Influence of item distribution pattern and abundance on efficiency of benthic core sampling
Behney, Adam C.; O'Shaughnessy, Ryan; Eichholz, Michael W.; Stafford, Joshua D.
2014-01-01
ore sampling is a commonly used method to estimate benthic item density, but little information exists about factors influencing the accuracy and time-efficiency of this method. We simulated core sampling in a Geographic Information System framework by generating points (benthic items) and polygons (core samplers) to assess how sample size (number of core samples), core sampler size (cm2), distribution of benthic items, and item density affected the bias and precision of estimates of density, the detection probability of items, and the time-costs. When items were distributed randomly versus clumped, bias decreased and precision increased with increasing sample size and increased slightly with increasing core sampler size. Bias and precision were only affected by benthic item density at very low values (500–1,000 items/m2). Detection probability (the probability of capturing ≥ 1 item in a core sample if it is available for sampling) was substantially greater when items were distributed randomly as opposed to clumped. Taking more small diameter core samples was always more time-efficient than taking fewer large diameter samples. We are unable to present a single, optimal sample size, but provide information for researchers and managers to derive optimal sample sizes dependent on their research goals and environmental conditions.
Liu, Chi-Hung; Hsu, Li-Ling; Hsiao, Cheng-Ting; Hsieh, Suh-Ing; Chang, Chun-Wei; Huang, Elaine Shinwei; Chang, Yeu-Jhy
2018-01-01
Background With the evolution of treatments for neurological diseases, the contents of core neurological examinations (NEs) for medical students may need to be modified. We aimed to establish a consensus on the core NE items for neurology clerks and compare viewpoints between different groups of panelists. Methods First, a pilot group proposed the core contents of NEs for neurology clerks. The proposed core NE items were then subject to a modified web-based Delphi process using the online software “SurveyMonkey”. A total of 30 panelists from different backgrounds (tutors or learners, neurologists or non-neurologists, community hospitals or medical centers, and different academic positions) participated in the modified Delphi process. Each panelist was asked to agree or disagree on the inclusion of each item using a 9-point Likert scale and was encouraged to provide feedback. We also compared viewpoints between different groups of panelists using the Mann-Whitney U test. Results Eighty-three items were used for the first round of the Delphi process. Of them, 18 without consensus of being a core NE item for the neurology clerks in the first round and another 14 items suggested by the panelists were further discussed in the second round. Finally, 75 items with different grades were included in the recommended NE items for neurology clerks. Conclusions Our findings provide a reference regarding the core NE items for milestone development for neurology clerkships. We hope that prioritizing the NE items in this order can help medical students to learn NE more efficiently. PMID:29771997
Core Items for a Standardized Resource Use Measure: Expert Delphi Consensus Survey.
Thorn, Joanna C; Brookes, Sara T; Ridyard, Colin; Riley, Ruth; Hughes, Dyfrig A; Wordsworth, Sarah; Noble, Sian M; Thornton, Gail; Hollingworth, William
2018-06-01
Resource use measurement by patient recall is characterized by inconsistent methods and a lack of validation. A validated standardized resource use measure could increase data quality, improve comparability between studies, and reduce research burden. To identify a minimum set of core resource use items that should be included in a standardized adult instrument for UK health economic evaluation from a provider perspective. Health economists with experience of UK-based economic evaluations were recruited to participate in an electronic Delphi survey. Respondents were asked to rate 60 resource use items (e.g., medication names) on a scale of 1 to 9 according to the importance of the item in a generic context. Items considered less important according to predefined consensus criteria were dropped and a second survey was developed. In the second round, respondents received the median score and their own score from round 1 for each item alongside summarized comments and were asked to rerate items. A final project team meeting was held to determine the recommended core set. Forty-five participants completed round 1. Twenty-six items were considered less important and were dropped, 34 items were retained for the second round, and no new items were added. Forty-two respondents (93.3%) completed round 2, and greater consensus was observed. After the final meeting, 10 core items were selected, with further items identified as suitable for "bolt-on" questionnaire modules. The consensus on 10 items considered important in a generic context suggests that a standardized instrument for core resource use items is feasible. Copyright © 2018. Published by Elsevier Inc.
Bleich, Sara N; Wolfson, Julia A; Jarlenski, Marian P
2015-01-01
Supply-side reductions to the calories in chain restaurants are a possible benefit of upcoming menu labeling requirements. To describe trends in calories available in large U.S. restaurants. Data were obtained from the MenuStat project, a census of menu items in 66 of the 100 largest U.S. restaurant chains, for 2012 and 2013 (N=19,417 items). Generalized linear models were used to calculate (1) the mean change in calories from 2012 to 2013, among items on the menu in both years; and (2) the difference in mean calories, comparing newly introduced items to those on the menu in 2012 only (overall and between core versus non-core items). Data were analyzed in 2014. Mean calories among items on menus in both 2012 and 2013 did not change. Large restaurant chains in the U.S. have recently had overall declines in calories in newly introduced menu items (-56 calories, 12% decline). These declines were concentrated mainly in new main course items (-67 calories, 10% decline). New beverage (-26 calories, 8% decline) and children's (-46 calories, 20% decline) items also had fewer mean calories. Among chain restaurants with a specific focus (e.g., burgers), average calories in new menu items not core to the business declined more than calories in core menu items. Large chain restaurants significantly reduced the number of calories in newly introduced menu items. Supply-side changes to the calories in chain restaurants may have a significant impact on obesity prevention. Copyright © 2015 American Journal of Preventive Medicine. Published by Elsevier Inc. All rights reserved.
Bleich, Sara N.; Wolfson, Julia A.; Jarlenski, Marian P.
2014-01-01
Background Supply-side reductions to the calories in chain restaurants are a possible benefit of upcoming menu labeling requirements. Purpose To describe trends in calories available in large U.S. restaurants. Methods Data were obtained from the MenuStat project, a census of menu items in 66 of the 100 largest U.S. restaurant chains, for 2012 and 2013 (N=19,417 items). Generalized linear models were used to calculate: (1) the mean change in calories from 2012 to 2013, among items on the menu in both years; and (2) the difference in mean calories, comparing newly introduced items to those on the menu in 2012 only (overall and between core versus non-core items). Data were analyzed in 2014. Results Mean calories among items on menus in both 2012 and 2013 did not change. Large restaurant chains in the U.S. have recently had overall declines in calories in newly introduced menu items (−56 calories, 12% decline). These declines were concentrated mainly in new main course items (−67 calories, 10% decline). New beverage (−26 calories, 8% decline) and children’s (−46 calories, 20% decline) items also had fewer mean calories. Among chain restaurants with a specific focus (e.g., burgers), average calories in new menu items not core to the business declined more than calories in core menu items. Conclusions Large chain restaurants significantly reduced the number of calories in newly introduced menu items. Supply-side changes to the calories in chain restaurants may have a significant impact on obesity prevention. PMID:25306397
Pulcini, C; Binda, F; Lamkang, A S; Trett, A; Charani, E; Goff, D A; Harbarth, S; Hinrichsen, S L; Levy-Hara, G; Mendelson, M; Nathwani, D; Gunturu, R; Singh, S; Srinivasan, A; Thamlikitkul, V; Thursky, K; Vlieghe, E; Wertheim, H; Zeng, M; Gandra, S; Laxminarayan, R
2018-04-03
With increasing global interest in hospital antimicrobial stewardship (AMS) programmes, there is a strong demand for core elements of AMS to be clearly defined on the basis of principles of effectiveness and affordability. To date, efforts to identify such core elements have been limited to Europe, Australia, and North America. The aim of this study was to develop a set of core elements and their related checklist items for AMS programmes that should be present in all hospitals worldwide, regardless of resource availability. A literature review was performed by searching Medline and relevant websites to retrieve a list of core elements and items that could have global relevance. These core elements and items were evaluated by an international group of AMS experts using a structured modified Delphi consensus procedure, using two-phased online in-depth questionnaires. The literature review identified seven core elements and their related 29 checklist items from 48 references. Fifteen experts from 13 countries in six continents participated in the consensus procedure. Ultimately, all seven core elements were retained, as well as 28 of the initial checklist items plus one that was newly suggested, all with ≥80% agreement; 20 elements and items were rephrased. This consensus on core elements for hospital AMS programmes is relevant to both high- and low-to-middle-income countries and could facilitate the development of national AMS stewardship guidelines and adoption by healthcare settings worldwide. Copyright © 2018 European Society of Clinical Microbiology and Infectious Diseases. All rights reserved.
Brookes, Sara T; Macefield, Rhiannon C; Williamson, Paula R; McNair, Angus G; Potter, Shelley; Blencowe, Natalie S; Strong, Sean; Blazeby, Jane M
2016-08-17
Methods for developing a core outcome or information set require involvement of key stakeholders to prioritise many items and achieve agreement as to the core set. The Delphi technique requires participants to rate the importance of items in sequential questionnaires (or rounds) with feedback provided in each subsequent round such that participants are able to consider the views of others. This study examines the impact of receiving feedback from different stakeholder groups, on the subsequent rating of items and the level of agreement between stakeholders. Randomized controlled trials were nested within the development of three core sets each including a Delphi process with two rounds of questionnaires, completed by patients and health professionals. Participants rated items from 1 (not essential) to 9 (absolutely essential). For round 2, participants were randomized to receive feedback from their peer stakeholder group only (peer) or both stakeholder groups separately (multiple). Decisions as to which items to retain following each round were determined by pre-specified criteria. Whilst type of feedback did not impact on the percentage of items for which a participant subsequently changed their rating, or the magnitude of change, it did impact on items retained at the end of round 2. Each core set contained discordant items retained by one feedback group but not the other (3-22 % discordant items). Consensus between patients and professionals in items to retain was greater amongst those receiving multiple group feedback in each core set (65-82 % agreement for peer-only feedback versus 74-94 % for multiple feedback). In addition, differences in round 2 scores were smaller between stakeholder groups receiving multiple feedback than between those receiving peer group feedback only. Variability in item scores across stakeholders was reduced following any feedback but this reduction was consistently greater amongst the multiple feedback group. In the development of a core outcome or information set, providing feedback within Delphi questionnaires from all stakeholder groups separately may influence the final core set and improve consensus between the groups. Further work is needed to better understand how participants rate and re-rate items within a Delphi process. The three randomized controlled trials reported here were each nested within the development of a core information or outcome set to investigate processes in core outcome and information set development. Outcomes were not health-related and therefore trial registration was not applicable.
2013-01-01
In 2003, the International Patient Decision Aid Standards (IPDAS) Collaboration was established to enhance the quality and effectiveness of patient decision aids by establishing an evidence-informed framework for improving their content, development, implementation, and evaluation. Over this 10 year period, the Collaboration has established: a) the background document on 12 core dimensions to inform the original modified Delphi process to establish the IPDAS checklist (74 items); b) the valid and reliable IPDAS instrument (47 items); and c) the IPDAS qualifying (6 items), certifying (6 items + 4 items for screening), and quality criteria (28 items). The objective of this paper is to describe the evolution of the IPDAS Collaboration and discuss the standardized process used to update the background documents on the theoretical rationales, evidence and emerging issues underlying the 12 core dimensions for assessing the quality of patient decision aids. PMID:24624947
Identifying Core Competencies of Infection Control Nurse Specialists in Hong Kong.
Chan, Wai Fong; Bond, Trevor G; Adamson, Bob; Chow, Meyrick
2016-01-01
To confirm a core competency scale for Hong Kong infection control nurses at the advanced nursing practice level from the core competency items proposed in a previous phase of this study. This would serve as the foundation of competency assurance in Hong Kong hospitals. A cross-sectional survey design was used. All public and private hospitals in Hong Kong. All infection control nurses in hospitals of Hong Kong. The 83-item proposed core competency list established in an earlier study was transformed into a questionnaire and sent to 112 infection control nurses in 48 hospitals in Hong Kong. They were asked to rate the importance of each infection prevention and control item using Likert-style response categories. Data were analyzed using the Rasch model. The response rate of 81.25% was achieved. Seven items were removed from the proposed core competency list, leaving a scale of 76 items that fit the measurement requirements of the unidimensional Rasch model. Essential core competency items of advanced practice for infection control nurses in Hong Kong were identified based on the measurement criteria of the Rasch model. Several items of the scale that reflect local Hong Kong contextual characteristics are distinguished from the overseas standards. This local-specific competency list could serve as the foundation for education and for certification of infection control nurse specialists in Hong Kong. Rasch measurement is an appropriate analytical tool for identifying core competencies of advanced practice nurses in other specialties and in other locations in a manner that incorporates practitioner judgment and expertise.
Sales, Célia Md; Neves, Inês Td; Alves, Paula G; Ashworth, Mark
2017-11-22
There is increasing interest in individualized patient-reported outcome measures (I-PROMS), where patients themselves indicate the specific problems they want to address in therapy and these problems are used as items within the outcome measurement tool. This paper examined the extent to which 279 items reported in an I-PROM (PSYCHLOPS) added qualitative information which was not captured by two well-established outcome measures (CORE-OM and PHQ-9). Comparison of items was only conducted for patients scoring above the "caseness" threshold on the standardized measures. 107 patients were participating in therapy within addiction and general psychiatric clinical settings. Almost every patient (95%) reported at least one item whose content was not covered by PHQ-9, and 71% reported at least one item not covered by CORE-OM. Results demonstrate the relevance of individualized outcome assessment for capturing data describing the issues of greatest concern to patients, as nomothetic measures do not always seem to capture the whole story. © 2017 The Authors Health Expectations Published by John Wiley & Sons Ltd.
Fundamentals of Marketing Core Curriculum. Test Items and Assessment Techniques.
ERIC Educational Resources Information Center
Smith, Clifton L.; And Others
This document contains multiple choice test items and assessment techniques for Missouri's fundamentals of marketing core curriculum. The core curriculum is divided into these nine occupational duties: (1) communications in marketing; (2) economics and marketing; (3) employment and advancement; (4) human relations in marketing; (5) marketing…
Advanced Marketing Core Curriculum. Test Items and Assessment Techniques.
ERIC Educational Resources Information Center
Smith, Clifton L.; And Others
This document contains duties and tasks, multiple-choice test items, and other assessment techniques for Missouri's advanced marketing core curriculum. The core curriculum begins with a list of 13 suggested textbook resources. Next, nine duties with their associated tasks are given. Under each task appears one or more citations to appropriate…
The EORTC CAT Core-The computer adaptive version of the EORTC QLQ-C30 questionnaire.
Petersen, Morten Aa; Aaronson, Neil K; Arraras, Juan I; Chie, Wei-Chu; Conroy, Thierry; Costantini, Anna; Dirven, Linda; Fayers, Peter; Gamper, Eva-Maria; Giesinger, Johannes M; Habets, Esther J J; Hammerlid, Eva; Helbostad, Jorunn; Hjermstad, Marianne J; Holzner, Bernhard; Johnson, Colin; Kemmler, Georg; King, Madeleine T; Kaasa, Stein; Loge, Jon H; Reijneveld, Jaap C; Singer, Susanne; Taphoorn, Martin J B; Thamsborg, Lise H; Tomaszewski, Krzysztof A; Velikova, Galina; Verdonck-de Leeuw, Irma M; Young, Teresa; Groenvold, Mogens
2018-06-21
To optimise measurement precision, relevance to patients and flexibility, patient-reported outcome measures (PROMs) should ideally be adapted to the individual patient/study while retaining direct comparability of scores across patients/studies. This is achievable using item banks and computerised adaptive tests (CATs). The European Organisation for Research and Treatment of Cancer (EORTC) Quality of Life Questionnaire Core 30 (QLQ-C30) is one of the most widely used PROMs in cancer research and clinical practice. Here we provide an overview of the research program to develop CAT versions of the QLQ-C30's 14 functional and symptom domains. The EORTC Quality of Life Group's strategy for developing CAT item banks consists of: literature search to identify potential candidate items; formulation of new items compatible with the QLQ-C30 item style; expert evaluations and patient interviews; field-testing and psychometric analyses, including factor analysis, item response theory calibration and simulation of measurement properties. In addition, software for setting up, running and scoring CAT has been developed. Across eight rounds of data collections, 9782 patients were recruited from 12 countries for the field-testing. The four phases of development resulted in a total of 260 unique items across the 14 domains. Each item bank consists of 7-34 items. Psychometric evaluations indicated higher measurement precision and increased statistical power of the CAT measures compared to the QLQ-C30 scales. Using CAT, sample size requirements may be reduced by approximately 20-35% on average without loss of power. The EORTC CAT Core represents a more precise, powerful and flexible measurement system than the QLQ-C30. It is currently being validated in a large independent, international sample of cancer patients. Copyright © 2018 Elsevier Ltd. All rights reserved.
Weidmer, Beverly A; Brach, Cindy; Hays, Ron D
2012-09-01
The complexity of health information often exceeds patients' skills to understand and use it. To develop survey items assessing how well healthcare providers communicate health information. Domains and items for the Consumer Assessment of Healthcare Providers and Systems (CAHPS) Item Set for Addressing Health Literacy were identified through an environmental scan and input from stakeholders. The draft item set was translated into Spanish and pretested in both English and Spanish. The revised item set was field tested with a randomly selected sample of adult patients from 2 sites using mail and telephonic data collection. Item-scale correlations, confirmatory factor analysis, and internal consistency reliability estimates were estimated to assess how well the survey items performed and identify composite measures. Finally, we regressed the CAHPS global rating of the provider item on the CAHPS core communication composite and the new health literacy composites. A total of 601 completed surveys were obtained (52% response rate). Two composite measures were identified: (1) Communication to Improve Health Literacy (16 items); and (2) How Well Providers Communicate About Medicines (6 items). These 2 composites were significantly uniquely associated with the global rating of the provider (communication to improve health literacy: P<0.001, b=0.28; and communication about medicines composite: P=0.02, b=0.04). The 2 composites and the CAHPS core communication composite accounted for 51% of the variance in the global rating of the provider. A 5-item subset of the Communication to Improve Health Literacy composite accounted for 90% of the variance of the original 16-item composite. This study provides support for reliability and validity of the CAHPS Item Set for Addressing Health Literacy. These items can serve to assess whether healthcare providers have communicated effectively with their patients and as a tool for quality improvement.
Mielenz, Thelma J; Callahan, Leigh F; Edwards, Michael C
2016-03-12
Examine the feasibility of performing an item response theory (IRT) analysis on two of the Centers for Disease Control and Prevention health-related quality of life (CDC HRQOL) modules - the 4-item Healthy Days Core Module (HDCM) and the 5-item Healthy days Symptoms Module (HDSM). Previous principal components analyses confirm that the two scales both assess a mix of mental (CDC-MH) and physical health (CDC-PH). The purpose is to conduct item response theory (IRT) analysis on the CDC-MH and CDC-PH scales separately. 2182 patients with self-reported or physician-diagnosed arthritis completed a cross-sectional survey including HDCM and HDSM items. Besides global health, the other 8 items ask the number of days that some statement was true; we chose to recode the data into 8 categories based on observed clustering. The IRT assumptions were assessed using confirmatory factor analysis and the data could be modeled using an unidimensional IRT model. The graded response model was used for IRT analyses and CDC-MH and CDC-PH scales were analyzed separately in flexMIRT. The IRT parameter estimates for the five-item CDC-PH all appeared reasonable. The three-item CDC-MH did not have reasonable parameter estimates. The CDC-PH scale is amenable to IRT analysis but the existing The CDC-MH scale is not. We suggest either using the 4-item Healthy Days Core Module (HDCM) and the 5-item Healthy days Symptoms Module (HDSM) as they currently stand or the CDC-PH scale alone if the primary goal is to measure physical health related HRQOL.
Gerbens, L A A; Apfelbacher, C J; Irvine, A D; Barbarot, S; de Booij, R J; Boyce, A E; Deleuran, M; Eichenfield, L F; Hof, M H; Middelkamp-Hup, M A; Roberts, A; Schmitt, J; Vestergaard, C; Wall, D; Weidinger, S; Williamson, P R; Flohr, C; Spuls, P I
2018-05-15
Evidence of immunomodulatory therapies to guide clinical management for atopic eczema (AE) is scarce, despite frequent and often off-label use. Patient registries provide valuable evidence for the effects of treatments under real world conditions which can inform treatment guidelines, give the opportunity for health economic evaluation and the evaluation of quality of care, as well as pharmacogenetic and -dynamic research which cannot be adequately addressed in clinical trials. The TREatment of ATopic eczema (TREAT) Registry Taskforce aims to seek international consensus on a core set of domains and items ('what to measure') for AE research registries, using a Delphi approach. Participants from six stakeholder groups were included: doctors, nurses, non-clinical researchers, patients, industry and regulatory body representatives. The eDelphi comprised 3 sequential online rounds, requesting participants to rate the importance of each proposed domain item. Participants could add domain items to the proposed list in round 1. A final consensus meeting was held to ratify the core set. 479 participants from 36 countries accessed the eDelphi platform, of whom 86%, 79% and 74% completed rounds 1, 2, and 3 respectively. At the face-to-face consensus meeting attended by 42 participants the final core set was established containing 19 domains with 69 domain items (49 baseline and 20 follow-up items). This core set of domains and items to be captured by national AE systemic therapy registries will standardise data collection and thereby allow direct comparability across registries and facilitate data pooling between countries. Ultimately, it will provide greater insight into the effectiveness, safety and cost-effectiveness of photo- and systemic immunomodulatory therapies. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
A Psychometric Evaluation of the Core Bereavement Items
ERIC Educational Resources Information Center
Holland, Jason M.; Nam, Ilsung; Neimeyer, Robert A.
2013-01-01
Despite being a routinely administered assessment of grieving, few studies have empirically examined the psychometric properties of the Core Bereavement Items (CBI). The present study investigated the factor structure, internal reliability, and concurrent validity of the CBI in a large, diverse sample of bereaved young adults (N = 1,366).…
Evaluation of Item Candidates: The PROMIS Qualitative Item Review
DeWalt, Darren A.; Rothrock, Nan; Yount, Susan; Stone, Arthur A.
2009-01-01
One of the PROMIS (Patient-Reported Outcome Measurement Information System) network's primary goals is the development of a comprehensive item bank for patient-reported outcomes of chronic diseases. For its first set of item banks, PROMIS chose to focus on pain, fatigue, emotional distress, physical function, and social function. An essential step for the development of an item pool is the identification, evaluation, and revision of extant questionnaire items for the core item pool. In this work, we also describe the systematic process wherein items are classified for subsequent statistical processing by the PROMIS investigators. Six phases of item development are documented: identification of extant items, item classification and selection, item review and revision, focus group input on domain coverage, cognitive interviews with individual items, and final revision before field testing. Identification of items refers to the systematic search for existing items in currently available scales. Expert item review and revision was conducted by trained professionals who reviewed the wording of each item and revised as appropriate for conventions adopted by the PROMIS network. Focus groups were used to confirm domain definitions and to identify new areas of item development for future PROMIS item banks. Cognitive interviews were used to examine individual items. Items successfully screened through this process were sent to field testing and will be subjected to innovative scale construction procedures. PMID:17443114
Automatically Scoring Short Essays for Content. CRESST Report 836
ERIC Educational Resources Information Center
Kerr, Deirdre; Mousavi, Hamid; Iseli, Markus R.
2013-01-01
The Common Core assessments emphasize short essay constructed response items over multiple choice items because they are more precise measures of understanding. However, such items are too costly and time consuming to be used in national assessments unless a way is found to score them automatically. Current automatic essay scoring techniques are…
Australian Biology Test Item Bank, Years 11 and 12. Volume II: Year 12.
ERIC Educational Resources Information Center
Brown, David W., Ed.; Sewell, Jeffrey J., Ed.
This document consists of test items which are applicable to biology courses throughout Australia (irrespective of course materials used); assess key concepts within course statement (for both core and optional studies); assess a wide range of cognitive processes; and are relevant to current biological concepts. These items are arranged under…
Australian Biology Test Item Bank, Years 11 and 12. Volume I: Year 11.
ERIC Educational Resources Information Center
Brown, David W., Ed.; Sewell, Jeffrey J., Ed.
This document consists of test items which are applicable to biology courses throughout Australia (irrespective of course materials used); assess key concepts within course statement (for both core and optional studies); assess a wide range of cognitive processes; and are relevant to current biological concepts. These items are arranged under…
Recommended core items to assess e-cigarette use in population-based surveys.
Pearson, Jennifer L; Hitchman, Sara C; Brose, Leonie S; Bauld, Linda; Glasser, Allison M; Villanti, Andrea C; McNeill, Ann; Abrams, David B; Cohen, Joanna E
2018-05-01
A consistent approach using standardised items to assess e-cigarette use in both youth and adult populations will aid cross-survey and cross-national comparisons of the effect of e-cigarette (and tobacco) policies and improve our understanding of the population health impact of e-cigarette use. Focusing on adult behaviour, we propose a set of e-cigarette use items, discuss their utility and potential adaptation, and highlight e-cigarette constructs that researchers should avoid without further item development. Reliable and valid items will strengthen the emerging science and inform knowledge synthesis for policy-making. Building on informal discussions at a series of international meetings of 65 experts from 15 countries, the authors provide recommendations for assessing e-cigarette use behaviour, relative perceived harm, device type, presence of nicotine, flavours and reasons for use. We recommend items assessing eight core constructs: e-cigarette ever use, frequency of use and former daily use; relative perceived harm; device type; primary flavour preference; presence of nicotine; and primary reason for use. These items should be standardised or minimally adapted for the policy context and target population. Researchers should be prepared to update items as e-cigarette device characteristics change. A minimum set of e-cigarette items is proposed to encourage consensus around items to allow for cross-survey and cross-jurisdictional comparisons of e-cigarette use behaviour. These proposed items are a starting point. We recognise room for continued improvement, and welcome input from e-cigarette users and scientific colleagues. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Development and validation of an item response theory-based Social Responsiveness Scale short form.
Sturm, Alexandra; Kuhfeld, Megan; Kasari, Connie; McCracken, James T
2017-09-01
Research and practice in autism spectrum disorder (ASD) rely on quantitative measures, such as the Social Responsiveness Scale (SRS), for characterization and diagnosis. Like many ASD diagnostic measures, SRS scores are influenced by factors unrelated to ASD core features. This study further interrogates the psychometric properties of the SRS using item response theory (IRT), and demonstrates a strategy to create a psychometrically sound short form by applying IRT results. Social Responsiveness Scale analyses were conducted on a large sample (N = 21,426) of youth from four ASD databases. Items were subjected to item factor analyses and evaluation of item bias by gender, age, expressive language level, behavior problems, and nonverbal IQ. Item selection based on item psychometric properties, DIF analyses, and substantive validity produced a reduced item SRS short form that was unidimensional in structure, highly reliable (α = .96), and free of gender, age, expressive language, behavior problems, and nonverbal IQ influence. The short form also showed strong relationships with established measures of autism symptom severity (ADOS, ADI-R, Vineland). Degree of association between all measures varied as a function of expressive language. Results identified specific SRS items that are more vulnerable to non-ASD-related traits. The resultant 16-item SRS short form may possess superior psychometric properties compared to the original scale and emerge as a more precise measure of ASD core symptom severity, facilitating research and practice. Future research using IRT is needed to further refine existing measures of autism symptomatology. © 2017 Association for Child and Adolescent Mental Health.
Osman, Helen; Jorm, Anthony F; Killackey, Eoin; Francey, Shona; Mulcahy, Dianne
2017-08-09
The aim of this study was to identify the core competencies required of mental health professionals working in the early psychosis field, which could function as an evidence-based tool to support the early psychosis workforce and in turn assist early psychosis service implementation and strengthen early psychosis model fidelity. The Delphi method was used to establish expert consensus on the core competencies. In the first stage, a systematic literature search was conducted to generate competency items. In the second stage, a panel consisting of expert early psychosis clinicians from around the world was formed. Panel members then rated each of the competency items on how essential they are to the clinical practice of all early psychosis clinicians. In total, 1023 pieces of literature including textbooks, journal articles and grey literature were reviewed. A final 542 competency items were identified for inclusion in the questionnaire. A total of 63 early psychosis experts participated in 3 rating rounds. Of the 542 competency items, 242 were endorsed as the required core competencies. There were 29 competency items that were endorsed by 62 or more experts, and these may be considered the foundational competencies for early psychosis practice. The study generated a set of core competencies that provide a common language for early psychosis clinicians across professional disciplines and country of practice, and potentially are a useful professional resource to support early psychosis workforce development and service reform. © 2017 John Wiley & Sons Australia, Ltd.
Validation of the CoRE Questionnaire for a Medical Journal Peer Review.
Doi, Suhail A R; Salzman-Scott, Sherry A; Onitilo, Adedayo A
2016-01-01
If a peer review instrument asks concrete questions (defined as items that can only generate disagreement if reviewers have different degrees of expertise), then questionnaires could become more meaningful in terms of resolving subjectivity thus leading to more reviewer agreement. A concrete item questionnaire with well-chosen questions can also help resolve disagreement when reviewers have the same level of expertise. We have recently created the core-item reviewer evaluation (CoRE) questionnaire for which decision-threshold score levels have been created, but which have not been validated. This prospective validation of these thresholds for the CoRE questionnaire demonstrated strong agreement between reviewer recommendations and their reported score levels when tested prospectively at Clinical Medicine and Research. We conclude that using the CoRE questionnaire will help reduce peer reviewer disagreement. More importantly, when reviewer expertise varies, editors can more easily detect this and decide which opinion reflects the greater expertise.
Dental responsibility loadings and the relative value of dental services.
Teusner, D N; Ju, X; Brennan, D S
2017-09-01
To estimate responsibility loadings for a comprehensive list of dental services, providing a standardized unit of clinical work effort. Dentists (n = 2500) randomly sampled from the Australian Dental Association membership (2011) were randomly assigned to one of 25 panels. Panels were surveyed by questionnaires eliciting responsibility loadings for eight common dental services (core items) and approximately 12 other items unique to that questionnaire. In total, loadings were elicited for 299 items listed in the Australian Dental Schedule 9th Edition. Data were weighted to reflect the age and sex distribution of the workforce. To assess reliability, regression models assessed differences in core item loadings by panel assignment. Estimated loadings were described by reporting the median and mean. Response rate was 37%. Panel composition did not vary by practitioner characteristics. Core item loadings did not vary by panel assignment. Oral surgery and endodontic service areas had the highest proportion (91%) of services with median loadings ≥1.5, followed by prosthodontics (78%), periodontics (76%), orthodontics (63%), restorative (62%) and diagnostic services (31%). Preventive services had median loadings ≤1.25. Dental responsibility loadings estimated by this study can be applied in the development of relative value scales. © 2017 Australian Dental Association.
Oude Voshaar, Martijn A H; Ten Klooster, Peter M; Glas, Cees A W; Vonkeman, Harald E; Taal, Erik; Krishnan, Eswar; Bernelot Moens, Hein J; Boers, Maarten; Terwee, Caroline B; van Riel, Piet L C M; van de Laar, Mart A F J
2015-12-01
To evaluate the content validity and measurement properties of the Patient-Reported Outcome Measurement Information System (PROMIS) physical function item bank and a 20-item short form in patients with RA in comparison with the HAQ disability index (HAQ-DI) and 36-item Short Form Health Survey (SF-36) physical functioning scale (PF-10). The content validity of the instruments was evaluated by linking their items to the International Classification of Functioning, Disability and Health (ICF) core set for RA. The measures were administered to 690 RA patients enrolled in the Dutch Rheumatoid Arthritis Monitoring registry. Measurement precision was evaluated using item response theory methods and construct validity was evaluated by correlating physical function scores with other clinical and patient-reported outcome measures. All 207 health concepts identified in the physical function measures referred to activities that are featured in the ICF. Twenty-three of 26 ICF RA core set domains are featured in the full PROMIS physical function item bank compared with 13 and 8 for the HAQ-DI and PF-10, respectively. As hypothesized, all three physical function instruments were highly intercorrelated (r 0.74-0.84), moderately correlated with disease activity measures (r 0.44-0.63) and weakly correlated with age (rs 0.07-0.14). Item response theory-based analysis revealed that a 20-item PROMIS physical function short form covered a wider range of physical function levels than the HAQ-DI or PF-10. The PROMIS physical function item bank demonstrated excellent measurement properties in RA. A content-driven 20-item short form may be a useful tool for assessing physical function in RA. © The Author 2015. Published by Oxford University Press on behalf of the British Society for Rheumatology. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Plasma Interactions With Spacecraft
2009-04-01
software core 3 Table 2. N2kDB classes 8 Table 3. N2kDB Application Programmer Interface 11 Table 4. How to get number of items from N2kDB 14 Table 5...grid, timesteps, and pages of particles. Table 4 specifies how these functions are used to get useful quantities. The Getcount function gets the...number of items with data item names that start with the specified string. 13 Table 4. How to get number of items from N2kDB. Function Specifics
ERIC Educational Resources Information Center
Yamamoto, Kentaro; He, Qiwei; Shin, Hyo Jeong; von Davier, Mattias
2017-01-01
Approximately a third of the Programme for International Student Assessment (PISA) items in the core domains (math, reading, and science) are constructed-response items and require human coding (scoring). This process is time-consuming, expensive, and prone to error as often (a) humans code inconsistently, and (b) coding reliability in…
ERIC Educational Resources Information Center
Wei, Youhua; Thompson, Bruce; Cook, C. Colleen
2005-01-01
LibQUAL+[TM] data to date have not been subjected to the modern measurement theory called polytomous item response theory (IRT). The data interpreted here were collected from 42,090 participants who completed the "American English" version of the 22 core LibQUAL+[TM] items, and 12,552 participants from Australia and Europe who…
Shawahna, Ramzi
2017-12-01
The aim of this study was to develop and achieve consensus on a core list of important knowledge items that community pharmacists should know on women's issues in epilepsy. This was a consensual study using a modified Delphi technique. Knowledge items were collected from the literature and from nine key contacts who were interviewed on their views on what information community pharmacists should have on women's issues in epilepsy. More knowledge items were suggested by five researchers with interest in women's issues who were contacted to rate and comment on the knowledge items collected. Two iterative Delphi rounds were conducted among a panel of pharmacists (n=30) to achieve consensus on the knowledge items to be included in the core list. Ten panelists ranked the knowledge items by their importance using the Analytical Hierarchy Process (AHP). Consensus was achieved to include 68 knowledge under 13 categories in the final core list. Items ranked by their importance were related to the following: teratogenicity (10.3%), effect of pregnancy on epilepsy (7.4%), preconception counseling (10.3%), bone health (5.9%), catamenial epilepsy (7.4%), menopause and hormonal replacement therapy (2.9%), contraception (14.7%), menstrual disorders and infertility (8.8%), eclampsia (2.9%), breastfeeding (4.4%), folic acid and vitamin K (5.9%), counseling on general issues (14.7%), and sexuality (4.4%). Using consensual knowledge lists might promote congruence in educating and/or training community pharmacists on women's issues in epilepsy. Future studies are needed to investigate if such lists can improve health services provided to women with epilepsy (WWE). Copyright © 2017 Elsevier Inc. All rights reserved.
Jafari, Peyman; Bagheri, Zahra; Ayatollahi, Seyyed Mohamad Taghi; Soltani, Zahra
2012-03-13
Item response theory (IRT) is extensively used to develop adaptive instruments of health-related quality of life (HRQoL). However, each IRT model has its own function to estimate item and category parameters, and hence different results may be found using the same response categories with different IRT models. The present study used the Rasch rating scale model (RSM) to examine and reassess the psychometric properties of the Persian version of the PedsQL™ 4.0 Generic Core Scales. The PedsQL™ 4.0 Generic Core Scales was completed by 938 Iranian school children and their parents. Convergent, discriminant and construct validity of the instrument were assessed by classical test theory (CTT). The RSM was applied to investigate person and item reliability, item statistics and ordering of response categories. The CTT method showed that the scaling success rate for convergent and discriminant validity were 100% in all domains with the exception of physical health in the child self-report. Moreover, confirmatory factor analysis supported a four-factor model similar to its original version. The RSM showed that 22 out of 23 items had acceptable infit and outfit statistics (<1.4, >0.6), person reliabilities were low, item reliabilities were high, and item difficulty ranged from -1.01 to 0.71 and -0.68 to 0.43 for child self-report and parent proxy-report, respectively. Also the RSM showed that successive response categories for all items were not located in the expected order. This study revealed that, in all domains, the five response categories did not perform adequately. It is not known whether this problem is a function of the meaning of the response choices in the Persian language or an artifact of a mostly healthy population that did not use the full range of the response categories. The response categories should be evaluated in further validation studies, especially in large samples of chronically ill patients.
ERIC Educational Resources Information Center
Thomas, Ally
2016-01-01
With the advent of the newly developed Common Core State Standards and the Next Generation Science Standards, innovative assessments, including technology-enhanced items and tasks, will be needed to meet the challenges of developing valid and reliable assessments in a world of computer-based testing. In a recent critique of the next generation…
ERIC Educational Resources Information Center
Kerr, Deirdre; Mousavi, Hamid; Iseli, Markus R.
2013-01-01
The Common Core assessments emphasize short essay constructed-response items over multiple-choice items because they are more precise measures of understanding. However, such items are too costly and time consuming to be used in national assessments unless a way to score them automatically can be found. Current automatic essay-scoring techniques…
Using Localized Survey Items to Augment Standardized Benchmarking Measures: A LibQUAL+[TM] Study
ERIC Educational Resources Information Center
Thompson, Bruce; Cook, Colleen; Kyrillidou, Martha
2006-01-01
The LibQUAL+[TM] protocol solicits open-ended comments from users with regard to library service quality, gathers data on 22 core items, and, at the option of individual libraries, also garners ratings on five items drawn from a pool of more than 100 choices selected by libraries. In this article, the relationship of scores on these locally…
ERIC Educational Resources Information Center
Reise, Steven P.; Meijer, Rob R.; Ainsworth, Andrew T.; Morales, Leo S.; Hays, Ron D.
2006-01-01
Group-level parametric and non-parametric item response theory models were applied to the Consumer Assessment of Healthcare Providers and Systems (CAHPS[R]) 2.0 core items in a sample of 35,572 Medicaid recipients nested within 131 health plans. Results indicated that CAHPS responses are dominated by within health plan variation, and only weakly…
A Method for Generating Educational Test Items That Are Aligned to the Common Core State Standards
ERIC Educational Resources Information Center
Gierl, Mark J.; Lai, Hollis; Hogan, James B.; Matovinovic, Donna
2015-01-01
The demand for test items far outstrips the current supply. This increased demand can be attributed, in part, to the transition to computerized testing, but, it is also linked to dramatic changes in how 21st century educational assessments are designed and administered. One way to address this growing demand is with automatic item generation.…
Goetz, Christopher G; Liu, Yuanyuan; Stebbins, Glenn T; Wang, Lu; Tilley, Barbara C; Teresi, Jeanne A; Merkitch, Douglas; Luo, Sheng
2016-12-01
Assess MDS-UPDRS items for gender-, age-, and race/ethnicity-based differential item functioning. Assessing differential item functioning is a core rating scale validation step. For the MDS-UPDRS, differential item functioning occurs if item-score probability among people with similar levels of parkinsonism differ according to selected covariates (gender, age, race/ethnicity). If the magnitude of differential item functioning is clinically relevant, item-score interpretation must consider influences by these covariates. Differential item functioning can be nonuniform (covariate variably influences an item-score across different levels of parkinsonism) or uniform (covariate influences an item-score consistently over all levels of parkinsonism). Using the MDS-UPDRS translation database of more than 5,000 PD patients from 14 languages, we tested gender-, age-, and race/ethnicity-based differential item functioning. To designate an item as having clinically relevant differential item functioning, we required statistical confirmation by 2 independent methods, along with a McFadden pseudo-R 2 magnitude statistic greater than "negligible." Most items showed no gender-, age- or race/ethnicity-based differential item functioning. When differential item functioning was identified, the magnitude statistic was always in the "negligible" range, and the scale-level impact was minimal. The absence of clinically relevant differential item functioning across all items and all parts of the MDS-UPDRS is strong evidence that the scale can be used confidently. As studies of Parkinson's disease increasingly involve multinational efforts and the MDS-UPDRS has several validated non-English translations, the findings support the scale's broad applicability in populations with varying gender, age, and race/ethnicity distributions. © 2016 International Parkinson and Movement Disorder Society. © 2016 International Parkinson and Movement Disorder Society.
Gerbens, Louise A A; Boyce, Aaron E; Wall, Dmitri; Barbarot, Sebastien; de Booij, Richard J; Deleuran, Mette; Middelkamp-Hup, Maritza A; Roberts, Amanda; Vestergaard, Christian; Weidinger, Stephan; Apfelbacher, Christian J; Irvine, Alan D; Schmitt, Jochen; Williamson, Paula R; Spuls, Phyllis I; Flohr, Carsten
2017-02-27
Patients with moderate-to-severe atopic eczema (AE) often require photo- or systemic immunomodulatory therapies to induce disease remission and maintain long-term control. The current evidence to guide clinical management is small, despite the frequent and often off-label use of these treatments. Registries of patients on photo- and systemic immunomodulatory therapies could fill this gap, and the collection of a core set concerning these therapies in AE will allow direct comparisons across registries as well as data sharing and pooling. Using an eDelphi approach, the international TREatment of ATopic eczema (TREAT) Registry Taskforce aims to seek consensus between key stakeholders internationally on a core set of domains and domain items for AE patient registries with a research focus that collect data of children and adults on photo- and systemic immunomodulatory therapies. Participants from six stakeholder groups will be invited: doctors, nurses, non-clinical researchers, patients, as well as industry and regulatory body representatives. The eDelphi will comprise three sequential online rounds, requesting participants to rate the importance of each proposed domain and domain items. Participants will be able to add domains and domain items to the proposed list in round 1. A final consensus meeting will be held with representatives of each stakeholder group. Identifying a uniform core set of domains and domain items to be captured by AE patient registries will increase the utility of individual registries, and provide greater insight into the effectiveness, safety and cost-effectiveness of photo- and systemic immunomodulatory therapies to guide clinical management across dermatology centres and country borders. Not applicable. This eDelphi study was registered in the Core Outcome Measures for Effectiveness Trials (COMET) database.
de Steur, W O; Henneman, D; Allum, W H; Dikken, J L; van Sandick, J W; Reynolds, J; Mariette, C; Jensen, L; Johansson, J; Kolodziejczyk, P; Hardwick, R H; van de Velde, C J H
2014-03-01
Seven countries (Denmark, France, Ireland, the Netherlands, Poland, Sweden, United Kingdom) collaborated to initiate a EURECCA (European Registration of Cancer Care) Upper GI project. The aim of this study was to identify a core dataset of shared items in the different data registries which can be used for future collaboration between countries. Item lists from all participating Upper GI cancer registries were collected. Items were scored 'present' when included in the registry, or when the items could be deducted from other items in the registry. The definition of a common item was that it was present in at least six of the seven participating countries. The number of registered items varied between 40 (Poland) and 650 (Ireland). Among the 46 shared items were data on patient characteristics, staging and diagnostics, neoadjuvant treatment, surgery, postoperative course, pathology, and adjuvant treatment. Information on non-surgical treatment was available in only 4 registries. A list of 46 shared items from seven participating Upper GI cancer registries was created, providing a basis for future quality assurance and research in Upper GI cancer treatment on a European level. Copyright © 2013 Elsevier Ltd. All rights reserved.
Jalaludin, My; Fuziah, Mz; Hong, Jyh; Mohamad Adam, B; Jamaiyah, H
2012-01-01
Self-care plays an important role in diabetes management. One of the instruments used to evaluate self-care in patients with diabetes is the Summary of Diabetes Self-Care Activities (SDSCA) questionnaire. A validated instrument in the Malay language is used to assess self-care practice among children and adolescents with diabetes in Malaysia. To translate and evaluate the psychometric properties of the revised version of the SDSCA questionnaire in the Malay language. Forward and backward translations were performed. An expert panel reviewed all versions for conceptual and content equivalence. The final version was administered to paediatric patients with diabetes between August 2006 and September 2007. Reliability was analysed using Cronbach's alpha and validity was assessed using exploratory factor analysis. A total of 117 patients aged 10-18 years were enrolled from nine hospitals. The reliability of overall core items was 0.735 (with item 4) while the reliabilities of the four domains were in the range of 0.539-0.838. As core item number 4 was found to be problematic and it was subtituted by item 5a (from the expanded SDSCA) to suit local dietary education and practice; and the reliabilities of the overall core item (0.782) and the four domains (0.620 - 0.838) improved. Factor loadings of all the items were greater than 0.4, loaded into the original domains, and accounted for 73% of the total variance. The Malay translation of the revised English SDSCA is reliable and valid as a guide for Malaysian children and adolescents suffering from diabetes.
ERIC Educational Resources Information Center
Anderson, Daniel; Alonzo, Julie; Tindal, Gerald
2012-01-01
In this technical report, we describe the results of a study of mathematics items written to align with the Common Core State Standards (CCSS) in grades 6-8. In each grade, CCSS items were organized into forms, and the reliability of these forms was evaluated along with an experimental form including items aligned with the National Council of…
An empirical examination of the factor structure of compassion.
Gu, Jenny; Cavanagh, Kate; Baer, Ruth; Strauss, Clara
2017-01-01
Compassion has long been regarded as a core part of our humanity by contemplative traditions, and in recent years, it has received growing research interest. Following a recent review of existing conceptualisations, compassion has been defined as consisting of the following five elements: 1) recognising suffering, 2) understanding the universality of suffering in human experience, 3) feeling moved by the person suffering and emotionally connecting with their distress, 4) tolerating uncomfortable feelings aroused (e.g., fear, distress) so that we remain open to and accepting of the person suffering, and 5) acting or being motivated to act to alleviate suffering. As a prerequisite to developing a high quality compassion measure and furthering research in this field, the current study empirically investigated the factor structure of the five-element definition using a combination of existing and newly generated self-report items. This study consisted of three stages: a systematic consultation with experts to review items from existing self-report measures of compassion and generate additional items (Stage 1), exploratory factor analysis of items gathered from Stage 1 to identify the underlying structure of compassion (Stage 2), and confirmatory factor analysis to validate the identified factor structure (Stage 3). Findings showed preliminary empirical support for a five-factor structure of compassion consistent with the five-element definition. However, findings indicated that the 'tolerating' factor may be problematic and not a core aspect of compassion. This possibility requires further empirical testing. Limitations with items from included measures lead us to recommend against using these items collectively to assess compassion. Instead, we call for the development of a new self-report measure of compassion, using the five-element definition to guide item generation. We recommend including newly generated 'tolerating' items in the initial item pool, to determine whether or not factor-level issues are resolved once item-level issues are addressed.
Mori, Masanori; Kuwama, Yuichiro; Ashikaga, Takamaru; Parsons, Henrique A; Miyashita, Mitsunori
2018-01-01
Acculturation is the phenomenon of the attitudinal changes of individuals who come into continuous contact with another culture. Despite the long history of Japanese immigration to America, little is known about the impact of acculturation on perceptions of a good death. To examine differences in perceptions of a good cancer death among Japanese Americans (JA/A), Japanese living in America (J/A), and the Japanese living in Japan (J/J). We administered surveys among JA/A and J/A and used historical J/J data for reference. Primary endpoint was the proportion of respondents who expressed the necessity of core and optional items of the Good Death Inventory. Group differences ≥20% were deemed clinically important. In total, 441 survey responses in America and 2548 in Japan were obtained. More than 80% of respondents consistently considered nine of 10 core items necessary without significant group differences. No core item reached a ≥20% group difference. Three of the eight optional items reached ≥20% group difference: fighting against disease until one's last moment (49%, P < 0.0001; 52%, P < 0.0001; and 73% in JA/A, J/A, and J/J, respectively), knowing what to expect about one's condition in the future (83%, P < 0.0001; 80%, P < 0.0001; and 58%, respectively), and having faith (64%, P = 0.0548; 43%, P = 0.0127; and 38%, respectively). Although most core items of a good death were preserved throughout the levels of acculturation, perceptions of some optional items shifted away from Japanese attitudes as individuals became more acculturated. Understanding of different levels of acculturation may help clinicians provide culturally sensitive end-of-life care. Copyright © 2017 American Academy of Hospice and Palliative Medicine. Published by Elsevier Inc. All rights reserved.
Ow, Yen Ling Mandy; Thumboo, Julian; Cella, David; Cheung, Yin Bun; Yong Fong, Kok; Wee, Hwee Lin
2011-06-01
To identify health-related quality of life (HRQOL) domains of importance to multiethnic Asian systemic lupus erythematosus (SLE) patients, to identify content gaps in existing SLE-specific HRQOL measures, and to determine whether the Patient-Reported Outcomes Measurement Information System (PROMIS) item banks could serve as a core set of questions for HRQOL assessment among SLE patients. English-speaking patients with physician-diagnosed SLE from a specialist clinic in a tertiary care hospital in Singapore and a patient support group were recruited. Thematic analysis was performed to distill themes from transcripts through open coding by 2 independent coders and axial coding for refinement of categories. Items from 3 existing SLE-specific measures and PROMIS Version 1.0 Item Banks were compared with identified subthemes. Twenty-seven female and 2 male participants (21 Chinese, 4 Malay, 3 Indian, 1 other) ages 23-62 years participated in 6 focus groups and 2 individual interviews, respectively. Twenty-one domains and 92 subthemes were identified. Domains of family, relationships, stigma and discrimination, and freedom were unaddressed by existing SLE-specific measures. Forty subthemes from 14 domains were addressed by the PROMIS Version 1.0 Item Banks (Physical Function, Pain, Fatigue, Sleep Disturbance, Sleep-Related Impairment, Anger, Anxiety, and Depression banks). Family and stigma and discrimination (identified as content gaps) may be accentuated in the Asian sociocultural context. PROMIS item banks have tremendous potential to serve as a core set of items for HRQOL assessment in SLE patients. Additional items may be written to fill the gaps in existing PROMIS item banks. Copyright © 2011 by the American College of Rheumatology.
2012-01-01
Background Item response theory (IRT) is extensively used to develop adaptive instruments of health-related quality of life (HRQoL). However, each IRT model has its own function to estimate item and category parameters, and hence different results may be found using the same response categories with different IRT models. The present study used the Rasch rating scale model (RSM) to examine and reassess the psychometric properties of the Persian version of the PedsQLTM 4.0 Generic Core Scales. Methods The PedsQLTM 4.0 Generic Core Scales was completed by 938 Iranian school children and their parents. Convergent, discriminant and construct validity of the instrument were assessed by classical test theory (CTT). The RSM was applied to investigate person and item reliability, item statistics and ordering of response categories. Results The CTT method showed that the scaling success rate for convergent and discriminant validity were 100% in all domains with the exception of physical health in the child self-report. Moreover, confirmatory factor analysis supported a four-factor model similar to its original version. The RSM showed that 22 out of 23 items had acceptable infit and outfit statistics (<1.4, >0.6), person reliabilities were low, item reliabilities were high, and item difficulty ranged from -1.01 to 0.71 and -0.68 to 0.43 for child self-report and parent proxy-report, respectively. Also the RSM showed that successive response categories for all items were not located in the expected order. Conclusions This study revealed that, in all domains, the five response categories did not perform adequately. It is not known whether this problem is a function of the meaning of the response choices in the Persian language or an artifact of a mostly healthy population that did not use the full range of the response categories. The response categories should be evaluated in further validation studies, especially in large samples of chronically ill patients. PMID:22414135
Odukoya, Jonathan A; Adekeye, Olajide; Igbinoba, Angie O; Afolabi, A
2018-01-01
Teachers and Students worldwide often dance to the tune of tests and examinations. Assessments are powerful tools for catalyzing the achievement of educational goals, especially if done rightly. One of the tools for 'doing it rightly' is item analysis. The core objectives for this study, therefore, were: ascertaining the item difficulty and distractive indices of the university wide courses. A range of 112-1956 undergraduate students participated in this study. With the use of secondary data, the ex-post facto design was adopted for this project. In virtually all cases, majority of the items (ranging between 65% and 97% of the 70 items fielded in each course) did not meet psychometric standard in terms of difficulty and distractive indices and consequently needed to be moderated or deleted. Considering the importance of these courses, the need to apply item analyses when developing these tests was emphasized.
Nikiphorou, Elena; Mackie, Sarah L; Kirwan, John; Boers, Martin; Isaacs, John; Morgan, Ann W; Young, Adam
2017-04-01
To obtain consensus on the minimum data items for an observational cohort study in RA in the UK and to make available the process for similar studies and other rheumatic conditions. Individuals with a diverse range of expertise and backgrounds were invited to participate in a process of proposing a minimum core dataset (MCD) for research studies, commissioned by Arthritis Research UK as part of the larger INBANK project. The group included patients and representatives from clinical and academic rheumatology, outcomes science, stratified medicine, health economics, and national professional and academic bodies/committees. A process was devised based on OMERACT principles for reviewing aims/objectives, defining the scope, identifying the important research questions and selecting key domains. Following the initial multistakeholder meeting, subsequent teleconferences and email communications: consensus was obtained on the most important and relevant research questions; agreement on how the OMERACT Core Areas (life impact, pathophysiological manifestations, resource use and death) could form the basis of a MCD; and consensus on 22 items for inclusion into a MCD. Workshops were undertaken for two essential items that required further exploration: work/social participation and co-morbidity. Reaching a consensus for the proposed minimal data items for long-term observational cohort studies of RA in the UK posed novel challenges and opportunities, and was largely successful. Further work is needed for selecting instruments for two important items and for achieving compatibility with other UK national initiatives, and more widely across Europe. © The Author 2016. Published by Oxford University Press on behalf of the British Society for Rheumatology. All rights reserved. For Permissions, please email: journals.permissions@oup.com
Brazier, John E.; Rowen, Donna; Barkham, Michael
2013-01-01
Background. The Clinical Outcomes in Routine Evaluation–Outcome Measure (CORE-OM) is used to evaluate the effectiveness of psychological therapies in people with common mental disorders. The objective of this study was to estimate a preference-based index for this population using CORE-6D, a health state classification system derived from the CORE-OM consisting of a 5-item emotional component and a physical item, and to demonstrate a novel method for generating states that are not orthogonal. Methods. Rasch analysis was used to identify 11 emotional health states from CORE-6D that were frequently observed in the study population and are, thus, plausible (in contrast, conventional statistical design might generate implausible states). Combined with the 3 response levels of the physical item of CORE-6D, they generate 33 plausible health states, 18 of which were selected for valuation. A valuation survey of 220 members of the public in South Yorkshire, United Kingdom, was undertaken using the time tradeoff (TTO) method. Regression analysis was subsequently used to predict values for all possible states described by CORE-6D. Results. A number of multivariate regression models were built to predict values for the 33 health states of CORE-6D, using the Rasch logit value of the emotional state and the response level of the physical item as independent variables. A cubic model with high predictive value (adjusted R2 = 0.990) was selected to predict TTO values for all 729 CORE-6D health states. Conclusion. The CORE-6D preference-based index will enable the assessment of cost-effectiveness of interventions for people with common mental disorders using existing and prospective CORE-OM data sets. The new method for generating states may be useful for other instruments with highly correlated dimensions. PMID:23178639
Validating a conceptual framework for the core concept of "cell-cell communication".
Michael, Joel; Martinkova, Patricia; McFarland, Jenny; Wright, Ann; Cliff, William; Modell, Harold; Wenderoth, Mary Pat
2017-06-01
We have created and validated a conceptual framework for the core physiology concept of "cell-cell communication." The conceptual framework is composed of 51 items arranged in a hierarchy that is, in some instances, four levels deep. We have validated it with input from faculty who teach at a wide variety of institutional types. All items making up the framework were deemed essential to moderately important. However, some of the main ideas were clearly judged to be more important than others. Furthermore, the lower in the hierarchy an item is, the less important it is thought to be. Finally, there was no significant difference in the ratings given by faculty at different types of institutions. Copyright © 2017 the American Physiological Society.
Development of the PROMIS nicotine dependence item banks.
Shadel, William G; Edelen, Maria Orlando; Tucker, Joan S; Stucky, Brian D; Hansen, Mark; Cai, Li
2014-09-01
Nicotine dependence is a core construct important for understanding cigarette smoking and smoking cessation behavior. This article describes analyses conducted to develop and evaluate item banks for assessing nicotine dependence among daily and nondaily smokers. Using data from a sample of daily (N = 4,201) and nondaily (N =1,183) smokers, we conducted a series of item factor analyses, item response theory analyses, and differential item functioning analyses (according to gender, age, and race/ethnicity) to arrive at a unidimensional set of nicotine dependence items for daily and nondaily smokers. We also evaluated performance of short forms (SFs) and computer adaptive tests (CATs) to efficiently assess dependence. A total of 32 items were included in the Nicotine Dependence item banks; 22 items are common across daily and nondaily smokers, 5 are unique to daily smokers, and 5 are unique to nondaily smokers. For both daily and nondaily smokers, the Nicotine Dependence item banks are strongly unidimensional, highly reliable (reliability = 0.97 and 0.97, respectively), and perform similarly across gender, age, and race/ethnicity groups. SFs common to daily and nondaily smokers consist of 8 and 4 items (reliability = 0.91 and 0.81, respectively). Results from simulated CATs showed that dependence can be assessed with very good precision for most respondents using fewer than 6 items adaptively selected from the item banks. Nicotine dependence on cigarettes can be assessed on the basis of these item banks via one of the SFs, by using CATs, or through a tailored set of items selected for a specific research purpose. © The Author 2014. Published by Oxford University Press on behalf of the Society for Research on Nicotine and Tobacco. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
The (mis)measurement of the Dark Triad Dirty Dozen: exploitation at the core of the scale
Kajonius, Petri J.; Persson, Björn N.; Rosenberg, Patricia
2016-01-01
Background. The dark side of human character has been conceptualized in the Dark Triad Model: Machiavellianism, psychopathy, and narcissism. These three dark traits are often measured using single long instruments for each one of the traits. Nevertheless, there is a necessity of short and valid personality measures in psychological research. As an independent research group, we replicated the factor structure, convergent validity and item response for one of the most recent and widely used short measures to operationalize these malevolent traits, namely, Jonason’s Dark Triad Dirty Dozen. We aimed to expand the understanding of what the Dirty Dozen really captures because the mixed results on construct validity in previous research. Method. We used the largest sample to date to respond to the Dirty Dozen (N = 3,698). We firstly investigated the factor structure using Confirmatory Factor Analysis and an exploratory distribution analysis of the items in the Dirty Dozen. Secondly, using a sub-sample (n = 500) and correlation analyses, we investigated the Dirty Dozen dark traits convergent validity to Machiavellianism measured by the Mach-IV, psychopathy measured by Eysenck’s Personality Questionnaire Revised, narcissism using the Narcissism Personality Inventory, and both neuroticism and extraversion from the Eysenck’s questionnaire. Finally, besides these Classic Test Theory analyses, we analyzed the responses for each Dirty Dozen item using Item Response Theory (IRT). Results. The results confirmed previous findings of a bi-factor model fit: one latent core dark trait and three dark traits. All three Dirty Dozen traits had a striking bi-modal distribution, which might indicate unconcealed social undesirability with the items. The three Dirty Dozen traits did converge too, although not strongly, with the contiguous single Dark Triad scales (r between .41 and .49). The probabilities of filling out steps on the Dirty Dozen narcissism-items were much higher than on the Dirty Dozen items for Machiavellianism and psychopathy. Overall, the Dirty Dozen instrument delivered the most predictive value with persons with average and high Dark Triad traits (theta > −0.5). Moreover, the Dirty Dozen scale was better conceptualized as a combined Machiavellianism-psychopathy factor, not narcissism, and is well captured with item 4: ‘I tend to exploit others towards my own end.’ Conclusion. The Dirty Dozen showed a consistent factor structure, a relatively convergent validity similar to that found in earlier studies. Narcissism measured using the Dirty Dozen, however, did not contribute with information to the core of the Dirty Dozen construct. More importantly, the results imply that the core of the Dirty Dozen scale, a manipulative and anti-social trait, can be measured by a Single Item Dirty Dark Dyad (SIDDD). PMID:26966673
The (mis)measurement of the Dark Triad Dirty Dozen: exploitation at the core of the scale.
Kajonius, Petri J; Persson, Björn N; Rosenberg, Patricia; Garcia, Danilo
2016-01-01
Background. The dark side of human character has been conceptualized in the Dark Triad Model: Machiavellianism, psychopathy, and narcissism. These three dark traits are often measured using single long instruments for each one of the traits. Nevertheless, there is a necessity of short and valid personality measures in psychological research. As an independent research group, we replicated the factor structure, convergent validity and item response for one of the most recent and widely used short measures to operationalize these malevolent traits, namely, Jonason's Dark Triad Dirty Dozen. We aimed to expand the understanding of what the Dirty Dozen really captures because the mixed results on construct validity in previous research. Method. We used the largest sample to date to respond to the Dirty Dozen (N = 3,698). We firstly investigated the factor structure using Confirmatory Factor Analysis and an exploratory distribution analysis of the items in the Dirty Dozen. Secondly, using a sub-sample (n = 500) and correlation analyses, we investigated the Dirty Dozen dark traits convergent validity to Machiavellianism measured by the Mach-IV, psychopathy measured by Eysenck's Personality Questionnaire Revised, narcissism using the Narcissism Personality Inventory, and both neuroticism and extraversion from the Eysenck's questionnaire. Finally, besides these Classic Test Theory analyses, we analyzed the responses for each Dirty Dozen item using Item Response Theory (IRT). Results. The results confirmed previous findings of a bi-factor model fit: one latent core dark trait and three dark traits. All three Dirty Dozen traits had a striking bi-modal distribution, which might indicate unconcealed social undesirability with the items. The three Dirty Dozen traits did converge too, although not strongly, with the contiguous single Dark Triad scales (r between .41 and .49). The probabilities of filling out steps on the Dirty Dozen narcissism-items were much higher than on the Dirty Dozen items for Machiavellianism and psychopathy. Overall, the Dirty Dozen instrument delivered the most predictive value with persons with average and high Dark Triad traits (theta > -0.5). Moreover, the Dirty Dozen scale was better conceptualized as a combined Machiavellianism-psychopathy factor, not narcissism, and is well captured with item 4: 'I tend to exploit others towards my own end.' Conclusion. The Dirty Dozen showed a consistent factor structure, a relatively convergent validity similar to that found in earlier studies. Narcissism measured using the Dirty Dozen, however, did not contribute with information to the core of the Dirty Dozen construct. More importantly, the results imply that the core of the Dirty Dozen scale, a manipulative and anti-social trait, can be measured by a Single Item Dirty Dark Dyad (SIDDD).
Writing Multiple Choice Outcome Questions to Assess Knowledge and Competence.
Brady, Erik D
2015-11-01
Few articles contemplate the need for good guidance in question item-writing in the continuing education (CE) space. Although many of the core principles of sound item design translate to the CE health education team, the need exists for specific examples for nurse educators that clearly describe how to measure changes in competence and knowledge using multiple choice items. In this article, some keys points and specific examples for nursing CE providers are shared. Copyright 2015, SLACK Incorporated.
Billington, D. Rex; Hsu, Patricia Hsien-Chuan; Feng, Xuan Joanna; Medvedev, Oleg N.; Kersten, Paula; Landon, Jason; Siegert, Richard J.
2016-01-01
The World Health Organisation Quality of Life (WHOQOL) questionnaires are widely used around the world and can claim strong cross-cultural validity due to their development in collaboration with international field centres. To enhance conceptual equivalence of quality of life across cultures, optional national items are often developed for use alongside the core instrument. The present study outlines the development of national items for the New Zealand WHOQOL-BREF. Focus groups with members of the community as well as health experts discussed what constitutes quality of life in their opinion. Based on themes extracted of aspects not contained in the existing WHOQOL instrument, 46 candidate items were generated and subsequently rated for their importance by a random sample of 585 individuals from the general population. Applying importance criteria reduced these items to 24, which were then sent to another large random sample (n = 808) to be rated alongside the existing WHOQOL-BREF. A final set of five items met the criteria for national items. Confirmatory factor analysis identified four national items as belonging to the psychological domain of quality of life, and one item to the social domain. Rasch analysis validated these results and generated ordinal-to-interval conversion algorithms to allow use of parametric statistics for domain scores with and without national items. PMID:27812203
A new scale for disaster nursing core competencies: Development and psychometric testing.
Al Thobaity, Abdulellah; Williams, Brett; Plummer, Virginia
2016-02-01
All nurses must have core competencies in preparing for, responding to and recovering from a disaster. In the Kingdom of Saudi Arabia (KSA), as in many other countries, disaster nursing core competencies are not fully understood and lack reliable, validated tools. Thus, it is imperative to develop a scale for exploring disaster nursing core competencies, roles and barriers in the KSA. This study's objective is to develop a valid, reliable scale that identifies and explores core competencies of disaster nursing, nurses' roles in disaster management and barriers to developing disaster nursing in the KSA. This study developed a new scale testing its validity and reliability. A principal component analysis (PCA) was used to develop and test psychometric properties of the new scale. The PCA used a purposive sample of nurses from emergency departments in two hospitals in the KSA. Participants rated 93 paper-based, self-report questionnaire items from 1 to 10 on a Likert scale. PCA using Varimax rotation was conducted to explore factors emerging from responses. The study's participants were 132 nurses (66% response rate). PCA of the 93 questionnaire items revealed 49 redundant items (which were deleted) and 3 factors with eigenvalues of >1. The remaining 44 items accounted for 77.3% of the total variance. The overall Cronbach's alpha was 0.96 for all factors: 0.98 for Factor 1, 0.92 for Factor 2 and 0.86 for Factor 3. This study provided a validated, reliable scale for exploring nurses' core competencies, nurses' roles and barriers to developing disaster nursing in the KSA. The new scale has many implications, such as for improving education, planning and curricula. Crown Copyright © 2015. Published by Elsevier Ltd. All rights reserved.
Deng, Nina; Anatchkova, Milena D; Waring, Molly E; Han, Kyung T; Ware, John E
2015-08-01
The Quality-of-life (QOL) Disease Impact Scale (QDIS(®)) standardizes the content and scoring of QOL impact attributed to different diseases using item response theory (IRT). This study examined the IRT invariance of the QDIS-standardized IRT parameters in an independent sample. The differential functioning of items and test (DFIT) of a static short-form (QDIS-7) was examined across two independent sources: patients hospitalized for acute coronary syndrome (ACS) in the TRACE-CORE study (N = 1,544) and chronically ill US adults in the QDIS standardization sample. "ACS-specific" IRT item parameters were calibrated and linearly transformed to compare to "standardized" IRT item parameters. Differences in IRT model-expected item, scale and theta scores were examined. The DFIT results were also compared in a standard logistic regression differential item functioning analysis. Item parameters estimated in the ACS sample showed lower discrimination parameters than the standardized discrimination parameters, but only small differences were found for thresholds parameters. In DFIT, results on the non-compensatory differential item functioning index (range 0.005-0.074) were all below the threshold of 0.096. Item differences were further canceled out at the scale level. IRT-based theta scores for ACS patients using standardized and ACS-specific item parameters were highly correlated (r = 0.995, root-mean-square difference = 0.09). Using standardized item parameters, ACS patients scored one-half standard deviation higher (indicating greater QOL impact) compared to chronically ill adults in the standardization sample. The study showed sufficient IRT invariance to warrant the use of standardized IRT scoring of QDIS-7 for studies comparing the QOL impact attributed to acute coronary disease and other chronic conditions.
Coulman, Karen D; Hopkins, James; Brookes, Sara T; Chalmers, Katy; Main, Barry; Owen-Smith, Amanda; Andrews, Robert C; Byrne, James; Donovan, Jenny L; Mazza, Graziella; Reeves, Barnaby C; Rogers, Chris A; Thompson, Janice L; Welbourn, Richard; Wordsworth, Sarah; Blazeby, Jane M
2016-11-01
Bariatric and metabolic surgery is used as a treatment for patients with severe and complex obesity. However, there is a need to improve outcome selection and reporting in bariatric surgery trials. A Core Outcome Set (COS), an agreed minimum set of outcomes reported in all studies of a specific condition, may achieve this. Here, we present the development of a COS for BARIAtric and metabolic surgery Clinical Trials-the BARIACT Study. Outcomes identified from systematic reviews and patient interviews informed a questionnaire survey. Patients and health professionals were surveyed three times and asked to rate the importance of each item on a 1-9 scale. Delphi methods provided anonymised feedback to participants. Items not meeting predefined criteria were discarded between rounds. Remaining items were discussed at consensus meetings, held separately with patients and professionals, where the COS was agreed. Data sources identified 2,990 outcomes, which were used to develop a 130-item questionnaire. Round 1 response rates were moderate but subsequently improved to above 75% for other rounds. After rounds 2 and 3, 81 and 14 items were discarded, respectively, leaving 35 items for discussion at consensus meetings. The final COS included nine items: "weight," "diabetes status," "cardiovascular risk," "overall quality of life (QOL)," "mortality," "technical complications of the specific operation," "any re-operation/re-intervention," "dysphagia/regurgitation," and "micronutrient status." The main limitation of this study was that it was based in the United Kingdom only. The COS is recommended to be used as a minimum in all trials of bariatric and metabolic surgery. Adoption of the COS will improve data synthesis and the value of research data. Future work will establish methods for the measurement of the outcomes in the COS.
Thorlacius, L.; Garg, A.; Ingram, J.R.; Villumsen, B.; Riis, P. Theut; Gottlieb, A.B.; Merola, J.F.; Dellavalle, R.; Ardon, C.; Baba, R.; Bechara, F.G.; Cohen, A.D.; Daham, N.; Davis, M.; Emtestam, L.; Fernández-Peñas, P.; Filippelli, M.; Gibbons, A.; Grant, T.; Guilbault, S.; Gulliver, S.; Harris, C; Harvent, C.; Houston, K.; Kirby, J.S.; Matusiak, L.; Mehdizadeh, A.; Mojica, T.; Okun, M.; Orgill, D.; Pallack, L.; Parks-Miller, A.; Prens, E.P.; Randell, S.; Rogers, C.; Rosen, C.F.; Choon, S.E.; van der Zee, H.H.; Christensen, R.; Jemec, G.B.E.
2018-01-01
Summary Background A core outcomes set (COS) is an agreed minimum set of outcomes that should be measured and reported in all clinical trials for a specific condition. Hidradenitis suppurativa (HS) has no agreed-upon COS. A central aspect in the COS development process is to identify a set of candidate outcome domains from a long list of items. Our long list had been developed from patient interviews, a systematic review of the literature and a healthcare professional survey, and initial votes had been cast in two e-Delphi surveys. In this manuscript, we describe two in-person consensus meetings of Delphi participants designed to ensure an inclusive approach to generation of domains from related items. Objectives To consider which items from a long list of candidate items to exclude and which to cluster into outcome domains. Methods The study used an international and multistakeholder approach, involving patients, dermatologists, surgeons, the pharmaceutical industry and medical regulators. The study format was a combination of formal presentations, small group work based on nominal group theory and a subsequent online confirmation survey. Results Forty-one individuals from 13 countries and four continents participated. Nine items were excluded and there was consensus to propose seven domains: disease course, physical signs, HS-specific quality of life, satisfaction, symptoms, pain and global assessments. Conclusions The HISTORIC consensus meetings I and II will be followed by further e-Delphi rounds to finalize the core domain set, building on the work of the in-person consensus meetings. PMID:29080368
Measurement of attitudes of U.K. dental practitioners to core job constructs.
Harris, R V; Ashcroft, A; Burnside, G; Dancer, J M; Smith, D; Grieveson, B
2009-03-01
To develop a measure to identify dental practitioner attitudes towards core job dimensions relating to job satisfaction and motivation and to test this against practice characteristics and provider attributes of U.K. practitioners. an 83-item questionnaire was developed from open-ended interviews with practitioners and use of items in previously used dentist job satisfaction questionnaires. This was subsequently sent to 684 practitioners. Item analysis reduced the item pool to 40 items and factor analysis (PCA) was undertaken. 440 (64%) dentists responded. Factor analysis resulted in six factors being identified as distinguishable job dimensions, overall Cronbach's alpha = 0.88. The factors were: 'restriction in being able to provide quality care (F1)', 'respect from being a dentist (F2)', 'control of work (F3)', 'running a practice (F4)', 'clinical skills (F5)', and 'caring for patients (F6)'. All six factors were correlated with a global job satisfaction score, although F1 was most strongly related (r = 0.60). Regression model analysis revealed that 'whether the dentist worked within the National Health Service or wholly or partly in the private sector' (p < 0.001), 'time since qualification' (p = 0.009), and the position of the dentist within the practice (whether a practice owner or associate dentist), (p = 0.047) were predictive of this factor. Six core job constructs of U.K. practitioners have been identified, together with several practice characteristics and practitioner attributes which predict these factors. The study demonstrates the importance of refining measures of dentists' job satisfaction to take account of the culture and the system in which the practitioner works.
Competency standards for newly graduated prosthetist/orthotists in Sweden.
Ramstrand, Nerrolyn; Ramstrand, Simon
2018-05-01
There are currently no national competency standards upon which to develop educational objectives for prosthetist/orthotists in Sweden. While standards have been developed in other countries, they cannot be applied without confirming their relevance in a Swedish context. To describe and obtain consensus on core competencies required for newly graduated prosthetist/orthotists in Sweden. Modified Delphi process. A modified Delphi technique was carried out. Focus groups were initially used to identify core competency domains. Two consecutive questionnaires, containing a list of potential competency items, were sent to a group of stakeholders with ties to the prosthetic and orthotic profession. Stakeholders were requested to rate their level of agreement with each competency item and provide written comments. Finally, two focus groups were conducted to obtain feedback on the draft competency standards. Forty-four competency items, listed under five key domains of practice, were identified as essential for newly graduated prosthetist/orthotists in Sweden. Many similarities exist in core competency descriptions for prosthetist/orthotists in Sweden when compared to other countries. Regional differences do however exist, and it is important to confirm the relevance of core competency items at a national level before they are applied. Clinical relevance Competency standards developed in this study can be used to guide development of learning objectives within an undergraduate prosthetic and orthotic program, provide a framework for workforce development, assist professional organizations in understanding the needs of their members, and prepare for international accreditation.
Testing to the Top: Everything But the Kitchen Sink?
ERIC Educational Resources Information Center
Dietel, Ron
2011-01-01
Two tests intended to measure student achievement of the Common Core State Standards will face intense scrutiny, but the test makers say they will include performance assessments and other items that are not multiple-choice questions. Incorporating performance items on this tests will bring up issues over scoring, costs, and validity.
Indicators of Family Care for Development for Use in Multicountry Surveys
Kariger, Patricia; Engle, Patrice; Britto, Pia M. Rebello; Sywulka, Sara M.; Menon, Purnima
2012-01-01
Indicators of family care for development are essential for ascertaining whether families are providing their children with an environment that leads to positive developmental outcomes. This project aimed to develop indicators from a set of items, measuring family care practices and resources important for caregiving, for use in epidemiologic surveys in developing countries. A mixed method (quantitative and qualitative) design was used for item selection and evaluation. Qualitative and quantitative analyses were conducted to examine the validity of candidate items in several country samples. Qualitative methods included the use of global expert panels to identify and evaluate the performance of each candidate item as well as in-country focus groups to test the content validity of the items. The quantitative methods included analyses of item-response distributions, using bivariate techniques. The selected items measured two family care practices (support for learning/stimulating environment and limit-setting techniques) and caregiving resources (adequacy of the alternate caregiver when the mother worked). Six play-activity items, indicative of support for learning/stimulating environment, were included in the core module of UNICEF's Multiple Cluster Indictor Survey 3. The other items were included in optional modules. This project provided, for the first time, a globally-relevant set of items for assessing family care practices and resources in epidemiological surveys. These items have multiple uses, including national monitoring and cross-country comparisons of the status of family care for development used globally. The obtained information will reinforce attention to efforts to improve the support for development of children. PMID:23304914
Irrational Delay Revisited: Examining Five Procrastination Scales in a Global Sample
Svartdal, Frode; Steel, Piers
2017-01-01
Scales attempting to measure procrastination focus on different facets of the phenomenon, yet they share a common understanding of procrastination as an unnecessary, unwanted, and disadvantageous delay. The present paper examines in a global sample (N = 4,169) five different procrastination scales – Decisional Procrastination Scale (DPS), Irrational Procrastination Scale (IPS), Pure Procrastination Scale (PPS), Adult Inventory of Procrastination Scale (AIP), and General Procrastination Scale (GPS), focusing on factor structures and item functioning using Confirmatory Factor Analysis and Item Response Theory. The results indicated that The PPS (12 items selected from DPS, AIP, and GPS) measures different facets of procrastination even better than the three scales it is based on. An even shorter version of the PPS (5 items focusing on irrational delay), corresponds well to the nine-item IPS. Both scales demonstrate good psychometric properties and appear to be superior measures of core procrastination attributes than alternative procrastination scales. PMID:29163302
Irrational Delay Revisited: Examining Five Procrastination Scales in a Global Sample.
Svartdal, Frode; Steel, Piers
2017-01-01
Scales attempting to measure procrastination focus on different facets of the phenomenon, yet they share a common understanding of procrastination as an unnecessary, unwanted, and disadvantageous delay. The present paper examines in a global sample ( N = 4,169) five different procrastination scales - Decisional Procrastination Scale (DPS), Irrational Procrastination Scale (IPS), Pure Procrastination Scale (PPS), Adult Inventory of Procrastination Scale (AIP), and General Procrastination Scale (GPS), focusing on factor structures and item functioning using Confirmatory Factor Analysis and Item Response Theory. The results indicated that The PPS (12 items selected from DPS, AIP, and GPS) measures different facets of procrastination even better than the three scales it is based on. An even shorter version of the PPS (5 items focusing on irrational delay), corresponds well to the nine-item IPS. Both scales demonstrate good psychometric properties and appear to be superior measures of core procrastination attributes than alternative procrastination scales.
Choosing Wisely: the American College of Rheumatology's Top 5 for pediatric rheumatology.
Rouster-Stevens, Kelly A; Ardoin, Stacy P; Cooper, Ashley M; Becker, Mara L; Dragone, Leonard L; Huttenlocher, Anna; Jones, Karla B; Kolba, Karen S; Moorthy, L Nandini; Nigrovic, Peter A; Stinson, Jennifer N; Ferguson, Polly J
2014-05-01
To create a pediatric rheumatology Top 5 list as part of the American Board of Internal Medicine Foundation's Choosing Wisely campaign. Delphi surveys of a core group of representative pediatric rheumatology providers from across North America generated candidate Top 5 items. Items with high content agreement and perceived to be of prevalent use and of high impact were included in a survey of all American College of Rheumatology (ACR) members who identified themselves as providing care to pediatric patients. Items with the highest ratings were subjected to literature review and further evaluation. A total of 121 candidate items were proposed in the initial Delphi survey and were reduced to 28 items in subsequent surveys. These 28 items were sent to 1,198 rheumatology providers who care for pediatric patients, and 397 (33%) responded. Based upon survey data and literature review, the Top 5 items were identified. These items focused on testing for antinuclear antibodies, autoantibody panels, Lyme disease, methotrexate toxicity monitoring, and use of routine radiographs. The ACR pediatric rheumatology Top 5 is one of the first pediatric subspecialty-specific Choosing Wisely Top 5 lists and provides an opportunity for patients and providers to discuss appropriate use of health care in pediatric rheumatology. Copyright © 2014 by the American College of Rheumatology.
Saravia, Luisa; González-Zapata, Laura I; Rendo-Urteaga, Tara; Ramos, Jamile; Collese, Tatiana Sadalla; Bove, Isabel; Delgado, Carlos; Tello, Florencia; Iglesia, Iris; Gonçalves Sousa, Ederson Dassler; De Moraes, Augusto César Ferreira; Carvalho, Heráclito Barbosa; Moreno, Luis A
2018-03-01
This study aimed to describe the development of a food frequency questionnaire (FFQ) to assess dietary intake in South American children and adolescents. A total of 345 children (aged 3-10 years) and 357 adolescents (aged 11-17 years) were included for analysis. The FFQ was designed to be self-administered and to assess dietary intake over the past 3 months. It was developed in Spanish and translated into Portuguese. Multiple approaches were considered to compile the food list, and 11 food groups were included. A food photo booklet was produced as supporting material. The FFQ items maintained a common core list among centers (47 items) and country-specific foods. The FFQ for Buenos Aires and Lima had a total of 63 items; there were 55 items for the FFQ in Medelin, 60 items for Montevideo, 58 items for Santiago, 67 items for Sao Paulo, and 68 items for Teresina. Alcohol was also incorporated in the adolescents' FFQ. We developed a semiquantitative, culturally adapted FFQ to assess dietary intake in children and adolescents in South America. It has an optimal size allowing its completion in a high proportion of the population; therefore, it can be used in epidemiological studies with South American children and adolescents. © 2018 The Obesity Society.
COS-STAR: a reporting guideline for studies developing core outcome sets (protocol).
Kirkham, Jamie J; Gorst, Sarah; Altman, Douglas G; Blazeby, Jane; Clarke, Mike; Devane, Declan; Gargon, Elizabeth; Williamson, Paula R
2015-08-22
Core outcome sets can increase the efficiency and value of research and, as a result, there are an increasing number of studies looking to develop core outcome sets (COS). However, the credibility of a COS depends on both the use of sound methodology in its development and clear and transparent reporting of the processes adopted. To date there is no reporting guideline for reporting COS studies. The aim of this programme of research is to develop a reporting guideline for studies developing COS and to highlight some of the important methodological considerations in the process. The study will include a reporting guideline item generation stage which will then be used in a Delphi study. The Delphi study is anticipated to include two rounds. The first round will ask stakeholders to score the items listed and to add any new items they think are relevant. In the second round of the process, participants will be shown the distribution of scores for all stakeholder groups separately and asked to re-score. A final consensus meeting will be held with an expert panel and stakeholder representatives to review the guideline item list. Following the consensus meeting, a reporting guideline will be drafted and review and testing will be undertaken until the guideline is finalised. The final outcome will be the COS-STAR (Core Outcome Set-STAndards for Reporting) guideline for studies developing COS and a supporting explanatory document. To assess the credibility and usefulness of a COS, readers of a COS development report need complete, clear and transparent information on its methodology and proposed core set of outcomes. The COS-STAR guideline will potentially benefit all stakeholders in COS development: COS developers, COS users, e.g. trialists and systematic reviewers, journal editors, policy-makers and patient groups.
ERIC Educational Resources Information Center
Donovan, Courtney; Green, Kathy E.; Seidel, Kent
2017-01-01
Core competencies essential for effective teaching were identified via a literature review and a review of standards for teacher education, and vetted by state groups with interests in teacher education. Survey items based on these competencies asked teacher candidates, graduates, and teacher education program faculty how well the program prepared…
ERIC Educational Resources Information Center
Scott, William H. O., Ed.
A basic buying list for libraries seeking to develop their Far East holdings is given in this bibliography. Over 1700 items include published material up to 1973--books, periodicals, films, filmstrips, tapes, and phonograph records--pertaining to China, Formosa, Japan, Korea, Mongolia and Tibet. The items are arranged geographically with topical…
Abdulelah, Juman; Sulaiman, Syed Azhar Syed; Hassali, Mohamed A; Blebil, Ali Q; Awaisu, Ahmed; Bredle, Jason M
2015-05-01
Various generic instruments exist to assess health-related quality of life (HRQOL) in patients with tuberculosis (TB), but a psychometrically sound disease-specific instrument is lacking. The present study aimed to develop and psychometrically validate a multidimensional TB-specific HRQOL instrument relevant to the value of patients with pulmonary TB in Iraq with an eye toward cross-cultural application. The core general HRQOL questionnaire is composed of the Functional Assessment of Cancer Therapy-General items. A modular approach was followed for the development of the Functional Assessment of Chronic Illness Therapy-Tuberculosis (FACIT-TB) questionnaire in which a set of items assessing quality-of-life (QOL) issues not sufficiently covered by the core Functional Assessment of Cancer Therapy-General items, but considered to be relevant to the target population, was added. Moreover, principal-component analysis was used to determine the new subscale structure of the questionnaire. In addition to the 27 items of the core questionnaire, a set of 20 items referring to disease symptoms related to the site of infection, adverse effects, and additional QOL dimensions such as fatigue, social stigma, and economic burden of the illness was included. Factor analysis demonstrated that the FACIT-TB construct comprised five domains. A rigorous method was applied in the development of the FACIT-TB measure to fully understand the impact of TB on patients' QOL. The instrument is psychometrically sound and portrays multiple important dimensions of HRQOL. FACIT-TB is relatively brief, is easy to administer and score, and is appropriate for use in clinical trials and practice. Copyright © 2015 International Society for Pharmacoeconomics and Outcomes Research (ISPOR). Published by Elsevier Inc. All rights reserved.
Decoding the content of recollection within the core recollection network and beyond.
Thakral, Preston P; Wang, Tracy H; Rugg, Michael D
2017-06-01
Recollection - retrieval of qualitative information about a past event - is associated with enhanced neural activity in a consistent set of neural regions (the 'core recollection network') seemingly regardless of the nature of the recollected content. Here, we employed multi-voxel pattern analysis (MVPA) to assess whether retrieval-related functional magnetic resonance imaging (fMRI) activity in core recollection regions - including the hippocampus, angular gyrus, medial prefrontal cortex, retrosplenial/posterior cingulate cortex, and middle temporal gyrus - contain information about studied content and thus demonstrate retrieval-related 'reinstatement' effects. During study, participants viewed objects and concrete words that were subjected to different encoding tasks. Test items included studied words, the names of studied objects, or unstudied words. Participants judged whether the items were recollected, familiar, or new by making 'remember', 'know', and 'new' responses, respectively. The study history of remembered test items could be reliably decoded using MVPA in most regions, as well as from the dorsolateral prefrontal cortex, a region where univariate recollection effects could not be detected. The findings add to evidence that members of the core recollection network, as well as at least one neural region where mean signal is insensitive to recollection success, carry information about recollected content. Importantly, the study history of recognized items endorsed with a 'know' response could be decoded with equal accuracy. The results thus demonstrate a striking dissociation between mean signal and multi-voxel indices of recollection. Moreover, they converge with prior findings in suggesting that, as it is operationalized by classification-based MVPA, reinstatement is not uniquely a signature of recollection. Copyright © 2016 Elsevier Ltd. All rights reserved.
Jafari, Peyman; Bagheri, Zahra; Hashemi, Seyyedeh Zahra; Shalileh, Keivan
2013-06-06
Limited studies have examined the effect of differential item functioning (DIF) on comparing health related quality of life (HRQoL) scores across child self-reports and parent proxy-reports. This study aims to determine whether parents and children respond differently to the items in the Persian version of the PedsQoLTM 4.0 measure. The PedsQLTM 4.0 Generic Core Scales was completed by 938 child-parent dyads. The graded response model (GRM) was used to detect DIF between parents and children. The IRT analyses were conducted using IRTPRO 2.1.On the whole, our findings showed that 50% (4 out of 8) of the items in the physical subscale and 40% (2 out of 5) in both emotional and school subscales were flagged with DIF. Among the DIF items, 62.5% (5 out of 8) were uniform and the remaining 37.5% (3 out of 8) were non-uniform. Parents and children interpret certain items of the PedsQLTM 4.0 in a different ways, except for the social subscale. Hence, we should be cautious about using parent proxy-report as a substitute for a child's ratings.
Maizura, Husna; Masilamani, Retneswari; Aris, Tahir
2009-04-01
This small, cross-sectional study assessed the reliability of 3 scales from the Job Content Questionnaire (JCQ)-decision latitude, psychological job demand, and social support-in a group of office workers in a multinational company in Kuala Lumpur. A universal sample of 30 white-collar workers from a department of the company self-administered the English version of the JCQ comprising 21 core items selected from the full recommended version of 49 items on-site. Reliability (internal consistency) was evaluated using Cronbach's alpha coefficients for each scale. Corrected item-total correlation was presented for each and every item. Cronbach's alpha coefficients were acceptable for decision latitude (.76) and social support (.79) but slightly lower for psychological job demand (.64). Values for all item-total correlations for all 3 scales were greater than .3. In conclusion, this study suggests that the JCQ is a reliable scale for assessing job stress in this group of workers.
Singer, Susanne; Araújo, Cláudia; Arraras, Juan Ignacio; Baumann, Ingo; Boehm, Andreas; Brokstad Herlofson, Bente; Castro Silva, Joaquim; Chie, Wei-Chu; Fisher, Sheila; Guntinas-Lichius, Orlando; Hammerlid, Eva; Irarrázaval, María Elisa; Jensen Hjermstad, Marianne; Jensen, Kenneth; Kiyota, Naomi; Licitra, Lisa; Nicolatou-Galitis, Ourania; Pinto, Monica; Santos, Marcos; Schmalz, Claudia; Sherman, Allen C; Tomaszewska, Iwona M; Verdonck de Leeuw, Irma; Yarom, Noam; Zotti, Paola; Hofmeister, Dirk
2015-09-01
The objective of this study was to pilot test an updated version of the European Organization for Research and Treatment of Cancer Quality of Life Questionnaire Head and Neck Module (EORTC QLQ-H&N60). Patients with head and neck cancer were asked to complete a list of 60 head and neck cancer-specific items comprising the updated EORTC head and neck module and the core questionnaire EORTC QLQ-C30. Debriefing interviews were conducted to identify any irrelevant items and confusing or upsetting wording. Interviews were performed with 330 patients from 17 countries, representing different head and neck cancer sites and treatments. Forty-one of the 60 items were retained according to the predefined EORTC criteria for module development, for another 2 items the wording was refined, and 17 items were removed. The preliminary EORTC QLQ-H&N43 can now be used in academic research. Psychometrics will be tested in a larger field study. © 2014 Wiley Periodicals, Inc.
Langer, Michelle M.; Hill, Cheryl D.; Thissen, David; Burwinkle, Tasha M.; Varni, James W.; DeWalt, Darren A.
2008-01-01
Objective To demonstrate the value of item response theory (IRT) and differential item functioning (DIF) methods in examining a health-related quality of life (HRQOL) measure in children and adolescents. Study Design and Setting This illustration uses data from 5,429 children using the four subscales of the PedsQL™ 4.0 Generic Core Scales. The IRT model-based likelihood ratio test was used to detect and evaluate DIF between healthy children and children with a chronic condition. Results DIF was detected for a majority of items but cancelled out at the total test score level due to opposing directions of DIF. Post-hoc analysis indicated that this pattern of results may be due to multidimensionality. We discuss issues in detecting and handling DIF. Conclusion This paper describes how to perform DIF analyses in validating a questionnaire to ensure that scores have equivalent meaning across subgroups. It offers insight into ways information gained through the analysis can be used to evaluate an existing scale. PMID:18226750
Development and validation of the simulation-based learning evaluation scale.
Hung, Chang-Chiao; Liu, Hsiu-Chen; Lin, Chun-Chih; Lee, Bih-O
2016-05-01
The instruments that evaluate a student's perception of receiving simulated training are English versions and have not been tested for reliability or validity. The aim of this study was to develop and validate a Chinese version Simulation-Based Learning Evaluation Scale (SBLES). Four stages were conducted to develop and validate the SBLES. First, specific desired competencies were identified according to the National League for Nursing and Taiwan Nursing Accreditation Council core competencies. Next, the initial item pool was comprised of 50 items related to simulation that were drawn from the literature of core competencies. Content validity was established by use of an expert panel. Finally, exploratory factor analysis and confirmatory factor analysis were conducted for construct validity, and Cronbach's coefficient alpha determined the scale's internal consistency reliability. Two hundred and fifty students who had experienced simulation-based learning were invited to participate in this study. Two hundred and twenty-five students completed and returned questionnaires (response rate=90%). Six items were deleted from the initial item pool and one was added after an expert panel review. Exploratory factor analysis with varimax rotation revealed 37 items remaining in five factors which accounted for 67% of the variance. The construct validity of SBLES was substantiated in a confirmatory factor analysis that revealed a good fit of the hypothesized factor structure. The findings tally with the criterion of convergent and discriminant validity. The range of internal consistency for five subscales was .90 to .93. Items were rated on a 5-point scale from 1 (strongly disagree) to 5 (strongly agree). The results of this study indicate that the SBLES is valid and reliable. The authors recommend that the scale could be applied in the nursing school to evaluate the effectiveness of simulation-based learning curricula. Copyright © 2016 Elsevier Ltd. All rights reserved.
2011-01-01
Background Questionnaires are commonly used to collect patient, or user, experiences with health care encounters; however, their adaption to specific target groups limits comparison between groups. We present the construction of a generic questionnaire (maximum of ten questions) for user evaluation across a range of health care services. Methods Based on previous testing of six group-specific questionnaires, we first constructed a generic questionnaire with 23 items related to user experiences. All questions included a "not applicable" response option, as well as a follow-up question about the item's importance. Nine user groups from one health trust were surveyed. Seven groups received questionnaires by mail and two by personal distribution. Selection of core questions was based on three criteria: applicability (proportion "not applicable"), importance (mean scores on follow-up questions), and comprehensiveness (content coverage, maximum two items per dimension). Results 1324 questionnaires were returned providing subsample sizes ranging from 52 to 323. Ten questions were excluded because the proportion of "not applicable" responses exceeded 20% in at least one user group. The number of remaining items was reduced to ten by applying the two other criteria. The final short questionnaire included items on outcome (2), clinician services (2), user involvement (2), incorrect treatment (1), information (1), organisation (1), and accessibility (1). Conclusion The Generic Short Patient Experiences Questionnaire (GS-PEQ) is a short, generic set of questions on user experiences with specialist health care that covers important topics for a range of groups. It can be used alone or with other instruments in quality assessment or in research. The psychometric properties and the relevance of the GS-PEQ in other health care settings and countries need further evaluation. PMID:21510871
ConTour: Data-Driven Exploration of Multi-Relational Datasets for Drug Discovery.
Partl, Christian; Lex, Alexander; Streit, Marc; Strobelt, Hendrik; Wassermann, Anne-Mai; Pfister, Hanspeter; Schmalstieg, Dieter
2014-12-01
Large scale data analysis is nowadays a crucial part of drug discovery. Biologists and chemists need to quickly explore and evaluate potentially effective yet safe compounds based on many datasets that are in relationship with each other. However, there is a lack of tools that support them in these processes. To remedy this, we developed ConTour, an interactive visual analytics technique that enables the exploration of these complex, multi-relational datasets. At its core ConTour lists all items of each dataset in a column. Relationships between the columns are revealed through interaction: selecting one or multiple items in one column highlights and re-sorts the items in other columns. Filters based on relationships enable drilling down into the large data space. To identify interesting items in the first place, ConTour employs advanced sorting strategies, including strategies based on connectivity strength and uniqueness, as well as sorting based on item attributes. ConTour also introduces interactive nesting of columns, a powerful method to show the related items of a child column for each item in the parent column. Within the columns, ConTour shows rich attribute data about the items as well as information about the connection strengths to other datasets. Finally, ConTour provides a number of detail views, which can show items from multiple datasets and their associated data at the same time. We demonstrate the utility of our system in case studies conducted with a team of chemical biologists, who investigate the effects of chemical compounds on cells and need to understand the underlying mechanisms.
Brédart, A; Anota, A; Young, T; Tomaszewski, K A; Arraras, J I; Moura De Albuquerque Melo, H; Schmidt, H; Friend, E; Bergenmar, M; Costantini, A; Vassiliou, V; Hureaux, J; Marchal, F; Tomaszewska, I M; Chie, W-C; Ramage, J; Beaudeau, A; Conroy, T; Bleiker, E; Kulis, D; Bonnetain, F; Aaronson, N K
2018-01-01
Advances in cancer care delivery require revision and further development of questionnaires assessing patients' perceived quality of care. This study pre-tested the revised EORTC satisfaction with cancer care core questionnaire applicable in both the cancer inpatient and outpatient settings, and its new, outpatient-specific complementary module. The process of revision, development of the extended application, and pre-testing of these questionnaires was based on phases I to III of the "EORTC Quality of Life Group Module Development Guidelines." In phase III, patients in 11 countries in four European regions, South America and Asia completed provisional versions of the questionnaires. Fifty-seven relevant issues selected from literature reviews and input from experts were operationalized into provisional items, and subsequently translated into ten languages. Assessment of understanding, acceptability, redundancy and relevance by patients (n = 151) from oncology inpatient wards, and outpatient chemotherapy, radiotherapy and consultation settings, led to retention of, deletion of and merging of 40, 14 and 6 items respectively. Cronbach's alpha coefficients for hypothesized questionnaire scales were above 0.80. Our results provide preliminary support for the 33-item EORTC Satisfaction with cancer care core questionnaire and the 7-item complementary module specific for the outpatient care setting. A large scale phase IV cross-cultural psychometric study is now underway. © 2017 John Wiley & Sons Ltd.
Accumulation of Content Validation Evidence for the Critical Thinking Self-Assessment Scale.
Nair, Girija Gopinathan; Hellsten, Laurie-Ann M; Stamler, Lynnette Leeseberg
2017-04-01
Critical thinking skills (CTS) are essential for nurses; assessing students' acquisition of these skills is a mandate of nursing curricula. This study aimed to develop a self-assessment instrument of critical thinking skills (Critical Thinking Self-Assessment Scale [CTSAS]) for students' self-monitoring. An initial pool of 196 items across 6 core cognitive skills and 16 subskills were generated using the American Philosophical Association definition of CTS. Experts' content review of the items and their ratings provided evidence of content relevance using the item-level content validity index (I-CVI) and Aiken's content validity coefficient (VIk). 115 items were retained (range of I-CVI values = .70 to .94 and range of VIk values = .69-.95; significant at p< .05). The CTSAS is the first CTS instrument designed specifically for self-assessment purposes.
Using Likert-type and ipsative/forced choice items in sequence to generate a preference.
Ried, L Douglas
2014-01-01
Collaboration and implementation of a minimum, standardized set of core global educational and professional competencies seems appropriate given the expanding international evolution of pharmacy practice. However, winnowing down hundreds of competencies from a plethora of local, national and international competency frameworks to select the most highly preferred to be included in the core set is a daunting task. The objective of this paper is to describe a combination of strategies used to ascertain the most highly preferred items among a large number of disparate items. In this case, the items were >100 educational and professional competencies that might be incorporated as the core components of new and existing competency frameworks. Panelists (n = 30) from the European Union (EU) and United States (USA) were chosen to reflect a variety of practice settings. Each panelist completed two electronic surveys. The first survey presented competencies in a Likert-type format and the second survey presented many of the same competencies in an ipsative/forced choice format. Item mean scores were calculated for each competency, the competencies were ranked, and non-parametric statistical tests were used to ascertain the consistency in the rankings achieved by the two strategies. This exploratory study presented over 100 competencies to the panelists in the beginning. The two methods provided similar results, as indicated by the significant correlation between the rankings (Spearman's rho = 0.30, P < 0.09). A two-step strategy using Likert-type and ipsative/forced choice formats in sequence, appears to be useful in a situation where a clear preference is required from among a large number of choices. The ipsative/forced choice format resulted in some differences in the competency preferences because the panelists could not rate them equally by design. While this strategy was used for the selection of professional educational competencies in this exploratory study, it is applicable in other situations where a smaller set of highly preferred items might be selected from a large list of choices in other areas of inquiry (e.g., patient reported outcomes). Copyright © 2014 Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Schultz, Madeleine; Lawrie, Gwendolyn A.; Bailey, Chantal H.; Bedford, Simon B.; Dargaville, Tim R.; O'Brien, Glennys; Tasker, Roy; Thompson, Christopher D.; Williams, Mark; Wright, Anthony H.
2017-03-01
A multi-institution collaborative team of Australian chemistry education researchers, teaching a total of over 3000 first year chemistry students annually, has explored a tool for diagnosing students' prior conceptions as they enter tertiary chemistry courses. Five core topics were selected and clusters of diagnostic items were assembled linking related concepts in each topic together. An ordered multiple choice assessment strategy was adopted to enable provision of formative feedback to students through combination of the specific distractors that they chose. Concept items were either sourced from existing research instruments or developed by the project team. The outcome is a diagnostic tool consisting of five topic clusters of five concept items that has been delivered in large introductory chemistry classes at five Australian institutions. Statistical analysis of data has enabled exploration of the composition and validity of the instrument including a comparison between delivery of the complete 25 item instrument with subsets of five items, clustered by topic. This analysis revealed that most items retained their validity when delivered in small clusters. Tensions between the assembly, validation and delivery of diagnostic instruments for the purposes of acquiring robust psychometric research data versus their pragmatic use are considered in this study.
Brief assessment of food insecurity accurately identifies high-risk US adults.
Gundersen, Craig; Engelhard, Emily E; Crumbaugh, Amy S; Seligman, Hilary K
2017-06-01
To facilitate the introduction of food insecurity screening into clinical settings, we examined the test performance of two-item screening questions for food insecurity against the US Department of Agriculture's Core Food Security Module. We examined sensitivity, specificity and accuracy of various two-item combinations of questions assessing food insecurity in the general population and high-risk population subgroups. 2013 Current Population Survey December Supplement, a population-based US survey. All survey participants from the general population and high-risk subgroups. The test characteristics of multiple two-item combinations of questions assessing food insecurity had adequate sensitivity (>97 %) and specificity (>70 %) for widespread adoption as clinical screening measures. We recommend two specific items for clinical screening programmes based on their widespread current use and high sensitivity for detecting food insecurity. These items query how often the household 'worried whether food would run out before we got money to buy more' and how often 'the food that we bought just didn't last and we didn't have money to get more'. The recommended items have sensitivity across high-risk population subgroups of ≥97 % and a specificity of ≥74 % for food insecurity.
Core and peripheral criteria of video game addiction in the game addiction scale for adolescents.
Brunborg, Geir Scott; Hanss, Daniel; Mentzoni, Rune Aune; Pallesen, Ståle
2015-05-01
Assessment of video game addiction often involves measurement of peripheral criteria that indicate high engagement with games, and core criteria that indicate problematic use of games. A survey of the Norwegian population aged 16-74 years (N=10,081, response rate 43.6%) was carried out in 2013, which included the Gaming Addiction Scale for Adolescents (GAS). Confirmatory factor analysis showed that a two-factor structure, which separated peripheral criteria from core criteria, fitted the data better (CFI=0.963; RMSEA=0.058) compared to the original one-factor solution where all items are determined to load only on one factor (CFI=0.905, RMSEA=0.089). This was also found when we analyzed men aged ≤33 years, men aged >33 years, women aged ≤33 years, and women aged >33 years separately. This indicates that the GAS measures both engagement and problems related to video games. Multi-group measurement invariance testing showed that the factor structure was valid in all four groups (configural invariance) for the two-factor structure but not for the one-factor structure. A novel approach to categorization of problem gamers and addicted gamers where only the core criteria items are used (the CORE 4 approach) was compared to the approach where all items are included (the GAS 7 approach). The current results suggest that the CORE 4 approach might be more appropriate for classification of problem gamers and addicted gamers compared to the GAS 7 approach.
Wu, Hua-hong; Li, Hui; Gao, Qian
2013-05-30
The quality of life in children with short stature was rarely studied in China, so we explore these children's quality of life and psychometric properties of the Chinese version of the Pediatric Quality of Life Inventory 4.0(PedsQL4.0) Generic Core Scales among children with short stature. A total of 201 children aged 8 ~ 18 years from the short stature clinic and other clinics of capital institute of pediatrics attended this study. The questionnaires include demographic information and PedsQL4.0 generic core scales. According to children's height, we divided them into three groups: short stature, normal short and normal group, then compared the score of scales by the height category. Moreover, we analyzed the reliability and validity of PedsQL4.0 generic core scales in these 201 children. The child self-report total PedsQL mean score, for the short stature, normal short and normal groups were 77.77 ± 9.69, 83.50 ± 8.56 and 87.36 ± 7.23; the parent-proxy total PedsQL mean score were 77.62 ± 10.50, 82.69 ± 8.35 and 84.91 ± 9.96 respectively. Both for children self- and parent proxy-reports, the Cronbach's α coefficients of total scale, psychosocial health and social functioning ranged between 0.74 and 0.80, it ranged between 0.51 and 0.66 in other dimensions. For child self-reports, the correlation coefficients of 17 items' scores (total 23 items) with the scores of dimensions they belong to were above 0.5, with the highest 0.759; the other 6 items' correlation coefficients were below 0.5, with the lowest 0.280. For parent proxy-reports, the correlation coefficients of 19 items' scores with the scores of dimension they belong to were above 0.5, with the highest 0.793, the other 4 items' below 0.5 with the lowest 0.243. The quality of life in children with short stature is worse than their normal peers by Peds QL4.0 generic core scales, the statues of their quality of life was positively related to their stature.
An Assessment of Mentoring Functions and Barriers to Mentoring
1999-12-01
were similarity between mentor and mentee and the quality of the supervisory relationship in terms of LMX and psychosocial and career development ... psychosocial (1985). These broad categories have remained at the core of mentoring from the time they were developed . Career development functions "help...internal consistency reported by Noe for the career development functions scale (7 items) was .89. The psychosocial functions scale, made up of 14 items
ERIC Educational Resources Information Center
Anderson, Daniel; Alonzo, Julie; Tindal, Gerald
2012-01-01
The purpose of this technical report is to document the piloting and scaling of new easyCBM mathematics test items aligned with the Common Core State Standards (CCSS) and to describe the process used to revise and supplement the 2012 research version easyCBM CCSS math tests in Grades 6-8. For all operational 2012 research version test forms (10…
The fertility quality of life (FertiQoL) tool: development and general psychometric properties†
Boivin, Jacky; Takefman, Janet; Braverman, Andrea
2011-01-01
BACKGROUND To develop the first international instrument to measure fertility quality of life (FertiQoL) in men and women experiencing fertility problems, to evaluate the preliminary psychometric properties of this new tool and to translate FertiQoL into multiple languages. METHOD We conducted a survey, both online and in fertility clinics in USA, Australia/New Zealand, Canada and UK. A total of 1414 people with fertility problems participated. The main outcome measure was the FertiQoL tool. RESULTS FertiQoL consists of 36 items that assess core (24 items) and treatment-related quality of life (QoL) (10 items) and overall life and physical health (2 items). Cronbach reliability statistics for the Core and Treatment FertiQoL (and subscales) were satisfactory and in the range of 0.72 and 0.92. Sensitivity analyses showed that FertiQoL detected expected relations between QoL and gender, parity and support-seeking. FertiQoL was translated into 20 languages by the same translation team with each translation verified by local bilingual fertility experts. CONCLUSIONS FertiQoL is a reliable measure of the impact of fertility problems and its treatment on QoL. Future research should establish its use in cross-cultural research and clinical work. PMID:21665875
Development and Validation of the Homeostasis Concept Inventory
McFarland, Jenny L.; Price, Rebecca M.; Wenderoth, Mary Pat; Martinková, Patrícia; Cliff, William; Michael, Joel; Modell, Harold; Wright, Ann
2017-01-01
We present the Homeostasis Concept Inventory (HCI), a 20-item multiple-choice instrument that assesses how well undergraduates understand this critical physiological concept. We used an iterative process to develop a set of questions based on elements in the Homeostasis Concept Framework. This process involved faculty experts and undergraduate students from associate’s colleges, primarily undergraduate institutions, regional and research-intensive universities, and professional schools. Statistical results provided strong evidence for the validity and reliability of the HCI. We found that graduate students performed better than undergraduates, biology majors performed better than nonmajors, and students performed better after receiving instruction about homeostasis. We used differential item analysis to assess whether students from different genders, races/ethnicities, and English language status performed differently on individual items of the HCI. We found no evidence of differential item functioning, suggesting that the items do not incorporate cultural or gender biases that would impact students’ performance on the test. Instructors can use the HCI to guide their teaching and student learning of homeostasis, a core concept of physiology. PMID:28572177
Silverstein, Michael J; Faraone, Stephen V; Alperin, Samuel; Leon, Terry L; Biederman, Joseph; Spencer, Thomas J; Adler, Lenard A
2018-02-01
The aim of this study is to validate the Adult ADHD Self-Report Scale (ASRS) and Adult ADHD Investigator Symptom Rating Scale (AISRS) expanded versions, including executive function deficits (EFDs) and emotional dyscontrol (EC) items, and to present ASRS and AISRS pilot normative data. Two patient samples (referred and primary care physician [PCP] controls) were pooled together for these analyses. Final analysis included 297 respondents, 171 with adult ADHD. Cronbach's alphas were high for all sections of the scales. Examining histograms of ASRS 31-item and AISRS 18-item total scores for ADHD controls, 95% cutoff scores were 70 and 23, respectively; histograms for pilot normative sample suggest cutoffs of 82 and 26, respectively. (a) ASRS- and AISRS-expanded versions have high validity in assessment of core 18 adult ADHD Diagnostic and Statistical Manual of Mental Disorders ( DSM) symptoms and EFD and EC symptoms. (b) ASRS (31-item) scores 70 to 82 and AISRS (18-item) scores from 23 to 26 suggest a high likelihood of adult ADHD.
Li, James T-C; Stoll, Doris A; Smith, June E; Lin, John J; Swing, Susan R
2003-09-01
The Accreditation Council for Graduate Medical Education (ACGME) and the American Board of Medical Specialties (ABMS) have identified six areas of general competency. This study surveyed graduates of allergy and immunology training programs about their perceived clinical competency and the adequacy of their subspecialty training. In August 2000 and May 2001, a questionnaire was mailed to 373 physicians who had completed a fellowship in allergy and immunology in the United States between 1995 and 2001. Physicians were asked to rate the perceived importance and adequacy of their training in, and their level of competency for, 57 general competencies and subspecialty-specific competencies and procedures. A total of 253 physicians responded (68%). All items in the six ACGME/ABMS general competencies had high ratings (>/= 90%) for perceived importance. One item in the practice-based learning area had low ratings for adequacy of training (57%) and intermediate for competency (75%). Two items in the system-based practice area had low ratings for training (65% and 67%) and intermediate for competency (86% and 88%). Generally, core specialty-specific items (allergic rhinitis, asthma, and urticaria) had high ratings (>/= 90%) for importance, training, and competency. Without exception, items with ratings of less than 70% for adequacy of training also had ratings of less than 90% for competency. The general competencies were considered important, but training in system-based practice and practice-based learning may be deficient. Although self-perceived competency in core areas of allergy and immunology was high, weaknesses in training and self-perceived competency in selected areas were identified.
Miller's Pyramid and Core Competency Assessment: A Study in Relationship Construct Validity.
Williams, Betsy White; Byrne, Phil D; Welindt, Dillon; Williams, Michael V
2016-01-01
Continuous professional development relies on the link between performance and an educational process aimed at improving knowledge and skill. One of the most broadly used frameworks for assessing skills is Miller's Pyramid. This Pyramid has a series of levels of achievement beginning with knowledge (at the base) and ending with routine application in the clinical setting. The purpose of this study was to determine the degree of convergence of two measurement methods, one based on Miller's framework, the second using the Accreditation Council for Graduate Medical Education/American Board of Medical Specialties (ACGME/ABMS) Core Competency framework. The data were gathered from the faculty of a large, Midwestern regional health care provider and hospital system. Data from 264 respondents were studied. The 360° data were from raters of physicians holding supervisory roles in the organization. The scale items were taken from an instrument that has been validated for both structure and known group prediction. The Miller scale was purposely built for this application. The questions were designed to describe each level of the model. The Miller scale was reduced to a single dimension. This result was then regressed on the items from the 360° item ratings. Results of a multivariate analysis of variance isolated a significant relationship between the Miller's Pyramid score and the competency items (P < 0.001). These findings demonstrate a relationship between measures based on Miller's framework and behavioral measures based on the ABMS/ACGME core competencies. Equally important is the finding that while they are related they are not identical. These findings have implications for continuous professional development programing design.
Development of a core outcome set for research and audit studies in reconstructive breast surgery.
Potter, S; Holcombe, C; Ward, J A; Blazeby, J M
2015-10-01
Appropriate outcome selection is essential if research is to guide decision-making and inform policy. Systematic reviews of the clinical, cosmetic and patient-reported outcomes of reconstructive breast surgery, however, have demonstrated marked heterogeneity, and results from individual studies cannot be compared or combined. Use of a core outcome set may improve the situation. The BRAVO study developed a core outcome set for reconstructive breast surgery. A long list of outcomes identified from systematic reviews and stakeholder interviews was used to inform a questionnaire survey. Key stakeholders defined as individuals involved in decision-making for reconstructive breast surgery, including patients, breast and plastic surgeons, specialist nurses and psychologists, were sampled purposively and sent the questionnaire (round 1). This asked them to rate the importance of each outcome on a 9-point Likert scale from 1 (not important) to 9 (extremely important). The proportion of respondents rating each item as very important (score 7-9) was calculated. This was fed back to participants in a second questionnaire (round 2). Respondents were asked to reprioritize outcomes based on the feedback received. Items considered very important after round 2 were discussed at consensus meetings, where the core outcome set was agreed. A total of 148 items were combined into 34 domains within six categories. Some 303 participants (51·4 per cent) (215 (49·5 per cent) of 434 patients; 88 (56·4 per cent) of 156 professionals) completed and returned the round 1 questionnaire, and 259 (85·5 per cent) reprioritized outcomes in round 2. Fifteen items were excluded based on questionnaire scores and 19 were carried forward to the consensus meetings, where a core outcome set containing 11 key outcomes was agreed. The BRAVO study has used robust consensus methodology to develop a core outcome set for reconstructive breast surgery. Widespread adoption by the reconstructive community will improve the quality of outcome assessment in effectiveness studies. Future work will evaluate how these key outcomes should best be measured. © 2015 The Authors. BJS published by John Wiley & Sons Ltd on behalf of BJS Society Ltd.
Kook, Seung Hee; Varni, James W
2008-06-02
The Pediatric Quality of Life Inventory (PedsQL) is a child self-report and parent proxy-report instrument designed to assess health-related quality of life (HRQOL) in healthy and ill children and adolescents. It has been translated into over 70 international languages and proposed as a valid and reliable pediatric HRQOL measure. This study aimed to assess the psychometric properties of the Korean translation of the PedsQL 4.0 Generic Core Scales. Following the guidelines for linguistic validation, the original US English scales were translated into Korean and cognitive interviews were administered. The field testing responses of 1425 school children and adolescents and 1431 parents to the Korean version of PedsQL 4.0 Generic Core Scales were analyzed utilizing confirmatory factor analysis and the Rasch model. Consistent with studies using the US English instrument and other translation studies, score distributions were skewed toward higher HRQOL in a predominantly healthy population. Confirmatory factor analysis supported a four-factor and a second order-factor model. The analysis using the Rasch model showed that person reliabilities are low, item reliabilities are high, and the majority of items fit the model's expectation. The Rasch rating scale diagnostics showed that PedsQL 4.0 Generic Core Scales in general have the optimal number of response categories, but category 4 (almost always a problem) is somewhat problematic for the healthy school sample. The agreements between child self-report and parent proxy-report were moderate. The results demonstrate the feasibility, validity, item reliability, item fit, and agreement between child self-report and parent proxy-report of the Korean version of PedsQL 4.0 Generic Core Scales for school population health research in Korea. However, the utilization of the Korean version of the PedsQL 4.0 Generic Core Scales for healthy school populations needs to consider low person reliability, ceiling effects and cultural differences, and further validation studies on Korean clinical samples are required.
NASA Astrophysics Data System (ADS)
Laursen, S. L.; Weston, T. J.; Thiry, H.
2012-12-01
URSSA is the Undergraduate Research Student Self-Assessment, an online survey instrument for programs and departments to use in assessing the student outcomes of undergraduate research (UR). URSSA focuses on what students learn from their UR experience, rather than whether they liked it. The online questionnaire includes both multiple-choice and open-ended items that focus on students' gains from undergraduate research. These gains include skills, knowledge, deeper understanding of the intellectual and practical work of science, growth in confidence, changes in identity, and career preparation. Other items probe students' participation in important research-related activities that lead to these gains (e.g. giving presentations, having responsibility for a project). These activities, and the gains themselves, are based in research and thus constitute a core set of items. Using these items as a group helps to align a particular program assessment with research-demonstrated outcomes. Optional items may be used to probe particular features that are augment the research experience (e.g. field trips, career seminars, housing arrangements). The URSSA items are based on extensive, interview-based research and evaluation work on undergraduate research by our group and others. This grounding in research means that URSSA measures what we know to be important about the UR experience The items were tested with students, revised and re-tested. Data from a large pilot sample of over 500 students enabled statistical testing of the items' validity and reliability. Optional items about UR program elements were developed in consultation with UR program developers and leaders. The resulting instrument is flexible. Users begin with a set of core items, then customize their survey with optional items to probe students' experiences of specific program elements. The online instrument is free and easy to use, with numeric results available as raw data, summary statistics, cross-tabs, and graphs, and as raw, downloadable data. Finally, URSSA has high content validity based on its research grounding and rigorous development. We will present examples of how URSSA has been used in evaluations of UR programs. A multi-year evaluation of a university-based UR program shows that URSSA items are sensitive to differences in students' prior level of experience with research. For example, experienced student researchers reported greater gains than did their peers new to UR in understanding the process of research and in coming to see themselves as scientists. These differences are consistent with interview data that suggest a developmental progression of gains as students pursue research and gain confidence in their ability to contribute meaningfully. A second example comes from a multi-site evaluation of sites funded by the National Science Foundation's Research Experience for Undergraduates (REU) program in Biology. This study acquired data from nearly 800 students at some 60 Bio REU sites in 2010 and 2011. Results reveal differences in gains among demographic groups, and the general strength of these well-planned programs relative to a comparison sample of UR programs that are not part of REU. Our presentation will demonstrate the evaluative use of URSSA and its potential applications to undergraduate research in the geosciences.
Space Station Furnace Facility. Volume 2: Appendix 1: Contract End Item specification (CEI), part 1
NASA Technical Reports Server (NTRS)
Seabrook, Craig
1992-01-01
This specification establishes the performance, design, development, and verification requirements for the Space Station Furnace Facility (SSFF) Core. The definition of the SSFF Core and its interfaces, specifies requirements for the SSFF Core performance, specifies requirements for the SSFF Core design, and construction are presented, and the verification requirements are established.
The role of object categories in hybrid visual and memory search
Cunningham, Corbin A.; Wolfe, Jeremy M.
2014-01-01
In hybrid search, observers (Os) search for any of several possible targets in a visual display containing distracting items and, perhaps, a target. Wolfe (2012) found that responses times (RT) in such tasks increased linearly with increases in the number of items in the display. However, RT increased linearly with the log of the number of items in the memory set. In earlier work, all items in the memory set were unique instances (e.g. this apple in this pose). Typical real world tasks involve more broadly defined sets of stimuli (e.g. any “apple” or, perhaps, “fruit”). The present experiments show how sets or categories of targets are handled in joint visual and memory search. In Experiment 1, searching for a digit among letters was not like searching for targets from a 10-item memory set, though searching for targets from an N-item memory set of arbitrary alphanumeric characters was like searching for targets from an N-item memory set of arbitrary objects. In Experiment 2, Os searched for any instance of N sets or categories held in memory. This hybrid search was harder than search for specific objects. However, memory search remained logarithmic. Experiment 3 illustrates the interaction of visual guidance and memory search when a subset of visual stimuli are drawn from a target category. Furthermore, we outline a conceptual model, supported by our results, defining the core components that would be necessary to support such categorical hybrid searches. PMID:24661054
Study on a novel core module based on optical fiber bundles for urine dry-chemistry analysis
NASA Astrophysics Data System (ADS)
Liu, Gaiqin; Ma, Zengwei; Li, Rui; Hu, Nan; Chen, Ping; Wang, Fei; Zhang, Ruiying; Chen, Longcong
2017-09-01
A core module with a novel optical structure is presented to analyze urine by the dry-chemistry method in this paper. It consists of a 32-bit microprocessor, optical fiber bundles, a high precision color sensor and a temperature sensor. The optical fiber bundles are adopted to control the spread path of light and reduce the influence of ambient light and the distance between the strip and sensor effectively. And the temperature sensor is applied to detect the environmental temperature to calibrate the measurement results. Therefore, all these can bring a lot of benefits to the core module, such as improving its test accuracy, reducing its volume and cost, and simplifying its assembly. Additionally, some parameters, including the calculation coefficient about reflectivity of each item, semi-quantitative intervals, the number of test items, may be modified by corresponding instructions in order to enhance its applicability. Meanwhile, its outputs can be chosen among the original data, normalized color values, reflectivity, and the semi-quantitative level of each test item by available instructions. Our results show that the module has high measurement accuracy of more than 95%, good stability, reliability, and consistency and can be easily used in various types of urine analyzers.
The PedsQL Multidimensional Fatigue Scale in pediatric rheumatology: reliability and validity.
Varni, James W; Burwinkle, Tasha M; Szer, Ilona S
2004-12-01
. The PedsQL (Pediatric Quality of Life Inventory) is a modular instrument designed to measure health related quality of life (HRQOL) in children and adolescents ages 2-18 years. The recently developed 18-item PedsQL Multidimensional Fatigue Scale was designed to measure fatigue in pediatric patients and comprises the General Fatigue Scale (6 items), Sleep/Rest Fatigue Scale (6 items), and Cognitive Fatigue Scale (6 items). The PedsQL 4.0 Generic Core Scales were developed as the generic core measure to be integrated with the PedsQL Disease-Specific Modules. The PedsQL 3.0 Rheumatology Module was designed to measure pediatric rheumatology-specific HRQOL. Methods. The PedsQL Multidimensional Fatigue Scale, Generic Core Scales, and Rheumatology Module were administered to 163 children and 154 parents (183 families accrued overall) recruited from a pediatric rheumatology clinic. Results. Internal consistency reliability for the PedsQL Multidimensional Fatigue Scale Total Score (a = 0.95 child, 0.95 parent report), General Fatigue Scale (a = 0.93 child, 0.92 parent), Sleep/Rest Fatigue Scale (a = 0.88 child, 0.90 parent), and Cognitive Fatigue Scale (a = 0.93 child, 0.96 parent) were excellent for group and individual comparisons. The validity of the PedsQL Multidimensional Fatigue Scale was confirmed through hypothesized intercorrelations with dimensions of generic and rheumatology-specific HRQOL. The PedsQL Multidimensional Fatigue Scale distinguished between healthy children and children with rheumatic diseases as a group, and was associated with greater disease severity. Children with fibromyalgia manifested greater fatigue than children with other rheumatic diseases. The results confirm the initial reliability and validity of the PedsQL Multidimensional Fatigue Scale in pediatric rheumatology.
Webster, Joseph B
2009-03-01
To determine the performance and change over time when incorporating questions in the core competency domains of practice-based learning and improvement (PBLI), systems-based practice (SBP), and professionalism (PROF) into the national PM&R Self-Assessment Examination for Residents (SAER). Prospective, longitudinal analysis. The national Self-Assessment Examination for Residents (SAER) in Physical Medicine and Rehabilitation, which is administered annually. Approximately 1100 PM&R residents who take the examination annually. Inclusion of progressively more challenging questions in the core competency domains of PBLI, SBP, and PROF. Individual test item level of difficulty (P value) and discrimination (point biserial index). Compared with the overall test, questions in the subtopic areas of PBLI, SBP, and PROF were relatively easier and less discriminating (correlation of resident performance on these domains compared with that on the total test). These differences became smaller during the 3-year time period. The difficulty level of the questions in each of the subtopic domains was raised during the 3 year period to a level close to the overall exam. Discrimination of the test items improved or remained stable. This study demonstrates that, with careful item writing and review, multiple-choice items in the PBLI, SBP, and PROF domains can be successfully incorporated into an annual, national self-assessment examination for residents. The addition of these questions had value in assessing competency while not compromising the overall validity and reliability of the exam. It is yet to be determined if resident performance on these questions corresponds to performance on other measures of competency in the areas of PBLI, SBP, and PROF.
Timing of Survey Administration After Hospice Patient Death: Stability of Bereaved Respondents
DiBiasio, Eleanor L.; Clark, Melissa A.; Gozalo, Pedro L.; Spence, Carol; Casarett, David J.; Teno, Joan M.
2017-01-01
Context The Centers for Medicare & Medicaid Services have elected to include a bereaved family member survey in public reporting of hospice quality data as mandated in the Affordable Care Act. However, it is not known what timepoint after death offers the most reliable responses. Objectives To examine the stability of bereaved family members’ survey responses when administered three, six and nine months after hospice patient death. Methods Bereaved family members from six geographically diverse hospices were interviewed three, six, and nine months after patient death. All respondents completed a core survey. Those whose family member died at home, in a free-standing inpatient unit, or in a nursing home also completed a site-specific module. Stability was based on top-box scoring of each item with kappa statistics, and multivariable regression models were used to assess directionality and predictors of change. To analyze the effects of grief, we assessed response stability among respondents at least one standard deviation from the mean change in grief between three and six months. Results We had 1532 surveys (536 three-month surveys, 529 six-month surveys, and 467 nine-month surveys) returned by 643 respondents (average age 61.7 years, 17.4% Black, 50.5% a child respondent) about hospice decedents (55.3% female, average age 78.6 years, 57.0% non-cancer, 40.0% at home.) The average kappa for core items between three and nine months was 0.54 (range: 0.42-0.74), 0.58 (0.41-0.69) for home-specific items, and 0.54 (0.39-0.63) for nursing home. Even among individuals demonstrating large grief changes, core items demonstrated moderate to high stability over time. Conclusion Bereaved family member responses are stable between three and nine months after the death of the patient. PMID:25647420
Gottvall, Maria; Vaez, Marjan
2017-01-01
A high proportion of refugees have been subjected to potentially traumatic experiences (PTEs), including torture. PTEs, and torture in particular, are powerful predictors of mental ill health. This paper reports the development and preliminary validation of a brief refugee trauma checklist applicable for survey studies. Methods: A pool of 232 items was generated based on pre-existing instruments. Conceptualization, item selection and item refinement was conducted based on existing literature and in collaboration with experts. Ten cognitive interviews using a Think Aloud Protocol (TAP) were performed in a clinical setting, and field testing of the proposed checklist was performed in a total sample of n = 137 asylum seekers from Syria. Results: The proposed refugee trauma history checklist (RTHC) consists of 2 × 8 items, concerning PTEs that occurred before and during the respondents’ flight, respectively. Results show low item non-response and adequate psychometric properties Conclusions: RTHC is a usable tool for providing self-report data on refugee trauma history surveys of community samples. The core set of included events can be augmented and slight modifications can be applied to RTHC for use also in other refugee populations and settings. PMID:28976937
Anorexia/cachexia-related quality of life for children with cancer.
Lai, Jin-Shei; Cella, David; Peterman, Amy; Barocas, Joshua; Goldman, Stewart
2005-10-01
Anorexia is a common symptom in patients with cancer, which can lead to poor tolerance of treatment and can contribute to cachexia in extreme cases. Children with advanced-stage cancer are especially vulnerable to malnutrition resulting from anorexia and cachexia. Currently, there are no instruments that measure common concerns specifically associated with anorexia and cachexia in children with cancer. The purpose of the current article was to test the psychometric properties of a newly developed pediatric Functional Assessment of Anorexia and Cachexia Therapy (peds-FAACT) for children with cancer. Ninety-six patients (ages 7-17 yrs) receiving cancer treatment and their parents were asked to complete the 12-item peds-FAACT. The authors implemented both classical test theory and item response theory to evaluate the agreement between parents and patients, internal consistency and unidimensionality of the scale, and stability of items across subgroups. As a result, a patient-reported six-item scale was recommended as the core measure for all pediatric patients with cancer and four additional peripheral items were recommended for adolescent patients. The peds-FAACT demonstrated good psychometric properties, differentiated patients with different functional performance status, and was determined to be a useful tool for future clinical trials.
Grigg, Kaine; Manderson, Lenore
2016-03-17
Racism and associated discrimination are pervasive and persistent challenges with multiple cumulative deleterious effects contributing to inequities in various health outcomes. Globally, research over the past decade has shown consistent associations between racism and negative health concerns. Such research confirms that race endures as one of the strongest predictors of poor health. Due to the lack of validated Australian measures of racist attitudes, RACES (Racism, Acceptance, and Cultural-Ethnocentrism Scale) was developed. Here, we examine RACES' psychometric properties, including the latent structure, utilising Item Response Theory (IRT). Unidimensional and Multidimensional Rating Scale Model (RSM) Rasch analyses were utilised with 296 Victorian primary school students and 182 adolescents and 220 adults from the Australian community. RACES was demonstrated to be a robust 24-item three-dimensional scale of Accepting Attitudes (12 items), Racist Attitudes (8 items), and Ethnocentric Attitudes (4 items). RSM Rasch analyses provide strong support for the instrument as a robust measure of racist attitudes in the Australian context, and for the overall factorial and construct validity of RACES across primary school children, adolescents, and adults. RACES provides a reliable and valid measure that can be utilised across the lifespan to evaluate attitudes towards all racial, ethnic, cultural, and religious groups. A core function of RACES is to assess the effectiveness of interventions to reduce community levels of racism and in turn inequities in health outcomes within Australia.
Core information set for oesophageal cancer surgery.
Blazeby, J M; Macefield, R; Blencowe, N S; Jacobs, M; McNair, A G K; Sprangers, M; Brookes, S T
2015-07-01
Surgeons provide patients with information before surgery, although standards of information are lacking and practice varies. The development and use of a 'core information set' as baseline information before surgery may improve understanding. A core set is a minimum set of information to use in all consultations before a specific procedure. This study developed a core information set for oesophageal cancer surgery. Information was identified from the literature, observations of clinical consultations and patient interviews. This was integrated to create a questionnaire survey. Stakeholders (patients and professionals) were surveyed twice to assess views on importance of information from 'not essential' to 'absolutely essential' using Delphi methods. Items not meeting predefined criteria were discarded after each survey and the final retained items were voted on, in separate patient and professional stakeholder meetings, to agree the core set. Some 67 information items were identified initially from multiple sources. Survey response rates were 76·5 per cent (185 of 242) and 54·8 per cent (126 of 230) for patients and professionals respectively (first round), and over 83 per cent in both groups thereafter. Health professionals rated short-term clinical outcomes most highly (technical complications), whereas patients prioritized information related to long-term benefits. The consensus meetings agreed the final set, which consisted of: in-hospital milestones to recovery, rates of open-and-close surgery, in-hospital mortality, major complications (reoperation), milestones in recovery after discharge, longer-term eating and drinking and overall quality of life, and chances of survival. This study has established a core information set for surgery for oesophageal cancer. © 2015 BJS Society Ltd Published by John Wiley & Sons Ltd.
1993-12-02
community) or not knowing MPT analysis evaluates human-in- loop costs and how to exploit data that were available to represent capabilities w;th intent...specification b. There is no closed loop ; the process may statements. An agency is preparing a System Specification with minimal security information...Item is one which musk alweys be provided by the CMI system to be AICC compliant. Core Items are those which a lesson may always depend upon being
Thompson, Joyce B; Fullerton, Judith T; Sawyer, Angela J
2011-08-01
a 2-year study was conducted to develop Global Standards for Midwifery Education in keeping with core documents of the International Confederation of Midwives. Elements of the standards were based on evidence available in the published and unpublished literature. Companion Guidelines to assist in implementing the standards were also developed. a modified Delphi survey process was conducted in two rounds following item validation by a panel of midwifery education experts. a global survey conducted in 88 countries. midwifery educators and clinicians associated with midwifery education located in any of the ICM member association countries. Additional participants included an Expert Midwifery Resource Group, other Key Stakeholders, midwifery regulators and policy makers. A total of 241 individuals from 46 ICM member association countries and ten non-member countries responded to one or both of the survey rounds. survey respondents expressed an opinion on whether to retain or to delete any of the proposed components of the standards. Version one had 109 proposed components and version two had 111 items for consideration. a majority consensus of .80 was required to accept an item without further deliberation. The Education Standards Task Force (expert panel) made final decisions in the four instances where this level of consensus was not reached, retaining all four items. The panel also amended the wording of selected items or added new items based on feedback received from survey respondents. The final document contains 10 Preface items, 35 glossary terms, and 37 discrete standards with 27 sub-sections. Copyright © 2011 Elsevier Ltd. All rights reserved.
Mowla, Arash; Kalantarhormozi, Mohammad Reza; Khazraee, Samaneh
2011-01-01
Differentiating major depressive disorder (MDD) without hypothyroidism from MDD associated with hypothyroidism can be challenging. Therefore some authors have suggested that thyroid function should be tested in all depressed patients. This study compared the clinical characteristics of patients with MDD associated with hypothyroidism with those of patients with MDD without hypothyroidism. Thyroid function tests were administered to 75 patients (60 female and 15 male) who met DSM-IV criteria for MDD. The 15 patients with hypothyroidism (8 with subclinical hypothyroidism and 7 with overt hypothyroidism) were compared with the other 60 patients with regard to depressive characteristics. The primary measure of depressive signs and symptoms used to assess depression severity and symptoms was the Hamilton Rating Scale for Depression, first 17 items (Ham-D-17). Baseline demographic data, including age and sex, were also compared. The two groups did not differ significantly in severity of overall depression at baseline, as measured by total score on the Ham-D-17 (P=0.471, Z=0.970). Patients with MDD without hypothyroidism had worse scores on item 1 (depressed mood), item 2 (feelings of guilt), item 3 (suicidality), item 6 (late insomnia), and item 16 (loss of weight). In contrast, depressed patients with hypothyroidism had more severe anxiety symptoms and greater agitation (items 9, 10, and 11). Our results may help clinicians differentiate MDD associated with hypothyroidism from MDD without hypothyroidism. Depressed patients with hypothyroidism had more anxiety symptoms and greater agitation, but they had fewer severe core depressive symptoms and biological signs of MDD. (Journal of Psychiatric Practice. 2011;17:67-71).
NASA Astrophysics Data System (ADS)
Lahaie, Sébastien; Parkes, David C.
We consider the problem of fair allocation in the package assignment model, where a set of indivisible items, held by single seller, must be efficiently allocated to agents with quasi-linear utilities. A fair assignment is one that is efficient and envy-free. We consider a model where bidders have superadditive valuations, meaning that items are pure complements. Our central result is that core outcomes are fair and even coalition-fair over this domain, while fair distributions may not even exist for general valuations. Of relevance to auction design, we also establish that the core is equivalent to the set of anonymous-price competitive equilibria, and that superadditive valuations are a maximal domain that guarantees the existence of anonymous-price competitive equilibrium. Our results are analogs of core equivalence results for linear prices in the standard assignment model, and for nonlinear, non-anonymous prices in the package assignment model with general valuations.
Wong, Alex W K; Lau, Stephen C L; Fong, Mandy W M; Cella, David; Lai, Jin-Shei; Heinemann, Allen W
2018-04-03
To determine the extent to which the content of the Quality of Life in Neurological Disorders (Neuro-QoL) covers the International Classification of Functioning, Disability and Health (ICF) Core Sets for multiple sclerosis (MS), stroke, spinal cord injury (SCI), and traumatic brain injury (TBI) using summary linkage indicators. Content analysis by linking content of the Neuro-QoL to corresponding ICF codes of each Core Set for MS, stroke, SCI, and TBI. Three academic centers. None. None. Four summary linkage indicators proposed by MacDermid et al were estimated to compare the content coverage between Neuro-QoL and the ICF codes of Core Sets for MS, stroke, MS, and TBI. Neuro-QoL represented 20% to 30% Core Set codes for different conditions in which more codes in Core Sets for MS (29%), stroke (28%), and TBI (28%) were covered than those for SCI in the long-term (20%) and early postacute (19%) contexts. Neuro-QoL represented nearly half of the unique Activity and Participation codes (43%-49%) and less than one third of the unique Body Function codes (12%-32%). It represented fewer Environmental Factors codes (2%-6%) and no Body Structures codes. Absolute linkage indicators found that at least 60% of Neuro-QoL items were linked to Core Set codes (63%-95%), but many items covered the same codes as revealed by unique linkage indicators (7%-13%), suggesting high concept redundancy among items. The Neuro-QoL links more closely to ICF Core Sets for stroke, MS, and TBI than to those for SCI, and primarily covers activity and participation ICF domains. Other instruments are needed to address concepts not measured by the Neuro-QoL when a comprehensive health assessment is needed. Copyright © 2018 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Baseline Design Compliance Matrix for the Rotary Mode Core Sampling System
DOE Office of Scientific and Technical Information (OSTI.GOV)
LECHELT, J.A.
2000-10-17
The purpose of the design compliance matrix (DCM) is to provide a single-source document of all design requirements associated with the fifteen subsystems that make up the rotary mode core sampling (RMCS) system. It is intended to be the baseline requirement document for the RMCS system and to be used in governing all future design and design verification activities associated with it. This document is the DCM for the RMCS system used on Hanford single-shell radioactive waste storage tanks. This includes the Exhauster System, Rotary Mode Core Sample Trucks, Universal Sampling System, Diesel Generator System, Distribution Trailer, X-Ray Cart System,more » Breathing Air Compressor, Nitrogen Supply Trailer, Casks and Cask Truck, Service Trailer, Core Sampling Riser Equipment, Core Sampling Support Trucks, Foot Clamp, Ramps and Platforms and Purged Camera System. Excluded items are tools such as light plants and light stands. Other items such as the breather inlet filter are covered by a different design baseline. In this case, the inlet breather filter is covered by the Tank Farms Design Compliance Matrix.« less
Nymark, C; Saboonchi, F; Mattiasson, A-C; Henriksson, P; Kiessling, A
2017-03-01
Reducing patient delay for patients afflicted by an acute myocardial infarction is a task of great complexity, which might be alleviated if more factors that influence this delay could be identified. Although a number of self-reported instruments associated with patient delay exist, none of these taps the content of the appraisal process related to patients' subjective emotions. The aim of this study was to develop and validate a questionnaire aimed at assessing patients' appraisal, emotions and action tendencies when afflicted by an acute myocardial infarction. An item pool was generated based on themes conceptualized in a recent qualitative study of acute myocardial infarction patients' thoughts, feelings and actions preceding the decision to seek medical care. The 'Think-Aloud Protocol' and test-retest analysis at item level were performed. The modified item pool was administered to 96 patients when treated for acute myocardial infarction. Explorative factor analysis and principal component analysis with the non-linear iterative partial least squares algorithm were performed to examine the underlying factor structure of the items. The findings indicated three core dimensions corresponding to three subscales, namely, 'symptom appraisal'; 'perceived inability to act'; 'autonomy preservation'. The results demonstrated acceptable measures of reliability and validity Conclusions: The PA-AMI questionnaire demonstrated satisfactory psychometric properties. Assessment of the included core dimensions may contribute to greater understanding of the appraisal processes for patients afflicted by an acute myocardial infarction.
Development and Validation of the Questionnaire of Vaping Craving.
Dowd, Ashley N; Motschman, Courtney A; Tiffany, Stephen T
2018-03-12
Craving may represent core motivational processes in tobacco dependence, but there is no psychometrically evaluated measure of craving for e-cigarettes (vaping craving). This research developed and validated a brief measure of vaping craving. The measure was evaluated in two studies. In Study 1, a 42-item questionnaire assessing a wide range of vaping craving content was administered to 209 current e-cigarette users. In Study 2, a 10-item questionnaire derived from Study 1 results was administered to 224 current e-cigarette users. Participants were recruited from Amazon's Mechanical Turk, an online labor market. Principal factor analysis identified the strongest loading items (.815 - .867) on the first extracted factor (77% of the factor variance) for inclusion in a 10-item Questionnaire of Vaping Craving (QVC). This item set, with an internal consistency (α) of .97, focused on desire and intent to vape, and anticipation of positive outcomes related to e-cigarette use. Confirmatory factor analysis revealed the items had strong factor loadings that were significantly predicted by the latent vaping craving construct (ps < .001). Higher vaping craving was significantly associated with the level of e-cigarette use, greater negative mood, and lower confidence in ability to quit vaping (ps < .01). Among participants who also smoked tobacco (87%), vaping craving was more strongly associated with e-cigarette dependence than tobacco dependence. The findings support the reliability and validity of the QVC and suggest it could be used in laboratory and clinical settings as a psychometrically sound measure of vaping craving.
Measuring the symptom burden associated with the treatment of chronic myeloid leukemia
Gonzalez, Araceli G. Garcia; Ault, Patricia; Mendoza, Tito R.; Sailors, Mary L.; Williams, Janet L.; Huang, Furong; Nazha, Aziz; Kantarjian, Hagop M.; Cleeland, Charles S.; Cortes, Jorge E.
2013-01-01
We developed a module of the MD Anderson Symptom Inventory (MDASI) for patients with chronic myeloid leukemia (CML). To develop the MDASI-CML, we identified CML-specific symptoms from qualitative interviews with 35 patients. A list of candidate symptoms was reduced by a panel of patients, caregivers, and clinicians to the 13 core MDASI symptom items and 6 CML-specific items; these items were subsequently administered to 30 patients. Cognitive debriefing confirmed that the items were clear, relevant, and easy to use. One additional CML-specific symptom item was added, for a total of 7. The refined MDASI-CML was administered to 152 patients once every 2 weeks for 1 year. The content, concurrent, known-group, and construct validity of the MDASI-CML were evaluated. The internal consistency and test-retest reliabilities of the module were adequate. Longitudinal analysis showed relatively stable symptom severity scores over time. The most severe symptoms were fatigue, drowsiness, disturbed sleep, muscle soreness and cramping, and trouble remembering things. Approximately one-third of the patients who completed the MDASI-CML reported persistent moderate-to-severe symptoms. The MDASI-CML is a valid and reliable symptom assessment instrument that can be used in clinical studies of symptom status in patients with CML. This trial was registered at www.clinicaltrials.gov as #NCT01046305. PMID:23777764
Daker-White, Gavin; Crowley, Tessa
2003-05-01
A cross-sectional questionnaire survey of 216 men and 191 women attending a genitourinary medicine (GUM) clinic was undertaken to explore the relationship between sexual symptoms and quality of sexual life, and to test the psychometric validity of a pilot self-report measure of Sexual Function and Quality of Sexual Life (SFQoSL). Statistical comparisons were made with three reference groups: volunteers attending GUM for psychosexual counselling, outpatients at an Obstetrics and Gynaecology Department, and staff. Exploratory principal components analysis (with varimax rotation) of questionnaire item responses suggested an 11 (in women) and 13 (in men) factor solution, incorporating four multi-item scales. Internal consistency (Cronbach's alpha) of core items was 0.84 in 186 women (19 items) and 0.87 in 210 men (22 items). Construct validity was supported in comparisons with reference groups using one-way analysis of variance and post-hoc Scheffé testing. Overall, 116 (54%) male and 132 (69%) female GUM outpatients had scores indicating sexual dysfunction. Thirty-seven (17%) men reported erectile dysfunction; 54 (28%) women reported vaginal dryness affecting sex; 48 (25%) women reported genital changes affecting sex; 45 (21%) men and 64 (34%) women reported problems reaching orgasm.
Development of six PROMIS pediatrics proxy-report item banks
2012-01-01
Background Pediatric self-report should be considered the standard for measuring patient reported outcomes (PRO) among children. However, circumstances exist when the child is too young, cognitively impaired, or too ill to complete a PRO instrument and a proxy-report is needed. This paper describes the development process including the proxy cognitive interviews and large-field-test survey methods and sample characteristics employed to produce item parameters for the Patient Reported Outcomes Measurement Information System (PROMIS) pediatric proxy-report item banks. Methods The PROMIS pediatric self-report items were converted into proxy-report items before undergoing cognitive interviews. These items covered six domains (physical function, emotional distress, social peer relationships, fatigue, pain interference, and asthma impact). Caregivers (n = 25) of children ages of 5 and 17 years provided qualitative feedback on proxy-report items to assess any major issues with these items. From May 2008 to March 2009, the large-scale survey enrolled children ages 8-17 years to complete the self-report version and caregivers to complete the proxy-report version of the survey (n = 1548 dyads). Caregivers of children ages 5 to 7 years completed the proxy report survey (n = 432). In addition, caregivers completed other proxy instruments, PedsQL™ 4.0 Generic Core Scales Parent Proxy-Report version, PedsQL™ Asthma Module Parent Proxy-Report version, and KIDSCREEN Parent-Proxy-52. Results Item content was well understood by proxies and did not require item revisions but some proxies clearly noted that determining an answer on behalf of their child was difficult for some items. Dyads and caregivers of children ages 5-17 years old were enrolled in the large-scale testing. The majority were female (85%), married (70%), Caucasian (64%) and had at least a high school education (94%). Approximately 50% had children with a chronic health condition, primarily asthma, which was diagnosed or treated within 6 months prior to the interview. The PROMIS proxy sample scored similar or better on the other proxy instruments compared to normative samples. Conclusions The initial calibration data was provided by a diverse set of caregivers of children with a variety of common chronic illnesses and racial/ethnic backgrounds. The PROMIS pediatric proxy-report item banks include physical function (mobility n = 23; upper extremity n = 29), emotional distress (anxiety n = 15; depressive symptoms n = 14; anger n = 5), social peer relationships (n = 15), fatigue (n = 34), pain interference (n = 13), and asthma impact (n = 17). PMID:22357192
From time-based to competency-based standards: core transitional competencies in plastic surgery.
Lutz, Kristina; Yazdani, Arjang; Ross, Douglas
2015-01-01
Competency-based medical education is becoming increasingly prevalent and is likely to be mandated by the Royal College in the near future. The objective of this study was to define the core technical competencies that should be possessed by plastic surgery residents as they transition into their senior (presently postgraduate year 3) years of training. A list of potential core competencies was generated using a modified Delphi method that included the investigators and 6 experienced, academic plastic surgeons from across Canada and the United States. Generated items were divided into 7 domains: basic surgical skills, anesthesia, hand surgery, cutaneous surgery, esthetic surgery, breast surgery, and craniofacial surgery. Members of the Delphi group were asked to rank particular skills on a 4-point scale with anchored descriptors. Item reduction resulted in a survey consisting of 48 skills grouped into the aforementioned domains. This self-administered survey was distributed to all Canadian program directors (n = 11) via e-mail for validation and further item reduction. The response rate was 100% (11/11). Using the average rankings of program directors, 26 "core" skills were identified. There was agreement of core skills across all domains except for breast surgery and esthetic surgery. Of them, 7 skills were determined to be above the level of a trainee at this stage; a further 15 skills were agreed to be important, but not core, competencies. Overall, 26 competencies have been identified as "core" for plastic surgery residents to possess as they begin their senior, on-service years. The nature of these skills makes them suitable for teaching in a formal, simulated environment, which would ensure that all plastic surgery trainees are competent in these tasks as they transition to their senior years of residency. Copyright © 2014 Association of Program Directors in Surgery. Published by Elsevier Inc. All rights reserved.
Quigley, Denise D; Martino, Steven C; Brown, Julie A; Hays, Ron D
2013-01-01
A doctor's ability to communicate effectively is key to establishing and maintaining positive doctor-patient relationships. The Consumer Assessment of Healthcare Providers and System (CAHPS(®)) Clinician and Group Survey is the standard for collecting and reporting information about patients' experiences of care in the USA. To evaluate how well CAHPS(®) Clinician and Group 2.0 core and supplemental survey items (CG-CAHPS) with a 12-month reference capture doctor-patient communication. Eleven of the 40 highest-rated physicians on the CG-CAHPS survey treating patients in a Midwest commercial health plan. Data were obtained via semi-structured interviews. Specific behaviors, practices, and opinions about doctor communication were coded and compared to the CG-CAHPS items. CG-CAHPS fully captures six of the nine behaviors most commonly mentioned by high-performing physicians: employing office staff with good people skills; involving office staff in communication with patients; spending enough time with patients; listening carefully; providing clear, simple explanations; and devising an action plan with each patient. Three physician behaviors identified as key were not captured in CG-CAHPS items: use of nonverbal communication; greeting patients and introducing oneself; and tracking personal information about patients. CG-CAHPS survey items capture many of the most commonly mentioned doctor-patient communication behaviors and practices identified by high-performing physicians. Nonverbal communication, greeting patients, and tracking personal information about patients were identified as key aspects of doctor-patient communication, but are not captured by the current CG-CAHPS. We recommend further research to assess patients' perceptions of specific verbal and nonverbal behaviors (such as leaning forward in a chair, casually asking about other family members), followed by the development of new items (if needed) that aim to capture what these specific behaviors represent to patients (e.g., listens attentively, seems to care about me as a person, empathy). We also recommend including items about greeting and tracking personal information about patients in future CAHPS item sets addressing doctor-patient communication. Enriching the content of the CAHPS communication measure can help health-care organizations improve doctor-patient communication and interactions.
Hagelstein, V; Ortland, I; Wilmer, A; Mitchell, S A; Jaehde, U
2016-12-01
Integrating the patient's perspective has become an increasingly important component of adverse event reporting. The National Cancer Institute has developed a Patient-Reported Outcomes version of the Common Terminology Criteria for Adverse Events (PRO-CTCAE™). This instrument has been translated into German and linguistically validated; however, its quantitative measurement properties have not been evaluated. A German language survey that included 31 PRO-CTCAE items, as well as the EORTC QLQ-C30 and the Oral Mucositis Daily Questionnaire (OMDQ), was distributed at 10 cancer treatment settings in Germany and Austria. Item quality was assessed by analysis of acceptability and comprehensibility. Reliability was evaluated by using Cronbach's' alpha and validity by principal components analysis (PCA), multitrait-multimethod matrix (MTMM) and known groups validity techniques. Of 660 surveys distributed to the study centres, 271 were returned (return rate 41%), and data from 262 were available for analysis. Participants' median age was 59.7 years, and 69.5% of the patients were female. Analysis of item quality supported the comprehensibility of the 31 PRO-CTCAE items. Reliability was very good; Cronbach's' alpha correlation coefficients were >0.9 for almost all item clusters. Construct validity of the PRO-CTCAE core item set was shown by identifying 10 conceptually meaningful item clusters via PCA. Moreover, construct validity was confirmed by the MTMM: monotrait-heteromethod comparison showed 100% high correlation, whereas heterotrait-monomethod comparison indicated 0% high correlation. Known groups validity was supported; PRO-CTCAE scores were significantly lower for those with impaired versus preserved health-related quality of life. A set of 31 items drawn from the German PRO-CTCAE item library demonstrated favourable measurement properties. These findings add to the body of evidence that PRO-CTCAE provides a rigorous method to capture patient self-reports of symptomatic toxicity for use in cancer clinical trials. © The Author 2016. Published by Oxford University Press on behalf of the European Society for Medical Oncology. All rights reserved. For permissions, please email: journals.permissions@oup.com.
Emotion Dysregulation and the Core Features of Autism Spectrum Disorder
ERIC Educational Resources Information Center
Samson, Andrea C.; Phillips, Jennifer M.; Parker, Karen J.; Shah, Shweta; Gross, James J.; Hardan, Antonio Y.
2014-01-01
The aim of this study was to examine the relationship between emotion dysregulation and the core features of Autism Spectrum Disorder (ASD), which include social/communication deficits, restricted/repetitive behaviors, and sensory abnormalities. An 18-item Emotion Dysregulation Index was developed on the basis of expert ratings of the Child…
Development of a State-Wide Competency Test for Marketing Education. Final Report.
ERIC Educational Resources Information Center
Smith, Clifton L.
A project was conducted to develop a valid, competency-referenced test on the core competencies identified for the Missouri Fundamentals of Marketing curriculum. During the project: (1) multiple-choice test items based on the core competencies in the Fundamentals of Marketing curriculum were developed; (2) instructions for onsite administration of…
Effects of Enhanced Anchored Instruction on Skills Aligned to Common Core Math Standards
ERIC Educational Resources Information Center
Bottge, Brian A.; Cho, Sun-Joo
2013-01-01
This study compared how students with learning difficulties in math (MLD) who were randomly assigned to two instructional conditions answered items on problem solving tests aligned to the Common Core State Standards Initiative for Mathematics. Posttest scores showed improvement in the math performance of students receiving Enhanced Anchored…
Core Ideas and Topics: Building Up or Drilling Down?
ERIC Educational Resources Information Center
Cooper, Melanie M.; Posey, Lynmarie A.; Underwood, Sonia M.
2017-01-01
In this paper we discuss how and why core ideas can serve as the framework upon which chemistry curricula and assessment items are developed. While there are a number of projects that have specified "big ideas" or "anchoring concepts", the ways that these ideas are subsequently developed may inadvertently lead to fragmentation…
Timing of Survey Administration After Hospice Patient Death: Stability of Bereaved Respondents.
DiBiasio, Eleanor L; Clark, Melissa A; Gozalo, Pedro L; Spence, Carol; Casarett, David J; Teno, Joan M
2015-07-01
The Centers for Medicare & Medicaid Services have elected to include a bereaved family member survey in public reporting of hospice quality data as mandated in the Affordable Care Act. However, it is not known what time point after death offers the most reliable responses. To examine the stability of bereaved family members' survey responses when administered three, six, and nine months after hospice patient death. Bereaved family members from six geographically diverse hospices were interviewed three, six, and nine months after patient death. All respondents completed a core survey. Those whose family member died at home, in a freestanding inpatient unit, or in a nursing home also completed a site-specific module. Stability was based on top-box scoring of each item with kappa statistics, and multivariable regression models were used to assess directionality and predictors of change. To analyze the effects of grief, we assessed response stability among respondents at least one SD from the mean change in grief between three and six months. We had 1532 surveys (536 three-month surveys, 529 six-month surveys, and 467 nine-month surveys) returned by 643 respondents (average age 61.7 years, 17.4% black, and 50.5% a child respondent) about hospice decedents (55.3% females, average age 78.6 years, 57.0% noncancer, and 40.0% at home). The average kappa for core items between three and nine months was 0.54 (range 0.42-0.74), 0.58 (0.41-0.69) for home-specific items, and 0.54 (0.39-0.63) for nursing home. Even among individuals demonstrating large grief changes, core items demonstrated moderate to high stability over time. Bereaved family member responses are stable between three and nine months after the death of the patient. Copyright © 2015 American Academy of Hospice and Palliative Medicine. Published by Elsevier Inc. All rights reserved.
Jørgensen, L; Garne, J P; Søgaard, M; Laursen, B S
2015-04-01
Women with breast cancer often experience significant distress. Currently, there are no questionnaires aimed at identifying women's unique and possible changing indicators for distress in surgical continuity of care for breast cancer. We developed and tested three questionnaires specifically for this use. We first searched PubMed, CINAHL and PsycINFO to retrieve information on previously described indicators. Next, we conducted a focus group interview with 6 specialised nurses, who have extensive experience about consequences of breast cancer for women in surgical continuity of care. The questionnaire was tested on 18 women scheduled for breast cancer surgery. Subsequently, the women were debriefed to gain knowledge about comprehensibility, readability and relevance of items, and the time needed to complete the questionnaire. After adjustment, the questionnaires were field-tested concomitantly with a clinical study, which both consisted of a survey and an interview study. Three multi-item questionnaires were developed specific to different time points in surgical continuity of care. The questionnaires share a core of statements divided into seven sub-scales: emotional and physical situation, social condition, sexuality, body image, religion and organisational factors. Besides the core of statements, each questionnaire has different statements depending on the time point of surgical continuity of care when it was to be responded to. The questionnaires contain comprehensive items that can identify indicators for distress in individual women taking part in surgical continuity of care. The items were understandable and the time used for filling in the questionnaires was reasonable. Copyright © 2014 Elsevier Ltd. All rights reserved.
Dueck, Amylou C; Mendoza, Tito R; Mitchell, Sandra A; Reeve, Bryce B; Castro, Kathleen M; Rogak, Lauren J; Atkinson, Thomas M; Bennett, Antonia V; Denicoff, Andrea M; O'Mara, Ann M; Li, Yuelin; Clauser, Steven B; Bryant, Donna M; Bearden, James D; Gillis, Theresa A; Harness, Jay K; Siegel, Robert D; Paul, Diane B; Cleeland, Charles S; Schrag, Deborah; Sloan, Jeff A; Abernethy, Amy P; Bruner, Deborah W; Minasian, Lori M; Basch, Ethan
2015-11-01
To integrate the patient perspective into adverse event reporting, the National Cancer Institute developed a patient-reported outcomes version of the Common Terminology Criteria for Adverse Events (PRO-CTCAE). To assess the construct validity, test-retest reliability, and responsiveness of PRO-CTCAE items. A total of 975 adults with cancer undergoing outpatient chemotherapy and/or radiation therapy enrolled in this questionnaire-based study between January 2011 and February 2012. Eligible participants could read English and had no clinically significant cognitive impairment. They completed PRO-CTCAE items on tablet computers in clinic waiting rooms at 9 US cancer centers and community oncology practices at 2 visits 1 to 6 weeks apart. A subset completed PRO-CTCAE items during an additional visit 1 business day after the first visit. Primary comparators were clinician-reported Eastern Cooperative Oncology Group Performance Status (ECOG PS) and the European Organisation for Research and Treatment of Cancer Core Quality of Life Questionnaire (QLQ-C30). A total of 940 of 975 (96.4%) and 852 of 940 (90.6%) participants completed PRO-CTCAE items at visits 1 and 2, respectively. At least 1 symptom was reported by 938 of 940 (99.8%) participants. Participants' median age was 59 years; 57.3% were female, 32.4% had a high school education or less, and 17.1% had an ECOG PS of 2 to 4. All PRO-CTCAE items had at least 1 correlation in the expected direction with a QLQ-C30 scale (111 of 124, P<.05 for all). Stronger correlations were seen between PRO-CTCAE items and conceptually related QLQ-C30 domains. Scores for 94 of 124 PRO-CTCAE items were higher in the ECOG PS 2 to 4 vs 0 to 1 group (58 of 124, P<.05 for all). Overall, 119 of 124 items met at least 1 construct validity criterion. Test-retest reliability was 0.7 or greater for 36 of 49 prespecified items (median [range] intraclass correlation coefficient, 0.76 [0.53-.96]). Correlations between PRO-CTCAE item changes and corresponding QLQ-C30 scale changes were statistically significant for 27 prespecified items (median [range] r=0.43 [0.10-.56]; all P≤.006). Evidence demonstrates favorable validity, reliability, and responsiveness of PRO-CTCAE in a large, heterogeneous US sample of patients undergoing cancer treatment. Studies evaluating other measurement properties of PRO-CTCAE are under way to inform further development of PRO-CTCAE and its inclusion in cancer trials.
Core competency model for the family planning public health nurse.
Hewitt, Caroline M; Roye, Carol; Gebbie, Kristine M
2014-01-01
A core competency model for family planning public health nurses has been developed, using a three stage Delphi Method with an expert panel of 40 family planning senior administrators, community/public health nursing faculty and seasoned family planning public health nurses. The initial survey was developed from the 2011 Title X Family Planning program priorities. The 32-item survey was distributed electronically via SurveyMonkey(®). Panelist attrition was low, and participation robust resulting in the final 28-item model, suggesting that the Delphi Method was a successful technique through which to achieve consensus. Competencies with at least 75% consensus were included in the model and those competencies were primarily related to education/counseling and administration of medications and contraceptives. The competencies identified have implications for education/training, certification and workplace performance. © 2014 Wiley Periodicals, Inc.
Three-dimensional structural representation of the sleep-wake adaptability.
Putilov, Arcady A
2016-01-01
Various characteristics of the sleep-wake cycle can determine the success or failure of individual adjustment to certain temporal conditions of the today's society. However, it remains to be explored how many such characteristics can be self-assessed and how they are inter-related one to another. The aim of the present report was to apply a three-dimensional structural representation of the sleep-wake adaptability in the form of "rugby cake" (scalene or triaxial ellipsoid) to explain the results of analysis of the pattern of correlations of the responses to the initial 320-item list of a new inventory with scores on the six scales designed for multidimensional self-assessment of the sleep-wake adaptability (Morning and Evening Lateness, Anytime and Nighttime Sleepability, and Anytime and Daytime Wakeability). The results obtained for sample consisting of 149 respondents were confirmed by the results of similar analysis of earlier collected responses of 139 respondents to the same list of 320 items and responses of 1213 respondents to the 72 items of one of the earlier established questionnaire tools. Empirical evidence was provided in support of the model-driven prediction of the possibility to identify items linked to as many as 36 narrow (6 core and 30 mixed) adaptabilities of the sleep-wake cycle. The results enabled the selection of 168 items for self-assessment of all these adaptabilities predicted by the rugby cake model.
Development of a Facebook Addiction Scale.
Andreassen, Cecilie Schou; Torsheim, Torbjørn; Brunborg, Geir Scott; Pallesen, Ståle
2012-04-01
The Bergen Facebook Addiction Scale (BFAS), initially a pool of 18 items, three reflecting each of the six core elements of addiction (salience, mood modification, tolerance, withdrawal, conflict, and relapse), was constructed and administered to 423 students together with several other standardized self-report scales (Addictive Tendencies Scale, Online Sociability Scale, Facebook Attitude Scale, NEO-FFI, BIS/BAS scales, and Sleep questions). That item within each of the six addiction elements with the highest corrected item-total correlation was retained in the final scale. The factor structure of the scale was good (RMSEA = .046, CFI = .99) and coefficient alpha was .83. The 3-week test-retest reliability coefficient was .82. The scores converged with scores for other scales of Facebook activity. Also, they were positively related to Neuroticism and Extraversion, and negatively related to Conscientiousness. High scores on the new scale were associated with delayed bedtimes and rising times.
Goldstein, Elizabeth; Farquhar, Marybeth; Crofton, Christine; Darby, Charles; Garfinkel, Steven
2005-12-01
To describe the developmental process for the CAHPS Hospital Survey. A pilot was conducted in three states with 19,720 hospital discharges. A rigorous, multi-step process was used to develop the CAHPS Hospital Survey. It included a public call for measures, multiple Federal Register notices soliciting public input, a review of the relevant literature, meetings with hospitals, consumers and survey vendors, cognitive interviews with consumer, a large-scale pilot test in three states and consumer testing and numerous small-scale field tests. The current version of the CAHPS Hospital Survey has survey items in seven domains, two overall ratings of the hospital and five items used for adjusting for the mix of patients across hospitals and for analytical purposes. The CAHPS Hospital Survey is a core set of questions that can be administered as a stand-alone questionnaire or combined with a broader set of hospital specific items.
Northwest Manufacturing Initiative
2012-03-27
crack growth and threshold stress corrosion cracking evaluation. Threshold stress corrosion cracking was done using the rising step load method with...Group Technology methods to establish manufacturing cells for production efficiency, to develop internal Lean Champions, and to implement rapid... different levels, advisory, core, etc. VI. Core steering committee composed of members that have a significant vested interest. Action Item: Draft
Validating a Conceptual Framework for the Core Concept of "Cell-Cell Communication"
ERIC Educational Resources Information Center
Michael, Joel; Martinkova, Patricia; McFarland, Jenny; Wright, Ann; Cliff, William; Modell, Harold; Wenderoth, Mary Pat
2017-01-01
We have created and validated a conceptual framework for the core physiology concept of "cell-cell communication." The conceptual framework is composed of 51 items arranged in a hierarchy that is, in some instances, four levels deep. We have validated it with input from faculty who teach at a wide variety of institutional types. All…
ERIC Educational Resources Information Center
Cheek, Jimmy G.; McGhee, Max B.
An activity was undertaken to develop written criterion-referenced tests for the common core component of Applied Principles of Agribusiness and Natural Resources Occupations. Intended for tenth grade students who have completed Fundamentals of Agribusiness and Natural Resources Occupations, applied principles were designed to consist of three…
Detection of abnormal item based on time intervals for recommender systems.
Gao, Min; Yuan, Quan; Ling, Bin; Xiong, Qingyu
2014-01-01
With the rapid development of e-business, personalized recommendation has become core competence for enterprises to gain profits and improve customer satisfaction. Although collaborative filtering is the most successful approach for building a recommender system, it suffers from "shilling" attacks. In recent years, the research on shilling attacks has been greatly improved. However, the approaches suffer from serious problem in attack model dependency and high computational cost. To solve the problem, an approach for the detection of abnormal item is proposed in this paper. In the paper, two common features of all attack models are analyzed at first. A revised bottom-up discretized approach is then proposed based on time intervals and the features for the detection. The distributions of ratings in different time intervals are compared to detect anomaly based on the calculation of chi square distribution (χ(2)). We evaluated our approach on four types of items which are defined according to the life cycles of these items. The experimental results show that the proposed approach achieves a high detection rate with low computational cost when the number of attack profiles is more than 15. It improves the efficiency in shilling attacks detection by narrowing down the suspicious users.
Balboni, Giulia; Incognito, Oriana; Belacchi, Carmen; Bonichini, Sabrina; Cubelli, Roberto
2017-02-01
The evaluation of adaptive behavior is informative in children with attention-deficit/hyperactivity disorder (ADHD) or specific learning disorders (SLD). However, the few investigations available have focused only on the gross level of domains of adaptive behavior. To investigate which item subsets of the Vineland-II can discriminate children with ADHD or SLD from peers with typical development. Student's t-tests, ROC analysis, logistic regression, and linear discriminant function analysis were used to compare 24 children with ADHD, 61 elementary students with SLD, and controls matched on age, sex, school level attended, and both parents' education level. Several item subsets that address not only ADHD core symptoms, but also understanding in social context and development of interpersonal relationships, allowed discrimination of children with ADHD from controls. The combination of four item subsets (Listening and attending, Expressing complex ideas, Social communication, and Following instructions) classified children with ADHD with both sensitivity and specificity of 87.5%. Only Reading skills, Writing skills, and Time and dates discriminated children with SLD from controls. Evaluation of Vineland-II scores at the level of item content categories is a useful procedure for an efficient clinical description. Copyright © 2016 Elsevier Ltd. All rights reserved.
National Workshop on Astrobiology: The Life Science Involvement of AAS I Laben
NASA Astrophysics Data System (ADS)
Adami, Giorgio
2006-12-01
The search for traces of past and present life is a complex and multidisciplinary research activity involving several scientific heritages and a specific industrial ability for planetary exploration. Laben was established in 1958 to design and manufacture electronic instruments for research in nuclear physics. In the mid 2004 the company was merged with Alenia Spazio. It is now part of Alcatel Alenia Space, a French Italian joint venture. Alcatel Alenia Space Italia SpA is a Finmeccanica Company. Currently the plant of Vimodrone provides a wide heritage in life science oriented to space application. The experience in Space Life Science is consolidated in the following research areas:
Smits, Marleen; Keizer, Ellen; Ram, Paul; Giesen, Paul
2017-12-02
Telephone triage is a core but vulnerable part of the care process at out-of-hours general practitioner (GP) cooperatives. In the Netherlands, different instruments have been used for assessing the quality of telephone triage. These instruments focussed mainly on communicational aspects, and less on the medical quality of triage decisions. Our aim was to develop and test a minimum set of items to assess the quality of telephone triage. A national survey among all GP cooperatives in the Netherlands was performed to examine the most important aspects of telephone triage. Next, corresponding items from existing instruments were searched on these topics. Subsequently, an expert panel judged these items on importance, completeness and formulation. The concept KERNset consisted of 24 items about the telephone conversation: 13 medical, ten communicational and one regarding both types. It was pilot tested on measurement characteristics, reliability, validity and variation between triagists. In this pilot study, 114 anonymous calls from four GP cooperatives spread across the Netherlands were judged by three out of eight raters, both internal and external raters. Cronbach's alpha was .94 for the medical items and .75 for the communicational items. Inter-rater reliability: complete agreement between the external raters was 45% and reasonable agreement 73% (difference of maximally one point on the five-point scale). Intra-rater reliability: complete agreement within raters was 55% and reasonable agreement 84%. There were hardly any differences between internal and external raters, but there were differences in strictness between individual raters. The construct validity was confirmed by the high correlation between the general impression of the call and the items of the KERNset. Of the differences within items 19% could be explained by differences between triage nurses, which means the KERNset is able to demonstrate differences between triage nurses. The KERNset can be used to assess the quality of telephone triage. The validity is good and differences between calls and between triage nurses can be measured. A more intensive training for raters could improve the reliability.
The Bergen Shopping Addiction Scale: reliability and validity of a brief screening test.
Andreassen, Cecilie S; Griffiths, Mark D; Pallesen, Ståle; Bilder, Robert M; Torsheim, Torbjørn; Aboujaoude, Elias
2015-01-01
Although excessive and compulsive shopping has been increasingly placed within the behavioral addiction paradigm in recent years, items in existing screens arguably do not assess the core criteria and components of addiction. To date, assessment screens for shopping disorders have primarily been rooted within the impulse-control or obsessive-compulsive disorder paradigms. Furthermore, existing screens use the terms 'shopping,' 'buying,' and 'spending' interchangeably, and do not necessarily reflect contemporary shopping habits. Consequently, a new screening tool for assessing shopping addiction was developed. Initially, 28 items, four for each of seven addiction criteria (salience, mood modification, conflict, tolerance, withdrawal, relapse, and problems), were constructed. These items and validated scales (i.e., Compulsive Buying Measurement Scale, Mini-International Personality Item Pool, Hospital Anxiety and Depression Scale, Rosenberg Self-Esteem Scale) were then administered to 23,537 participants (M age = 35.8 years, SD age = 13.3). The highest loading item from each set of four pooled items reflecting the seven addiction criteria were retained in the final scale, The Bergen Shopping Addiction Scale (BSAS). The factor structure of the BSAS was good (RMSEA = 0.064, CFI = 0.983, TLI = 0.973) and coefficient alpha was 0.87. The scores on the BSAS converged with scores on the Compulsive Buying Measurement Scale (CBMS; 0.80), and were positively correlated with extroversion and neuroticism, and negatively with conscientiousness, agreeableness, and intellect/imagination. The scores of the BSAS were positively associated with anxiety, depression, and low self-esteem and inversely related to age. Females scored higher than males on the BSAS. The BSAS is the first scale to fully embed shopping addiction within an addiction paradigm. A recommended cutoff score for the new scale and future research directions are discussed.
Barbic, Skye P; Bartlett, Susan J; Mayo, Nancy E
2015-07-01
To describe the practical steps in identifying items and evaluating scoring strategies for a new measure of emotional vitality in informal caregivers of individuals who have experienced a significant health event. The psychometric properties of responses to selected items from validated health-related quality of life and other psychosocial questionnaires administered four times over a one-year period were evaluated using Rasch Measurement Theory. Community. A total of 409 individuals providing informal care at home to older adults who had experienced a recent stroke. Rasch Measurement Theory was used to test the ordering of response option thresholds, fit, spread of the item locations, residual correlations, person separation index, and stability across time. Based on a theoretical framework developed in earlier work, we identified 22 candidate items from a pool of relevant psychosocial measures available. Of these, additional evaluation resulted in 19 items that could be used to assess the five core domains. The overall model fit was reasonable (χ(2) = 202.26, DF = 117, p = 0.06), stable across time, with borderline evidence of multidimensionality (10%). Items and people covered a continuum ranging from -3.7 to +2.7 logits, reflecting coverage of the measurement continuum, with a person separation index of 0.85. Mean fit of caregivers was lower than expected (-1.31 ±1.10 logits). Established methods from the Rasch Measurement Theory were applied to develop a prototype measure of emotional vitality that is acceptable, reliable, and can be used to obtain an interval level score for use in future research and clinical settings. © The Author(s) 2014.
The Bergen Shopping Addiction Scale: reliability and validity of a brief screening test
Andreassen, Cecilie S.; Griffiths, Mark D.; Pallesen, Ståle; Bilder, Robert M.; Torsheim, Torbjørn; Aboujaoude, Elias
2015-01-01
Although excessive and compulsive shopping has been increasingly placed within the behavioral addiction paradigm in recent years, items in existing screens arguably do not assess the core criteria and components of addiction. To date, assessment screens for shopping disorders have primarily been rooted within the impulse-control or obsessive-compulsive disorder paradigms. Furthermore, existing screens use the terms ‘shopping,’ ‘buying,’ and ‘spending’ interchangeably, and do not necessarily reflect contemporary shopping habits. Consequently, a new screening tool for assessing shopping addiction was developed. Initially, 28 items, four for each of seven addiction criteria (salience, mood modification, conflict, tolerance, withdrawal, relapse, and problems), were constructed. These items and validated scales (i.e., Compulsive Buying Measurement Scale, Mini-International Personality Item Pool, Hospital Anxiety and Depression Scale, Rosenberg Self-Esteem Scale) were then administered to 23,537 participants (Mage = 35.8 years, SDage = 13.3). The highest loading item from each set of four pooled items reflecting the seven addiction criteria were retained in the final scale, The Bergen Shopping Addiction Scale (BSAS). The factor structure of the BSAS was good (RMSEA = 0.064, CFI = 0.983, TLI = 0.973) and coefficient alpha was 0.87. The scores on the BSAS converged with scores on the Compulsive Buying Measurement Scale (CBMS; 0.80), and were positively correlated with extroversion and neuroticism, and negatively with conscientiousness, agreeableness, and intellect/imagination. The scores of the BSAS were positively associated with anxiety, depression, and low self-esteem and inversely related to age. Females scored higher than males on the BSAS. The BSAS is the first scale to fully embed shopping addiction within an addiction paradigm. A recommended cutoff score for the new scale and future research directions are discussed. PMID:26441749
Doostfatemeh, Marziyeh; Ayatollahi, Seyyed Mohammad Taghi; Jafari, Peyman
2015-08-01
In child-parent agreement studies in the field of paediatric health-related quality of life (HRQoL), little attention has been paid to the effect of gender in parental proxy rating of children's HRQoL. This study aims to test the potential interchangeability of parent dyads in reporting children's HRQoL on both item and scale levels of the PedsQL™ 4.0 instrument, using the approach of differential item functioning (DIF). The PedsQL™ 4.0 Generic Core Scales were completed by 576 father-and-mother dyads. A polytomous item response theory model, graded response model, was used to detect DIF across fathers and mothers. Assessment at item level showed that fathers and mothers perceived the meaning of items of the PedsQL™ 4.0 consistently. Regarding the scale level, a moderate to high level of agreement was observed between mothers' and fathers' reports on all similar subscales. Although the significant mean score differences in total, physical and emotional functioning indicated that fathers gave higher scores to their children, the small effect size implied that this difference may not be practically meaningful. Our findings revealed that discrepancy in parent dyads in rating children's HRQoL is a "real" difference and not an artefact due to measurement non-invariance. Fathers were seen to have slightly different insights into their children, especially for emotional functioning, but overall the results were not all that different. This suggests that paternal proxy-reports can be included in studies along with maternal proxy-reports, and the two may be combined when looking at parent-child agreement. Parent-child agreement studies in Iran are not affected by parents' gender, and therefore, researchers may rely on the assumption of the interchangeability of fathers and mothers in these studies.
Reporting studies on time to diagnosis: proposal of a guideline by an international panel (REST).
Launay, Elise; Cohen, Jérémie F; Bossuyt, Patrick M; Buekens, Pierre; Deeks, Jonathan; Dye, Timothy; Feltbower, Richard; Ferrari, Andrea; Kramer, Michael; Leeflang, Mariska; Moher, David; Moons, Karel G; von Elm, Erik; Ravaud, Philippe; Chalumeau, Martin
2016-09-27
Studies on time to diagnosis are an increasing field of clinical research that may help to plan corrective actions and identify inequities in access to healthcare. Specific features of time to diagnosis studies, such as how participants were selected and how time to diagnosis was defined and measured, are poorly reported. The present study aims to derive a reporting guideline for studies on time to diagnosis. Each item of a list previously used to evaluate the completeness of reporting of studies on time to diagnosis was independently evaluated by a core panel of international experts (n = 11) for relevance and readability before an open electronic discussion allowed consensus to be reached on a refined list. The list was then submitted with an explanatory document to first, last and/or corresponding authors (n = 98) of published systematic reviews on time to diagnosis (n = 45) for relevance and readability, and finally approved by the core expert panel. The refined reporting guideline consists of a 19-item checklist: six items are about the process of participant selection (with a suggested flowchart), six about the definition and measurement of time to diagnosis, and three about optional analyses of associations between time to diagnosis and participant characteristics and health outcomes. Of 24 responding authors of systematic reviews, more than 21 (≥88 %) rated the items as relevant, and more than 17 (≥70 %) as readable; 19 of 22 (86 %) authors stated that they would potentially use the reporting guideline in the future. We propose a reporting guideline (REST) that could help authors, reviewers, and editors of time to diagnosis study reports to improve the completeness and the accuracy of their reporting.
Item generation in the development of an inpatient experience questionnaire: a qualitative study
2013-01-01
Background Patient experience is a key feature of quality improvement in modern health-care delivery. Measuring patient experience is one of several tools used to assess and monitor the quality of health services. This study aims to develop a tool for assessing patient experience with inpatient care in public hospitals in Hong Kong. Methods Based on the General Inpatient Questionnaire (GIQ) framework of the Care Quality Commission as a discussion guide, a qualitative study involving focus group discussions and in-depth individual interviews with patients was employed to develop a tool for measuring inpatient experience in Hong Kong. Results All participants agreed that a patient satisfaction survey is an important platform for collecting patients’ views on improving the quality of health-care services. Findings of the focus group discussions and in-depth individual interviews identified nine key themes as important hospital quality indicators: prompt access, information provision, care and involvement in decision making, physical and emotional needs, coordination of care, respect and privacy, environment and facilities, handling of patient feedback, and overall care from health-care professionals and quality of care. Privacy, complaint mechanisms, patient involvement, and information provision were further highlighted as particularly important areas for item revision by the in-depth individual interviews. Thus, the initial version of the Hong Kong Inpatient Experience Questionnaire (HKIEQ), comprising 58 core items under nine themes, was developed. Conclusions A set of dimensions and core items of the HKIEQ was developed and the instrument will undergo validity and reliability tests through a validation survey. A valid and reliable tool is important in accurately assessing patient experience with care delivery in hospitals to improve the quality of health-care services. PMID:23835186
Van Lerbeirghe, J; Van Lerbeirghe, J; Van Schaeybroeck, P; Robijn, H; Rasschaert, R; Sys, J; Parlevliet, T; Hallaert, G; Van Wambeke, P; Depreitere, B
2018-01-01
The core outcome measures index (COMI) is a validated multidimensional instrument for assessing patient-reported outcome in patients with back problems. The aim of the present study is to translate the COMI into Dutch and validate it for use in native Dutch speakers with low back pain. The COMI was translated into Dutch following established guidelines and avoiding region-specific terminology. A total of 89 Dutch-speaking patients with low back pain were recruited from 8 centers, located in the Dutch-speaking part of Belgium. Patients completed a questionnaire booklet including the validated Dutch version of the Roland Morris disability questionnaire, EQ-5D, the WHOQoL-Bref, the Numeric Rating Scale (NRS) for pain, and the Dutch translation of the COMI. Two weeks later, patients completed the Dutch COMI translation again, with a transition scale assessing changes in their condition. The patterns of correlations between the individual COMI items and the validated reference questionnaires were comparable to those reported for other validated language versions of the COMI. The intraclass correlation for the COMI summary score was 0.90 (95% CI 0.84-0.94). It was 0.75 and 0.70 for the back and leg pain score, respectively. The minimum detectable change for the COMI summary score was 1.74. No significant differences were observed between repeated scores of individual COMI items or for the summary score. The reproducibility of the Dutch translation of the COMI is comparable to that of other validated spine outcome measures. The COMI items correlate well with the established item-specific scores. The Dutch translation of the COMI, validated by this work, is a reliable and valuable tool for spine centers treating Dutch-speaking patients and can be used in registries and outcome studies.
Umar, Sulaiman; Man, Norsida; Nawi, Nolila Mohd; Latif, Ismail Abd; Samah, Bahaman Abu
2017-06-01
The study described the perceived importance of, and proficiency in core agricultural extension competencies among extension workers in Peninsular Malaysia; and evaluating the resultant deficits in the competencies. The Borich's Needs Assessment Model was used to achieve the objectives of the study. A sample of 298 respondents was randomly selected and interviewed using a pre-tested structured questionnaire. Thirty-three core competency items were assessed. Instrument validity and reliability were ensured. The cross-sectional data obtained was analysed using SPSS for descriptive statistics including mean weighted discrepancy score (MWDS). Results of the study showed that on a scale of 5, the most important core extension competency items according to respondents' perception were: "Making good use of information and communication technologies/access and use of web-based resources" (M=4.86, SD=0.23); "Conducting needs assessments" (M=4.84, SD=0.16); "organizing extension campaigns" (M=4.82, SD=0.47) and "Managing groups and teamwork" (M=4.81, SD=0.76). In terms of proficiency, the highest competency identified by the respondents was "Conducting farm and home visits (M=3.62, SD=0.82) followed by 'conducting meetings effectively' (M=3.19, SD=0.72); "Conducting focus group discussions" (M=3.16, SD=0.32) and "conducting community forums" (M=3.13, SD=0.64). The discrepancies implying competency deficits were widest in "Acquiring and allocating resources" (MWDS=12.67); use of information and communication technologies (ICTs) and web-based resources in agricultural extension (MWDS=12.59); and report writing and sharing the results and impacts (MWDS=11.92). It is recommended that any intervention aimed at developing the capacity of extension workers in Peninsular Malaysia should prioritize these core competency items in accordance with the deficits established in this study. Copyright © 2017 Elsevier Ltd. All rights reserved.
NASA Astrophysics Data System (ADS)
Nelson, Philip
2015-03-01
I'll describe an intermediate-level course on ``Physical Models of Living Systems.'' The only prerequisite is first-year university physics and calculus. The course is a response to rapidly growing interest among undergraduates in a broad range of science and engineering majors. Students acquire several research skills that are often not addressed in traditional courses:
Exploration into the Effects of the Schema-Based Instruction: A Bottom-Up Approach
ERIC Educational Resources Information Center
Fujii, Kazuma
2016-01-01
The purpose of this paper is to explore the effective use of the core schema-based instruction (SBI) in a classroom setting. The core schema is a schematic representation of the common underlying meaning of a given lexical item, and was first proposed on the basis of the cognitive linguistic perspectives by the Japanese applied linguists Tanaka,…
Zhao, L; Wang, Z; Qin, Z; Leslie, E; He, J; Xiong, Y; Xu, F
2018-03-01
The identification of physical-activity-friendly built environment (BE) constructs is highly useful for physical activity promotion and maintenance. The Physical Activity Neighborhood Environment Scale (PANES) was developed for assessing BE correlates. However, PANES reliability has not been investigated among adults in China. A cross-sectional study. With multistage sampling approaches, 1568 urban adults (aged 35-74 years) were recruited for the initial survey on all 17 items of PANES Chinese version (PANES-CHN), with the survey repeated 7 days later for each participant. Intraclass correlation coefficient (ICC) was used to assess the test-retest reliability of PANES-CHN for each item. Totally, 1551 participants completed both surveys (follow-up rate = 98.9%). Among participants (mean age: 54.7 ± 11.1 years), 47.8% were men, 22.1% were elders, and 22.7% had ≥13 years of education. Overall, the PANES-CHN demonstrated at least substantial reliability with ICCs ranging from 0.66 to 0.95 (core items), from 0.75 to 0.95 (recommended items), and from 0.78 to 0.87 (optional items). Similar outcomes were observed when data were analyzed by gender or age groups. The PANES-CHN has excellent test-retest reliability and thus has valuable utility for assessing urban BE attributes among Chinese adults. Copyright © 2017 The Royal Society for Public Health. Published by Elsevier Ltd. All rights reserved.
The EORTC module for quality of life in patients with thyroid cancer: phase III.
Singer, Susanne; Jordan, Susan; Locati, Laura D; Pinto, Monica; Tomaszewska, Iwona M; Araújo, Cláudia; Hammerlid, Eva; Vidhubala, E; Husson, Olga; Kiyota, Naomi; Brannan, Christine; Salem, Dina; Gamper, Eva M; Arraras, Juan Ignacio; Ioannidis, Georgios; Andry, Guy; Inhestern, Johanna; Grégoire, Vincent; Licitra, Lisa
2017-04-01
The purpose of the study was to pilot-test a questionnaire measuring health-related quality of life (QoL) in thyroid cancer patients to be used with the European Organisation for Research and Treatment of Cancer (EORTC) core questionnaire EORTC QLQ-C30. A provisional questionnaire with 47 items was administered to patients treated for thyroid cancer within the last 2 years. Patients were interviewed about time and help needed to complete the questionnaire, and whether they found the items understandable, confusing or annoying. Items were kept in the questionnaire if they fulfilled pre-defined criteria: relevant to the patients, easy to understand, not confusing, few missing values, neither floor nor ceiling effects, and high variance. A total of 182 thyroid cancer patients in 15 countries participated ( n = 115 with papillary, n = 31 with follicular, n = 22 with medullary, n = 6 with anaplastic, and n = 8 with other types of thyroid cancer). Sixty-six percent of the patients needed 15 min or less to complete the questionnaire. Of the 47 items, 31 fulfilled the predefined criteria and were kept unchanged, 14 were removed, and 2 were changed. Shoulder dysfunction was mentioned by 5 patients as missing and an item covering this issue was added. To conclude, the EORTC quality of life module for thyroid cancer (EORTC QLQ-THY34) is ready for the final validation phase IV. © 2017 Society for Endocrinology.
The diagnostic utility of separation anxiety disorder symptoms: An item response theory analysis
Cooper-Vince, Christine E.; Emmert-Aronson, Benjamin O.; Pincus, Donna B.; Comer, Jonathan S.
2013-01-01
At present, it is not clear whether the current definition of separation anxiety disorder (SAD) is the optimal classification of developmentally inappropriate, severe, and interfering separation anxiety in youth. Much remains to be learned about the relative contributions of individual SAD symptoms for informing diagnosis. Two-parameter logistic Item Response Theory analyses were conducted on the eight core SAD symptoms in an outpatient anxiety sample of treatment-seeking children (N=359, 59.3% female, MAge=11.2) and their parents to determine the diagnostic utility of each of these symptoms. Analyses considered values of item threshold, which characterize the SAD severity level at which each symptom has a 50% chance of being endorsed, and item discrimination, which characterize how well each symptom distinguishes individuals with higher and lower levels of SAD. Distress related to separation and fear of being alone without major attachment figures showed the strongest discrimination properties and the lowest thresholds for being endorsed. In contrast, worry about harm befalling attachment figures showed the poorest discrimination properties, and nightmares about separation showed the highest threshold for being endorsed. Distress related to separation demonstrated crossing differential item functioning associated with age—at lower separation anxiety levels excessive fear at separation was more likely to be endorsed for children ≥9 years, whereas at higher levels this symptom was more likely to be endorsed by children <9 years. Implications are discussed for optimizing the taxonomy of SAD in youth. PMID:23963543
Improved Taxation Rate for Bin Packing Games
NASA Astrophysics Data System (ADS)
Kern, Walter; Qiu, Xian
A cooperative bin packing game is a N-person game, where the player set N consists of k bins of capacity 1 each and n items of sizes a 1, ⋯ ,a n . The value of a coalition of players is defined to be the maximum total size of items in the coalition that can be packed into the bins of the coalition. We present an alternative proof for the non-emptiness of the 1/3-core for all bin packing games and show how to improve this bound ɛ= 1/3 (slightly). We conjecture that the true best possible value is ɛ= 1/7.
Pescosolido, Bernice A; Medina, Tait R; Martin, Jack K; Long, J Scott
2013-05-01
We used the Stigma in Global Context-Mental Health Study to assess the core sentiments that represent consistent, salient public health intervention targets. Data from 16 countries employed a nationally representative sampling strategy, international collaboration for instrument development, and case vignettes with Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition depression and schizophrenia criteria. We measured knowledge and prejudice with existing questions and scales, and employed exploratory data analysis to examine the public response to 43 items. Across countries, levels of recognition, acceptance of neurobiological attributions, and treatment endorsement were high. However, a core of 5 prejudice items was consistently high, even in countries with low overall stigma levels. The levels were generally lower for depression than schizophrenia, and exclusionary sentiments for more intimate venues and in authority-based roles showed the greatest stigma. Negative responses to schizophrenia and depression were highly correlated across countries. These results challenge researchers to reconfigure measurement strategies and policymakers to reconsider efforts to improve population mental health. Efforts should prioritize inclusion, integration, and competences for the reduction of cultural barriers to recognition, response, and recovery.
Teresi, Jeanne A; Ocepek-Welikson, Katja; Ramirez, Mildred; Kleinman, Marjorie; Ornstein, Katherine; Siu, Albert
2016-01-01
Background The Family Satisfaction with End-of-Life Care is an internationally used measure of satisfaction with cancer care. However, the Family Satisfaction with End-of-Life Care has not been studied for equivalence of item endorsement across different socio-demographic groups using differential item functioning. Aims The aims of this secondary data analysis were (1) to examine potential differential item functioning in the family satisfaction item set with respect to type of caregiver, race, and patient age, gender, and education and (2) to provide parameters and documentation of differential item functioning for an item bank. Design A mixed qualitative and quantitative analysis was conducted. A priori hypotheses regarding potential group differences in item response were established. Item response theory and Wald tests were used for the analyses of differential item functioning, accompanied by magnitude and impact measures. Results Very little significant differential item functioning was observed for patient's age and gender. For race, 13 items showed differential item functioning after multiple comparison adjustment, 10 with non-uniform differential item functioning. No items evidenced differential item functioning of high magnitude, and the impact was negligible. For education, 5 items evidenced uniform differential item functioning after adjustment, none of high magnitude. Differential item functioning impact was trivial. One item evidenced differential item functioning for the caregiver relationship variable. Conclusion Differential item functioning was observed primarily for race and education. No differential item functioning of high magnitude was observed for any item, and the overall impact of differential item functioning was negligible. One item, satisfaction with “the patient's pain relief,” might be singled out for further study, given that this item was both hypothesized and observed to show differential item functioning for race and education. PMID:25160692
Hinton, Devon E; Hinton, Alexander L; Eng, Kok-Thay; Choung, Sophearith
2012-09-01
This article describes a culturally sensitive assessment tool for traumatized Cambodians, the Cambodian "Somatic Symptom and Syndrome Inventory" (SSI), and reports the outcome of a needs assessment conducted in rural Cambodia using the instrument. Villagers locally identified (N = 139) as still suffering the effects of the Pol Pot genocide were evaluated. All 139 had post-traumatic stress disorder (PTSD) as assessed by the PTSD Checklist (PCL), and they had elevated SSI scores. The severity of the SSI items varied by level of PTSD severity, and several items--for example, dizziness, dizziness on standing, khyâl (a windlike substance) attacks, and "thinking a lot"--were extremely elevated in those participants with higher levels of PTSD. The SSI was more highly correlated to self-perceived health (Short Form Health Survey-3) and past trauma events (Harvard Trauma Questionnaire) than was the PCL. The study shows the SSI items to be a core aspect of the Cambodian trauma ontology.
Fact Sheet: Early Warning Signs of Psychosis
... 2 items) Genetics (1 item) Brain Anatomy and Physiology (1 item) Other Treatments (4 items) Coping with ... 2 items) Genetics (1 item) Brain Anatomy and Physiology (1 item) Other Treatments (4 items) Coping with ...
Treatment of Children with Mental Illness
... 1 item) Post-Traumatic Stress Disorder (4 items) Schizophrenia (5 items) Social Phobia (1 item) Populations Children ... 1 item) Post-Traumatic Stress Disorder (4 items) Schizophrenia (5 items) Social Phobia (1 item) Populations Children ...
Federal Register 2010, 2011, 2012, 2013, 2014
2010-06-24
..., 5 cores, 2 digging stick handles, 2 flake perforators, 2 hafted drills, 1 piece of incised bone, 7 pestles, 2 projectile points, and 1 fragment of worked bone. The 32 lots of objects are 4 lots of animal... worked bone fragment, 1 bottle fragment, 13 bullet cartridges, 3 copper pendants, 6 cores, 1 digging...
ERIC Educational Resources Information Center
Martens, Matthew P.; Brown, Natashia T.; Donovan, Brooke M.; Dude, Kim
2005-01-01
A commonly used instrument to assess negative consequences of substance use among college students is the Core Alcohol and Drug Survey (CADS; C. A. Presley, P. W. Meilman, & J. S. Leichliter, 1998; C. A. Presley, P. W. Meilman, & R. Lyerla, 1993). Results from 2 studies suggest that a subset of CADS negative consequences items can be…
Park, In Sook; Suh, Yeon Ok; Park, Hae Sook; Kang, So Young; Kim, Kwang Sung; Kim, Gyung Hee; Choi, Yeon-Hee; Kim, Hyun-Ju
2017-01-01
The purpose of this study was to improve the quality of items on the Korean Nursing Licensing Examination by developing and evaluating case-based items that reflect integrated nursing knowledge. We conducted a cross-sectional observational study to develop new case-based items. The methods for developing test items included expert workshops, brainstorming, and verification of content validity. After a mock examination of undergraduate nursing students using the newly developed case-based items, we evaluated the appropriateness of the items through classical test theory and item response theory. A total of 50 case-based items were developed for the mock examination, and content validity was evaluated. The question items integrated 34 discrete elements of integrated nursing knowledge. The mock examination was taken by 741 baccalaureate students in their fourth year of study at 13 universities. Their average score on the mock examination was 57.4, and the examination showed a reliability of 0.40. According to classical test theory, the average level of item difficulty of the items was 57.4% (80%-100% for 12 items; 60%-80% for 13 items; and less than 60% for 25 items). The mean discrimination index was 0.19, and was above 0.30 for 11 items and 0.20 to 0.29 for 15 items. According to item response theory, the item discrimination parameter (in the logistic model) was none for 10 items (0.00), very low for 20 items (0.01 to 0.34), low for 12 items (0.35 to 0.64), moderate for 6 items (0.65 to 1.34), high for 1 item (1.35 to 1.69), and very high for 1 item (above 1.70). The item difficulty was very easy for 24 items (below -2.0), easy for 8 items (-2.0 to -0.5), medium for 6 items (-0.5 to 0.5), hard for 3 items (0.5 to 2.0), and very hard for 9 items (2.0 or above). The goodness-of-fit test in terms of the 2-parameter item response model between the range of 2.0 to 0.5 revealed that 12 items had an ideal correct answer rate. We surmised that the low reliability of the mock examination was influenced by the timing of the test for the examinees and the inappropriate difficulty of the items. Our study suggested a methodology for the development of future case-based items for the Korean Nursing Licensing Examination.
A new, female-specific irritability rating scale
Born, Leslie; Koren, Gideon; Lin, Elizabeth; Steiner, Meir
2008-01-01
Objective Irritability is a prominent symptom in the spectrum of female-specific mood disorders, and in some women, irritability is serious enough to disrupt their lives and warrant treatment. The objective of this research was to develop a new, female-specific state measure of irritability. Methods We constructed self-rating and observer rating scales using items derived from spontaneous descriptions of irritability by women with mood disturbances related to the menstrual cycle, childbearing or menopause. Following a pretest, the scales were shortened to the core items of irritability (annoyance, anger, tension, hostility, sensitivity to noise and touch) and tested on a new cohort of patients. Results The 14-item Self-Rating Scale and the 5-item Observer Rating Scale showed evidence for internal consistency (Self-Rating: n = 36 patients, Cronbach's α = 0.9257, mean interitem correlation = 0.4690; Observer Rating: Cronbach's α = 0.7418, mean interitem correlation = 0.3616), Self-Rating test–retest reliability (n = 29 patients, rs = 0.704, p = 0.01) and interrater reliability (n = 20 patients; τb = 1.000, p = 0.001). Conclusion This new, female-specific scale for rating irritability has the potential to further the evaluation of this prominent symptom cluster and increase specificity in clinical assessments of emotional disturbances related to reproductive cyclicity in women. PMID:18592028
The core content of the undergraduate curriculum in Manchester.
O'Neill, P A; Metcalfe, D; David, T J
1999-02-01
To identify the core content for the new undergraduate medical curriculum in Manchester. The initial step was to produce a list of 'index clinical situations' (ICSs), for which a newly graduated doctor must have a required level of competence. Using repeated consultation with consultants and general practitioners involved in medical education in the North-West of England, a list of 215 ICSs was agreed. Specialists and generalists were then asked to identify the components of the knowledge base and the performance (skills) base for each ICS. The knowledge base was divided into technical (biomedical facts/concepts) and contextual (effect/management of disease within the individual, family and society) domains. The performance base was divided into intellectual (problem solving and decision making) and interpersonal (history, examination, communication and procedural skills) domains. Forty specialties were consulted and 11,021 items (defined as a piece of knowledge, a concept or a skill) were identified. There was considerable overlap in the items listed, such that when the returns for each ICS were amalgamated, the 215 ICSs contained 6434 items with a mean of 34 +/- 14.2 per situation (range 6-85). UTILISATION: We have used the defined ICSs in the design of the trigger material used in the weekly problem-based learning sessions. Over 4 years almost all (207/215, 96%) of the ICS are covered, with many being revisited at several points in the curriculum.
From Paresis to PANDAS and PANS
... 4 items) Genetics (18 items) Brain Anatomy and Physiology (7 items) RDoC (6 items) Research Funding (36 ... 4 items) Genetics (18 items) Brain Anatomy and Physiology (7 items) RDoC (6 items) Research Funding (36 ...
Leppert, Wojciech; Majkowicz, Mikolaj
2013-05-01
Limited data exist on the validation of the European Organization for Research and Treatment of Cancer Quality of Life Questionnaire - Core 15 - Palliative Care in advanced cancer patients. To adapt the European Organization for Research and Treatment of Cancer Quality of Life Questionnaire - Core 15 - Palliative Care to the Polish clinical setting and to evaluate its psychometric properties in advanced cancer patients. Two quality-of-life measurements were performed at baseline and after 7 days. The concurrent validity of the European Organization for Research and Treatment of Cancer Quality of Life Questionnaire - Core 15 - Palliative Care was established by the Pearson correlation coefficients with the modified Edmonton Symptom Assessment System, the Karnofsky Performance Status and the Brief Pain Inventory - Short Form. Reliability was assessed using Cronbach's alpha coefficients and the Spearman correlation coefficients of the baseline and of the second measurement of European Organization for Research and Treatment of Cancer Quality of Life Questionnaire - Core 15 - Palliative Care items. A total of 160 consecutive patients in one academic palliative medicine centre were included. A total of 129 patients completed the study. The concurrent validity revealed significant correlations of the European Organization for Research and Treatment of Cancer Quality of Life Questionnaire - Core 15 - Palliative Care pain scale with the Brief Pain Inventory - Short Form, the European Organization for Research and Treatment of Cancer Quality of Life Questionnaire - Core 15 - Palliative Care symptom items with the modified Edmonton Symptom Assessment System and European Organization for Research and Treatment of Cancer Quality of Life Questionnaire - Core 15 - Palliative Care functional scales with the Karnofsky Performance Status scores. High Cronbach's alpha and standardised Cronbach's alpha values were found in the case of both functional (range: 0.830-0.925; 0.830-0.932) and symptom scales (range: 0.784-0.940; 0.794-0.941) of the European Organization for Research and Treatment of Cancer Quality of Life Questionnaire - Core 15 - Palliative Care, respectively. The Spearman correlation coefficients between the first and the second measurements were significant (p < 0.0001) for all European Organization for Research and Treatment of Cancer Quality of Life Questionnaire - Core 15 - Palliative Care items. Polish version of the European Organization for Research and Treatment of Cancer Quality of Life Questionnaire - Core 15 - Palliative Care is a valid and reliable tool recommended for quality-of-life assessment and monitoring in advanced cancer patients.
Akiyama, Tsuyoshi; Tsuda, Hitoshi; Matsumoto, Satoko; Miyake, Yuko; Kawamura, Yoshiya; Noda, Toshie; Akiskal, Kareen K; Akiskal, Hagop S
2005-03-01
In Japan, Kraepelin's descriptions on four "fundamental states" of manic depressive illness, the concepts of schizoid temperament by Kretschmer and obsessional and melancholic type temperament by Shimoda and Tellenbach have been widely accepted. This research investigates the construct validity of these temperaments through factor analysis. TEMPS-A measured depressive, cyclothymic, hyperthymic and irritable temperaments and MPT rigidity, esoteric and isolation subscales measured, respectively, melancholic type and schizoid temperaments. Factor analysis was implemented with TEMPS-A alone and TEMPS-A and MPT combined data. With TEMPS-A alone analysis, Factor 1 included 1 depressive, 11 cyclothymic and 12 irritable temperament items with a factor loading higher than 0.4; Factor 2 included 1 depressive and 10 hyperthymic temperament items; and Factor 3 included 2 depressive temperament items only. With TEMPS-A and MPT combined data, Factor 1 included 3 depressive, 11 cyclothymic and 5 irritable temperament items with a factor loading higher than 0.4 (interpreted as the central cyclothymic tendency for all affective temperaments along Kretschmerian lines and accounting for 11.7% of the variance); Factor 2 included 6 hyperthymic temperament items (6.22% of variance); Factor 3 included 1 cyclothymic, 7 irritable and 1 schizoid temperament items (interpreted as the irritable temperament and accounting for 3.24% of the variance); Factor 4 included 1 depressive temperament and 5 melancholic type items (interpreted as the latter, accounting for 2.66% of the variance); Factor 5 included 5 depressive temperament items, along interpersonal sensitivity and passivity lines, and accounting for 2.31% of the variance; and Factor 6 included 4 schizoid temperament items accounting for 2.07% of the variance. We did not use the Kasahara scale, which some believe to better capture the Japanese melancholic type. Sample was 70% male. These analyses confirm the factor validity of depressive, hyperthymic, cyclothymic and irritable temperaments (TEMPS-A), as well as the melancholic type and the schizoid temperament (MPT). Traits of the depressive and melancholic types emerge as rather distinct. Indeed, our results permit the delineation of an interpersonally sensitive type that "gives in to others" as the core features of the depressive temperament; this is to be contrasted with the higher functioning, perfectionistic, work-oriented melancholic type. Mood dysregulation is represented by the largest number of traits in this population. Contrary to a widely held belief that the melancholic type with its devotion to work and to others is the signature temperament in Japan, cyclothymic traits account for the largest variance in this nonclinical population. Hyperthymic temperament, melancholic type and schizoid temperaments appear largely independent of mood dysregulation. In this Japanese population, TEMPS-A may identify temperament constructs more comprehensively when implemented with melancholic type and schizoid temperament question items added to it. The proposed new Japanese Temperament and Personality (JTP) Scale has self-rated items divided into six subscales.
2013-01-01
Patient-reported outcomes (PROs) play an increasingly important role in clinical practice and research. Modern psychometric methods such as item response theory (IRT) enable the creation of item banks that support fixed-length forms as well as computerized adaptive testing (CAT), often resulting in improved measurement precision and responsiveness. Here we describe and discuss the case for developing an international core set of PROs building from the US PROMIS® network. PROMIS is a U.S.-based cooperative group of research sites and centers of excellence convened to develop and standardize PRO measures across studies and settings. If extended to a global collaboration, PROMIS has the potential to transform PRO measurement by creating a shared, unifying terminology and metric for reporting of common symptoms and functional life domains. Extending a common set of standardized PRO measures to the international community offers great potential for improving patient-centered research, clinical trials reporting, population monitoring, and health care worldwide. Benefits of such standardization include the possibility of: international syntheses (such as meta-analyses) of research findings; international population monitoring and policy development; health services administrators and planners access to relevant information on the populations they serve; better assessment and monitoring of patients by providers; and improved shared decision making. The goal of the current PROMIS International initiative is to ensure that item banks are translated and culturally adapted for use in adults and children in as many countries as possible. The process includes 3 key steps: translation/cultural adaptation, calibration, and validation. A universal translation, an approach focusing on commonalities, rather than differences across versions developed in regions or countries speaking the same language, is proposed to ensure conceptual equivalence for all items. International item calibration using nationally representative samples of adults and children within countries is essential to demonstrate that all items possess expected strong measurement properties. Finally, it is important to demonstrate that the PROMIS measures are valid, reliable and responsive to change when used in an international context. IRT item banking will allow for tailoring within countries and facilitate growth and evolution of PROs through contributions from the international measurement community. A number of opportunities and challenges of international development of PROs item banks are discussed. PMID:24359143
Alonso, Jordi; Bartlett, Susan J; Rose, Matthias; Aaronson, Neil K; Chaplin, John E; Efficace, Fabio; Leplège, Alain; Lu, Aiping; Tulsky, David S; Raat, Hein; Ravens-Sieberer, Ulrike; Revicki, Dennis; Terwee, Caroline B; Valderas, Jose M; Cella, David; Forrest, Christopher B
2013-12-20
Patient-reported outcomes (PROs) play an increasingly important role in clinical practice and research. Modern psychometric methods such as item response theory (IRT) enable the creation of item banks that support fixed-length forms as well as computerized adaptive testing (CAT), often resulting in improved measurement precision and responsiveness. Here we describe and discuss the case for developing an international core set of PROs building from the US PROMIS® network.PROMIS is a U.S.-based cooperative group of research sites and centers of excellence convened to develop and standardize PRO measures across studies and settings. If extended to a global collaboration, PROMIS has the potential to transform PRO measurement by creating a shared, unifying terminology and metric for reporting of common symptoms and functional life domains. Extending a common set of standardized PRO measures to the international community offers great potential for improving patient-centered research, clinical trials reporting, population monitoring, and health care worldwide. Benefits of such standardization include the possibility of: international syntheses (such as meta-analyses) of research findings; international population monitoring and policy development; health services administrators and planners access to relevant information on the populations they serve; better assessment and monitoring of patients by providers; and improved shared decision making.The goal of the current PROMIS International initiative is to ensure that item banks are translated and culturally adapted for use in adults and children in as many countries as possible. The process includes 3 key steps: translation/cultural adaptation, calibration, and validation. A universal translation, an approach focusing on commonalities, rather than differences across versions developed in regions or countries speaking the same language, is proposed to ensure conceptual equivalence for all items. International item calibration using nationally representative samples of adults and children within countries is essential to demonstrate that all items possess expected strong measurement properties. Finally, it is important to demonstrate that the PROMIS measures are valid, reliable and responsive to change when used in an international context.IRT item banking will allow for tailoring within countries and facilitate growth and evolution of PROs through contributions from the international measurement community. A number of opportunities and challenges of international development of PROs item banks are discussed.
Comprehensive clinical assessment in community setting: applicability of the MDS-HC.
Morris, J N; Fries, B E; Steel, K; Ikegami, N; Bernabei, R; Carpenter, G I; Gilgen, R; Hirdes, J P; Topinková, E
1997-08-01
To describe the results of an international trial of the home care version of the MDS assessment and problem identification system (the MDS-HC), including reliability estimates, a comparison of MDS-HC reliabilities with reliabilities of the same items in the MDS 2.0 nursing home assessment instrument, and an examination of the types of problems found in home care clients using the MDS-HC. Independent, dual assessment of clients of home-care agencies by trained clinicians using a draft of the MDS-HC, with additional descriptive data regarding problem profiles for home care clients. Reliability data from dual assessments of 241 randomly selected clients of home care agencies in five countries, all of whom volunteered to test the MDS-HC. Also included are an expanded sample of 780 home care assessments from these countries and 187 dually assessed residents from 21 nursing homes in the United States. The array of MDS-HC assessment items included measures in the following areas: personal items, cognitive patterns, communication/hearing, vision, mood and behavior, social functioning, informal support services, physical functioning, continence, disease diagnoses health conditions and preventive health measures, nutrition/hydration, dental status, skin condition, environmental assessment, service utilization, and medications. Forty-seven percent of the functional, health status, social environment, and service items in the MDS-HC were taken from the MDS 2.0 for nursing homes. For this item set, it is estimated that the average weighted Kappa is .74 for the MDS-HC and .75 for the MDS 2.0. Similarly, high reliability values were found for items newly introduced in the MDS-HC (weighted Kappa = .70). Descriptive findings also characterize the problems of home care clients, with subanalyses within cognitive performance levels. Findings indicate that the core set of items in the MDS 2.0 work equally well in community and nursing home settings. New items are highly reliable. In tandem, these instruments can be used within the international community, assisting and planning care for older adults within a broad spectrum of service settings, including nursing homes and home care programs. With this community-based, second-generation problem and care plan-driven assessment instrument, disability assessment can be performed consistently across the world.
Development of a work addiction scale.
Andreassen, Cecilie Schou; Griffiths, Mark D; Hetland, Jørn; Pallesen, Ståle
2012-06-01
Research into excessive work has gained increasing attention over the last 20 years. Terms such as "workaholism,"work addiction" and "excessive work" have been used interchangeably. Given the increase in empirical research, this study presents the development of the Bergen Work Addiction Scale (BWAS), a new psychometrically validated scale for the assessment of work addiction. A pool of 14 items, with two reflecting each of seven core elements of addiction (i.e., salience, mood modification, tolerance, withdrawal, conflict, relapse, and problems) was initially constructed. The items were then administered to two samples, one recruited by a web survey following a television broadcast about workaholism (n = 11,769) and one comprising participants in the second wave of a longitudinal internet-based survey about working life (n = 368). The items with the highest corrected item-total correlation from within each of the seven addiction elements were retained in the final scale. The assumed one-factor solution of the refined seven-item scale was acceptable (root mean square error of approximation = 0.077, Comparative Fit Index = 0.96, Tucker-Lewis Index = 0.95) and the internal reliability of the two samples were 0.84 and 0.80, respectively. The scores of the BWAS converged with scores on other workaholism scales, except for a Work Enjoyment subscale. A suggested cut-off for categorization of workaholics showed good discriminative ability in terms of working hours, leadership position, and subjective health complaints. It is concluded that the BWAS has good psychometric properties. © 2012 The Authors. Scandinavian Journal of Psychology © 2012 The Scandinavian Psychological Associations.
Method using a density field for locating related items for data mining
Wylie, Brian N.
2002-01-01
A method for locating related items in a geometric space transforms relationships among items to geometric locations. The method locates items in the geometric space so that the distance between items corresponds to the degree of relatedness. The method facilitates communication of the structure of the relationships among the items. The method makes use of numeric values as a measure of similarity between each pairing of items. The items are given initial coordinates in the space. An energy is then determined for each item from the item's distance and similarity to other items, and from the density of items assigned coordinates near the item. The distance and similarity component can act to draw items with high similarities close together, while the density component can act to force all items apart. If a terminal condition is not yet reached, then new coordinates can be determined for one or more items, and the energy determination repeated. The iteration can terminate, for example, when the total energy reaches a threshold, when each item's energy is below a threshold, after a certain amount of time or iterations.
ERIC Educational Resources Information Center
Burns, Daniel J.; Martens, Nicholas J.; Bertoni, Alicia A.; Sweeney, Emily J.; Lividini, Michelle D.
2006-01-01
In a repeated testing paradigm, list items receiving item-specific processing are more likely to be recovered across successive tests (item gains), whereas items receiving relational processing are likely to be forgotten progressively less on successive tests. Moreover, analysis of cumulative-recall curves has shown that item-specific processing…
Obbarius, Nina; Fischer, Felix; Obbarius, Alexander; Nolte, Sandra; Liegl, Gregor; Rose, Matthias
2018-04-10
To develop the first item bank to measure Stress Resilience (SR) in clinical populations. Qualitative item development resulted in an initial pool of 131 items covering a broad theoretical SR concept. These items were tested in n=521 patients at a psychosomatic outpatient clinic. Exploratory and Confirmatory Factor Analysis (CFA), as well as other state-of-the-art item analyses and IRT were used for item evaluation and calibration of the final item bank. Out of the initial item pool of 131 items, we excluded 64 items (54 factor loading <.5, 4 residual correlations >.3, 2 non-discriminative Item Response Curves, 4 Differential Item Functioning). The final set of 67 items indicated sufficient model fit in CFA and IRT analyses. Additionally, a 10-item short form with high measurement precision (SE≤.32 in a theta range between -1.8 and +1.5) was derived. Both the SR item bank and the SR short form were highly correlated with an existing static legacy tool (Connor-Davidson Resilience Scale). The final SR item bank and 10-item short form showed good psychometric properties. When further validated, they will be ready to be used within a framework of Computer-Adaptive Tests for a comprehensive assessment of the Stress-Construct. Copyright © 2018. Published by Elsevier Inc.
Wiklander, Maria; Rydström, Lise-Lott; Ygge, Britt-Marie; Navér, Lars; Wettergren, Lena; Eriksson, Lars E
2013-11-14
HIV is a stigmatizing medical condition. The concept of HIV stigma is multifaceted, with personalized stigma (perceived stigmatizing consequences of others knowing of their HIV status), disclosure concerns, negative self-image, and concerns with public attitudes described as core aspects of stigma for individuals with HIV infection. There is limited research on HIV stigma in children. The aim of this study was to test a short version of the 40-item HIV Stigma Scale (HSS-40), adapted for 8-18 years old children with HIV infection living in Sweden. A Swedish version of the HSS-40 was adapted for children by an expert panel and evaluated by think aloud interviews. A preliminary short version with twelve items covering the four dimensions of stigma in the HSS-40 was tested. The psychometric evaluation included inspection of missing values, principal component analysis (PCA), internal consistency, and correlations with measures of health-related quality of life (HRQoL). Fifty-eight children, representing 71% of all children with HIV infection in Sweden meeting the inclusion criteria, completed the 12-item questionnaire. Four items concerning participants' experiences of others' reactions to their HIV had unacceptable rates of missing values and were therefore excluded. The remaining items constituted an 8-item scale, the HIV Stigma Scale for Children (HSSC-8), measuring HIV-related disclosure concerns, negative self-image, and concerns with public attitudes. Evidence for internal validity was supported by a PCA, suggesting a three factor solution with all items loading on the same subscales as in the original HSS-40. The scale demonstrated acceptable internal consistency, with exception for the disclosure concerns subscale. Evidence for external validity was supported in correlational analyses with measures of HRQoL, where higher levels of stigma correlated with poorer HRQoL. The results suggest feasibility, reliability, as well as internal and external validity of the HSSC-8, an HIV stigma scale for children with HIV infection, measuring disclosure concerns, negative self-image, and concerns with public attitudes. The present study shows that different aspects of HIV stigma can be assessed among children with HIV in the age group 8-18.
Insomnia severity is an indicator of suicidal ideation during a depression clinical trial.
McCall, W Vaughn; Blocker, Jill N; D'Agostino, Ralph; Kimball, James; Boggs, Niki; Lasater, Barbara; Rosenquist, Peter B
2010-10-01
Insomnia has been linked to suicidal ideas and suicide death in cross-sectional and longitudinal population-based studies. A link between insomnia and suicide has not been previously examined in the setting of a clinical trial. Herein we describe the relationship between insomnia and suicidal thinking during the course of a clinical trial for depression with insomnia. Sixty patients aged 41.5±12.5 years (2/3 women) with major depressive episode and symptoms of insomnia received open-label fluoxetine for 9 weeks and also received blinded, randomized eszopiclone 3mg or placebo at bedtime after the first week of fluoxetine. Insomnia symptoms were assessed with the Insomnia Severity Index (ISI), and suicidal ideation was assessed with The Scale for Suicide Ideation (SSI). Depression symptoms were assessed with the depressed mood item and the anhedonia item from the Hamilton Rating Scale for Depression-24 (HRSD24), as well as a sum score for all non-sleep and non-suicide items from the HRSD (HRSD20). Measurements were taken at baseline and weeks 1, 2, 4, 6, and 8. SSI was examined by generalized linear mixed models for repeated measures as the outcome of interest for all 60 participants with ISI and various mood symptoms as independent variables, with adjustment for age, gender, treatment assignment, and baseline SSI. Higher levels of insomnia corresponded to significantly greater intensity of suicidal thinking (p<0.01). The depressed mood item of the HRSD, and the sum of the HRSD20, both corresponded to greater suicidal thinking (p<0.001). The anhedonia item did not correspond with suicidal thinking. When both ISI and the depressed mood item, or ISI and the anhedonia item, were included together in the same model, the ISI remained an independent predictor of suicidal thinking. The results support the concept that insomnia may be a useful indicator for suicidal ideation and now extend this idea into clinical trials. Insomnia remains an independent indicator of suicidal ideation, even taking into account the core symptoms of depression such as depressed mood and anhedonia. The complaint of insomnia during a depression clinical trial might indicate that more direct questioning about suicide is warranted.
A Xhosa language translation of the CORE-OM using South African university student samples.
Campbell, Megan M; Young, Charles
2016-10-01
The translation of well established psychometric tools from English into Xhosa may assist in improving access to psychological services for Xhosa speakers. The aim of this study was to translate the Clinical Outcomes in Routine Evaluation - Outcome Measure (CORE-OM), a measure of general distress and dysfunction developed in the UK, into Xhosa for use at South African university student counselling centres. The CORE-OM and embedded CORE-10 were translated into Xhosa using a five-stage translation design. This design included (a) forward-translation, (b) back-translation, (c) committee approach, (d) qualitative piloting, and (e) quantitative piloting on South African university students. Clinical and general samples were drawn from English-medium South African universities. Clinical samples were generated from university student counselling centres. General student samples were generated through random stratified cluster sampling of full-time university students. Qualitative feedback from the translation process and results from quantitative piloting of the 34-item CORE-OM English and Xhosa versions supported the reduction of the scale to 10 items. This reduced scale is referred to as the South African CORE-10 (SA CORE-10). A measurement and structural model of the SA CORE-10 English version was developed and cross-validated using an English-speaking university student sample. Equivalence of this model with the SA CORE-10 Xhosa version was investigated using a first-language Xhosa-speaking university sample. Partial measurement equivalence was achieved at the metric level. The resultant SA CORE-10 Xhosa and English versions provide core measures of distress and dysfunction. Additional, culture- and language-specific domains could be added to increase sensitivity and specificity. © The Author(s) 2016.
Leff, Stephen S; Baum, Katherine T; Bevans, Katherine B; Blum, Nathan J
2015-02-01
To describe the development and psychometric evaluation of the Core Competency Measure (CCM), an instrument designed to assess professional competencies as defined by the Maternal Child Health Bureau (MCHB) and targeted by Leadership Education in Neurodevelopmental and Related Disabilities (LEND) programs. The CCM is a 44-item self-report measure comprised of six subscales to assess clinical, interdisciplinary, family-centered/cultural, community, research, and advocacy/policy competencies. The CCM was developed in an iterative fashion through participatory action research, and then nine cohorts of LEND trainees (N = 144) from 14 different disciplines completed the CCM during the first week of the training program. A 6-factor confirmatory factor analysis model was fit to data from the 44 original items. After three items were removed, the model adequately fit the data (comparative fit indices = .93, root mean error of approximation = .06) with all factor loadings exceeding .55. The measure was determined to be quite reliable as adequate internal consistency and test-retest reliability were found for each subscale. The instrument's construct validity was supported by expected differences in self-rated competencies among fellows representing various disciplines, and the convergent validity was supported by the pattern of inter-correlations between subscale scores. The CCM appears to be a reliable and valid measure of MCHB core competencies for our sample of LEND trainees. It provides an assessment of key training areas addressed by the LEND program. Although the measure was developed within only one LEND Program, with additional research it has the potential to serve as a standardized tool to evaluate the strengths and limitations of MCHB training, both within and between programs.
Core Outcome Set–STAndards for Reporting: The COS-STAR Statement
Kirkham, Jamie J.; Gorst, Sarah; Altman, Douglas G.; Blazeby, Jane M.; Clarke, Mike; Devane, Declan; Moher, David; Schmitt, Jochen; Tugwell, Peter; Tunis, Sean; Williamson, Paula R.
2016-01-01
Background Core outcome sets (COS) can enhance the relevance of research by ensuring that outcomes of importance to health service users and other people making choices about health care in a particular topic area are measured routinely. Over 200 COS to date have been developed, but the clarity of these reports is suboptimal. COS studies will not achieve their goal if reports of COS are not complete and transparent. Methods and Findings In recognition of these issues, an international group that included experienced COS developers, methodologists, journal editors, potential users of COS (clinical trialists, systematic reviewers, and clinical guideline developers), and patient representatives developed the Core Outcome Set–STAndards for Reporting (COS-STAR) Statement as a reporting guideline for COS studies. The developmental process consisted of an initial reporting item generation stage and a two-round Delphi survey involving nearly 200 participants representing key stakeholder groups, followed by a consensus meeting. The COS-STAR Statement consists of a checklist of 18 items considered essential for transparent and complete reporting in all COS studies. The checklist items focus on the introduction, methods, results, and discussion section of a manuscript describing the development of a particular COS. A limitation of the COS-STAR Statement is that it was developed without representative views of low- and middle-income countries. COS have equal relevance to studies conducted in these areas, and, subsequently, this guideline may need to evolve over time to encompass any additional challenges from developing COS in these areas. Conclusions With many ongoing COS studies underway, the COS-STAR Statement should be a helpful resource to improve the reporting of COS studies for the benefit of all COS users. PMID:27755541
Core Outcome Set-STAndards for Development: The COS-STAD recommendations.
Kirkham, Jamie J; Davis, Katherine; Altman, Douglas G; Blazeby, Jane M; Clarke, Mike; Tunis, Sean; Williamson, Paula R
2017-11-01
The use of core outcome sets (COS) ensures that researchers measure and report those outcomes that are most likely to be relevant to users of their research. Several hundred COS projects have been systematically identified to date, but there has been no formal quality assessment of these studies. The Core Outcome Set-STAndards for Development (COS-STAD) project aimed to identify minimum standards for the design of a COS study agreed upon by an international group, while other specific guidance exists for the final reporting of COS development studies (Core Outcome Set-STAndards for Reporting [COS-STAR]). An international group of experienced COS developers, methodologists, journal editors, potential users of COS (clinical trialists, systematic reviewers, and clinical guideline developers), and patient representatives produced the COS-STAD recommendations to help improve the quality of COS development and support the assessment of whether a COS had been developed using a reasonable approach. An open survey of experts generated an initial list of items, which was refined by a 2-round Delphi survey involving nearly 250 participants representing key stakeholder groups. Participants assigned importance ratings for each item using a 1-9 scale. Consensus that an item should be included in the set of minimum standards was defined as at least 70% of the voting participants from each stakeholder group providing a score between 7 and 9. The Delphi survey was followed by a consensus discussion with the study management group representing multiple stakeholder groups. COS-STAD contains 11 minimum standards that are the minimum design recommendations for all COS development projects. The recommendations focus on 3 key domains: the scope, the stakeholders, and the consensus process. The COS-STAD project has established 11 minimum standards to be followed by COS developers when planning their projects and by users when deciding whether a COS has been developed using reasonable methods.
ERIC Educational Resources Information Center
Jimenez, Bree A.; Staples, Kelli
2015-01-01
This study investigated the effect of systematic early numeracy skill instruction on grade-aligned 4th and 5th grade Common Core math skill acquisition for three 4th and 5th grade students with a significant intellectual disability. Students were taught early numeracy skills (e.g., number identification, making sets to five items, simple addition)…
Yang, Fang Yu; Zhao, Rong Rong; Liu, Yi Si; Wu, Ying; Jin, Ning Ning; Li, Rui Ying; Shi, Shu Ping; Shao, Yue Ying; Guo, Ming; Arthur, David; Elliott, Malcolm
2013-12-01
A review of the literature showed that the core competencies needed by newly graduated Chinese nurses were not as of yet undocumented. To develop a psychometrically sound instrument for identifying and measuring the core competencies needed by Chinese nursing baccalaureate graduates. Descriptive correlational and multicentre study. Seven major tertiary teaching hospitals and three major medical universities in Beijing. 790 subjects, including patients, nursing faculty members, doctors and nurses. A reliable and valid self-report instrument, consisting of 58 items, was developed using multiple methods. It was then distributed to 790 subjects to measure nursing competency in a broader Chinese context. The psychometric characteristics of reliability and validity were supported by descriptive and inferential analyses. The final instrument consists of six dimensions with 47 items. The content validity index was 0.90. The overall scale reliability was 0.97 with dimensions range from 0.87 to 0.94. Six domains of core competencies were identified: professionalism; direct care; support and communication; application of professional knowledge; personal traits; and critical thinking and innovation. The findings of this study provide valuable evidence for a psychometrically sound measurement tool, as well as for competency-based nursing curriculum reform. Copyright © 2013 Elsevier Ltd. All rights reserved.
Flens, Gerard; Smits, Niels; Terwee, Caroline B; Dekker, Joost; Huijbrechts, Irma; de Beurs, Edwin
2017-03-01
We developed a Dutch-Flemish version of the patient-reported outcomes measurement information system (PROMIS) adult V1.0 item bank for depression as input for computerized adaptive testing (CAT). As item bank, we used the Dutch-Flemish translation of the original PROMIS item bank (28 items) and additionally translated 28 U.S. depression items that failed to make the final U.S. item bank. Through psychometric analysis of a combined clinical and general population sample ( N = 2,010), 8 added items were removed. With the final item bank, we performed several CAT simulations to assess the efficiency of the extended (48 items) and the original item bank (28 items), using various stopping rules. Both item banks resulted in highly efficient and precise measurement of depression and showed high similarity between the CAT simulation scores and the full item bank scores. We discuss the implications of using each item bank and stopping rule for further CAT development.
10 CFR Appendix I to Part 1050 - DOE Form 3735.2-Foreign Gifts Statement
Code of Federal Regulations, 2010 CFR
2010-01-01
... should always be indicated in item 1; if the employee is the recipient of the gift then items 5 and 6... information should be included in items 5 and 6. Item 2.Self explanatory. Items 3 and 4.The Office or Division... employee or a spouse or dependent. Items 5 and 6.See above, Item 1. Item 7.Self explanatory. Item 8.Self...
Provost, Mélanie; Koompalum, Dayin; Dong, Diane; Martin, Bradley C
2006-01-01
To develop a comprehensive instrument assessing quality of health-related web sites. Phase I consisted of a literature review to identify constructs thought to indicate web site quality and to identify items. During content analysis, duplicate items were eliminated and items that were not clear, meaningful, or measurable were reworded or removed. Some items were generated by the authors. Phase II: a panel consisting of six healthcare and MIS reviewers was convened to assess each item for its relevance and importance to the construct and to assess item clarity and measurement feasibility. Three hundred and eighty-four items were generated from 26 sources. The initial content analysis reduced the scale to 104 items. Four of the six expert reviewers responded; high concordance on the relevance, importance and measurement feasibility of each item was observed: 3 out of 4, or all raters agreed on 76-85% of items. Based on the panel ratings, 9 items were removed, 3 added, and 10 revised. The WebMedQual consists of 8 categories, 8 sub-categories, 95 items and 3 supplemental items to assess web site quality. The constructs are: content (19 items), authority of source (18 items), design (19 items), accessibility and availability (6 items), links (4 items), user support (9 items), confidentiality and privacy (17 items), e-commerce (6 items). The "WebMedQual" represents a first step toward a comprehensive and standard quality assessment of health web sites. This scale will allow relatively easy assessment of quality with possible numeric scoring.
Questionnaire of core beliefs related to drug use and craving for assessment of relapse risk.
Martínez-González, José Miguel; Vilar López, Raquel; Lozano-Rojas, Oscar; Verdejo-García, Antonio
2017-07-12
This study was aimed at designing a questionnaire for the assessment of addiction-related core beliefs and craving. The sample comprised 215 patients (85.8% males and 14.2% females) in treatment for dependence to alcohol (40%), cocaine (36.3%) and cannabis (23.7%). Descriptive statistics were used to characterize the sample. Variance, regression and factorial analyses were conducted to study the questionnaire structure and its relation with variables such as abstinence and craving. Items about drug-related beliefs yielded a four-factor structure: what patient think that they could not do without drug use, lack of withdrawal, conditions required to use drugs again, and use of drugs as the only way to feel good. Items related to craving yielded three factors: negative emotions as precipitants of drug use, positive emotions, and difficulties attributed to coping with craving. Furthermore, beliefs were more important to predict craving than abstinence time. The present questionnaire allows to assess a set of significant factors to design prevention relapse programs.
Computerized Adaptive Testing with Item Clones. Research Report.
ERIC Educational Resources Information Center
Glas, Cees A. W.; van der Linden, Wim J.
To reduce the cost of item writing and to enhance the flexibility of item presentation, items can be generated by item-cloning techniques. An important consequence of cloning is that it may cause variability on the item parameters. Therefore, a multilevel item response model is presented in which it is assumed that the item parameters of a…
A Study of the Homogeneity of Items Produced From Item Forms Across Different Taxonomic Levels.
ERIC Educational Resources Information Center
Weber, Margaret B.; Argo, Jana K.
This study determined whether item forms ( rules for constructing items related to a domain or set of tasks) would enable naive item writers to generate multiple-choice items at three taxonomic levels--knowledge, comprehension, and application. Students wrote 120 multiple-choice items from 20 item forms, corresponding to educational objectives…
Code of Federal Regulations, 2013 CFR
2013-07-01
... Munitions List Items (MLIs)/Commerce Control List Items (CCLIs) requiring demilitarization? 102-36.435... Personal Property Whose Disposal Requires Special Handling Munitions List Items/commerce Control List Items (mlis/cclis) § 102-36.435 How do we identify Munitions List Items (MLIs)/Commerce Control List Items...
Code of Federal Regulations, 2012 CFR
2012-01-01
... Munitions List Items (MLIs)/Commerce Control List Items (CCLIs) requiring demilitarization? 102-36.435... Personal Property Whose Disposal Requires Special Handling Munitions List Items/commerce Control List Items (mlis/cclis) § 102-36.435 How do we identify Munitions List Items (MLIs)/Commerce Control List Items...
Code of Federal Regulations, 2011 CFR
2011-01-01
... Munitions List Items (MLIs)/Commerce Control List Items (CCLIs) requiring demilitarization? 102-36.435... Personal Property Whose Disposal Requires Special Handling Munitions List Items/commerce Control List Items (mlis/cclis) § 102-36.435 How do we identify Munitions List Items (MLIs)/Commerce Control List Items...
Code of Federal Regulations, 2014 CFR
2014-01-01
... Munitions List Items (MLIs)/Commerce Control List Items (CCLIs) requiring demilitarization? 102-36.435... Personal Property Whose Disposal Requires Special Handling Munitions List Items/commerce Control List Items (mlis/cclis) § 102-36.435 How do we identify Munitions List Items (MLIs)/Commerce Control List Items...
Code of Federal Regulations, 2010 CFR
2010-07-01
... Munitions List Items (MLIs)/Commerce Control List Items (CCLIs) requiring demilitarization? 102-36.435... Personal Property Whose Disposal Requires Special Handling Munitions List Items/commerce Control List Items (mlis/cclis) § 102-36.435 How do we identify Munitions List Items (MLIs)/Commerce Control List Items...
Code of Federal Regulations, 2013 CFR
2013-07-01
... Munitions List Items (MLIs)/Commerce Control List Items (CCLIs)? 102-36.430 Section 102-36.430 Public... Disposal Requires Special Handling Munitions List Items/commerce Control List Items (mlis/cclis) § 102-36.430 May we dispose of excess Munitions List Items (MLIs)/Commerce Control List Items (CCLIs)? You may...
Code of Federal Regulations, 2012 CFR
2012-01-01
... Munitions List Items (MLIs)/Commerce Control List Items (CCLIs)? 102-36.430 Section 102-36.430 Public... Disposal Requires Special Handling Munitions List Items/commerce Control List Items (mlis/cclis) § 102-36.430 May we dispose of excess Munitions List Items (MLIs)/Commerce Control List Items (CCLIs)? You may...
Code of Federal Regulations, 2014 CFR
2014-01-01
... Munitions List Items (MLIs)/Commerce Control List Items (CCLIs)? 102-36.430 Section 102-36.430 Public... Disposal Requires Special Handling Munitions List Items/commerce Control List Items (mlis/cclis) § 102-36.430 May we dispose of excess Munitions List Items (MLIs)/Commerce Control List Items (CCLIs)? You may...
Code of Federal Regulations, 2011 CFR
2011-01-01
... Munitions List Items (MLIs)/Commerce Control List Items (CCLIs)? 102-36.430 Section 102-36.430 Public... Disposal Requires Special Handling Munitions List Items/commerce Control List Items (mlis/cclis) § 102-36.430 May we dispose of excess Munitions List Items (MLIs)/Commerce Control List Items (CCLIs)? You may...
Item Writer Judgments of Item Difficulty versus Actual Item Difficulty: A Case Study
ERIC Educational Resources Information Center
Sydorenko, Tetyana
2011-01-01
This study investigates how accurate one item writer can be on item difficulty estimates and whether factors affecting item writer judgments correspond to predictors of actual item difficulty. The items were based on conversational dialogs (presented as videos online) that focus on pragmatic functions. Thirty-five 2nd-, 3rd-, and 4th-year learners…
Code of Federal Regulations, 2010 CFR
2010-07-01
... Munitions List Items (MLIs)/Commerce Control List Items (CCLIs)? 102-36.430 Section 102-36.430 Public... Disposal Requires Special Handling Munitions List Items/commerce Control List Items (mlis/cclis) § 102-36.430 May we dispose of excess Munitions List Items (MLIs)/Commerce Control List Items (CCLIs)? You may...
Paz, Sylvia H; Spritzer, Karen L; Reise, Steven P; Hays, Ron D
2017-06-01
About 70% of Latinos, 5 years old or older, in the United States speak Spanish at home. Measurement equivalence of the PROMIS ® pain interference (PI) item bank by language of administration (English versus Spanish) has not been evaluated. A sample of 527 adult Spanish-speaking Latinos completed the Spanish version of the 41-item PROMIS ® pain interference item bank. We evaluate dimensionality, monotonicity and local independence of the Spanish-language items. Then we evaluate differential item functioning (DIF) using ordinal logistic regression with item response theory scores estimated from DIF-free "anchor" items. One of the 41 items in the Spanish version of the PROMIS ® PI item bank was identified as having significant uniform DIF. English- and Spanish-speaking subjects with the same level of pain interference responded differently to 1 of the 41 items in the PROMIS ® PI item bank. This item was not retained due to proprietary issues. The original English language item parameters can be used when estimating PROMIS ® PI scores.
A Time and Place for Everything: Developmental Differences in the Building Blocks of Episodic Memory
Lee, Joshua K.; Wendelken, J. Carter; Bunge, Silvia A.; Ghetti, Simona
2015-01-01
This research investigated whether episodic memory development can be explained by improvements in relational binding processes, involved in forming novel associations between events and the context in which they occurred. Memory for item-space, item-time, and item-item relations was assessed in an ethnically diverse sample of 151 children aged 7 to 11 years and 28 young adults. Item-space memory reached adult performance by 9½ years, whereas item-time and item-item memory improved into adulthood. In path analysis, item-space, but not item-time best explained item-item memory. Across age groups, relational binding related to source memory and performance on standardized memory assessments. In conclusion, relational binding development depends on relation type, but relational binding overall supports episodic memory development. PMID:26493950
Chandler, L S; Terhorst, L; Rogers, J C; Holm, M B
2016-07-01
The purpose of this study was to establish the validity, reliability, stability and sensitivity to change of the family-centred Movement Assessment of Children (MAC) in typically developing infants/toddlers from 2 months (1 month 16 days) to 2 years (24 months 15 days) of age. Assessment of infant/toddler motor development is critical so that infants and toddlers who are at-risk for developmental delay or whose functional motor development is delayed can be monitored and receive therapy to improve their developmental outcomes. Infants/toddlers are thought to be more responsive during the MAC assessment because parents and siblings participate and elicit responses. Two hundred seventy six children and 405 assessments contributed to the establishment of age-related parameters for typically developing infants and toddlers on the MAC. The MAC assesses three core domains of functional movement (head control, upper extremities and hands, pelvis and lower extremities), and generates a core total score. Four explanatory domains serve to alert examiners to factors that may impact atypical development (general observations, special senses, primitive reflexes/reactions, muscle tone). Construct validity of functional motor development was examined using the relationship between incremental increases in scores and increases in participants' ages. Subsamples were used to establish inter-rater reliability, test-retest reliability, stability and sensitivity to change. Construct validity was established and inter-rater reliability ICCs for the core items and core total ranged from 0.83 to 0.99. Percent agreement for the explanatory items ranged from 0.72 to 0.96. Stability within age grouping was consistent from baseline to 6 months post-baseline, and sensitivity to change from baseline to 6 months was significant for all core items and the total score. The MAC has proven to be a well-constructed assessment of infant and toddler functional motor development. It is a family-centred and efficient tool that can be used to assess and follow-up of infants and toddlers from 2 months to 2 years. © 2016 John Wiley & Sons Ltd.
Faulks, Denise; Norderyd, Johanna; Molina, Gustavo; Macgiolla Phadraig, Caoimhin; Scagnet, Gabriela; Eschevins, Caroline; Hennequin, Martine
2013-01-01
Children in dentistry are traditionally described in terms of medical diagnosis and prevalence of oral disease. This approach gives little information regarding a child's capacity to maintain oral health or regarding the social determinants of oral health. The biopsychosocial approach, embodied in the International Classification of Functioning, Disability and Health - Child and Youth version (ICF-CY) (WHO), provides a wider picture of a child's real-life experience, but practical tools for the application of this model are lacking. This article describes the preliminary empirical study necessary for development of such a tool - an ICF-CY Core Set for Oral Health. An ICF-CY questionnaire was used to identify the medical, functional, social and environmental context of 218 children and adolescents referred to special care or paediatric dental services in France, Sweden, Argentina and Ireland (mean age 8 years ± 3.6 yrs). International Classification of Disease (ICD-10) diagnoses included disorders of the nervous system (26.1%), Down syndrome (22.0%), mental retardation (17.0%), autistic disorders (16.1%), and dental anxiety alone (11.0%). The most frequently impaired items in the ICF Body functions domain were 'Intellectual functions', 'High-level cognitive functions', and 'Attention functions'. In the Activities and Participation domain, participation restriction was frequently reported for 25 items including 'Handling stress', 'Caring for body parts', 'Looking after one's health' and 'Speaking'. In the Environment domain, facilitating items included 'Support of friends', 'Attitude of friends' and 'Support of immediate family'. One item was reported as an environmental barrier - 'Societal attitudes'. The ICF-CY can be used to highlight common profiles of functioning, activities, participation and environment shared by children in relation to oral health, despite widely differing medical, social and geographical contexts. The results of this empirical study might be used to develop an ICF-CY Core Set for Oral Health - a holistic but practical tool for clinical and epidemiological use.
Selecting Items for Criterion-Referenced Tests.
ERIC Educational Resources Information Center
Mellenbergh, Gideon J.; van der Linden, Wim J.
1982-01-01
Three item selection methods for criterion-referenced tests are examined: the classical theory of item difficulty and item-test correlation; the latent trait theory of item characteristic curves; and a decision-theoretic approach for optimal item selection. Item contribution to the standardized expected utility of mastery testing is discussed. (CM)
A Mixed Effects Randomized Item Response Model
ERIC Educational Resources Information Center
Fox, J.-P.; Wyrick, Cheryl
2008-01-01
The randomized response technique ensures that individual item responses, denoted as true item responses, are randomized before observing them and so-called randomized item responses are observed. A relationship is specified between randomized item response data and true item response data. True item response data are modeled with a (non)linear…
ERIC Educational Resources Information Center
Swinford, Ashleigh
2016-01-01
With rigor outlined in state and Common Core standards and the addition of constructed-response test items to most state tests, math constructed-response questions have become increasingly popular in today's classroom. Although constructed-response problems can present a challenge for students, they do offer a glimpse of students' learning through…
What a Cognitivist Can Take from Discursive Research
ERIC Educational Resources Information Center
Brown, Margaret
2016-01-01
To an unreconstructed cognitivist with experience of developing General Certificate of Secondary Education (GCSE) examinations and national tests in mathematics, the three papers which provide the core of this special issue (Morgan & Sfard; Morgan, Morgan & Tang) provided a refreshing opportunity to consider examination items from a…
Correlates of a Single-Item Indicator Versus a Multi-Item Scale of Outness About Same-Sex Attraction
Noor, Syed W.; Galos, Dylan L.; Simon Rosser, B. R.
2017-01-01
In this study, we investigated if a single-item indicator measured the degree to which people were open about their same-sex attraction (“out”) as accurately as a multi-item scale. For the multi-item scale, we used the Outness Inventory, which includes three subscales: family, world, and religion. We examined correlations between the single- and multi-item measures; between the single-item indicator and the subscales of the multi-item scale; and between the measures and internalized homonegativity, social attitudes towards homosexuality, and depressive symptoms. In addition, we calculated Tjur’s R2 as a measure of predictive power of the single-item indicator, multi-item scale, and subscales of the multi-item scale in predicting two health-related outcomes: depressive symptoms and condomless anal sex with multiple partners. There was a strong correlation between the single- and multi-item measures (r = 0.73). Furthermore, there were strong correlations between the single-item indicator and each subscale of the multi-item scale: family (r = 0.70), world (r = 0.77), and religion (r = 0.50). In addition, the correlations between the single-item indicator and internalized homonegativity (r = −0.63), social attitudes towards homosexuality (r = −0.38), and depression (r = −0.14) were higher than those between the multi-item scale and internalized homonegativity (r = −0.55), social attitudes towards homosexuality (r = −0.21), and depression (r = −0.13). Contrary to the premise that multi-item measures are superior to single-item measures, our collective findings indicate that the single-item indicator of outness performs better than the multi-item scale of outness. PMID:26292840
Clinical pharmacology and therapeutics in undergraduate medical education in the UK: the future.
Walley, T; Bligh, J; Orme, M; Breckenridge, A
1994-01-01
1. Changes in undergraduate medical education will involve the development of a core curriculum of material of essential knowledge and of the skills for self directed learning both as a student and a postgraduate. A survey of departments or individuals teaching clinical pharmacology and therapeutics was conducted to consider what a core curriculum in these subjects might contain and how changes in the school curriculum would affect teaching in the future. 2. A questionnaire was developed based on an American consensus statement on the core curriculum in clinical pharmacology and therapeutics. Freetext answers were encouraged. Twenty-seven medical schools were surveyed; 21 (78%) replied. 3. Items of core knowledge (as defined by the American statement) were generally rated important or very important. The most important were considered to be (in order): prescribing for the elderly, management of overdose and adverse drug reactions. All of these were widely taught (85-100%). The least important items were the efficacy and toxicity of nonprescription drugs (taught by 35%) and the process of drug development and approval (taught nevertheless by 95%). 4. Core skills were generally rated less important, and less often taught. It was felt by many respondents that these skills, as defined, were excessively detailed for British undergraduates and more appropriate for postgraduate education. 5. Core attitudes were rated as being of intermediate importance, but not widely taught as it was felt that these could best be inculcated by example rather than formal teaching. Again, many felt that these attitudes were inappropriate for a UK core curriculum.(ABSTRACT TRUNCATED AT 250 WORDS) PMID:8186060
Quality of Life in Patients With Brain Metastases Using the EORTC QLQ-BN20+2 and QLQ-C15-PAL
DOE Office of Scientific and Technical Information (OSTI.GOV)
Caissie, Amanda; Nguyen, Janet; Chen, Emily
Purpose: The 20-item European Organisation for Research and Treatment of Cancer Quality of Life Questionnaire-Brain Neoplasm (QLQ-BN20) is a validated quality-of-life (QOL) questionnaire for patients with primary brain tumors. The European Organisation for Research and Treatment of Cancer Quality of Life Questionnaire-Core 15 Palliative (QLQ-C15-PAL) core palliative questionnaire is a 15-item version of the core 30-item QLQ-C30 and was developed to decrease the burden on patients with advanced cancer. The combination of the QLQ-BN20 and QLQ-C30 to assess QOL may be too burdensome for patients. The primary aim of this study was to assess QOL in patients before and aftermore » treatment for brain metastases using the QLQ-BN20+2 and QLQ-C15-PAL, a version of the QLQ-BN20 questionnaire with 2 additional questions assessing cognitive functioning that were not addressed in the QLQ-C15-PAL. Methods and Materials: Patients with brain metastases completed the QLQ-C15-PAL and QLQ-BN20+2 questionnaires to assess QOL before and 1 month after radiation. Linear regression analysis was used to assess changes in QOL scores over time, as well as to explore associations between the QLQ-BN20+2 and QLQ-C15-PAL scales, patient demographics, and clinical variables. Spearman correlation assessed associations between the QLQ-BN20+2 and QLQ-C15-PAL scales. Results: Among 108 patients, the majority (55%) received whole-brain radiotherapy only, with 65% of patients completing follow-up at 1 month after treatment. The most prominent symptoms at baseline were future uncertainty (QLQ-BN20+2) and fatigue (QLQ-C15-PAL). After treatment, significant improvement was seen for the QLQ-C15-PAL insomnia scale, as well as the QLQ-BN20+2 scales of future uncertainty, visual disorder, and concentration difficulty. Baseline Karnofsky Performance Status was negatively correlated to QLQ-BN20+2 motor dysfunction but positively related to QLQ-C15-PAL physical functioning and QLQ-BN20+2 cognitive functioning at baseline and follow-up. QLQ-BN20+2 scales of future uncertainty and motor dysfunction correlated with the most QLQ-C15-PAL scales, including overall QOL (negative association) at baseline and follow-up. Conclusion: After radiation, the questionnaires showed maintenance of QOL and improvement of QOL scores such as future uncertainty, which featured prominently in this patient population. It is proposed that the 37-item QLQ-BN20+2 and QLQ-C15-PAL, as opposed to the 50-item QLQ-BN20 and QLQ-C30, may be used together as a universal QOL assessment tool in this setting.« less
41 CFR 101-30.701-1 - Item reduction study.
Code of Federal Regulations, 2011 CFR
2011-07-01
... 41 Public Contracts and Property Management 2 2011-07-01 2007-07-01 true Item reduction study. 101....7-Item Reduction Program § 101-30.701-1 Item reduction study. Item reduction study means the study... so identified, a replacement item shall be proposed. The result of item reduction studies will...
41 CFR 101-30.701-1 - Item reduction study.
Code of Federal Regulations, 2010 CFR
2010-07-01
... 41 Public Contracts and Property Management 2 2010-07-01 2010-07-01 true Item reduction study. 101....7-Item Reduction Program § 101-30.701-1 Item reduction study. Item reduction study means the study... so identified, a replacement item shall be proposed. The result of item reduction studies will...
Detection of Differential Item Functioning Using the Lasso Approach
ERIC Educational Resources Information Center
Magis, David; Tuerlinckx, Francis; De Boeck, Paul
2015-01-01
This article proposes a novel approach to detect differential item functioning (DIF) among dichotomously scored items. Unlike standard DIF methods that perform an item-by-item analysis, we propose the "LR lasso DIF method": logistic regression (LR) model is formulated for all item responses. The model contains item-specific intercepts,…
Li, Zhandong; Shi, Qiuling; Liu, Meng; Jia, Liqun; He, Bin; Yang, Yufei; Liu, Jie; Lin, Hongsheng; Lin, Huei-Kai; Li, Pingping; Wang, Xin Shelley
2017-11-01
The MD Anderson Symptom Inventory (MDASI) is a brief, yet thorough, patient-reported outcomes measure for assessing the severity of common cancer-related symptoms and their interference with daily functioning. We report the development of an MDASI version tailored for use with Traditional Chinese Medicine in China (the MDASI-TCM). Chinese-speaking patients with mixed cancer types (n = 317) participated in the study. The development and validation process included four steps: 1) identify candidate TCM-specific items, with input from patients, oncologists, and TCM specialists; 2) eliminate candidate TCM items lacking relevance, based on patient report; 3) psychometrically examine the MDASI-TCM's validity and reliability in cancer patients receiving TCM-based care; and 4) cognitively debrief patients to assess the MDASI-TCM's relevance, understandability, and acceptability. Seven TCM-specific symptom items (sweating, feeling cold, constipation, bitter taste, coughing, palpitations, and heat in palms/soles) were clinically and psychometrically meaningful to add to the core MDASI. Approximately 61% of patients had moderate to severe symptoms (rated ≥5 on the MDASI-TCM's 0-10 scale). Cronbach α coefficients were .90 for symptom-severity items and .93 for interference items, indicating internal consistency reliability. Known-group validity was substantiated by the MDASI-TCM's detection of differences in symptom severity according to performance status (P < .001) and interference levels by cancer stage (P < .05). Cognitive debriefing indicated that patients found the MDASI-TCM to be an understandable, easy-to-use tool. The Chinese MDASI-TCM is a valid, reliable, and concise measure of symptom severity and interference that can be used to assess Chinese cancer patients and survivors receiving TCM-based care. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Climate of Respect Evaluation in ICUs: Development of an Instrument (ICU-CORE).
Beach, Mary Catherine; Topazian, Rachel; Chan, Kitty S; Sugarman, Jeremy; Geller, Gail
2018-06-01
To develop a valid, reliable measure that reflected the environment of respectfulness within the ICU setting. We developed a preliminary survey instrument based on conceptual domains of respect identified through prior qualitative analyses of ICU patient, family member, and clinician perspectives. The initial instrument consisted of 21 items. After five cognitive interviews and 16 pilot surveys, we revised the instrument to include 23 items. We used standard psychometric methods to analyze the instrument. Eight ICUs serving adult patients affiliated with a large university health system. ICU clinicians. None. Based on 249 responses, we identified three factors and created subscales: General Respect, Respectful Behaviors, and Disrespectful Behaviors. The General Respect subscale had seven items (α = 0.932) and reflected how often patients in the ICU are treated with respect, in a dignified manner, as an individual, equally to all other patients, on the "same level" as the ICU team, as a person, and as you yourself would want to be treated. The Respectful Behaviors subscale had 10 items (α = 0.926) and reflected how often the ICU team responds to patient and/or family anxiety, makes an effort to get to know the patient and family as people, listens carefully, explains things thoroughly, gives the opportunity to provide input into care, protects patient modesty, greets when entering room, and talks to sedated patients. The subscale measuring disrespect has four items (α = 0.702) and reflects how often the ICU team dismisses family concerns, talks down to patients and families, speaks disrespectfully behind their backs, and gets frustrated with patients and families. We created a reliable set of scales to measure the climate of respectfulness in intensive care settings. These measures can be used for ongoing quality improvement that aim to enhance the experience of ICU patients and their families.
Saadatpour, Leila; Hemati, Simin; Habibi, Farzaneh; Behzadi, Erfan; Hashemi-Jazi, Marsa Sadat; Kheirabadi, Gholamreza; Mirbagher, Leila; Gholamrezaei, Ali
2015-09-01
Various symptoms frequently affect cancer patients' quality of life. Appropriate assessment of these symptoms provides valuable data for cancer management. This study aimed to validate the Persian version of the M. D. Anderson Symptom Inventory (MDASI-P). This cross-sectional study was conducted at four cancer treatment centers in two cities in Iran. Breast cancer and colorectal cancer patients aged 18 years and older were consecutively included in the study. The standard forward-backward translation method was applied. Patients completed the MDASI-P along with the previously validated Persian version of the European Organization for Research and Treatment of Cancer Quality of Life Questionnaire-Core 30 (EORTC QLQ-C30). Construct validity (factor analysis), criterion validity (against the EORTC QLQ-C30), and reliability (Cronbach's alpha) were analyzed. A total of 146 breast cancer and 94 colorectal cancer patients were studied. Factor analysis for the symptom severity items resulted in a three-factor solution, further reduced to a two-factor solution: general symptoms and gastrointestinal symptoms. Correlation of the MDASI-P symptom severity items with corresponding EORTC QLQ-C30 symptom items (r = 0.48-0.75) and MDASI-P interference items with corresponding EORTC QLQ-C30 functioning domains (r = -0.46 to -0.23) supported the criterion validity. Cronbach's alpha was 0.90, 0.88, and 0.77 for the total questionnaire, symptom severity items, and the interference subscale, respectively. The MDASI-P is a feasible, valid, and reliable instrument for evaluation of symptoms in Persian-speaking cancer patients and can be used to improve symptom management in these patients. Copyright © 2015 American Academy of Hospice and Palliative Medicine. Published by Elsevier Inc. All rights reserved.
Harasym, Peter H; Woloschuk, Wayne; Cunning, Leslie
2008-12-01
Physician-patient communication is a clinical skill that can be learned and has a positive impact on patient satisfaction and health outcomes. A concerted effort at all medical schools is now directed at teaching and evaluating this core skill. Student communication skills are often assessed by an Objective Structure Clinical Examination (OSCE). However, it is unknown what sources of error variance are introduced into examinee communication scores by various OSCE components. This study primarily examined the effect different examiners had on the evaluation of students' communication skills assessed at the end of a family medicine clerkship rotation. The communication performance of clinical clerks from Classes 2005 and 2006 were assessed using six OSCE stations. Performance was rated at each station using the 28-item Calgary-Cambridge guide. Item Response Theory analysis using a Multifaceted Rasch model was used to partition the various sources of error variance and generate a "true" communication score where the effects of examiner, case, and items are removed. Variance and reliability of scores were as follows: communication scores (.20 and .87), examiner stringency/leniency (.86 and .91), case (.03 and .96), and item (.86 and .99), respectively. All facet scores were reliable (.87-.99). Examiner variance (.86) was more than four times the examinee variance (.20). About 11% of the clerks' outcome status shifted using "true" rather than observed/raw scores. There was large variability in examinee scores due to variation in examiner stringency/leniency behaviors that may impact pass-fail decisions. Exploring the benefits of examiner training and employing "true" scores generated using Item Response Theory analyses prior to making pass/fail decisions are recommended.
Østergaard, Søren D; Opler, Mark G A; Correll, Christoph U
2017-12-01
There is currently a "measurement gap" between research and clinical care in schizophrenia. The main reason behind this gap is that the most widely used rating scale in schizophrenia research, the 30-item Positive and Negative Syndrome Scale (PANSS), takes so long to administer that it is rarely used in clinical practice. This compromises the translation of research findings into clinical care and vice versa. The aim of this paper is to discuss how this measurement gap can be closed. Specifically, the main points of discussion are 1) the practical problems associated with using the full 30-item PANSS in clinical practice; 2) how the brief, six-item version of the Positive and Negative Syndrome Scale (PANSS-6) was derived empirically from the full 30-item PANSS and what the initial results obtained with PANSS-6 entail; and 3) how PANSS-6 ratings, guided by the newly developed, 15-25-minute, stand-alone Simplified Negative and Positive Symptoms Interview (SNAPSI), might help bridge the measurement gap between research and clinical care in schizophrenia. The full 30-item PANSS is often used in research studies, but is too time consuming to allow for routine clinical use. Recent studies suggest that the much briefer PANSS-6 is a psychometrically valid measure of core positive and negative symptoms of schizophrenia and that the scale is sensitive to symptom improvement following pharmacological treatment. SNAPSI is a brief interview that yields the information needed to rate PANSS-6 (and other brief rating scales). We believe that PANSS-6 ratings guided by SNAPSI will help bridge the measurement gap between research and clinical care in schizophrenia.
Ashley, Laura; Smith, Adam B; Keding, Ada; Jones, Helen; Velikova, Galina; Wright, Penny
2013-12-01
To provide new insights into the psychometrics of the revised Illness Perception Questionnaire (IPQ-R) in cancer patients. To undertake, for the first time using data from breast, colorectal and prostate cancer patients, a confirmatory factor analysis (CFA) to assess the validity of the IPQ-R's core seven-factor structure. Also, for the first time in any illness group, to undertake Rasch analysis to explore the extent to which the IPQ-R factors form unidimensional scales, with linear measurement properties and no Differential Item Functioning (DIF). Patients with potentially curable breast, colorectal or prostate cancer, within 6months post-diagnosis, completed the IPQ-R online (N=531). CFA was conducted, including multi-sample analysis, and for each IPQ-R factor fit to the Rasch model was assessed by examining, amongst other things, item fit, DIF and unidimensionality. The CFA showed a moderate fit of the data to the IPQ-R model, and stability across diagnosis, although fit was significantly improved following the removal of selected items. All seven factors achieved fit to the Rasch model, and exhibited unidimensionality and minimal DIF, although in most cases this was after some item rescoring and/or deletion. In both analyses, IPQ-R items 12, 18 and 24 were indicated as misfitting and removed. Given the rigorous standard of Rasch measurement, and the generic nature of the IPQ-R, it stood up well to the demands of the Rasch model in this study. Importantly, the results show that with some relatively minor, pragmatic modifications the IPQ-R could possess Rasch-standard measurement in cancer patients. © 2013.
Emergency department team communication with the patient: the patient's perspective.
McCarthy, Danielle M; Ellison, Emily P; Venkatesh, Arjun K; Engel, Kirsten G; Cameron, Kenzie A; Makoul, Gregory; Adams, James G
2013-08-01
Effective communication is important for the delivery of quality care. The Emergency Department (ED) environment poses significant challenges to effective communication. The objective of this study was to determine patients' perceptions of their ED team's communication skills. This was a cross-sectional study in an urban, academic ED. Patients completed the Communication Assessment Tool for Teams (CAT-T) survey upon ED exit. The CAT-T was adapted from the psychometrically validated Communication Assessment Tool (CAT) to measure patient perceptions of communication with a medical team. The 14 core CAT-T items are associated with a 5-point scale (5 = excellent); results are reported as the percent of participants who responded "excellent." Responses were analyzed for differences based on age, sex, race, and operational metrics (wait time, ED daily census). There were 346 patients identified; the final sample for analysis was 226 patients (53.5% female, 48.2% Caucasian), representing a response rate of 65.3%. The scores on CAT-T items (reported as % "excellent") ranged from 50.0% to 76.1%. The highest-scoring items were "let me talk without interruptions" (76.1%), "talked in terms I could understand" (75.2%), and "treated me with respect" (74.3%). The lowest-scoring item was "encouraged me to ask questions" (50.0%). No differences were noted based on patient sex, race, age, wait time, or daily census of the ED. The patients in this study perceived that the ED teams were respectful and allowed them to talk without interruptions; however, lower ratings were given for items related to actively engaging the patient in decision-making and asking questions. Copyright © 2013 Elsevier Inc. All rights reserved.
Hjermstad, Marianne Jensen; Bergenmar, Mia; Fisher, Sheila E; Montel, Sébastien; Nicolatou-Galitis, Ourania; Raber-Durlacher, Judith; Singer, Susanne; Verdonck-de Leeuw, Irma; Weis, Joachim; Yarom, Noam; Herlofson, Bente B
2012-09-01
Assessment of oral and dental problems is seldom routine in clinical oncology, despite the potential negative impact of these problems on nutritional status, social function and quality of life (QoL). The aim was to develop a supplementary module to the European Organisation for Research and Treatment of Cancer Core Questionnaire (EORTC QLQ-C30) focusing on oral health and related QoL issues in all cancer diagnoses. The module development followed the EORTC guidelines. Phases 1&2 were conducted in France, Germany, Greece, Netherlands, Norway and United Kingdom, while seven countries representing seven languages were included in Phase 3. Eighty-five QoL-items were identified from systematic literature searches. Semi-structured interviews with health-care professionals experienced in oncology and oral/dental care (n=18) and patients (n=133) resulted in a provisional module with 41 items. In phase 3 this was further tested in 178 European patients representing different phases of disease and treatment. Results from the interviews, clinical experiences and statistical analyses resulted in the EORTC QLQ-OH17. The module consists of 17 items conceptualised into four multi-item scales (pain/discomfort, xerostomia, eating, information) and three single items related to use of dentures and future worries. This study provides a useful tool intended for use in conjunction with the EORTC QLQ-C30 for assessment of oral and dental problems. The increased awareness may lead to proper interventions, thereby preventing more serious problems and negative impact on QoL. The reliability and validity, the cross-cultural applicability and the psychometric properties of the module will be tested in a larger international study. Copyright © 2012 Elsevier Ltd. All rights reserved.
Baumann, Sophie; Gaertner, Beate; Schnuerer, Inga; Bischof, Gallus; John, Ulrich; Freyer-Adam, Jennis
2013-09-01
Little is known about the applicability of the transtheoretical model of intentional behavior change (TTM) to individuals with unhealthy alcohol use that is primarily characterized by low readiness to change. This study examined the psychometric properties of short measures by assessing three core constructs of the TTM: the 20-item Processes of Change (POC-20) scale, and short versions of the Alcohol Decisional Balance Scale (ADBS) and the Alcohol Abstinence Self-Efficacy (AASE) scale. A sample of 427 individuals with unhealthy alcohol use (Mage = 30 years, 65% men), identified at job agencies in northeastern Germany, completed all three scales. Item difficulty (d), selectivity (rit), and Cronbach's alpha were calculated. Confirmatory factory analyses were used to test for construct validity and latent mean differences across the stages. The psychometric properties of the 8-item AASE were adequate (d range: 0.59-0.78; rit range: 0.59-0.68; α range: 0.74-0.81), except for one subscale. Most items of the POC-20 and the 10-item ADBS were difficult (dPOC range: 0.08-0.40; dADBS range: 0.21-0.58); selectivity (ritPOC range: 0.26-0.62; ritADBS range: 0.34-0.68) and internal consistency (αPOC range: 0.41-0.76; αADBS range: 0.64-0.78) were low to moderate. Construct validity was acceptable (Comparative Fit Index range: 0.95-0.99). The association between stages and TTM constructs partially followed expected patterns. Suggestions for modifications of TTM measures are discussed for better applicability among proactively recruited samples of individuals with unhealthy alcohol use and with primarily low readiness to change. (PsycINFO Database Record (c) 2013 APA, all rights reserved).
Use of a Video Scoring Anchor for Rapid Serial Assessment of Social Communication in Toddlers.
Marrus, Natasha; Kennon-McGill, Stefanie; Harris, Brooke; Zhang, Yi; Glowinski, Anne L; Constantino, John N
2018-03-14
Reciprocal social behavior (RSB), an early-emerging capacity to engage in social contingency-which is foundational for both social learning and social competency-is hypothesized to be disrupted in autism spectrum disorder (ASD). The ability to quantify the full range of RSB during the toddler period, when core symptoms of ASD often arise, is pivotal for evaluating early risk for ASD, characterizing social development, and tracking response to early interventions. However, important parameters of variation in RSB-especially prior to the development of verbal language-can be nuanced and difficult to characterize using questionnaire-based methods. To address this challenge, we developed a system for measuring quantitative variation in RSB in toddlers (ages 18 - 30 months) that incorporated not only standard questionnaire data from caregivers but also a novel set of video-referenced items, through which a respondent compares the behavior of a subject to that observed in a short video of a young child manifesting a highly competent level of social communication. Testing of this measure in a general population sample of twins confirmed that both the video-referenced items and the RSB Total Score (video-referenced items plus non-video-referenced items) displayed unimodal, continuous distributions, strong internal consistency, marked preservation of individual differences, and extremely high heritability. In addition, video-referenced items were particularly sensitive to quantifying incremental changes in social communication, a major element of RSB, over the course of early childhood development. Scores on the vrRSB clearly differentiated children with and without ASD and these data comprise an initial validation of this promising method for quantifying early RSB-cross-sectionally, over time, and as a function of early intervention.
Ohde, Sachiko; Deshpande, Gautam A; Takahashi, Osamu; Fukui, Tsuguya
2014-07-12
In Japan, all trainee physicians must begin clinical practice in a standardized, mandatory junior residency program, which encompasses the first two years of post-graduate medical training (PGY1 - PGY2). Implemented in 2004 to foster primary care skills, the comprehensive rotation program (CRP) requires junior residents to spend 14 months rotating through a comprehensive array of clinical departments including internal medicine, surgery, anesthesiology, obstetrics-gynecology (OBGYN), pediatrics, psychiatry, and rural medicine. In 2010, Japan's health ministry relaxed this curricular requirement, allowing training programs to offer a limited rotation program (LRP), in which core departments constitute 10 months of training, with electives geared towards residents' choice of career specialty comprising the remaining 14 months. The effectiveness of primary care skill acquisition during early training warrants evaluation. This study assesses self-reported confidence with clinical competencies, as well as case experience, between residents in CRP versus LRP curricula. A nation-wide cross-sectional study of all PGY2 physicians in Japan was conducted in March 2011. Primary outcomes were self-report confidence for 98 clinical competency items, and number of cases experienced for 85 common diseases. We compared confidence scores and case experience between residents in CRP and LRP programs, adjusting for parameters relevant to training. Among 7506 PGY2 residents, 5052 replied to the survey (67.3%). Of 98 clinical competency items, CRP residents reported higher confidence in 12 items compared to those in an LRP curriculum, 10 of which remained significantly higher after adjustment. CRP trainees reported lower confidence scores in none of the items. Out of 85 diseases, LRP residents reported less experience with 11 diseases. CRP trainees reported lower case experience with one disease, though this did not remain significant on adjusted analysis. Confidence and case experience with OBGYN- and pediatrics-related items were particularly low among LRP trainees. Residents in the specialty-oriented LRP curriculum showed less confidence and less case experience compared to peers training in the broader CRP residency curriculum. In order to foster competence in independent primary care practice, junior residency programs requiring experience in a breadth of core departments should continue to be mandated to ensure adequate primary care skills.
Item validity vs. item discrimination index: a redundancy?
NASA Astrophysics Data System (ADS)
Panjaitan, R. L.; Irawati, R.; Sujana, A.; Hanifah, N.; Djuanda, D.
2018-03-01
In several literatures about evaluation and test analysis, it is common to find that there are calculations of item validity as well as item discrimination index (D) with different formula for each. Meanwhile, other resources said that item discrimination index could be obtained by calculating the correlation between the testee’s score in a particular item and the testee’s score on the overall test, which is actually the same concept as item validity. Some research reports, especially undergraduate theses tend to include both item validity and item discrimination index in the instrument analysis. It seems that these concepts might overlap for both reflect the test quality on measuring the examinees’ ability. In this paper, examples of some results of data processing on item validity and item discrimination index were compared. It would be discussed whether item validity and item discrimination index can be represented by one of them only or it should be better to present both calculations for simple test analysis, especially in undergraduate theses where test analyses were included.
Refining the Measurement of Distress Intolerance
McHugh, R. Kathryn; Otto, Michael W.
2012-01-01
Distress intolerance is an important transdiagnostic variable that has long been implicated in the development and maintenance of psychological disorders. Self-report measurement strategies for distress intolerance have emerged from several different models of psychopathology and these measures have been applied inconsistently in the literature in the absence of a clear gold standard. The absence of a consistent assessment strategy has limited the ability to compare across studies and samples, thus hampering the advancement of this research agenda. This study evaluated the latent factor structure of existing measures of DI to examine the degree to which they are capturing the same construct. Results of confirmatory factor analysis in 3 samples totaling 400 participants provided support for a single factor latent structure. Individual items of these four scales were then correlated with this factor to identify those that best capture the core construct. Results provided consistent supported for 10 items that demonstrated the strongest concordance with this factor. The use of these 10 items as a unifying measure in the study of DI and future directions for the evaluation of its utility are discussed. PMID:22697451
ERIC Educational Resources Information Center
Li, Yanmei
2012-01-01
In a common-item (anchor) equating design, the common items should be evaluated for item parameter drift. Drifted items are often removed. For a test that contains mostly dichotomous items and only a small number of polytomous items, removing some drifted polytomous anchor items may result in anchor sets that no longer resemble mini-versions of…
ERIC Educational Resources Information Center
Thurman, Carol
2009-01-01
The increased use of polytomous item formats has led assessment developers to pay greater attention to the detection of differential item functioning (DIF) in these items. DIF occurs when an item performs differently for two contrasting groups of respondents (e.g., males versus females) after controlling for differences in the abilities of the…
Qualitative Development of the PROMIS® Pediatric Stress Response Item Banks
Gardner, William; Pajer, Kathleen; Riley, Anne W.; Forrest, Christopher B.
2013-01-01
Objective To describe the qualitative development of the Patient-Reported Outcome Measurement Information System (PROMIS®) Pediatric Stress Response item banks. Methods Stress response concepts were specified through a literature review and interviews with content experts, children, and parents. A library comprising 2,677 items derived from 71 instruments was developed. Items were classified into conceptual categories; new items were written and redundant items were removed. Items were then revised based on cognitive interviews (n = 39 children), readability analyses, and translatability reviews. Results 2 pediatric Stress Response sub-domains were identified: somatic experiences (43 items) and psychological experiences (64 items). Final item pools cover the full range of children’s stress experiences. Items are comprehensible among children aged ≥8 years and ready for translation. Conclusions Child- and parent-report versions of the item banks assess children’s somatic and psychological states when demands tax their adaptive capabilities. PMID:23124904
Lynch, Andrew D; Dodds, Nathan E; Yu, Lan; Pilkonis, Paul A; Irrgang, James J
2016-05-11
The content and wording of the Patient Reported Outcome Measurement Information System (PROMIS) Physical Function and Pain Interference item banks have not been qualitatively assessed by individuals with knee joint impairments. The purpose of this investigation was to identify items in the PROMIS Physical Function and Pain Interference Item Banks that are irrelevant, unclear, or otherwise difficult to respond to for individuals with impairment of the knee and to suggest modifications based on cognitive interviews. Twenty-nine individuals with knee joint impairments qualitatively assessed items in the Pain Interference and Physical Function Item Banks in a mixed-methods cognitive interview. Field notes were analyzed to identify themes and frequency counts were calculated to identify items not relevant to individuals with knee joint impairments. Issues with clarity were identified in 23 items in the Physical Function Item Bank, resulting in the creation of 43 new or modified items, typically changing words within the item to be clearer. Interpretation issues included whether or not the knee joint played a significant role in overall health and age/gender differences in items. One quarter of the original items (31 of 124) in the Physical Function Item Bank were identified as irrelevant to the knee joint. All 41 items in the Pain Interference Item Bank were identified as clear, although individuals without significant pain substituted other symptoms which interfered with their life. The Physical Function Item Bank would benefit from additional items that are relevant to individuals with knee joint impairments and, by extension, to other lower extremity impairments. Several issues in clarity were identified that are likely to be present in other patient cohorts as well.
Identifying content for the glaucoma-specific item bank to measure quality-of-life parameters.
Khadka, Jyoti; McAlinden, Colm; Craig, Jamie E; Fenwick, Eva K; Lamoureux, Ecosse L; Pesudovs, Konrad
2015-01-01
Patient-reported outcomes (PROs) have become essential clinical trial end points. However, a comprehensive, multidimensional, patient-relevant, and precise glaucoma-specific PRO instrument is not available. Therefore, the purpose of this study was to identify content for a new, glaucoma-specific, quality-of-life (QOL) item bank. Content identification was undertaken in 5 phases: (1) identification of extant items in glaucoma-specific instruments and the qualitative literature; (2) focus groups and interviews with glaucoma patients; (3) item classification and selection; (4) expert review and revision of items; and (5) cognitive interviews with patients. A total of 737 unique items (extant items from PRO instruments, 247; qualitative articles, 14 items; focus groups and semistructured interviews, 476 items) were identified. These items were classified into 10 QOL domains. Four criteria (item redundancy, item inconsistent with domain definition, item content too narrow to have wider applicability, and item clarity) were used to remove and refine the items. After the cognitive interviews, the final minimally representative item set had a total of 342 unique items belonging to 10 domains: activity limitation (88), mobility (20), visual symptoms (19), ocular surface symptoms (22), general symptoms (15), convenience (39), health concerns (45), emotional well-being (49), social issues (23), and economic issues (22). The systematic content identification process identified 10 QOL domains, which were important to patients with glaucoma. The majority of the items were identified from the patient-specific focus groups and semistructured interviews suggesting that the existing PRO instruments do not adequately address QOL issues relevant to individuals with glaucoma.
Developing and investigating the use of single-item measures in organizational research.
Fisher, Gwenith G; Matthews, Russell A; Gibbons, Alyssa Mitchell
2016-01-01
The validity of organizational research relies on strong research methods, which include effective measurement of psychological constructs. The general consensus is that multiple item measures have better psychometric properties than single-item measures. However, due to practical constraints (e.g., survey length, respondent burden) there are situations in which certain single items may be useful for capturing information about constructs that might otherwise go unmeasured. We evaluated 37 items, including 18 newly developed items as well as 19 single items selected from existing multiple-item scales based on psychometric characteristics, to assess 18 constructs frequently measured in organizational and occupational health psychology research. We examined evidence of reliability; convergent, discriminant, and content validity assessments; and test-retest reliabilities at 1- and 3-month time lags for single-item measures using a multistage and multisource validation strategy across 3 studies, including data from N = 17 occupational health subject matter experts and N = 1,634 survey respondents across 2 samples. Items selected from existing scales generally demonstrated better internal consistency reliability and convergent validity, whereas these particular new items generally had higher levels of content validity. We offer recommendations regarding when use of single items may be more or less appropriate, as well as 11 items that seem acceptable, 14 items with mixed results that might be used with caution due to mixed results, and 12 items we do not recommend using as single-item measures. Although multiple-item measures are preferable from a psychometric standpoint, in some circumstances single-item measures can provide useful information. (c) 2016 APA, all rights reserved).
ERIC Educational Resources Information Center
Arendasy, Martin E.; Sommer, Markus
2012-01-01
The use of new test administration technologies such as computerized adaptive testing in high-stakes educational and occupational assessments demands large item pools. Classic item construction processes and previous approaches to automatic item generation faced the problems of a considerable loss of items after the item calibration phase. In this…
Calibrating Item Families and Summarizing the Results Using Family Expected Response Functions
ERIC Educational Resources Information Center
Sinharay, Sandip; Johnson, Matthew S.; Williamson, David M.
2003-01-01
Item families, which are groups of related items, are becoming increasingly popular in complex educational assessments. For example, in automatic item generation (AIG) systems, a test may consist of multiple items generated from each of a number of item models. Item calibration or scoring for such an assessment requires fitting models that can…
A New Item Selection Procedure for Mixed Item Type in Computerized Classification Testing.
ERIC Educational Resources Information Center
Lau, C. Allen; Wang, Tianyou
This paper proposes a new Information-Time index as the basis for item selection in computerized classification testing (CCT) and investigates how this new item selection algorithm can help improve test efficiency for item pools with mixed item types. It also investigates how practical constraints such as item exposure rate control, test…
Item Purification Does Not Always Improve DIF Detection: A Counterexample with Angoff's Delta Plot
ERIC Educational Resources Information Center
Magis, David; Facon, Bruno
2013-01-01
Item purification is an iterative process that is often advocated as improving the identification of items affected by differential item functioning (DIF). With test-score-based DIF detection methods, item purification iteratively removes the items currently flagged as DIF from the test scores to get purified sets of items, unaffected by DIF. The…
Jones, Conor M; Baker, Justin N; Keesey, Rachel M; Eliason, Ruth J; Lanctot, Jennifer Q; Clegg, Jennifer L; Mandrell, Belinda N; Ness, Kirsten K; Krull, Kevin R; Srivastava, Deokumar; Forrest, Christopher B; Hudson, Melissa M; Robison, Leslie L; Huang, I-Chan
2018-04-18
To compare importance ratings of patient-reported outcomes (PROs) items from the viewpoints of childhood cancer survivors, parents, and clinicians for further developing short-forms to use in survivorship care. 101 cancer survivors, 101 their parents, and 36 clinicians were recruited from St. Jude Children's Research Hospital. Participants were asked to select eight items that they deemed useful for clinical decision making from each of the four Patient-Reported Outcomes Measurement Information System Pediatric item banks. These item banks were pain interference (20 items), fatigue (23 items), psychological stress (19 items), and positive affect (37 items). Compared to survivors, clinicians rated more items across four domains that were statistically different than did parents (23 vs. 13 items). Clinicians rated five items in pain interference domain (ORs 2.33-6.01; p's < 0.05) and three items in fatigue domain (ORs 2.22-3.80; p's < .05) as more important but rated three items in psychological stress domain (ORs 0.14-0.42; p's < .05) and six items in positive affect domain (ORs 0.17-0.35; p's < .05) as less important than did survivors. In contrast, parents rated seven items in positive affect domain (ORs 0.25-0.47; p's < .05) as less important than did survivors. Survivors, parents, and clinicians viewed importance of PRO items for survivorship care differently. These perspectives should be used to assist the development of PROs tools.
Viewing Reading Recovery as a Restructuring Phenomenon
ERIC Educational Resources Information Center
Rinehart, James S.; Short, Paula Myrick
2010-01-01
This study investigated components of Reading Recovery that relate to a restructuring paradigm. Specifically, Reading Recovery was analyzed as a way to redesign teachers' work, empower teachers, and affect the core technology of teaching. Data were collected by a survey that consisted of open-ended questions and of categorical response items.…
Adaptations and Access to Assessment of Common Core Content
ERIC Educational Resources Information Center
Kettler, Ryan J.
2015-01-01
This chapter introduces theory that undergirds the role of testing adaptations in assessment, provides examples of item modifications and testing accommodations, reviews research relevant to each, and introduces a new paradigm that incorporates opportunity to learn (OTL), academic enablers, testing adaptations, and inferences that can be made from…
Development and Validation of a Depression Scale for Asian Adolescents
ERIC Educational Resources Information Center
Woo, Bernardine S. C.; Chang, W. C.; Fung, Daniel S. S.; Koh, Jessie B. K.; Leong, Joyce S. F.; Kee, Carolyn H. Y.; Seah, Cheryl K. F.
2004-01-01
Items covering both core and culture-specific facets of depression were generated based on literature review and clinical experience. They were modified following focus group discussions with depressed adolescents and adolescents in the community. The newly constructed Asian Adolescent Depression Scale (AADS) was administered to a clinical and a…
Library Users' Service Desires: A LibQUAL+ Study
ERIC Educational Resources Information Center
Thompson, Bruce; Kyrillidou, Martha; Cook, Colleen
2008-01-01
The present study was conducted to explore library users' desired service quality levels on the twenty-two core LibQUAL+ items. Specifically, we explored similarities and differences in users' desired library service quality levels across user groups (i.e., undergraduate students, graduate students, and faculty), across geographic locations (i.e.,…
Assessment of Psychopharmacology on the American Board of Psychiatry and Neurology Examinations
ERIC Educational Resources Information Center
Juul, Dorthea; Winstead, Daniel K.; Sheiber, Stephen C.
2005-01-01
OBJECTIVE: To report the assessment of psychopharmacology on the certification and recertification exams in general psychiatry and in the subspecialties administered by the American Board of Psychiatry and Neurology (ABPN). METHODS: The ABPN's core competencies for psychiatrists were reviewed. The number of items addressing psychopharmacology or…
77 FR 41670 - Definition of Terms
Federal Register 2010, 2011, 2012, 2013, 2014
2012-07-16
... cryptography'', 2. On page 642, add the term ``Explosives'', 3. On page 650, add the term ``Nuclear reactor... ``Commerce Control List''. * * * * * Nuclear reactor. (Cat 0 and 2) includes the items within or attached directly to the reactor vessel, the equipment which controls the level of power in the core, and the...
10 CFR 52.137 - Contents of applications; technical information.
Code of Federal Regulations, 2010 CFR
2010-01-01
... limits on its operation, and presents a safety analysis of the structures, systems, and components and of... products. The description shall be sufficient to permit understanding of the system designs and their relationship to the safety evaluations. Items such as the reactor core, reactor coolant system, instrumentation...
Current Risk Management Practices in Psychotherapy Supervision.
Mehrtens, Ilayna K; Crapanzano, Kathleen; Tynes, L Lee
2017-12-01
Psychotherapy competence is a core skill for psychiatry residents, and psychotherapy supervision is a time-honored approach to teaching this skill. To explore the current supervision practices of psychiatry training programs, a 24-item questionnaire was sent to all program directors of Accreditation Council for Graduate Medical Education (ACGME)-approved adult psychiatry programs. The questionnaire included items regarding adherence to recently proposed therapy supervision practices aimed at reducing potential liability risk. The results suggested that current therapy supervision practices do not include sufficient management of the potential liability involved in therapy supervision. Better protections for patients, residents, supervisors and the institutions would be possible with improved credentialing practices and better documentation of informed consent and supervision policies and procedures. © 2017 American Academy of Psychiatry and the Law.
Towards a model of contemporary parenting: the parenting behaviours and dimensions questionnaire.
Reid, Carly A Y; Roberts, Lynne D; Roberts, Clare M; Piek, Jan P
2015-01-01
The assessment of parenting has been problematic due to theoretical disagreement, concerns over generalisability, and problems with the psychometric properties of current parenting measures. The aim of this study was to develop a comprehensive, psychometrically sound self-report parenting measure for use with parents of preadolescent children, and to use this empirical scale development process to identify the core dimensions of contemporary parenting behaviour. Following item generation and parent review, 846 parents completed an online survey comprising 116 parenting items. Exploratory and confirmatory factor analyses supported a six factor parenting model, comprising Emotional Warmth, Punitive Discipline, Anxious Intrusiveness, Autonomy Support, Permissive Discipline and Democratic Discipline. This measure will allow for the comprehensive and consistent assessment of parenting in future research and practice.
Selected list of books and journals for the small medical library.
Brandon, A N; Hill, D R
1979-04-01
This revised list of 492 books and 138 journals is intended as a selection guide for small or medium-sized hospital libraries or for the small medical library serving a specified clientele. It can also be used as a core list by small hospital library consortia. Books and journals are categorized by subject, with the books being followed by an author index and the journals by an alphabetical title listing. Items suggested for initial purchase by smaller libraries are indicated by an asterisk. To purchase the entire collection of books and to pay for annual subscriptions to all the journals would require an expenditure of about $22,500. The cost of only the asterisked items, recommended for first purchase, totals approximately $6,100.
Selected list of books and journals for the small medical library.
Brandon, A N
1977-04-01
This revised list of 472 books and 138 journals is intended as a selection guide for small or medium-sized hospital libraries or for the small medical library serving a specified clientele. It can also be used as a core list by small hospital library consortia. Books and journals are categorized by subject, with the books being followed by an author index and the journals by an alphabetical title listing. Items suggested for initial purchase by smaller libraries are indicated by an asterisk. To purchase the entire collection of books and to pay for annual subscriptions to all the journals would require an expenditure of about $18,200. The cost of only the asterisked items recommended for first purchase totals approximately $4,500.
Kawasaki, Yohei; Ide, Kazuki; Akutagawa, Maiko; Yamada, Hiroshi; Furukawa, Toshiaki A.; Ono, Yutaka
2016-01-01
Background Several studies have shown that total depressive symptom scores in the general population approximate an exponential pattern, except for the lower end of the distribution. The Center for Epidemiologic Studies Depression Scale (CES-D) consists of 20 items, each of which may take on four scores: “rarely,” “some,” “occasionally,” and “most of the time.” Recently, we reported that the item responses for 16 negative affect items commonly exhibit exponential patterns, except for the level of “rarely,” leading us to hypothesize that the item responses at the level of “rarely” may be related to the non-exponential pattern typical of the lower end of the distribution. To verify this hypothesis, we investigated how the item responses contribute to the distribution of the sum of the item scores. Methods Data collected from 21,040 subjects who had completed the CES-D questionnaire as part of a Japanese national survey were analyzed. To assess the item responses of negative affect items, we used a parameter r, which denotes the ratio of “rarely” to “some” in each item response. The distributions of the sum of negative affect items in various combinations were analyzed using log-normal scales and curve fitting. Results The sum of the item scores approximated an exponential pattern regardless of the combination of items, whereas, at the lower end of the distributions, there was a clear divergence between the actual data and the predicted exponential pattern. At the lower end of the distributions, the sum of the item scores with high values of r exhibited higher scores compared to those predicted from the exponential pattern, whereas the sum of the item scores with low values of r exhibited lower scores compared to those predicted. Conclusions The distributional pattern of the sum of the item scores could be predicted from the item responses of such items. PMID:27806132
Mitchell, Alex J; Smith, Adam B; Al-salihy, Zerak; Rahim, Twana A; Mahmud, Mahmud Q; Muhyaldin, Asma S
2011-10-01
We aimed to redefine the optimal self-report symptoms of depression suitable for creation of an item bank that could be used in computer adaptive testing or to develop a simplified screening tool for DSM-V. Four hundred subjects (200 patients with primary depression and 200 non-depressed subjects), living in Iraqi Kurdistan were interviewed. The Mini International Neuropsychiatric Interview (MINI) was used to define the presence of major depression (DSM-IV criteria). We examined symptoms of depression using four well-known scales delivered in Kurdish. The Partial Credit Model was applied to each instrument. Common-item equating was subsequently used to create an item bank and differential item functioning (DIF) explored for known subgroups. A symptom level Rasch analysis reduced the original 45 items to 24 items of the original after the exclusion of 21 misfitting items. A further six items (CESD13 and CESD17, HADS-D4, HADS-D5 and HADS-D7, and CDSS3 and CDSS4) were removed due to misfit as the items were added together to form the item bank, and two items were subsequently removed following the DIF analysis by diagnosis (CESD20 and CDSS9, both of which were harder to endorse for women). Therefore the remaining optimal item bank consisted of 17 items and produced an area under the curve (AUC) of 0.987. Using a bank restricted to the optimal nine items revealed only minor loss of accuracy (AUC = 0.989, sensitivity 96%, specificity 95%). Finally, when restricted to only four items accuracy was still high (AUC was still 0.976; sensitivity 93%, specificity 96%). An item bank of 17 items may be useful in computer adaptive testing and nine or even four items may be used to develop a simplified screening tool for DSM-V major depressive disorder (MDD). Further examination of this item bank should be conducted in different cultural settings.
A Comparison of Three Types of Test Development Procedures Using Classical and Latent Trait Methods.
ERIC Educational Resources Information Center
Benson, Jeri; Wilson, Michael
Three methods of item selection were used to select sets of 38 items from a 50-item verbal analogies test and the resulting item sets were compared for internal consistency, standard errors of measurement, item difficulty, biserial item-test correlations, and relative efficiency. Three groups of 1,500 cases each were used for item selection. First…
Parameter Estimation in Rasch Models for Examinee-Selected Items
ERIC Educational Resources Information Center
Liu, Chen-Wei; Wang, Wen-Chung
2017-01-01
The examinee-selected-item (ESI) design, in which examinees are required to respond to a fixed number of items in a given set of items (e.g., choose one item to respond from a pair of items), always yields incomplete data (i.e., only the selected items are answered and the others have missing data) that are likely nonignorable. Therefore, using…
ERIC Educational Resources Information Center
Wang, Wen-Chung
2004-01-01
Scale indeterminacy in analysis of differential item functioning (DIF) within the framework of item response theory can be resolved by imposing 3 anchor item methods: the equal-mean-difficulty method, the all-other anchor item method, and the constant anchor item method. In this article, applicability and limitations of these 3 methods are…
ERIC Educational Resources Information Center
Çokluk, Ömay; Gül, Emrah; Dogan-Gül, Çilem
2016-01-01
The study aims to examine whether differential item function is displayed in three different test forms that have item orders of random and sequential versions (easy-to-hard and hard-to-easy), based on Classical Test Theory (CTT) and Item Response Theory (IRT) methods and bearing item difficulty levels in mind. In the correlational research, the…
State Assessment Program Item Banks: Model Language for Request for Proposals (RFP) and Contracts
ERIC Educational Resources Information Center
Swanson, Leonard C.
2010-01-01
This document provides recommendations for request for proposal (RFP) and contract language that state education agencies can use to specify their requirements for access to test item banks. An item bank is a repository for test items and data about those items. Item banks are used by state agency staff to view items and associated data; to…
Examination of the PROMIS upper extremity item bank.
Hung, Man; Voss, Maren W; Bounsanga, Jerry; Crum, Anthony B; Tyser, Andrew R
Clinical measurement. The psychometric properties of the PROMIS v1.2 UE item bank were tested on various samples prior to its release, but have not been fully evaluated among the orthopaedic population. This study assesses the performance of the UE item bank within the UE orthopaedic patient population. The UE item bank was administered to 1197 adult patients presenting to a tertiary orthopaedic clinic specializing in hand and UE conditions and was examined using traditional statistics and Rasch analysis. The UE item bank fits a unidimensional model (outfit MNSQ range from 0.64 to 1.70) and has adequate reliabilities (person = 0.84; item = 0.82) and local independence (item residual correlations range from -0.37 to 0.34). Only one item exhibits gender differential item functioning. Most items target low levels of function. The UE item bank is a useful clinical assessment tool. Additional items covering higher functions are needed to enhance validity. Supplemental testing is recommended for patients at higher levels of function until more high function UE items are developed. 2c. Copyright © 2016 Hanley & Belfus. Published by Elsevier Inc. All rights reserved.
Attention! Can choices for low value food over high value food be trained?
Zoltak, Michael J; Veling, Harm; Chen, Zhang; Holland, Rob W
2018-05-01
People choose high value food items over low value food items, because food choices are guided by the comparison of values placed upon choice alternatives. This value comparison process is also influenced by the amount of attention people allocate to different items. Recent research shows that choices for food items can be increased by training attention toward these items, with a paradigm named cued-approach training (CAT). However, previous work till now has only examined the influence of CAT on choices between two equally valued items. It has remained unclear whether CAT can increase choices for low value items when people choose between a low and high value food item. To address this question in the current study participants were cued to make rapid responses in CAT to certain low and high value items. Next, they made binary choices between low and high value items, where we systematically varied whether the low and high value items were cued or uncued. In two experiments, we found that participants overall preferred high over low value food items for real consumption. More important, their choices for low value items increased when only the low value item had been cued in CAT compared to when both low and high value items had not been cued. Exploratory analyses revealed that this effect was more pronounced for participants with a relatively small value difference between low and high value items. The present research thus suggests that CAT may be used to boost the choice and consumption of low value items via enhanced attention toward these items, as long as the value difference is not too large. Implications for facilitating choices for healthy food are discussed. Copyright © 2017 Elsevier Ltd. All rights reserved.
Improved Approximation Algorithms for Item Pricing with Bounded Degree and Valuation
NASA Astrophysics Data System (ADS)
Hamane, Ryoso; Itoh, Toshiya
When a store sells items to customers, the store wishes to decide the prices of the items to maximize its profit. If the store sells the items with low (resp. high) prices, the customers buy more (resp. less) items, which provides less profit to the store. It would be hard for the store to decide the prices of items. Assume that a store has a set V of n items and there is a set C of m customers who wish to buy those items. The goal of the store is to decide the price of each item to maximize its profit. We refer to this maximization problem as an item pricing problem. We classify the item pricing problems according to how many items the store can sell or how the customers valuate the items. If the store can sell every item i with unlimited (resp. limited) amount, we refer to this as unlimited supply (resp. limited supply). We say that the item pricing problem is single-minded if each customer j∈C wishes to buy a set ej⊆V of items and assigns valuation w(ej)≥0. For the single-minded item pricing problems (in unlimited supply), Balcan and Blum regarded them as weighted k-hypergraphs and gave several approximation algorithms. In this paper, we focus on the (pseudo) degree of k-hypergraphs and the valuation ratio, i. e., the ratio between the smallest and the largest valuations. Then for the single-minded item pricing problems (in unlimited supply), we show improved approximation algorithms (for k-hypergraphs, general graphs, bipartite graphs, etc.) with respect to the maximum (pseudo) degree and the valuation ratio.
CTTITEM: SAS macro and SPSS syntax for classical item analysis.
Lei, Pui-Wa; Wu, Qiong
2007-08-01
This article describes the functions of a SAS macro and an SPSS syntax that produce common statistics for conventional item analysis including Cronbach's alpha, item difficulty index (p-value or item mean), and item discrimination indices (D-index, point biserial and biserial correlations for dichotomous items and item-total correlation for polytomous items). These programs represent an improvement over the existing SAS and SPSS item analysis routines in terms of completeness and user-friendliness. To promote routine evaluations of item qualities in instrument development of any scale, the programs are available at no charge for interested users. The program codes along with a brief user's manual that contains instructions and examples are downloadable from suen.ed.psu.edu/-pwlei/plei.htm.
Berthaume, Michael A.; Dumont, Elizabeth R.; Godfrey, Laurie R.; Grosse, Ian R.
2014-01-01
Teeth are often assumed to be optimal for their function, which allows researchers to derive dietary signatures from tooth shape. Most tooth shape analyses normalize for tooth size, potentially masking the relationship between relative food item size and tooth shape. Here, we model how relative food item size may affect optimal tooth cusp radius of curvature (RoC) during the fracture of brittle food items using a parametric finite-element (FE) model of a four-cusped molar. Morphospaces were created for four different food item sizes by altering cusp RoCs to determine whether optimal tooth shape changed as food item size changed. The morphospaces were also used to investigate whether variation in efficiency metrics (i.e. stresses, energy and optimality) changed as food item size changed. We found that optimal tooth shape changed as food item size changed, but that all optimal morphologies were similar, with one dull cusp that promoted high stresses in the food item and three cusps that acted to stabilize the food item. There were also positive relationships between food item size and the coefficients of variation for stresses in food item and optimality, and negative relationships between food item size and the coefficients of variation for stresses in the enamel and strain energy absorbed by the food item. These results suggest that relative food item size may play a role in selecting for optimal tooth shape, and the magnitude of these selective forces may change depending on food item size and which efficiency metric is being selected. PMID:25320068
Item Difficulty Modeling of Paragraph Comprehension Items
ERIC Educational Resources Information Center
Gorin, Joanna S.; Embretson, Susan E.
2006-01-01
Recent assessment research joining cognitive psychology and psychometric theory has introduced a new technology, item generation. In algorithmic item generation, items are systematically created based on specific combinations of features that underlie the processing required to correctly solve a problem. Reading comprehension items have been more…
Item-focussed Trees for the Identification of Items in Differential Item Functioning.
Tutz, Gerhard; Berger, Moritz
2016-09-01
A novel method for the identification of differential item functioning (DIF) by means of recursive partitioning techniques is proposed. We assume an extension of the Rasch model that allows for DIF being induced by an arbitrary number of covariates for each item. Recursive partitioning on the item level results in one tree for each item and leads to simultaneous selection of items and variables that induce DIF. For each item, it is possible to detect groups of subjects with different item difficulties, defined by combinations of characteristics that are not pre-specified. The way a DIF item is determined by covariates is visualized in a small tree and therefore easily accessible. An algorithm is proposed that is based on permutation tests. Various simulation studies, including the comparison with traditional approaches to identify items with DIF, show the applicability and the competitive performance of the method. Two applications illustrate the usefulness and the advantages of the new method.
Forrest, Christopher B; Devine, Janine; Bevans, Katherine B; Becker, Brandon D; Carle, Adam C; Teneralli, Rachel E; Moon, JeanHee; Tucker, Carole A; Ravens-Sieberer, Ulrike
2018-01-01
To describe the psychometric evaluation and item response theory calibration of the PROMIS Pediatric Life Satisfaction item banks, child-report, and parent-proxy editions. A pool of 55 life satisfaction items was administered to 1992 children 8-17 years old and 964 parents of children 5-17 years old. Analyses included descriptive statistics, reliability, factor analysis, differential item functioning, and assessment of construct validity. Thirteen items were deleted because of poor psychometric performance. An 8-item short form was administered to a national sample of 996 children 8-17 years old, and 1294 parents of children 5-17 years old. The combined sample (2988 children and 2258 parents) was used in item response theory (IRT) calibration analyses. The final item banks were unidimensional, the items were locally independent, and the items were free from impactful differential item functioning. The 8-item and 4-item short form scales showed excellent reliability, convergent validity, and discriminant validity. Life satisfaction decreased with declining socio-economic status, presence of a special health care need, and increasing age for girls, but not boys. After IRT calibration, we found that 4- and 8-item short forms had a high degree of precision (reliability) across a wide range (>4 SD units) of the latent variable. The PROMIS Pediatric Life Satisfaction item banks and their short forms provide efficient, precise, and valid assessments of life satisfaction in children and youth.
The Effects of Goal Relevance and Perceptual Features on Emotional Items and Associative Memory
Mao, Wei B.; An, Shu; Yang, Xiao F.
2017-01-01
Showing an emotional item in a neutral background scene often leads to enhanced memory for the emotional item and impaired associative memory for background details. Meanwhile, both top–down goal relevance and bottom–up perceptual features played important roles in memory binding. We conducted two experiments and aimed to further examine the effects of goal relevance and perceptual features on emotional items and associative memory. By manipulating goal relevance (asking participants to categorize only each item image as living or non-living or to categorize each whole composite picture consisted of item image and background scene as natural scene or manufactured scene) and perceptual features (controlling visual contrast and visual familiarity) in two experiments, we found that both high goal relevance and salient perceptual features (high salience of items vs. high familiarity of items) could promote emotional item memory, but they had different effects on associative memory for emotional items and neutral backgrounds. Specifically, high goal relevance and high perceptual-salience of items could jointly impair the associative memory for emotional items and neutral backgrounds, while the effect of item familiarity on associative memory for emotional items would be modulated by goal relevance. High familiarity of items could increase associative memory for negative items and neutral backgrounds only in the low goal relevance condition. These findings suggest the effect of emotion on associative memory is not only related to attentional capture elicited by emotion, but also can be affected by goal relevance and perceptual features of stimulus. PMID:28790943
The Effects of Goal Relevance and Perceptual Features on Emotional Items and Associative Memory.
Mao, Wei B; An, Shu; Yang, Xiao F
2017-01-01
Showing an emotional item in a neutral background scene often leads to enhanced memory for the emotional item and impaired associative memory for background details. Meanwhile, both top-down goal relevance and bottom-up perceptual features played important roles in memory binding. We conducted two experiments and aimed to further examine the effects of goal relevance and perceptual features on emotional items and associative memory. By manipulating goal relevance (asking participants to categorize only each item image as living or non-living or to categorize each whole composite picture consisted of item image and background scene as natural scene or manufactured scene) and perceptual features (controlling visual contrast and visual familiarity) in two experiments, we found that both high goal relevance and salient perceptual features (high salience of items vs. high familiarity of items) could promote emotional item memory, but they had different effects on associative memory for emotional items and neutral backgrounds. Specifically, high goal relevance and high perceptual-salience of items could jointly impair the associative memory for emotional items and neutral backgrounds, while the effect of item familiarity on associative memory for emotional items would be modulated by goal relevance. High familiarity of items could increase associative memory for negative items and neutral backgrounds only in the low goal relevance condition. These findings suggest the effect of emotion on associative memory is not only related to attentional capture elicited by emotion, but also can be affected by goal relevance and perceptual features of stimulus.
Forrest, Christopher B; Ravens-Sieberer, Ulrike; Devine, Janine; Becker, Brandon D; Teneralli, Rachel; Moon, JeanHee; Carle, Adam; Tucker, Carole A; Bevans, Katherine B
2018-03-01
The purpose of this study is to describe the psychometric evaluation and item response theory calibration of the PROMIS Pediatric Positive Affect item bank, child-report and parent-proxy editions. The initial item pool comprising 53 items, previously developed using qualitative methods, was administered to 1,874 children 8-17 years old and 909 parents of children 5-17 years old. Analyses included descriptive statistics, reliability, factor analysis, differential item functioning, and construct validity. A total of 14 items were deleted, because of poor psychometric performance, and an 8-item short form constructed from the remaining 39 items was administered to a national sample of 1,004 children 8-17 years old, and 1,306 parents of children 5-17 years old. The combined sample was used in item response theory (IRT) calibration analyses. The final item bank appeared unidimensional, the items appeared locally independent, and the items were free from differential item functioning. The scales showed excellent reliability and convergent and discriminant validity. Positive affect decreased with children's age and was lower for those with a special health care need. After IRT calibration, we found that 4 and 8 item short forms had a high degree of precision (reliability) across a wide range of the latent trait (>4 SD units). The PROMIS Pediatric Positive Affect item bank and its short forms provide an efficient, precise, and valid assessment of positive affect in children and youth.
2013-01-01
Introduction: Craving is useful in the diagnosis of drug dependence, but it is unclear how various items used to assess craving might influence the diagnostic performance of craving measures. This study determined the diagnostic performance of individual items and item subgroups of the 32-item Questionnaire on Smoking Urges (QSU) as a function of item wording, level of craving intensity, and item stability. Methods: Nondaily and daily smokers (n = 222) completed the QSU on 6 separate occasions, and item responses were averaged across the administrations. Nicotine dependence was assessed with the Wisconsin Inventory of Smoking Dependence Motives. The discriminative performance of the QSU items was evaluated with receiver-operating characteristic curves and area under the curve statistics. Results: Although each of the QSU items and selected subgroups of items significantly discriminated dependent from nondependent smokers, certain item subgroups outperformed others. There was no difference in discriminative performance between use of the specific terms urge and crave or between items assessing intention to smoke relative to those assessing desire to smoke, but there were significant differences in the two major factors represented on the QSU and in craving items reflecting more intense relative to less intense craving. Stability of the item scores was strongly related to the discriminative performance of craving. Conclusions: Items indexing stable, high-intensity aspects of craving that reflect the negative reinforcing effects of smoking will likely be most useful for diagnostic purposes. Future directions and implications are discussed. PMID:23817585
Germeroth, Lisa J; Wray, Jennifer M; Gass, Julie C; Tiffany, Stephen T
2013-12-01
Craving is useful in the diagnosis of drug dependence, but it is unclear how various items used to assess craving might influence the diagnostic performance of craving measures. This study determined the diagnostic performance of individual items and item subgroups of the 32-item Questionnaire on Smoking Urges (QSU) as a function of item wording, level of craving intensity, and item stability. Nondaily and daily smokers (n = 222) completed the QSU on 6 separate occasions, and item responses were averaged across the administrations. Nicotine dependence was assessed with the Wisconsin Inventory of Smoking Dependence Motives. The discriminative performance of the QSU items was evaluated with receiver-operating characteristic curves and area under the curve statistics. Although each of the QSU items and selected subgroups of items significantly discriminated dependent from nondependent smokers, certain item subgroups outperformed others. There was no difference in discriminative performance between use of the specific terms urge and crave or between items assessing intention to smoke relative to those assessing desire to smoke, but there were significant differences in the two major factors represented on the QSU and in craving items reflecting more intense relative to less intense craving. Stability of the item scores was strongly related to the discriminative performance of craving. Items indexing stable, high-intensity aspects of craving that reflect the negative reinforcing effects of smoking will likely be most useful for diagnostic purposes. Future directions and implications are discussed.
Dissociative effects of orthographic distinctiveness in pure and mixed lists: an item-order account.
McDaniel, Mark A; Cahill, Michael; Bugg, Julie M; Meadow, Nathaniel G
2011-10-01
We apply the item-order theory of list composition effects in free recall to the orthographic distinctiveness effect. The item-order account assumes that orthographically distinct items advantage item-specific encoding in both mixed and pure lists, but at the expense of exploiting relational information present in the list. Experiment 1 replicated the typical free recall advantage of orthographically distinct items in mixed lists and the elimination of that advantage in pure lists. Supporting the item-order account, recognition performances indicated that orthographically distinct items received greater item-specific encoding than did orthographically common items in mixed and pure lists (Experiments 1 and 2). Furthermore, order memory (input-output correspondence and sequential contiguity effects) was evident in recall of pure unstructured common lists, but not in recall of unstructured distinct lists (Experiment 1). These combined patterns, although not anticipated by prevailing views, are consistent with an item-order account.
Michaelides, Michalis P.
2010-01-01
Many studies have investigated the topic of change or drift in item parameter estimates in the context of item response theory (IRT). Content effects, such as instructional variation and curricular emphasis, as well as context effects, such as the wording, position, or exposure of an item have been found to impact item parameter estimates. The issue becomes more critical when items with estimates exhibiting differential behavior across test administrations are used as common for deriving equating transformations. This paper reviews the types of effects on IRT item parameter estimates and focuses on the impact of misbehaving or aberrant common items on equating transformations. Implications relating to test validity and the judgmental nature of the decision to keep or discard aberrant common items are discussed, with recommendations for future research into more informed and formal ways of dealing with misbehaving common items. PMID:21833230
Michaelides, Michalis P
2010-01-01
Many studies have investigated the topic of change or drift in item parameter estimates in the context of item response theory (IRT). Content effects, such as instructional variation and curricular emphasis, as well as context effects, such as the wording, position, or exposure of an item have been found to impact item parameter estimates. The issue becomes more critical when items with estimates exhibiting differential behavior across test administrations are used as common for deriving equating transformations. This paper reviews the types of effects on IRT item parameter estimates and focuses on the impact of misbehaving or aberrant common items on equating transformations. Implications relating to test validity and the judgmental nature of the decision to keep or discard aberrant common items are discussed, with recommendations for future research into more informed and formal ways of dealing with misbehaving common items.
Insomnia Severity is an Indicator of Suicidal Ideation During a Depression Clinical Trial
McCall, W. Vaughn; Blocker, Jill N.; D’Agostino, Ralph; Kimball, James; Boggs, Niki; Lasater, Barbara; Rosenquist, Peter B.
2010-01-01
Objective Insomnia has been linked to suicidal ideas and suicide death in cross-sectional and longitudinal population-based studies. A link between insomnia and suicide has not been previously examined in the setting of a clinical trial. Herein we describe the relationship between insomnia and suicidal thinking during the course of a clinical trial for depression with insomnia. Methods Sixty patients aged 41.5 ± 12.5 years (2/3 women) with major depressive episode and symptoms of insomnia received open label fluoxetine for 9 weeks and also received blinded, randomized eszopiclone 3 mg or placebo at bedtime after the first week of fluoxetine. Insomnia symptoms were assessed with the Insomnia Severity Index (ISI), and suicidal ideation was assessed with The Scale for Suicide Ideation (SSI). Depression symptoms were assessed with the depressed mood item and the anhedonia item from the Hamilton Rating Scale for Depression-24 (HRSD24), as well as a sum score for all non-sleep and non-suicide items from the HRSD (HRSD20). Measurements were taken at baseline and weeks 1, 2, 4, 6, and 8. SSI was examined by generalized linear mixed models for repeated measures as the outcome of interest for all 60 participants with ISI and various mood symptoms as independent variables, with adjustment for age, gender, treatment assignment, and baseline SSI. Results Higher levels of insomnia corresponded to significantly greater intensity of suicidal thinking (p<0.01). The depressed mood item of the HRSD, and the sum of the HRSD20, both corresponded to greater suicidal thinking (p<0.001). The anhedonia item did not correspond with suicidal thinking. When both ISI and the depressed mood item, or ISI and the anhedonia item, were included together in the same model, the ISI remained an independent predictor of suicidal thinking. Conclusions The results support the concept that insomnia may be a useful indicator for suicidal ideation, and now extend this idea into clinical trials. Insomnia remains an independent indicator of suicidal ideation even taking into account the core symptoms of depression such as depressed mood and anhedonia. The complaint of insomnia during a depression clinical trial might indicate that more direct questioning about suicide is warranted. PMID:20478741
Corbett, Anne; Achterberg, Wilco; Husebo, Bettina; Lobbezoo, Frank; de Vet, Henrica; Kunz, Miriam; Strand, Liv; Constantinou, Marios; Tudose, Catalina; Kappesser, Judith; de Waal, Margot; Lautenbacher, Stefan
2014-12-10
Pain is common in people with dementia, yet identification is challenging. A number of pain assessment tools exist, utilizing observation of pain-related behaviours, vocalizations and facial expressions. Whilst they have been developed robustly, these often lack sufficient evidence of psychometric properties, like reliability, face and construct validity, responsiveness and usability, and are not internationally implemented. The EU-COST initiative "Pain in impaired cognition, especially dementia" aims to combine the expertise of clinicians and researchers to address this important issue by building on previous research in the area, identifying existing pain assessment tools for dementia, and developing consensus for items for a new universal meta-tool for use in research and clinical settings. This paper reports on the initial phase of this collaboration task. All existing observational pain behaviour tools were identified and elements categorised using a three-step reduction process. Selection and refinement of items for the draft Pain Assessment in Impaired Cognition (PAIC) meta-tool was achieved through scrutiny of the evidence, consensus of expert opinion, frequency of use and alignment with the American Geriatric Society guidelines. The main aim of this process was to identify key items with potential empirical, rather than theoretical value to take forward for testing. 12 eligible assessment tools were identified, and pain items categorised according to behaviour, facial expression and vocalisation according to the AGS guidelines (Domains 1 - 3). This has been refined to create the PAIC meta-tool for validation and further refinement. A decision was made to create a supporting comprehensive toolkit to support the core assessment tool to provide additional resources for the assessment of overlapping symptoms in dementia, including AGS domains four to six, identification of specific types of pain and assessment of duration and location of pain. This multidisciplinary, cross-cultural initiative has created a draft meta-tool for capturing pain behaviour to be used across languages and culture, based on the most promising items used in existing tools. The draft PAIC meta-tool will now be taken forward for evaluation according to COSMIN guidelines and the EU-COST protocol in order to exclude invalid items, refine included items and optimise the meta-tool.
Richter, Randy R; Sebelski, Chris A; Austin, Tricia M
2016-09-01
The quality of abstract reporting in physical therapy literature is unknown. The purpose of this study was to provide baseline data for judging the future impact of the 2010 Consolidated Standards of Reporting Trials statement specifically referencing the 2008 Consolidated Standards of Reporting Trials statement for reporting of abstracts of randomized controlled trials across and between a broad sample and a core sample of physical therapy literature. A cross-sectional, bibliographic analysis was conducted. Abstracts of randomized controlled trials from 2009 were retrieved from PubMed, PEDro, and CENTRAL. Eligibility was determined using PEDro criteria. For outcomes measures, items from the Consolidated Standards of Reporting Trials statement for abstract reporting were used for assessment. Raters were not blinded to citation details. Using a computer-generated set of random numbers, 150 abstracts from 112 journals comprised the broad sample. A total of 53 abstracts comprised the core sample. Fourteen of 20 Consolidated Standards of Reporting Trials items for both samples were reported in less than 50% of the abstracts. Significantly more abstracts in the core sample reported (% difference core - broad; 95% confidence interval) title (28.4%; 12.9%-41.2%), blinding (15.2%; 1.6%-29.8%), setting (47.6%; 32.4%-59.4%), and confidence intervals (13.1%; 5.0%-25.1%). These findings provide baseline data for determining if continuing efforts to improve abstract reporting are heeded.
Development of tailorable advanced blanket insulation for advanced space transportation systems
NASA Technical Reports Server (NTRS)
Calamito, Dominic P.
1987-01-01
Two items of Tailorable Advanced Blanket Insulation (TABI) for Advanced Space Transportation Systems were produced. The first consisted of flat panels made from integrally woven, 3-D fluted core having parallel fabric faces and connecting ribs of Nicalon silicon carbide yarns. The triangular cross section of the flutes were filled with mandrels of processed Q-Fiber Felt. Forty panels were prepared with only minimal problems, mostly resulting from the unavailability of insulation with the proper density. Rigidizing the fluted fabric prior to inserting the insulation reduced the production time. The procedures for producing the fabric, insulation mandrels, and TABI panels are described. The second item was an effort to determine the feasibility of producing contoured TABI shapes from gores cut from flat, insulated fluted core panels. Two gores of integrally woven fluted core and single ply fabric (ICAS) were insulated and joined into a large spherical shape employing a tadpole insulator at the mating edges. The fluted core segment of each ICAS consisted of an Astroquartz face fabric and Nicalon face and rib fabrics, while the single ply fabric segment was Nicalon. Further development will be required. The success of fabricating this assembly indicates that this concept may be feasible for certain types of space insulation requirements. The procedures developed for weaving the ICAS, joining the gores, and coating certain areas of the fabrics are presented.
The Role of Item Models in Automatic Item Generation
ERIC Educational Resources Information Center
Gierl, Mark J.; Lai, Hollis
2012-01-01
Automatic item generation represents a relatively new but rapidly evolving research area where cognitive and psychometric theories are used to produce tests that include items generated using computer technology. Automatic item generation requires two steps. First, test development specialists create item models, which are comparable to templates…
ERIC Educational Resources Information Center
Rudner, Lawrence
This digest discusses the advantages and disadvantages of using item banks, and it provides useful information for those who are considering implementing an item banking project in their school districts. The primary advantage of item banking is in test development. Using an item response theory method, such as the Rasch model, items from multiple…
ERIC Educational Resources Information Center
Qian, Xiaoyu; Nandakumar, Ratna; Glutting, Joseoph; Ford, Danielle; Fifield, Steve
2017-01-01
In this study, we investigated gender and minority achievement gaps on 8th-grade science items employing a multilevel item response methodology. Both gaps were wider on physics and earth science items than on biology and chemistry items. Larger gender gaps were found on items with specific topics favoring male students than other items, for…
The Effect of the Position of an Item within a Test on the Item Difficulty Value.
ERIC Educational Resources Information Center
Rubin, Lois S.; Mott, David E. W.
An investigation of the effect on the difficulty value of an item due to position placement within a test was made. Using a 60-item operational test comprised of 5 subtests, 60 items were placed as experimental items on a number of spiralled test forms in three different positions (first, middle, last) within the subtest composed of like items.…
ERIC Educational Resources Information Center
Kostin, Irene
2004-01-01
The purpose of this study is to explore the relationship between a set of item characteristics and the difficulty of TOEFL[R] dialogue items. Identifying characteristics that are related to item difficulty has the potential to improve the efficiency of the item-writing process The study employed 365 TOEFL dialogue items, which were coded on 49…
ERIC Educational Resources Information Center
Marie, S. Maria Josephine Arokia; Edannur, Sreekala
2015-01-01
This paper focused on the analysis of test items constructed in the paper of teaching Physical Science for B.Ed. class. It involved the analysis of difficulty level and discrimination power of each test item. Item analysis allows selecting or omitting items from the test, but more importantly item analysis is a tool to help the item writer improve…
Remembered but Unused: The Accessory Items in Working Memory that Do Not Guide Attention
ERIC Educational Resources Information Center
Peters, Judith C.; Goebel, Rainer; Roelfsema, Pieter R.
2009-01-01
If we search for an item, a representation of this item in our working memory guides attention to matching items in the visual scene. We can hold multiple items in working memory. Do all these items guide attention in parallel? We asked participants to detect a target object in a stream of objects while they maintained a second item in memory for…
Kim, Stella H; Strutt, Adriana M; Olabarrieta-Landa, Laiene; Lequerica, Anthony H; Rivera, Diego; De Los Reyes Aragon, Carlos Jose; Utria, Oscar; Arango-Lasprilla, Juan Carlos
2018-02-23
The Boston Naming Test (BNT) is a widely used measure of confrontation naming ability that has been criticized for its questionable construct validity for non-English speakers. This study investigated item difficulty and construct validity of the Spanish version of the BNT to assess cultural and linguistic impact on performance. Subjects were 1298 healthy Spanish speaking adults from Colombia. They were administered the 60- and 15-item Spanish version of the BNT. A Rasch analysis was computed to assess dimensionality, item hierarchy, targeting, reliability, and item fit. Both versions of the BNT satisfied requirements for unidimensionality. Although internal consistency was excellent for the 60-item BNT, order of difficulty did not increase consistently with item number and there were a number of items that did not fit the Rasch model. For the 15-item BNT, a total of 5 items changed position on the item hierarchy with 7 poor fitting items. Internal consistency was acceptable. Construct validity of the BNT remains a concern when it is administered to non-English speaking populations. Similar to previous findings, the order of item presentation did not correspond with increasing item difficulty, and both versions were inadequate at assessing high naming ability.
Negative effects of item repetition on source memory.
Kim, Kyungmi; Yi, Do-Joon; Raye, Carol L; Johnson, Marcia K
2012-08-01
In the present study, we explored how item repetition affects source memory for new item-feature associations (picture-location or picture-color). We presented line drawings varying numbers of times in Phase 1. In Phase 2, each drawing was presented once with a critical new feature. In Phase 3, we tested memory for the new source feature of each item from Phase 2. Experiments 1 and 2 demonstrated and replicated the negative effects of item repetition on incidental source memory. Prior item repetition also had a negative effect on source memory when different source dimensions were used in Phases 1 and 2 (Experiment 3) and when participants were explicitly instructed to learn source information in Phase 2 (Experiments 4 and 5). Importantly, when the order between Phases 1 and 2 was reversed, such that item repetition occurred after the encoding of critical item-source combinations, item repetition no longer affected source memory (Experiment 6). Overall, our findings did not support predictions based on item predifferentiation, within-dimension source interference, or general interference from multiple traces of an item. Rather, the findings were consistent with the idea that prior item repetition reduces attention to subsequent presentations of the item, decreasing the likelihood that critical item-source associations will be encoded.
Better assessment of physical function: item improvement is neglected but essential
2009-01-01
Introduction Physical function is a key component of patient-reported outcome (PRO) assessment in rheumatology. Modern psychometric methods, such as Item Response Theory (IRT) and Computerized Adaptive Testing, can materially improve measurement precision at the item level. We present the qualitative and quantitative item-evaluation process for developing the Patient Reported Outcomes Measurement Information System (PROMIS) Physical Function item bank. Methods The process was stepwise: we searched extensively to identify extant Physical Function items and then classified and selectively reduced the item pool. We evaluated retained items for content, clarity, relevance and comprehension, reading level, and translation ease by experts and patient surveys, focus groups, and cognitive interviews. We then assessed items by using classic test theory and IRT, used confirmatory factor analyses to estimate item parameters, and graded response modeling for parameter estimation. We retained the 20 Legacy (original) Health Assessment Questionnaire Disability Index (HAQ-DI) and the 10 SF-36's PF-10 items for comparison. Subjects were from rheumatoid arthritis, osteoarthritis, and healthy aging cohorts (n = 1,100) and a national Internet sample of 21,133 subjects. Results We identified 1,860 items. After qualitative and quantitative evaluation, 124 newly developed PROMIS items composed the PROMIS item bank, which included revised Legacy items with good fit that met IRT model assumptions. Results showed that the clearest and best-understood items were simple, in the present tense, and straightforward. Basic tasks (like dressing) were more relevant and important versus complex ones (like dancing). Revised HAQ-DI and PF-10 items with five response options had higher item-information content than did comparable original Legacy items with fewer response options. IRT analyses showed that the Physical Function domain satisfied general criteria for unidimensionality with one-, two-, three-, and four-factor models having comparable model fits. Correlations between factors in the test data sets were > 0.90. Conclusions Item improvement must underlie attempts to improve outcome assessment. The clear, personally important and relevant, ability-framed items in the PROMIS Physical Function item bank perform well in PRO assessment. They will benefit from further study and application in a wider variety of rheumatic diseases in diverse clinical groups, including those at the extremes of physical functioning, and in different administration modes. PMID:20015354
Better assessment of physical function: item improvement is neglected but essential.
Bruce, Bonnie; Fries, James F; Ambrosini, Debbie; Lingala, Bharathi; Gandek, Barbara; Rose, Matthias; Ware, John E
2009-01-01
Physical function is a key component of patient-reported outcome (PRO) assessment in rheumatology. Modern psychometric methods, such as Item Response Theory (IRT) and Computerized Adaptive Testing, can materially improve measurement precision at the item level. We present the qualitative and quantitative item-evaluation process for developing the Patient Reported Outcomes Measurement Information System (PROMIS) Physical Function item bank. The process was stepwise: we searched extensively to identify extant Physical Function items and then classified and selectively reduced the item pool. We evaluated retained items for content, clarity, relevance and comprehension, reading level, and translation ease by experts and patient surveys, focus groups, and cognitive interviews. We then assessed items by using classic test theory and IRT, used confirmatory factor analyses to estimate item parameters, and graded response modeling for parameter estimation. We retained the 20 Legacy (original) Health Assessment Questionnaire Disability Index (HAQ-DI) and the 10 SF-36's PF-10 items for comparison. Subjects were from rheumatoid arthritis, osteoarthritis, and healthy aging cohorts (n = 1,100) and a national Internet sample of 21,133 subjects. We identified 1,860 items. After qualitative and quantitative evaluation, 124 newly developed PROMIS items composed the PROMIS item bank, which included revised Legacy items with good fit that met IRT model assumptions. Results showed that the clearest and best-understood items were simple, in the present tense, and straightforward. Basic tasks (like dressing) were more relevant and important versus complex ones (like dancing). Revised HAQ-DI and PF-10 items with five response options had higher item-information content than did comparable original Legacy items with fewer response options. IRT analyses showed that the Physical Function domain satisfied general criteria for unidimensionality with one-, two-, three-, and four-factor models having comparable model fits. Correlations between factors in the test data sets were > 0.90. Item improvement must underlie attempts to improve outcome assessment. The clear, personally important and relevant, ability-framed items in the PROMIS Physical Function item bank perform well in PRO assessment. They will benefit from further study and application in a wider variety of rheumatic diseases in diverse clinical groups, including those at the extremes of physical functioning, and in different administration modes.
Mao, Xinrui; Tian, Mengxi; Liu, Yi; Li, Bingcan; Jin, Yan; Wu, Yanhong; Guo, Chunyan
2017-01-01
Retrieval inhibition hypothesis of directed forgetting effects assumed TBF (to-be-forgotten) items were not retrieved intentionally, while selective rehearsal hypothesis assumed the memory representation of retrieved TBF (to-be-forgotten) items was weaker than TBR (to-be-remembered) items. Previous studies indicated that directed forgetting effects of item-cueing method resulted from selective rehearsal at encoding, but the mechanism of retrieval inhibition that affected directed forgetting of TBF (to-be-forgotten) items was not clear. Strategic retrieval is a control process allowing the selective retrieval of target information, which includes retrieval orientation and strategic recollection. Retrieval orientation via the comparison of tasks refers to the specific form of processing resulted by retrieval efforts. Strategic recollection is the type of strategies to recollect studied items for the retrieval success of targets. Using a "directed forgetting" paradigm combined with a memory exclusion task, our investigation of strategic retrieval in directed forgetting assisted to explore how retrieval inhibition played a role on directed forgetting effects. When TBF items were targeted, retrieval orientation showed more positive ERPs to new items, indicating that TBF items demanded more retrieval efforts. The results of strategic recollection indicated that: (a) when TBR items were retrieval targets, late parietal old/new effects were only evoked by TBR items but not TBF items, indicating the retrieval inhibition of TBF items; (b) when TBF items were retrieval targets, the late parietal old/new effect were evoked by both TBR items and TBF items, indicating that strategic retrieval could overcome retrieval inhibition of TBF items. These findings suggested the modulation of strategic retrieval on retrieval inhibition of directed forgetting, supporting that directed forgetting effects were not only caused by selective rehearsal, but also retrieval inhibition.
Mao, Xinrui; Tian, Mengxi; Liu, Yi; Li, Bingcan; Jin, Yan; Wu, Yanhong; Guo, Chunyan
2017-01-01
Retrieval inhibition hypothesis of directed forgetting effects assumed TBF (to-be-forgotten) items were not retrieved intentionally, while selective rehearsal hypothesis assumed the memory representation of retrieved TBF (to-be-forgotten) items was weaker than TBR (to-be-remembered) items. Previous studies indicated that directed forgetting effects of item-cueing method resulted from selective rehearsal at encoding, but the mechanism of retrieval inhibition that affected directed forgetting of TBF (to-be-forgotten) items was not clear. Strategic retrieval is a control process allowing the selective retrieval of target information, which includes retrieval orientation and strategic recollection. Retrieval orientation via the comparison of tasks refers to the specific form of processing resulted by retrieval efforts. Strategic recollection is the type of strategies to recollect studied items for the retrieval success of targets. Using a “directed forgetting” paradigm combined with a memory exclusion task, our investigation of strategic retrieval in directed forgetting assisted to explore how retrieval inhibition played a role on directed forgetting effects. When TBF items were targeted, retrieval orientation showed more positive ERPs to new items, indicating that TBF items demanded more retrieval efforts. The results of strategic recollection indicated that: (a) when TBR items were retrieval targets, late parietal old/new effects were only evoked by TBR items but not TBF items, indicating the retrieval inhibition of TBF items; (b) when TBF items were retrieval targets, the late parietal old/new effect were evoked by both TBR items and TBF items, indicating that strategic retrieval could overcome retrieval inhibition of TBF items. These findings suggested the modulation of strategic retrieval on retrieval inhibition of directed forgetting, supporting that directed forgetting effects were not only caused by selective rehearsal, but also retrieval inhibition. PMID:28900411
Buck, Harleah G; Harkness, Karen; Ali, Muhammad Usman; Carroll, Sandra L; Kryworuchko, Jennifer; McGillion, Michael
2017-04-01
Caregivers (CGs) contribute important assistance with heart failure (HF) self-care, including daily maintenance, symptom monitoring, and management. Until CGs' contributions to self-care can be quantified, it is impossible to characterize it, account for its impact on patient outcomes, or perform meaningful cost analyses. The purpose of this study was to conduct psychometric testing and item reduction on the recently developed 34-item Caregiver Contribution to Heart Failure Self-care (CACHS) instrument using classical and item response theory methods. Fifty CGs (mean age 63 years ±12.84; 70% female) recruited from a HF clinic completed the CACHS in 2014 and results evaluated using classical test theory and item response theory. Items would be deleted for low (<.05) or high (>.95) endorsement, low (<.3) or high (>.7) corrected item-total correlations, significant pairwise correlation coefficients, floor or ceiling effects, relatively low latent trait and item information function levels (<1.5 and p > .5), and differential item functioning. After analysis, 14 items were excluded, resulting in a 20-item instrument (self-care maintenance eight items; monitoring seven items; and management five items). Most items demonstrated moderate to high discrimination (median 2.13, minimum .77, maximum 5.05), and appropriate item difficulty (-2.7 to 1.4). Internal consistency reliability was excellent (Cronbach α = .94, average inter-item correlation = .41) with no ceiling effects. The newly developed 20-item version of the CACHS is supported by rigorous instrument development and represents a novel instrument to measure CGs' contribution to HF self-care. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Zhao, Yue
2017-03-01
In patient-reported outcome research that utilizes item response theory (IRT), using statistical significance tests to detect misfit is usually the focus of IRT model-data fit evaluations. However, such evaluations rarely address the impact/consequence of using misfitting items on the intended clinical applications. This study was designed to evaluate the impact of IRT item misfit on score estimates and severity classifications and to demonstrate a recommended process of model-fit evaluation. Using secondary data sources collected from the Patient-Reported Outcome Measurement Information System (PROMIS) wave 1 testing phase, analyses were conducted based on PROMIS depression (28 items; 782 cases) and pain interference (41 items; 845 cases) item banks. The identification of misfitting items was assessed using Orlando and Thissen's summed-score item-fit statistics and graphical displays. The impact of misfit was evaluated according to the agreement of both IRT-derived T-scores and severity classifications between inclusion and exclusion of misfitting items. The examination of the presence and impact of misfit suggested that item misfit had a negligible impact on the T-score estimates and severity classifications with the general population sample in the PROMIS depression and pain interference item banks, implying that the impact of item misfit was insignificant. Findings support the T-score estimates in the two item banks as robust against item misfit at both the group and individual levels and add confidence to the use of T-scores for severity diagnosis in the studied sample. Recommendations on approaches for identifying item misfit (statistical significance) and assessing the misfit impact (practical significance) are given.
FIM-Minimum Data Set Motor Item Bank: Short Forms Development and Precision Comparison in Veterans.
Li, Chih-Ying; Romero, Sergio; Simpson, Annie N; Bonilha, Heather S; Simpson, Kit N; Hong, Ickpyo; Velozo, Craig A
2018-03-01
To improve the practical use of the short forms (SFs) developed from the item bank, we compared the measurement precision of the 4- and 8-item SFs generated from a motor item bank composed of the FIM and the Minimum Data Set (MDS). The FIM-MDS motor item bank allowed scores generated from different instruments to be co-calibrated. The 4- and 8-item SFs were developed based on Rasch analysis procedures. This article compared person strata, ceiling/floor effects, and test SE plots for each administration form and examined 95% confidence interval error bands of anchored person measures with the corresponding SFs. We used 0.3 SE as a criterion to reflect a reliability level of .90. Veterans' inpatient rehabilitation facilities and community living centers. Veterans (N=2500) who had both FIM and the MDS data within 6 days during 2008 through 2010. Not applicable. Four- and 8-item SFs of FIM, MDS, and FIM-MDS motor item bank. Six SFs were generated with 4 and 8 items across a range of difficulty levels from the FIM-MDS motor item bank. The three 8-item SFs all had higher correlations with the item bank (r=.82-.95), higher person strata, and less test error than the corresponding 4-item SFs (r=.80-.90). The three 4-item SFs did not meet the criteria of SE <0.3 for any theta values. Eight-item SFs could improve clinical use of the item bank composed of existing instruments across the continuum of care in veterans. We also found that the number of items, not test specificity, determines the precision of the instrument. Copyright © 2017 American Congress of Rehabilitation Medicine. All rights reserved.
Haverman, Lotte; Grootenhuis, Martha A; Raat, Hein; van Rossum, Marion A J; van Dulmen-den Broeder, Eline; Hoppenbrouwers, Karel; Correia, Helena; Cella, David; Roorda, Leo D; Terwee, Caroline B
2016-03-01
The Patient-Reported Outcomes Measurement Information System (PROMIS(®)) is a new, state-of-the-art assessment system for measuring patient-reported health and well-being of adults and children. It has the potential to be more valid, reliable, and responsive than existing PROMs. The items banks are designed to be self-reported and completed by children aged 8-18 years. The PROMIS items can be administered in short forms or through computerized adaptive testing. This paper describes the translation and cultural adaption of nine PROMIS item banks (151 items) for children in Dutch-Flemish. The translation was performed by FACITtrans using standardized PROMIS methodology and approved by the PROMIS Statistical Center. The translation included four forward translations, two back-translations, three independent reviews (at least two Dutch, one Flemish), and pretesting in 24 children from the Netherlands and Flanders. For some items, it was necessary to have separate translations for Dutch and Flemish: physical function-mobility (three items), anger (one item), pain interference (two items), and asthma impact (one item). Challenges faced in the translation process included scarcity or overabundance of possible translations, unclear item descriptions, constructs broader/smaller in the target language, difficulties in rank ordering items, differences in unit of measurement, irrelevant items, or differences in performance of activities. By addressing these challenges, acceptable translations were obtained for all items. The Dutch-Flemish PROMIS items are linguistically equivalent to the original USA version. Short forms are now available for use, and entire item banks are ready for cross-cultural validation in the Netherlands and Flanders.
Approximation Preserving Reductions among Item Pricing Problems
NASA Astrophysics Data System (ADS)
Hamane, Ryoso; Itoh, Toshiya; Tomita, Kouhei
When a store sells items to customers, the store wishes to determine the prices of the items to maximize its profit. Intuitively, if the store sells the items with low (resp. high) prices, the customers buy more (resp. less) items, which provides less profit to the store. So it would be hard for the store to decide the prices of items. Assume that the store has a set V of n items and there is a set E of m customers who wish to buy those items, and also assume that each item i ∈ V has the production cost di and each customer ej ∈ E has the valuation vj on the bundle ej ⊆ V of items. When the store sells an item i ∈ V at the price ri, the profit for the item i is pi = ri - di. The goal of the store is to decide the price of each item to maximize its total profit. We refer to this maximization problem as the item pricing problem. In most of the previous works, the item pricing problem was considered under the assumption that pi ≥ 0 for each i ∈ V, however, Balcan, et al. [In Proc. of WINE, LNCS 4858, 2007] introduced the notion of “loss-leader, ” and showed that the seller can get more total profit in the case that pi < 0 is allowed than in the case that pi < 0 is not allowed. In this paper, we derive approximation preserving reductions among several item pricing problems and show that all of them have algorithms with good approximation ratio.
Paap, Muirne C S; Kroeze, Karel A; Terwee, Caroline B; van der Palen, Job; Veldkamp, Bernard P
2017-11-01
Examining item usage is an important step in evaluating the performance of a computerized adaptive test (CAT). We study item usage for a newly developed multidimensional CAT which draws items from three PROMIS domains, as well as a disease-specific one. The multidimensional item bank used in the current study contained 194 items from four domains: the PROMIS domains fatigue, physical function, and ability to participate in social roles and activities, and a disease-specific domain (the COPD-SIB). The item bank was calibrated using the multidimensional graded response model and data of 795 patients with chronic obstructive pulmonary disease. To evaluate the item usage rates of all individual items in our item bank, CAT simulations were performed on responses generated based on a multivariate uniform distribution. The outcome variables included active bank size and item overuse (usage rate larger than the expected item usage rate). For average θ-values, the overall active bank size was 9-10%; this number quickly increased as θ-values became more extreme. For values of -2 and +2, the overall active bank size equaled 39-40%. There was 78% overlap between overused items and active bank size for average θ-values. For more extreme θ-values, the overused items made up a much smaller part of the active bank size: here the overlap was only 35%. Our results strengthen the claim that relatively short item banks may suffice when using polytomous items (and no content constraints/exposure control mechanisms), especially when using MCAT.
Cohen, Matthew L; Kisala, Pamela A; Dyson-Hudson, Trevor A; Tulsky, David S
2018-05-01
To develop modern patient-reported outcome measures that assess pain interference and pain behavior after spinal cord injury (SCI). Grounded-theory based qualitative item development; large-scale item calibration field-testing; confirmatory factor analyses; graded response model item response theory analyses; statistical linking techniques to transform scores to the Patient Reported Outcome Measurement Information System (PROMIS) metric. Five SCI Model Systems centers and one Department of Veterans Affairs medical center in the United States. Adults with traumatic SCI. N/A. Spinal Cord Injury - Quality of Life (SCI-QOL) Pain Interference item bank, SCI-QOL Pain Interference short form, and SCI-QOL Pain Behavior scale. Seven hundred fifty-seven individuals with traumatic SCI completed 58 items addressing various aspects of pain. Items were then separated by whether they assessed pain interference or pain behavior, and poorly functioning items were removed. Confirmatory factor analyses confirmed that each set of items was unidimensional, and item response theory analyses were used to estimate slopes and thresholds for the items. Ultimately, 7 items (4 from PROMIS) comprised the Pain Behavior scale and 25 items (18 from PROMIS) comprised the Pain Interference item bank. Ten of these 25 items were selected to form the Pain Interference short form. The SCI-QOL Pain Interference item bank and the SCI-QOL Pain Behavior scale demonstrated robust psychometric properties. The Pain Interference item bank is available as a computer adaptive test or short form for research and clinical applications, and scores are transformed to the PROMIS metric.
Heinemann, Allen W; Kisala, Pamela A; Hahn, Elizabeth A; Tulsky, David S
2015-05-01
To develop a spinal cord injury (SCI)-focused version of PROMIS and Neuro-QOL social domain item banks; evaluate the psychometric properties of items developed for adults with SCI; and report information to facilitate clinical and research use. We used a mixed-methods design to develop and evaluate Ability to Participate in Social Roles and Activities and Satisfaction with Social Roles and Activities items. Focus groups helped define the constructs; cognitive interviews helped revise items; and confirmatory factor analysis and item response theory methods helped calibrate item banks and evaluate differential item functioning related to demographic and injury characteristics. Five SCI Model System sites and one Veterans Administration medical center. The calibration sample consisted of 641 individuals; a reliability sample consisted of 245 individuals residing in the community. A subset of 27 Ability to Participate and 35 Satisfaction items demonstrated good measurement properties and negligible differential item functioning related to demographic and injury characteristics. The SCI-specific measures correlate strongly with the PROMIS and Neuro-QOL versions. Ten item short forms correlate >0.96 with the full banks. Variable-length CATs with a minimum of 4 items, variable-length CATs with a minimum of 8 items, fixed-length CATs of 10 items, and the 10-item short forms demonstrate construct coverage and measurement error that is comparable to the full item bank. The Ability to Participate and Satisfaction with Social Roles and Activities CATs and short forms demonstrate excellent psychometric properties and are suitable for clinical and research applications.
Toward a More Systematic Assessment of Smoking: Development of a Smoking Module for PROMIS®
Tucker, Joan S.; Shadel, William G.; Stucky, Brian D.; Cai, Li
2012-01-01
Introduction The aim of the PROMIS® Smoking Initiative is to develop, evaluate, and standardize item banks to assess cigarette smoking behavior and biopsychosocial constructs associated with smoking for both daily and non-daily smokers. Methods We used qualitative methods to develop the item pool (following the PROMIS® approach: e.g., literature search, “binning and winnowing” of items, and focus groups and cognitive interviews to finalize wording and format), and quantitative methods (e.g., factor analysis) to develop the item banks. Results We considered a total of 1622 extant items, and 44 new items for inclusion in the smoking item banks. A final set of 277 items representing 11 conceptual domains was selected for field testing in a national sample of smokers. Using data from 3021 daily smokers in the field test, an iterative series of exploratory factor analyses and project team discussions resulted in six item banks: Positive Consequences of Smoking (40 items), Smoking Dependence/Craving (55 items), Health Consequences of Smoking (26 items), Psychosocial Consequences of Smoking (37 items), Coping Aspects of Smoking (30 items), and Social Factors of Smoking (23 items). Conclusions Inclusion of a smoking domain in the PROMIS® framework will standardize measurement of key smoking constructs using state-of-the-art psychometric methods, and make them widely accessible to health care providers, smoking researchers and the large community of researchers using PROMIS® who might not otherwise include an assessment of smoking in their design. Next steps include reducing the number of items in each domain, conducting confirmatory analyses, and duplicating the process for non-daily smokers. PMID:22770824
Toward a more systematic assessment of smoking: development of a smoking module for PROMIS®.
Edelen, Maria O; Tucker, Joan S; Shadel, William G; Stucky, Brian D; Cai, Li
2012-11-01
The aim of the PROMIS® Smoking Initiative is to develop, evaluate, and standardize item banks to assess cigarette smoking behavior and biopsychosocial constructs associated with smoking for both daily and non-daily smokers. We used qualitative methods to develop the item pool (following the PROMIS® approach: e.g., literature search, "binning and winnowing" of items, and focus groups and cognitive interviews to finalize wording and format), and quantitative methods (e.g., factor analysis) to develop the item banks. We considered a total of 1622 extant items, and 44 new items for inclusion in the smoking item banks. A final set of 277 items representing 11 conceptual domains was selected for field testing in a national sample of smokers. Using data from 3021 daily smokers in the field test, an iterative series of exploratory factor analyses and project team discussions resulted in six item banks: Positive Consequences of Smoking (40 items), Smoking Dependence/Craving (55 items), Health Consequences of Smoking (26 items), Psychosocial Consequences of Smoking (37 items), Coping Aspects of Smoking (30 items), and Social Factors of Smoking (23 items). Inclusion of a smoking domain in the PROMIS® framework will standardize measurement of key smoking constructs using state-of-the-art psychometric methods, and make them widely accessible to health care providers, smoking researchers and the large community of researchers using PROMIS® who might not otherwise include an assessment of smoking in their design. Next steps include reducing the number of items in each domain, conducting confirmatory analyses, and duplicating the process for non-daily smokers. Copyright © 2012 Elsevier Ltd. All rights reserved.
A Stepwise Test Characteristic Curve Method to Detect Item Parameter Drift
ERIC Educational Resources Information Center
Guo, Rui; Zheng, Yi; Chang, Hua-Hua
2015-01-01
An important assumption of item response theory is item parameter invariance. Sometimes, however, item parameters are not invariant across different test administrations due to factors other than sampling error; this phenomenon is termed item parameter drift. Several methods have been developed to detect drifted items. However, most of the…
Optimal Item Selection with Credentialing Examinations.
ERIC Educational Resources Information Center
Hambleton, Ronald K.; And Others
The study compared two promising item response theory (IRT) item-selection methods, optimal and content-optimal, with two non-IRT item selection methods, random and classical, for use in fixed-length certification exams. The four methods were used to construct 20-item exams from a pool of approximately 250 items taken from a 1985 certification…
An NCME Instructional Module on Polytomous Item Response Theory Models
ERIC Educational Resources Information Center
Penfield, Randall David
2014-01-01
A polytomous item is one for which the responses are scored according to three or more categories. Given the increasing use of polytomous items in assessment practices, item response theory (IRT) models specialized for polytomous items are becoming increasingly common. The purpose of this ITEMS module is to provide an accessible overview of…
77 FR 26791 - Records Schedules; Availability and Request for Comments
Federal Register 2010, 2011, 2012, 2013, 2014
2012-05-07
...-374- 09-7, 1 item, 1 temporary item). Master files of an electronic information system containing...-2012-0006, 1 item, 1 temporary item). Master files of an electronic information system used to document...-0001, 2 items, 2 temporary items). Master files and outputs of an electronic information system used to...
ERIC Educational Resources Information Center
Spaan, Mary
2007-01-01
This article follows the development of test items (see "Language Assessment Quarterly", Volume 3 Issue 1, pp. 71-79 for the article "Test and Item Specifications Development"), beginning with a review of test and item specifications, then proceeding to writing and editing of items, pretesting and analysis, and finally selection of an item for a…
ERIC Educational Resources Information Center
Hewitt, Margaret A.; Homan, Susan P.
2004-01-01
Test validity issues considered by test developers and school districts rarely include individual item readability levels. In this study, items from a major standardized test were examined for individual item readability level and item difficulty. The Homan-Hewitt Readability Formula was applied to items across three grade levels. Results of…
Item Information and Discrimination Functions for Trinary PCM Items.
ERIC Educational Resources Information Center
Akkermans, Wies; Muraki, Eiji
1997-01-01
For trinary partial credit items, the shape of the item information and item discrimination functions is examined in relation to the item parameters. Conditions under which these functions are unimodal and bimodal are discussed, and the locations and values of maxima are derived. Practical relevance of the results is discussed. (SLD)
Shen, Linjun; Li, Feiming; Wattleworth, Roberta; Filipetto, Frank
2010-10-01
The Comprehensive Osteopathic Medical Licensing Examination conducted a trial of multimedia items in the 2008-2009 Level 3 testing cycle to determine (1) if multimedia items were able to test additional elements of medical knowledge and skills and (2) how to develop effective multimedia items. Forty-four content-matched multimedia and text multiple-choice items were randomly delivered to Level 3 candidates. Logistic regression and paired-samples t tests were used for pairwise and group-level comparisons, respectively. Nine pairs showed significant differences in either difficulty or/and discrimination. Content analysis found that, if text narrations were less direct, multimedia materials could make items easier. When textbook terminologies were replaced by multimedia presentations, multimedia items could become more difficult. Moreover, a multimedia item was found not uniformly difficult for candidates at different ability levels, possibly because multimedia and text items tested different elements of a same concept. Multimedia items may be capable of measuring some constructs different from what text items can measure. Effective multimedia items with reasonable psychometric properties can be intentionally developed.
Measurement of self-evaluative motives: a shopping scenario.
Wajda, Theresa A; Kolbe, Richard; Hu, Michael Y; Cui, Annie Peng
2008-08-01
To develop measures of consumers' self-evaluative motives of Self-verification, Self-enhancement, and Self-improvement within the context of a mall shopping environment, an initial set of 49 items was generated by conducting three focus-group sessions. These items were subsequently converted into shopping-dependent motive statements. 250 undergraduate college students responded on a 7-point scale to each statement as these related to the acquisition of recent personal shopping goods. An exploratory factor analysis yielded five factors, accounting for 57.7% of the variance, three of which corresponded to the Self-verification motive (five items), Self-enhancement motive (three items), and Self-improvement motive (six items). These 14 items, along with 9 reconstructed items, yielded 23 items retained and subjected to additional testing. In a final round of data collection, 169 college students provided data for exploratory factor analysis. 11 items were used in confirmatory factor analysis. Analysis indicated that the 11-item scale adequately captured measures of the three self-evaluative motives. However, further data reduction produced a 9-item scale with marked improvement in statistical fit over the 11-item scale.
Oberauer, Klaus; Farrell, Simon; Jarrold, Christopher; Pasiecznik, Kazimir; Greaves, Martin
2012-05-01
Four experiments examined the effect of phonological similarity between items and distractors on complex span performance. Item-distractor similarity benefited serial recall when distractors followed the items they were similar to, but not when distractors preceded the items they were similar to. These findings are predicted by C-SOB (contextual serial order in a box), a computational model of complex span. The model assumes that distractors are involuntarily encoded into memory, being associated to the preceding item's list position. Distractors interfere with items by superposition of distributed representations that are associated to the same position. Superposition distorts item memory; this distortion is less severe when the distractor is similar to the item. Further support for the assumption that distractors are encoded at the position of the preceding item comes from the finding that intrusions of distractors at recall tended to come from the position of the target item. In addition, intruding distractors tend to replace items to which they are similar, showing that lack of distinctiveness also contributes to interference. (c) 2012 APA, all rights reserved.
Further evaluation of leisure items in the attention condition of functional analyses.
Roscoe, Eileen M; Carreau, Abbey; MacDonald, Jackie; Pence, Sacha T
2008-01-01
Research suggests that including leisure items in the attention condition of a functional analysis may produce engagement that masks sensitivity to attention. In this study, 4 individuals' initial functional analyses indicated that behavior was maintained by nonsocial variables (n = 3) or by attention (n = 1). A preference assessment was used to identify items for subsequent functional analyses. Four conditions were compared, attention with and without leisure items and control with and without leisure items. Following this, either high- or low-preference items were included in the attention condition. Problem behavior was more probable during the attention condition when no leisure items or low-preference items were included, and lower levels of problem behavior were observed during the attention condition when high-preference leisure items were included. These findings suggest how preferred items may hinder detection of behavioral function.
Stochl, Jan; Böhnke, Jan R; Pickett, Kate E; Croudace, Tim J
2016-05-20
Recent developments in psychometric modeling and technology allow pooling well-validated items from existing instruments into larger item banks and their deployment through methods of computerized adaptive testing (CAT). Use of item response theory-based bifactor methods and integrative data analysis overcomes barriers in cross-instrument comparison. This paper presents the joint calibration of an item bank for researchers keen to investigate population variations in general psychological distress (GPD). Multidimensional item response theory was used on existing health survey data from the Scottish Health Education Population Survey (n = 766) to calibrate an item bank consisting of pooled items from the short common mental disorder screen (GHQ-12) and the Affectometer-2 (a measure of "general happiness"). Computer simulation was used to evaluate usefulness and efficacy of its adaptive administration. A bifactor model capturing variation across a continuum of population distress (while controlling for artefacts due to item wording) was supported. The numbers of items for different required reliabilities in adaptive administration demonstrated promising efficacy of the proposed item bank. Psychometric modeling of the common dimension captured by more than one instrument offers the potential of adaptive testing for GPD using individually sequenced combinations of existing survey items. The potential for linking other item sets with alternative candidate measures of positive mental health is discussed since an optimal item bank may require even more items than these.
Psychological distress in cancer survivors: the further development of an item bank.
Smith, Adam B; Armes, Jo; Richardson, Alison; Stark, Dan P
2013-02-01
Assessment of psychological distress by patient report is necessary to meet patients' needs throughout the cancer journey. We have previously developed an item bank to assess psychological distress but not evaluated it for cancer survivors. Our first aim in this study was to test whether we could extend our item bank to include cancer survivors. The second aim was to examine whether the item bank could assess positive affect as a single construct alongside negative psychological symptoms. Responses from 1315 cancer survivors to the Hospital Anxiety and Depression Scale (HADS) and the Positive and Negative Affect Scale (PANAS) were considered for inclusion in a pre-existing item bank created from a heterogeneous sample of 4914 cancer patients. Differential item functioning (DIF) was used to assess whether HADS responses drawn from the two samples were equivalent. Common-item equating was used to anchor the shared (HADS) items, whilst the PANAS items were added. Item fit was evaluated at each stage, and misfitting items were removed. Unidimensionality was assessed with a principal components factor analysis. The DIF analysis did not reveal any differences between the HADS item locations from the two samples. Three misfitting PANAS items were removed, resulting in a final unidimensional bank of 80 items with good internal reliability (α = 0.85). The new item bank is valid for use across the cancer journey, including cancer survivors, and modestly improves the assessment of all levels of psychological distress and positive psychological function. Copyright © 2011 John Wiley & Sons, Ltd.
Determining an Imaging Literacy Curriculum for Radiation Oncologists: An International Delphi Study
DOE Office of Scientific and Technical Information (OSTI.GOV)
Giuliani, Meredith E., E-mail: Meredith.Giuliani@rmp.uhn.on.ca; Department of Radiation Oncology, University of Toronto, Toronto, Ontario; Gillan, Caitlin
2014-03-15
Purpose: Rapid evolution of imaging technologies and their integration into radiation therapy practice demands that radiation oncology (RO) training curricula be updated. The purpose of this study was to develop an entry-to-practice image literacy competency profile. Methods and Materials: A list of 263 potential imaging competency items were assembled from international objectives of training. Expert panel eliminated redundant or irrelevant items to create a list of 97 unique potential competency items. An international 2-round Delphi process was conducted with experts in RO. In round 1, all experts scored, on a 9-point Likert scale, the degree to which they agreed anmore » item should be included in the competency profile. Items with a mean score ≥7 were included, those 4 to 6 were reviewed in round 2, and items scored <4 were excluded. In round 2, items were discussed and subsequently ranked for inclusion or exclusion in the competency profile. Items with >75% voting for inclusion were included in the final competency profile. Results: Forty-nine radiation oncologists were invited to participate in round 1, and 32 (65%) did so. Participants represented 24 centers in 6 countries. Of the 97 items ranked in round 1, 80 had a mean score ≥7, 1 item had a score <4, and 16 items with a mean score of 4 to 6 were reviewed and rescored in round 2. In round 2, 4 items had >75% of participants voting for inclusion and were included; the remaining 12 were excluded. The final list of 84 items formed the final competency profile. The 84 enabling competency items were aggregated into the following 4 thematic groups of key competencies: (1) imaging fundamentals (42 items); (2) clinical application (27 items); (3) clinical management (5 items); and (4) professional practice (10 items). Conclusions: We present an imaging literacy competency profile which could constitute the minimum training standards in radiation oncology residency programs.« less
NASA Astrophysics Data System (ADS)
Chen, Juan; Ma, Guosheng
2018-02-01
Curriculum is the means to cultivate higher vocational talents. On the basis of analyzing the core curriculum problems of curriculum reform and Agro-ecological environmental specialties in higher vocational colleges, this paper puts forward the optimization and integration measures of 6 core courses, including “Eco-environment Repair Technology”, “Agro-environmental Management Plan”, “Environmental Engineering Design”, “Environmental Pest Management Technology”, “Agro-chemical Pollution Control Technology”, “Agro-environmental Testing and Analysis”. It integrates the vocational qualification certificate education and professional induction certificate training items, and enhances the adaptability, skills and professionalism of professional core curriculum.
Item Analyses of Memory Differences
Salthouse, Timothy A.
2017-01-01
Objective Although performance on memory and other cognitive tests is usually assessed with a score aggregated across multiple items, potentially valuable information is also available at the level of individual items. Method The current study illustrates how analyses of variance with item as one of the factors, and memorability analyses in which item accuracy in one group is plotted as a function of item accuracy in another group, can provide a more detailed characterization of the nature of group differences in memory. Data are reported for two memory tasks, word recall and story memory, across age, ability, repetition, delay, and longitudinal contrasts. Results The item-level analyses revealed evidence for largely uniform differences across items in the age, ability, and longitudinal contrasts, but differential patterns across items in the repetition contrast, and unsystematic item relations in the delay contrast. Conclusion Analyses at the level of individual items have the potential to indicate the manner by which group differences in the aggregate test score are achieved. PMID:27618285
Recall dynamics reveal the retrieval of emotional context.
Long, Nicole M; Danoff, Michelle S; Kahana, Michael J
2015-10-01
Memory is often better for emotional rather than neutral stimuli. The benefit for emotional items could be the result of an associative mechanism whereby items are associated to a slowly updating context. Through this process, emotional features are integrated with context during study, and are reactivated during test. The presence of emotion in context would both provide a stronger retrieval cue, enhancing memory of emotional items, as well as lead to emotional clustering, whereby emotionally similar items are recalled consecutively. To measure whether associative mechanisms can explain the enhancement for emotional items, we conducted a free recall study in which most items were emotionally neutral to minimize effects of mood induction and to more closely reflect naturalistic settings. We found that emotional items were significantly more likely to be recalled than neutral items and that participants were more likely to transition between emotional items rather than between emotional and neutral items. Together, these results suggest that contextual encoding and retrieval mechanisms may drive the benefit for emotional items both within and outside the laboratory.
Paz, Sylvia H; Spritzer, Karen L; Morales, Leo S; Hays, Ron D
2013-03-29
To evaluate the equivalence of the PROMIS® wave 1 physical functioning item bank, by age (50 years or older versus 18-49). A total of 114 physical functioning items with 5 response choices were administered to English- (n=1504) and Spanish-language (n=640) adults. Item frequencies, means and standard deviations, item-scale correlations, and internal consistency reliability were estimated. Differential Item Functioning (DIF) by age was evaluated. Thirty of the 114 items were fagged for DIF based on an R-squared of 0.02 or above criterion. The expected total score was higher for those respondents who were 18-49 than those who were 50 or older. Those who were 50 years or older versus 18-49 years old with the same level of physical functioning responded differently to 30 of the 114 items in the PROMIS® physical functioning item bank. This study yields essential information about the equivalence of the physical functioning items in older versus younger individuals.
Haberman, Shelby J; Sinharay, Sandip; Chon, Kyong Hee
2013-07-01
Residual analysis (e.g. Hambleton & Swaminathan, Item response theory: principles and applications, Kluwer Academic, Boston, 1985; Hambleton, Swaminathan, & Rogers, Fundamentals of item response theory, Sage, Newbury Park, 1991) is a popular method to assess fit of item response theory (IRT) models. We suggest a form of residual analysis that may be applied to assess item fit for unidimensional IRT models. The residual analysis consists of a comparison of the maximum-likelihood estimate of the item characteristic curve with an alternative ratio estimate of the item characteristic curve. The large sample distribution of the residual is proved to be standardized normal when the IRT model fits the data. We compare the performance of our suggested residual to the standardized residual of Hambleton et al. (Fundamentals of item response theory, Sage, Newbury Park, 1991) in a detailed simulation study. We then calculate our suggested residuals using data from an operational test. The residuals appear to be useful in assessing the item fit for unidimensional IRT models.
The measurement of psychological literacy: a first approximation
Roberts, Lynne D.; Heritage, Brody; Gasson, Natalie
2015-01-01
Psychological literacy, the ability to apply psychological knowledge to personal, family, occupational, community and societal challenges, is promoted as the primary outcome of an undergraduate education in psychology. As the concept of psychological literacy becomes increasingly adopted as the core business of undergraduate psychology training courses world-wide, there is urgent need for the construct to be accurately measured so that student and institutional level progress can be assessed and monitored. Key to the measurement of psychological literacy is determining the underlying factor-structure of psychological literacy. In this paper we provide a first approximation of the measurement of psychological literacy by identifying and evaluating self-report measures for psychological literacy. Multi-item and single-item self-report measures of each of the proposed nine dimensions of psychological literacy were completed by two samples (N = 218 and N = 381) of undergraduate psychology students at an Australian university. Single and multi-item measures of each dimension were weakly to moderately correlated. Exploratory and confirmatory factor analyses of multi-item measures indicated a higher order three factor solution best represented the construct of psychological literacy. The three factors were reflective processes, generic graduate attributes, and psychology as a helping profession. For the measurement of psychological literacy to progress there is a need to further develop self-report measures and to identify/develop and evaluate objective measures of psychological literacy. Further approximations of the measurement of psychological literacy remain an imperative, given the construct's ties to measuring institutional efficacy in teaching psychology to an undergraduate audience. PMID:25741300
Development of the International Spinal Cord Injury Activities and Participation Basic Data Set.
Post, M W; Charlifue, S; Biering-Sørensen, F; Catz, A; Dijkers, M P; Horsewell, J; Noonan, V K; Noreau, L; Tate, D G; Sinnott, K A
2016-07-01
Consensus decision-making process. The objective of this study was to develop an International Spinal Cord Injury (SCI) Activities and Participation (A&P) Basic Data Set. International working group. A committee of experts was established to select and define A&P data elements to be included in this data set. A draft data set was developed and posted on the International Spinal Cord Society (ISCoS) and American Spinal Injury Association websites and was also disseminated among appropriate organizations for review. Suggested revisions were considered, and a final version of the A&P Data Set was completed. Consensus was reached to define A&P and to incorporate both performance and satisfaction ratings. Items that were considered core to each A&P domain were selected from two existing questionnaires. Four items measuring activities were selected from the Spinal Cord Independence Measure III to provide basic data on task execution in activities of daily living. Eight items were selected from the Craig Handicap Assessment and Reporting Technique to provide basic data on the frequency of participation. An additional rating of satisfaction on a three-point scale for each item completes the total of 24 A&P variables. Collection of the International SCI A&P Basic Data Set variables in all future research on SCI outcomes is advised to facilitate comparison of results across published studies from around the world. Additional standardised instruments to assess activities of daily living or participation can be administered, depending on the purpose of a particular study.
Using a Linear Regression Method to Detect Outliers in IRT Common Item Equating
ERIC Educational Resources Information Center
He, Yong; Cui, Zhongmin; Fang, Yu; Chen, Hanwei
2013-01-01
Common test items play an important role in equating alternate test forms under the common item nonequivalent groups design. When the item response theory (IRT) method is applied in equating, inconsistent item parameter estimates among common items can lead to large bias in equated scores. It is prudent to evaluate inconsistency in parameter…
ERIC Educational Resources Information Center
He, Yong
2013-01-01
Common test items play an important role in equating multiple test forms under the common-item nonequivalent groups design. Inconsistent item parameter estimates among common items can lead to large bias in equated scores for IRT true score equating. Current methods extensively focus on detection and elimination of outlying common items, which…
ERIC Educational Resources Information Center
Lee, HwaYoung; Dodd, Barbara G.
2012-01-01
This study investigated item exposure control procedures under various combinations of item pool characteristics and ability distributions in computerized adaptive testing based on the partial credit model. Three variables were manipulated: item pool characteristics (120 items for each of easy, medium, and hard item pools), two ability…
ERIC Educational Resources Information Center
Zickar, Michael J.; Ury, Karen L.
2002-01-01
Attempted to relate content features of personality items to item parameter estimates from the partial credit model of E. Muraki (1990) by administering the Adjective Checklist (L. Goldberg, 1992) to 329 undergraduates. As predicted, the discrimination parameter was related to the item subtlety ratings of personality items but the level of word…
Effects of Ignoring Item Interaction on Item Parameter Estimation and Detection of Interacting Items
ERIC Educational Resources Information Center
Chen, Cheng-Te; Wang, Wen-Chung
2007-01-01
This study explores the effects of ignoring item interaction on item parameter estimation and the efficiency of using the local dependence index Q[subscript 3] and the SAS NLMIXED procedure to detect item interaction under the three-parameter logistic model and the generalized partial credit model. Through simulations, it was found that ignoring…
ERIC Educational Resources Information Center
Fukuhara, Hirotaka; Kamata, Akihito
2011-01-01
A differential item functioning (DIF) detection method for testlet-based data was proposed and evaluated in this study. The proposed DIF model is an extension of a bifactor multidimensional item response theory (MIRT) model for testlets. Unlike traditional item response theory (IRT) DIF models, the proposed model takes testlet effects into…
Item Response Models for Examinee-Selected Items
ERIC Educational Resources Information Center
Wang, Wen-Chung; Jin, Kuan-Yu; Qiu, Xue-Lan; Wang, Lei
2012-01-01
In some tests, examinees are required to choose a fixed number of items from a set of given items to answer. This practice creates a challenge to standard item response models, because more capable examinees may have an advantage by making wiser choices. In this study, we developed a new class of item response models to account for the choice…
ERIC Educational Resources Information Center
Scheuneman, Janice Dowd; Gerritz, Kalle
1990-01-01
Differential item functioning (DIF) methodology for revealing sources of item difficulty and performance characteristics of different groups was explored. A total of 150 Scholastic Aptitude Test items and 132 Graduate Record Examination general test items were analyzed. DIF was evaluated for males and females and Blacks and Whites. (SLD)
Item Structural Properties as Predictors of Item Difficulty and Item Association.
ERIC Educational Resources Information Center
Solano-Flores, Guillermo
1993-01-01
Studied the ability of logical test design (LTD) to predict student performance in reading Roman numerals for 211 sixth graders in Mexico City tested on Roman numeral items varying on LTD-related and non-LTD-related variables. The LTD-related variable item iterativity was found to be the best predictor of item difficulty. (SLD)
A Note on Item-Restscore Association in Rasch Models
ERIC Educational Resources Information Center
Kreiner, Svend
2011-01-01
To rule out the need for a two-parameter item response theory (IRT) model during item analysis by Rasch models, it is important to check the Rasch model's assumption that all items have the same item discrimination. Biserial and polyserial correlation coefficients measuring the association between items and restscores are often used in an informal…
Investigating Item Exposure Control Methods in Computerized Adaptive Testing
ERIC Educational Resources Information Center
Ozturk, Nagihan Boztunc; Dogan, Nuri
2015-01-01
This study aims to investigate the effects of item exposure control methods on measurement precision and on test security under various item selection methods and item pool characteristics. In this study, the Randomesque (with item group sizes of 5 and 10), Sympson-Hetter, and Fade-Away methods were used as item exposure control methods. Moreover,…
The Consequences of Ignoring Item Parameter Drift in Longitudinal Item Response Models
ERIC Educational Resources Information Center
Lee, Wooyeol; Cho, Sun-Joo
2017-01-01
Utilizing a longitudinal item response model, this study investigated the effect of item parameter drift (IPD) on item parameters and person scores via a Monte Carlo study. Item parameter recovery was investigated for various IPD patterns in terms of bias and root mean-square error (RMSE), and percentage of time the 95% confidence interval covered…
ERIC Educational Resources Information Center
Lee, Woo-yeol; Cho, Sun-Joo
2017-01-01
Cross-level invariance in a multilevel item response model can be investigated by testing whether the within-level item discriminations are equal to the between-level item discriminations. Testing the cross-level invariance assumption is important to understand constructs in multilevel data. However, in most multilevel item response model…
An NCME Instructional Module on Latent DIF Analysis Using Mixture Item Response Models
ERIC Educational Resources Information Center
Cho, Sun-Joo; Suh, Youngsuk; Lee, Woo-yeol
2016-01-01
The purpose of this ITEMS module is to provide an introduction to differential item functioning (DIF) analysis using mixture item response models. The mixture item response models for DIF analysis involve comparing item profiles across latent groups, instead of manifest groups. First, an overview of DIF analysis based on latent groups, called…
17 CFR 240.17Ad-1 - Definitions.
Code of Federal Regulations, 2011 CFR
2011-04-01
... or mails the item to, or the item is awaiting pick-up by, the presentor or a person designated by the... transfer agent dispatches or mails the item to, or the item is awaiting pick-up by, the outside registrar... registrar dispatches or mails the item to, or the item is awaiting pick-up by, the presenting transfer agent...
17 CFR 240.17Ad-1 - Definitions.
Code of Federal Regulations, 2010 CFR
2010-04-01
... or mails the item to, or the item is awaiting pick-up by, the presentor or a person designated by the... transfer agent dispatches or mails the item to, or the item is awaiting pick-up by, the outside registrar... registrar dispatches or mails the item to, or the item is awaiting pick-up by, the presenting transfer agent...
48 CFR 22.1803 - Contract clause.
Code of Federal Regulations, 2010 CFR
2010-10-01
... COTS items, but for minor modifications (as defined at paragraph (3)(ii) of the definition of “commercial item” at 2.101); (3) Items that would be COTS items if they were not bulk cargo; or (4) Commercial services that are— (i) Part of the purchase of a COTS item (or an item that would be a COTS item, but for...
Item Pool Design for an Operational Variable-Length Computerized Adaptive Test
ERIC Educational Resources Information Center
He, Wei; Reckase, Mark D.
2014-01-01
For computerized adaptive tests (CATs) to work well, they must have an item pool with sufficient numbers of good quality items. Many researchers have pointed out that, in developing item pools for CATs, not only is the item pool size important but also the distribution of item parameters and practical considerations such as content distribution…
Classical Item Analysis Using Latent Variable Modeling: A Note on a Direct Evaluation Procedure
ERIC Educational Resources Information Center
Raykov, Tenko; Marcoulides, George A.
2011-01-01
A directly applicable latent variable modeling procedure for classical item analysis is outlined. The method allows one to point and interval estimate item difficulty, item correlations, and item-total correlations for composites consisting of categorical items. The approach is readily employed in empirical research and as a by-product permits…
Item Estimates under Low-Stakes Conditions: How Should Omits Be Treated?
ERIC Educational Resources Information Center
DeMars, Christine
Using data from a pilot test of science and math from students in 30 high schools, item difficulties were estimated with a one-parameter model (partial-credit model for the multi-point items). Some items were multiple-choice items, and others were constructed-response items (open-ended). Four sets of estimates were obtained: estimates for males…
ERIC Educational Resources Information Center
Tay, Louis; Vermunt, Jeroen K.; Wang, Chun
2013-01-01
We evaluate the item response theory with covariates (IRT-C) procedure for assessing differential item functioning (DIF) without preknowledge of anchor items (Tay, Newman, & Vermunt, 2011). This procedure begins with a fully constrained baseline model, and candidate items are tested for uniform and/or nonuniform DIF using the Wald statistic.…
Student Personality Differences Are Related to Their Responses on Instructor Evaluation Forms
ERIC Educational Resources Information Center
McCann, Stewart; Gardner, Christopher
2014-01-01
The relation of student personality to student evaluations of teaching (SETs) was determined in a sample of 144 undergraduates. Student Big Five personality variables and core self-evaluation (CSE) were assessed. Students rated their most preferred instructor (MPI) and least preferred instructor (LPI) on 11 common evaluation items. Pearson and…
Predicting Lexical Proficiency in Language Learner Texts Using Computational Indices
ERIC Educational Resources Information Center
Crossley, Scott A.; Salsbury, Tom; McNamara, Danielle S.; Jarvis, Scott
2011-01-01
The authors present a model of lexical proficiency based on lexical indices related to vocabulary size, depth of lexical knowledge, and accessibility to core lexical items. The lexical indices used in this study come from the computational tool Coh-Metrix and include word length scores, lexical diversity values, word frequency counts, hypernymy…
The Development and Validation of a Learning Progression for Argumentation in Science
ERIC Educational Resources Information Center
Osborne, Jonathan F.; Henderson, J. Bryan; MacPherson, Anna; Szu, Evan; Wild, Andrew; Yao, Shi-Ying
2016-01-01
Given the centrality of argumentation in the Next Generation Science Standards, there is an urgent need for an empirically validated learning progression of this core practice and the development of high-quality assessment items. Here, we introduce a hypothesized three-tiered learning progression for scientific argumentation. The learning…
Lenard, N R; Zheng, H; Berthoud, H-R
2010-06-01
To test the hypothesis that micro-opioid receptor signaling in the nucleus accumbens contributes to hedonic (over)eating and obesity. To investigate the effects of chronic micro-opioid antagonism in the nucleus accumbens core or shell on intake of a palatable diet, and the development of diet-induced obesity in rats. Chronic blockade of micro-opioid receptor signaling in the nucleus accumbens core or shell was achieved by means of repeated injections (every 4-5 days) of the irreversible receptor antagonist beta-funaltrexamine (BFNA) over 3-5 weeks. The diet consisted of either a choice of high-fat chow, chocolate-flavored Ensure and regular chow (each nutritionally complete) or regular chow only. Intake of each food item, body weight and body fat mass were monitored throughout the study. The BFNA injections aimed at either the core or shell of the nucleus accumbens resulted in significantly attenuated intake of palatable diet, body weight gain and fat accretion, compared with vehicle control injections. The injection of BFNA in the core did not significantly change these parameters in chow-fed control rats. The injection of BFNA in the core and shell differentially affected intake of the two palatable food items: in the core, BFNA significantly reduced the intake of high-fat, but not of Ensure, whereas in the shell, it significantly reduced the intake of Ensure, but not of high-fat, compared with vehicle treatment. Endogenous micro-opioid receptor signaling in the nucleus accumbens core and shell is necessary for palatable diet-induced hyperphagia and obesity to fully develop in rats. Sweet and non-sweet fatty foods may be differentially processed in subcomponents of the ventral striatum.
Findings from the ISMP Medication Safety Self-Assessment for hospitals.
Smetzer, Judy L; Vaida, Allen J; Cohen, Michael R; Tranum, Diane; Pittman, Mary A; Armstrong, Carl W
2003-11-01
Hospital medication practices should be assessed, awareness of the characteristics of a safe medication system heightened, and baseline data to identify national priorities established. A cross-sectional survey of U.S. hospitals (N = 6,180) was conducted in May 2000. The survey instrument contained 194 self-assessment items organized into 20 core characteristics and 10 larger domains. Hospitals were asked to voluntarily submit their confidential assessment data to the Institute for Safe Medication Practices (ISMP) for aggregate analysis. A weighting structure was applied to the individual items and used to calculate core characteristic scores, domain scores, and overall self-assessment scores. These scores were then compared to identify areas most in need of improvement. The 1,435 participating hospitals scored highest in domains related to drug storage and distribution; environmental factors; infusion pumps; and medication labeling, packaging, and nomenclature issues. These hospitals scored lowest in domains related to accessible patient information, communication of medication orders, patient education, and quality processes such as double-check systems and organizational culture. Enormous opportunities exist to improve medication safety, especially in domains related to culture, information management, and communication.
Applying Item Response Theory methods to design a learning progression-based science assessment
NASA Astrophysics Data System (ADS)
Chen, Jing
Learning progressions are used to describe how students' understanding of a topic progresses over time and to classify the progress of students into steps or levels. This study applies Item Response Theory (IRT) based methods to investigate how to design learning progression-based science assessments. The research questions of this study are: (1) how to use items in different formats to classify students into levels on the learning progression, (2) how to design a test to give good information about students' progress through the learning progression of a particular construct and (3) what characteristics of test items support their use for assessing students' levels. Data used for this study were collected from 1500 elementary and secondary school students during 2009--2010. The written assessment was developed in several formats such as the Constructed Response (CR) items, Ordered Multiple Choice (OMC) and Multiple True or False (MTF) items. The followings are the main findings from this study. The OMC, MTF and CR items might measure different components of the construct. A single construct explained most of the variance in students' performances. However, additional dimensions in terms of item format can explain certain amount of the variance in student performance. So additional dimensions need to be considered when we want to capture the differences in students' performances on different types of items targeting the understanding of the same underlying progression. Items in each item format need to be improved in certain ways to classify students more accurately into the learning progression levels. This study establishes some general steps that can be followed to design other learning progression-based tests as well. For example, first, the boundaries between levels on the IRT scale can be defined by using the means of the item thresholds across a set of good items. Second, items in multiple formats can be selected to achieve the information criterion at all the defined boundaries. This ensures the accuracy of the classification. Third, when item threshold parameters vary a bit, the scoring rubrics and the items need to be reviewed to make the threshold parameters similar across items. This is because one important design criterion of the learning progression-based items is that ideally, a student should be at the same level across items, which means that the item threshold parameters (d1, d 2 and d3) should be similar across items. To design a learning progression-based science assessment, we need to understand whether the assessment measures a single construct or several constructs and how items are associated with the constructs being measured. Results from dimension analyses indicate that items of different carbon transforming processes measure different aspects of the carbon cycle construct. However, items of different practices assess the same construct. In general, there are high correlations among different processes or practices. It is not clear whether the strong correlations are due to the inherent links among these process/practice dimensions or due to the fact that the student sample does not show much variation in these process/practice dimensions. Future data are needed to examine the dimensionalities in terms of process/practice in detail. Finally, based on item characteristics analysis, recommendations are made to write more discriminative CR items and better OMC, MTF options. Item writers can follow these recommendations to write better learning progression-based items.
Chen, Cheng-Te; Chen, Yu-Lan; Lin, Yu-Ching; Hsieh, Ching-Lin; Tzeng, Jeng-Yi
2018-01-01
Objective The purpose of this study was to construct a computerized adaptive test (CAT) for measuring self-care performance (the CAT-SC) in children with developmental disabilities (DD) aged from 6 months to 12 years in a content-inclusive, precise, and efficient fashion. Methods The study was divided into 3 phases: (1) item bank development, (2) item testing, and (3) a simulation study to determine the stopping rules for the administration of the CAT-SC. A total of 215 caregivers of children with DD were interviewed with the 73-item CAT-SC item bank. An item response theory model was adopted for examining the construct validity to estimate item parameters after investigation of the unidimensionality, equality of slope parameters, item fitness, and differential item functioning (DIF). In the last phase, the reliability and concurrent validity of the CAT-SC were evaluated. Results The final CAT-SC item bank contained 56 items. The stopping rules suggested were (a) reliability coefficient greater than 0.9 or (b) 14 items administered. The results of simulation also showed that 85% of the estimated self-care performance scores would reach a reliability higher than 0.9 with a mean test length of 8.5 items, and the mean reliability for the rest was 0.86. Administering the CAT-SC could reduce the number of items administered by 75% to 84%. In addition, self-care performances estimated by the CAT-SC and the full item bank were very similar to each other (Pearson r = 0.98). Conclusion The newly developed CAT-SC can efficiently measure self-care performance in children with DD whose performances are comparable to those of TD children aged from 6 months to 12 years as precisely as the whole item bank. The item bank of the CAT-SC has good reliability and a unidimensional self-care construct, and the CAT can estimate self-care performance with less than 25% of the items in the item bank. Therefore, the CAT-SC could be useful for measuring self-care performance in children with DD in clinical and research settings. PMID:29561879
Chen, Cheng-Te; Chen, Yu-Lan; Lin, Yu-Ching; Hsieh, Ching-Lin; Tzeng, Jeng-Yi; Chen, Kuan-Lin
2018-01-01
The purpose of this study was to construct a computerized adaptive test (CAT) for measuring self-care performance (the CAT-SC) in children with developmental disabilities (DD) aged from 6 months to 12 years in a content-inclusive, precise, and efficient fashion. The study was divided into 3 phases: (1) item bank development, (2) item testing, and (3) a simulation study to determine the stopping rules for the administration of the CAT-SC. A total of 215 caregivers of children with DD were interviewed with the 73-item CAT-SC item bank. An item response theory model was adopted for examining the construct validity to estimate item parameters after investigation of the unidimensionality, equality of slope parameters, item fitness, and differential item functioning (DIF). In the last phase, the reliability and concurrent validity of the CAT-SC were evaluated. The final CAT-SC item bank contained 56 items. The stopping rules suggested were (a) reliability coefficient greater than 0.9 or (b) 14 items administered. The results of simulation also showed that 85% of the estimated self-care performance scores would reach a reliability higher than 0.9 with a mean test length of 8.5 items, and the mean reliability for the rest was 0.86. Administering the CAT-SC could reduce the number of items administered by 75% to 84%. In addition, self-care performances estimated by the CAT-SC and the full item bank were very similar to each other (Pearson r = 0.98). The newly developed CAT-SC can efficiently measure self-care performance in children with DD whose performances are comparable to those of TD children aged from 6 months to 12 years as precisely as the whole item bank. The item bank of the CAT-SC has good reliability and a unidimensional self-care construct, and the CAT can estimate self-care performance with less than 25% of the items in the item bank. Therefore, the CAT-SC could be useful for measuring self-care performance in children with DD in clinical and research settings.
Daniels, Vijay J; Bordage, Georges; Gierl, Mark J; Yudkowsky, Rachel
2014-10-01
Objective structured clinical examinations (OSCEs) are used worldwide for summative examinations but often lack acceptable reliability. Research has shown that reliability of scores increases if OSCE checklists for medical students include only clinically relevant items. Also, checklists are often missing evidence-based items that high-achieving learners are more likely to use. The purpose of this study was to determine if limiting checklist items to clinically discriminating items and/or adding missing evidence-based items improved score reliability in an Internal Medicine residency OSCE. Six internists reviewed the traditional checklists of four OSCE stations classifying items as clinically discriminating or non-discriminating. Two independent reviewers augmented checklists with missing evidence-based items. We used generalizability theory to calculate overall reliability of faculty observer checklist scores from 45 first and second-year residents and predict how many 10-item stations would be required to reach a Phi coefficient of 0.8. Removing clinically non-discriminating items from the traditional checklist did not affect the number of stations (15) required to reach a Phi of 0.8 with 10 items. Focusing the checklist on only evidence-based clinically discriminating items increased test score reliability, needing 11 stations instead of 15 to reach 0.8; adding missing evidence-based clinically discriminating items to the traditional checklist modestly improved reliability (needing 14 instead of 15 stations). Checklists composed of evidence-based clinically discriminating items improved the reliability of checklist scores and reduced the number of stations needed for acceptable reliability. Educators should give preference to evidence-based items over non-evidence-based items when developing OSCE checklists.
Vegetable parenting practices scale. Item response modeling analyses
Chen, Tzu-An; O’Connor, Teresia; Hughes, Sheryl; Beltran, Alicia; Baranowski, Janice; Diep, Cassandra; Baranowski, Tom
2015-01-01
Objective To evaluate the psychometric properties of a vegetable parenting practices scale using multidimensional polytomous item response modeling which enables assessing item fit to latent variables and the distributional characteristics of the items in comparison to the respondents. We also tested for differences in the ways item function (called differential item functioning) across child’s gender, ethnicity, age, and household income groups. Method Parents of 3–5 year old children completed a self-reported vegetable parenting practices scale online. Vegetable parenting practices consisted of 14 effective vegetable parenting practices and 12 ineffective vegetable parenting practices items, each with three subscales (responsiveness, structure, and control). Multidimensional polytomous item response modeling was conducted separately on effective vegetable parenting practices and ineffective vegetable parenting practices. Results One effective vegetable parenting practice item did not fit the model well in the full sample or across demographic groups, and another was a misfit in differential item functioning analyses across child’s gender. Significant differential item functioning was detected across children’s age and ethnicity groups, and more among effective vegetable parenting practices than ineffective vegetable parenting practices items. Wright maps showed items only covered parts of the latent trait distribution. The harder- and easier-to-respond ends of the construct were not covered by items for effective vegetable parenting practices and ineffective vegetable parenting practices, respectively. Conclusions Several effective vegetable parenting practices and ineffective vegetable parenting practices scale items functioned differently on the basis of child’s demographic characteristics; therefore, researchers should use these vegetable parenting practices scales with caution. Item response modeling should be incorporated in analyses of parenting practice questionnaires to better assess differences across demographic characteristics. PMID:25895694
Terwee, C B; Roorda, L D; de Vet, H C W; Dekker, J; Westhovens, R; van Leeuwen, J; Cella, D; Correia, H; Arnold, B; Perez, B; Boers, M
2014-08-01
The Patient-Reported Outcomes Measurement Information System (PROMIS(®)) is a new, state-of-the-art assessment system for measuring patient-reported health and well-being of adults and children that has the potential to be more valid, reliable and responsive than existing PROMs. The PROMIS items can be administered in short forms or, more efficiently, through computerized adaptive testing. This paper describes the translation of 563 items from 17 PROMIS item banks (domains) for adults from the English source into Dutch-Flemish. The translation was performed by FACITtrans using standardized methodology and approved by the PROMIS Statistical Center. The translation included four forward translations, two back-translations, three to five independent reviews (at least two Dutch, one Flemish) and pre-testing in 70 adults (age range 20-77) from the Netherlands and Flanders. A small number of items required separate translations for Dutch and Flemish: physical function (five items), pain behaviour (two items), pain interference (one item), social isolation (one item) and global health (one item). Challenges faced in the translation process included: scarcity or overabundance of possible translations, unclear item descriptions, constructs broader/smaller in the target language, difficulties in rank ordering items, differences in unit of measurement, irrelevant items or differences in performance of activities. By addressing these challenges, acceptable translations were obtained for all items. The methodology used and experience gained in this study can be used as an example for researchers in other countries interested in translating PROMIS. The Dutch-Flemish PROMIS items are linguistically equivalent. Short forms will soon be available for use and entire item banks are ready for cross-cultural validation in the Netherlands and Flanders.
Development of the PROMIS positive emotional and sensory expectancies of smoking item banks.
Tucker, Joan S; Shadel, William G; Edelen, Maria Orlando; Stucky, Brian D; Li, Zhen; Hansen, Mark; Cai, Li
2014-09-01
The positive emotional and sensory expectancies of cigarette smoking include improved cognitive abilities, positive affective states, and pleasurable sensorimotor sensations. This paper describes development of Positive Emotional and Sensory Expectancies of Smoking item banks that will serve to standardize the assessment of this construct among daily and nondaily cigarette smokers. Data came from daily (N = 4,201) and nondaily (N =1,183) smokers who completed an online survey. To identify a unidimensional set of items, we conducted item factor analyses, item response theory analyses, and differential item functioning analyses. Additionally, we evaluated the performance of fixed-item short forms (SFs) and computer adaptive tests (CATs) to efficiently assess the construct. Eighteen items were included in the item banks (15 common across daily and nondaily smokers, 1 unique to daily, 2 unique to nondaily). The item banks are strongly unidimensional, highly reliable (reliability = 0.95 for both), and perform similarly across gender, age, and race/ethnicity groups. A SF common to daily and nondaily smokers consists of 6 items (reliability = 0.86). Results from simulated CATs indicated that, on average, less than 8 items are needed to assess the construct with adequate precision using the item banks. These analyses identified a new set of items that can assess the positive emotional and sensory expectancies of smoking in a reliable and standardized manner. Considerable efficiency in assessing this construct can be achieved by using the item bank SF, employing computer adaptive tests, or selecting subsets of items tailored to specific research or clinical purposes. © The Author 2014. Published by Oxford University Press on behalf of the Society for Research on Nicotine and Tobacco. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Baylor, Carolyn; Yorkston, Kathryn; Eadie, Tanya; Kim, Jiseon; Chung, Hyewon; Amtmann, Dagmar
2015-01-01
Purpose The purpose of this study was to calibrate the items for the Communicative Participation Item Bank (CPIB) using Item Response Theory (IRT). One overriding objective was to examine if the IRT item parameters would be consistent across different diagnostic groups, thereby allowing creation of a disorder-generic instrument. The intended outcomes were the final item bank and a short form ready for clinical and research applications. Methods Self-report data were collected from 701 individuals representing four diagnoses: multiple sclerosis, Parkinson’s disease, amyotrophic lateral sclerosis and head and neck cancer. Participants completed the CPIB and additional self-report questionnaires. CPIB data were analyzed using the IRT Graded Response Model (GRM). Results The initial set of 94 candidate CPIB items were reduced to an item bank of 46 items demonstrating unidimensionality, local independence, good item fit, and good measurement precision. Differential item function (DIF) analyses detected no meaningful differences across diagnostic groups. A 10-item, disorder-generic short form was generated. Conclusions The CPIB provides speech-language pathologists with a unidimensional, self-report outcomes measurement instrument dedicated to the construct of communicative participation. This instrument may be useful to clinicians and researchers wanting to implement measures of communicative participation in their work. PMID:23816661
Kisala, Pamela A; Tulsky, David S; Pace, Natalie; Victorson, David; Choi, Seung W; Heinemann, Allen W
2015-05-01
To develop a calibrated item bank and computer adaptive test (CAT) to assess the effects of stigma on health-related quality of life in individuals with spinal cord injury (SCI). Grounded-theory based qualitative item development methods, large-scale item calibration field testing, confirmatory factor analysis, and item response theory (IRT)-based psychometric analyses. Five SCI Model System centers and one Department of Veterans Affairs medical center in the United States. Adults with traumatic SCI. SCI-QOL Stigma Item Bank A sample of 611 individuals with traumatic SCI completed 30 items assessing SCI-related stigma. After 7 items were iteratively removed, factor analyses confirmed a unidimensional pool of items. Graded Response Model IRT analyses were used to estimate slopes and thresholds for the final 23 items. The SCI-QOL Stigma item bank is unique not only in the assessment of SCI-related stigma but also in the inclusion of individuals with SCI in all phases of its development. Use of confirmatory factor analytic and IRT methods provide flexibility and precision of measurement. The item bank may be administered as a CAT or as a 10-item fixed-length short form and can be used for research and clinical applications.
Kisala, Pamela A.; Tulsky, David S.; Pace, Natalie; Victorson, David; Choi, Seung W.; Heinemann, Allen W.
2015-01-01
Objective To develop a calibrated item bank and computer adaptive test (CAT) to assess the effects of stigma on health-related quality of life in individuals with spinal cord injury (SCI). Design Grounded-theory based qualitative item development methods, large-scale item calibration field testing, confirmatory factor analysis, and item response theory (IRT)-based psychometric analyses. Setting Five SCI Model System centers and one Department of Veterans Affairs medical center in the United States. Participants Adults with traumatic SCI. Main Outcome Measures SCI-QOL Stigma Item Bank Results A sample of 611 individuals with traumatic SCI completed 30 items assessing SCI-related stigma. After 7 items were iteratively removed, factor analyses confirmed a unidimensional pool of items. Graded Response Model IRT analyses were used to estimate slopes and thresholds for the final 23 items. Conclusions The SCI-QOL Stigma item bank is unique not only in the assessment of SCI-related stigma but also in the inclusion of individuals with SCI in all phases of its development. Use of confirmatory factor analytic and IRT methods provide flexibility and precision of measurement. The item bank may be administered as a CAT or as a 10-item fixed-length short form and can be used for research and clinical applications. PMID:26010973
Hajian, Sepideh; Mehrabi, Esmat; Simbar, Masoumeh; Houshyari, Mohammad; Zayeri, Farid; Hajian, Parastoo
2016-01-01
Background Cancer diagnosis for everybody may be perceived as crisis and breast cancer, as the most common malignancy in women, can influence their well-being and multiple aspects of their health. So understanding that how women in various contexts and communities adjust to the illness is necessary to facilitate this adjustment and improve their quality of life. Objectives The aim of this study was to: 1) identify the core components of coping strategies to adjust to the illness in Iranian women with breast cancer perspective, 2) to develop and determine psychometric properties of a native self-report instrument to assess coping behaviors and measure the degree of adjustment with the breast cancer. Methods The present exploratory mixed method study was conducted in two consecutive stages: 1) the hermeneutic phenomenological study was done to explore the life experiences of coping styles to adjust with the breast cancer using in-depth interviews with patients that lead to item generation; 2) psychometric properties (validity and reliability) of the instrument were evaluated recruiting 340 eligible women. The item pool was reduced systematically and resulted in a 49-item instrument. Results From the qualitative stage, item pool containing 78 items related to coping strategies to adjust with the breast cancer. After eliminating unwanted statements from the results, qualitative and quantitative face and content validity, the 10 factors extracted employing construct validity were: feeling of guilt, abstention-diversion, role preservation and seeking support, efforts for threat control, confronting, fear and anxiety, role wasting, maturation and growth, isolation, and fatalism. These factors accounted for the 59.1% of variance observed. The Cronbach reliability test was carried out and alpha value of 10 factors was calculated from 0.78 to 0.87 confirming all factors were internally consistent. The scale’s stability was tested using the test-retest method. Conclusions The 49-item AIMI-IBC revealed acceptable psychometric properties. This instrument provides healthcare professionals to systematically assess the coping strategies of Iranian women with breast cancer and measure the degree of adjustment with illness. PMID:27761211
Development and content validation of a patient-reported endometriosis pain daily diary.
van Nooten, Floortje E; Cline, Jennifer; Elash, Celeste A; Paty, Jean; Reaney, Matthew
2018-01-04
Endometriosis is a common gynecological disorder that causes inflammation and pelvic pain. Endometriosis-related pain is best captured with patient-reported outcome (PRO) measures, however, assessment of endometriosis-related pain in clinical trials has been difficult in the absence of a reliable and valid PRO instrument. We describe the development of the Endometriosis Pain Daily Diary (EPDD), an electronic PRO developed as a survey instrument to assess endometriosis-related pain and its impact on patients' lives. The EPDD was initially developed on the basis of an existing Endometriosis Pain and Bleeding Diary, a targeted review of relevant literature, clinical expert interviews, and open-ended (concept elicitation) patient interviews in the United States (US) and Japan which captured patients' experience with endometriosis. Cognitive interviews of patients with endometriosis were conducted to evaluate patient comprehension of the EPDD items. A conceptual model of endometriosis was developed, and meetings with US and European regulatory authorities provided feedback for validating the EPDD in the context of clinical trials. Translatability assessments of the EPDD were conducted to confirm its appropriate interpretation and ease of completion across 17 languages. The iterative development progressed through three versions of the instrument. The EPDDv1 included 18 items relating to dysmenorrhea/pelvic pain, dyspareunia and sexual activity, bleeding, hot flashes, daily activities, and use of rescue medication. The EPDDv2 was a larger 43-item survey tested in cognitive interviews and subsequently revised to yield the current 11-item EPDDv3, consisting of five core items relating to dysmenorrhea, non-menstrual pelvic pain, and dyspareunia, and six extension items relating to sexual activity, daily activities, and use of rescue medication. The EPDD is a PRO for the evaluation of endometriosis-related pain and its associated impacts on patients' lives. The EPDD represents an important step in providing a PRO that is relevant to patients with endometriosis-related pain in the context of a clinical study setting (ie, fit-for-purpose), designed to evaluate pain associated with endometriosis, including regulatory agency support for its further exploration in clinical trials.
Cunningham, S G; Carinci, F; Brillante, M; Leese, G P; McAlpine, R R; Azzopardi, J; Beck, P; Bratina, N; Bocquet, V; Doggen, K; Jarosz-Chobot, P K; Jecht, M; Lindblad, U; Moulton, T; Metelko, Ž; Nagy, A; Olympios, G; Pruna, S; Skeie, S; Storms, F; Di Iorio, C T; Massi Benedetti, M
2016-01-01
A set of core diabetes indicators were identified in a clinical review of current evidence for the EUBIROD project. In order to allow accurate comparisons of diabetes indicators, a standardised currency for data storage and aggregation was required. We aimed to define a robust European data dictionary with appropriate clinical definitions that can be used to analyse diabetes outcomes and provide the foundation for data collection from existing electronic health records for diabetes. Existing clinical datasets used by 15 partner institutions across Europe were collated and common data items analysed for consistency in terms of recording, data definition and units of measurement. Where necessary, data mappings and algorithms were specified in order to allow partners to meet the standard definitions. A series of descriptive elements were created to document metadata for each data item, including recording, consistency, completeness and quality. While datasets varied in terms of consistency, it was possible to create a common standard that could be used by all. The minimum dataset defined 53 data items that were classified according to their feasibility and validity. Mappings and standardised definitions were used to create an electronic directory for diabetes care, providing the foundation for the EUBIROD data analysis repository, also used to implement the diabetes registry and model of care for Cyprus. The development of data dictionaries and standards can be used to improve the quality and comparability of health information. A data dictionary has been developed to be compatible with other existing data sources for diabetes, within and beyond Europe.
Parallel coding of conjunctions in visual search.
Found, A
1998-10-01
Two experiments investigated whether the conjunctive nature of nontarget items influenced search for a conjunction target. Each experiment consisted of two conditions. In both conditions, the target item was a red bar tilted to the right, among white tilted bars and vertical red bars. As well as color and orientation, display items also differed in terms of size. Size was irrelevant to search in that the size of the target varied randomly from trial to trial. In one condition, the size of items correlated with the other attributes of display items (e.g., all red items were big and all white items were small). In the other condition, the size of items varied randomly (i.e., some red items were small and some were big, and some white items were big and some were small). Search was more efficient in the size-correlated condition, consistent with the parallel coding of conjunctions in visual search.
Douglas, Raymond S; Tsirbas, Angelo; Gordon, Mark; Lee, Diana; Khadavi, Nicole; Garneau, Helene Chokron; Goldberg, Robert A; Cahill, Kenneth; Dolman, Peter J; Elner, Victor; Feldon, Steve; Lucarelli, Mark; Uddin, Jimmy; Kazim, Michael; Smith, Terry J; Khanna, Dinesh
2009-09-01
To identify components of a provisional clinical response index for thyroid eye disease using a modified Delphi technique. The International Thyroid Eye Disease Society conducted a structured, 3-round Delphi exercise establishing consensus for a core set of measures for clinical trials in thyroid eye disease. The steering committee discussed the results in a face-to-face meeting (nominal group technique) and evaluated each criterion with respect to its feasibility, reliability, redundancy, and validity. Redundant measures were consolidated or excluded. Criteria were parsed into 11 domains for the Delphi surveys. Eighty-four respondents participated in the Delphi 1 survey, providing 220 unique items. Ninety-two members (100% of the respondents from Delphi 1 plus 8 new participants) responded in Delphi 2 and rated the same 220 items. Sixty-four members (76% of participants) rated 153 criteria in Delphi 3 (67 criteria were excluded because of redundancy). Criteria with a mean greater than 6 (1 = least appropriate to 9 = most appropriate) were further evaluated by the nominal group technique and provisional core measures were chosen. Using a Delphi exercise, we developed provisional core measures for assessing disease activity and severity in clinical trials of therapies for thyroid eye disease. These measures will be iteratively refined for use in multicenter clinical trials.
Douglas, Raymond S.; Tsirbas, Angelo; Gordon, Mark; Lee, Diana; Khadavi, Nicole; Garneau, Helene Chokron; Goldberg, Robert A.; Cahill, Kenneth; Dolman, Peter J.; Elner, Victor; Feldon, Steve; Lucarelli, Mark; Uddin, Jimmy; Kazim, Michael; Smith, Terry J.; Khanna, Dinesh
2014-01-01
To identify components of a provisional clinical response index for thyroid eye disease (CRI-TED) using a modified Delphi technique. The International Thyroid Eye Disease Society (ITEDS) conducted a structured, 3-round Delphi exercise establishing consensus for a core set of measures for clinical trials in TED. The steering committee discussed the results in a face-to-face meeting (nominal group technique) and evaluated each criterion with respect to its feasibility, reliability, redundancy, and validity. Redundant measures were consolidated or excluded. Criteria were parsed into 11 domains for the Delphi surveys. Eighty four respondents participated in the Delphi-1 survey, providing 220 unique items. Ninety- two members (100% of the respondents from Delphi 1 plus eight new participants) responded in Delphi-2 and rated the same 220 items. Sixty-four members (76% of participants) rated 153 criteria in Delphi-3 (67 criteria were excluded due to redundancy). Criteria with a mean greater than 6 (1 least appropriate to 9 most appropriate) were further evaluated by the nominal group technique and provisional core measures were chosen. Using a Delphi exercise, we developed provisional core measures for assessing disease activity and severity in clinical trials of therapies for TED. These measures will be iteratively refined for use in multicenter clinical trials. PMID:19752424
Negative affect impairs associative memory but not item memory.
Bisby, James A; Burgess, Neil
2013-12-17
The formation of associations between items and their context has been proposed to rely on mechanisms distinct from those supporting memory for a single item. Although emotional experiences can profoundly affect memory, our understanding of how it interacts with different aspects of memory remains unclear. We performed three experiments to examine the effects of emotion on memory for items and their associations. By presenting neutral and negative items with background contexts, Experiment 1 demonstrated that item memory was facilitated by emotional affect, whereas memory for an associated context was reduced. In Experiment 2, arousal was manipulated independently of the memoranda, by a threat of shock, whereby encoding trials occurred under conditions of threat or safety. Memory for context was equally impaired by the presence of negative affect, whether induced by threat of shock or a negative item, relative to retrieval of the context of a neutral item in safety. In Experiment 3, participants were presented with neutral and negative items as paired associates, including all combinations of neutral and negative items. The results showed both above effects: compared to a neutral item, memory for the associate of a negative item (a second item here, context in Experiments 1 and 2) is impaired, whereas retrieval of the item itself is enhanced. Our findings suggest that negative affect impairs associative memory while recognition of a negative item is enhanced. They support dual-processing models in which negative affect or stress impairs hippocampal-dependent associative memory while the storage of negative sensory/perceptual representations is spared or even strengthened.
Austvoll-Dahlgren, Astrid; Guttersrud, Øystein; Nsangi, Allen; Semakula, Daniel; Oxman, Andrew D
2017-01-01
Background The Claim Evaluation Tools database contains multiple-choice items for measuring people’s ability to apply the key concepts they need to know to be able to assess treatment claims. We assessed items from the database using Rasch analysis to develop an outcome measure to be used in two randomised trials in Uganda. Rasch analysis is a form of psychometric testing relying on Item Response Theory. It is a dynamic way of developing outcome measures that are valid and reliable. Objectives To assess the validity, reliability and responsiveness of 88 items addressing 22 key concepts using Rasch analysis. Participants We administrated four sets of multiple-choice items in English to 1114 people in Uganda and Norway, of which 685 were children and 429 were adults (including 171 health professionals). We scored all items dichotomously. We explored summary and individual fit statistics using the RUMM2030 analysis package. We used SPSS to perform distractor analysis. Results Most items conformed well to the Rasch model, but some items needed revision. Overall, the four item sets had satisfactory reliability. We did not identify significant response dependence between any pairs of items and, overall, the magnitude of multidimensionality in the data was acceptable. The items had a high level of difficulty. Conclusion Most of the items conformed well to the Rasch model’s expectations. Following revision of some items, we concluded that most of the items were suitable for use in an outcome measure for evaluating the ability of children or adults to assess treatment claims. PMID:28550019
Massof, Robert W
2014-10-01
A simple theoretical framework explains patient responses to items in rating scale questionnaires. Fixed latent variables position each patient and each item on the same linear scale. Item responses are governed by a set of fixed category thresholds, one for each ordinal response category. A patient's item responses are magnitude estimates of the difference between the patient variable and the patient's estimate of the item variable, relative to his/her personally defined response category thresholds. Differences between patients in their personal estimates of the item variable and in their personal choices of category thresholds are represented by random variables added to the corresponding fixed variables. Effects of intervention correspond to changes in the patient variable, the patient's response bias, and/or latent item variables for a subset of items. Intervention effects on patients' item responses were simulated by assuming the random variables are normally distributed with a constant scalar covariance matrix. Rasch analysis was used to estimate latent variables from the simulated responses. The simulations demonstrate that changes in the patient variable and changes in response bias produce indistinguishable effects on item responses and manifest as changes only in the estimated patient variable. Changes in a subset of item variables manifest as intervention-specific differential item functioning and as changes in the estimated person variable that equals the average of changes in the item variables. Simulations demonstrate that intervention-specific differential item functioning produces inefficiencies and inaccuracies in computer adaptive testing. © The Author(s) 2013 Reprints and permissions: sagepub.co.uk/journalsPermissions.nav.
Improving Measurement Efficiency of the Inner EAR Scale with Item Response Theory.
Jessen, Annika; Ho, Andrew D; Corrales, C Eduardo; Yueh, Bevan; Shin, Jennifer J
2018-02-01
Objectives (1) To assess the 11-item Inner Effectiveness of Auditory Rehabilitation (Inner EAR) instrument with item response theory (IRT). (2) To determine whether the underlying latent ability could also be accurately represented by a subset of the items for use in high-volume clinical scenarios. (3) To determine whether the Inner EAR instrument correlates with pure tone thresholds and word recognition scores. Design IRT evaluation of prospective cohort data. Setting Tertiary care academic ambulatory otolaryngology clinic. Subjects and Methods Modern psychometric methods, including factor analysis and IRT, were used to assess unidimensionality and item properties. Regression methods were used to assess prediction of word recognition and pure tone audiometry scores. Results The Inner EAR scale is unidimensional, and items varied in their location and information. Information parameter estimates ranged from 1.63 to 4.52, with higher values indicating more useful items. The IRT model provided a basis for identifying 2 sets of items with relatively lower information parameters. Item information functions demonstrated which items added insubstantial value over and above other items and were removed in stages, creating a 8- and 3-item Inner EAR scale for more efficient assessment. The 8-item version accurately reflected the underlying construct. All versions correlated moderately with word recognition scores and pure tone averages. Conclusion The 11-, 8-, and 3-item versions of the Inner EAR scale have strong psychometric properties, and there is correlational validity evidence for the observed scores. Modern psychometric methods can help streamline care delivery by maximizing relevant information per item administered.
Development of an item bank for computerized adaptive test (CAT) measurement of pain.
Petersen, Morten Aa; Aaronson, Neil K; Chie, Wei-Chu; Conroy, Thierry; Costantini, Anna; Hammerlid, Eva; Hjermstad, Marianne J; Kaasa, Stein; Loge, Jon H; Velikova, Galina; Young, Teresa; Groenvold, Mogens
2016-01-01
Patient-reported outcomes should ideally be adapted to the individual patient while maintaining comparability of scores across patients. This is achievable using computerized adaptive testing (CAT). The aim here was to develop an item bank for CAT measurement of the pain domain as measured by the EORTC QLQ-C30 questionnaire. The development process consisted of four steps: (1) literature search, (2) formulation of new items and expert evaluations, (3) pretesting and (4) field-testing and psychometric analyses for the final selection of items. In step 1, we identified 337 pain items from the literature. Twenty-nine new items fitting the QLQ-C30 item style were formulated in step 2 that were reduced to 26 items by expert evaluations. Based on interviews with 31 patients from Denmark, France and the UK, the list was further reduced to 21 items in step 3. In phase 4, responses were obtained from 1103 cancer patients from five countries. Psychometric evaluations showed that 16 items could be retained in a unidimensional item bank. Evaluations indicated that use of the CAT measure may reduce sample size requirements with 15-25% compared to using the QLQ-C30 pain scale. We have established an item bank of 16 items suitable for CAT measurement of pain. While being backward compatible with the QLQ-C30, the new item bank will significantly improve measurement precision of pain. We recommend initiating CAT measurement by screening for pain using the two original QLQ-C30 pain items. The EORTC pain CAT is currently available for "experimental" purposes.
Cordier, Reinie; Speyer, Renée; Schindler, Antonio; Michou, Emilia; Heijnen, Bas Joris; Baijens, Laura; Karaduman, Ayşe; Swan, Katina; Clavé, Pere; Joosten, Annette Veronica
2018-02-01
The Swallowing Quality of Life questionnaire (SWAL-QOL) is widely used clinically and in research to evaluate quality of life related to swallowing difficulties. It has been described as a valid and reliable tool, but was developed and tested using classic test theory. This study describes the reliability and validity of the SWAL-QOL using item response theory (IRT; Rasch analysis). SWAL-QOL data were gathered from 507 participants at risk of oropharyngeal dysphagia (OD) across four European countries. OD was confirmed in 75.7% of participants via videofluoroscopy and/or fiberoptic endoscopic evaluation, or a clinical diagnosis based on meeting selected criteria. Patients with esophageal dysphagia were excluded. Data were analysed using Rasch analysis. Item and person reliability was good for all the items combined. However, person reliability was poor for 8 subscales and item reliability was poor for one subscale. Eight subscales exhibited poor person separation and two exhibited poor item separation. Overall item and person fit statistics were acceptable. However, at an individual item fit level results indicated unpredictable item responses for 28 items, and item redundancy for 10 items. The item-person dimensionality map confirmed these findings. Results from the overall Rasch model fit and Principal Component Analysis were suggestive of a second dimension. For all the items combined, none of the item categories were 'category', 'threshold' or 'step' disordered; however, all subscales demonstrated category disordered functioning. Findings suggest an urgent need to further investigate the underlying structure of the SWAL-QOL and its psychometric characteristics using IRT.
Methodology for Developing and Evaluating the PROMIS® Smoking Item Banks
Cai, Li; Stucky, Brian D.; Tucker, Joan S.; Shadel, William G.; Edelen, Maria Orlando
2014-01-01
Introduction: This article describes the procedures used in the PROMIS® Smoking Initiative for the development and evaluation of item banks, short forms (SFs), and computerized adaptive tests (CATs) for the assessment of 6 constructs related to cigarette smoking: nicotine dependence, coping expectancies, emotional and sensory expectancies, health expectancies, psychosocial expectancies, and social motivations for smoking. Methods: Analyses were conducted using response data from a large national sample of smokers. Items related to each construct were subjected to extensive item factor analyses and evaluation of differential item functioning (DIF). Final item banks were calibrated, and SF assessments were developed for each construct. The performance of the SFs and the potential use of the item banks for CAT administration were examined through simulation study. Results: Item selection based on dimensionality assessment and DIF analyses produced item banks that were essentially unidimensional in structure and free of bias. Simulation studies demonstrated that the constructs could be accurately measured with a relatively small number of carefully selected items, either through fixed SFs or CAT-based assessment. Illustrative results are presented, and subsequent articles provide detailed discussion of each item bank in turn. Conclusions: The development of the PROMIS smoking item banks provides researchers with new tools for measuring smoking-related constructs. The use of the calibrated item banks and suggested SF assessments will enhance the quality of score estimates, thus advancing smoking research. Moreover, the methods used in the current study, including innovative approaches to item selection and SF construction, may have general relevance to item bank development and evaluation. PMID:23943843
Evaluation of item candidates for a diabetic retinopathy quality of life item bank.
Fenwick, Eva K; Pesudovs, Konrad; Khadka, Jyoti; Rees, Gwyn; Wong, Tien Y; Lamoureux, Ecosse L
2013-09-01
We are developing an item bank assessing the impact of diabetic retinopathy (DR) on quality of life (QoL) using a rigorous multi-staged process combining qualitative and quantitative methods. We describe here the first two qualitative phases: content development and item evaluation. After a comprehensive literature review, items were generated from four sources: (1) 34 previously validated patient-reported outcome measures; (2) five published qualitative articles; (3) eight focus groups and 18 semi-structured interviews with 57 DR patients; and (4) seven semi-structured interviews with diabetes or ophthalmic experts. Items were then evaluated during 3 stages, namely binning (grouping) and winnowing (reduction) based on key criteria and panel consensus; development of item stems and response options; and pre-testing of items via cognitive interviews with patients. The content development phase yielded 1,165 unique items across 7 QoL domains. After 3 sessions of binning and winnowing, items were reduced to a minimally representative set (n = 312) across 9 domains of QoL: visual symptoms; ocular surface symptoms; activity limitation; mobility; emotional; health concerns; social; convenience; and economic. After 8 cognitive interviews, 42 items were amended resulting in a final set of 314 items. We have employed a systematic approach to develop items for a DR-specific QoL item bank. The psychometric properties of the nine QoL subscales will be assessed using Rasch analysis. The resulting validated item bank will allow clinicians and researchers to better understand the QoL impact of DR and DR therapies from the patient's perspective.
Developing an item bank to measure the coping strategies of people with hereditary retinal diseases.
Prem Senthil, Mallika; Khadka, Jyoti; De Roach, John; Lamey, Tina; McLaren, Terri; Campbell, Isabella; Fenwick, Eva K; Lamoureux, Ecosse L; Pesudovs, Konrad
2018-05-05
Our understanding of the coping strategies used by people with visual impairment to manage stress related to visual loss is limited. This study aims to develop a sophisticated coping instrument in the form of an item bank implemented via Computerised adaptive testing (CAT) for hereditary retinal diseases. Items on coping were extracted from qualitative interviews with patients which were supplemented by items from a literature review. A systematic multi-stage process of item refinement was carried out followed by expert panel discussion and cognitive interviews. The final coping item bank had 30 items. Rasch analysis was used to assess the psychometric properties. A CAT simulation was carried out to estimate an average number of items required to gain precise measurement of hereditary retinal disease-related coping. One hundred eighty-nine participants answered the coping item bank (median age = 58 years). The coping scale demonstrated good precision and targeting. The standardised residual loadings for items revealed six items grouped together. Removal of the six items reduced the precision of the main coping scale and worsened the variance explained by the measure. Therefore, the six items were retained within the main scale. Our CAT simulation indicated that, on average, less than 10 items are required to gain a precise measurement of coping. This is the first study to develop a psychometrically robust coping instrument for hereditary retinal diseases. CAT simulation indicated that on an average, only four and nine items were required to gain measurement at moderate and high precision, respectively.
41 CFR 101-27.204 - Types of shelf-life items.
Code of Federal Regulations, 2010 CFR
2010-07-01
... 41 Public Contracts and Property Management 2 2010-07-01 2010-07-01 true Types of shelf-life items...-Management of Shelf-Life Materials § 101-27.204 Types of shelf-life items. Shelf-life items are classified as nonextendable (Type I) and extendable (Type II). Type I items have a definite storage life after which the item...
41 CFR 101-27.204 - Types of shelf-life items.
Code of Federal Regulations, 2011 CFR
2011-07-01
... 41 Public Contracts and Property Management 2 2011-07-01 2007-07-01 true Types of shelf-life items...-Management of Shelf-Life Materials § 101-27.204 Types of shelf-life items. Shelf-life items are classified as nonextendable (Type I) and extendable (Type II). Type I items have a definite storage life after which the item...
Code of Federal Regulations, 2012 CFR
2012-04-01
... 17 Commodity and Securities Exchanges 3 2012-04-01 2012-04-01 false Inclusion of items, differentiation between items and answers, omission of instructions. 260.7a-16 Section 260.7a-16 Commodity and... INDENTURE ACT OF 1939 Formal Requirements § 260.7a-16 Inclusion of items, differentiation between items and...
Code of Federal Regulations, 2014 CFR
2014-04-01
... 17 Commodity and Securities Exchanges 4 2014-04-01 2014-04-01 false Inclusion of items, differentiation between items and answers, omission of instructions. 260.7a-16 Section 260.7a-16 Commodity and... INDENTURE ACT OF 1939 Formal Requirements § 260.7a-16 Inclusion of items, differentiation between items and...
Code of Federal Regulations, 2013 CFR
2013-04-01
... 17 Commodity and Securities Exchanges 3 2013-04-01 2013-04-01 false Inclusion of items, differentiation between items and answers, omission of instructions. 260.7a-16 Section 260.7a-16 Commodity and... INDENTURE ACT OF 1939 Formal Requirements § 260.7a-16 Inclusion of items, differentiation between items and...
ERIC Educational Resources Information Center
Matlock, Ki Lynn; Turner, Ronna
2016-01-01
When constructing multiple test forms, the number of items and the total test difficulty are often equivalent. Not all test developers match the number of items and/or average item difficulty within subcontent areas. In this simulation study, six test forms were constructed having an equal number of items and average item difficulty overall.…
A Quasi-Parametric Method for Fitting Flexible Item Response Functions
ERIC Educational Resources Information Center
Liang, Longjuan; Browne, Michael W.
2015-01-01
If standard two-parameter item response functions are employed in the analysis of a test with some newly constructed items, it can be expected that, for some items, the item response function (IRF) will not fit the data well. This lack of fit can also occur when standard IRFs are fitted to personality or psychopathology items. When investigating…
ERIC Educational Resources Information Center
Gierl, Mark J.; Lai, Hollis
2013-01-01
Changes to the design and development of our educational assessments are resulting in the unprecedented demand for a large and continuous supply of content-specific test items. One way to address this growing demand is with automatic item generation (AIG). AIG is the process of using item models to generate test items with the aid of computer…
The Effects of Judgment-Based Stratum Classifications on the Efficiency of Stratum Scored CATs.
ERIC Educational Resources Information Center
Finney, Sara J.; Smith, Russell W.; Wise, Steven L.
Two operational item pools were used to investigate the performance of stratum computerized adaptive tests (CATs) when items were assigned to strata based on empirical estimates of item difficulty or human judgments of item difficulty. Items from the first data set consisted of 54 5-option multiple choice items from a form of the ACT mathematics…
The Selection of Test Items for Decision Making with a Computer Adaptive Test.
ERIC Educational Resources Information Center
Spray, Judith A.; Reckase, Mark D.
The issue of test-item selection in support of decision making in adaptive testing is considered. The number of items needed to make a decision is compared for two approaches: selecting items from an item pool that are most informative at the decision point or selecting items that are most informative at the examinee's ability level. The first…
Code of Federal Regulations, 2010 CFR
2010-04-01
... 17 Commodity and Securities Exchanges 3 2010-04-01 2010-04-01 false Inclusion of items, differentiation between items and answers, omission of instructions. 260.7a-16 Section 260.7a-16 Commodity and... INDENTURE ACT OF 1939 Formal Requirements § 260.7a-16 Inclusion of items, differentiation between items and...
Automatic Item Generation: A More Efficient Process for Developing Mathematics Achievement Items?
ERIC Educational Resources Information Center
Embretson, Susan E.; Kingston, Neal M.
2018-01-01
The continual supply of new items is crucial to maintaining quality for many tests. Automatic item generation (AIG) has the potential to rapidly increase the number of items that are available. However, the efficiency of AIG will be mitigated if the generated items must be submitted to traditional, time-consuming review processes. In two studies,…
41 CFR 101-27.204 - Types of shelf-life items.
Code of Federal Regulations, 2014 CFR
2014-07-01
... 41 Public Contracts and Property Management 2 2014-07-01 2012-07-01 true Types of shelf-life items...-Management of Shelf-Life Materials § 101-27.204 Types of shelf-life items. Shelf-life items are classified as nonextendable (Type I) and extendable (Type II). Type I items have a definite storage life after which the item...
41 CFR 101-27.204 - Types of shelf-life items.
Code of Federal Regulations, 2013 CFR
2013-07-01
... 41 Public Contracts and Property Management 2 2013-07-01 2012-07-01 true Types of shelf-life items...-Management of Shelf-Life Materials § 101-27.204 Types of shelf-life items. Shelf-life items are classified as nonextendable (Type I) and extendable (Type II). Type I items have a definite storage life after which the item...
Code of Federal Regulations, 2011 CFR
2011-04-01
... 17 Commodity and Securities Exchanges 3 2011-04-01 2011-04-01 false Inclusion of items, differentiation between items and answers, omission of instructions. 260.7a-16 Section 260.7a-16 Commodity and... INDENTURE ACT OF 1939 Formal Requirements § 260.7a-16 Inclusion of items, differentiation between items and...
Kisala, Pamela A.; Victorson, David; Pace, Natalie; Heinemann, Allen W.; Choi, Seung W.; Tulsky, David S.
2015-01-01
Objective To describe the development and psychometric properties of the SCI-QOL Psychological Trauma item bank and short form. Design Using a mixed-methods design, we developed and tested a Psychological Trauma item bank with patient and provider focus groups, cognitive interviews, and item response theory based analytic approaches, including tests of model fit, differential item functioning (DIF) and precision. Setting We tested a 31-item pool at several medical institutions across the United States, including the University of Michigan, Kessler Foundation, Rehabilitation Institute of Chicago, the University of Washington, Craig Hospital and the James J. Peters/Bronx Veterans Administration hospital. Participants A total of 716 individuals with SCI completed the trauma items Results The 31 items fit a unidimensional model (CFI=0.952; RMSEA=0.061) and demonstrated good precision (theta range between 0.6 and 2.5). Nine items demonstrated negligible DIF with little impact on score estimates. The final calibrated item bank contains 19 items Conclusion The SCI-QOL Psychological Trauma item bank is a psychometrically robust measurement tool from which a short form and a computer adaptive test (CAT) version are available. PMID:26010967
Method of locating related items in a geometric space for data mining
Hendrickson, B.A.
1999-07-27
A method for locating related items in a geometric space transforms relationships among items to geometric locations. The method locates items in the geometric space so that the distance between items corresponds to the degree of relatedness. The method facilitates communication of the structure of the relationships among the items. The method is especially beneficial for communicating databases with many items, and with non-regular relationship patterns. Examples of such databases include databases containing items such as scientific papers or patents, related by citations or keywords. A computer system adapted for practice of the present invention can include a processor, a storage subsystem, a display device, and computer software to direct the location and display of the entities. The method comprises assigning numeric values as a measure of similarity between each pairing of items. A matrix is constructed, based on the numeric values. The eigenvectors and eigenvalues of the matrix are determined. Each item is located in the geometric space at coordinates determined from the eigenvectors and eigenvalues. Proper construction of the matrix and proper determination of coordinates from eigenvectors can ensure that distance between items in the geometric space is representative of the numeric value measure of the items' similarity. 12 figs.
Item response theory analyses of the Delis-Kaplan Executive Function System card sorting subtest.
Spencer, Mercedes; Cho, Sun-Joo; Cutting, Laurie E
2018-02-02
In the current study, we examined the dimensionality of the 16-item Card Sorting subtest of the Delis-Kaplan Executive Functioning System assessment in a sample of 264 native English-speaking children between the ages of 9 and 15 years. We also tested for measurement invariance for these items across age and gender groups using item response theory (IRT). Results of the exploratory factor analysis indicated that a two-factor model that distinguished between verbal and perceptual items provided the best fit to the data. Although the items demonstrated measurement invariance across age groups, measurement invariance was violated for gender groups, with two items demonstrating differential item functioning for males and females. Multigroup analysis using all 16 items indicated that the items were more effective for individuals whose IRT scale scores were relatively high. A single-group explanatory IRT model using 14 non-differential item functioning items showed that for perceptual ability, females scored higher than males and that scores increased with age for both males and females; for verbal ability, the observed increase in scores across age differed for males and females. The implications of these findings are discussed.
Method of locating related items in a geometric space for data mining
Hendrickson, Bruce A.
1999-01-01
A method for locating related items in a geometric space transforms relationships among items to geometric locations. The method locates items in the geometric space so that the distance between items corresponds to the degree of relatedness. The method facilitates communication of the structure of the relationships among the items. The method is especially beneficial for communicating databases with many items, and with non-regular relationship patterns. Examples of such databases include databases containing items such as scientific papers or patents, related by citations or keywords. A computer system adapted for practice of the present invention can include a processor, a storage subsystem, a display device, and computer software to direct the location and display of the entities. The method comprises assigning numeric values as a measure of similarity between each pairing of items. A matrix is constructed, based on the numeric values. The eigenvectors and eigenvalues of the matrix are determined. Each item is located in the geometric space at coordinates determined from the eigenvectors and eigenvalues. Proper construction of the matrix and proper determination of coordinates from eigenvectors can ensure that distance between items in the geometric space is representative of the numeric value measure of the items' similarity.
Assessing psychological well-being: self-report instruments for the NIH Toolbox.
Salsman, John M; Lai, Jin-Shei; Hendrie, Hugh C; Butt, Zeeshan; Zill, Nicholas; Pilkonis, Paul A; Peterson, Christopher; Stoney, Catherine M; Brouwers, Pim; Cella, David
2014-02-01
Psychological well-being (PWB) has a significant relationship with physical and mental health. As a part of the NIH Toolbox for the Assessment of Neurological and Behavioral Function, we developed self-report item banks and short forms to assess PWB. Expert feedback and literature review informed the selection of PWB concepts and the development of item pools for positive affect, life satisfaction, and meaning and purpose. Items were tested with a community-dwelling US Internet panel sample of adults aged 18 and above (N = 552). Classical and item response theory (IRT) approaches were used to evaluate unidimensionality, fit of items to the overall measure, and calibrations of those items, including differential item function (DIF). IRT-calibrated item banks were produced for positive affect (34 items), life satisfaction (16 items), and meaning and purpose (18 items). Their psychometric properties were supported based on the results of factor analysis, fit statistics, and DIF evaluation. All banks measured the concepts precisely (reliability ≥0.90) for more than 98% of participants. These adult scales and item banks for PWB provide the flexibility, efficiency, and precision necessary to promote future epidemiological, observational, and intervention research on the relationship of PWB with physical and mental health.
Vaughn, Kalif E; Rawson, Katherine A; Pyc, Mary A
2013-12-01
A wealth of previous research has established that retrieval practice promotes memory, particularly when retrieval is successful. Although successful retrieval promotes memory, it remains unclear whether successful retrieval promotes memory equally well for items of varying difficulty. Will easy items still outperform difficult items on a final test if all items have been correctly recalled equal numbers of times during practice? In two experiments, normatively difficult and easy Lithuanian-English word pairs were learned via test-restudy practice until each item had been correctly recalled a preassigned number of times (from 1 to 11 correct recalls). Despite equating the numbers of successful recalls during practice, performance on a delayed final cued-recall test was lower for difficult than for easy items. Experiment 2 was designed to diagnose whether the disadvantage for difficult items was due to deficits in cue memory, target memory, and/or associative memory. The results revealed a disadvantage for the difficult versus the easy items only on the associative recognition test, with no differences on cue recognition, and even an advantage on target recognition. Although successful retrieval enhanced memory for both difficult and easy items, equating retrieval success during practice did not eliminate normative item difficulty differences.
Restricted interests and teacher presentation of items.
Stocco, Corey S; Thompson, Rachel H; Rodriguez, Nicole M
2011-01-01
Restricted and repetitive behavior (RRB) is more pervasive, prevalent, frequent, and severe in individuals with autism spectrum disorders (ASDs) than in their typical peers. One subtype of RRB is restricted interests in items or activities, which is evident in the manner in which individuals engage with items (e.g., repetitious wheel spinning), the types of items or activities they select (e.g., preoccupation with a phone book), or the range of items or activities they select (i.e., narrow range of items). We sought to describe the relation between restricted interests and teacher presentation of items. Overall, we observed 5 teachers interacting with 2 pairs of students diagnosed with an ASD. Each pair included 1 student with restricted interests. During these observations, teachers were free to present any items from an array of 4 stimuli selected by experimenters. We recorded student responses to teacher presentation of items and analyzed the data to determine the relation between teacher presentation of items and the consequences for presentation provided by the students. Teacher presentation of items corresponded with differential responses provided by students with ASD, and those with restricted preferences experienced a narrower array of items.
Selected list of books and journals for the small medical library.
Brandon, A N
1977-01-01
This revised list of 472 books and 138 journals is intended as a selection guide for small or medium-sized hospital libraries or for the small medical library serving a specified clientele. It can also be used as a core list by small hospital library consortia. Books and journals are categorized by subject, with the books being followed by an author index and the journals by an alphabetical title listing. Items suggested for initial purchase by smaller libraries are indicated by an asterisk. To purchase the entire collection of books and to pay for annual subscriptions to all the journals would require an expenditure of about $18,200. The cost of only the asterisked items recommended for first purchase totals approximately $4,500. PMID:321057
Selected list of books and journals for the small medical library.
Brandon, A N; Hill, D R
1979-01-01
This revised list of 492 books and 138 journals is intended as a selection guide for small or medium-sized hospital libraries or for the small medical library serving a specified clientele. It can also be used as a core list by small hospital library consortia. Books and journals are categorized by subject, with the books being followed by an author index and the journals by an alphabetical title listing. Items suggested for initial purchase by smaller libraries are indicated by an asterisk. To purchase the entire collection of books and to pay for annual subscriptions to all the journals would require an expenditure of about $22,500. The cost of only the asterisked items, recommended for first purchase, totals approximately $6,100. PMID:380695
Selected list of books and journals for the small medical library.
Brandon, A N; Hill, D R
1981-04-01
This revised list of 539 books and 136 journals is intended as a selection guide for small or medium-sized hospital libraries or for small medical libraries in comparable health care facilities. It can also be used as a core list by consortia of small hospital libraries. Books and journals are categorized by subject; the book list is followed by an author index and the list of journals by an alphabetical title listing. Items suggested for initial purchase by smaller libraries, 137 books and 54 journals, are indicated by asterisks. To purchase the entire collection of books and to pay for annual subscriptions to all the journals would require an expenditure of about $30,000. The cost of only the asterisked items, which are recommended for first purchase, totals approximately $8,900.
Towards a Model of Contemporary Parenting: The Parenting Behaviours and Dimensions Questionnaire
Reid, Carly A. Y.; Roberts, Lynne D.; Roberts, Clare M.; Piek, Jan P.
2015-01-01
The assessment of parenting has been problematic due to theoretical disagreement, concerns over generalisability, and problems with the psychometric properties of current parenting measures. The aim of this study was to develop a comprehensive, psychometrically sound self-report parenting measure for use with parents of preadolescent children, and to use this empirical scale development process to identify the core dimensions of contemporary parenting behaviour. Following item generation and parent review, 846 parents completed an online survey comprising 116 parenting items. Exploratory and confirmatory factor analyses supported a six factor parenting model, comprising Emotional Warmth, Punitive Discipline, Anxious Intrusiveness, Autonomy Support, Permissive Discipline and Democratic Discipline. This measure will allow for the comprehensive and consistent assessment of parenting in future research and practice. PMID:26043107
Core verbal working-memory capacity: the limit in words retained without covert articulation.
Chen, Zhijian; Cowan, Nelson
2009-07-01
Verbal working memory may combine phonological and conceptual units. We disentangle their contributions by extending a prior procedure (Chen & Cowan, 2005) in which items recalled from lists of previously seen word singletons and of previously learned word pairs depended on the list length in chunks. Here we show that a constant capacity of about 3 chunks holds across list lengths and list types, provided that covert phonological rehearsal is prevented. What remains is a core verbal working-memory capacity.
[Mokken scaling of the Cognitive Screening Test].
Diesfeldt, H F A
2009-10-01
The Cognitive Screening Test (CST) is a twenty-item orientation questionnaire in Dutch, that is commonly used to evaluate cognitive impairment. This study applied Mokken Scale Analysis, a non-parametric set of techniques derived from item response theory (IRT), to CST-data of 466 consecutive participants in psychogeriatric day care. The full item set and the standard short version of fourteen items both met the assumptions of the monotone homogeneity model, with scalability coefficient H = 0.39, which is considered weak. In order to select items that would fulfil the assumption of invariant item ordering or the double monotonicity model, the subjects were randomly partitioned into a training set (50% of the sample) and a test set (the remaining half). By means of an automated item selection eleven items were found to measure one latent trait, with H = 0.67 and item H coefficients larger than 0.51. Cross-validation of the item analysis in the remaining half of the subjects gave comparable values (H = 0.66; item H coefficients larger than 0.56). The selected items involve year, place of residence, birth date, the monarch's and prime minister's names, and their predecessors. Applying optimal discriminant analysis (ODA) it was found that the full set of twenty CST items performed best in distinguishing two predefined groups of patients of lower or higher cognitive ability, as established by an independent criterion derived from the Amsterdam Dementia Screening Test. The chance corrected predictive value or prognostic utility was 47.5% for the full item set, 45.2% for the fourteen items of the standard short version of the CST, and 46.1% for the homogeneous, unidimensional set of selected eleven items. The results of the item analysis support the application of the CST in cognitive assessment, and revealed a more reliable 'short' version of the CST than the standard short version (CST14).
Development and Initial Validation of Military Deployment-Related TBI Quality-of-Life Item Banks.
Toyinbo, Peter A; Vanderploeg, Rodney D; Donnell, Alison J; Mutolo, Sandra A; Cook, Karon F; Kisala, Pamela A; Tulsky, David S
2016-01-01
To investigate unique factors that affect health-related quality of life (QOL) in individuals with military deployment-related traumatic brain injury (MDR-TBI) and to develop appropriate assessment tools, consistent with the TBI-QOL/PROMIS/Neuro-QOL systems. Three focus groups from each of the 4 Veterans Administration (VA) Polytrauma Rehabilitation Centers, consisting of 20 veterans with mild to severe MDR-TBI, and 36 VA providers were involved in early stage of new item banks development. The item banks were field tested in a sample (N = 485) of veterans enrolled in VA and diagnosed with an MDR-TBI. Focus groups and survey. Developed item banks and short forms for Guilt, Posttraumatic Stress Disorder/Trauma, and Military-Related Loss. Three new item banks representing unique domains of MDR-TBI health outcomes were created: 15 new Posttraumatic Stress Disorder items plus 16 SCI-QOL legacy Trauma items, 37 new Military-Related Loss items plus 18 TBI-QOL legacy Grief/Loss items, and 33 new Guilt items. Exploratory and confirmatory factor analyses plus bifactor analysis of the items supported sufficient unidimensionality of the new item pools. Convergent and discriminant analyses results, as well as known group comparisons, provided initial support for the validity and clinical utility of the new item response theory-calibrated item banks and their short forms. This work provides a unique opportunity to identify issues specific to individuals with MDR-TBI and ensure that they are captured in QOL assessment, thus extending the existing TBI-QOL measurement system.
On the Relationship Between Tooth Shape and Masticatory Efficiency: A Finite Element Study.
Berthaume, Michael A
2016-05-01
Dental topography has successfully linked disparate tooth shapes to distinct dietary categories, but not to masticatory efficiency. Here, the relationship between four dental topographic metrics and brittle food item breakdown efficiency during compressive biting was investigated using a parametric finite element model of a bunodont molar. Food item breakdown efficiency was chosen to represent masticatory efficiency as it isolated tooth-food item interactions, where most other categories of masticatory efficiency include several aspects of the masticatory process. As relative food item size may affect the presence/absence of any relationship, four isometrically scaled, hemispherical, proxy food items were considered. Topographic metrics were uncorrelated to food item breakdown efficiency irrespective of relative food item size, and dental topographic metrics were largely uncorrelated to one another. The lack of a correlation between topographic metrics and food item breakdown efficiency is not unexpected as not all food items break down in the same manner (e.g., nuts are crushed, leaves are sheared), and only one food item shape was considered. In addition, food item breakdown efficiency describes tooth-food item interactions and requires location and shape specific information, which are absent from dental topographic metrics. This makes it unlikely any one efficiency metric will be correlated to all topographic metrics. These results emphasize the need to take into account how food items break down during biting, ingestion, and mastication when investigating the mechanical relationship between food item shape, size, mechanical properties, and breakdown, and tooth shape. © 2016 Wiley Periodicals, Inc.
Recall dynamics reveal the retrieval of emotional context
Long, Nicole M.; Danoff, Michelle S.
2015-01-01
Memory is often better for emotional rather than neutral stimuli. The benefit for emotional items could be the result of an associative mechanism whereby items are associated to a slowly updating context. Through this process, emotional features are integrated with context during study, and are reactivated during test. The presence of emotion in context would both provide a stronger retrieval cue, enhancing memory of emotional items, as well as lead to emotional clustering, whereby emotionally similar items are recalled consecutively. To measure whether associative mechanisms can explain the enhancement for emotional items, we conducted a free recall study in which most items were emotionally neutral to minimize effects of mood induction and to more closely reflect naturalistic settings. We found that emotional items were significantly more likely to be recalled than neutral items and that participants were more likely to transition between emotional items rather than between emotional and neutral items. Together, these results suggest that contextual encoding and retrieval mechanisms may drive the benefit for emotional items both within and outside the laboratory. PMID:25604771
Psychometrics of the self-report safe driving behavior measure for older adults.
Classen, Sherrilene; Wen, Pey-Shan; Velozo, Craig A; Bédard, Michel; Winter, Sandra M; Brumback, Babette; Lanford, Desiree N
2012-01-01
We investigated the psychometric properties of the 68-item Safe Driving Behavior Measure (SDBM) with 80 older drivers, 80 caregivers, and 2 evaluators from two sites. Using Rasch analysis, we examined unidimensionality and local dependence; rating scale; item- and person-level psychometrics; and item hierarchy of older drivers, caregivers, and driving evaluators who had completed the SDBM. The evidence suggested the SDBM is unidimensional, but pairs of items showed local dependency. Across the three rater groups, the data showed good person (≥3.4) and item (≥3.6) separation as well as good person (≥.93) and item reliability (≥.92). Cronbach's α was ≥.96, and few items were misfitting. Some of the items did not follow the hypothesized order of item difficulty. The SDBM classified the older drivers into six ability levels, but to fully calibrate the instrument it must be refined in terms of its items (e.g., item exclusion) and then tested among participants of lesser ability. Copyright © 2012 by the American Occupational Therapy Association, Inc.
ERIC Educational Resources Information Center
Sikstrom, Sverker
2006-01-01
An item that stands out (is isolated) from its context is better remembered than an item consistent with the context. This isolation effect cannot be accounted for by increased attention, because it occurs when the isolated item is presented as the first item, or by impoverished memory of nonisolated items, because the isolated item is better…
ERIC Educational Resources Information Center
Snyder, James
2010-01-01
This dissertation research examined the changes in item RIT calibration that occurred when adding audio to a set of currently calibrated RIT items and then placing these new items as field test items in the modified assessments on the NWEA MAP test platform. The researcher used test results from over 600 students in the Poway School District in…
ERIC Educational Resources Information Center
Michaelides, Michalis P.
2006-01-01
Consistent behavior is a desirable characteristic that common items are expected to have when administered to different groups. Findings from the literature have established that items do not always behave in consistent ways; item indices and IRT item parameter estimates of the same items differ when obtained from different administrations.…
Towards shared patient records: an architecture for using routine data for nationwide research.
Knaup, Petra; Garde, Sebastian; Merzweiler, Angela; Graf, Norbert; Schilling, Freimut; Weber, Ralf; Haux, Reinhold
2006-01-01
Ubiquitous information is currently one of the most challenging slogans in medical informatics research. An adequate architecture for shared electronic patient records is needed which can use data for multiple purposes and which is extensible for new research questions. We introduce eardap as architecture for using routine data for nationwide clinical research in a multihospital environment. eardap can be characterized as terminology-based. Main advantage of our approach is the extensibility by new items and new research questions. Once the definition of items for a research question is finished, a consistent, corresponding database can be created without any informatics skills. Our experiences in pediatric oncology in Germany have shown the applicability of eardap. The functions of our core system were in routine clinical use in several hospitals. We validated the terminology management system (TMS) and the module generation tool with the basic data set of pediatric oncology. The multiple usability depends mainly on the quality of item planning in the TMS. High quality harmonization will lead to a higher amount of multiply used data. When using eardap, special emphasis is to be placed on interfaces to local hospital information systems and data security issues.
Assessing child and adolescent pragmatic language competencies: toward evidence-based assessments.
Russell, Robert L; Grizzle, Kenneth L
2008-06-01
Using language appropriately and effectively in social contexts requires pragmatic language competencies (PLCs). Increasingly, deficits in PLCs are linked to child and adolescent disorders, including autism spectrum, externalizing, and internalizing disorders. As the role of PLCs expands in diagnosis and treatment of developmental psychopathology, psychologists and educators will need to appraise and select clinical and research PLC instruments for use in assessments and/or studies. To assist in this appraisal, 24 PLC instruments, containing 1,082 items, are assessed by addressing four questions: (1) Can PLC domains targeted by assessment items be reliably identified?, (2) What are the core PLC domains that emerge across the 24 instruments?, (3) Do PLC questionnaires and tests assess similar PLC domains?, and (4) Do the instruments achieve content, structural, diagnostic, and ecological validity? Results indicate that test and questionnaire items can be reliably categorized into PLC domains, that PLC domains featured in questionnaires and tests significantly differ, and that PLC instruments need empirical confirmation of their dimensional structure, content validity across all developmental age bands, and ecological validity. Progress in building a better evidence base for PLC assessments should be a priority in future research.
Development and validation of the Vietnamese primary care assessment tool.
Hoa, Nguyen Thi; Tam, Nguyen Minh; Peersman, Wim; Derese, Anselme; Markuns, Jeffrey F
2018-01-01
To adapt the consumer version of the Primary Care Assessment Tool (PCAT) for Vietnam and determine its internal consistency and validity. A quantitative cross sectional study. 56 communes in 3 representative provinces of central Vietnam. Total of 3289 people who used health care services at health facility at least once over the past two years. The Vietnamese adult expanded consumer version of the PCAT (VN PCAT-AE) is an instrument for evaluation of primary care in Vietnam with 70 items comprising six scales representing four core primary care domains, and three additional scales representing three derivative domains. Sixteen other items from the original tool were not included in the final instrument, due to problems with missing values, floor or ceiling effects, and item-total correlations. All the retained scales have a Cronbach's alpha above 0.70 except for the subscale of Family Centeredness. The VN PCAT-AE demonstrates adequate internal consistency and validity to be used as an effective tool for measuring the quality of primary care in Vietnam from the consumer perspective. Additional work in the future to optimize valid measurement in all domains consistent with the original version of the tool may be helpful as the primary care system in Vietnam further develops.
Child and Adolescent Perceptions of Oral Health Over the Life Course
Maida, Carl A.; Marcus, Marvin; Hays, Ron D.; Coulter, Ian D.; Ramos-Gomez, Francisco; Lee, Steve Y.; McClory, Patricia S.; Van, Laura V.; Wang, Yan; Shen, Jie; Cai, Li; Spolsky, Vladimir W.; Crall, James J.; Liu, Honghu
2016-01-01
Purpose To elicit perceptions of oral health in children and adolescents as an initial step in the in the development of oral health item banks for the Patient-Reported Oral Health Outcomes Measurement Information System project. Methods We conducted focus groups with ethnically, socioeconomically, and geographically diverse youth (8-12, 13-17 years) to identify perceptions of oral health status. We performed content analysis, including a thematic and narrative analysis, to identify important themes. Results We identified three unique themes that the youth associated with their oral health status: 1) understanding the value of maintaining good oral health over the life course, with respect to longevity and quality of life in the adult years; 2) positive association between maintaining good oral health and interpersonal relationships at school, and dating, for older youth; and 3) knowledge of the benefits of orthodontic treatment to appearance and positive self-image, while holding a strong view as to the discomfort associated with braces. Conclusions The results provide valuable information about core domains for the oral health item banks to be developed and generated content for new items to be developed and evaluated with cognitive interviews and in a field test. PMID:26038216
Measuring Advance Care Planning: Optimizing the Advance Care Planning Engagement Survey.
Sudore, Rebecca L; Heyland, Daren K; Barnes, Deborah E; Howard, Michelle; Fassbender, Konrad; Robinson, Carole A; Boscardin, John; You, John J
2017-04-01
A validated 82-item Advance Care Planning (ACP) Engagement Survey measures a broad range of behaviors. However, concise surveys are needed. The objective of this study was to validate shorter versions of the survey. The survey included 57 process (e.g., readiness) and 25 action items (e.g., discussions). For item reduction, we systematically eliminated questions based on face validity, item nonresponse, redundancy, ceiling effects, and factor analysis. We assessed internal consistency (Cronbach's alpha) and construct validity with cross-sectional correlations and the ability of the progressively shorter survey versions to detect change one week after exposure to an ACP intervention (Pearson correlation coefficients). Five hundred one participants (four Canadian and three US sites) were included in item reduction (mean age 69 years [±10], 41% nonwhite). Because of high correlations between readiness and action items, all action items were removed. Because of high correlations and ceiling effects, two process items were removed. Successive factor analysis then created 55-, 34-, 15-, nine-, and four-item versions; 664 participants (from three US ACP clinical trials) were included in validity analysis (age 65 years [±8], 72% nonwhite, 34% Spanish speaking). Cronbach's alphas were high for all versions (four items 0.84-55 items 0.97). Compared with the original survey, cross-sectional correlations were high (four items 0.85; 55 items 0.97) as were delta correlations (four items 0.68; 55 items 0.93). Shorter versions of the ACP Engagement Survey are valid, internally consistent, and able to detect change across a broad range of ACP behaviors for English and Spanish speakers. Shorter ACP surveys can efficiently measure broad ACP behaviors in research and clinical settings. Published by Elsevier Inc.
Austvoll-Dahlgren, Astrid; Guttersrud, Øystein; Nsangi, Allen; Semakula, Daniel; Oxman, Andrew D
2017-05-25
The Claim Evaluation Tools database contains multiple-choice items for measuring people's ability to apply the key concepts they need to know to be able to assess treatment claims. We assessed items from the database using Rasch analysis to develop an outcome measure to be used in two randomised trials in Uganda. Rasch analysis is a form of psychometric testing relying on Item Response Theory. It is a dynamic way of developing outcome measures that are valid and reliable. To assess the validity, reliability and responsiveness of 88 items addressing 22 key concepts using Rasch analysis. We administrated four sets of multiple-choice items in English to 1114 people in Uganda and Norway, of which 685 were children and 429 were adults (including 171 health professionals). We scored all items dichotomously. We explored summary and individual fit statistics using the RUMM2030 analysis package. We used SPSS to perform distractor analysis. Most items conformed well to the Rasch model, but some items needed revision. Overall, the four item sets had satisfactory reliability. We did not identify significant response dependence between any pairs of items and, overall, the magnitude of multidimensionality in the data was acceptable. The items had a high level of difficulty. Most of the items conformed well to the Rasch model's expectations. Following revision of some items, we concluded that most of the items were suitable for use in an outcome measure for evaluating the ability of children or adults to assess treatment claims. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Paschoal, Sérgio Márcio Pacheco; Filho, Wilson Jacob; Litvoc, Júlio
2008-01-01
OBJECTIVE To describe item reduction and its distribution into dimensions in the construction process of a quality of life evaluation instrument for the elderly. METHODS The sampling method was chosen by convenience through quotas, with selection of elderly subjects from four programs to achieve heterogeneity in the “health status”, “functional capacity”, “gender”, and “age” variables. The Clinical Impact Method was used, consisting of the spontaneous and elicited selection by the respondents of relevant items to the construct Quality of Life in Old Age from a previously elaborated item pool. The respondents rated each item’s importance using a 5-point Likert scale. The product of the proportion of elderly selecting the item as relevant (frequency) and the mean importance score they attributed to it (importance) represented the overall impact of that item in their quality of life (impact). The items were ordered according to their impact scores and the top 46 scoring items were grouped in dimensions by three experts. A review of the negative items was performed. RESULTS One hundred and ninety three people (122 women and 71 men) were interviewed. Experts distributed the 46 items into eight dimensions. Closely related items were grouped and dimensions not reaching the minimum expected number of items received additional items resulting in eight dimensions and 43 items. DISCUSSION The sample was heterogeneous and similar to what was expected. The dimensions and items demonstrated the multidimensionality of the construct. The Clinical Impact Method was appropriate to construct the instrument, which was named Elderly Quality of Life Index - EQoLI. An accuracy process will be examined in the future. PMID:18438571
Transitional probabilities count more than frequency, but might not be used for memorization.
Endress, Ansgar D; Langus, Alan
2017-02-01
Learners often need to extract recurring items from continuous sequences, in both vision and audition. The best-known example is probably found in word-learning, where listeners have to determine where words start and end in fluent speech. This could be achieved through universal and experience-independent statistical mechanisms, for example by relying on Transitional Probabilities (TPs). Further, these mechanisms might allow learners to store items in memory. However, previous investigations have yielded conflicting evidence as to whether a sensitivity to TPs is diagnostic of the memorization of recurring items. Here, we address this issue in the visual modality. Participants were familiarized with a continuous sequence of visual items (i.e., arbitrary or everyday symbols), and then had to choose between (i) high-TP items that appeared in the sequence, (ii) high-TP items that did not appear in the sequence, and (iii) low-TP items that appeared in the sequence. Items matched in TPs but differing in (chunk) frequency were much harder to discriminate than items differing in TPs (with no significant sensitivity to chunk frequency), and learners preferred unattested high-TP items over attested low-TP items. Contrary to previous claims, these results cannot be explained on the basis of the similarity of the test items. Learners thus weigh within-item TPs higher than the frequency of the chunks, even when the TP differences are relatively subtle. We argue that these results are problematic for distributional clustering mechanisms that analyze continuous sequences, and provide supporting computational results. We suggest that the role of TPs might not be to memorize items per se, but rather to prepare learners to memorize recurring items once they are presented in subsequent learning situations with richer cues. Copyright © 2016 Elsevier Inc. All rights reserved.
Holman, Rebecca; Glas, Cees AW; Lindeboom, Robert; Zwinderman, Aeilko H; de Haan, Rob J
2004-01-01
Background Whenever questionnaires are used to collect data on constructs, such as functional status or health related quality of life, it is unlikely that all respondents will respond to all items. This paper examines ways of dealing with responses in a 'not applicable' category to items included in the AMC Linear Disability Score (ALDS) project item bank. Methods The data examined in this paper come from the responses of 392 respondents to 32 items and form part of the calibration sample for the ALDS item bank. The data are analysed using the one-parameter logistic item response theory model. The four practical strategies for dealing with this type of response are: cold deck imputation; hot deck imputation; treating the missing responses as if these items had never been offered to those individual patients; and using a model which takes account of the 'tendency to respond to items'. Results The item and respondent population parameter estimates were very similar for the strategies involving hot deck imputation; treating the missing responses as if these items had never been offered to those individual patients; and using a model which takes account of the 'tendency to respond to items'. The estimates obtained using the cold deck imputation method were substantially different. Conclusions The cold deck imputation method was not considered suitable for use in the ALDS item bank. The other three methods described can be usefully implemented in the ALDS item bank, depending on the purpose of the data analysis to be carried out. These three methods may be useful for other data sets examining similar constructs, when item response theory based methods are used. PMID:15200681
de Sá Junior, Antonio Reis; de Andrade, Arthur Guerra; Andrade, Laura Helena; Gorenstein, Clarice; Wang, Yuan-Pang
2018-07-01
This study examines the response pattern of depressive symptoms in a nationwide student sample, through item analyses of a rating scale by both classical test theory (CTT) and item response theory (IRT). The 21-item Beck Depression Inventory-II (BDI-II) was administered to 12,711 college students. First, the psychometric properties of the scale were described. Thereafter, the endorsement probability of depressive symptom in each scale item was analyzed through CTT and IRT. Graphical plots depicted the endorsement probability of scale items and intensity of depression. Three items of different difficulty level were compared through CTT and IRT approach. Four in five students reported the presence of depressive symptoms. The BDI-II items presented good reliability and were distributed along the symptomatic continuum of depression. Similarly, in both CTT and IRT approaches, the item 'changes in sleep' was easily endorsed, 'loss of interest' moderately and 'suicidal thoughts' hardly. Graphical representation of BDI-II of both methods showed much equivalence in terms of item discrimination and item difficulty. The item characteristic curve of the IRT method provided informative evaluation of item performance. The inventory was applied only in college students. Depressive symptoms were frequent psychopathological manifestations among college students. The performance of the BDI-II items indicated convergent results from both methods of analysis. While the CTT was easy to understand and to apply, the IRT was more complex to understand and to implement. Comprehensive assessment of the functioning of each BDI-II item might be helpful in efficient detection of depressive conditions in college students. Copyright © 2018 Elsevier B.V. All rights reserved.
Methodology for developing and evaluating the PROMIS smoking item banks.
Hansen, Mark; Cai, Li; Stucky, Brian D; Tucker, Joan S; Shadel, William G; Edelen, Maria Orlando
2014-09-01
This article describes the procedures used in the PROMIS Smoking Initiative for the development and evaluation of item banks, short forms (SFs), and computerized adaptive tests (CATs) for the assessment of 6 constructs related to cigarette smoking: nicotine dependence, coping expectancies, emotional and sensory expectancies, health expectancies, psychosocial expectancies, and social motivations for smoking. Analyses were conducted using response data from a large national sample of smokers. Items related to each construct were subjected to extensive item factor analyses and evaluation of differential item functioning (DIF). Final item banks were calibrated, and SF assessments were developed for each construct. The performance of the SFs and the potential use of the item banks for CAT administration were examined through simulation study. Item selection based on dimensionality assessment and DIF analyses produced item banks that were essentially unidimensional in structure and free of bias. Simulation studies demonstrated that the constructs could be accurately measured with a relatively small number of carefully selected items, either through fixed SFs or CAT-based assessment. Illustrative results are presented, and subsequent articles provide detailed discussion of each item bank in turn. The development of the PROMIS smoking item banks provides researchers with new tools for measuring smoking-related constructs. The use of the calibrated item banks and suggested SF assessments will enhance the quality of score estimates, thus advancing smoking research. Moreover, the methods used in the current study, including innovative approaches to item selection and SF construction, may have general relevance to item bank development and evaluation. © The Author 2013. Published by Oxford University Press on behalf of the Society for Research on Nicotine and Tobacco. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Development of the PROMIS health expectancies of smoking item banks.
Edelen, Maria Orlando; Tucker, Joan S; Shadel, William G; Stucky, Brian D; Cerully, Jennifer; Li, Zhen; Hansen, Mark; Cai, Li
2014-09-01
Smokers' health-related outcome expectancies are associated with a number of important constructs in smoking research, yet there are no measures currently available that focus exclusively on this domain. This paper describes the development and evaluation of item banks for assessing the health expectancies of smoking. Using data from a sample of daily (N = 4,201) and nondaily (N = 1,183) smokers, we conducted a series of item factor analyses, item response theory analyses, and differential item functioning analyses (according to gender, age, and race/ethnicity) to arrive at a unidimensional set of health expectancies items for daily and nondaily smokers. We also evaluated the performance of short forms (SFs) and computer adaptive tests (CATs) to efficiently assess health expectancies. A total of 24 items were included in the Health Expectancies item banks; 13 items are common across daily and nondaily smokers, 6 are unique to daily, and 5 are unique to nondaily. For both daily and nondaily smokers, the Health Expectancies item banks are unidimensional, reliable (reliability = 0.95 and 0.96, respectively), and perform similarly across gender, age, and race/ethnicity groups. A SF common to daily and nondaily smokers consists of 6 items (reliability = 0.87). Results from simulated CATs showed that health expectancies can be assessed with good precision with an average of 5-6 items adaptively selected from the item banks. Health expectancies of smoking can be assessed on the basis of these item banks via SFs, CATs, or through a tailored set of items selected for a specific research purpose. © The Author 2014. Published by Oxford University Press on behalf of the Society for Research on Nicotine and Tobacco. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Development of the PROMIS negative psychosocial expectancies of smoking item banks.
Stucky, Brian D; Edelen, Maria Orlando; Tucker, Joan S; Shadel, William G; Cerully, Jennifer; Kuhfeld, Megan; Hansen, Mark; Cai, Li
2014-09-01
Negative psychosocial expectancies of smoking include aspects of social disapproval and disappointment in oneself. This paper describes analyses conducted to develop and evaluate item banks for assessing psychosocial expectancies among daily and nondaily smokers. Using data from a sample of daily (N = 4,201) and nondaily (N =1,183) smokers, we conducted a series of item factor analyses, item response theory analyses, and differential item functioning analyses (according to gender, age, and race/ethnicity) to arrive at a unidimensional set of psychosocial expectancies items for daily and nondaily smokers. We also evaluated performance of short forms (SFs) and computer adaptive tests (CATs) to efficiently assess psychosocial expectancies. A total of 21 items were included in the Psychosocial Expectancies item banks: 14 items are common across daily and nondaily smokers, 6 are unique to daily, and 1 is unique to nondaily. For both daily and nondaily smokers, the Psychosocial Expectancies item banks are strongly unidimensional, highly reliable (reliability = 0.95 and 0.93, respectively), and perform similarly across gender, age, and race/ethnicity groups. A SF common to daily and nondaily smokers consists of 6 items (reliability = 0.85). Results from simulated CATs showed that, on average, fewer than 8 items are needed to assess psychosocial expectancies with adequate precision when using the item banks. Psychosocial expectancies of smoking can be assessed on the basis of these item banks via the SF, by using CAT, or through a tailored set of items selected for a specific research purpose. © The Author 2014. Published by Oxford University Press on behalf of the Society for Research on Nicotine and Tobacco. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Underestimating numerosity of items in visual search tasks.
Cassenti, Daniel N; Kelley, Troy D; Ghirardelli, Thomas G
2010-10-01
Previous research on numerosity judgments addressed attended items, while the present research addresses underestimation for unattended items in visual search tasks. One potential cause of underestimation for unattended items is that estimates of quantity may depend on viewing a large portion of the display within foveal vision. Another theory follows from the occupancy model: estimating quantity of items in greater proximity to one another increases the likelihood of an underestimation error. Three experimental manipulations addressed aspects of underestimation for unattended items: the size of the distracters, the distance of the target from fixation, and whether items were clustered together. Results suggested that the underestimation effect for unattended items was best explained within a Gestalt grouping framework.
Haggerty, Jeannie L.; Bouharaoui, Fatima; Santor, Darcy A.
2011-01-01
Evaluating the extent to which groups or subgroups of individuals differ with respect to primary healthcare experience depends on first ruling out the possibility of bias. Objective: To determine whether item or subscale performance differs systematically between French/English, high/low education subgroups and urban/rural residency. Method: A sample of 645 adult users balanced by French/English language (in Quebec and Nova Scotia, respectively), high/low education and urban/rural residency responded to six validated instruments: the Primary Care Assessment Survey (PCAS); the Primary Care Assessment Tool – Short Form (PCAT-S); the Components of Primary Care Index (CPCI); the first version of the EUROPEP (EUROPEP-I); the Interpersonal Processes of Care Survey, version II (IPC-II); and part of the Veterans Affairs National Outpatient Customer Satisfaction Survey (VANOCSS). We normalized subscale scores to a 0-to-10 scale and tested for between-group differences using ANOVA tests. We used a parametric item response model to test for differences between subgroups in item discriminability and item difficulty. We re-examined group differences after removing items with differential item functioning. Results: Experience of care was assessed more positively in the English-speaking (Nova Scotia) than in the French-speaking (Quebec) respondents. We found differential English/French item functioning in 48% of the 153 items: discriminability in 20% and differential difficulty in 28%. English items were more discriminating generally than the French. Removing problematic items did not change the differences in French/English assessments. Differential item functioning by high/low education status affected 27% of items, with items being generally more discriminating in high-education groups. Between-group comparisons were unchanged. In contrast, only 9% of items showed differential item functioning by geography, affecting principally the accessibility attribute. Removing problematic items reversed a previously non-significant finding, revealing poorer first-contact access in rural than in urban areas. Conclusion: Differential item functioning does not bias or invalidate French/English comparisons on subscales, but additional development is required to make French and English items equivalent. These instruments are relatively robust by educational status and geography, but results suggest potential differences in the underlying construct in low-education and rural respondents. PMID:23205035
Grilo Diogo, Pedro; Barbosa, Joselina; Ferreira, Maria Amélia
2015-12-19
The Tuning Project is an initiative funded by the European Commission that developed core competences for primary medical degrees in Europe. Students' grouped self-assessments are used for program evaluation and improvement of curricula. The TEST study aimed to assess how do Portuguese medical graduates self-assess their acquisition of core competences and experiences of contact with patients in core settings according to the Tuning framework. Translation of the Tuning's competences (Clinical Practice - CP), Knowledge (K) items and Clinical Settings (CS) was performed. Questionnaires were created in paper and electronic formats and distributed to 1591 graduates from seven Portuguese medical schools (July 2014). Items were rated in a 6-point Likert scale (0-5) of levels of competence. Exploratory factor analysis (EFA) was conducted and Cronbach's alpha was used to evaluate the internal consistency of the questionnaire. Kruskal-Wallis and Dunn's tests were used for multiple comparisons. Three hundred eighty seven questionnaires were analyzed, corresponding to 24% of the target population. EFA yielded an 11-factor solution for CP and a 6-factor solution for K items. The median value of CP factors was 2.8 (p25 = 2.0; p75 = 3.5) and the median value of K factors was 2.6 (2.0; 3.2). Factor scores ranged from 1.3 (Legal principles) to 4.0 (Ethical principles). Clinical presentations, psychological aspects of illness, evidence-based medicine and promotion of health showed the highest results. Lower scores were detected in medical emergencies, practical procedures, prescribing drugs and legal principles. More than 90% of graduates experienced having contact with patients in 8 CS but only 24% of graduates had contact in all 14 CS. Graduates had the least contact with patients in the emergency rooms, intensive care units, palliative, rehabilitation and anesthetic care. Significant differences (p < 0.05) among schools were detected in 8 factors and 7 settings. We developed a valid questionnaire supporting national SWOT analysis on the acquisition of core competences in medical education. Results suggest that Portuguese graduates are not fully prepared for clinical practice. Curricular improvements in core competences and the educational development of the transition period between undergraduate and postgraduate education ought to be considered. Outcome-based program evaluation relying on graduates' grouped self-assessments contributes to inform changes in medical education.
Multi-institutional validation of a web-based core competency assessment system.
Tabuenca, Arnold; Welling, Richard; Sachdeva, Ajit K; Blair, Patrice G; Horvath, Karen; Tarpley, John; Savino, John A; Gray, Richard; Gulley, Julie; Arnold, Teresa; Wolfe, Kevin; Risucci, Donald A
2007-01-01
The Association of Program Directors in Surgery and the Division of Education of the American College of Surgeons developed and implemented a web-based system for end-of-rotation faculty assessment of ACGME core competencies of residents. This study assesses its reliability and validity across multiple programs. Each assessment included ratings (1-5 scale) on 23 items reflecting the 6 core competencies. A total of 4241 end-of-rotation assessments were completed for 332 general surgery residents (> or =5 evaluations each) at 5 sites during the 2004-2005 and 2005-2006 academic years. The mean rating for each resident on each item was computed for each academic year. The mean rating of items representing each competency was computed for each resident. Additional data included USMLE and ABSITE scores, PGY, and status in program (categorical, designated preliminary, and undesignated preliminary). Coefficient alpha was greater than 0.90 for each competency score. Mean ratings for each competency increased significantly (p < 0.01) as a function of PGY. Mean ratings for professionalism and interpersonal/communication skills (IPC) were significantly higher than all other competencies at all PGY levels. Competency ratings of PGY 1 residents correlated significantly with USMLE Step I, ranging from (r = 0.26, p < 0.01) for Professionalism to (r = 0.41, p < 0.001) for Systems-Based Practice. Ratings of Knowledge (r = 0.31, p < 0.01), Practice-Based Learning & Improvement (PBLI; r = 0.22, p < 0.05), and Systems-Based Practice (r = 0.20, p < 0.05) correlated significantly with 2005 ABSITE Total Percentile. Ratings of all competencies correlated significantly with the 2006 ABSITE Total Percentile Score (range: r = 0.20, p < 0.05 for professionalism to r = 0.35, p < 0.001 for knowledge). Categorical and designated preliminary residents received significantly higher ratings (p < 0.05) than nondesignated preliminaries for knowledge, patient care, PBLI, and systems-based practice only. Faculty ratings of core competencies are internally consistent. The pattern of statistically significant correlations between competency ratings and USMLE and ABSITE scores supports the postdictive and concurrent validity, respectively, of faculty perceptions of resident knowledge. The pattern of increased ratings as a function of PGY supports the construct validity of faculty ratings of resident core competencies.
Pilkonis, Paul A.; Choi, Seung W.; Reise, Steven P.; Stover, Angela M.; Riley, William T.; Cella, David
2011-01-01
The authors report on the development and calibration of item banks for depression, anxiety, and anger as part of the Patient-Reported Outcomes Measurement Information System (PROMIS®). Comprehensive literature searches yielded an initial bank of 1,404 items from 305 instruments. After qualitative item analysis (including focus groups and cognitive interviewing), 168 items (56 for each construct) were written in a first person, past tense format with a 7-day time frame and five response options reflecting frequency. The calibration sample included nearly 15,000 respondents. Final banks of 28, 29, and 29 items were calibrated for depression, anxiety, and anger, respectively, using item response theory. Test information curves showed that the PROMIS item banks provided more information than conventional measures in a range of severity from approximately −1 to +3 standard deviations (with higher scores indicating greater distress). Short forms consisting of seven to eight items provided information comparable to legacy measures containing more items. PMID:21697139
Pilkonis, Paul A; Choi, Seung W; Reise, Steven P; Stover, Angela M; Riley, William T; Cella, David
2011-09-01
The authors report on the development and calibration of item banks for depression, anxiety, and anger as part of the Patient-Reported Outcomes Measurement Information System (PROMIS®). Comprehensive literature searches yielded an initial bank of 1,404 items from 305 instruments. After qualitative item analysis (including focus groups and cognitive interviewing), 168 items (56 for each construct) were written in a first person, past tense format with a 7-day time frame and five response options reflecting frequency. The calibration sample included nearly 15,000 respondents. Final banks of 28, 29, and 29 items were calibrated for depression, anxiety, and anger, respectively, using item response theory. Test information curves showed that the PROMIS item banks provided more information than conventional measures in a range of severity from approximately -1 to +3 standard deviations (with higher scores indicating greater distress). Short forms consisting of seven to eight items provided information comparable to legacy measures containing more items.
Chan, Kitty S; Gross, Alden L; Pezzin, Liliana E; Brandt, Jason; Kasper, Judith D
2015-12-01
To harmonize measures of cognitive performance using item response theory (IRT) across two international aging studies. Data for persons ≥65 years from the Health and Retirement Study (HRS, N = 9,471) and the English Longitudinal Study of Aging (ELSA, N = 5,444). Cognitive performance measures varied (HRS fielded 25, ELSA 13); 9 were in common. Measurement precision was examined for IRT scores based on (a) common items, (b) common items adjusted for differential item functioning (DIF), and (c) DIF-adjusted all items. Three common items (day of date, immediate word recall, and delayed word recall) demonstrated DIF by survey. Adding survey-specific items improved precision but mainly for HRS respondents at lower cognitive levels. IRT offers a feasible strategy for harmonizing cognitive performance measures across other surveys and for other multi-item constructs of interest in studies of aging. Practical implications depend on sample distribution and the difficulty mix of in-common and survey-specific items. © The Author(s) 2015.
Mokken scaling of the Myocardial Infarction Dimensional Assessment Scale (MIDAS).
Thompson, David R; Watson, Roger
2011-02-01
The purpose of this study was to examine the hierarchical and cumulative nature of the 35 items of the Myocardial Infarction Dimensional Assessment Scale (MIDAS), a disease-specific health-related quality of life measure. Data from 668 participants who completed the MIDAS were analysed using the Mokken Scaling Procedure, which is a computer program that searches polychotomous data for hierarchical and cumulative scales on the basis of a range of diagnostic criteria. Fourteen MIDAS items were retained in a Mokken scale and these items included physical activity, insecurity, emotional reaction and dependency items but excluded items related to diet, medication or side-effects. Item difficulty, in item response theory terms, ran from physical activity items (low difficulty) to insecurity, suggesting that the most severe quality of life effect of myocardial infarction is loneliness and isolation. Items from the MIDAS form a strong and reliable Mokken scale, which provides new insight into the relationship between items in the MIDAS and the measurement of quality of life after myocardial infarction. © 2010 Blackwell Publishing Ltd.
Okada, Takayuki
2013-01-01
The author suggested that it is essential for lawyers and psychiatrists to have a common understanding of the mutual division of roles between them when determining criminal responsibility (CR) and, for this purpose, proposed an 8-step structured CR decision-making process. The 8 steps are: (1) gathering of information related to mental function and condition, (2) recognition of mental function and condition,(3) psychiatric diagnosis, (4) description of the relationship between psychiatric symptom or psychopathology and index offense, (5) focus on capacities of differentiation between right and wrong and behavioral control, (6) specification of elements of cognitive/volitional prong in legal context, (7) legal evaluation of degree of cognitive/volitional prong, and (8) final interpretation of CR as a legal conclusion. The author suggested that the CR decision-making process should proceed not in a step-like pattern from (1) to (2) to (3) to (8), but in a step-like pattern from (1) to (2) to (4) to (5) to (6) to (7) to (8), and that not steps after (5), which require the interpretation or the application of section 39 of the Penal Code, but Step (4), must be the core of psychiatric expert evidence. When explaining the relationship between the mental disorder and offense described in Step (4), the Seven Focal Points (7FP) are often used. The author urged basic precautions to prevent the misuse of 7FP, which are: (a) the priority of each item is not equal and the relative importance differs from case to case; (b) each item is not exclusively independent, there may be overlap between items; (c) the criminal responsibility shall not be judged because one item is applicable or because a number of items are applicable, i. e., 7FP are not "criteria," for example, the aim is not to decide such things as 'the motive is understandable' or 'the conduct is appropriate', but should be to describe how psychopathological factors affected the offense specifically in the context of understandability of motive or appropriateness of conduct; (d) it is essential to evaluate each item from a neutral point of view rather than only from one perspective, for example, looking at the case from the aspects of both comprehensibility and incomprehensibility of motive or from aspects of both oriented, purposeful, organized behavior and disoriented, purposeless, disorganized behavior during the offense; (e) depending on the case, there are some items that do not require any consideration (there are some cases in which there are less than seven items); (f) 7FP are not exhaustive and there are instances in which, depending on the case, there should be a focus on points that are not included in these.
ERIC Educational Resources Information Center
Wang, Wei
2013-01-01
Mixed-format tests containing both multiple-choice (MC) items and constructed-response (CR) items are now widely used in many testing programs. Mixed-format tests often are considered to be superior to tests containing only MC items although the use of multiple item formats leads to measurement challenges in the context of equating conducted under…
ERIC Educational Resources Information Center
Kim, Jiyoung; Chi, Youngshin; Huensch, Amanda; Jun, Heesung; Li, Hongli; Roullion, Vanessa
2010-01-01
This article discusses a case study on an item writing process that reflects on our practical experience in an item development project. The purpose of the article is to share our lessons from the experience aiming to demystify item writing process. The study investigated three issues that naturally emerged during the project: how item writers use…
Development and community-based validation of eight item banks to assess mental health.
Batterham, Philip J; Sunderland, Matthew; Carragher, Natacha; Calear, Alison L
2016-09-30
There is a need for precise but brief screening of mental health problems in a range of settings. The development of item banks to assess depression and anxiety has resulted in new adaptive and static screeners that accurately assess severity of symptoms. However, expansion to a wider array of mental health problems is required. The current study developed item banks for eight mental health problems: social anxiety disorder, panic disorder, post-traumatic stress disorder, obsessive-compulsive disorder, adult attention-deficit hyperactivity disorder, drug use, psychosis and suicidality. The item banks were calibrated in a population-based Australian adult sample (N=3175) by administering large item pools (45-75 items) and excluding items on the basis of local dependence or measurement non-invariance. Item Response Theory parameters were estimated for each item bank using a two-parameter graded response model. Each bank consisted of 19-47 items, demonstrating excellent fit and precision across a range of -1 to 3 standard deviations from the mean. No previous study has developed such a broad range of mental health item banks. The calibrated item banks will form the basis of a new system of static and adaptive measures to screen for a broad array of mental health problems in the community. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
When Listening Is Better Than Reading: Performance Gains on Cardiac Auscultation Test Questions.
Short, Kathleen; Bucak, S Deniz; Rosenthal, Francine; Raymond, Mark R
2018-05-01
In 2007, the United States Medical Licensing Examination embedded multimedia simulations of heart sounds into multiple-choice questions. This study investigated changes in item difficulty as determined by examinee performance over time. The data reflect outcomes obtained following initial use of multimedia items from 2007 through 2012, after which an interface change occurred. A total of 233,157 examinees responded to 1,306 cardiology test items over the six-year period; 138 items included multimedia simulations of heart sounds, while 1,168 text-based items without multimedia served as controls. The authors compared changes in difficulty of multimedia items over time with changes in difficulty of text-based cardiology items over time. Further, they compared changes in item difficulty for both groups of items between graduates of Liaison Committee on Medical Education (LCME)-accredited and non-LCME-accredited (i.e., international) medical schools. Examinee performance on cardiology test items with multimedia heart sounds improved by 12.4% over the six-year period, while performance on text-based cardiology items improved by approximately 1.4%. These results were similar for graduates of LCME-accredited and non-LCME-accredited medical schools. Examinees' ability to interpret auscultation findings in test items that include multimedia presentations increased from 2007 to 2012.
Electronic Quality of Life Assessment Using Computer-Adaptive Testing
2016-01-01
Background Quality of life (QoL) questionnaires are desirable for clinical practice but can be time-consuming to administer and interpret, making their widespread adoption difficult. Objective Our aim was to assess the performance of the World Health Organization Quality of Life (WHOQOL)-100 questionnaire as four item banks to facilitate adaptive testing using simulated computer adaptive tests (CATs) for physical, psychological, social, and environmental QoL. Methods We used data from the UK WHOQOL-100 questionnaire (N=320) to calibrate item banks using item response theory, which included psychometric assessments of differential item functioning, local dependency, unidimensionality, and reliability. We simulated CATs to assess the number of items administered before prespecified levels of reliability was met. Results The item banks (40 items) all displayed good model fit (P>.01) and were unidimensional (fewer than 5% of t tests significant), reliable (Person Separation Index>.70), and free from differential item functioning (no significant analysis of variance interaction) or local dependency (residual correlations < +.20). When matched for reliability, the item banks were between 45% and 75% shorter than paper-based WHOQOL measures. Across the four domains, a high standard of reliability (alpha>.90) could be gained with a median of 9 items. Conclusions Using CAT, simulated assessments were as reliable as paper-based forms of the WHOQOL with a fraction of the number of items. These properties suggest that these item banks are suitable for computerized adaptive assessment. These item banks have the potential for international development using existing alternative language versions of the WHOQOL items. PMID:27694100
Park, Jong Cook; Kim, Kwang Sig
2012-03-01
The reliability of test is determined by each items' characteristics. Item analysis is achieved by classical test theory and item response theory. The purpose of the study was to compare the discrimination indices with item response theory using the Rasch model. Thirty-one 4th-year medical school students participated in the clinical course written examination, which included 22 A-type items and 3 R-type items. Point biserial correlation coefficient (C(pbs)) was compared to method of extreme group (D), biserial correlation coefficient (C(bs)), item-total correlation coefficient (C(it)), and corrected item-total correlation coeffcient (C(cit)). Rasch model was applied to estimate item difficulty and examinee's ability and to calculate item fit statistics using joint maximum likelihood. Explanatory power (r2) of Cpbs is decreased in the following order: C(cit) (1.00), C(it) (0.99), C(bs) (0.94), and D (0.45). The ranges of difficulty logit and standard error and ability logit and standard error were -0.82 to 0.80 and 0.37 to 0.76, -3.69 to 3.19 and 0.45 to 1.03, respectively. Item 9 and 23 have outfit > or =1.3. Student 1, 5, 7, 18, 26, 30, and 32 have fit > or =1.3. C(pbs), C(cit), and C(it) are good discrimination parameters. Rasch model can estimate item difficulty parameter and examinee's ability parameter with standard error. The fit statistics can identify bad items and unpredictable examinee's responses.
Miller, Leonie M; Roodenrys, Steven
2012-11-01
The frequency effect in short-term serial recall is influenced by the composition of lists. In pure lists, a robust advantage in the recall of high-frequency (HF) words is observed, yet in alternating mixed lists, HF and low-frequency (LF) words are recalled equally well. It has been argued that the preexisting associations between all list items determine a single, global level of supportive activation that assists item recall. Preexisting associations between items are assumed to be a function of language co-occurrence; HF-HF associations are high, LF-LF associations are low, and mixed associations are intermediate in activation strength. This account, however, is based on results when alternating lists with equal numbers of HF and LF words were used. It is possible that directional association between adjacent list items is responsible for the recall patterns reported. In the present experiment, the recall of three forms of mixed lists-those with equal numbers of HF and LF items and pure lists-was examined to test the extent to which item-to-item associations are present in serial recall. Furthermore, conditional probabilities were used to examine more closely the evidence for a contribution, since correct-in-position scoring may mask recall that is dependent on the recall of prior items. The results suggest that an item-to-item effect is clearly present for early but not late list items, and they implicate an additional factor, perhaps the availability of resources at output, in the recall of late list items.
Converging evidence for control of color-word Stroop interference at the item level.
Bugg, Julie M; Hutchison, Keith A
2013-04-01
Prior studies have shown that cognitive control is implemented at the list and context levels in the color-word Stroop task. At first blush, the finding that Stroop interference is reduced for mostly incongruent items as compared with mostly congruent items (i.e., the item-specific proportion congruence [ISPC] effect) appears to provide evidence for yet a third level of control, which modulates word reading at the item level. However, evidence to date favors the view that ISPC effects reflect the rapid prediction of high-contingency responses and not item-specific control. In Experiment 1, we first show that an ISPC effect is obtained when the relevant dimension (i.e., color) signals proportion congruency, a problematic pattern for theories based on differential response contingencies. In Experiment 2, we replicate and extend this pattern by showing that item-specific control settings transfer to new stimuli, ruling out alternative frequency-based accounts. In Experiment 3, we revert to the traditional design in which the irrelevant dimension (i.e., word) signals proportion congruency. Evidence for item-specific control, including transfer of the ISPC effect to new stimuli, is apparent when 4-item sets are employed but not when 2-item sets are employed. We attribute this pattern to the absence of high-contingency responses on incongruent trials in the 4-item set. These novel findings provide converging evidence for reactive control of color-word Stroop interference at the item level, reveal theoretically important factors that modulate reliance on item-specific control versus contingency learning, and suggest an update to the item-specific control account (Bugg, Jacoby, & Chanani, 2011).
Functional recovery in patients with schizophrenia: recommendations from a panel of experts.
Lahera, Guillermo; Gálvez, José L; Sánchez, Pedro; Martínez-Roig, Miguel; Pérez-Fuster, J V; García-Portilla, Paz; Herrera, Berta; Roca, Miquel
2018-06-05
The management of schizophrenia is evolving towards a more comprehensive model based on functional recovery. The concept of functional recovery goes beyond clinical remission and encompasses multiple aspects of the patient's life, making it difficult to settle on a definition and to develop reliable assessment criteria. In this consensus process based on a panel of experts in schizophrenia, we aimed to provide useful insights on functional recovery and its involvement in clinical practice and clinical research. After a literature review of functional recovery in schizophrenia, a scientific committee of 8 members prepared a 75-item questionnaire, including 6 sections: (I) the concept of functional recovery (9 items), (II) assessment of functional recovery (23 items), (III) factors influencing functional recovery (16 items), (IV) psychosocial interventions and functional recovery (8 items), (V) pharmacological treatment and functional recovery (14 items), and (VI) the perspective of patients and their relatives on functional recovery (5 items). The questionnaire was sent to a panel of 53 experts, who rated each item on a 9-point Likert scale. Consensus was achieved in a 2-round Delphi dynamics, using the median (interquartile range) scores to consider consensus in either agreement (scores 7-9) or disagreement (scores 1-3). Items not achieving consensus in the first round were sent back to the experts for a second consideration. After the two recursive rounds, consensus was achieved in 64 items (85.3%): 61 items (81.3%) in agreement and 3 (4.0%) in disagreement, all of them from section II (assessment of functional recovery). Items not reaching consensus were related to the concepts of functional recovery (1 item, 1.3%), functional assessment (5 items, 6.7%), factors influencing functional recovery (3 items, 4.0%), and psychosocial interventions (2 items, 5.6%). Despite the lack of a well-defined concept of functional recovery, we identified a trend towards a common archetype of the definition and factors associated with functional recovery, as well as its applicability in clinical practice and clinical research.
Approximation Algorithms for the Highway Problem under the Coupon Model
NASA Astrophysics Data System (ADS)
Hamane, Ryoso; Itoh, Toshiya; Tomita, Kouhei
When a store sells items to customers, the store wishes to decide the prices of items to maximize its profit. Intuitively, if the store sells the items with low (resp. high) prices, the customers buy more (resp. less) items, which provides less profit to the store. So it would be hard for the store to decide the prices of items. Assume that the store has a set V of n items and there is a set E of m customers who wish to buy the items, and also assume that each item i ∈ V has the production cost di and each customer ej ∈ E has the valuation vj on the bundle ej ⊆ V of items. When the store sells an item i ∈ V at the price ri, the profit for the item i is pi = ri - di. The goal of the store is to decide the price of each item to maximize its total profit. We refer to this maximization problem as the item pricing problem. In most of the previous works, the item pricing problem was considered under the assumption that pi ≥ 0 for each i ∈ V, however, Balcan, et al. [In Proc. of WINE, LNCS 4858, 2007] introduced the notion of “loss-leader, ” and showed that the seller can get more total profit in the case that pi < 0 is allowed than in the case that pi < 0 is not allowed. In this paper, we consider the line highway problem (in which each customer is interested in an interval on the line of the items) and the cycle highway problem (in which each customer is interested in an interval on the cycle of the items), and show approximation algorithms for the line highway problem and the cycle highway problem in which the smallest valuation is s and the largest valuation is l (this is called an [s, l]-valuation setting) or all valuations are identical (this is called a single valuation setting).
Rasch validation of the Arabic version of the lower extremity functional scale.
Alnahdi, Ali H
2018-02-01
The purpose of this study was to examine the internal construct validity of the Arabic version of the Lower Extremity Functional Scale (20-item Arabic LEFS) using Rasch analysis. Patients (n = 170) with lower extremity musculoskeletal dysfunction were recruited. Rasch analysis of 20-item Arabic LEFS was performed. Once the initial Rasch analysis indicated that the 20-item Arabic LEFS did not fit the Rasch model, follow-up analyses were conducted to improve the fit of the scale to the Rasch measurement model. These modifications included removing misfitting individuals, changing item scoring structure, removing misfitting items, addressing bias caused by response dependency between items and differential item functioning (DIF). Initial analysis indicated deviation of the 20-item Arabic LEFS from the Rasch model. Disordered thresholds in eight items and response dependency between six items were detected with the scale as a whole did not meet the requirement of unidimensionality. Refinements led to a 15-item Arabic LEFS that demonstrated excellent internal consistency (person separation index [PSI] = 0.92) and satisfied all the requirement of the Rasch model. Rasch analysis did not support the 20-item Arabic LEFS as a unidimensional measure of lower extremity function. The refined 15-item Arabic LEFS met all the requirement of the Rasch model and hence is a valid objective measure of lower extremity function. The Rasch-validated 15-item Arabic LEFS needs to be further tested in an independent sample to confirm its fit to the Rasch measurement model. Implications for Rehabilitation The validity of the 20-item Arabic Lower Extremity Functional Scale to measure lower extremity function is not supported. The 15-item Arabic version of the LEFS is a valid measure of lower extremity function and can be used to quantify lower extremity function in patients with lower extremity musculoskeletal disorders.
Gopichandran, Vijayaprasad; Wouters, Edwin; Chetlapalli, Satish Kumar
2015-05-03
Trust in physicians is the unwritten covenant between the patient and the physician that the physician will do what is in the best interest of the patient. This forms the undercurrent of all healthcare relationships. Several scales exist for assessment of trust in physicians in developed healthcare settings, but to our knowledge none of these have been developed in a developing country context. To develop and validate a new trust in physician scale for a developing country setting. Dimensions of trust in physicians, which were identified in a previous qualitative study in the same setting, were used to develop a scale. This scale was administered among 616 adults selected from urban and rural areas of Tamil Nadu, south India, using a multistage sampling cross sectional survey method. The individual items were analysed using a classical test approach as well as item response theory. Cronbach's α was calculated and the item to total correlation of each item was assessed. After testing for unidimensionality and absence of local dependence, a 2 parameter logistic Semajima's graded response model was fit and item characteristics assessed. Competence, assurance of treatment, respect for the physician and loyalty to the physician were important dimensions of trust. A total of 31 items were developed using these dimensions. Of these, 22 were selected for final analysis. The Cronbach's α was 0.928. The item to total correlations were acceptable for all the 22 items. The item response analysis revealed good item characteristic curves and item information for all the items. Based on the item parameters and item information, a final 12 item scale was developed. The scale performs optimally in the low to moderate trust range. The final 12 item trust in physician scale has a good construct validity and internal consistency. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Gopichandran, Vijayaprasad; Wouters, Edwin; Chetlapalli, Satish Kumar
2015-01-01
Trust in physicians is the unwritten covenant between the patient and the physician that the physician will do what is in the best interest of the patient. This forms the undercurrent of all healthcare relationships. Several scales exist for assessment of trust in physicians in developed healthcare settings, but to our knowledge none of these have been developed in a developing country context. Objectives To develop and validate a new trust in physician scale for a developing country setting. Methods Dimensions of trust in physicians, which were identified in a previous qualitative study in the same setting, were used to develop a scale. This scale was administered among 616 adults selected from urban and rural areas of Tamil Nadu, south India, using a multistage sampling cross sectional survey method. The individual items were analysed using a classical test approach as well as item response theory. Cronbach's α was calculated and the item to total correlation of each item was assessed. After testing for unidimensionality and absence of local dependence, a 2 parameter logistic Semajima's graded response model was fit and item characteristics assessed. Results Competence, assurance of treatment, respect for the physician and loyalty to the physician were important dimensions of trust. A total of 31 items were developed using these dimensions. Of these, 22 were selected for final analysis. The Cronbach's α was 0.928. The item to total correlations were acceptable for all the 22 items. The item response analysis revealed good item characteristic curves and item information for all the items. Based on the item parameters and item information, a final 12 item scale was developed. The scale performs optimally in the low to moderate trust range. Conclusions The final 12 item trust in physician scale has a good construct validity and internal consistency. PMID:25941182
48 CFR 204.7106 - Contract modifications.
Code of Federal Regulations, 2010 CFR
2010-10-01
... 204.7106 Contract modifications. (a) If new items are added, assign new contract line or subline item... those item numbers. (2) If the contracting officer decides to assign new identifications to existing contract or exhibit line items, the following rules apply— (i) Definitized and undefinitized items. (A) The...
Using Mutual Information for Adaptive Item Comparison and Student Assessment
ERIC Educational Resources Information Center
Liu, Chao-Lin
2005-01-01
The author analyzes properties of mutual information between dichotomous concepts and test items. The properties generalize some common intuitions about item comparison, and provide principled foundations for designing item-selection heuristics for student assessment in computer-assisted educational systems. The proposed item-selection strategies…
Evaluating the Psychometric Characteristics of Generated Multiple-Choice Test Items
ERIC Educational Resources Information Center
Gierl, Mark J.; Lai, Hollis; Pugh, Debra; Touchie, Claire; Boulais, André-Philippe; De Champlain, André
2016-01-01
Item development is a time- and resource-intensive process. Automatic item generation integrates cognitive modeling with computer technology to systematically generate test items. To date, however, items generated using cognitive modeling procedures have received limited use in operational testing situations. As a result, the psychometric…
Faulks, Denise; Norderyd, Johanna; Molina, Gustavo; Macgiolla Phadraig, Caoimhin; Scagnet, Gabriela; Eschevins, Caroline; Hennequin, Martine
2013-01-01
Children in dentistry are traditionally described in terms of medical diagnosis and prevalence of oral disease. This approach gives little information regarding a child’s capacity to maintain oral health or regarding the social determinants of oral health. The biopsychosocial approach, embodied in the International Classification of Functioning, Disability and Health - Child and Youth version (ICF-CY) (WHO), provides a wider picture of a child’s real-life experience, but practical tools for the application of this model are lacking. This article describes the preliminary empirical study necessary for development of such a tool - an ICF-CY Core Set for Oral Health. An ICF-CY questionnaire was used to identify the medical, functional, social and environmental context of 218 children and adolescents referred to special care or paediatric dental services in France, Sweden, Argentina and Ireland (mean age 8 years ±3.6yrs). International Classification of Disease (ICD-10) diagnoses included disorders of the nervous system (26.1%), Down syndrome (22.0%), mental retardation (17.0%), autistic disorders (16.1%), and dental anxiety alone (11.0%). The most frequently impaired items in the ICF Body functions domain were ‘Intellectual functions’, ‘High-level cognitive functions’, and ‘Attention functions’. In the Activities and Participation domain, participation restriction was frequently reported for 25 items including ‘Handling stress’, ‘Caring for body parts’, ‘Looking after one’s health’ and ‘Speaking’. In the Environment domain, facilitating items included ‘Support of friends’, ‘Attitude of friends’ and ‘Support of immediate family’. One item was reported as an environmental barrier – ‘Societal attitudes’. The ICF-CY can be used to highlight common profiles of functioning, activities, participation and environment shared by children in relation to oral health, despite widely differing medical, social and geographical contexts. The results of this empirical study might be used to develop an ICF-CY Core Set for Oral Health - a holistic but practical tool for clinical and epidemiological use. PMID:23614000
Development of a measure of model fidelity for mental health Crisis Resolution Teams.
Lloyd-Evans, Brynmor; Bond, Gary R; Ruud, Torleif; Ivanecka, Ada; Gray, Richard; Osborn, David; Nolan, Fiona; Henderson, Claire; Mason, Oliver; Goater, Nicky; Kelly, Kathleen; Ambler, Gareth; Morant, Nicola; Onyett, Steve; Lamb, Danielle; Fahmy, Sarah; Brown, Ellie; Paterson, Beth; Sweeney, Angela; Hindle, David; Fullarton, Kate; Frerichs, Johanna; Johnson, Sonia
2016-12-01
Crisis Resolution Teams (CRTs) provide short-term intensive home treatment to people experiencing mental health crisis. Trial evidence suggests CRTs can be effective at reducing hospital admissions and increasing satisfaction with acute care. When scaled up to national level however, CRT implementation and outcomes have been variable. We aimed to develop and test a fidelity scale to assess adherence to a model of best practice for CRTs, based on best available evidence. A concept mapping process was used to develop a CRT fidelity scale. Participants (n = 68) from a range of stakeholder groups prioritised and grouped statements (n = 72) about important components of the CRT model, generated from a literature review, national survey and qualitative interviews. These data were analysed using Ariadne software and the resultant cluster solution informed item selection for a CRT fidelity scale. Operational criteria and scoring anchor points were developed for each item. The CORE CRT fidelity scale was then piloted in 75 CRTs in the UK to assess the range of scores achieved and feasibility for use in a 1-day fidelity review process. Trained reviewers (n = 16) rated CRT service fidelity in a vignette exercise to test the scale's inter-rater reliability. There were high levels of agreement within and between stakeholder groups regarding the most important components of the CRT model. A 39-item measure of CRT model fidelity was developed. Piloting indicated that the scale was feasible for use to assess CRT model fidelity and had good face validity. The wide range of item scores and total scores across CRT services in the pilot demonstrate the measure can distinguish lower and higher fidelity services. Moderately good inter-rater reliability was found, with an estimated correlation between individual ratings of 0.65 (95% CI: 0.54 to 0.76). The CORE CRT Fidelity Scale has been developed through a rigorous and systematic process. Promising initial testing indicates its value in assessing adherence to a model of CRT best practice and to support service improvement monitoring and planning. Further research is required to establish its psychometric properties and international applicability.
Palmer, Victoria J; Chondros, Patty; Piper, Donella; Callander, Rosemary; Weavell, Wayne; Godbee, Kali; Potiriadis, Maria; Richard, Lauralie; Densely, Konstancja; Herrman, Helen; Furler, John; Pierce, David; Schuster, Tibor; Iedema, Rick; Gunn, Jane
2015-03-24
User engagement in mental health service design is heralded as integral to health systems quality and performance, but does engagement improve health outcomes? This article describes the CORE study protocol, a novel stepped wedge cluster randomised controlled trial (SWCRCT) to improve psychosocial recovery outcomes for people with severe mental illness. An SWCRCT with a nested process evaluation will be conducted over nearly 4 years in Victoria, Australia. 11 teams from four mental health service providers will be randomly allocated to one of three dates 9 months apart to start the intervention. The intervention, a modified version of Mental Health Experience Co-Design (MH ECO), will be delivered to 30 service users, 30 carers and 10 staff in each cluster. Outcome data will be collected at baseline (6 months) and at completion of each intervention wave. The primary outcome is improvement in recovery score using the 24-item Revised Recovery Assessment Scale for service users. Secondary outcomes are improvements to user and carer mental health and well-being using the shortened 8-item version of the WHOQOL Quality of Life scale (EUROHIS), changes to staff attitudes using the 19-item Staff Attitudes to Recovery Scale and recovery orientation of services using the 36-item Recovery Self Assessment Scale (provider version). Intervention and usual care periods will be compared using a linear mixed effects model for continuous outcomes and a generalised linear mixed effects model for binary outcomes. Participants will be analysed in the group that the cluster was assigned to at each time point. The University of Melbourne, Human Research Ethics Committee (1340299.3) and the Federal and State Departments of Health Committees (Project 20/2014) granted ethics approval. Baseline data results will be reported in 2015 and outcomes data in 2017. Australian and New Zealand Clinical Trials Registry ACTRN12614000457640. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Palmer, Victoria J; Chondros, Patty; Piper, Donella; Callander, Rosemary; Weavell, Wayne; Godbee, Kali; Potiriadis, Maria; Richard, Lauralie; Densely, Konstancja; Herrman, Helen; Furler, John; Pierce, David; Schuster, Tibor; Iedema, Rick; Gunn, Jane
2015-01-01
Introduction User engagement in mental health service design is heralded as integral to health systems quality and performance, but does engagement improve health outcomes? This article describes the CORE study protocol, a novel stepped wedge cluster randomised controlled trial (SWCRCT) to improve psychosocial recovery outcomes for people with severe mental illness. Methods An SWCRCT with a nested process evaluation will be conducted over nearly 4 years in Victoria, Australia. 11 teams from four mental health service providers will be randomly allocated to one of three dates 9 months apart to start the intervention. The intervention, a modified version of Mental Health Experience Co-Design (MH ECO), will be delivered to 30 service users, 30 carers and 10 staff in each cluster. Outcome data will be collected at baseline (6 months) and at completion of each intervention wave. The primary outcome is improvement in recovery score using the 24-item Revised Recovery Assessment Scale for service users. Secondary outcomes are improvements to user and carer mental health and well-being using the shortened 8-item version of the WHOQOL Quality of Life scale (EUROHIS), changes to staff attitudes using the 19-item Staff Attitudes to Recovery Scale and recovery orientation of services using the 36-item Recovery Self Assessment Scale (provider version). Intervention and usual care periods will be compared using a linear mixed effects model for continuous outcomes and a generalised linear mixed effects model for binary outcomes. Participants will be analysed in the group that the cluster was assigned to at each time point. Ethics and dissemination The University of Melbourne, Human Research Ethics Committee (1340299.3) and the Federal and State Departments of Health Committees (Project 20/2014) granted ethics approval. Baseline data results will be reported in 2015 and outcomes data in 2017. Trial registration number Australian and New Zealand Clinical Trials Registry ACTRN12614000457640. PMID:25805530
Development and validation of a Malawian version of the primary care assessment tool.
Dullie, Luckson; Meland, Eivind; Hetlevik, Øystein; Mildestvedt, Thomas; Gjesdal, Sturla
2018-05-16
Malawi does not have validated tools for assessing primary care performance from patients' experience. The aim of this study was to develop a Malawian version of Primary Care Assessment Tool (PCAT-Mw) and to evaluate its reliability and validity in the assessment of the core primary care dimensions from adult patients' perspective in Malawi. A team of experts assessed the South African version of the primary care assessment tool (ZA-PCAT) for face and content validity. The adapted questionnaire underwent forward and backward translation and a pilot study. The tool was then used in an interviewer administered cross-sectional survey in Neno district, Malawi, to test validity and reliability. Exploratory factor analysis was performed on a random half of the sample to evaluate internal consistency, reliability and construct validity of items and scales. The identified constructs were then tested with confirmatory factor analysis. Likert scale assumption testing and descriptive statistics were done on the final factor structure. The PCAT-Mw was further tested for intra-rater and inter-rater reliability. From the responses of 631 patients, a 29-item PCAT-Mw was constructed comprising seven multi-item scales, representing five primary care dimensions (first contact, continuity, comprehensiveness, coordination and community orientation). All the seven scales achieved good internal consistency, item-total correlations and construct validity. Cronbach's alpha coefficient ranged from 0.66 to 0.91. A satisfactory goodness of fit model was achieved (GFI = 0.90, CFI = 0.91, RMSEA = 0.05, PCLOSE = 0.65). The full range of possible scores was observed for all scales. Scaling assumptions tests were achieved for all except the two comprehensiveness scales. Intra-class correlation coefficient (ICC) was 0.90 (n = 44, 95% CI 0.81-0.94, p < 0.001) for intra-rater reliability and 0.84 (n = 42, 95% CI 0.71-0.96, p < 0.001) for inter-rater reliability. Comprehensive metric analyses supported the reliability and validity of PCAT-Mw in assessing the core concepts of primary care from adult patients' experience. This tool could be used for health service research in primary care in Malawi.
Phonological Similarity in Serial Recall: Constraints on Theories of Memory
ERIC Educational Resources Information Center
Lewandowsky, Stephan; Farrell, Simon
2008-01-01
In short-term serial recall, similar-sounding items are remembered more poorly than items that do not sound alike. When lists mix similar and dissimilar items, performance on the dissimilar items is of considerable theoretical interest. Farrell and Lewandowsky [Farrell, S., & Lewandowsky, S. (2003). Dissimilar items benefit from phonological…
Testing Three-Item Versions for Seven of Young's Maladaptive Schema
ERIC Educational Resources Information Center
Blau, Gary; DiMino, John; Sheridan, Natalie; Pred, Robert S.; Beverly, Clyde; Chessler, Marcy
2015-01-01
The Young Schema Questionnaire (YSQ) in either long-form (205- item) or short-form (75-item or 90-item) versions has demonstrated its clinical usefulness for assessing early maladaptive schemas. However, even a 75 or 90-item "short form", particularly when combined with other measures, can represent a lengthy…
Item Purification in Differential Item Functioning Using Generalized Linear Mixed Models
ERIC Educational Resources Information Center
Liu, Qian
2011-01-01
For this dissertation, four item purification procedures were implemented onto the generalized linear mixed model for differential item functioning (DIF) analysis, and the performance of these item purification procedures was investigated through a series of simulations. Among the four procedures, forward and generalized linear mixed model (GLMM)…
Applying Hierarchical Model Calibration to Automatically Generated Items.
ERIC Educational Resources Information Center
Williamson, David M.; Johnson, Matthew S.; Sinharay, Sandip; Bejar, Isaac I.
This study explored the application of hierarchical model calibration as a means of reducing, if not eliminating, the need for pretesting of automatically generated items from a common item model prior to operational use. Ultimately the successful development of automatic item generation (AIG) systems capable of producing items with highly similar…
Code of Federal Regulations, 2010 CFR
2010-10-01
... and contract clauses for the acquisition of commercial items. 512.301 Section 512.301 Federal... ACQUISITION OF COMMERCIAL ITEMS Solicitation Provisions and Contract Clauses for the Acquisition of Commercial Items 512.301 Solicitation provisions and contract clauses for the acquisition of commercial items. (a...
Item-Writing Guidelines for Physics
ERIC Educational Resources Information Center
Regan, Tom
2015-01-01
A teacher learning how to write test questions (test items) will almost certainly encounter item-writing guidelines--lists of item-writing do's and don'ts. Item-writing guidelines usually are presented as applicable across all assessment settings. Table I shows some guidelines that I believe to be generally applicable and two will be briefly…
Unidimensional Interpretations for Multidimensional Test Items
ERIC Educational Resources Information Center
Kahraman, Nilufer
2013-01-01
This article considers potential problems that can arise in estimating a unidimensional item response theory (IRT) model when some test items are multidimensional (i.e., show a complex factorial structure). More specifically, this study examines (1) the consequences of model misfit on IRT item parameter estimates due to unintended minor item-level…
Test item linguistic complexity and assessments for deaf students.
Cawthon, Stephanie
2011-01-01
Linguistic complexity of test items is one test format element that has been studied in the context of struggling readers and their participation in paper-and-pencil tests. The present article presents findings from an exploratory study on the potential relationship between linguistic complexity and test performance for deaf readers. A total of 64 students completed 52 multiple-choice items, 32 in mathematics and 20 in reading. These items were coded for linguistic complexity components of vocabulary, syntax, and discourse. Mathematics items had higher linguistic complexity ratings than reading items, but there were no significant relationships between item linguistic complexity scores and student performance on the test items. The discussion addresses issues related to the subject area, student proficiency levels in the test content, factors to look for in determining a "linguistic complexity effect," and areas for further research in test item development and deaf students.
Sinharay, Sandip
2017-09-01
Benefiting from item preknowledge is a major type of fraudulent behavior during educational assessments. Belov suggested the posterior shift statistic for detection of item preknowledge and showed its performance to be better on average than that of seven other statistics for detection of item preknowledge for a known set of compromised items. Sinharay suggested a statistic based on the likelihood ratio test for detection of item preknowledge; the advantage of the statistic is that its null distribution is known. Results from simulated and real data and adaptive and nonadaptive tests are used to demonstrate that the Type I error rate and power of the statistic based on the likelihood ratio test are very similar to those of the posterior shift statistic. Thus, the statistic based on the likelihood ratio test appears promising in detecting item preknowledge when the set of compromised items is known.
North American Veterinary Licensing Examination pacing study.
Subhiyah, Raja G; Boyce, John R
2010-01-01
The National Board of Veterinary Medical Examiners was interested in the possible effects of word count on the outcomes of the North American Veterinary Licensing Examination. In this study, the authors investigated the effects of increasing word count on the pacing of examinees during each section of the examination and on the performance of examinees on the items. Specifically, the authors analyzed the effect of item word count on the average time spent on each item within a section of the examination, the average number of items omitted at the end of a section, and the average difficulty of items as a function of presentation order. The average word count per item increased from 2001 to 2008. As expected, there was a relationship between word count and time spent on the item. No significant relationship was found between word count and item difficulty, and an analysis of omitted items and pacing patterns showed no indication of overall pacing problems.
Spatial transposition gradients in visual working memory.
Rerko, Laura; Oberauer, Klaus; Lin, Hsuan-Yu
2014-01-01
In list memory, access to individual items reflects limits of temporal distinctiveness. This is reflected in the finding that neighbouring list items tend to be confused most often. This article investigates the analogous effect of spatial proximity in a visual working-memory task. Items were presented in different locations varying in spatial distance. A retro-cue indicated the location of the item relevant for the subsequent memory test. In two recognition experiments, probes matching spatially close neighbours of the relevant item led to more false alarms than probes matching distant neighbours or non-neighbouring memory items. In two probed-recall experiments, one with simultaneous, the other with sequential memory item presentation, items closer to the cued location were more frequently chosen for recall than more distant items. These results reflect a spatial transposition gradient analogous to the temporal transposition gradient in serial recall and challenge fixed-capacity models of visual working memory (WM).
Jung, Myun Sook; Choi, Hyeong Wook; Li, Dong Mei
2010-02-01
The purpose of this study was to analyze nursing-related content in middle, and high school textbooks under the National Common Basic Curriculum in Korea. Nursing-related content from 43 middle school textbooks and 13 high school textbooks was analyzed. There were 28 items of nursing-related content in the selected textbooks. Among them, 13 items were in the 'nursing activity' area, 6 items were in the 'nurse as an occupation' area, 2 items were in the 'major and career choice' area, 6 items were 'just one word' and 1 item in 'others'. The main nursing related content which portrayed in the middle and high school textbooks were caring for patients (7 items accounting for 46.5%), nurses working in hospitals (6 items accounting for 21.4%). In terms of gender perspective, female nurses (15 items accounting for 53.6%) were most prevalent.
Sources of difficulty in assessment: example of PISA science items
NASA Astrophysics Data System (ADS)
Le Hebel, Florence; Montpied, Pascale; Tiberghien, Andrée; Fontanieu, Valérie
2017-03-01
The understanding of what makes a question difficult is a crucial concern in assessment. To study the difficulty of test questions, we focus on the case of PISA, which assesses to what degree 15-year-old students have acquired knowledge and skills essential for full participation in society. Our research question is to identify PISA science item characteristics that could influence the item's proficiency level. It is based on an a-priori item analysis and a statistical analysis. Results show that only the cognitive complexity and the format out of the different characteristics of PISA science items determined in our a-priori analysis have an explanatory power on an item's proficiency levels. The proficiency level cannot be explained by the dependence/independence of the information provided in the unit and/or item introduction and the competence. We conclude that in PISA, it appears possible to anticipate a high proficiency level, that is, students' low scores for items displaying a high cognitive complexity. In the case of a middle or low cognitive complexity level item, the cognitive complexity level is not sufficient to predict item difficulty. Other characteristics play a crucial role in item difficulty. We discuss anticipating the difficulties in assessment in a broader perspective.
NASA Astrophysics Data System (ADS)
Kim, Jungja; Ceong, Heetaek; Won, Yonggwan
In market-basket analysis, weighted association rule (WAR) discovery can mine the rules that include more beneficial information by reflecting item importance for special products. In the point-of-sale database, each transaction is composed of items with similar properties, and item weights are pre-defined and fixed by a factor such as the profit. However, when items are divided into more than one group and the item importance must be measured independently for each group, traditional weighted association rule discovery cannot be used. To solve this problem, we propose a new weighted association rule mining methodology. The items should be first divided into subgroups according to their properties, and the item importance, i.e. item weight, is defined or calculated only with the items included in the subgroup. Then, transaction weight is measured by appropriately summing the item weights from each subgroup, and the weighted support is computed as the fraction of the transaction weights that contains the candidate items relative to the weight of all transactions. As an example, our proposed methodology is applied to assess the vulnerability to threats of computer systems that provide networked services. Our algorithm provides both quantitative risk-level values and qualitative risk rules for the security assessment of networked computer systems using WAR discovery. Also, it can be widely used for new applications with many data sets in which the data items are distinctly separated.
Pilkonis, Paul A.; Yu, Lan; Dodds, Nathan E.; Johnston, Kelly L.; Lawrence, Suzanne; Hilton, Thomas F.; Daley, Dennis C.; Patkar, Ashwin A.; McCarty, Dennis
2015-01-01
Background Two item banks for substance use were developed as part of the Patient-Reported Outcomes Measurement Information System (PROMIS®): severity of substance use and positive appeal of substance use. Methods Qualitative item analysis (including focus groups, cognitive interviewing, expert review, and item revision) reduced an initial pool of more than 5,300 items for substance use to 119 items included in field testing. Items were written in a first-person, past-tense format, with 5 response options reflecting frequency or severity. Both 30-day and 3-month time frames were tested. The calibration sample of 1,336 respondents included 875 individuals from the general population (ascertained through an internet panel) and 461patients from addiction treatment centers participating in the National Drug Abuse Treatment Clinical Trials Network. Results Final banks of 37 and 18 items were calibrated for severity of substance use and positive appeal of substance use, respectively, using the two-parameter graded response model from item response theory (IRT). Initial calibrations were similar for the 30-day and 3-month time frames, and final calibrations used data combined across the time frames, making the items applicable with either interval. Seven-item static short forms were also developed from each item bank. Conclusions Test information curves showed that the PROMIS item banks provided substantial information in a broad range of severity, making them suitable for treatment, observational, and epidemiological research in both clinical and community settings. PMID:26423364
Blome, Christine; von Usslar, Kathrin; Augustin, Matthias
2016-06-01
Qualitative interviews are used to assess understandability and content validity of patient-reported outcomes. However, the common approach of asking patients to paraphrase items may not be sufficient to completely reveal item content as understood by patients. We used qualitative interviews to elicit more detailed information about patients' understanding of treatment goal items for the Patient Benefit Index 2.0 (PBI 2.0). This questionnaire measures patient-relevant benefit from treatments for skin diseases by assessing goal importance prior to and goal attainment after treatment. We interviewed 16 patients with psoriasis, atopic dermatitis, leg ulcers, and vitiligo. Patients were asked to elaborate in detail on their understanding of 15 treatment goal items. Subsequently, they were asked to suggest changes in item wording and to name missing treatment goals. Interview transcripts were analyzed according to an adapted approach of content analysis. The task was easy for the patients to understand, and they shared detailed information on what each goal meant to them. Results of the content analysis induced a range of revisions of the PBI 2.0 items, including changes in wording (four items) and item order (two items). Four items were deleted because they were found to be redundant or irrelevant, and one item was added to the list of treatment goals. Asking patients to elaborate on their item understanding in qualitative interviews provided detailed insight into item content and understandability. This method has helped considerably to improve feasibility and content validity of the PBI 2.0.
Adjusting for cross-cultural differences in computer-adaptive tests of quality of life.
Gibbons, C J; Skevington, S M
2018-04-01
Previous studies using the WHOQOL measures have demonstrated that the relationship between individual items and the underlying quality of life (QoL) construct may differ between cultures. If unaccounted for, these differing relationships can lead to measurement bias which, in turn, can undermine the reliability of results. We used item response theory (IRT) to assess differential item functioning (DIF) in WHOQOL data from diverse language versions collected in UK, Zimbabwe, Russia, and India (total N = 1332). Data were fitted to the partial credit 'Rasch' model. We used four item banks previously derived from the WHOQOL-100 measure, which provided excellent measurement for physical, psychological, social, and environmental quality of life domains (40 items overall). Cross-cultural differential item functioning was assessed using analysis of variance for item residuals and post hoc Tukey tests. Simulated computer-adaptive tests (CATs) were conducted to assess the efficiency and precision of the four items banks. Splitting item parameters by DIF results in four linked item banks without DIF or other breaches of IRT model assumptions. Simulated CATs were more precise and efficient than longer paper-based alternatives. Assessing differential item functioning using item response theory can identify measurement invariance between cultures which, if uncontrolled, may undermine accurate comparisons in computer-adaptive testing assessments of QoL. We demonstrate how compensating for DIF using item anchoring allowed data from all four countries to be compared on a common metric, thus facilitating assessments which were both sensitive to cultural nuance and comparable between countries.
Pollard, Beth; Dixon, Diane; Dieppe, Paul; Johnston, Marie
2009-01-01
Background The International Classification of Functioning, Disability and Health (ICF) proposes three main health outcomes, Impairment (I), Activity Limitation (A) and Participation Restriction (P), but good measures of these constructs are needed The aim of this study was to use both Classical Test Theory (CTT) and Item Response Theory (IRT) methods to carry out an item analysis to improve measurement of these three components in patients having joint replacement surgery mainly for osteoarthritis (OA). Methods A geographical cohort of patients about to undergo lower limb joint replacement was invited to participate. Five hundred and twenty four patients completed ICF items that had been previously identified as measuring only a single ICF construct in patients with osteoarthritis. There were 13 I, 26 A and 20 P items. The SF-36 was used to explore the construct validity of the resultant I, A and P measures. The CTT and IRT analyses were run separately to identify items for inclusion or exclusion in the measurement of each construct. The results from both analyses were compared and contrasted. Results Overall, the item analysis resulted in the removal of 4 I items, 9 A items and 11 P items. CTT and IRT identified the same 14 items for removal, with CTT additionally excluding 3 items, and IRT a further 7 items. In a preliminary exploration of reliability and validity, the new measures appeared acceptable. Conclusion New measures were developed that reflect the ICF components of Impairment, Activity Limitation and Participation Restriction for patients with advanced arthritis. The resulting Aberdeen IAP measures (Ab-IAP) comprising I (Ab-I, 9 items), A (Ab-A, 17 items), and P (Ab-P, 9 items) met the criteria of conventional psychometric (CTT) analyses and the additional criteria (information and discrimination) of IRT. The use of both methods was more informative than the use of only one of these methods. Thus combining CTT and IRT appears to be a valuable tool in the development of measures. PMID:19422677
Development and assessment of floor and ceiling items for the PROMIS physical function item bank
2013-01-01
Introduction Disability and Physical Function (PF) outcome assessment has had limited ability to measure functional status at the floor (very poor functional abilities) or the ceiling (very high functional abilities). We sought to identify, develop and evaluate new floor and ceiling items to enable broader and more precise assessment of PF outcomes for the NIH Patient-Reported-Outcomes Measurement Information System (PROMIS). Methods We conducted two cross-sectional studies using NIH PROMIS item improvement protocols with expert review, participant survey and focus group methods. In Study 1, respondents with low PF abilities evaluated new floor items, and those with high PF abilities evaluated new ceiling items for clarity, importance and relevance. In Study 2, we compared difficulty ratings of new floor items by low functioning respondents and ceiling items by high functioning respondents to reference PROMIS PF-10 items. We used frequencies, percentages, means and standard deviations to analyze the data. Results In Study 1, low (n = 84) and high (n = 90) functioning respondents were mostly White, women, 70 years old, with some college, and disability scores of 0.62 and 0.30. More than 90% of the 31 new floor and 31 new ceiling items were rated as clear, important and relevant, leaving 26 ceiling and 30 floor items for Study 2. Low (n = 246) and high (n = 637) functioning Study 2 respondents were mostly White, women, 70 years old, with some college, and Health Assessment Questionnaire (HAQ) scores of 1.62 and 0.003. Compared to difficulty ratings of reference items, ceiling items were rated to be 10% more to greater than 40% more difficult to do, and floor items were rated to be about 12% to nearly 90% less difficult to do. Conclusions These new floor and ceiling items considerably extend the measurable range of physical function at either extreme. They will help improve instrument performance in populations with broad functional ranges and those concentrated at one or the other extreme ends of functioning. Optimal use of these new items will be assisted by computerized adaptive testing (CAT), reducing questionnaire burden and insuring item administration to appropriate individuals. PMID:24286166
ERIC Educational Resources Information Center
Sharp, John G.; Hemmings, Brian; Kay, Russell; Callinan, Carol
2013-01-01
This article presents findings arising from the first UK application of a revised 70-item lecturer self-efficacy questionnaire recently developed for use in the Australian higher education context. Intended to probe and systematically measure confidence in the core functions of research, teaching and other academic or service-related activities…
ERIC Educational Resources Information Center
Yuan, Kun; Le, Vi-Nhuan
2014-01-01
In 2010, the William and Flora Hewlett Foundation's Education Program has established the Deeper Learning Initiative, which focuses on students' development of deeper learning skills (i.e., the mastery of core academic content, critical-thinking, problem-solving, collaboration, communication, and "learn-how-to-learn" skills). Two test…
Modeling Information Accumulation in Psychological Tests Using Item Response Times
ERIC Educational Resources Information Center
Ranger, Jochen; Kuhn, Jörg-Tobias
2015-01-01
In this article, a latent trait model is proposed for the response times in psychological tests. The latent trait model is based on the linear transformation model and subsumes popular models from survival analysis, like the proportional hazards model and the proportional odds model. Core of the model is the assumption that an unspecified monotone…
29 CFR 1960.12 - Dissemination of occupational safety and health program information.
Code of Federal Regulations, 2011 CFR
2011-07-01
... establishment, and keep posted, a poster informing employees of the provisions of the Act, Executive Order 12196... furnish the core text of a poster to agencies. Each agency shall add the following items: (1) Details of...) Relevant information about any agency safety and health committees. Such posters and additions shall not be...
29 CFR 1960.12 - Dissemination of occupational safety and health program information.
Code of Federal Regulations, 2013 CFR
2013-07-01
... establishment, and keep posted, a poster informing employees of the provisions of the Act, Executive Order 12196... furnish the core text of a poster to agencies. Each agency shall add the following items: (1) Details of...) Relevant information about any agency safety and health committees. Such posters and additions shall not be...
29 CFR 1960.12 - Dissemination of occupational safety and health program information.
Code of Federal Regulations, 2012 CFR
2012-07-01
... establishment, and keep posted, a poster informing employees of the provisions of the Act, Executive Order 12196... furnish the core text of a poster to agencies. Each agency shall add the following items: (1) Details of...) Relevant information about any agency safety and health committees. Such posters and additions shall not be...
29 CFR 1960.12 - Dissemination of occupational safety and health program information.
Code of Federal Regulations, 2010 CFR
2010-07-01
... establishment, and keep posted, a poster informing employees of the provisions of the Act, Executive Order 12196... furnish the core text of a poster to agencies. Each agency shall add the following items: (1) Details of...) Relevant information about any agency safety and health committees. Such posters and additions shall not be...
29 CFR 1960.12 - Dissemination of occupational safety and health program information.
Code of Federal Regulations, 2014 CFR
2014-07-01
... establishment, and keep posted, a poster informing employees of the provisions of the Act, Executive Order 12196... furnish the core text of a poster to agencies. Each agency shall add the following items: (1) Details of...) Relevant information about any agency safety and health committees. Such posters and additions shall not be...
10 CFR 431.15 - Materials incorporated by reference.
Code of Federal Regulations, 2011 CFR
2011-01-01
... Method With Indirect Measurement of the Stray-Load Loss and Direct Measurement of the Stator Winding (I2R), Rotor Winding (I2 R), Core and Windage-Friction Losses, IBR approved for §§ 431.12; 431.19; 431.20... with Loss Segregation, and the correction to the calculation at item (28) in Section 10.2 Form B-Test...
ERIC Educational Resources Information Center
Verheul, Roel; Andrea, Helene; Berghout, Caspar C.; Dolan, Conor; Busschbach, Jan J. V.; van der Kroft, Petra J. A.; Bateman, Anthony W.; Fonagy, Peter
2008-01-01
This article describes a series of studies involving 2,730 participants on the development and validity testing of the Severity Indices of Personality Problems (SIPP), a self-report questionnaire covering important core components of (mal)adaptive personality functioning. Results show that the 16 facets constituted homogeneous item clusters (i.e.,…
ERIC Educational Resources Information Center
Anderson, Daniel; Irvin, P. Shawn; Patarapichayatham, Chalie; Alonzo, Julie; Tindal, Gerald
2012-01-01
In the following technical report, we describe the development and scaling of the easyCBM CCSS middle school mathematics measures, designed for use within a response to intervention framework. All items were developed in collaboration with experienced middle school mathematics teachers and were written to align with the Common Core State…
Temporal Clustering and Sequencing in Short-Term Memory and Episodic Memory
ERIC Educational Resources Information Center
Farrell, Simon
2012-01-01
A model of short-term memory and episodic memory is presented, with the core assumptions that (a) people parse their continuous experience into episodic clusters and (b) items are clustered together in memory as episodes by binding information within an episode to a common temporal context. Along with the additional assumption that information…
Assessing Lexical Proficiency Using Analytic Ratings: A Case for Collocation Accuracy
ERIC Educational Resources Information Center
Crossley, Scott A.; Salsbury, Tom; Mcnamara, Danielle S.
2015-01-01
This study analyzes lexical proficiency in oral and written texts produced by second language (L2) learners of English. The purpose of the study is to examine relationships between analytic scores of depth of lexical knowledge, breadth of lexical knowledge, and access to core lexical items and holistic scores of lexical proficiency. A corpus of…
Corpus-Based Studies on Nursing Textbooks
ERIC Educational Resources Information Center
Mohamad, Alif Fairus Nor; Jin, Ng Yu
2013-01-01
English for Specific Purposes (ESP) educators often face dilemma in deciding what lexical items to teach their students. In the field of English for Nursing Purposes (ENP), there is no exception on this issue as well. Only by analyzing the nursing corpus made up of essential core textbooks that can provide better insights and guide to both nursing…
ERIC Educational Resources Information Center
Laing-Kean, Claudine A. M.
2010-01-01
Programs supported by the Carl D. Perkins Act of 2006 are required to operate under the state or national content standards, and are expected to carry out evaluation procedures that address accountability. The Indiana high school course, "Advanced Life Science: Foods" ("ALS: Foods") operates under the auspices of the Perkins…
The Autism Impact Measure (AIM): Initial Development of a New Tool for Treatment Outcome Measurement
ERIC Educational Resources Information Center
Kanne, Stephen M.; Mazurek, Micah O.; Sikora, Darryn; Bellando, Jayne; Branum-Martin, Lee; Handen, Benjamin; Katz, Terry; Freedman, Brian; Powell, Mary Paige; Warren, Zachary
2014-01-01
The current study describes the development and psychometric properties of a new measure targeting sensitivity to change of core autism spectrum disorder (ASD) symptoms, the Autism Impact Measure (AIM). The AIM uses a 2-week recall period with items rated on two corresponding 5-point scales (frequency and impact). Psychometric properties were…
Is There a Core in Sociology? Results from a Survey
ERIC Educational Resources Information Center
Wagenaar, Theodore C.
2004-01-01
I report on a study of 301 sociologists to determine which concepts, topics, and skills they deem most important to cover in the introductory course and in the sociology curriculum. Respondents indicated high agreement that the list of skills, topics, and concepts adequately represented the range of possible items. I use both the raw ratings and…
Measuring Functional Creativity: Non-Expert Raters and the Creative Solution Diagnosis Scale
ERIC Educational Resources Information Center
Cropley, David H.; Kaufman, James C.
2012-01-01
The Creative Solution Diagnosis Scale (CSDS) is a 30-item scale based on a core of four criteria: Relevance & Effectiveness, Novelty, Elegance, and Genesis. The CSDS offers potential for the consensual assessment of functional product creativity. This article describes an empirical study in which non-expert judges rated a series of mousetrap…
Psychology in Teacher Education: A Perspective from Singapore's Pre-Service Teachers
ERIC Educational Resources Information Center
Tan, Ai-Girl
2006-01-01
This paper reports on Singaporean pre-service teachers' views of psychology and knowledge and the skills of psychology which are important for them. A total of 353 teachers taking the core module of educational psychology participated in the study. They rated the degree of appropriateness of items that described the discipline of psychology and…
Guillemin, I; Marrel, A; Arnould, B; Capuron, L; Dupuy, A; Ginon, E; Layé, S; Lecerf, J-M; Prost, M; Rogeaux, M; Urdapilleta, I; Allaert, F-A
2016-01-01
Providing well-being and maintaining good health are main objectives subjects seek from diet. This manuscript describes the development and preliminary validation of an instrument assessing well-being associated with food and eating habits in a general healthy population. Qualitative data from 12 groups of discussion (102 subjects) conducted with healthy subjects were used to develop the core of the Well-being related to Food Questionnaire (Well-BFQ). Twelve other groups of discussion with subjects with joint (n = 34), digestive (n = 32) or repetitive infection complaints (n = 30) were performed to develop items specific to these complaints. Five main themes emerged from the discussions and formed the modular backbone of the questionnaire: "Grocery shopping", "Cooking", "Dining places", "Commensality", "Eating and drinking". Each module has a common structure: items about subject's food behavior and items about immediate and short-term benefits. An additional theme - "Eating habits and health" - assesses subjects' beliefs about expected benefits of food and eating habits on health, disease prevention and protection, and quality of ageing. A preliminary validation was conducted with 444 subjects with balanced diet; non-balanced diet; and standard diet. The structure of the questionnaire was further determined using principal component analyses exploratory factor analyses, with confirmation of the sub-sections food behaviors, immediate benefits (pleasure, security, relaxation), direct short-term benefits (digestion and satiety, energy and psychology), and deferred long-term benefits (eating habits and health). Thirty-three subscales and 14 single items were further defined. Confirmatory analyses confirmed the structure, with overall moderate to excellent convergent and divergent validity and internal consistency reliability. The Well-BFQ is a unique, modular tool that comprehensively assesses the full picture of well-being related to food and eating habits in the general population. Copyright © 2015 Elsevier Ltd. All rights reserved.
Sparsey™: event recognition via deep hierarchical sparse distributed codes
Rinkus, Gerard J.
2014-01-01
The visual cortex's hierarchical, multi-level organization is captured in many biologically inspired computational vision models, the general idea being that progressively larger scale (spatially/temporally) and more complex visual features are represented in progressively higher areas. However, most earlier models use localist representations (codes) in each representational field (which we equate with the cortical macrocolumn, “mac”), at each level. In localism, each represented feature/concept/event (hereinafter “item”) is coded by a single unit. The model we describe, Sparsey, is hierarchical as well but crucially, it uses sparse distributed coding (SDC) in every mac in all levels. In SDC, each represented item is coded by a small subset of the mac's units. The SDCs of different items can overlap and the size of overlap between items can be used to represent their similarity. The difference between localism and SDC is crucial because SDC allows the two essential operations of associative memory, storing a new item and retrieving the best-matching stored item, to be done in fixed time for the life of the model. Since the model's core algorithm, which does both storage and retrieval (inference), makes a single pass over all macs on each time step, the overall model's storage/retrieval operation is also fixed-time, a criterion we consider essential for scalability to the huge (“Big Data”) problems. A 2010 paper described a nonhierarchical version of this model in the context of purely spatial pattern processing. Here, we elaborate a fully hierarchical model (arbitrary numbers of levels and macs per level), describing novel model principles like progressive critical periods, dynamic modulation of principal cells' activation functions based on a mac-level familiarity measure, representation of multiple simultaneously active hypotheses, a novel method of time warp invariant recognition, and we report results showing learning/recognition of spatiotemporal patterns. PMID:25566046
Koller, M; Hjermstad, M J; Tomaszewski, K A; Tomaszewska, I M; Hornslien, K; Harle, A; Arraras, J I; Morag, O; Pompili, C; Ioannidis, G; Georgiou, M; Navarra, C; Chie, W-C; Johnson, C D; Himpel, A; Schulz, C; Bohrer, T; Janssens, A; Kulis, D; Bottomley, A
2017-11-01
The European Organization for Research and Treatment of Cancer (EORTC) QLQ-LC13 was the first module to be used in conjunction with the core questionnaire, the QLQ-C30. Since the publication of the LC13 in 1994, major advances have occurred in the treatment of lung cancer. Given this, an update of the EORTC QLQ-LC13 was undertaken. The study followed phases I to III of the EORTC Module Development Guidelines. Phase I generated relevant quality-of-life issues using a mix of sources including the involvement of 108 lung cancer patients. Phase II transformed issues into questionnaire items. In an international multicenter study (phase III), patients completed both the EORTC QLQ-C30 and the 48-item provisional lung cancer module generated in phases I and II. Patients rated each of the items regarding relevance, comprehensibility, and acceptance. Patient ratings were assessed against a set of prespecified statistical criteria. Descriptive statistics and basic psychometric analyses were carried out. The phase III study enrolled 200 patients with histologically confirmed lung cancer from 12 centers in nine countries (Cyprus, Germany, Italy, Israel, Spain, Norway, Poland, Taiwan, and the UK). Mean age was 64 years (39 - 91), 59% of the patients were male, 82% had non-small-cell lung cancer, and 56% were treated with palliative intent. Twenty-nine of the 48 questions met the criteria for inclusion. The resulting module with 29 questions, thus currently named EORTC QLQ-LC29, retained 12 of the 13 original items, supplemented with 17 items that primarily assess treatment side-effects of traditional and newer therapies. © The Author 2017. Published by Oxford University Press on behalf of the European Society for Medical Oncology. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Assessing the utility of diagnostic criteria: a multisite study on gender identity disorder.
Paap, Muirne C S; Kreukels, Baudewijntje P C; Cohen-Kettenis, Peggy T; Richter-Appelt, Hertha; de Cuypere, Griet; Haraldsen, Ira R
2011-01-01
Studies involving patients with gender identity disorder (GID) are inconsistent with regard to outcomes and often difficult to compare because of the vague descriptions of the diagnostic process. A multisite study is needed to scrutinize the utility and generality of different aspects of the diagnostic criteria for GID. To investigate the way in which the diagnosis-specific Diagnostic and Statistical Manual of Mental Disorders, 4th Edition, Text Revision criteria for GID were used to reach a psychiatric diagnosis in four European countries: the Netherlands (Amsterdam), Norway (Oslo), Germany (Hamburg), and Belgium (Ghent). The main goal was to compare item (symptom) characteristics across countries. The current study included all new applicants to the four GID clinics who were seen between January 2007 and March 2009, were at least 16 years of age at their first visit, and had completed the diagnostic assessment (N = 214, mean age = 32 ± 12.2 years). Mokken scale analysis, a form of Nonparametric Item Response Theory (NIRT) was performed. Operationalization and quantification of the core criteria A and B resulted in a 23-item score sheet that was filled out by the participating clinicians after they had made a diagnosis. We found that, when ordering the 23 items according to their means for each country separately, the rank ordering was similar among the four countries for 21 of the items. Furthermore, only one scale emerged, which combined criteria A and B when all data were analyzed together. Our results indicate that patients' symptoms were interpreted in a similar fashion in all four countries. However, we did not find support for the treatment of A and B as two separate criteria. We recommend the use of NIRT in future studies, especially in studies with small sample sizes and/or with data that show a poor fit to parametric IRT models. © 2010 International Society for Sexual Medicine.
Bernhard, Gerda; Knibbe, Ronald A.; von Wolff, Alessa; Dingoyan, Demet; Schulz, Holger; Mösko, Mike
2015-01-01
Background Cultural competence of healthcare professionals (HCPs) is recognized as a strategy to reduce cultural disparities in healthcare. However, standardised, valid and reliable instruments to assess HCPs’ cultural competence are notably lacking. The present study aims to 1) identify the core components of cultural competence from a healthcare perspective, 2) to develop a self-report instrument to assess cultural competence of HCPs and 3) to evaluate the psychometric properties of the new instrument. Methods The conceptual model and initial item pool, which were applied to the cross-cultural competence instrument for the healthcare profession (CCCHP), were derived from an expert survey (n = 23), interviews with HCPs (n = 12), and a broad narrative review on assessment instruments and conceptual models of cultural competence. The item pool was reduced systematically, which resulted in a 59-item instrument. A sample of 336 psychologists, in advanced psychotherapeutic training, and 409 medical students participated, in order to evaluate the construct validity and reliability of the CCCHP. Results Construct validity was supported by principal component analysis, which led to a 32-item six-component solution with 50% of the total variance explained. The different dimensions of HCPs’ cultural competence are: Cross-Cultural Motivation/Curiosity, Cross-Cultural Attitudes, Cross-Cultural Skills, Cross-Cultural Knowledge/Awareness and Cross-Cultural Emotions/Empathy. For the total instrument, the internal consistency reliability was .87 and the dimension’s Cronbach’s α ranged from .54 to .84. The discriminating power of the CCCHP was indicated by statistically significant mean differences in CCCHP subscale scores between predefined groups. Conclusions The 32-item CCCHP exhibits acceptable psychometric properties, particularly content and construct validity to examine HCPs’ cultural competence. The CCCHP with its five dimensions offers a comprehensive assessment of HCPs’ cultural competence, and has the ability to distinguish between groups that are expected to differ in cultural competence. This instrument can foster professional development through systematic self-assessment and thus contributes to improve the quality of patient care. PMID:26641876
Bernhard, Gerda; Knibbe, Ronald A; von Wolff, Alessa; Dingoyan, Demet; Schulz, Holger; Mösko, Mike
2015-01-01
Cultural competence of healthcare professionals (HCPs) is recognized as a strategy to reduce cultural disparities in healthcare. However, standardised, valid and reliable instruments to assess HCPs' cultural competence are notably lacking. The present study aims to 1) identify the core components of cultural competence from a healthcare perspective, 2) to develop a self-report instrument to assess cultural competence of HCPs and 3) to evaluate the psychometric properties of the new instrument. The conceptual model and initial item pool, which were applied to the cross-cultural competence instrument for the healthcare profession (CCCHP), were derived from an expert survey (n = 23), interviews with HCPs (n = 12), and a broad narrative review on assessment instruments and conceptual models of cultural competence. The item pool was reduced systematically, which resulted in a 59-item instrument. A sample of 336 psychologists, in advanced psychotherapeutic training, and 409 medical students participated, in order to evaluate the construct validity and reliability of the CCCHP. Construct validity was supported by principal component analysis, which led to a 32-item six-component solution with 50% of the total variance explained. The different dimensions of HCPs' cultural competence are: Cross-Cultural Motivation/Curiosity, Cross-Cultural Attitudes, Cross-Cultural Skills, Cross-Cultural Knowledge/Awareness and Cross-Cultural Emotions/Empathy. For the total instrument, the internal consistency reliability was .87 and the dimension's Cronbach's α ranged from .54 to .84. The discriminating power of the CCCHP was indicated by statistically significant mean differences in CCCHP subscale scores between predefined groups. The 32-item CCCHP exhibits acceptable psychometric properties, particularly content and construct validity to examine HCPs' cultural competence. The CCCHP with its five dimensions offers a comprehensive assessment of HCPs' cultural competence, and has the ability to distinguish between groups that are expected to differ in cultural competence. This instrument can foster professional development through systematic self-assessment and thus contributes to improve the quality of patient care.
Adleman, Jenna; Gillan, Caitlin; Caissie, Amanda; Davis, Carol-Anne; Liszewski, Brian; McNiven, Andrea; Giuliani, Meredith
2017-06-01
To develop an entry-to-practice quality and safety competency profile for radiation oncology residency. A comprehensive list of potential quality and safety competency items was generated from public and professional resources and interprofessional focus groups. Redundant or out-of-scope items were eliminated through investigator consensus. Remaining items were subjected to an international 2-round modified Delphi process involving experts in radiation oncology, radiation therapy, and medical physics. During Round 1, each item was scored independently on a 9-point Likert scale indicating appropriateness for inclusion in the competency profile. Items indistinctly ranked for inclusion or exclusion were re-evaluated through web conference discussion and reranked in Round 2. An initial 1211 items were compiled from 32 international sources and distilled to 105 unique potential quality and safety competency items. Fifteen of the 50 invited experts participated in round 1: 10 radiation oncologists, 4 radiation therapists, and 1 medical physicist from 13 centers in 5 countries. Round 1 rankings resulted in 80 items included, 1 item excluded, and 24 items indeterminate. Two areas emerged more prominently within the latter group: change management and human factors. Web conference with 5 participants resulted in 9 of these 24 items edited for content or clarity. In Round 2, 12 participants rescored all indeterminate items resulting in 10 items ranked for inclusion. The final 90 enabling competency items were organized into thematic groups consisting of 18 key competencies under headings adapted from Deming's System of Profound Knowledge. This quality and safety competency profile may inform minimum training standards for radiation oncology residency programs. Copyright © 2017 Elsevier Inc. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Adleman, Jenna; Gillan, Caitlin; Radiation Medicine Program, Princess Margaret Cancer Centre, Toronto, Ontario
Purpose: To develop an entry-to-practice quality and safety competency profile for radiation oncology residency. Methods and Materials: A comprehensive list of potential quality and safety competency items was generated from public and professional resources and interprofessional focus groups. Redundant or out-of-scope items were eliminated through investigator consensus. Remaining items were subjected to an international 2-round modified Delphi process involving experts in radiation oncology, radiation therapy, and medical physics. During Round 1, each item was scored independently on a 9-point Likert scale indicating appropriateness for inclusion in the competency profile. Items indistinctly ranked for inclusion or exclusion were re-evaluated through webmore » conference discussion and reranked in Round 2. Results: An initial 1211 items were compiled from 32 international sources and distilled to 105 unique potential quality and safety competency items. Fifteen of the 50 invited experts participated in round 1: 10 radiation oncologists, 4 radiation therapists, and 1 medical physicist from 13 centers in 5 countries. Round 1 rankings resulted in 80 items included, 1 item excluded, and 24 items indeterminate. Two areas emerged more prominently within the latter group: change management and human factors. Web conference with 5 participants resulted in 9 of these 24 items edited for content or clarity. In Round 2, 12 participants rescored all indeterminate items resulting in 10 items ranked for inclusion. The final 90 enabling competency items were organized into thematic groups consisting of 18 key competencies under headings adapted from Deming's System of Profound Knowledge. Conclusions: This quality and safety competency profile may inform minimum training standards for radiation oncology residency programs.« less
LeBouthillier, Daniel M; Thibodeau, Michel A; Alberts, Nicole M; Hadjistavropoulos, Heather D; Asmundson, Gordon J G
2015-04-01
Individuals with medical conditions are likely to have elevated health anxiety; however, research has not demonstrated how medical status impacts response patterns on health anxiety measures. Measurement bias can undermine the validity of a questionnaire by overestimating or underestimating scores in groups of individuals. We investigated whether the Short Health Anxiety Inventory (SHAI), a widely-used measure of health anxiety, exhibits medical condition-based bias on item and subscale levels, and whether the SHAI subscales adequately assess the health anxiety continuum. Data were from 963 individuals with diabetes, breast cancer, or multiple sclerosis, and 372 healthy individuals. Mantel-Haenszel tests and item characteristic curves were used to classify the severity of item-level differential item functioning in all three medical groups compared to the healthy group. Test characteristic curves were used to assess scale-level differential item functioning and whether the SHAI subscales adequately assess the health anxiety continuum. Nine out of 14 items exhibited differential item functioning. Two items exhibited differential item functioning in all medical groups compared to the healthy group. In both Thought Intrusion and Fear of Illness subscales, differential item functioning was associated with mildly deflated scores in medical groups with very high levels of the latent traits. Fear of Illness items poorly discriminated between individuals with low and very low levels of the latent trait. While individuals with medical conditions may respond differentially to some items, clinicians and researchers can confidently use the SHAI with a variety of medical populations without concern of significant bias. Copyright © 2015 Elsevier Inc. All rights reserved.
Efficient Algorithms for Segmentation of Item-Set Time Series
NASA Astrophysics Data System (ADS)
Chundi, Parvathi; Rosenkrantz, Daniel J.
We propose a special type of time series, which we call an item-set time series, to facilitate the temporal analysis of software version histories, email logs, stock market data, etc. In an item-set time series, each observed data value is a set of discrete items. We formalize the concept of an item-set time series and present efficient algorithms for segmenting a given item-set time series. Segmentation of a time series partitions the time series into a sequence of segments where each segment is constructed by combining consecutive time points of the time series. Each segment is associated with an item set that is computed from the item sets of the time points in that segment, using a function which we call a measure function. We then define a concept called the segment difference, which measures the difference between the item set of a segment and the item sets of the time points in that segment. The segment difference values are required to construct an optimal segmentation of the time series. We describe novel and efficient algorithms to compute segment difference values for each of the measure functions described in the paper. We outline a dynamic programming based scheme to construct an optimal segmentation of the given item-set time series. We use the item-set time series segmentation techniques to analyze the temporal content of three different data sets—Enron email, stock market data, and a synthetic data set. The experimental results show that an optimal segmentation of item-set time series data captures much more temporal content than a segmentation constructed based on the number of time points in each segment, without examining the item set data at the time points, and can be used to analyze different types of temporal data.
Tulsky, David S; Kisala, Pamela A; Tate, Denise G; Spungen, Ann M; Kirshblum, Steven C
2015-05-01
To describe the development and psychometric properties of the Spinal Cord Injury--Quality of Life (SCI-QOL) Bladder Management Difficulties and Bowel Management Difficulties item banks and Bladder Complications scale. Using a mixed-methods design, a pool of items assessing bladder and bowel-related concerns were developed using focus groups with individuals with spinal cord injury (SCI) and SCI clinicians, cognitive interviews, and item response theory (IRT) analytic approaches, including tests of model fit and differential item functioning. Thirty-eight bladder items and 52 bowel items were tested at the University of Michigan, Kessler Foundation Research Center, the Rehabilitation Institute of Chicago, the University of Washington, Craig Hospital, and the James J. Peters VA Medical Center, Bronx, NY. Seven hundred fifty-seven adults with traumatic SCI. The final item banks demonstrated unidimensionality (Bladder Management Difficulties CFI=0.965; RMSEA=0.093; Bowel Management Difficulties CFI=0.955; RMSEA=0.078) and acceptable fit to a graded response IRT model. The final calibrated Bladder Management Difficulties bank includes 15 items, and the final Bowel Management Difficulties item bank consists of 26 items. Additionally, 5 items related to urinary tract infections (UTI) did not fit with the larger Bladder Management Difficulties item bank but performed relatively well independently (CFI=0.992, RMSEA=0.050) and were thus retained as a separate scale. The SCI-QOL Bladder Management Difficulties and Bowel Management Difficulties item banks are psychometrically robust and are available as computer adaptive tests or short forms. The SCI-QOL Bladder Complications scale is a brief, fixed-length outcomes instrument for individuals with a UTI.
Crins, Martine H. P.; Roorda, Leo D.; Smits, Niels; de Vet, Henrica C. W.; Westhovens, Rene; Cella, David; Cook, Karon F.; Revicki, Dennis; van Leeuwen, Jaap; Boers, Maarten; Dekker, Joost; Terwee, Caroline B.
2015-01-01
The Dutch-Flemish PROMIS Group translated the adult PROMIS Pain Interference item bank into Dutch-Flemish. The aims of the current study were to calibrate the parameters of these items using an item response theory (IRT) model, to evaluate the cross-cultural validity of the Dutch-Flemish translations compared to the original English items, and to evaluate their reliability and construct validity. The 40 items in the bank were completed by 1085 Dutch chronic pain patients. Before calibrating the items, IRT model assumptions were evaluated using confirmatory factor analysis (CFA). Items were calibrated using the graded response model (GRM), an IRT model appropriate for items with more than two response options. To evaluate cross-cultural validity, differential item functioning (DIF) for language (Dutch vs. English) was examined. Reliability was evaluated based on standard errors and Cronbach’s alpha. To evaluate construct validity correlations with scores on legacy instruments (e.g., the Disabilities of the Arm, Shoulder and Hand Questionnaire) were calculated. Unidimensionality of the Dutch-Flemish PROMIS Pain Interference item bank was supported by CFA tests of model fit (CFI = 0.986, TLI = 0.986). Furthermore, the data fit the GRM and showed good coverage across the pain interference continuum (threshold-parameters range: -3.04 to 3.44). The Dutch-Flemish PROMIS Pain Interference item bank has good cross-cultural validity (only two out of 40 items showing DIF), good reliability (Cronbach’s alpha = 0.98), and good construct validity (Pearson correlations between 0.62 and 0.75). A computer adaptive test (CAT) and Dutch-Flemish PROMIS short forms of the Dutch-Flemish PROMIS Pain Interference item bank can now be developed. PMID:26214178
Calorie Changes in Large Chain Restaurants: Declines in New Menu Items but Room for Improvement.
Bleich, Sara N; Wolfson, Julia A; Jarlenski, Marian P
2016-01-01
Large chain restaurants reduced the number of calories in newly introduced menu items in 2013 by about 60 calories (or 12%) relative to 2012. This paper describes trends in calories available in large U.S. chain restaurants to understand whether previously documented patterns persist. Data (a census of items for included restaurants) were obtained from the MenuStat project. This analysis included 66 of the 100 largest U.S. restaurants that are available in all three of the data years (2012-2014; N=23,066 items). Generalized linear models were used to examine: (1) per-item calorie changes from 2012 to 2014 among items on the menu in all years; and (2) mean calories in new items in 2013 and 2014 compared with items on the menu in 2012 only. Data were analyzed in 2014. Overall, calories in newly introduced menu items declined by 71 (or 15%) from 2012 to 2013 (p=0.001) and by 69 (or 14%) from 2012 to 2014 (p=0.03). These declines were concentrated mainly in new main course items (85 fewer calories in 2013 and 55 fewer calories in 2014; p=0.01). Although average calories in newly introduced menu items are declining, they are higher than items common to the menu in all 3 years. No differences in mean calories among items on menus in 2012, 2013, or 2014 were found. The previously observed declines in newly introduced menu items among large restaurant chains have been maintained, which suggests the beginning of a trend toward reducing calories. Copyright © 2016 American Journal of Preventive Medicine. Published by Elsevier Inc. All rights reserved.
Hays, Ron D; Spritzer, Karen L; Amtmann, Dagmar; Lai, Jin-Shei; Dewitt, Esi Morgan; Rothrock, Nan; Dewalt, Darren A; Riley, William T; Fries, James F; Krishnan, Eswar
2013-11-01
To create upper-extremity and mobility subdomain scores from the Patient-Reported Outcomes Measurement Information System (PROMIS) physical functioning adult item bank. Expert reviews were used to identify upper-extremity and mobility items from the PROMIS item bank. Psychometric analyses were conducted to assess empirical support for scoring upper-extremity and mobility subdomains. Data were collected from the U.S. general population and multiple disease groups via self-administered surveys. The sample (N=21,773) included 21,133 English-speaking adults who participated in the PROMIS wave 1 data collection and 640 Spanish-speaking Latino adults recruited separately. Not applicable. We used English- and Spanish-language data and existing PROMIS item parameters for the physical functioning item bank to estimate upper-extremity and mobility scores. In addition, we fit graded response models to calibrate the upper-extremity items and mobility items separately, compare separate to combined calibrations, and produce subdomain scores. After eliminating items because of local dependency, 16 items remained to assess upper extremity and 17 items to assess mobility. The estimated correlation between upper extremity and mobility was .59 using existing PROMIS physical functioning item parameters (r=.60 using parameters calibrated separately for upper-extremity and mobility items). Upper-extremity and mobility subdomains shared about 35% of the variance in common, and produced comparable scores whether calibrated separately or together. The identification of the subset of items tapping these 2 aspects of physical functioning and scored using the existing PROMIS parameters provides the option of scoring these subdomains in addition to the overall physical functioning score. Copyright © 2013 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Development of a PROMIS item bank to measure pain interference.
Amtmann, Dagmar; Cook, Karon F; Jensen, Mark P; Chen, Wen-Hung; Choi, Seung; Revicki, Dennis; Cella, David; Rothrock, Nan; Keefe, Francis; Callahan, Leigh; Lai, Jin-Shei
2010-07-01
This paper describes the psychometric properties of the PROMIS-pain interference (PROMIS-PI) bank. An initial candidate item pool (n=644) was developed and evaluated based on the review of existing instruments, interviews with patients, and consultation with pain experts. From this pool, a candidate item bank of 56 items was selected and responses to the items were collected from large community and clinical samples. A total of 14,848 participants responded to all or a subset of candidate items. The responses were calibrated using an item response theory (IRT) model. A final 41-item bank was evaluated with respect to IRT assumptions, model fit, differential item function (DIF), precision, and construct and concurrent validity. Items of the revised bank had good fit to the IRT model (CFI and NNFI/TLI ranged from 0.974 to 0.997), and the data were strongly unidimensional (e.g., ratio of first and second eigenvalue=35). Nine items exhibited statistically significant DIF. However, adjusting for DIF had little practical impact on score estimates and the items were retained without modifying scoring. Scores provided substantial information across levels of pain; for scores in the T-score range 50-80, the reliability was equivalent to 0.96-0.99. Patterns of correlations with other health outcomes supported the construct validity of the item bank. The scores discriminated among persons with different numbers of chronic conditions, disabling conditions, levels of self-reported health, and pain intensity (p<0.0001). The results indicated that the PROMIS-PI items constitute a psychometrically sound bank. Computerized adaptive testing and short forms are available. Copyright 2010 International Association for the Study of Pain. All rights reserved.
Crins, Martine H P; Roorda, Leo D; Smits, Niels; de Vet, Henrica C W; Westhovens, Rene; Cella, David; Cook, Karon F; Revicki, Dennis; van Leeuwen, Jaap; Boers, Maarten; Dekker, Joost; Terwee, Caroline B
2015-01-01
The Dutch-Flemish PROMIS Group translated the adult PROMIS Pain Interference item bank into Dutch-Flemish. The aims of the current study were to calibrate the parameters of these items using an item response theory (IRT) model, to evaluate the cross-cultural validity of the Dutch-Flemish translations compared to the original English items, and to evaluate their reliability and construct validity. The 40 items in the bank were completed by 1085 Dutch chronic pain patients. Before calibrating the items, IRT model assumptions were evaluated using confirmatory factor analysis (CFA). Items were calibrated using the graded response model (GRM), an IRT model appropriate for items with more than two response options. To evaluate cross-cultural validity, differential item functioning (DIF) for language (Dutch vs. English) was examined. Reliability was evaluated based on standard errors and Cronbach's alpha. To evaluate construct validity correlations with scores on legacy instruments (e.g., the Disabilities of the Arm, Shoulder and Hand Questionnaire) were calculated. Unidimensionality of the Dutch-Flemish PROMIS Pain Interference item bank was supported by CFA tests of model fit (CFI = 0.986, TLI = 0.986). Furthermore, the data fit the GRM and showed good coverage across the pain interference continuum (threshold-parameters range: -3.04 to 3.44). The Dutch-Flemish PROMIS Pain Interference item bank has good cross-cultural validity (only two out of 40 items showing DIF), good reliability (Cronbach's alpha = 0.98), and good construct validity (Pearson correlations between 0.62 and 0.75). A computer adaptive test (CAT) and Dutch-Flemish PROMIS short forms of the Dutch-Flemish PROMIS Pain Interference item bank can now be developed.