Heinemann, Allen W; Kisala, Pamela A; Hahn, Elizabeth A; Tulsky, David S
2015-05-01
To develop a spinal cord injury (SCI)-focused version of PROMIS and Neuro-QOL social domain item banks; evaluate the psychometric properties of items developed for adults with SCI; and report information to facilitate clinical and research use. We used a mixed-methods design to develop and evaluate Ability to Participate in Social Roles and Activities and Satisfaction with Social Roles and Activities items. Focus groups helped define the constructs; cognitive interviews helped revise items; and confirmatory factor analysis and item response theory methods helped calibrate item banks and evaluate differential item functioning related to demographic and injury characteristics. Five SCI Model System sites and one Veterans Administration medical center. The calibration sample consisted of 641 individuals; a reliability sample consisted of 245 individuals residing in the community. A subset of 27 Ability to Participate and 35 Satisfaction items demonstrated good measurement properties and negligible differential item functioning related to demographic and injury characteristics. The SCI-specific measures correlate strongly with the PROMIS and Neuro-QOL versions. Ten item short forms correlate >0.96 with the full banks. Variable-length CATs with a minimum of 4 items, variable-length CATs with a minimum of 8 items, fixed-length CATs of 10 items, and the 10-item short forms demonstrate construct coverage and measurement error that is comparable to the full item bank. The Ability to Participate and Satisfaction with Social Roles and Activities CATs and short forms demonstrate excellent psychometric properties and are suitable for clinical and research applications.
McCaffrey, Stacey A; Black, Ryan A; Butler, Stephen F
2018-03-01
The PainCAS is a web-based clinical tool for assessing and tracking pain and opioid risk in chronic pain patients. Despite evidence for its utility within the clinical setting, the PainCAS scales have never been subject to psychometric evaluation. The current study is the first to evaluate the psychometric properties of the PainCAS Interference with Daily Activities, Psychological/Emotional Distress, and Pain scales. Patients (N = 4797) from treatment centers and hospitals in 16 different states completed the PainCAS as part of routine clinical assessment. A subsample (n = 73) from two hospital-based treatment centers also completed comparator measures. Rasch Rating Scale Models were employed to evaluate the Interference with Daily Activities and Psychological/Emotional Distress scales, and empirical evaluation included assessment of dimensionality, discrimination, item fit, reliability, information, and person-to-item targeting. Additionally, convergent and discriminant validity were evaluated through classical test theory approaches. Convergent validity of the Pain scales was evaluated through correlations with corresponding comparator items. One Interference with Daily Activities item was removed due to poor functioning and discrimination. The retained items from the Interference with Daily Activities and Psychological/Emotional Distress scales conformed to unidimensional Rasch measurement models, yielding satisfactory item fit, reliability, precision, and coverage. Further, results provided support for the convergent and discriminant validity of these two scales. Convergent validity between the PainCAS Pain and BPI Pain items was also strong. Taken together, results provide strong psychometric support for these PainCAS Pain scales. Strengths and limitations of the current study are discussed.
Paap, Muirne C S; Kroeze, Karel A; Terwee, Caroline B; van der Palen, Job; Veldkamp, Bernard P
2017-11-01
Examining item usage is an important step in evaluating the performance of a computerized adaptive test (CAT). We study item usage for a newly developed multidimensional CAT which draws items from three PROMIS domains, as well as a disease-specific one. The multidimensional item bank used in the current study contained 194 items from four domains: the PROMIS domains fatigue, physical function, and ability to participate in social roles and activities, and a disease-specific domain (the COPD-SIB). The item bank was calibrated using the multidimensional graded response model and data of 795 patients with chronic obstructive pulmonary disease. To evaluate the item usage rates of all individual items in our item bank, CAT simulations were performed on responses generated based on a multivariate uniform distribution. The outcome variables included active bank size and item overuse (usage rate larger than the expected item usage rate). For average θ-values, the overall active bank size was 9-10%; this number quickly increased as θ-values became more extreme. For values of -2 and +2, the overall active bank size equaled 39-40%. There was 78% overlap between overused items and active bank size for average θ-values. For more extreme θ-values, the overused items made up a much smaller part of the active bank size: here the overlap was only 35%. Our results strengthen the claim that relatively short item banks may suffice when using polytomous items (and no content constraints/exposure control mechanisms), especially when using MCAT.
Mulcahey, M J; Merenda, Lisa; Tian, Feng; Kozin, Scott; James, Michelle; Gogola, Gloria; Ni, Pengsheng
2013-01-01
This study examined the psychometric properties of item pools relevant to upper-extremity function and activity performance and evaluated simulated 5-, 10-, and 15-item computer adaptive tests (CATs). In a multicenter, cross-sectional study of 200 children and youth with brachial plexus birth palsy (BPBP), parents responded to upper-extremity (n = 52) and activity (n = 34) items using a 5-point response scale. We used confirmatory and exploratory factor analysis, ordinal logistic regression, item maps, and standard errors to evaluate the psychometric properties of the item banks. Validity was evaluated using analysis of variance and Pearson correlation coefficients. Results show that the two item pools have acceptable model fit, scaled well for children and youth with BPBP, and had good validity, content range, and precision. Simulated CATs performed comparably to the full item banks, suggesting that a reduced number of items provide similar information to the entire set of items. Copyright © 2013 by the American Occupational Therapy Association, Inc.
Arimoto, Azusa; Gregg, Misuzu F; Nagata, Satoko; Miki, Yuko; Murashima, Sachiyo
2012-07-01
Evaluation of doctoral programs in nursing is becoming more important with the rapid increase in the programs in Japan. This study aimed to evaluate doctoral nursing programs by faculty members and to analyze the relationship of the evaluation with educational and research activities of faculty members in Japan. Target settings were all 46 doctoral nursing programs. Eighty-five faculty members from 28 programs answered the questionnaire, which included 17 items for program evaluation, 12 items for faculty evaluation, 9 items for resource evaluation, 3 items for overall evaluations, and educational and research activities. A majority gave low evaluations for sources of funding, the number of faculty members and support staff, and administrative systems. Faculty members who financially supported a greater number of students gave a higher evaluation for extramural funding support, publication, provision of diverse learning experiences, time of supervision, and research infrastructure. The more time a faculty member spent on advising doctoral students, the higher were their evaluations on the supportive learning environment, administrative systems, time of supervision, and timely feedback on students' research. The findings of this study indicate a need for improvement in research infrastructure, funding sources, and human resources to achieve quality nursing doctoral education in Japan. Copyright © 2011 Elsevier Ltd. All rights reserved.
Rasch analysis of the patient-rated wrist evaluation questionnaire.
Esakki, Saravanan; MacDermid, Joy C; Vincent, Joshua I; Packham, Tara L; Walton, David; Grewal, Ruby
2018-01-01
The Patient-Rated Wrist Evaluation (PRWE) was developed as a wrist joint specific measure of pain and disability and evidence of sound validity has been accumulated through classical psychometric methods. Rasch analysis (RA) has been endorsed as a newer method for analyzing the clinical measurement properties of self-report outcome measures. The purpose of this study was to evaluate the PRWE using Rasch modeling. We employed the Rasch model to assess overall fit, response scaling, individual item fit, differential item functioning (DIF), local dependency, unidimensionality and person separation index (PSI). A convenience sample of 382 patients with distal radius fracture was recruited from the hand and upper limb clinic at large academic healthcare organization, London, Ontario, Canada, 6-month post-injury scores of the PRWE was used. RA was conducted on the 3 subscales (pain, specific activities, and usual activities) of the PRWE separately. The pain subscale adequately fit the Rasch model when item 4 "Pain - When it is at its worst" was deleted to eliminate non-uniform DIF by age group, and item 5 "How often do you have pain" was rescored by collapsing into 8 intervals to eliminate disordered thresholds. Uniform DIF for "Use my affected hand to push up from the chair" (by work status) and "Use bathroom tissue with my affected hand" (by injured hand) was addressed by splitting the items for analysis. After background rescoring of 2 items in pain subscale, 2 items in specific activities and 3 items in usual activities, all three subscales of the PRWE were well targeted and had high reliability (PSI = 0.86). These changes provided a unidimensional, interval-level scaled measure. Like a previous analysis of the Patient-Rated Wrist and Hand Evaluation, this study found the PRWE could be fit to the Rasch model with rescoring of multiple items. However, the modifications required to achieve fit were not the same across studies, our fit statistics also suggested one of the pain items should be deleted. This study adds to the pool of evidence supporting the PRWE, but cannot confidently provide a Rasch-based scoring algorithm.
Oude Voshaar, Martijn A H; Ten Klooster, Peter M; Vonkeman, Harald E; van de Laar, Mart A F J
2017-11-01
Traditional patient-reported physical function instruments often poorly differentiate patients with mild-to-moderate disability. We describe the development and psychometric evaluation of a generic item bank for measuring everyday activity limitations in outpatient populations. Seventy-two items generated from patient interviews and mapped to the International Classification of Functioning, Disability and Health (ICF) domestic life chapter were administered to 1128 adults representative of the Dutch population. The partial credit model was fitted to the item responses and evaluated with respect to its assumptions, model fit, and differential item functioning (DIF). Measurement performance of a computerized adaptive testing (CAT) algorithm was compared with the SF-36 physical functioning scale (PF-10). A final bank of 41 items was developed. All items demonstrated acceptable fit to the partial credit model and measurement invariance across age, sex, and educational level. Five- and ten-item CAT simulations were shown to have high measurement precision, which exceeded that of SF-36 physical functioning scale across the physical function continuum. Floor effects were absent for a 10-item empirical CAT simulation, and ceiling effects were low (13.5%) compared with SF-36 physical functioning (38.1%). CAT also discriminated better than SF-36 physical functioning between age groups, number of chronic conditions, and respondents with or without rheumatic conditions. The Rasch assessment of everyday activity limitations (REAL) item bank will hopefully prove a useful instrument for assessing everyday activity limitations. T-scores obtained using derived measures can be used to benchmark physical function outcomes against the general Dutch adult population.
Oude Voshaar, Martijn A H; Ten Klooster, Peter M; Glas, Cees A W; Vonkeman, Harald E; Taal, Erik; Krishnan, Eswar; Bernelot Moens, Hein J; Boers, Maarten; Terwee, Caroline B; van Riel, Piet L C M; van de Laar, Mart A F J
2015-12-01
To evaluate the content validity and measurement properties of the Patient-Reported Outcome Measurement Information System (PROMIS) physical function item bank and a 20-item short form in patients with RA in comparison with the HAQ disability index (HAQ-DI) and 36-item Short Form Health Survey (SF-36) physical functioning scale (PF-10). The content validity of the instruments was evaluated by linking their items to the International Classification of Functioning, Disability and Health (ICF) core set for RA. The measures were administered to 690 RA patients enrolled in the Dutch Rheumatoid Arthritis Monitoring registry. Measurement precision was evaluated using item response theory methods and construct validity was evaluated by correlating physical function scores with other clinical and patient-reported outcome measures. All 207 health concepts identified in the physical function measures referred to activities that are featured in the ICF. Twenty-three of 26 ICF RA core set domains are featured in the full PROMIS physical function item bank compared with 13 and 8 for the HAQ-DI and PF-10, respectively. As hypothesized, all three physical function instruments were highly intercorrelated (r 0.74-0.84), moderately correlated with disease activity measures (r 0.44-0.63) and weakly correlated with age (rs 0.07-0.14). Item response theory-based analysis revealed that a 20-item PROMIS physical function short form covered a wider range of physical function levels than the HAQ-DI or PF-10. The PROMIS physical function item bank demonstrated excellent measurement properties in RA. A content-driven 20-item short form may be a useful tool for assessing physical function in RA. © The Author 2015. Published by Oxford University Press on behalf of the British Society for Rheumatology. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Rose, M; Bjorner, J B; Becker, J; Fries, J F; Ware, J E
2008-01-01
The Patient-Reported Outcomes Measurement Information System (PROMIS) was initiated to improve precision, reduce respondent burden, and enhance the comparability of health outcomes measures. We used item response theory (IRT) to construct and evaluate a preliminary item bank for physical function assuming four subdomains. Data from seven samples (N=17,726) using 136 items from nine questionnaires were evaluated. A generalized partial credit model was used to estimate item parameters, which were normed to a mean of 50 (SD=10) in the US population. Item bank properties were evaluated through Computerized Adaptive Test (CAT) simulations. IRT requirements were fulfilled by 70 items covering activities of daily living, lower extremity, and central body functions. The original item context partly affected parameter stability. Items on upper body function, and need for aid or devices did not fit the IRT model. In simulations, a 10-item CAT eliminated floor and decreased ceiling effects, achieving a small standard error (< 2.2) across scores from 20 to 50 (reliability >0.95 for a representative US sample). This precision was not achieved over a similar range by any comparable fixed length item sets. The methods of the PROMIS project are likely to substantially improve measures of physical function and to increase the efficiency of their administration using CAT.
[Validity and reliability of a scale to assess self-efficacy for physical activity in elderly].
Borges, Rossana Arruda; Rech, Cassiano Ricardo; Meurer, Simone Teresinha; Benedetti, Tânia Rosane Bertoldo
2015-04-01
This study aimed to analyze the confirmatory factor validity and reliability of a self-efficacy scale for physical activity in a sample of 118 elderly (78% women) from 60 to 90 years of age. Mplus 6.1 was used to evaluate the confirmatory factor analysis. Reliability was tested by internal consistency and temporal stability. The original scale consisted of five items with dichotomous answers (yes/no), independently for walking and moderate and vigorous physical activity. The analysis excluded the item related to confidence in performing physical activities when on vacation. Two constructs were identified, called "self-efficacy for walking" and "self-efficacy for moderate and vigorous physical activity", with a factor load ≥ 0.50. Internal consistency was adequate both for walking (> 0.70) and moderate and vigorous physical activity (> 0.80), and temporal stability was adequate for all the items. In conclusion, the self-efficacy scale for physical activity showed adequate validity, reliability, and internal consistency for evaluating this construct in elderly Brazilians.
Cabanas-Sánchez, Verónica; Tejero-González, Carlos M; Veiga, Oscar L
2012-01-01
One of the main problems of health in the first world is the increase of physical inactivity. In this respect, adolescence has been identified as a critic period with high decline of physical activity. Therefore, a relevant line of research is the understanding of this social phenomenon. The aim of this study was to design a scale to assess perceived barriers to physical activity on adolescents. A convenience sample of 160 Spanish adolescents (84 girls), between 12 and 18 years old, was recruited for this study. Firstly, there were designed 40 items whose pertinence was evaluated through content validation by experts. Later, the participants were divided in two randomized groups, and Exploratory Factor Analysis and Confirmatory Factor Analysis were performed to define a short scale of 12 items. Cronbach Alfa Coefficent was used to evaluate internal consistence of the instrument. The scale reports four dimensions: incompatibility barriers (2 items), self-concept barriers (4 items), amotivation barriers (4 items) and social barriers (2 items). The scale showed enough construct validity (χ2=60.78; d.f.=48; p=0.100; GFI=0.88; CFI=0.94; RMSEA=0.58) and high internal reliability (α=0.80). Moreover, the scale was able to explain 67% of the data variance. The Short Scale of Perceived Barriers to Physical Activity in Adolescents is a valid and reliable instrument.
Development of a Questionnaire Assessing School Physical Activity Environment
ERIC Educational Resources Information Center
Robertson-Wilson, Jennifer; Levesque, Lucie; Holden, Ronald R.
2007-01-01
This study was designed to develop the Questionnaire Assessing School Physical Activity Environment (Q--SPACE) based on student perceptions. Twenty-eight items rated on 4-point Likert scales were administered to 244 middle school students in 9 schools. Exploratory factor analysis was used to evaluate the underlying structure of the items and 2…
Rasch measurement: the Arm Activity measure (ArmA) passive function sub-scale.
Ashford, Stephen; Siegert, Richard J; Alexandrescu, Roxana
2016-01-01
To evaluate the conformity of the Arm Activity measure (ArmA) passive function sub-scale to the Rasch model. A consecutive cohort of patients (n = 92) undergoing rehabilitation, including upper limb rehabilitation and spasticity management, at two specialist rehabilitation units were included. Rasch analysis was used to examine scaling and conformity to the model. Responses were analysed using Rasch unidimensional measurement models (RUMM 2030). The following aspects were considered: overall model and individual item fit statistics and fit residuals, internal reliability, item response threshold ordering, item bias, local dependency and unidimensionality. ArmA contains both active and passive function sub-scales, but in this analysis only the passive function sub-scale was considered. Four of the seven items in the ArmA passive function sub-scale initially had disordered thresholds. These items were rescored to four response options, which resulted in ordered thresholds for all items. Once the items with disordered thresholds had been rescored, item bias was not identified for age, global disability level or diagnosis, but with a small difference in difficulty between males and females for one item of the scale. Local dependency was not observed and the unidimensionality of the sub-scale was supported and good fit to the Rasch model was identified. The person separation index (PSI) was 0.95 indicating that the scale is able to reliably differentiate at least two groups of patients. The ArmA passive function sub-scale was shown in this evaluation to conform to the Rasch model once disordered thresholds had been addressed. Using the logit scores produced by the Rasch model it was possible to convert this back to the original scale range. Implications for Rehabilitation The ArmA passive function sub-scale was shown, in this evaluation, to conform to the Rasch model once disordered thresholds had been addressed and therefore to be a clinically applicable and potentially useful hierarchical measure. Using Rasch logit scores it has be possible to convert back to the original ordinal scale range and provide an indication of real change to enable evaluation of clinical outcome of importance to patients and clinicians.
Sakakibara, Brodie M.; Miller, William C.; Backman, Catherine L.
2012-01-01
Objective To explore shortened response formats for use with the Activities-specific Balance Confidence scale and then: 1) evaluate the unidimensionality of the scale; 2) evaluate the item difficulty; 3) evaluate the scale for redundancy and content gaps; and 4) evaluate the item standard error of measurement (SEM) and internal consistency reliability among aging individuals (≥50 years) with a lower-limb amputation living in the community. Design Secondary analysis of cross-sectional survey and chart review data. Setting Out-patient amputee clinics, Ontario, Canada. Participants Four hundred forty eight community living adults, at least 50 years old (mean = 68 years), who have used a prosthesis for at least 6 months for a major unilateral lower limb amputation. Three hundred twenty five (72.5%) were men. Intervention N/a Main Outcome Measure(s) Activities-specific Balance Confidence Scale. Results A 5-option response format outperformed 4- and 6-option formats. Factor analyses confirmed a unidimensional scale. The distance between response options is not the same for all items on the scale, evident by the Partial Credit Model (PCM) having a better fit to the data than the Rating Scale Model. Two items, however, did not fit the PCM within statistical reason. Revising the wording of the two items may resolve the misfit, and improve the construct validity and lower the SEM. Overall, the difficulty of the scale’s items is appropriate for use with aging individuals with lower-limb amputation, and is most reliable (Cronbach ∝ = 0.94) for use with individuals with moderately low balance confidence levels. Conclusions The ABC-scale with a simplified 5-option response format is a valid and reliable measure of balance confidence for use with individuals aging with a lower limb amputation. PMID:21704978
Rose, Matthias; Bjorner, Jakob B; Gandek, Barbara; Bruce, Bonnie; Fries, James F; Ware, John E
2014-05-01
To document the development and psychometric evaluation of the Patient-Reported Outcomes Measurement Information System (PROMIS) Physical Function (PF) item bank and static instruments. The items were evaluated using qualitative and quantitative methods. A total of 16,065 adults answered item subsets (n>2,200/item) on the Internet, with oversampling of the chronically ill. Classical test and item response theory methods were used to evaluate 149 PROMIS PF items plus 10 Short Form-36 and 20 Health Assessment Questionnaire-Disability Index items. A graded response model was used to estimate item parameters, which were normed to a mean of 50 (standard deviation [SD]=10) in a US general population sample. The final bank consists of 124 PROMIS items covering upper, central, and lower extremity functions and instrumental activities of daily living. In simulations, a 10-item computerized adaptive test (CAT) eliminated floor and decreased ceiling effects, achieving higher measurement precision than any comparable length static tool across four SDs of the measurement range. Improved psychometric properties were transferred to the CAT's superior ability to identify differences between age and disease groups. The item bank provides a common metric and can improve the measurement of PF by facilitating the standardization of patient-reported outcome measures and implementation of CATs for more efficient PF assessments over a larger range. Copyright © 2014. Published by Elsevier Inc.
Neural correlates of economic value and valuation context: an event-related potential study.
Tyson-Carr, John; Kokmotou, Katerina; Soto, Vicente; Cook, Stephanie; Fallon, Nicholas; Giesbrecht, Timo; Stancak, Andrej
2018-05-01
The value of environmental cues and internal states is continuously evaluated by the human brain, and it is this subjective value that largely guides decision making. The present study aimed to investigate the initial value attribution process, specifically the spatiotemporal activation patterns associated with values and valuation context, using electroencephalographic event-related potentials (ERPs). Participants completed a stimulus rating task in which everyday household items marketed up to a price of £4 were evaluated with respect to their desirability or material properties. The subjective values of items were evaluated as willingness to pay (WTP) in a Becker-DeGroot-Marschak auction. On the basis of the individual's subjective WTP values, the stimuli were divided into high- and low-value items. Source dipole modeling was applied to estimate the cortical sources underlying ERP components modulated by subjective values (high vs. low WTP) and the evaluation condition (value-relevant vs. value-irrelevant judgments). Low-WTP items and value-relevant judgments both led to a more pronounced N2 visual evoked potential at right frontal scalp electrodes. Source activity in right anterior insula and left orbitofrontal cortex was larger for low vs. high WTP at ∼200 ms. At a similar latency, source activity in right anterior insula and right parahippocampal gyrus was larger for value-relevant vs. value-irrelevant judgments. A stronger response for low- than high-value items in anterior insula and orbitofrontal cortex appears to reflect aversion to low-valued item acquisition, which in an auction experiment would be perceived as a relative loss. This initial low-value bias occurs automatically irrespective of the valuation context. NEW & NOTEWORTHY We demonstrate the spatiotemporal characteristics of the brain valuation process using event-related potentials and willingness to pay as a measure of subjective value. The N2 component resolves values of objects with a bias toward low-value items. The value-related changes of the N2 component are part of an automatic valuation process.
USDA-ARS?s Scientific Manuscript database
This study aimed to evaluate the psychometric properties of four self-efficacy scales (i.e., self-efficacy for fruit (FSE), vegetable (VSE), and water (WSE) intakes, and physical activity (PASE)) and to investigate their differences in item functioning across sex, age, and body weight status groups ...
[Inter-rater concordance of the "Nursing Activities Score" in intensive care].
Valls-Matarín, Josefa; Salamero-Amorós, Maria; Roldán-Gil, Carmen; Quintana-Riera, Salvador
2015-01-01
To evaluate inter-rater concordance in the valuation of the "Nursing Activities Score". Cross-sectional descriptive study conducted from December 2012 until June 2013 in a general intensive care unit with twelve beds. Three evaluator nurses, simultaneously and independently, through the patient daily charts, scored the nursing workload using Nursing Activities Score scale in all patients admitted over 18 years old. Three hundreds and thirty-nine records were collected. The intra-class correlation coefficient (ICC) between evaluators was 0.92 (0.89-0.94). A perfect concordance was obtained in 39.1% of the items, with 52.2% having a high, and 8.7% having lower concordance, corresponding to two of the items with multiple scoring options. Significant differences between two of the evaluators (P=.049) were found. Although the inter-rater concordance was high, more accurate records are needed to reduce the variability of the items with multiple options and to allow more accuracy in the interpretation and measurement of the data regarding nursing workload. Copyright © 2015 Elsevier España, S.L.U. All rights reserved.
78 FR 42821 - Agency Information Collection (Brand Name or Equal) Activities Under OMB Review
Federal Register 2010, 2011, 2012, 2013, 2014
2013-07-17
... the brand name item. This evidence may be in the form of descriptive literature or material, such as... use the information to evaluate whether or not the item offered meets the specification requirements...
78 FR 21711 - Proposed Information Collection (Brand Name or Equal) Activity: Comment Request
Federal Register 2010, 2011, 2012, 2013, 2014
2013-04-11
... in fact, equal to the brand name item. This evidence may be in the form of descriptive literature or... will use the information to evaluate whether or not the item offered meets the specification...
75 FR 9489 - Proposed Information Collection (Brand Name or Equal) Activity: Comment Request
Federal Register 2010, 2011, 2012, 2013, 2014
2010-03-02
... in fact, equal to the brand name item. This evidence may be in the form of descriptive literature or... will use the information to evaluate whether or not the item offered meets the specification...
Development of the Computer-Adaptive Version of the Late-Life Function and Disability Instrument
Tian, Feng; Kopits, Ilona M.; Moed, Richard; Pardasaney, Poonam K.; Jette, Alan M.
2012-01-01
Background. Having psychometrically strong disability measures that minimize response burden is important in assessing of older adults. Methods. Using the original 48 items from the Late-Life Function and Disability Instrument and newly developed items, a 158-item Activity Limitation and a 62-item Participation Restriction item pool were developed. The item pools were administered to a convenience sample of 520 community-dwelling adults 60 years or older. Confirmatory factor analysis and item response theory were employed to identify content structure, calibrate items, and build the computer-adaptive testings (CATs). We evaluated real-data simulations of 10-item CAT subscales. We collected data from 102 older adults to validate the 10-item CATs against the Veteran’s Short Form-36 and assessed test–retest reliability in a subsample of 57 subjects. Results. Confirmatory factor analysis revealed a bifactor structure, and multi-dimensional item response theory was used to calibrate an overall Activity Limitation Scale (141 items) and an overall Participation Restriction Scale (55 items). Fit statistics were acceptable (Activity Limitation: comparative fit index = 0.95, Tucker Lewis Index = 0.95, root mean square error approximation = 0.03; Participation Restriction: comparative fit index = 0.95, Tucker Lewis Index = 0.95, root mean square error approximation = 0.05). Correlation of 10-item CATs with full item banks were substantial (Activity Limitation: r = .90; Participation Restriction: r = .95). Test–retest reliability estimates were high (Activity Limitation: r = .85; Participation Restriction r = .80). Strength and pattern of correlations with Veteran’s Short Form-36 subscales were as hypothesized. Each CAT, on average, took 3.56 minutes to administer. Conclusions. The Late-Life Function and Disability Instrument CATs demonstrated strong reliability, validity, accuracy, and precision. The Late-Life Function and Disability Instrument CAT can achieve psychometrically sound disability assessment in older persons while reducing respondent burden. Further research is needed to assess their ability to measure change in older adults. PMID:22546960
ERIC Educational Resources Information Center
Gutl, Christian; Lankmayr, Klaus; Weinhofer, Joachim; Hofler, Margit
2011-01-01
Research in automated creation of test items for assessment purposes became increasingly important during the recent years. Due to automatic question creation it is possible to support personalized and self-directed learning activities by preparing appropriate and individualized test items quite easily with relatively little effort or even fully…
Evaluation of item candidates for a diabetic retinopathy quality of life item bank.
Fenwick, Eva K; Pesudovs, Konrad; Khadka, Jyoti; Rees, Gwyn; Wong, Tien Y; Lamoureux, Ecosse L
2013-09-01
We are developing an item bank assessing the impact of diabetic retinopathy (DR) on quality of life (QoL) using a rigorous multi-staged process combining qualitative and quantitative methods. We describe here the first two qualitative phases: content development and item evaluation. After a comprehensive literature review, items were generated from four sources: (1) 34 previously validated patient-reported outcome measures; (2) five published qualitative articles; (3) eight focus groups and 18 semi-structured interviews with 57 DR patients; and (4) seven semi-structured interviews with diabetes or ophthalmic experts. Items were then evaluated during 3 stages, namely binning (grouping) and winnowing (reduction) based on key criteria and panel consensus; development of item stems and response options; and pre-testing of items via cognitive interviews with patients. The content development phase yielded 1,165 unique items across 7 QoL domains. After 3 sessions of binning and winnowing, items were reduced to a minimally representative set (n = 312) across 9 domains of QoL: visual symptoms; ocular surface symptoms; activity limitation; mobility; emotional; health concerns; social; convenience; and economic. After 8 cognitive interviews, 42 items were amended resulting in a final set of 314 items. We have employed a systematic approach to develop items for a DR-specific QoL item bank. The psychometric properties of the nine QoL subscales will be assessed using Rasch analysis. The resulting validated item bank will allow clinicians and researchers to better understand the QoL impact of DR and DR therapies from the patient's perspective.
Massey, Kevin; Barnes, Marilyn J D; Villines, Dana; Goldstein, Julie D; Pierson, Anna Lee Hisey; Scherer, Cheryl; Vander Laan, Betty; Summerfelt, Wm Thomas
2015-01-01
Chaplains are increasingly seen as key members of interdisciplinary palliative care teams, yet the specific interventions and hoped for outcomes of their work are poorly understood. This project served to develop a standard terminology inventory for the chaplaincy field, to be called the chaplaincy taxonomy. The research team used a mixed methods approach to generate, evaluate and validate items for the taxonomy. We conducted a literature review, retrospective chart review, focus groups, self-observation, experience sampling, concept mapping, and reliability testing. Chaplaincy activities focused primarily on palliative care in an intensive care unit setting in order to capture a broad cross section of chaplaincy activities. Literature and chart review resulted in 438 taxonomy items for testing. Chaplain focus groups generated an additional 100 items and removed 421 items as duplications. Self-Observation, Experience Sampling and Concept Mapping provided validity that the taxonomy items were actual activities that chaplains perform in their spiritual care. Inter-rater reliability for chaplains to identify taxonomy items from vignettes was 0.903. The 100 item chaplaincy taxonomy provides a strong foundation for a normative inventory of chaplaincy activities and outcomes. A deliberative process is proposed to further expand and refine the taxonomy to create a standard terminological inventory for the field of chaplaincy. A standard terminology could improve the ways inter-disciplinary palliative care teams communicate about chaplaincy activities and outcomes.
75 FR 26345 - Agency Information Collection (Brand Name or Equal) Activities Under OMB Review
Federal Register 2010, 2011, 2012, 2013, 2014
2010-05-11
... in fact, equal to the brand name item. This evidence may be in the form of descriptive literature or... will use the information to evaluate whether or not the item offered meets the specification...
78 FR 42593 - Agency Information Collection (Brand Name or Equal) Activities Under OMB Review
Federal Register 2010, 2011, 2012, 2013, 2014
2013-07-16
... the brand name item. This evidence may be in the form of descriptive literature or material, such as... information to evaluate whether or not the item offered meets the specification requirements. An agency may...
Kerner, Matthew S
2005-06-01
Using the theory of planned behavior as a conceptual framework, scales assessing Attitude to Leisure-time Physical Activity, Expectations of Others, Perceived Control, and Intention to Engage in Leisure-time Physical Activity were developed for use among middle-school students. The study sample included 349 boys and 400 girls, 10 to 14 years of age (M=11.9 yr., SD=.9). Unipolar and bipolar scales with seven response choices were developed, with each scale item phrased in a Likert-type format. Following revisions, 22 items were retained in the Attitude to Leisure-time Physical Activity Scale, 10 items in the Expectations of Others Scale, 3 items in the Perceived Control Scale, and 17 items in the Intention to Engage in Leisure-time Physical Activity Scale. Adequate internal consistency was indicated by standardized coefficients alpha ranging from .75 to .89. Current results must be extended to assess discriminant and predictive validities and to check various reliabilities with new samples, then evaluation of intervention techniques for promotion of positive attitudes about leisure-time physical activity, including perception of control and intentions to engage in leisure-time physical activity.
Koyama, Utako; Murayama, Nobuko
2011-08-01
This qualitative and quantitative research was conducted to develop an empowerment scale for health promotion volunteers (hereinafter referred to as the ESFHPV), key persons responsible for creating healthy communities. A focus group interview was conducted with four groups of health promotion volunteers from two cities in S Public Health Center of N Prefecture. A qualitative analysis was employed and a 32-item draft scale was created. The reliability and validity of this scale were then evaluated using quantitative methods. A questionnaire survey was conducted in 2009 for all 660 health promotion volunteers across the 2 cities. Of 401 respondents (response rate, 60.8%), 356 (53.9%) provided valid responses and were thus included in the analysis. 1) Internal consistency was confirmed by item-total correlation analysis (I-T analysis), assessment of Cronbach's coefficient alpha for all except one item and good-poor analysis (G-P analysis). Four items were excluded from the 32-item draft scale because of correlation coefficients more than 0.7, leaving 28 items for analysis. 2) Based on the results obtained from the factor analysis performed on the 28 provisional empowerment questions, 28 items were chosen for inclusion in the ESFHPV. These items consisted of four sub-scales, namely 'activity for healthy community' (10 items), 'intention for solving health problems of the community' (10 items), 'democratic organization activity' (four items) and 'growth as individual health promotion volunteers' (four items). 3) The Cronbach's coefficient alpha for the ESFHPV and its four sub-scales were 0.93, 0.88, 0.89, 0.84 and 0.79 respectively. The coefficients of I-T analysis were between 0.33 and 0.69. 4) The health promotion volunteers who attended other community activities demonstrated significantly high scores for the ESFHPV and the four sub-scales. Persons who were above 60 years, had a longer duration of activity as a health promotion volunteer and were housewives showed significantly high scores on the first sub-scale, 'growth as individual health promotion volunteers' To measure the empowerment levels of health promotion volunteers, a 28-item scale was developed and its reliability and validity were confirmed. Health promotion volunteers as well as the public health nurses who assist them can use this scale to assess the empowerment levels of other health promotion volunteers.
Levac, Danielle; Nawrotek, Joanna; Deschenes, Emilie; Giguere, Tia; Serafin, Julie; Bilodeau, Martin; Sveistrup, Heidi
2016-06-01
Virtual reality active video games are increasingly popular physical therapy interventions for children with cerebral palsy. However, physical therapists require educational resources to support decision making about game selection to match individual patient goals. Quantifying the movements elicited during virtual reality active video game play can inform individualized game selection in pediatric rehabilitation. The objectives of this study were to develop and evaluate the feasibility and reliability of the Movement Rating Instrument for Virtual Reality Game Play (MRI-VRGP). Item generation occurred through an iterative process of literature review and sample videotape viewing. The MRI-VRGP includes 25 items quantifying upper extremity, lower extremity, and total body movements. A total of 176 videotaped 90-second game play sessions involving 7 typically developing children and 4 children with cerebral palsy were rated by 3 raters trained in MRI-VRGP use. Children played 8 games on 2 virtual reality and active video game systems. Intraclass correlation coefficients (ICCs) determined intra-rater and interrater reliability. Excellent intrarater reliability was evidenced by ICCs of >0.75 for 17 of the 25 items across the 3 raters. Interrater reliability estimates were less precise. Excellent interrater reliability was achieved for far reach upper extremity movements (ICC=0.92 [for right and ICC=0.90 for left) and for squat (ICC=0.80) and jump items (ICC=0.99), with 9 items achieving ICCs of >0.70, 12 items achieving ICCs of between 0.40 and 0.70, and 4 items achieving poor reliability (close-reach upper extremity-ICC=0.14 for right and ICC=0.07 for left) and single-leg stance (ICC=0.55 for right and ICC=0.27 for left). Poor video quality, differing item interpretations between raters, and difficulty quantifying the high-speed movements involved in game play affected reliability. With item definition clarification and further psychometric property evaluation, the MRI-VRGP could inform the content of educational resources for therapists by ranking games according to frequency and type of elicited body movements.
Nawrotek, Joanna; Deschenes, Emilie; Giguere, Tia; Serafin, Julie; Bilodeau, Martin; Sveistrup, Heidi
2016-01-01
Background Virtual reality active video games are increasingly popular physical therapy interventions for children with cerebral palsy. However, physical therapists require educational resources to support decision making about game selection to match individual patient goals. Quantifying the movements elicited during virtual reality active video game play can inform individualized game selection in pediatric rehabilitation. Objective The objectives of this study were to develop and evaluate the feasibility and reliability of the Movement Rating Instrument for Virtual Reality Game Play (MRI-VRGP). Methods Item generation occurred through an iterative process of literature review and sample videotape viewing. The MRI-VRGP includes 25 items quantifying upper extremity, lower extremity, and total body movements. A total of 176 videotaped 90-second game play sessions involving 7 typically developing children and 4 children with cerebral palsy were rated by 3 raters trained in MRI-VRGP use. Children played 8 games on 2 virtual reality and active video game systems. Intraclass correlation coefficients (ICCs) determined intra-rater and interrater reliability. Results Excellent intrarater reliability was evidenced by ICCs of >0.75 for 17 of the 25 items across the 3 raters. Interrater reliability estimates were less precise. Excellent interrater reliability was achieved for far reach upper extremity movements (ICC=0.92 [for right and ICC=0.90 for left) and for squat (ICC=0.80) and jump items (ICC=0.99), with 9 items achieving ICCs of >0.70, 12 items achieving ICCs of between 0.40 and 0.70, and 4 items achieving poor reliability (close-reach upper extremity-ICC=0.14 for right and ICC=0.07 for left) and single-leg stance (ICC=0.55 for right and ICC=0.27 for left). Conclusions Poor video quality, differing item interpretations between raters, and difficulty quantifying the high-speed movements involved in game play affected reliability. With item definition clarification and further psychometric property evaluation, the MRI-VRGP could inform the content of educational resources for therapists by ranking games according to frequency and type of elicited body movements. PMID:27251029
Hill, Bridget; Pallant, Julie; Williams, Gavin; Olver, John; Ferris, Scott; Bialocerkowski, Andrea
2016-12-01
To evaluate the internal construct validity and dimensionality of a new patient-reported outcome measure for people with traumatic brachial plexus injury (BPI) based on the International Classification of Functioning, Disability and Health definition of activity. Cross-sectional study. Outpatient clinics. Adults (age range, 18-82y) with a traumatic BPI (N=106). There were 106 people with BPI who completed a 51-item 5-response questionnaire. Responses were analyzed in 4 phases (missing responses, item correlations, exploratory factor analysis, and Rasch analysis) to evaluate the properties of fit to the Rasch model, threshold response, local dependency, dimensionality, differential item functioning, and targeting. Not applicable, as this study addresses the development of an outcome measure. Six items were deleted for missing responses, and 10 were deleted for high interitem correlations >.81. The remaining 35 items, while demonstrating fit to the Rasch model, showed evidence of local dependency and multidimensionality. Items were divided into 3 subscales: dressing and grooming (8 items), arm and hand (17 items), and no hand (6 items). All 3 subscales demonstrated fit to the model with no local dependency, minimal disordered thresholds, no unidimensionality or differential item functioning for age, time postinjury, or self-selected dominance. Subscales were combined into 3 subtests and demonstrated fit to the model, no misfit, and unidimensionality, allowing calculation of a summary score. This preliminary analysis supports the internal construct validity of the Brachial Assessment Tool, a unidimensional targeted 4-response patient-reported outcome measure designed to solely assess activity after traumatic BPI regardless of level of injury, age at recruitment, premorbid limb dominance, and time postinjury. Further examination is required to determine test-retest reliability and responsiveness. Copyright © 2016 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Zamprogno, Helia; Hansen, Bernie D; Bondell, Howard D; Sumrell, Andrea Thomson; Simpson, Wendy; Robertson, Ian D; Brown, James; Pease, Anthony P; Roe, Simon C; Hardie, Elizabeth M; Wheeler, Simon J; Lascelles, B Duncan X
2010-12-01
To determine the items (question topics) for a subjective instrument to assess degenerative joint disease (DJD)-associated chronic pain in cats and determine the instrument design most appropriate for use by cat owners. 100 randomly selected client-owned cats from 6 months to 20 years old. Cats were evaluated to determine degree of radiographic DJD and signs of pain throughout the skeletal system. Two groups were identified: high DJD pain and low DJD pain. Owner-answered questions about activity and signs of pain were compared between the 2 groups to define items relating to chronic DJD pain. Interviews with 45 cat owners were performed to generate items. Fifty-three cat owners who had not been involved in any other part of the study, 19 veterinarians, and 2 statisticians assessed 6 preliminary instrument designs. 22 cats were selected for each group; 19 important items were identified, resulting in 12 potential items for the instrument; and 3 additional items were identified from owner interviews. Owners and veterinarians selected a 5-point descriptive instrument design over 11-point or visual analogue scale formats. Behaviors relating to activity were substantially different between healthy cats and cats with signs of DJD-associated pain. Fifteen items were identified as being potentially useful, and the preferred instrument design was identified. This information could be used to construct an owner-based questionnaire to assess feline DJD-associated pain. Once validated, such a questionnaire would assist in evaluating potential analgesic treatments for these patients.
Neural Signaling of Food Healthiness Associated with Emotion Processing.
Herwig, Uwe; Dhum, Matthias; Hittmeyer, Anna; Opialla, Sarah; Scherpiet, Sigrid; Keller, Carmen; Brühl, Annette B; Siegrist, Michael
2016-01-01
The ability to differentiate healthy from unhealthy foods is important in order to promote good health. Food, however, may have an emotional connotation, which could be inversely related to healthiness. The neurobiological background of differentiating healthy and unhealthy food and its relations to emotion processing are not yet well understood. We addressed the neural activations, particularly considering the single subject level, when one evaluates a food item to be of a higher, compared to a lower grade of healthiness with a particular view on emotion processing brain regions. Thirty-seven healthy subjects underwent functional magnetic resonance imaging while evaluating the healthiness of food presented as photographs with a subsequent rating on a visual analog scale. We compared individual evaluations of high and low healthiness of food items and also considered gender differences. We found increased activation when food was evaluated to be healthy in the left dorsolateral prefrontal cortex and precuneus in whole brain analyses. In ROI analyses, perceived and rated higher healthiness was associated with lower amygdala activity and higher ventral striatal and orbitofrontal cortex activity. Females exerted a higher activation in midbrain areas when rating food items as being healthy. Our results underline the close relationship between food and emotion processing, which makes sense considering evolutionary aspects. Actively evaluating and deciding whether food is healthy is accompanied by neural signaling associated with reward and self-relevance, which could promote salutary nutrition behavior. The involved brain regions may be amenable to mechanisms of emotion regulation in the context of psychotherapeutic regulation of food intake.
Desselle, Bonnie C; English, Robin; Hescock, George; Hauser, Andrea; Roy, Melissa; Yang, Tong; Chauvin, Sheila W
2012-12-01
Active engagement in the learning process is important to enhance learners' knowledge acquisition and retention and the development of their thinking skills. This study evaluated whether a 1-hour faculty development workshop increased the use of active teaching strategies and enhanced residents' active learning and thinking. Faculty teaching in a pediatrics residency participated in a 1-hour workshop (intervention) approximately 1 month before a scheduled lecture. Participants' responses to a preworkshop/postworkshop questionnaire targeted self-efficacy (confidence) for facilitating active learning and thinking and providing feedback about workshop quality. Trained observers assessed each lecture (3-month baseline phase and 3-month intervention phase) using an 8-item scale for use of active learning strategies and a 7-item scale for residents' engagement in active learning. Observers also assessed lecturer-resident interactions and the extent to which residents were asked to justify their answers. Responses to the workshop questionnaire (n = 32/34; 94%) demonstrated effectiveness and increased confidence. Faculty in the intervention phase demonstrated increased use of interactive teaching strategies for 6 items, with 5 reaching statistical significance (P ≤ .01). Residents' active learning behaviors in lectures were higher in the intervention arm for all 7 items, with 5 reaching statistical significance. Faculty in the intervention group demonstrated increased use of higher-order questioning (P = .02) and solicited justifications for answers (P = .01). A 1-hour faculty development program increased faculty use of active learning strategies and residents' engagement in active learning during resident core curriculum lectures.
Effects of levomilnacipran ER on fatigue symptoms associated with major depressive disorder
Fava, Maurizio; Gommoll, Carl; Chen, Changzheng; Greenberg, William M.; Ruth, Adam
2016-01-01
The aim of this study was to evaluate the effects of levomilnacipran extended-release (ER) on depression-related fatigue in adults with major depressive disorder. Post-hoc analyses of five phase III trials were carried out, with evaluation of fatigue symptoms based on score changes in four items: Montgomery–Åsberg Depression Rating Scale (MADRS) item 7 (lassitude), and 17-item Hamilton Depression Rating Scale (HAMD17) items 7 (work/activities), 8 (retardation), and 13 (somatic symptoms). Symptom remission was analyzed on the basis of score shifts from baseline to end of treatment: MADRS item 7 and HAMD17 item 7 (from ≥2 to ≤1); HAMD17 items 8 and 13 (from ≥1 to 0). The mean change in MADRS total score was analyzed in patients with low and high fatigue (MADRS item 7 baseline score <4 and ≥4, respectively). Patients receiving levomilnacipran ER had significantly greater mean improvements and symptom remission (no/minimal residual fatigue) on all fatigue-related items: lassitude (35 vs. 28%), work/activities (43 vs. 35%), retardation (46 vs. 39%), somatic symptoms (26 vs. 18%; all Ps<0.01 versus placebo). The mean change in MADRS total score was significantly greater with levomilnacipran ER versus placebo in both low (least squares mean difference=−2.8, P=0.0018) and high (least squares mean difference=−3.1, P<0.0001) fatigue subgroups. Levomilnacipran ER treatment was effective in reducing depression-related fatigue in adult patients with major depressive disorder and was associated with remission of fatigue symptoms. PMID:26584326
Qualitative Development and Content Validation of the PROMIS Pediatric Sleep Health Items.
Bevans, Katherine B; Meltzer, Lisa J; De La Motte, Anna; Kratchman, Amy; Viél, Dominique; Forrest, Christopher B
2018-04-25
To develop the Patient Reported Outcome Measurement Information System (PROMIS) Pediatric Sleep Health item pool and evaluate its content validity. Participants included 8 expert sleep clinician-researchers, 64 children ages 8-17 years, and 54 parents of children ages 5-17 years. We started with item concepts and expressions from the PROMIS Sleep Disturbance and Sleep Related Impairment adult measures. Additional pediatric sleep health concepts were generated by expert (n = 8), child (n = 28), and parent (n = 33) concept elicitation interviews and a systematic review of existing pediatric sleep health questionnaires. Content validity of the item pool was evaluated with item translatability review, readability analysis, and child (n = 36) and parent (n = 21) cognitive interviews. The final pediatric Sleep Health item pool includes 43 items that assess sleep disturbance (children's capacity to fall and stay asleep, sleep quality, dreams, and parasomnias) and sleep-related impairments (daytime sleepiness, low energy, difficulty waking up, and the impact of sleep and sleepiness on cognition, affect, behavior, and daily activities). Items are translatable and relevant and well understood by children ages 8-17 and parents of children ages 5-17. Rigorous qualitative procedures were used to develop and evaluate the content validity of the PROMIS Pediatric Sleep Health item pool. Once the item pool's psychometric properties are established, the scales will be useful for measuring children's subjective experiences of sleep.
Malec, James F; Kragness, Miriam; Evans, Randall W; Finlay, Karen L; Kent, Ann; Lezak, Muriel D
2003-01-01
To evaluate the internal consistency of the Mayo-Portland Adaptability Inventory (MPAI), further refine the instrument, and provide reference data based on a large, geographically diverse sample of persons with acquired brain injury (ABI). 386 persons, most with moderate to severe ABI. Outpatient, community-based, and residential rehabilitation facilities for persons with ABI located in the United States: West, Midwest, and Southeast. Rasch, item cluster, principal components, and traditional psychometric analyses for internal consistency of MPAI data and subscales. With rescoring of rating scales for 4 items, a 29-item version of the MPAI showed satisfactory internal consistency by Rasch (Person Reliability=.88; Item Reliability=.99) and traditional psychometric indicators (Cronbach's alpha=.89). Three rationally derived subscales for Ability, Activity, and Participation demonstrated psychometric properties that were equivalent to subscales derived empirically through item cluster and factor analyses. For the 3 subscales, Person Reliability ranged from.78 to.79; Item Reliability, from.98 to.99; and Cronbach's alpha, from.76 to.83. Subscales correlated moderately (Pearson r =.49-.65) with each other and strongly with the overall scale (Pearson r=.82-.86). Outcome after ABI is represented by the unitary dimension described by the MPAI. MPAI subscales further define regions of this dimension that may be useful for evaluation of clinical cases and program evaluation.
NASA Astrophysics Data System (ADS)
Bruno, B. C.; Hsia, M.; Wiener, C.
2012-12-01
Climate change is not just an atmospheric phenomenon. It has serious impacts on the ocean, such as sea level rise, ocean acidification, and coral bleaching. Ocean FEST (Families Exploring Science Together) aims to educate participants about how increasing carbon dioxide is affecting our oceans, and to inspire students to pursue ocean, earth and environmental science careers. Throughout the program, participants examine their everyday decisions and the impact of their choices on the planet's climate and oceans. Ocean FEST is a two-hour program that explores the ocean and relevant environmental topics through six hands-on science activities. Activities are designed so students can see how globally important issues (e.g., climate change and ocean acidification) have local effects (e.g., sea level rise, coastal erosion, coral bleaching). The program ends with a career component, drawing parallels between the program activities and the activities done by "real scientists" in their jobs. Over the past three years, we have conducted over 60 Ocean FEST events. Evaluations are conducted at selected events using electronic surveys, which students and parents complete immediately prior to (pre-survey) and following (post-survey) the program. Survey items were developed and cognitively tested in collaboration with professional evaluators from the American Institute of Research. The nine-item survey includes items on science content knowledge, personal responsibility, and career interest. For each survey item, participants are asked to indicate agreement (coded as 2.0), disagreement (1.0) or don't know (1.5). By comparing the pre- and post-survey results, we can evaluate program efficacy. For example, one survey item is: "I can do something every day to help fight global climate change." Student mean data moved from 1.78 pre-survey to 1.89 post-survey, which is a statistically significant gain at p<.000. Mean parent data for this same item moved from 1.90 pre-survey to 1.96 post-survey, which is again a statistically significant gain at p<.000. In summary, we have found positive statistically significant gains on all survey items for students, and on all but one survey item for parents. These results strongly indicate program efficacy. For more information, please visit our web site: oceanfest.soest.hawaii.edu
Development of a Multidimensional Functional Health Scale for Older Adults in China.
Mao, Fanzhen; Han, Yaofeng; Chen, Junze; Chen, Wei; Yuan, Manqiong; Alicia Hong, Y; Fang, Ya
2016-05-01
A first step to achieve successful aging is assessing functional wellbeing of older adults. This study reports the development of a culturally appropriate brief scale (the Multidimensional Functional Health Scale for Chinese Elderly, MFHSCE) to assess the functional health of Chinese elderly. Through systematic literature review, Delphi method, cultural adaptation, synthetic statistical item selection, Cronbach's alpha and confirmatory factor analysis, we conducted development of item pool, two rounds of item selection, and psychometric evaluation. Synthetic statistical item selection and psychometric evaluation was processed among 539 and 2032 older adults, separately. The MFHSCE consists of 30 items, covering activities of daily living, social relationships, physical health, mental health, cognitive function, and economic resources. The Cronbach's alpha was 0.92, and the comparative fit index was 0.917. The MFHSCE has good internal consistency and construct validity; it is also concise and easy to use in general practice, especially in communities in China.
Abasi, Mohammad Hadi; Eslami, Ahmad Ali; Rakhshani, Fatemeh; Shiri, Mansoor
2016-01-01
Self-regulation is one of the current psychological concepts that have been known as a determinant of leisure time physical activity. Due to cultural and social diversity in different societies and age groups, application of specific questionnaires is essential to perform investigations about physical activities. The aim of this study is development and evaluation of psychometric properties of a self-regulation questionnaire about leisure time physical activity in Iranian male adolescents. This cross-sectional study was conducted in 2013, and data of 603 male students from 12 high schools in Isfahan were collected. A comprehensive literature review and similar questionnaire review were conducted and 25 items were selected or developed to measure self-regulation. Comprehensibility of items was evaluated in a pilot study and an expert panel evaluated face and content validity. Exploratory factors analysis (EFA) was used for evaluation of construct validity and extraction of sub-constructs of self-regulation. Leisure time physical activity was assessed using International Physical Activity Questionnaire (IPAQ). The mean age of the participants was 16.3 years (SD =1.0) and the range was 15-19 years. Cronbach's α coefficient of the questionnaire in the pilot and main study was 0.84 and 0.90, respectively. EFA resulted in four sub-constructs including "enlistment of social support", "goal setting", "self-construction", and "self-monitoring", which explained 63.6% of the variance of self-regulation. Results of this investigation provide some support to the validity and reliability of the 16-item questionnaire of self-regulation abut leisure time physical activity in the target group.
Paap, Muirne C S; Lenferink, Lonneke I M; Herzog, Nadine; Kroeze, Karel A; van der Palen, Job
2016-06-27
Health-related quality of life (HRQoL) is widely used as an outcome measure in the evaluation of treatment interventions in patients with chronic obstructive pulmonary disease (COPD). In order to address challenges associated with existing fixed-length measures (e.g., too long to be used routinely, too short to ensure both content validity and reliability), a COPD-specific item bank (COPD-SIB) was developed. Items were selected based on literature review and interviews with Dutch COPD patients, with a strong focus on both content validity and item comprehension. The psychometric quality of the item bank was evaluated using Mokken Scale Analysis and parametric Item Response Theory, using data of 666 COPD patients. The final item bank contains 46 items that form a strong scale, tapping into eight important themes that were identified based on literature review and patient interviews: Coping with disease/symptoms, adaptability; Autonomy; Anxiety about the course/end-state of the disease, hopelessness; Positive psychological functioning; Situations triggering or enhancing breathing problems; Symptoms; Activity; Impact. The 46-item COPD-SIB has good psychometric properties and content validity. Items are available in Dutch and English. The COPD-SIB can be used as a stand-alone instrument, or to inform computerised adaptive testing.
Houston, Megan N; Hoch, Johanna M; Van Lunen, Bonnie L; Hoch, Matthew C
2015-11-01
The Disablement in the Physically Active scale (DPA) is a generic patient-reported outcome designed to evaluate constructs of disability in physically active populations. The purpose of this study was to analyze the DPA scale structure for summary components. Four hundred and fifty-six collegiate athletes completed a demographic form and the DPA. A principal component analysis (PCA) was conducted with oblique rotation. Factors with eigenvalues >1 that explained >5 % of the variance were retained. The PCA revealed a two-factor structure consistent with paradigms used to develop the original DPA. Items 1-12 loaded on Factors 1 and Items 13-16 loaded on Factor 2. Items 1-12 pertain to impairment, activity limitations, and participation restrictions. Items 13-16 address psychosocial and emotional well-being. Consideration of item content suggested Factor 1 concerned physical function, while Factor 2 concerned mental well-being. Thus, items clustered around Factor 1 and 2 were identified as physical (DPA-PSC) and mental (DPA-MSC) summary components, respectively. Together, the factors accounted for 65.1 % of the variance. The PCA revealed a two-factor structure for the DPA that resulted in DPA-PSC and DPA-MSC. Analyzing the DPA as separate constructs may provide distinct information that could help to prescribe treatment and rehabilitation strategies.
Nathan, Nicole; Wolfenden, Luke; Morgan, Philip J; Bell, Andrew C; Barker, Daniel; Wiggers, John
2013-06-13
Valid tools measuring characteristics of the school environment associated with the physical activity and dietary behaviours of children are needed to accurately evaluate the impact of initiatives to improve school environments. The aim of this study was to assess the validity of Principal self-report of primary school healthy eating and physical activity environments. Primary school Principals (n = 42) in New South Wales, Australia were invited to complete a telephone survey of the school environment; the School Environment Assessment Tool - SEAT. Equivalent observational data were collected by pre-service teachers located within the school. The SEAT, involved 65 items that assessed food availability via canteens, vending machines and fundraisers and the presence of physical activity facilities, equipment and organised physical activities. Kappa statistics were used to assess agreement between the two measures. Almost 70% of the survey demonstrated moderate to almost perfect agreement. Substantial agreement was found for 10 of 13 items assessing foods sold for fundraising, 3 of 6 items assessing physical activity facilities of the school, and both items assessing organised physical activities that occurred at recess and lunch and school sport. Limited agreement was found for items assessing foods sold through canteens and access to small screen recreation. The SEAT provides researchers and policy makers with a valid tool for assessing aspects of the school food and physical activity environment.
Is Your Neighborhood Designed to Support Physical Activity? A Brief Streetscape Audit Tool.
Sallis, James F; Cain, Kelli L; Conway, Terry L; Gavand, Kavita A; Millstein, Rachel A; Geremia, Carrie M; Frank, Lawrence D; Saelens, Brian E; Glanz, Karen; King, Abby C
2015-09-03
Macro level built environment factors (eg, street connectivity, walkability) are correlated with physical activity. Less studied but more modifiable microscale elements of the environment (eg, crosswalks) may also affect physical activity, but short audit measures of microscale elements are needed to promote wider use. This study evaluated the relation of a 15-item neighborhood environment audit tool with a full version of the tool to assess neighborhood design on physical activity in 4 age groups. From the 120-item Microscale Audit of Pedestrian Streetscapes (MAPS) measure of street design, sidewalks, and street crossings, we developed the 15-item version (MAPS-Mini) on the basis of associations with physical activity and attribute modifiability. As a sample of a likely walking route, MAPS-Mini was conducted on a 0.25-mile route from participant residences toward the nearest nonresidential destination for children (n = 758), adolescents (n = 897), younger adults (n = 1,655), and older adults (n = 367). Active transportation and leisure physical activity were measured with age-appropriate surveys, and accelerometers provided objective physical activity measures. Mixed-model regressions were conducted for each MAPS item and a total environment score, adjusted for demographics, participant clustering, and macrolevel walkability. Total scores of MAPS-Mini and the 120-item MAPS correlated at r = .85. Total microscale environment scores were significantly related to active transportation in all age groups. Items related to active transport in 3 age groups were presence of sidewalks, curb cuts, street lights, benches, and buffer between street and sidewalk. The total score was related to leisure physical activity and accelerometer measures only in children. The MAPS-Mini environment measure is short enough to be practical for use by community groups and planning agencies and is a valid substitute for the full version that is 8 times longer.
ERIC Educational Resources Information Center
Jaynes, William E.; And Others
Alumni attitudes concerning their college experience, present work, and present recreational activities were analyzed in relation to the time in college, using a semantic differential format. Four items were used for each type of rating, one evaluative, another activity-oriented, and two potency oriented. The evaluation dimension concerns the…
Khalil, Mohammed K; Kirkley, Debbie L; Kibble, Jonathan D
2013-01-01
This article describes the development of an interactive computer-based laboratory manual, created to facilitate the teaching and learning of medical histology. The overarching goal of developing the manual is to facilitate self-directed group interactivities that actively engage students during laboratory sessions. The design of the manual includes guided instruction for students to navigate virtual slides, exercises for students to monitor learning, and cases to provide clinical relevance. At the end of the laboratory activities, student groups can generate a laboratory report that may be used to provide formative feedback. The instructional value of the manual was evaluated by a questionnaire containing both closed-ended and open-ended items. Closed-ended items using a five-point Likert-scale assessed the format and navigation, instructional contents, group process, and learning process. Open-ended items assessed student's perception on the effectiveness of the manual in facilitating their learning. After implementation for two consecutive years, student evaluation of the manual was highly positive and indicated that it facilitated their learning by reinforcing and clarifying classroom sessions, improved their understanding, facilitated active and cooperative learning, and supported self-monitoring of their learning. Copyright © 2013 American Association of Anatomists.
Paz, Sylvia H; Spritzer, Karen L; Morales, Leo S; Hays, Ron D
2013-09-01
To evaluate the equivalence of the PROMIS(®) physical functioning item bank by language of administration (English versus Spanish). The PROMIS(®) wave 1 English-language physical functioning bank consists of 124 items, and 114 of these were translated into Spanish. Item frequencies, means and standard deviations, item-scale correlations, and internal consistency reliability were calculated. The IRT assumption of unidimensionality was evaluated by fitting a single-factor confirmatory factor analytic model. IRT threshold and discrimination parameters were estimated using Samejima's Graded Response Model. DIF by language of administration was evaluated. Item means ranged from 2.53 (SD = 1.36) to 4.62 (SD = 0.82). Coefficient alpha was 0.99, and item-rest correlations ranged from 0.41 to 0.89. A one-factor model fits the data well (CFI = 0.971, TLI = 0.970, and RMSEA = 0.052). The slope parameters ranged from 0.45 ("Are you able to run 10 miles?") to 4.50 ("Are you able to put on a shirt or blouse?"). The threshold parameters ranged from -1.92 ("How much do physical health problems now limit your usual physical activities (such as walking or climbing stairs)?") to 6.06 ("Are you able to run 10 miles?"). Fifty of the 114 items were flagged for DIF based on an R(2) of 0.02 or above criterion. The expected total score was higher for Spanish- than English-language respondents. English- and Spanish-speaking subjects with the same level of underlying physical function responded differently to 50 of 114 items. This study has important implications in the study of physical functioning among diverse populations.
Science Library of Test Items. Volume Four: Practical Testing Guide.
ERIC Educational Resources Information Center
New South Wales Dept. of Education, Sydney (Australia).
As one in a series of test items collections developed by the Assessment and Evaluation Unit of the Directorate of Studies, the guide gives a wide range of questions and activities for the manipulation of scientific equipment to allow assessment of students' practical laboratory skills. Instructions are given to make norm-referenced or…
Petrillo, Jennifer; Bressler, Neil M; Lamoureux, Ecosse; Ferreira, Alberto; Cano, Stefan
2017-08-14
The NEI VFQ-25 has undergone psychometric evaluation in patients with varying ocular conditions and the general population. However, important limitations which may affect the interpretation of clinical trial results have been previously identified, such as concerns with reliability and validity. The purpose of this study was to evaluate the National Eye Institute Visual Functioning Questionnaire (NEI VFQ-25) and make recommendations for a revised scoring structure, with a view to improving its psychometric performance and interpretability. Rasch Measurement Theory analyses were conducted in two stages using pooled baseline NEI VFQ-25 data for 2487 participants with retinal diseases enrolled in six clinical trials. In stage 1, we examined: scale-to-sample targeting; thresholds for item response options; item fit statistics; stability; local dependence; and reliability. In stage 2, a post-hoc revision of the scoring structure (VFQ-28R) was created and psychometrically re-evaluated. In stage 1, we found that the NEI VFQ-25 was mis-targeted to the sample, and had disordered response thresholds (15/25 items) and mis-fitting items (8/25 items). However, items appeared to be stable (differential item functioning for three items), have minimal item dependency (one pair of items) and good reliability (person-separation index, 0.93). In stage 2, the modified Rasch-scored NEI VFQ-28-R was assessed. It comprised two broad domains: Activity Limitation (19 items) and Socio-Emotional Functioning (nine items). The NEI VFQ-28-R demonstrated improved performance with fewer disordered response thresholds (no items), less item misfit (three items) and improved population targeting (reduced ceiling effect) compared with the NEI VFQ-25. Compared with the original version, the proposed NEI VFQ-28-R, with Rasch-based scoring and a two-domain structure, appears to offer improved psychometric performance and interpretability of the vision-related quality of life scale for the population analysed.
Development of the NIH PROMIS ® Sexual Function and Satisfaction measures in patients with cancer.
Flynn, Kathryn E; Lin, Li; Cyranowski, Jill M; Reeve, Bryce B; Reese, Jennifer Barsky; Jeffery, Diana D; Smith, Ashley Wilder; Porter, Laura S; Dombeck, Carrie B; Bruner, Deborah Watkins; Keefe, Francis J; Weinfurt, Kevin P
2013-02-01
We describe the development and validation of the Patient-Reported Outcomes Measurement Information System(®) Sexual Function and Satisfaction (PROMIS(®) SexFS; National Institutes of Health) measures, version 1.0, for cancer populations. To develop a customizable self-report measure of sexual function and satisfaction as part of the U.S. National Institutes of Health PROMIS Network. Our multidisciplinary working group followed a comprehensive protocol for developing psychometrically robust patient-reported outcome measures including qualitative (scale development) and quantitative (psychometric evaluation) development. We performed an extensive literature review, conducted 16 focus groups with cancer patients and multiple discussions with clinicians, and evaluated candidate items in cognitive testing with patients. We administered items to 819 cancer patients. Items were calibrated using item-response theory and evaluated for reliability and validity. The PROMIS SexFS measures, version 1.0, include 81 items in 11 domains: Interest in Sexual Activity, Lubrication, Vaginal Discomfort, Erectile Function, Global Satisfaction with Sex Life, Orgasm, Anal Discomfort, Therapeutic Aids, Sexual Activities, Interfering Factors, and Screener Questions. In addition to content validity (patients indicate that items cover important aspects of their experiences) and face validity (patients indicate that items measure sexual function and satisfaction), the measure shows evidence for discriminant validity (domains discriminate between groups expected to be different) and convergent validity (strong correlations between scores on PROMIS and scores on conceptually similar older measures of sexual function), as well as favorable test-retest reliability among people not expected to change (interclass correlations from two administrations of the instrument, 1 month apart). The PROMIS SexFS offers researchers a reliable and valid set of tools to measure self-reported sexual function and satisfaction among diverse men and women. The measures are customizable; researchers can select the relevant domains and items comprising those domains for their study. © 2013 International Society for Sexual Medicine.
Development of the NIH PROMIS® Sexual Function and Satisfaction Measures in Patients with Cancer
Flynn, Kathryn E.; Lin, Li; Cyranowski, Jill M.; Reeve, Bryce B.; Reese, Jennifer Barsky; Jeffery, Diana D.; Smith, Ashley Wilder; Porter, Laura S.; Dombeck, Carrie B.; Bruner, Deborah Watkins; Keefe, Francis J.; Weinfurt, Kevin P.
2013-01-01
Introduction We describe the development and validation of the PROMIS Sexual Function and Satisfaction (PROMIS SexFS) measures version 1.0 for cancer populations. Aim To develop a customizable self-report measure of sexual function and satisfaction as part of the U.S. National Institutes of Health PROMIS® Network. Methods Our multidisciplinary working group followed a comprehensive protocol for developing psychometrically robust patient reported outcome (PRO) measures including qualitative (scale development) and quantitative (psychometric evaluation) development. We performed an extensive literature review, conducted 16 focus groups with cancer patients and multiple discussions with clinicians, and evaluated candidate items in cognitive testing with patients. We administered items to 819 cancer patients. Items were calibrated using item response theory and evaluated for reliability and validity. Main Outcome Measures The PROMIS Sexual Function and Satisfaction (PROMIS SexFS) measures version 1.0 include 79 items in 11 domains: interest in sexual activity, lubrication, vaginal discomfort, erectile function, global satisfaction with sex life, orgasm, anal discomfort, therapeutic aids, sexual activities, interfering factors, and screener questions. Results In addition to content validity (patients indicate that items cover important aspects of their experiences) and face validity (patients indicate that items measure sexual function and satisfaction), the measure shows evidence for discriminant validity (domains discriminate between groups expected to be different), convergent validity (strong correlations between scores on PROMIS and scores on conceptually-similar older measures of sexual function), as well as favorable test-retest reliability among people not expected to change (inter-class correlations from 2 administrations of the instrument, 1 month apart). Conclusions The PROMIS SexFS offers researchers a reliable and valid set of tools to measure self-reported sexual function and satisfaction among diverse men and women. The measures are customizable; researchers can select the relevant domains and items comprising those domains for their study. PMID:23387911
Légaré, France; Borduas, Francine; Freitas, Adriana; Jacques, André; Godin, Gaston; Luconi, Francesca; Grimshaw, Jeremy
2014-01-01
Decision-makers in organizations providing continuing professional development (CPD) have identified the need for routine assessment of its impact on practice. We sought to develop a theory-based instrument for evaluating the impact of CPD activities on health professionals' clinical behavioral intentions. Our multipronged study had four phases. 1) We systematically reviewed the literature for instruments that used socio-cognitive theories to assess healthcare professionals' clinically-oriented behavioral intentions and/or behaviors; we extracted items relating to the theoretical constructs of an integrated model of healthcare professionals' behaviors and removed duplicates. 2) A committee of researchers and CPD decision-makers selected a pool of items relevant to CPD. 3) An international group of experts (n = 70) reached consensus on the most relevant items using electronic Delphi surveys. 4) We created a preliminary instrument with the items found most relevant and assessed its factorial validity, internal consistency and reliability (weighted kappa) over a two-week period among 138 physicians attending a CPD activity. Out of 72 potentially relevant instruments, 47 were analyzed. Of the 1218 items extracted from these, 16% were discarded as improperly phrased and 70% discarded as duplicates. Mapping the remaining items onto the constructs of the integrated model of healthcare professionals' behaviors yielded a minimum of 18 and a maximum of 275 items per construct. The partnership committee retained 61 items covering all seven constructs. Two iterations of the Delphi process produced consensus on a provisional 40-item questionnaire. Exploratory factorial analysis following test-retest resulted in a 12-item questionnaire. Cronbach's coefficients for the constructs varied from 0.77 to 0.85. A 12-item theory-based instrument for assessing the impact of CPD activities on health professionals' clinical behavioral intentions showed adequate validity and reliability. Further studies could assess its responsiveness to behavior change following CPD activities and its capacity to predict health professionals' clinical performance.
Légaré, France; Borduas, Francine; Freitas, Adriana; Jacques, André; Godin, Gaston; Luconi, Francesca; Grimshaw, Jeremy
2014-01-01
Background Decision-makers in organizations providing continuing professional development (CPD) have identified the need for routine assessment of its impact on practice. We sought to develop a theory-based instrument for evaluating the impact of CPD activities on health professionals' clinical behavioral intentions. Methods and Findings Our multipronged study had four phases. 1) We systematically reviewed the literature for instruments that used socio-cognitive theories to assess healthcare professionals' clinically-oriented behavioral intentions and/or behaviors; we extracted items relating to the theoretical constructs of an integrated model of healthcare professionals' behaviors and removed duplicates. 2) A committee of researchers and CPD decision-makers selected a pool of items relevant to CPD. 3) An international group of experts (n = 70) reached consensus on the most relevant items using electronic Delphi surveys. 4) We created a preliminary instrument with the items found most relevant and assessed its factorial validity, internal consistency and reliability (weighted kappa) over a two-week period among 138 physicians attending a CPD activity. Out of 72 potentially relevant instruments, 47 were analyzed. Of the 1218 items extracted from these, 16% were discarded as improperly phrased and 70% discarded as duplicates. Mapping the remaining items onto the constructs of the integrated model of healthcare professionals' behaviors yielded a minimum of 18 and a maximum of 275 items per construct. The partnership committee retained 61 items covering all seven constructs. Two iterations of the Delphi process produced consensus on a provisional 40-item questionnaire. Exploratory factorial analysis following test-retest resulted in a 12-item questionnaire. Cronbach's coefficients for the constructs varied from 0.77 to 0.85. Conclusion A 12-item theory-based instrument for assessing the impact of CPD activities on health professionals' clinical behavioral intentions showed adequate validity and reliability. Further studies could assess its responsiveness to behavior change following CPD activities and its capacity to predict health professionals' clinical performance. PMID:24643173
Baldwin, Constance; Chandran, Latha; Gusic, Maryellen
2011-01-01
The academic community needs a sound framework for the promotion and advancement of educators. The Group on Educational Affairs of the Association of American Medical Colleges organized a consensus conference that affirmed the use of five domains for documenting the quantity and quality of scholarly engagement in educational activities: teaching, curriculum, advising/mentoring, educational leadership/administration, and learner assessment. In this article, we offer detailed guidelines to evaluate these five domains of educator performance and the essential elements of scholarly activity. The guidelines are adapted from our developmental educator portfolio template and educator portfolio analysis tool, previously published in MedEdPORTAL. A short tool for educator performance evaluation that summarizes items in the guidelines is proposed for discussion. Our goal in this article is to itemize criteria for systematic faculty evaluation that can be applied in any institutional setting to assist promotion decision makers in their task of evaluating medical school faculty.
Lee, Kunsei; Kim, Hyun Joo; You, Myoungsoon; Lee, Jin-Seok; Eun, Sang Jun; Jeong, Hyoseon; Ahn, Hye Mi; Lee, Jin Yong
2017-03-01
This study aims to identify which activities of a public community hospital (PHC) should be included in their definition of publicness and tries to achieve a consensus among experts using the Delphi method. We conduct 2 rounds of the Delphi process with 17 panel members using a developed draft of tentative activities for publicness including 5 main categories covering 27 items. The questions remain the same in both rounds and the applicability of each of the 27 items to publicness is measured on a 9-point scale. If the participants believe government funding is needed, we ask how much they think the government should support each item on a 0% to 100% scale. After conducting 2 rounds of the Delphi process, 22 out of the 27 items reached a consensus as activities defining the publicness of the PHCs. Among the 5 major categories, in category C, activities preventing market failure, all 10 items were considered activities of publicness. Nine of these were evaluated as items that should be compensated at 100% of total financial loss by the Korean government. Throughout results, we were able to define the activities of the PCH that encompassed its publicness and confirm that there are "good deficits" in the context of the PCHs. Thus, some PCH deficits are unavoidable and not wasted as these monies support a necessary role and function in providing public health. The Korean government should therefore consider taking actions such as exempting such "good deficits" or providing additional financial aid to reimburse the PHCs for "good deficits."
Development and content validation of a patient-reported endometriosis pain daily diary.
van Nooten, Floortje E; Cline, Jennifer; Elash, Celeste A; Paty, Jean; Reaney, Matthew
2018-01-04
Endometriosis is a common gynecological disorder that causes inflammation and pelvic pain. Endometriosis-related pain is best captured with patient-reported outcome (PRO) measures, however, assessment of endometriosis-related pain in clinical trials has been difficult in the absence of a reliable and valid PRO instrument. We describe the development of the Endometriosis Pain Daily Diary (EPDD), an electronic PRO developed as a survey instrument to assess endometriosis-related pain and its impact on patients' lives. The EPDD was initially developed on the basis of an existing Endometriosis Pain and Bleeding Diary, a targeted review of relevant literature, clinical expert interviews, and open-ended (concept elicitation) patient interviews in the United States (US) and Japan which captured patients' experience with endometriosis. Cognitive interviews of patients with endometriosis were conducted to evaluate patient comprehension of the EPDD items. A conceptual model of endometriosis was developed, and meetings with US and European regulatory authorities provided feedback for validating the EPDD in the context of clinical trials. Translatability assessments of the EPDD were conducted to confirm its appropriate interpretation and ease of completion across 17 languages. The iterative development progressed through three versions of the instrument. The EPDDv1 included 18 items relating to dysmenorrhea/pelvic pain, dyspareunia and sexual activity, bleeding, hot flashes, daily activities, and use of rescue medication. The EPDDv2 was a larger 43-item survey tested in cognitive interviews and subsequently revised to yield the current 11-item EPDDv3, consisting of five core items relating to dysmenorrhea, non-menstrual pelvic pain, and dyspareunia, and six extension items relating to sexual activity, daily activities, and use of rescue medication. The EPDD is a PRO for the evaluation of endometriosis-related pain and its associated impacts on patients' lives. The EPDD represents an important step in providing a PRO that is relevant to patients with endometriosis-related pain in the context of a clinical study setting (ie, fit-for-purpose), designed to evaluate pain associated with endometriosis, including regulatory agency support for its further exploration in clinical trials.
Abasi, Mohammad Hadi; Eslami, Ahmad Ali; Rakhshani, Fatemeh; Shiri, Mansoor
2016-01-01
Background: Self-regulation is one of the current psychological concepts that have been known as a determinant of leisure time physical activity. Due to cultural and social diversity in different societies and age groups, application of specific questionnaires is essential to perform investigations about physical activities. The aim of this study is development and evaluation of psychometric properties of a self-regulation questionnaire about leisure time physical activity in Iranian male adolescents. Materials and Methods: This cross-sectional study was conducted in 2013, and data of 603 male students from 12 high schools in Isfahan were collected. A comprehensive literature review and similar questionnaire review were conducted and 25 items were selected or developed to measure self-regulation. Comprehensibility of items was evaluated in a pilot study and an expert panel evaluated face and content validity. Exploratory factors analysis (EFA) was used for evaluation of construct validity and extraction of sub-constructs of self-regulation. Leisure time physical activity was assessed using International Physical Activity Questionnaire (IPAQ). Results: The mean age of the participants was 16.3 years (SD =1.0) and the range was 15-19 years. Cronbach's α coefficient of the questionnaire in the pilot and main study was 0.84 and 0.90, respectively. EFA resulted in four sub-constructs including “enlistment of social support”, “goal setting”, “self-construction”, and “self-monitoring”, which explained 63.6% of the variance of self-regulation. Conclusions: Results of this investigation provide some support to the validity and reliability of the 16-item questionnaire of self-regulation abut leisure time physical activity in the target group. PMID:27095993
Pasternak, Amy; Sideridis, Georgios; Fragala-Pinkham, Maria; Glanzman, Allan M; Montes, Jacqueline; Dunaway, Sally; Salazar, Rachel; Quigley, Janet; Pandya, Shree; O'Riley, Susan; Greenwood, Jonathan; Chiriboga, Claudia; Finkel, Richard; Tennekoon, Gihan; Martens, William B; McDermott, Michael P; Fournier, Heather Szelag; Madabusi, Lavanya; Harrington, Timothy; Cruz, Rosangel E; LaMarca, Nicole M; Videon, Nancy M; Vivo, Darryl C De; Darras, Basil T
2016-12-01
In this study we evaluated the suitability of a caregiver-reported functional measure, the Pediatric Evaluation of Disability Inventory-Computer Adaptive Test (PEDI-CAT), for children and young adults with spinal muscular atrophy (SMA). PEDI-CAT Mobility and Daily Activities domain item banks were administered to 58 caregivers of children and young adults with SMA. Rasch analysis was used to evaluate test properties across SMA types. Unidimensional content for each domain was confirmed. The PEDI-CAT was most informative for type III SMA, with ability levels distributed close to 0.0 logits in both domains. It was less informative for types I and II SMA, especially for mobility skills. Item and person abilities were not distributed evenly across all types. The PEDI-CAT may be used to measure functional performance in SMA, but additional items are needed to identify small changes in function and best represent the abilities of all types of SMA. Muscle Nerve 54: 1097-1107, 2016. © 2016 Wiley Periodicals, Inc.
Silva, Adriana Lucia Pastore E; Croci, Alberto Tesconi; Gobbi, Riccardo Gomes; Hinckel, Betina Bremer; Pecora, José Ricardo; Demange, Marco Kawamura
2017-01-01
Translation, cultural adaptation, and validation of the new version of the Knee Society Score - The 2011 KS Score - into Brazilian Portuguese and verification of its measurement properties, reproducibility, and validity. In 2012, the new version of the Knee Society Score was developed and validated. This scale comprises four separate subscales: (a) objective knee score (seven items: 100 points); (b) patient satisfaction score (five items: 40 points); (c) patient expectations score (three items: 15 points); and (d) functional activity score (19 items: 100 points). A total of 90 patients aged 55-85 years were evaluated in a clinical cross-sectional study. The pre-operative translated version was applied to patients with TKA referral, and the post-operative translated version was applied to patients who underwent TKA. Each patient answered the same questionnaire twice and was evaluated by two experts in orthopedic knee surgery. Evaluations were performed pre-operatively and three, six, or 12 months post-operatively. The reliability of the questionnaire was evaluated using the intraclass correlation coefficient (ICC) between the two applications. Internal consistency was evaluated using Cronbach's alpha. The ICC found no difference between the means of the pre-operative, three-month, and six-month post-operative evaluations between sub-scale items. The Brazilian Portuguese version of The 2011 KS Score is a valid and reliable instrument for objective and subjective evaluation of the functionality of Brazilian patients who undergo TKA and revision TKA.
Doğanay Erdoğan, Beyza; Elhan, Atilla Halİl; Kaskatı, Osman Tolga; Öztuna, Derya; Küçükdeveci, Ayşe Adile; Kutlay, Şehim; Tennant, Alan
2017-10-01
This study aimed to explore the potential of an inclusive and fully integrated measurement system for the Activities component of the International Classification of Functioning, Disability and Health (ICF), incorporating four classical scales, including the Health Assessment Questionnaire (HAQ), and a Computerized Adaptive Testing (CAT). Three hundred patients with rheumatoid arthritis (RA) answered relevant questions from four questionnaires. Rasch analysis was performed to create an item bank using this item pool. A further 100 RA patients were recruited for a CAT application. Both real and simulated CATs were applied and the agreement between these CAT-based scores and 'paper-pencil' scores was evaluated with intraclass correlation coefficient (ICC). Anchoring strategies were used to obtain a direct translation from the item bank common metric to the HAQ score. Mean age of 300 patients was 52.3 ± 11.7 years; disease duration was 11.3 ± 8.0 years; 74.7% were women. After testing for the assumptions of Rasch analysis, a 28-item Activities item bank was created. The agreement between CAT-based scores and paper-pencil scores were high (ICC = 0.993). Using those HAQ items in the item bank as anchoring items, another Rasch analysis was performed with HAQ-8 scores as separate items together with anchoring items. Finally a conversion table of the item bank common metric to the HAQ scores was created. A fully integrated and inclusive health assessment system, illustrating the Activities component of the ICF, was built to assess RA patients. Raw score to metric conversions and vice versa were available, giving access to the metric by a simple look-up table. © 2015 Asia Pacific League of Associations for Rheumatology and Wiley Publishing Asia Pty Ltd.
Psychometric properties of the communication Confidence Rating Scale for Aphasia (CCRSA): phase 1.
Cherney, Leora R; Babbitt, Edna M; Semik, Patrick; Heinemann, Allen W
2011-01-01
Confidence is a construct that has not been explored previously in aphasia research. We developed the Communication Confidence Rating Scale for Aphasia (CCRSA) to assess confidence in communicating in a variety of activities and evaluated its psychometric properties using rating scale (Rasch) analysis. The CCRSA was administered to 21 individuals with aphasia before and after participation in a computer-based language therapy study. Person reliability of the 8-item CCRSA was .77. The 5-category rating scale demonstrated monotonic increases in average measures from low to high ratings. However, one item ("I follow news, sports, stories on TV/movies") misfit the construct defined by the other items (mean square infit = 1.69, item-measure correlation = .41). Deleting this item improved reliability to .79; the 7 remaining items demonstrated excellent fit to the underlying construct, although there was a modest ceiling effect in this sample. Pre- to posttreatment changes on the 7-item CCRSA measure were statistically significant using a paired samples t test. Findings support the reliability and sensitivity of the CCRSA in assessing participants' self-report of communication confidence. Further evaluation of communication confidence is required with larger and more diverse samples.
Teresi, Jeanne A; Ocepek-Welikson, Katja; Cook, Karon F; Kleinman, Marjorie; Ramirez, Mildred; Reid, M Carrington; Siu, Albert
2016-01-01
Reducing the response burden of standardized pain measures is desirable, particularly for individuals who are frail or live with chronic illness, e.g., those suffering from cancer and those in palliative care. The Patient Reported Outcome Measurement Information System ® (PROMIS ® ) project addressed this issue with the provision of computerized adaptive tests (CAT) and short form measures that can be used clinically and in research. Although there has been substantial evaluation of PROMIS item banks, little is known about the performance of PROMIS short forms, particularly in ethnically diverse groups. Reviewed in this article are findings related to the differential item functioning (DIF) and reliability of the PROMIS pain interference short forms across diverse sociodemographic groups. DIF hypotheses were generated for the PROMIS short form pain interference items. Initial analyses tested item response theory (IRT) model assumptions of unidimensionality and local independence. Dimensionality was evaluated using factor analytic methods; local dependence (LD) was tested using IRT-based LD indices. Wald tests were used to examine group differences in IRT parameters, and to test DIF hypotheses. A second DIF-detection method used in sensitivity analyses was based on ordinal logistic regression with a latent IRT-derived conditioning variable. Magnitude and impact of DIF were investigated, and reliability and item and scale information statistics were estimated. The reliability of the short form item set was excellent. However, there were a few items with high local dependency, which affected the estimation of the final discrimination parameters. As a result, the item, "How much did pain interfere with enjoyment of social activities?" was excluded in the DIF analyses for all subgroup comparisons. No items were hypothesized to show DIF for race and ethnicity; however, five items showed DIF after adjustment for multiple comparisons in both primary and sensitivity analyses: ability to concentrate, enjoyment of recreational activities, tasks away from home, participation in social activities, and socializing with others. The magnitude of DIF was small and the impact negligible. Three items were consistently identified with DIF for education: enjoyment of life, ability to concentrate, and enjoyment of recreational activities. No item showed DIF above the magnitude threshold and the impact of DIF on the overall measure was minimal. No item showed gender DIF after correction for multiple comparisons in the primary analyses. Four items showed consistent age DIF: enjoyment of life, ability to concentrate, day to day activities, and enjoyment of recreational activities, none with primary magnitude values above threshold. Conditional on the pain state, Spanish speakers were hypothesized to report less pain interference on one item, enjoyment of life. The DIF findings confirmed the hypothesis; however, the magnitude was small. Using an arbitrary cutoff point of theta ( θ ) ≥ 1.0 to classify respondents with acute pain interference, the highest number of changes were for the education groups analyses. There were 231 respondents (4% of the total sample) who changed from the designation of no acute pain interference to acute interference after the DIF adjustment. There was no change in the designations for race/ethnic subgroups, and a small number of changes for respondents aged 65 to 84. Although significant DIF was observed after correction for multiple comparisons, all DIF was of low magnitude and impact. However, some individual-level impact was observed for low education groups. Reliability estimates were high. Thus, the PROMIS short form pain items examined in this ethnically diverse sample performed relatively well; although one item was problematic and removed from the analyses. It is concluded that the majority of the PROMIS pain interference short form items can be recommended for use among ethnically diverse groups, including those in palliative care and with cancer and chronic illness.
Teresi, Jeanne A.; Ocepek-Welikson, Katja; Cook, Karon F.; Kleinman, Marjorie; Ramirez, Mildred; Reid, M. Carrington; Siu, Albert
2017-01-01
Reducing the response burden of standardized pain measures is desirable, particularly for individuals who are frail or live with chronic illness, e.g., those suffering from cancer and those in palliative care. The Patient Reported Outcome Measurement Information System® (PROMIS®) project addressed this issue with the provision of computerized adaptive tests (CAT) and short form measures that can be used clinically and in research. Although there has been substantial evaluation of PROMIS item banks, little is known about the performance of PROMIS short forms, particularly in ethnically diverse groups. Reviewed in this article are findings related to the differential item functioning (DIF) and reliability of the PROMIS pain interference short forms across diverse sociodemographic groups. Methods DIF hypotheses were generated for the PROMIS short form pain interference items. Initial analyses tested item response theory (IRT) model assumptions of unidimensionality and local independence. Dimensionality was evaluated using factor analytic methods; local dependence (LD) was tested using IRT-based LD indices. Wald tests were used to examine group differences in IRT parameters, and to test DIF hypotheses. A second DIF-detection method used in sensitivity analyses was based on ordinal logistic regression with a latent IRT-derived conditioning variable. Magnitude and impact of DIF were investigated, and reliability and item and scale information statistics were estimated. Results The reliability of the short form item set was excellent. However, there were a few items with high local dependency, which affected the estimation of the final discrimination parameters. As a result, the item, “How much did pain interfere with enjoyment of social activities?” was excluded in the DIF analyses for all subgroup comparisons. No items were hypothesized to show DIF for race and ethnicity; however, five items showed DIF after adjustment for multiple comparisons in both primary and sensitivity analyses: ability to concentrate, enjoyment of recreational activities, tasks away from home, participation in social activities, and socializing with others. The magnitude of DIF was small and the impact negligible. Three items were consistently identified with DIF for education: enjoyment of life, ability to concentrate, and enjoyment of recreational activities. No item showed DIF above the magnitude threshold and the impact of DIF on the overall measure was minimal. No item showed gender DIF after correction for multiple comparisons in the primary analyses. Four items showed consistent age DIF: enjoyment of life, ability to concentrate, day to day activities, and enjoyment of recreational activities, none with primary magnitude values above threshold. Conditional on the pain state, Spanish speakers were hypothesized to report less pain interference on one item, enjoyment of life. The DIF findings confirmed the hypothesis; however, the magnitude was small. Using an arbitrary cutoff point of theta (θ) ≥ 1.0 to classify respondents with acute pain interference, the highest number of changes were for the education groups analyses. There were 231 respondents (4% of the total sample) who changed from the designation of no acute pain interference to acute interference after the DIF adjustment. There was no change in the designations for race/ethnic subgroups, and a small number of changes for respondents aged 65 to 84. Conclusions Although significant DIF was observed after correction for multiple comparisons, all DIF was of low magnitude and impact. However, some individual-level impact was observed for low education groups. Reliability estimates were high. Thus, the PROMIS short form pain items examined in this ethnically diverse sample performed relatively well; although one item was problematic and removed from the analyses. It is concluded that the majority of the PROMIS pain interference short form items can be recommended for use among ethnically diverse groups, including those in palliative care and with cancer and chronic illness. PMID:28983449
Does the hippocampus mediate objective binding or subjective remembering?
Slotnick, Scott D
2010-01-15
Human functional magnetic resonance imaging (fMRI) evidence suggests the hippocampus is associated with context memory to a greater degree than item memory (where only context memory requires item-in-context binding). A separate line of fMRI research suggests the hippocampus is associated with "remember" responses to a greater degree than "know" or familiarity based responses (where only remembering reflects the subjective experience of specific detail). Previous studies, however, have confounded context memory with remembering and item memory with knowing. The present fMRI study independently tested the binding hypothesis and remembering hypothesis of hippocampal function by evaluating activity within hippocampal regions-of-interest (ROIs). At encoding, participants were presented with colored and gray abstract shapes and instructed to remember each shape and whether it was colored or gray. At retrieval, old and new shapes were presented in gray and participants classified each shape as "old and previously colored", "old and previously gray", or "new", followed by a "remember" or "know" response. In 3 of 11 hippocampal ROIs, activity was significantly greater for context memory than item memory, the context memory-item memory by remember-know interaction was significant, and activity was significantly greater for context memory-knowing than item memory-remembering. This pattern of activity only supports the binding hypothesis. The analogous pattern of activity that would have supported the remembering hypothesis was never observed in the hippocampus. However, a targeted analysis revealed remembering specific activity in the left inferior parietal cortex. The present results suggest parietal cortex may be associated with subjective remembering while the hippocampus mediates binding.
Wang, Jing-Jing; Chen, Tzu-An; Baranowski, Tom; Lau, Patrick W C
2017-09-16
This study aimed to evaluate the psychometric properties of four self-efficacy scales (i.e., self-efficacy for fruit (FSE), vegetable (VSE), and water (WSE) intakes, and physical activity (PASE)) and to investigate their differences in item functioning across sex, age, and body weight status groups using item response modeling (IRM) and differential item functioning (DIF). Four self-efficacy scales were administrated to 763 Hong Kong Chinese children (55.2% boys) aged 8-13 years. Classical test theory (CTT) was used to examine the reliability and factorial validity of scales. IRM was conducted and DIF analyses were performed to assess the characteristics of item parameter estimates on the basis of children's sex, age and body weight status. All self-efficacy scales demonstrated adequate to excellent internal consistency reliability (Cronbach's α: 0.79-0.91). One FSE misfit item and one PASE misfit item were detected. Small DIF were found for all the scale items across children's age groups. Items with medium to large DIF were detected in different sex and body weight status groups, which will require modification. A Wright map revealed that items covered the range of the distribution of participants' self-efficacy for each scale except VSE. Several self-efficacy scales' items functioned differently by children's sex and body weight status. Additional research is required to modify the four self-efficacy scales to minimize these moderating influences for application.
Rasch analysis of the Patient Rated Elbow Evaluation questionnaire.
Vincent, Joshua I; MacDermid, Joy C; King, Graham J W; Grewal, Ruby
2015-06-20
The Patient Rated Elbow Evaluation (PREE) was developed as an elbow joint specific measure of pain and disability and validated with classical psychometric methods. More recently, Rasch analysis has contributed new methods for analyzing the clinical measurement properties of self-report outcome measures. The objective of the study was to determine aspects of validity of the PREE using the Rasch model to assess the overall fit of the PREE data, the response scaling, individual item fit, differential item functioning (DIF), local dependency, unidimensionality and person separation index (PSI). A convenience sample of 236 patients (Age range 21-79 years; M: F- 97:139) with elbow disorders were recruited from the Roth│McFarlane Hand and Upper Limb Centre, London, Ontario, Canada. The baseline scores of the PREE were used. Rasch analysis was conducted using RUMM 2030 software on the 3 sub scales of the PREE separately. The 3 sub scales showed misfit initially with disordered thresholds on17 out of 20 items), uniform DIF was observed for two items ("Carrying a 10lbs object" from specific activities subscale for age group; and "household work" from the usual activities subscale for gender); multidimensionality and local dependency. The Pain subscale satisfied Rasch expectations when item 2 "Pain - At rest" was split for age group, while the usual activities subscale readily stood up to Rasch requirements when the item 2 "household work" was split for gender. The specific activities subscale demonstrated fit to the Rasch model when sub test analysis accounted for local dependency. All three subscales of the PREE were well targeted and had high reliability (PSI >0.80). The three subscales of the PREE appear to be robust when tested against the Rasch model when subject to a few alterations. The value of changing the 0-10 format is questionable given its widespread use; further Rasch-based analysis of whether these findings are stable in other samples is warranted.
The ADL taxonomy for persons with mental disorders - adaptation and evaluation.
Holmqvist, Kajsa Lidström; Holmefur, Marie
2018-05-03
There is a lack of occupation-focused instruments to assess Activities of Daily Living (ADL) that are intended for persons with mental disorders. The ADL Taxonomy is an instrument that is widely-used within clinical practice for persons with physical impairment. The aim of this study was to adapt the ADL Taxonomy for persons with mental disorders and evaluate its validity. An expert group of Occupational Therapists (OTs) from psychiatric care adapted the ADL Taxonomy to fit the client group, including creating three new items. OTs in psychiatric care collected client data and evaluated the instrument for usability. Rasch analysis was used to evaluate the contruct validity of 16 activities separately. The OTs collected 123 assessments from clients with various mental disorders. Ten activities had excellent, and four had acceptable, psychometric properties with regard to item and person fit and unidimensionality. The activity managing the day/time gave complex results and would benefit from further development. The OTs found the test version intelligible, relevant and easy to use. The ADL Taxonomy for persons with mental disorders has 16 activities with three to six actions each, and is now ready for clinical use.
Tools of the trade: Improving nurses' ability to access and evaluate research.
Sleutel, Martha R; Bullion, John W; Sullivan, Ronnie
2018-03-01
To evaluate the effect of a manager-required RN competency on staff nurses' perceived knowledge, ability and frequency of information-seeking activities. Basing clinical practice on research and standards of care is essential to delivering appropriate care with optimal outcomes. Nurses' information-seeking abilities are critical for acquiring evidence-based answers to aid clinical decision-making, yet nurses under-utilize library resources and report barriers. A unit manager sought to test the effect of an innovative competency for acquiring and appraising evidence for practice. This longitudinal descriptive study evaluated 28 nurses before and after a 1-hr class, as well as 5 months later. The class covered library information services and the basics of critiquing research articles. Nurses had statistically significant improvements in four of five items measuring knowledge/ability and four of five items measuring frequency of information-seeking activities. At 5 months, most knowledge/ability items increased. There was no effect of nurse characteristics on outcomes. A required competency improved nurses' knowledge, ability and frequency of acquiring and appraising evidence with a single 1-hr class and a hands-on practice activity. Unit managers can have great impact on nurses' use of evidence for practice. © 2018 John Wiley & Sons Ltd.
Osborne, Richard H; Elsworth, Gerald R; Whitfield, Kathryn
2007-05-01
This paper describes the development and validation of the Health Education Impact Questionnaire (heiQ). The aim was to develop a user-friendly, relevant, and psychometrically sound instrument for the comprehensive evaluation of patient education programs, which can be applied across a broad range of chronic conditions. Item development for the heiQ was guided by a Program Logic Model, Concept Mapping, interviews with stakeholders and psychometric analyses. Construction (N=591) and confirmatory (N=598) samples were drawn from consumers of patient education programs and hospital outpatients. The properties of the heiQ were investigated using item response theory and structural equation modeling. Over 90 candidate items were generated, with 42 items selected for inclusion in the final scale. Eight independent dimensions were derived: Positive and Active Engagement in Life (five items, Cronbach's alpha (alpha)=0.86); Health Directed Behavior (four items, alpha=0.80); Skill and Technique Acquisition (five items, alpha=0.81); Constructive Attitudes and Approaches (five items, alpha=0.81); Self-Monitoring and Insight (seven items, alpha=0.70); Health Service Navigation (five items, alpha=0.82); Social Integration and Support (five items, alpha=0.86); and Emotional Wellbeing (six items, alpha=0.89). The heiQ has high construct validity and is a reliable measure of a broad range of patient education program benefits. The heiQ will provide valuable information to clinicians, researchers, policymakers and other stakeholders about the value of patient education programs in chronic disease management.
[Evaluation of the factorial and metric equivalence of the Sexual Assertiveness Scale (SAS) by sex].
Sierra, Juan Carlos; Santos-Iglesias, Pablo; Vallejo-Medina, Pablo
2012-05-01
Sexual assertiveness refers to the ability to initiate sexual activity, refuse unwanted sexual activity, and use contraceptive methods to avoid sexually transmitted diseases, developing healthy sexual behaviors. The Sexual Assertiveness Scale (SAS) assesses these three dimensions. The purpose of this study is to evaluate, using structural equation modeling and differential item functioning, the equivalence of the scale between men and women. Standard scores are also provided. A total of 4,034 participants from 21 Spanish provinces took part in the study. Quota sampling method was used. Results indicate a strict equivalent dimensionality of the Sexual Assertiveness Scale across sexes. One item was flagged by differential item functioning, although it does not affect the scale. Therefore, there is no significant bias in the scale when comparing across sexes. Standard scores show similar Initiation assertiveness scores for men and women, and higher scores on Refusal and Sexually Transmitted Disease Prevention for women. This scale can be used on men and women with sufficient psychometric guarantees.
Mental health in primary care: an evaluation using the Item Response Theory.
Rocha, Hugo André da; Santos, Alaneir de Fátima Dos; Reis, Ilka Afonso; Santos, Marcos Antônio da Cunha; Cherchiglia, Mariângela Leal
2018-01-01
OBJECTIVE To determine the items of the Brazilian National Program for Improving Access and Quality of Primary Care that better evaluate the capacity to provide mental health care. METHODS This is a cross-sectional study carried out using the Graded Response Model of the Item Response Theory using secondary data from the second cycle of the National Program for Improving Access and Quality of Primary Care, which evaluates 30,523 primary care teams in the period from 2013 to 2014 in Brazil. The internal consistency, correlation between items, and correlation between items and the total score were tested using the Cronbach's alpha, Spearman's correlation, and point biserial coefficients, respectively. The assumptions of unidimensionality and local independence of the items were tested. Word clouds were used as one way to present the results. RESULTS The items with the greatest ability to discriminate were scheduling of the agenda according to risk stratification, keeping of records of the most serious cases of users in psychological distress, and provision of group care. The items that required a higher level of mental health care in the parameter of location were the provision of any type of group care and the provision of educational and mental health promotion activities. Total Cronbach's alpha coefficient was 0.87. The items that obtained the highest correlation with total score were the recording of the most serious cases of users in psychological distress and scheduling of the agenda according to risk stratification. The final scores obtained oscillated between -2.07 (minimum) and 1.95 (maximum). CONCLUSIONS There are important aspects in the discrimination of the capacity to provide mental health care by primary health care teams: risk stratification for care management, follow-up of the most serious cases, group care, and preventive and health promotion actions.
Darzins, Susan W; Imms, Christine; Di Stefano, Marilyn
2017-05-01
To explore the operationalization of activity and participation-related measurement constructs through comparison of item phrasing, item response categories and scoring (scale properties) for two separate instruments targeting activities of daily living. Personal Care Participation Assessment and Resource Tool (PC-PART) item content was linked to ICF categories using established linking rules. Previously reported ICF-linked FIM content categories and ICF-linked PC-PART content categories were compared to identify common ICF categories between the instruments. Scale properties of both instruments were compared using a patient scenario to explore the instruments' separate measurement constructs. The PC-PART and FIM shared 15 of the 53 level two ICF-linked categories identified across both instruments. Examination of the instruments' scale properties for items with overlapping ICF content, and exploration through a patient scenario, provided supportive evidence that the instruments measure different constructs. While the PC-PART and FIM share common ICF-linked content, they measure separate constructs. Measurement construct was influenced by the instruments' scale properties. The FIM was observed to measure activity limitations and the PC-PART measured participation restrictions. Scrutiny of instruments' scale properties in addition to item content is critical in the operationalization of activity and participation-related measurement constructs. Implications for Rehabilitation When selecting outcome measures for use in rehabilitation it is necessary to examine both the content of the instruments' items and item phrasing, response categories and scoring, to clarify the construct being measured. Measurement of activity limitations as well as participation restrictions in activities of daily living required for community life provides a more comprehensive measurement of rehabilitation outcomes than measurement of either construct alone. To measure the effects of interventions used in rehabilitation, it is necessary to select measures with relevant content and scale properties that enable evaluation of change in the constructs that are expected to change, as a result of the rehabilitation intervention.
Inter-rater reliability of the Sødring Motor Evaluation of Stroke patients (SMES).
Halsaa, K E; Sødring, K M; Bjelland, E; Finsrud, K; Bautz-Holter, E
1999-12-01
The Sødring Motor Evaluation of Stroke patients is an instrument for physiotherapists to evaluate motor function and activities in stroke patients. The rating reflects quality as well as quantity of the patient's unassisted performance within three domains: leg, arm and gross function. The inter-rater reliability of the method was studied in a sample of 30 patients admitted to a stroke rehabilitation unit. Three therapists were involved in the study; two therapists assessed the same patient on two consecutive days in a balanced design. Cohen's weighted kappa and McNemar's test of symmetry were used as measures of item reliability, and the intraclass correlation coefficient was used to express the reliability of the sumscores. For 24 out of 32 items the weighted kappa statistic was excellent (0.75-0.98), while 7 items had a kappa statistic within the range 0.53-0.74 (fair to good). The reliability of one item was poor (0.13). The intraclass correlation coefficient for the three sumscores was 0.97, 0.91 and 0.97. We conclude that the Sødring Motor Evaluation of Stroke patients is a reliable measure of motor function in stroke patients undergoing rehabilitation.
Hart, Dennis L; Werneke, Mark W; George, Steven Z; Matheson, James W; Wang, Ying-Chih; Cook, Karon F; Mioduski, Jerome E; Choi, Seung W
2009-08-01
Screening people for elevated levels of fear-avoidance beliefs is uncommon, but elevated levels of fear could worsen outcomes. Developing short screening tools might reduce the data collection burden and facilitate screening, which could prompt further testing or management strategy modifications to improve outcomes. The purpose of this study was to develop efficient yet accurate screening methods for identifying elevated levels of fear-avoidance beliefs regarding work or physical activities in people receiving outpatient rehabilitation. A secondary analysis of data collected prospectively from people with a variety of common neuromusculoskeletal diagnoses was conducted. Intake Fear-Avoidance Beliefs Questionnaire (FABQ) data were collected from 17,804 people who had common neuromusculoskeletal conditions and were receiving outpatient rehabilitation in 121 clinics in 26 states (in the United States). Item response theory (IRT) methods were used to analyze the FABQ data, with particular emphasis on differential item functioning among clinically logical groups of subjects, and to identify screening items. The accuracy of screening items for identifying subjects with elevated levels of fear was assessed with receiver operating characteristic analyses. Three items for fear of physical activities and 10 items for fear of work activities represented unidimensional scales with adequate IRT model fit. Differential item functioning was negligible for variables known to affect functional status outcomes: sex, age, symptom acuity, surgical history, pain intensity, condition severity, and impairment. Items that provided maximum information at the median for the FABQ scales were selected as screening items to dichotomize subjects by high versus low levels of fear. The accuracy of the screening items was supported for both scales. This study represents a retrospective analysis, which should be replicated using prospective designs. Future prospective studies should assess the reliability and validity of using one FABQ item to screen people for high levels of fear-avoidance beliefs. The lack of differential item functioning in the FABQ scales in the sample tested in this study suggested that FABQ screening could be useful in routine clinical practice and allowed the development of single-item screening for fear-avoidance beliefs that accurately identified subjects with elevated levels of fear. Because screening was accurate and efficient, single IRT-based FABQ screening items are recommended to facilitate improved evaluation and care of heterogeneous populations of people receiving outpatient rehabilitation.
Indicators of Family Care for Development for Use in Multicountry Surveys
Kariger, Patricia; Engle, Patrice; Britto, Pia M. Rebello; Sywulka, Sara M.; Menon, Purnima
2012-01-01
Indicators of family care for development are essential for ascertaining whether families are providing their children with an environment that leads to positive developmental outcomes. This project aimed to develop indicators from a set of items, measuring family care practices and resources important for caregiving, for use in epidemiologic surveys in developing countries. A mixed method (quantitative and qualitative) design was used for item selection and evaluation. Qualitative and quantitative analyses were conducted to examine the validity of candidate items in several country samples. Qualitative methods included the use of global expert panels to identify and evaluate the performance of each candidate item as well as in-country focus groups to test the content validity of the items. The quantitative methods included analyses of item-response distributions, using bivariate techniques. The selected items measured two family care practices (support for learning/stimulating environment and limit-setting techniques) and caregiving resources (adequacy of the alternate caregiver when the mother worked). Six play-activity items, indicative of support for learning/stimulating environment, were included in the core module of UNICEF's Multiple Cluster Indictor Survey 3. The other items were included in optional modules. This project provided, for the first time, a globally-relevant set of items for assessing family care practices and resources in epidemiological surveys. These items have multiple uses, including national monitoring and cross-country comparisons of the status of family care for development used globally. The obtained information will reinforce attention to efforts to improve the support for development of children. PMID:23304914
Two-Year Follow-up of the Collision Auto Repair Safety Study (CARSS)
Bejan, Anca; Parker, David L.; Brosseau, Lisa M.; Xi, Min; Skan, Maryellen
2015-01-01
This paper presents an evaluation of the sustainability of health and safety improvements in small auto collision shops 1 year after the implementation of a year-long targeted intervention. During the first year (active phase), owners received quarterly phone calls, written reminders, safety newsletters, and access to online services and in-person assistance with creating safety programs and respirator fit testing. During the second year (passive phase), owners received up to three postcard reminders regarding the availability of free health and safety resources. Forty-five shops received an evaluation at baseline and at the end of the first year (Y1). Of these, 33 were evaluated at the end of the second year (Y2), using the same 92-item assessment tool. At Y1, investigators found that between 70 and 81% of the evaluated items were adequate in each business (mean = 73% items, SD = 11%). At Y2, between 63 and 89% of items were deemed adequate (mean = 73% items, SD = 9.5%). Three safety areas demonstrated statistically significant (P < 0.05) changes: compressed gasses (8% improvement), personal protective equipment (7% improvement), and respiratory protection (6% decline). The number of postcard reminders sent to each business did not affect the degree to which shops maintained safety improvements made during the first year of the intervention. However, businesses that received more postcards were more likely to request assistance services than those receiving fewer. PMID:25539646
Evaluation of efficacy in a liver pretransplantation orientation group.
Guimaro, M Simon; Lacerda, S Silva; Bacoccina, T D; Karam, C Hegedus; de Sá, J Roberto; Ferraz-Neto, B H; Andreoli, P Bruno de Araújo
2007-10-01
The medical context recognizes the efficiency of working with groups of patients. Group interventions can intensify the understanding, ability, and notion of recognizing the patient's own condition, increasing the responsibility for him- or herself. This survey sought to evaluate the efficacy of an interdisciplinary orientation group for hepatic transplantation preoperatively. The opinions of all patients on a waiting list for liver transplantation and their accompanying persons were evaluated from August to December 2005 through a questionnaire with 17 relevant items concerning the transplantation process. The group efficacy was evaluated according to the percentage of correct answers from the subjects before and after attending the group. The results showed a 59% increase in correct answers for the evaluated items after group attendance. The items which showed significant improvement were: what should I do after being called for transplantation; average time of admission to hospital and ICU; use of immunosuppressive drugs; clinical conditions for transplantation; frequency of appointments with the surgeon within the first month; physical activities; diet; blood transfusion; and forgetting medication. A ceiling effect was observed upon reevaluation of the previous conditions for transplantation item. The percentage of health improvement after attending the group demonstrated an impact of the interdisciplinary orientation intervention on the instruction of patients and their accompanying persons, thus representing an important step in their training process.
NASA Astrophysics Data System (ADS)
Laursen, S. L.; Weston, T. J.; Thiry, H.
2012-12-01
URSSA is the Undergraduate Research Student Self-Assessment, an online survey instrument for programs and departments to use in assessing the student outcomes of undergraduate research (UR). URSSA focuses on what students learn from their UR experience, rather than whether they liked it. The online questionnaire includes both multiple-choice and open-ended items that focus on students' gains from undergraduate research. These gains include skills, knowledge, deeper understanding of the intellectual and practical work of science, growth in confidence, changes in identity, and career preparation. Other items probe students' participation in important research-related activities that lead to these gains (e.g. giving presentations, having responsibility for a project). These activities, and the gains themselves, are based in research and thus constitute a core set of items. Using these items as a group helps to align a particular program assessment with research-demonstrated outcomes. Optional items may be used to probe particular features that are augment the research experience (e.g. field trips, career seminars, housing arrangements). The URSSA items are based on extensive, interview-based research and evaluation work on undergraduate research by our group and others. This grounding in research means that URSSA measures what we know to be important about the UR experience The items were tested with students, revised and re-tested. Data from a large pilot sample of over 500 students enabled statistical testing of the items' validity and reliability. Optional items about UR program elements were developed in consultation with UR program developers and leaders. The resulting instrument is flexible. Users begin with a set of core items, then customize their survey with optional items to probe students' experiences of specific program elements. The online instrument is free and easy to use, with numeric results available as raw data, summary statistics, cross-tabs, and graphs, and as raw, downloadable data. Finally, URSSA has high content validity based on its research grounding and rigorous development. We will present examples of how URSSA has been used in evaluations of UR programs. A multi-year evaluation of a university-based UR program shows that URSSA items are sensitive to differences in students' prior level of experience with research. For example, experienced student researchers reported greater gains than did their peers new to UR in understanding the process of research and in coming to see themselves as scientists. These differences are consistent with interview data that suggest a developmental progression of gains as students pursue research and gain confidence in their ability to contribute meaningfully. A second example comes from a multi-site evaluation of sites funded by the National Science Foundation's Research Experience for Undergraduates (REU) program in Biology. This study acquired data from nearly 800 students at some 60 Bio REU sites in 2010 and 2011. Results reveal differences in gains among demographic groups, and the general strength of these well-planned programs relative to a comparison sample of UR programs that are not part of REU. Our presentation will demonstrate the evaluative use of URSSA and its potential applications to undergraduate research in the geosciences.
Transfer Student Success: Educationally Purposeful Activities Predictive of Undergraduate GPA
ERIC Educational Resources Information Center
Fauria, Renee M.; Fuller, Matthew B.
2015-01-01
Researchers evaluated the effects of Educationally Purposeful Activities (EPAs) on transfer and nontransfer students' cumulative GPAs. Hierarchical, linear, and multiple regression models yielded seven statistically significant educationally purposeful items that influenced undergraduate student GPAs. Statistically significant positive EPAs for…
Jalaludin, My; Fuziah, Mz; Hong, Jyh; Mohamad Adam, B; Jamaiyah, H
2012-01-01
Self-care plays an important role in diabetes management. One of the instruments used to evaluate self-care in patients with diabetes is the Summary of Diabetes Self-Care Activities (SDSCA) questionnaire. A validated instrument in the Malay language is used to assess self-care practice among children and adolescents with diabetes in Malaysia. To translate and evaluate the psychometric properties of the revised version of the SDSCA questionnaire in the Malay language. Forward and backward translations were performed. An expert panel reviewed all versions for conceptual and content equivalence. The final version was administered to paediatric patients with diabetes between August 2006 and September 2007. Reliability was analysed using Cronbach's alpha and validity was assessed using exploratory factor analysis. A total of 117 patients aged 10-18 years were enrolled from nine hospitals. The reliability of overall core items was 0.735 (with item 4) while the reliabilities of the four domains were in the range of 0.539-0.838. As core item number 4 was found to be problematic and it was subtituted by item 5a (from the expanded SDSCA) to suit local dietary education and practice; and the reliabilities of the overall core item (0.782) and the four domains (0.620 - 0.838) improved. Factor loadings of all the items were greater than 0.4, loaded into the original domains, and accounted for 73% of the total variance. The Malay translation of the revised English SDSCA is reliable and valid as a guide for Malaysian children and adolescents suffering from diabetes.
Jette, Alan M.; McDonough, Christine M.; Haley, Stephen M.; Ni, Pengsheng; Olarsch, Sippy; Latham, Nancy; Hambleton, Ronald K.; Felson, David; Kim, Young-jo; Hunter, David
2012-01-01
Objective To develop and evaluate a prototype measure (OA-DISABILITY-CAT) for osteoarthritis research using Item Response Theory (IRT) and Computer Adaptive Test (CAT) methodologies. Study Design and Setting We constructed an item bank consisting of 33 activities commonly affected by lower extremity (LE) osteoarthritis. A sample of 323 adults with LE osteoarthritis reported their degree of limitation in performing everyday activities and completed the Health Assessment Questionnaire-II (HAQ-II). We used confirmatory factor analyses to assess scale unidimensionality and IRT methods to calibrate the items and examine the fit of the data. Using CAT simulation analyses, we examined the performance of OA-DISABILITY-CATs of different lengths compared to the full item bank and the HAQ-II. Results One distinct disability domain was identified. The 10-item OA-DISABILITY-CAT demonstrated a high degree of accuracy compared with the full item bank (r=0.99). The item bank and the HAQ-II scales covered a similar estimated scoring range. In terms of reliability, 95% of OA-DISABILITY reliability estimates were over 0.83 versus 0.60 for the HAQ-II. Except at the highest scores the 10-item OA-DISABILITY-CAT demonstrated superior precision to the HAQ-II. Conclusion The prototype OA-DISABILITY-CAT demonstrated promising measurement properties compared to the HAQ-II, and is recommended for use in LE osteoarthritis research. PMID:19216052
Darzins, Susan; Imms, Christine; Di Stefano, Marilyn; Taylor, Nicholas F; Pallant, Julie F
2014-11-05
The Personal Care Participation Assessment and Resource Tool (PC-PART) is a 43-item, clinician-administered assessment, designed to identify patients' unmet needs (participation restrictions) in activities of daily living (ADL) required for community life. This information is important for identifying problems that need addressing to enable, for example, discharge from inpatient settings to community living. The objective of this study was to evaluate internal construct validity of the PC-PART using Rasch methods. Fit to the Rasch model was evaluated for 41 PC-PART items, assessing threshold ordering, overall model fit, individual item fit, person fit, internal consistency, Differential Item Functioning (DIF), targeting of items and dimensionality. Data used in this research were taken from admission data from a randomised controlled trial conducted at two publically funded inpatient rehabilitation units in Melbourne, Australia, with 996 participants (63% women; mean age 74 years) and with various impairment types. PC-PART items assessed as one scale, and original PC-PART domains evaluated as separate scales, demonstrated poor fit to the Rasch model. Adequate fit to the Rasch model was achieved in two newly formed PC-PART scales: Self-Care (16 items) and Domestic Life (14 items). Both scales were unidimensional, had acceptable internal consistency (PSI =0.85, 0.76, respectively) and well-targeted items. Rasch analysis did not support conventional summation of all PC-PART item scores to create a total score. However, internal construct validity of the newly formed PC-PART scales, Self-Care and Domestic Life, was supported. Their Rasch-derived scores provided interval-level measurement enabling summation of scores to form a total score on each scale. These scales may assist clinicians, managers and researchers in rehabilitation settings to assess and measure changes in ADL participation restrictions relevant to community living. Data used in this research were gathered during a registered randomised controlled trial: Australian and New Zealand Clinical Trials Registry ACTRN12609000973213. Ethics committee approval was gained for secondary analysis of data for this study.
2010-01-01
Objectives. To evaluate, by age, the performance of 2 disability measures based on needing help: one using 5 classic activities of daily living (ADL) and another using an expanded set of 14 activities including instrumental activities of daily living (IADL), walking, getting outside, and ADL (IADL/ADL). Methods. Guttman and item response theory (IRT) scaling methods are used with a large (N = 25,470) nationally representative household survey of individuals aged 18 years and older. Results. Guttman scalability of the ADL items increases steadily with age, reaching a high level at ages 75 years and older. That is reflected in an IRT model by age-related differential item functioning (DIF) resulting in age-biased measurement of ADL. Guttman scalability of the IADL/ADL items also increases with age but is lower than the ADL. Although age-related DIF also occurs with IADL/ADL items, DIF is lower in magnitude and balances out without causing age bias. Discussion. An IADL/ADL scale measuring need for help is hierarchical, unidimensional, and unbiased by age. It has greater content validity for measuring need for help in the community and shows greater sensitivity by age than the classic ADL measure. As demand for community services is increasing among adults of all ages, an expanded IADL/ADL measure is more useful than ADL. PMID:20100786
DOE Office of Scientific and Technical Information (OSTI.GOV)
Weaver, Phyllis C.
2013-12-12
The U.S. Department of Energy (DOE) Oak Ridge Office of Environmental Management (EM-OR) requested Oak Ridge Associated Universities (ORAU), working under the Oak Ridge Institute for Science and Education (ORISE) contract, to provide technical and independent waste management planning support under the American Recovery and Reinvestment Act (ARRA). Specifically, DOE EM-OR requested ORAU to plan and implement a sampling and analysis campaign to target certain items associated with URS|CH2M Oak Ridge, LLC (UCOR) surveillance and maintenance (S&M) process inventory waste. Eight populations of historical and reoccurring S&M waste at the Oak Ridge National Laboratory (ORNL) have been identified in themore » Waste Handling Plan for Surveillance and Maintenance Activities at the Oak Ridge National Laboratory, DOE/OR/01-2565&D2 (WHP) (DOE 2012) for evaluation and processing for final disposal. This waste was generated during processing, surveillance, and maintenance activities associated with the facilities identified in the process knowledge (PK) provided in Appendix A. A list of items for sampling and analysis were generated from a subset of materials identified in the WHP populations (POPs) 4, 5, 6, 7, and 8, plus a small number of items not explicitly addressed by the WHP. Specifically, UCOR S&M project personnel identified 62 miscellaneous waste items that would require some level of evaluation to identify the appropriate pathway for disposal. These items are highly diverse, relative to origin; composition; physical description; contamination level; data requirements; and the presumed treatment, storage, and disposal facility (TSDF). Because of this diversity, ORAU developed a structured approach to address item-specific data requirements necessary for acceptance in a presumed TSDF that includes the Environmental Management Waste Management Facility (EMWMF)—using the approved Waste Lot (WL) 108.1 profile—the Y-12 Sanitary Landfill (SLF) if appropriate; EnergySolutions Clive; and the Nevada National Security Site (NNSS) (ORAU 2013b). Finally, the evaluation of these wastes was more suited to a judgmental sampling approach rather than a statistical design, meaning data were collected for each individual item, thereby providing information for item-byitem disposition decisions. ORAU prepared a sampling and analysis plan (SAP) that outlined data collection strategies, methodologies, and analytical guidelines and requirements necessary for characterizing targeted items (ORAU 2013b). The SAP described an approach to collect samples that allowed evaluation as to whether or not the waste would be eligible for disposal at the EMWMF. If the waste was determined not to be eligible for EMWMF disposal, then there would be adequate information collected that would allow the waste to be profiled for one of the alternate TSDFs listed above.« less
Graffigna, Guendalina; Barello, Serena; Bonanomi, Andrea; Lozza, Edoardo; Hibbard, Judith
2015-12-23
The Patient Activation Measure (PAM13) is an instrument that assesses patient knowledge, skills, and confidence for disease self-management. This cross-sectional study was aimed to validate a culturally-adapted Italian Patient Activation Measure (PAM13-I) for patients with chronic conditions. 519 chronic patients were involved in the Italian validation study and responded to PAM13-I. The PAM 13 was translated into Italian by a standardized forward-backward translation. Data quality was assessed by mean, median, item response, missing values, floor and ceiling effects, internal consistency (Cronbach's alpha and average inter-item correlation), item-rest correlations. Rasch Model and differential item functioning assessed scale properties. Mean PAM13-I score was 66.2. Rasch analysis showed that the PAM13-I is a good measure of patient activation. The level of internal consistency was good (α = 0.88). For all items, the distribution of answers was left-skewed, with a small floor effect (range 1.7-4.5 %) and a moderate ceiling effect (range 27.6-55.0 %). The Italian version formed a unidimensional, probabilistic Guttman-like scale explaining 41 % of the variance. The PAM13-I has been demonstrated to be a valid and reliable measure of patient activation and the present study suggests its applicability to the Italian-speaking chronic patient population. The measure has good psychometric properties and appears to be consistent with the developmental nature of the patient activation phenomenon, although it presents a different ranking order of the items comparing to the American version. PAM13-I can be a useful assessment tool to evaluate interventions aimed at improving patient engagement in healthcare and to train doctors in attuning their communication to the level of patients' activation. Future research could be conducted to further confirm the validity of the PAM13-I.
Fabricant, Peter D; Robles, Alex; Downey-Zayas, Timothy; Do, Huong T; Marx, Robert G; Widmann, Roger F; Green, Daniel W
2013-10-01
Having simple and reliable validated outcome measures is vital to conducting high-quality outcomes research in the field of orthopaedic surgery. Activity level is a key prognostic variable for patients with sports injuries. There is a paucity of such activity scales for children and adolescents who are otherwise healthy and athletically active. In addition to frequency and intensity of athletic activity, level of play and coach/trainer supervision are important variables unique to children and adolescents that are not captured in available adult scoring systems. To create and validate a concise and comprehensive activity rating scale for athletically active children and adolescents 10 to 18 years of age. Cohort study (diagnosis); Level of evidence, 2. Item generation was performed with a panel of orthopaedic surgeons and adolescent athletes. Item reduction, pilot testing and scale refinement resulted in a final 8-item instrument, the Hospital for Special Surgery Pediatric Functional Activity Brief Scale (HSS Pedi-FABS). Existing methods were used to determine reliability and validation. The Flesch-Kincaid score was calculated at a 6.6th-grade reading level (approximately 13 years old); therefore, although all subjects provided their own answers, parents were allowed to assist children younger than 13 years with reading the questionnaire. Scale reliability was excellent (test-retest reliability, intraclass correlation coefficient = 0.91; internal consistency, Cronbach alpha = .914), and there were no floor or ceiling effects. There was also robust construct validity: Convergent validity testing revealed positive correlations between the HSS Pedi-FABS and level of competition in athletic activity, number of reported hours of athletic activity per week, and existing comparable adult and pediatric scales. Discriminant validity was shown with age, body mass index, and type of sport as measured by the Daniel scale. The 8-item HSS Pedi-FABS can be used to reliably and accurately evaluate activity level as a prognostic variable for clinical research studies. It is a simple, reliable, and valid metric to assess activity in children and adolescents 10 to 18 years of age. This instrument will lead to better evaluation of posttreatment outcomes and patient-reported activity for child and adolescent athletes.
Constantine, Melissa L; Pauls, Rachel N; Rogers, Rebecca R; Rockwood, Todd H
2017-12-01
The Prolapse/Incontinence Sexual Questionnaire-International Urogynecology Association (IUGA) Revised (PISQ-IR) measures sexual function in women with pelvic floor disorders (PFDs) yet is unwieldy, with six individual subscale scores for sexually active women and four for women who are not. We hypothesized that a valid and responsive summary score could be created for the PISQ-IR. Item response data from participating women who completed a revised version of the PISQ-IR at three clinical sites were used to generate item weights using a magnitude estimation (ME) and Q-sort (Q) approaches. Item weights were applied to data from the original PISQ-IR validation to generate summary scores. Correlation and factor analysis methods were used to evaluate validity and responsiveness of summary scores. Weighted and nonweighted summary scores for the sexually active PISQ-IR demonstrated good criterion validity with condition-specific measures: Incontinence Severity Index = 0.12, 0.11, 0.11; Pelvic Floor Distress Inventory-20 = 0.39, 0.39, 0.12; Epidemiology of Prolapse and Incontinence Questionnaire-Q35 = 0.26 0,.25, 0.40); Female Sexual Functioning Index subscale total score = 0.72, 0.75, 0.72 for nonweighted, ME, and Q summary scores, respectively. Responsiveness evaluation showed weighted and nonweighted summary scores detected moderate effect sizes (Cohen's d > 0.5). Weighted items for those NSA demonstrated significant floor effects and did not meet criterion validity. A PISQ-IR summary score for use with sexually active women, nonweighted or calculated with ME or Q item weights, is a valid and reliable measure for clinical use. The summary scores provide value for assesing clinical treatment of pelvic floor disorders.
Validating the food behavior questions from the elementary school SPAN questionnaire.
Thiagarajah, Krisha; Fly, Alyce D; Hoelscher, Deanna M; Bai, Yeon; Lo, Kaman; Leone, Angela; Shertzer, Julie A
2008-01-01
The School Physical Activity and Nutrition (SPAN) questionnaire was developed as a surveillance instrument to measure physical activity, nutrition attitudes, and dietary and physical activity behaviors in children and adolescents. The SPAN questionnaire has 2 versions. This study was conducted to evaluate the validity of food consumption items from the elementary school version of the SPAN questionnaire. Validity was assessed by comparing food items selected on the questionnaire with food items reported from a single 24-hour recall covering the same reference period. 5 elementary schools in Indiana. Fourth-grade student volunteers (N = 121) from 5 elementary schools. Agreement between responses to SPAN questionnaire items and reference values obtained through 24-hour dietary recall. The agreement between the questionnaire and the 24-hour recall was measured using Spearman correlation, percentage agreement, and kappa statistic. Correlation between SPAN item responses and recall data ranged from .25 (bread and related products) to .67 (gravy). The percentage agreement ranged from 26% (bread and related products) to 90% (gravy). The kappa statistic varied from .06 (chocolate candy) to .60 (beans). Results from this study indicate that the SPAN questionnaire can be administered in the classroom quickly and easily to measure many previous day dietary behaviors of fourth graders. However, questions addressing consumption of "vegetables," "candy," and "snacks" need further investigation.
Development and Testing of the Nurse Manager EBP Competency Scale.
Shuman, Clayton J; Ploutz-Snyder, Robert J; Titler, Marita G
2018-02-01
The purpose of this study was to develop and evaluate the validity and reliability of an instrument to measure nurse manager competencies regarding evidence-based practice (EBP). The Nurse Manager EBP Competency Scale consists of 16 items for respondents to indicate their perceived level of competency on a 0 to 3 Likert-type scale. Content validity was demonstrated through expert panel review and pilot testing. Principal axis factoring and Cronbach's alpha evaluated construct validity and internal consistency reliability, respectively. Eighty-three nurse managers completed the scale. Exploratory factor analysis resulted in a 16-item scale with two subscales, EBP Knowledge ( n = 6 items, α = .90) and EBP Activity ( n = 10 items, α = .94). Cronbach's alpha for the entire scale was .95. The Nurse Manager EBP Competency Scale is a brief measure of nurse manager EBP competency with evidence of validity and reliability. The scale can enhance our understanding in future studies regarding how nurse manager EBP competency affects implementation.
Okochi, Jiro; Utsunomiya, Sakiko; Takahashi, Tai
2005-01-01
Background The International Classification of Functioning, Disability and Health (ICF) was published by the World Health Organization (WHO) to standardize descriptions of health and disability. Little is known about the reliability and clinical relevance of measurements using the ICF and its qualifiers. This study examines the test-retest reliability of ICF codes, and the rate of immeasurability in long-term care settings of the elderly to evaluate the clinical applicability of the ICF and its qualifiers, and the ICF checklist. Methods Reliability of 85 body function (BF) items and 152 activity and participation (AP) items of the ICF was studied using a test-retest procedure with a sample of 742 elderly persons from 59 institutional and at home care service centers. Test-retest reliability was estimated using the weighted kappa statistic. The clinical relevance of the ICF was estimated by calculating immeasurability rate. The effect of the measurement settings and evaluators' experience was analyzed by stratification of these variables. The properties of each item were evaluated using both the kappa statistic and immeasurability rate to assess the clinical applicability of WHO's ICF checklist in the elderly care setting. Results The median of the weighted kappa statistics of 85 BF and 152 AP items were 0.46 and 0.55 respectively. The reproducibility statistics improved when the measurements were performed by experienced evaluators. Some chapters such as genitourinary and reproductive functions in the BF domain and major life area in the AP domain contained more items with lower test-retest reliability measures and rated as immeasurable than in the other chapters. Some items in the ICF checklist were rated as unreliable and immeasurable. Conclusion The reliability of the ICF codes when measured with the current ICF qualifiers is relatively low. The result in increase in reliability according to evaluators' experience suggests proper education will have positive effects to raise the reliability. The ICF checklist contains some items that are difficult to be applied in the geriatric care settings. The improvements should be achieved by selecting the most relevant items for each measurement and by developing appropriate qualifiers for each code according to the interest of the users. PMID:16050960
Tucker, Carole A; Escorpizo, Reuben; Cieza, Alarcos; Lai, Jin Shei; Stucki, Gerold; Ustun, T. Bedirhan; Kostanjsek, Nenad; Cella, David; Forrest, Christopher B.
2014-01-01
Background The Patient Reported Outcomes Measurement Information System (PROMIS®) is a U.S. National Institutes of Health initiative that has produced self-reported item banks for physical, mental, and social health. Objective To describe the content of PROMIS at the item level using the World Health Organization’s International Classification of Functioning, Disability and Health (ICF). Methods All PROMIS adult items (publicly available as of 2012) were assigned to relevant ICF concepts. The content of the PROMIS adult item banks were then described using the mapped ICF code descriptors. Results The 1006 items in the PROMIS instruments could all be mapped to ICF concepts at the second level of classification, with the exception of 3 items of global or general health that mapped across the first-level classification of ICF activity and participation component (d categories). Individual PROMIS item banks mapped from 1 to 5 separate ICF codes indicating one-to-one, one-to-many and many-to-one mappings between PROMIS item banks and ICF second level classification codes. PROMIS supports measurement of the majority of major concepts in the ICF Body Functions (b) and Activity & Participation (d) components using PROMIS item banks or subsets of PROMIS items that could, with care, be used to develop customized instruments. Given the focus of PROMIS is on measurement of person health outcomes, concepts in body structures (s) and some body functions (b), as well as many ICF environmental factor have minimal coverage in PROMIS. Discussion The PROMIS-ICF mapped items provide a basis for users to evaluate the ICF related content of specific PROMIS instruments, and to select PROMIS instruments in ICF based measurement applications. PMID:24760532
Two-year follow-up of the Collision Auto Repair Safety Study (CARSS).
Bejan, Anca; Parker, David L; Brosseau, Lisa M; Xi, Min; Skan, Maryellen
2015-06-01
This paper presents an evaluation of the sustainability of health and safety improvements in small auto collision shops 1 year after the implementation of a year-long targeted intervention. During the first year (active phase), owners received quarterly phone calls, written reminders, safety newsletters, and access to online services and in-person assistance with creating safety programs and respirator fit testing. During the second year (passive phase), owners received up to three postcard reminders regarding the availability of free health and safety resources. Forty-five shops received an evaluation at baseline and at the end of the first year (Y1). Of these, 33 were evaluated at the end of the second year (Y2), using the same 92-item assessment tool. At Y1, investigators found that between 70 and 81% of the evaluated items were adequate in each business (mean = 73% items, SD = 11%). At Y2, between 63 and 89% of items were deemed adequate (mean = 73% items, SD = 9.5%). Three safety areas demonstrated statistically significant (P < 0.05) changes: compressed gasses (8% improvement), personal protective equipment (7% improvement), and respiratory protection (6% decline). The number of postcard reminders sent to each business did not affect the degree to which shops maintained safety improvements made during the first year of the intervention. However, businesses that received more postcards were more likely to request assistance services than those receiving fewer. © The Author 2014. Published by Oxford University Press on behalf of the British Occupational Hygiene Society.
Ranganath, Veena K; Yoon, Jeonglim; Khanna, Dinesh; Park, Grace S; Furst, Daniel E; Elashoff, David A; Jawaheer, Damini; Sharp, John T; Gold, Richard H; Keystone, Edward C; Paulus, Harold E
2007-01-01
Objective To evaluate concordance and agreement of the original DAS44/ESR‐4 item composite disease activity status measure with nine simpler derivatives when classifying patient responses by European League of Associations for Rheumatology (EULAR) criteria, using an early rheumatoid factor positive (RF+) rheumatoid arthritis (RA) patient cohort. Methods Disease‐modifying anti‐rheumatic drug‐naïve RF+ patients (n = 223; mean duration of symptoms, 6 months) were categorised as ACR none/20/50/70 responders. One‐way analysis of variance and two‐sample t tests were used to investigate the relationship between the ACR response groups and each composite measure. EULAR reached/change cut‐point scores were calculated for each composite measure. EULAR (good/moderate/none) responses for each composite measure and the degree of agreement with the DAS44/ESR‐4 item were calculated for 203 patients. Results Patients were mostly female (78%) with moderate to high disease activity. A centile‐based nomogram compared equivalent composite measure scores. Changes from baseline in the composite measures in patients with ACRnone were significantly less than those of ACR20/50/70 responders, and those for ACR50 were significantly different from those for ACR70. EULAR reached/change cut‐point scores for our cohort were similar to published cut‐points. When compared with the DAS44/ESR‐4 item, EULAR (good/moderate/none) percentage agreements were 92 with the DAS44/ESR‐3 item, 74 with the Clinical Disease Activity Index, and 80 with the DAS28/ESR‐4 item, the DAS28/CRP‐4 item and the Simplified Disease Activity Index. Conclusion The relationships of nine different RA composite measures against the DAS44/ESR‐4 item when applied to a cohort of seropositive patients with early RA are described. Each of these simplified status and response measures could be useful in assessing patients with RA, but the specific measure selected should be pre‐specified and described for each study. PMID:17472996
Yamaguchi, Yukio; Kai, Yuko; Kumamoto, Hiroko
2009-12-01
The purpose of the present trial was to develop and evaluate an educational program for promotion of healthy nutrition and physical activity by health volunteers. The educational program consisted of the following four phases: preliminary self-learning by mail (3 weeks), basic learning (3 sessions of 3 hours), practice of planned activities (2 months), and a report session (1 session of 3 hours). Beginner volunteers (n=18, mean age 63.3 +/- 6.4) were recruited from two volunteer health organizations in Kurume city. They then participated in a program that taught basic health knowledge regarding nutrition and physical activity, how to plan effective support activities, and methods for self-evaluation. In the preliminary self-learning phase, an assessment sheet, health information, and homework (goal setting, etc.) were delivered to the volunteers by mail. In the basic learning phase, volunteers attended a 3 day seminar on essential principles for behavioral change and assessment methods for volunteer activity. In addition, effective support activities were planned through group discussion. After a 2-month practice of support activities, each group reported and discussed the results of their activity in a 3-hour report session. Main outcome measures were health knowledge (15 items, 0-1 points), self-efficacy for life style support (5 items, 0-100%), and evaluation of the educational program (9 items, 1-5 points). All measures were self-administered. Significant increases in rate of true answers for health knowledge were observed during the preliminary self-learning and before basic learning phases (54.8% --> 67.1%, P < 0.05), and before and after basic learning phases (67.1% --> 87.6%, P < 0.05). Self-efficacy for life style support were significantly higher after the report session than before the preliminary self-learning phase (35.1% --> 53.1%, P < 0.05). In the two-month practice, all groups received feedback through questionnaires completed by participants who took part in their planned activity. The mean scores for the overall evaluation of the program, the effectiveness of the course materials and group-work, the staff, and the course contents were all higher than 4.0 points. These findings indicate that this program is structured effectively and is appropriate for educating beginner health volunteers regarding promotion of healthy nutrition and physical activity.
Ward, Dianne S; Mazzucca, Stephanie; McWilliams, Christina; Hales, Derek
2015-09-26
Early care and education (ECE) centers are important settings influencing young children's diet and physical activity (PA) behaviors. To better understand their impact on diet and PA behaviors as well as to evaluate public health programs aimed at ECE settings, we developed and tested the Environment and Policy Assessment and Observation - Self-Report (EPAO-SR), a self-administered version of the previously validated, researcher-administered EPAO. Development of the EPAO-SR instrument included modification of items from the EPAO, community advisory group and expert review, and cognitive interviews with center directors and classroom teachers. Reliability and validity data were collected across 4 days in 3-5 year old classrooms in 50 ECE centers in North Carolina. Center teachers and directors completed relevant portions of the EPAO-SR on multiple days according to a standardized protocol, and trained data collectors completed the EPAO for 4 days in the centers. Reliability and validity statistics calculated included percent agreement, kappa, correlation coefficients, coefficients of variation, deviations, mean differences, and intraclass correlation coefficients (ICC), depending on the response option of the item. Data demonstrated a range of reliability and validity evidence for the EPAO-SR instrument. Reporting from directors and classroom teachers was consistent and similar to the observational data. Items that produced strongest reliability and validity estimates included beverages served, outside time, and physical activity equipment, while items such as whole grains served and amount of teacher-led PA had lower reliability (observation and self-report) and validity estimates. To overcome lower reliability and validity estimates, some items need administration on multiple days. This study demonstrated appropriate reliability and validity evidence for use of the EPAO-SR in the field. The self-administered EPAO-SR is an advancement of the measurement of ECE settings and can be used by researchers and practitioners to assess the nutrition and physical activity environments of ECE settings.
Development and validation of Iranian children’s participation assessment scale
Amini, Malek; Hassani Mehraban, Afsoon; Haghni, Hamid; Asgharnezhad, Ali Asghar; Khayatzadeh Mahani, Mohammad
2016-01-01
Background: Participation is mostly cultural and familial based, and there is not any assessment scales for evaluating kids’ participation in Iranian context, therefore the purpose of this study was developing children’s participation assessment scale for Iranian children. Methods: Development of this scale occurred in two phases; phase I: planning: following reviewing the literature and adopting and compiling some items of available evaluation tools in the area (such as CAPE, CPQ, CLASS, Life-H) and receiving advice from two expert panels, the preliminary94- item questionnaire was prepared. Phase II: construct: the survey study was carried out on40 children and 21 of their parents to assess the popularity of the activity in Iran; thus, the items of the questionnaire reduced to 92 and after face and content validity, the final version prepared with 71 items. Results: The final 71-item questionnaire was developed in two parent-report and child-report versions. The 71 items based on the literature and expert panels’ advice were categorized in 8 areas of occupation according to Occupational Therapy Practice Framework (ADL, IADL, Play, leisure, social participation, education, work, and sleep/rest). Conclusion: Iranian children’s participation assessment is a useful and culturally relevant tool to measure participation of Iranian children. It can be used in rigorous clinical and population-based research. PMID:27390703
Preston, N.; Levesley, M.; Mon‐Williams, M.; O'Connor, R.J.
2017-01-01
Abstract Background and purpose Upper limb activity measures for children with cerebral palsy have a number of limitations, for example, lack of validity and poor responsiveness. To overcome these limitations, we developed the Children's Arm Rehabilitation Measure (ChARM), a parent‐reported questionnaire validated for children with cerebral palsy aged 5–16 years. This paper describes both the development of the ChARM items and response categories and its psychometric testing and further refinement using the Rasch measurement model. Methods To generate valid items for the ChARM, we collected goals of therapy specifically developed by therapists, children with cerebral palsy, and their parents for improving activity limitation of the upper limb. The activities, which were the focus of these goals, formed the basis for the items. Therapists typically break an activity into natural stages for the purpose of improving activity performance, and these natural orders of achievement formed each item's response options. Items underwent face validity testing with health care professionals, parents of children with cerebral palsy, academics, and lay persons. A Rasch analysis was performed on ChARM questionnaires completed by the parents of 170 children with cerebral palsy from 12 hospital paediatric services. The ChARM was amended, and the procedure repeated on 148 ChARMs (from children's mean age: 10 years and 1 month; range: 4 years and 8 months to 16 years and 11 months; 85 males; Manual Ability Classification System Levels I = 9, II = 26, III = 48, IV = 45, and V = 18). Results The final 19‐item unidimensional questionnaire displayed fit to the Rasch model (chi‐square p = .18), excellent reliability (person separation index = 0.95, α = 0.95), and no floor or ceiling effects. Items showed no response bias for gender, distribution of impairment, age, or learning disability. Discussion The ChARM is a psychometrically sound measure of upper limb activity validated for children with cerebral palsy aged 5–16 years. The ChARM is freely available for use to clinicians and nonprofit organisations. PMID:28112465
Mental health in primary care: an evaluation using the Item Response Theory
da Rocha, Hugo André; dos Santos, Alaneir de Fátima; Reis, Ilka Afonso; Santos, Marcos Antônio da Cunha; Cherchiglia, Mariângela Leal
2018-01-01
ABSTRACT OBJECTIVE To determine the items of the Brazilian National Program for Improving Access and Quality of Primary Care that better evaluate the capacity to provide mental health care. METHODS This is a cross-sectional study carried out using the Graded Response Model of the Item Response Theory using secondary data from the second cycle of the National Program for Improving Access and Quality of Primary Care, which evaluates 30,523 primary care teams in the period from 2013 to 2014 in Brazil. The internal consistency, correlation between items, and correlation between items and the total score were tested using the Cronbach’s alpha, Spearman’s correlation, and point biserial coefficients, respectively. The assumptions of unidimensionality and local independence of the items were tested. Word clouds were used as one way to present the results. RESULTS The items with the greatest ability to discriminate were scheduling of the agenda according to risk stratification, keeping of records of the most serious cases of users in psychological distress, and provision of group care. The items that required a higher level of mental health care in the parameter of location were the provision of any type of group care and the provision of educational and mental health promotion activities. Total Cronbach’s alpha coefficient was 0.87. The items that obtained the highest correlation with total score were the recording of the most serious cases of users in psychological distress and scheduling of the agenda according to risk stratification. The final scores obtained oscillated between -2.07 (minimum) and 1.95 (maximum). CONCLUSIONS There are important aspects in the discrimination of the capacity to provide mental health care by primary health care teams: risk stratification for care management, follow-up of the most serious cases, group care, and preventive and health promotion actions. PMID:29489992
Brown, Heidi Wendell; Wise, Meg E.; Westenberg, Danielle; Schmuhl, Nicholas B.; Brezoczky, Kelly Lewis; Rogers, Rebecca G.; Constantine, Melissa L.
2017-01-01
Introduction and hypothesis Fewer than 30% of women with accidental bowel leakage (ABL) seek care, despite the existence of effective, minimally invasive therapies. We developed and validated a condition-specific instrument to assess barriers to care-seeking for ABL in women. Methods Adult women with ABL completed an electronic survey about condition severity, patient activation, previous care-seeking, and demographics. The Barriers to Care-seeking for Accidental Bowel Leakage (BCABL) instrument contained 42 potential items completed at baseline and again 2 weeks later. Paired t tests evaluated test–retest reliability. Factor analysis evaluated factor structure and guided item retention. Cronbach’s alpha evaluated internal consistency. Within and across factor item means generated a summary BCABL score used to evaluate scale validity with six external criterion measures. Results Among 1,677 click-throughs, 736 (44%) entered the survey; 95% of eligible female respondents (427 out of 458) provided complete data. Fifty-three percent of respondents had previously sought care for their ABL; median age was 62 years (range 27–89); mean Vaizey score was 12.8 (SD = 5.0), indicating moderate to severe ABL. Test–retest reliability was excellent for all items. Factor extraction via oblique rotation resulted in the final structure of 16 items in six domains, within which internal consistency was high. All six external criterion measures correlated significantly with BCABL score. Conclusions The BCABL questionnaire, with 16 items mapping to six domains, has excellent criterion validity and test–retest reliability when administered electronically in women with ABL. The BCABL can be used to identify care-seeking barriers for ABL in different populations, inform targeted interventions, and measure their effectiveness. PMID:28236039
Dirven, L.; Meijer, W.; Sikkes, S.A.M.; Reijneveld, J.C.; Aaronson, N.K.; Uitdehaag, B.M.J.; Taphoorn, M. J. B.
2014-01-01
BACKGROUND: Next to health-related quality of life, information on daily life functioning in brain tumour patients is essential. Instrumental Activities of Daily Living (I-ADL) are complex daily activities, such as food preparation and shopping. I-ADL may be negatively influenced by a cognitive decline, characteristic of brain tumor patients. OBJECTIVE: In the first phase of this project, we generated a provisional list of items measuring I-ADL that are relevant for primary brain tumour patients. METHODS: Questions from the Amsterdam IADL Questionnaire®, a 70-item questionnaire developed and validated to measure I-ADL in patients with dementia, were evaluated for relevance to brain tumour patients. In addition, new activities were generated. In the first step, 6 professional experts in neuro-oncology and 10 primary brain tumour patient-proxy dyads were asked to evaluate items in the Amsterdam IADL Questionnaire®. Experts had to indicate if these activities (1) could be considered as I-ADL, (2) were affected in brain tumour patients and (3) were clearly formulated. Patients and their proxies only needed to answer the latter two questions. In the second step, the same 6 experts, and in addition 6 other patient-proxy dyads were asked to generate new activities. To do so, in-depth interviews were conducted. Decision rules were determined to aid in deciding which items to retain (step 1) or to add (step 2). Activities that were indicated as IADL, affected and clearly formulated were retained. Activities that were considered as IADL and affected, but not clearly formulated, were rephrased. New activities that were frequently generated were added to the existing list of items. RESULTS: In step 1, experts indicated that 37% of the activities described in the Amsterdam IADL questionnaire® fulfilled all three criteria: conform the definition of IADL, clearly formulated and affected in brain tumour patients. Twenty-three per cent of the activities were affected and conform the provided definition, but not clearly formulated. According to patients and their proxies, 19% and 17% of the activities were clearly formulated and affected in brain tumour patients, respectively. Moreover, 1-3% of the activities were indicated to be affected, but not clearly formulated. Several new activities (concerning social interaction and work) were generated in step 2. With the decision rules as guide, it was decided in consensus that a total of 30 questions of the Amsterdam IADL questionnaire® could also be used to measure I-ADL in primary brain tumour patients. In addition, 16 new questions covering other relevant activities for brain tumour patients were added. CONCLUSION: This first phase resulted in a provisional questionnaire of 46 items intending to measure I-ADL in primary brain tumour patients. The next step is to validate this provisional questionnaire in a larger sample of patients.
Evaluation of Faculty Performance in Extension and Service. AIR 1989 Annual Forum Paper.
ERIC Educational Resources Information Center
Montgomery, James R.; And Others
A widespread perception exists that faculty with public service or extension activities are not treated equitably either in annual evaluations for merit salary increases or in peer evaluation for promotion. To determine the items considered important in making personnel decisions in extension and service areas, a survey was sent to chief academic…
Cramm, Jane Murray; Rutten-Van Mölken, Maureen PMH; Nieboer, Anna Petra
2012-01-01
Objective We investigated whether patients with chronic obstructive pulmonary disease (COPD) who were enrolled in disease-management programmes (DMPs) felt that they received a better quality of care than non-enrolled COPD patients. Methods Our cross-sectional study was performed among patients (n=665) enrolled in four DMPs in the Netherlands. We also evaluated COPD patients (n=227) not enrolled in such programmes. Patients’ assessment of chronic-illness care (PACIC) was measured with a 20-item questionnaire. The instrument had five pre-defined domains: patient activation (three items), delivery-system/practice design (three items), goal setting/tailoring (five items), problem solving/contextual (four items), and follow-up/coordination (five items). Results The mean overall PACIC score (scale: 1–5) of enrolled DMP patients was 2.94, and that of non-enrolled DMP patients was 2.73 (p≤0.01). Differences in the same direction were found in the subscales of patient activation (p≤0.01), delivery-system/practice design (p≤0.001), and problem solving/contextual (p≤0.001). Conclusions Our results suggest that even in the early stages of implementation, DMPs for COPD may significantly improve care. PMID:23593052
Calley, Darren Q; Jackson, Steven; Collins, Heather; George, Steven Z
2010-12-01
Cross-sectional. To evaluate the accuracy with which physical therapists identify fear-avoidance beliefs in patients with low back pain by comparing therapist ratings of perceived patient fear-avoidance to the Fear-Avoidance Beliefs Questionnaire (FABQ), Tampa Scale of Kinesiophobia 11-item (TSK-11), and Pain Catastrophizing Scale (PCS). To compare the concurrent validity of therapist ratings of perceived patient fear-avoidance and a 2-item questionnaire on fear of physical activity and harm, with clinical measures of fear-avoidance (FABQ, TSK-11, PCS), pain intensity as assessed with a numeric pain rating scale (NPRS), and disability as assessed with the Oswestry Disability Questionnaire (ODQ). The need to consider psychosocial factors for identifying patients at risk for disability and chronic low back pain has been well documented. Yet the ability of physical therapists to identify fear-avoidance beliefs using direct observation has not been studied. Eight physical therapists and 80 patients with low back pain from 3 physical therapy clinics participated in the study. Patients completed the FABQ, TSK-11, PCS, ODQ, NPRS, and a dichotomous 2-item fear-avoidance screening questionnaire. Following the initial evaluation, physical therapists rated perceived patient fear-avoidance on a 0-to-10 scale and recorded 2 influences on their ratings. Spearman correlation and independent t tests determined the level of association of therapist 0-to-10 ratings and 2-item screening with fear-avoidance and clinical measures. Therapist ratings of perceived patient fear-avoidance had fair to moderate interrater reliability (ICC2,1 = 0.663). Therapist ratings did not strongly correlate with FABQ or TSK-11 scores. Instead, they unexpectedly had stronger associations with ODQ and PCS scores. Both 2-item screening questions were associated with FABQ-physical activity scores, while the fear of physical activity question was also associated with FABQ-work, TSK-11, PCS, and ODQ scores. Therapists' ratings of perceived patient fear-avoidance were not associated with self-reported fear-avoidance scores, showing a potential disconnect between therapist judgments and commonly used fear-avoidance measures. Instead, therapist ratings had small but statistically significant correlations with pain catastrophizing and disability, findings that may support therapists' inability to discriminate fear-avoidance from these other factors. The 2-item screening questions based on fear of physical activity and harm showed potential to identify elevated FABQ physical activity scores. Differential diagnosis, level 2b.
Yuen, Eva; Knight, Tess; Dodson, Sarity; Chirgwin, Jacqueline; Busija, Lucy; Ricciardelli, Lina A; Burney, Susan; Parente, Phillip; Livingston, Patricia M
2018-05-01
Caregivers have been largely neglected in health literacy measurement. We assess the construct validity, and internal consistency of the Health Literacy of Caregivers Scale-Cancer (HLCS-C), and present a revised, psychometrically robust scale. Using data from 297 cancer caregivers (12.4% response rate) recruited from Melbourne, Australia between January-July 2014, confirmatory factor analysis (CFA) was conducted to evaluate the HLCS-C's proposed factor structure. Items were evaluated for: item difficulty, unidimensionality and overall item fit within their domain. Item-threshold-ordering was examined though one-parameter Item Response Theory models. Internal consistency was assessed using Raykov's reliability coefficient. CFA results identified 42 poorly performing/redundant items which were subsequently removed. A 10-factor model was fitted to 46 acceptable items with no correlated residuals or factor cross-loadings accepted. Adequate fit was revealed (χ 2 WLSMV = 1463.807[df = 944], p < .001, RMSEA = 0.043, CFI = 0.980, TLI = 0.978, WRMR = 1.00). Ten domains were identified: Proactivity and determination to seek information; Adequate information about cancer and cancer management; Supported by healthcare providers (HCP) to understand information; Social support; Cancer-related communication with the care recipient (CR); Understanding CR needs and preferences; Self-care; Understanding the healthcare system; Capacity to process health information; and Active engagement with HCP. Internal consistency was adequate across domains (0.78-0.92). The revised HLCS-C demonstrated good structural, convergent, and discriminant validity, and high internal consistency. The scale may be useful for the development and evaluation of caregiver interventions. © 2017 John Wiley & Sons Ltd.
Ghazanfari, Zeinab; Niknami, Shamsaddin; Ghofranipour, Fazlollah; Hajizadeh, Ebrahim; Montazeri, Ali
2010-11-09
This study carried out to develop a scale for assessing diabetic patients' perceptions about physical activity and to test its psychometric properties (The Physical Activity Questionnaire for Diabetic Patients-PAQ-DP). An item pool extracted from the Theory of Planned Behavior literature was generated. Then an expert panel evaluated the items by assessing content validity index and content validity ratio. Consequently exploratory factor analysis (EFA) was performed to indicate the scale constructs. In addition reliability analyses including internal consistency and test-retest analysis were carried out. In all a sample of 127 women with diabetes participated in the study. Twenty-two items were initially extracted from the literature. A six-factor solution (containing 19 items) emerged as a result of an exploratory factor analysis namely: instrumental attitude, subjective norm, perceived behavioral control, affective attitude, self-identity, and intention explaining 60.30% of the variance observed. Additional analyses indicated satisfactory results for internal consistency (Cronbach's alpha ranging from 0.54 to 0.8) and intraclass correlation coefficients (ranging from 0.40 to 0.92). The Physical Activity Questionnaire for Diabetic Patients (PAQ-DP) is the first instrument that applies the Theory of Planned Behavior in its constructs. The findings indicated that the PAQ-DP is a reliable and valid measure for assessing physical activity perceptions and now is available and can be used in future studies.
2010-01-01
Background This study carried out to develop a scale for assessing diabetic patients' perceptions about physical activity and to test its psychometric properties (The Physical Activity Questionnaire for Diabetic Patients-PAQ-DP). Methods An item pool extracted from the Theory of Planned Behavior literature was generated. Then an expert panel evaluated the items by assessing content validity index and content validity ratio. Consequently exploratory factor analysis (EFA) was performed to indicate the scale constructs. In addition reliability analyses including internal consistency and test-retest analysis were carried out. Results In all a sample of 127 women with diabetes participated in the study. Twenty-two items were initially extracted from the literature. A six-factor solution (containing 19 items) emerged as a result of an exploratory factor analysis namely: instrumental attitude, subjective norm, perceived behavioral control, affective attitude, self-identity, and intention explaining 60.30% of the variance observed. Additional analyses indicated satisfactory results for internal consistency (Cronbach's alpha ranging from 0.54 to 0.8) and intraclass correlation coefficients (ranging from 0.40 to 0.92). Conclusions The Physical Activity Questionnaire for Diabetic Patients (PAQ-DP) is the first instrument that applies the Theory of Planned Behavior in its constructs. The findings indicated that the PAQ-DP is a reliable and valid measure for assessing physical activity perceptions and now is available and can be used in future studies. PMID:21062466
Abasi, Mohammad Hadi; Eslami, Ahmad Ali; Rakhshani, Fatemeh; Shiri, Mansoor
2016-01-01
Attention to different aspects of self-efficacy leads to actual evaluation of self-efficacy about physical activity. This study was carried out in order to design and determine psychometric characteristics of a questionnaire for evaluation of self-efficacy about leisure time physical activity (SELPA) among Iranian adolescent boys, with an emphasis on regulatory self-efficacy. This descriptive-analytic study was conducted in 734 male adolescents aged 15-19 years in Isfahan. After item generation and item selection based on review of literature and other questionnaires, content validity index (CVI) and content validity ratio (CVR) were determined and items were modified employing the opinions of expert panel (N = 10). Comprehensibility of the questionnaire was determined by members of target group (N = 35). Exploratory factors analysis (EFA) was operated on sample 1 (N 1 = 325) and confirmatory factors analysis (CFA) on sample 2 (N 2 = 347). Reliability of SELPA was estimated via internal consistency method. According to EFA, barrier self-efficacy and scheduling self-efficacy are the two main aspects of SELPA with the total variance of 65%. The suggested model was confirmed by CFA and all fitness indices of the corrected model were good. Cronbach's alpha was totally estimated as 0.89 and for barrier and scheduling self-efficacy, it was 0.86 and 0.81, respectively. The results provide some evidence for acceptable validity and reliability of SELPA in Iranian adolescent boys. However, further investigations, especially for evaluation of predictive power of the questionnaire, are necessary.
Oyeyemi, Adewale L; Kasoma, Sandra S; Onywera, Vincent O; Assah, Felix; Adedoyin, Rufus A; Conway, Terry L; Moss, Sarah J; Ocansey, Reginald; Kolbe-Alexander, Tracy L; Akinroye, Kingsley K; Prista, Antonio; Larouche, Richard; Gavand, Kavita A; Cain, Kelli L; Lambert, Estelle V; Aryeetey, Richmond; Bartels, Clare; Tremblay, Mark S; Sallis, James F
2016-03-08
Built environment and policy interventions are effective strategies for controlling the growing worldwide deaths from physical inactivity-related non-communicable diseases. To improve built environment research and develop African specific evidence, it is important to first tailor built environment measures to African contexts and assess their psychometric properties across African countries. This study reports on the adaptation and test-retest reliability of the Neighborhood Environment Walkability Scale in seven sub-Saharan African countries (NEWS-Africa). The original NEWS comprising 8 subscales measuring reported physical and social attributes of neighborhood environments was systematically adapted for Africa through extensive input from physical activity and public health researchers, built environment professionals, and residents in seven African countries: Cameroon, Ghana, Kenya, Mozambique, Nigeria, South Africa and Uganda. Cognitive testing of NEWS-Africa was conducted among diverse residents (N = 109, 50 youth [12 - 17 years] and 59 adults [22 - 67 years], 69 % from low socioeconomic status [SES] neighborhoods). NEWS-Africa was translated into local languages and evaluated for 2-week test-retest reliability in adult participants (N = 301; female = 50.2 %; age = 32.3 ± 12.9 years) purposively recruited from neighborhoods varying in walkability (high and low walkable) and SES (high and low income) and from villages in six of seven participating countries. The original 67 NEWS items was expanded to 89 scores (76 individual NEWS items and 13 computed scales). Several modifications were made to individual items, and some new items were added to capture important attributes in the African environment. A new scale on personal safety was created, and the aesthetics scale was enlarged to reflect African specific characteristics. Over 95 % of all NEWS-Africa scores (items plus computed scales) demonstrated evidence of "excellent" (ICCs > .75 %) or "good" (ICCs = 0.60 to 0.74) reliability. Seven (53.8 %) of the 13 computed NEWS scales demonstrated "excellent" agreement and the other six had "good" agreement. No items or scales demonstrated "poor" reliability (ICCs < .40). The systematic adaptation and initial psychometric evaluation of NEWS-Africa indicates the instrument is feasible and reliable for use with adults of diverse demographic characteristics in Africa. The measure is likely to be useful for research, surveillance of built environment conditions for planning purposes, and to evaluate physical activity and policy interventions in Africa.
Raglio, Alfredo; Gnesi, Marco; Monti, Maria Cristina; Oasi, Osmano; Gianotti, Marta; Attardo, Lapo; Gontero, Giulia; Morotti, Lara; Boffelli, Sara; Imbriani, Chiara; Montomoli, Cristina; Imbriani, Marcello
2017-11-01
Music therapy (MT) interventions are aimed at creating and developing a relationship between patient and therapist. However, there is a lack of validated observational instruments to consistently evaluate the MT process. The purpose of this study was the validation of Music Therapy Session Assessment Scale (MT-SAS), designed to assess the relationship between therapist and patient during active MT sessions. Videotapes of a single 30-min session per patient were considered. A pilot study on the videotapes of 10 patients was carried out to help refine the items, define the scoring system and improve inter-rater reliability among the five raters. Then, a validation study on 100 patients with different clinical conditions was carried out. The Italian MT-SAS was used throughout the process, although we also provide an English translation. The final scale consisted of 7 binary items accounting for eye contact, countenance, and nonverbal and sound-music communication. In the pilot study, raters were found to share an acceptable level of agreement in their assessments. Explorative factorial analysis disclosed a single homogeneous factor including 6 items (thus supporting an ordinal total score), with only the item about eye contact being unrelated to the others. Moreover, the existence of 2 different archetypal profiles of attuned and disattuned behaviours was highlighted through multiple correspondence analysis. As suggested by the consistent results of 2 different analyses, MT-SAS is a reliable tool that globally evaluates sonorous-musical and nonverbal behaviours related to emotional attunement and empathetic relationship between patient and therapist during active MT sessions. Copyright © 2017 John Wiley & Sons, Ltd.
Elvén, Maria; Hochwälder, Jacek; Dean, Elizabeth; Söderlund, Anne
2018-05-01
A systematically developed and evaluated instrument is needed to support investigations of physiotherapists' clinical reasoning integrated with the process of clients' behavior change. This study's aim was to develop an instrument to assess physiotherapy students' and physiotherapists' clinical reasoning focused on clients' activity-related behavior and behavior change, and initiate its evaluation, including feasibility and content validity. The study was conducted in three phases: 1) determination of instrument structure and item generation, based on a model, guidelines for assessing clinical reasoning, and existing measures; 2) cognitive interviews with five physiotherapy students to evaluate item understanding and feasibility; and 3) a Delphi process with 18 experts to evaluate content relevance. Phase 1 resulted in an instrument with four domains: Physiotherapist; Input from client; Functional behavioral analysis; and Strategies for behavior change. The instrument consists of case scenarios followed by items in which key features are identified, prioritized, or interpreted. Phase 2 resulted in revisions of problems and approval of feasibility. Phase 3 demonstrated high level of consensus regarding the instrument's content relevance. This feasible and content-validated instrument shows potential for use in investigations of physiotherapy students' and physiotherapists' clinical reasoning, however continued development and testing are needed.
Fan, Yuying; Li, Qiujie; Yang, Shufen; Guo, Ying; Yang, Libin; Zhao, Shibin
2014-01-01
Purpose. Researchers developed evaluation tools measuring employment relevant satisfaction for nursing new graduates. The evaluation tools were designed to be relevant to nursing managers who make employment decisions and nursing new graduates who were just employed. Methods. In-depth interviews and an expert panel were established to review the activities that evaluate the employee and employer satisfaction of nursing new graduates. Based on individual interviews and literature review, evaluation items were selected. A two-round Delphi study was then conducted from September 2008 to May 2009 with a panel of experts from a range of nursing colleges in China. Results. The response rate was 100% and Kendall's W was 0.73 in the second round of Delphi study. After two rounds of Delphi surveys, a list of 5 employee satisfaction items and 4 employer satisfaction items was identified for nursing new graduates. Conclusions. The findings of this study identified a different but multidimensional set of factors for employment relevant satisfaction, which confirmed the importance of certain fundamental aspects of practice. We developed the evaluation tools to assess the employer and employee satisfaction of nursing new graduates, which provided a database for further study. PMID:25097876
Fan, Yuying; Li, Qiujie; Yang, Shufen; Guo, Ying; Yang, Libin; Zhao, Shibin
2014-01-01
Researchers developed evaluation tools measuring employment relevant satisfaction for nursing new graduates. The evaluation tools were designed to be relevant to nursing managers who make employment decisions and nursing new graduates who were just employed. In-depth interviews and an expert panel were established to review the activities that evaluate the employee and employer satisfaction of nursing new graduates. Based on individual interviews and literature review, evaluation items were selected. A two-round Delphi study was then conducted from September 2008 to May 2009 with a panel of experts from a range of nursing colleges in China. The response rate was 100% and Kendall's W was 0.73 in the second round of Delphi study. After two rounds of Delphi surveys, a list of 5 employee satisfaction items and 4 employer satisfaction items was identified for nursing new graduates. The findings of this study identified a different but multidimensional set of factors for employment relevant satisfaction, which confirmed the importance of certain fundamental aspects of practice. We developed the evaluation tools to assess the employer and employee satisfaction of nursing new graduates, which provided a database for further study.
Wendt, Anne; Harmes, J Christine
2009-01-01
This article is a continuation of the research on the development and evaluation of innovative item formats for the NCLEX examinations that was published in the March/April 2009 edition of Nurse Educator. The authors discuss the innovative item templates and evaluate the statistical characteristics and level of cognitive processing required to answer the examination items.
O'Connor, Teresia M; Cerin, Ester; Hughes, Sheryl O; Robles, Jessica; Thompson, Deborah I; Mendoza, Jason A; Baranowski, Tom; Lee, Rebecca E
2014-01-15
Latino preschoolers (3-5 year old children) have among the highest rates of obesity. Low levels of physical activity (PA) are a risk factor for obesity. Characterizing what Latino parents do to encourage or discourage their preschooler to be physically active can help inform interventions to increase their PA. The objective was therefore to develop and assess the psychometrics of a new instrument: the Preschooler Physical Activity Parenting Practices (PPAPP) among a Latino sample, to assess parenting practices used to encourage or discourage PA among preschool-aged children. Cross-sectional study of 240 Latino parents who reported the frequency of using PA parenting practices. 95% of respondents were mothers; 42% had more than a high school education. Child mean age was 4.5 (±0.9) years (52% male). Test-retest reliability was assessed in 20%, 2 weeks later. We assessed the fit of a priori models using Confirmatory factor analyses (CFA). In a separate sub-sample (35%), preschool-aged children wore accelerometers to assess associations with their PA and PPAPP subscales. The a-priori models showed poor fit to the data. A modified factor structure for encouraging PPAPP had one multiple-item scale: engagement (15 items), and two single-items (have outdoor toys; not enroll in sport-reverse coded). The final factor structure for discouraging PPAPP had 4 subscales: promote inactive transport (3 items), promote screen time (3 items), psychological control (4 items) and restricting for safety (4 items). Test-retest reliability (ICC) for the two scales ranged from 0.56-0.85. Cronbach's alphas ranged from 0.5-0.9. Several sub-factors correlated in the expected direction with children's objectively measured PA. The final models for encouraging and discouraging PPAPP had moderate to good fit, with moderate to excellent test-retest reliabilities. The PPAPP should be further evaluated to better assess its associations with children's PA and offers a new tool for measuring PPAPP among Latino families with preschool-aged children.
ERIC Educational Resources Information Center
Pierce, W. David; Sydie, R. A.; Stratkotter, Rainer
2003-01-01
Male and female participants (N = 274) made judgments about the social concepts of "feminist," "man," and "woman" on 63 semantic differential items. Factor analysis identified three basic dimensions termed evaluative, potency, and activity as well as two secondary factors called expressiveness and sexuality. Results for the evaluative dimension…
A Novel Teaching Tool Combined With Active-Learning to Teach Antimicrobial Spectrum Activity.
MacDougall, Conan
2017-03-25
Objective. To design instructional methods that would promote long-term retention of knowledge of antimicrobial pharmacology, particularly the spectrum of activity for antimicrobial agents, in pharmacy students. Design. An active-learning approach was used to teach selected sessions in a required antimicrobial pharmacology course. Students were expected to review key concepts from the course reader prior to the in-class sessions. During class, brief concept reviews were followed by active-learning exercises, including a novel schematic method for learning antimicrobial spectrum of activity ("flower diagrams"). Assessment. At the beginning of the next quarter (approximately 10 weeks after the in-class sessions), 360 students (three yearly cohorts) completed a low-stakes multiple-choice examination on the concepts in antimicrobial spectrum of activity. When data for students was pooled across years, the mean number of correct items was 75.3% for the items that tested content delivered with the active-learning method vs 70.4% for items that tested content delivered via traditional lecture (mean difference 4.9%). Instructor ratings on student evaluations of the active-learning approach were high (mean scores 4.5-4.8 on a 5-point scale) and student comments were positive about the active-learning approach and flower diagrams. Conclusion. An active-learning approach led to modestly higher scores in a test of long-term retention of pharmacology knowledge and was well-received by students.
A Novel Teaching Tool Combined With Active-Learning to Teach Antimicrobial Spectrum Activity
2017-01-01
Objective. To design instructional methods that would promote long-term retention of knowledge of antimicrobial pharmacology, particularly the spectrum of activity for antimicrobial agents, in pharmacy students. Design. An active-learning approach was used to teach selected sessions in a required antimicrobial pharmacology course. Students were expected to review key concepts from the course reader prior to the in-class sessions. During class, brief concept reviews were followed by active-learning exercises, including a novel schematic method for learning antimicrobial spectrum of activity (“flower diagrams”). Assessment. At the beginning of the next quarter (approximately 10 weeks after the in-class sessions), 360 students (three yearly cohorts) completed a low-stakes multiple-choice examination on the concepts in antimicrobial spectrum of activity. When data for students was pooled across years, the mean number of correct items was 75.3% for the items that tested content delivered with the active-learning method vs 70.4% for items that tested content delivered via traditional lecture (mean difference 4.9%). Instructor ratings on student evaluations of the active-learning approach were high (mean scores 4.5-4.8 on a 5-point scale) and student comments were positive about the active-learning approach and flower diagrams. Conclusion. An active-learning approach led to modestly higher scores in a test of long-term retention of pharmacology knowledge and was well-received by students. PMID:28381885
Hoffman, Aubri S; Abhyankar, Purva; Sheridan, Stacey; Bekker, Hilary; LeBlanc, Annie; Levin, Carrie; Ropka, Mary; Shaffer, Victoria; Stacey, Dawn; Stalmeier, Peep; Vo, Ha; Wills, Celia; Thomson, Richard
2018-01-01
This Explanation and Elaboration (E&E) article expands on the 26 items in the Standards for UNiversal reporting of Decision Aid Evaluations guidelines. The E&E provides a rationale for each item and includes examples for how each item has been reported in published papers evaluating patient decision aids. The E&E focuses on items key to reporting studies evaluating patient decision aids and is intended to be illustrative rather than restrictive. Authors and reviewers may wish to use the E&E broadly to inform structuring of patient decision aid evaluation reports, or use it as a reference to obtain details about how to report individual checklist items. PMID:29467235
15 CFR 400.27 - Criteria applicable to evaluation of applications for production authority.
Code of Federal Regulations, 2013 CFR
2013-01-01
... value-added activity; (4) Extent of value-added activity; (5) Overall effect on import levels of... determining cause of imports. Thus, without undertaking a review of the economic factors enumerated in § 400..., taking into account imports both as individual items and as components of imported products. (b) Economic...
15 CFR 400.27 - Criteria applicable to evaluation of applications for production authority.
Code of Federal Regulations, 2014 CFR
2014-01-01
... value-added activity; (4) Extent of value-added activity; (5) Overall effect on import levels of... determining cause of imports. Thus, without undertaking a review of the economic factors enumerated in § 400..., taking into account imports both as individual items and as components of imported products. (b) Economic...
Kasper, Judith D.; Brandt, Jason; Pezzin, Liliana E.
2012-01-01
Objective. To examine the measurement equivalence of items on disability across three international surveys of aging. Method. Data for persons aged 65 and older were drawn from the Health and Retirement Survey (HRS, n = 10,905), English Longitudinal Study of Aging (ELSA, n = 5,437), and Survey of Health, Ageing and Retirement in Europe (SHARE, n = 13,408). Differential item functioning (DIF) was assessed using item response theory (IRT) methods for activities of daily living (ADL) and instrumental activities of daily living (IADL) items. Results. HRS and SHARE exhibited measurement equivalence, but 6 of 11 items in ELSA demonstrated meaningful DIF. At the scale level, this item-level DIF affected scores reflecting greater disability. IRT methods also spread out score distributions and shifted scores higher (toward greater disability). Results for mean disability differences by demographic characteristics, using original and DIF-adjusted scores, were the same overall but differed for some subgroup comparisons involving ELSA. Discussion. Testing and adjusting for DIF is one means of minimizing measurement error in cross-national survey comparisons. IRT methods were used to evaluate potential measurement bias in disability comparisons across three international surveys of aging. The analysis also suggested DIF was mitigated for scales including both ADL and IADL and that summary indexes (counts of limitations) likely underestimate mean disability in these international populations. PMID:22156662
Assessing depression outcome in patients with moderate dementia: sensitivity of the HoNOS65+ scale.
Canuto, Alessandra; Rudhard-Thomazic, Valérie; Herrmann, François R; Delaloye, Christophe; Giannakopoulos, Panteleimon; Weber, Kerstin
2009-08-15
To date, there is no widely accepted clinical scale to monitor the evolution of depressive symptoms in demented patients. We assessed the sensitivity to treatment of a validated French version of the Health of the Nation Outcome Scale (HoNOS) 65+ compared to five routinely used scales. Thirty elderly inpatients with ICD-10 diagnosis of dementia and depression were evaluated at admission and discharge using paired t-test. Using the Brief Psychiatric Rating Scale (BPRS) "depressive mood" item as gold standard, a receiver operating characteristic curve (ROC) analysis assessed the validity of HoNOS65+F "depressive symptoms" item score changes. Unlike Geriatric Depression Scale, Mini Mental State Examination and Activities of Daily Living scores, BPRS scores decreased and Global Assessment Functioning Scale score increased significantly from admission to discharge. Amongst HoNOS65+F items, "behavioural disturbance", "depressive symptoms", "activities of daily life" and "drug management" items showed highly significant changes between the first and last day of hospitalization. The ROC analysis revealed that changes in the HoNOS65+F "depressive symptoms" item correctly classified 93% of the cases with good sensitivity (0.95) and specificity (0.88) values. These data suggest that the HoNOS65+F "depressive symptoms" item may provide a valid assessment of the evolution of depressive symptoms in demented patients.
Zhao, Yue
2017-03-01
In patient-reported outcome research that utilizes item response theory (IRT), using statistical significance tests to detect misfit is usually the focus of IRT model-data fit evaluations. However, such evaluations rarely address the impact/consequence of using misfitting items on the intended clinical applications. This study was designed to evaluate the impact of IRT item misfit on score estimates and severity classifications and to demonstrate a recommended process of model-fit evaluation. Using secondary data sources collected from the Patient-Reported Outcome Measurement Information System (PROMIS) wave 1 testing phase, analyses were conducted based on PROMIS depression (28 items; 782 cases) and pain interference (41 items; 845 cases) item banks. The identification of misfitting items was assessed using Orlando and Thissen's summed-score item-fit statistics and graphical displays. The impact of misfit was evaluated according to the agreement of both IRT-derived T-scores and severity classifications between inclusion and exclusion of misfitting items. The examination of the presence and impact of misfit suggested that item misfit had a negligible impact on the T-score estimates and severity classifications with the general population sample in the PROMIS depression and pain interference item banks, implying that the impact of item misfit was insignificant. Findings support the T-score estimates in the two item banks as robust against item misfit at both the group and individual levels and add confidence to the use of T-scores for severity diagnosis in the studied sample. Recommendations on approaches for identifying item misfit (statistical significance) and assessing the misfit impact (practical significance) are given.
Paz, Sylvia H; Spritzer, Karen L; Reise, Steven P; Hays, Ron D
2017-06-01
About 70% of Latinos, 5 years old or older, in the United States speak Spanish at home. Measurement equivalence of the PROMIS ® pain interference (PI) item bank by language of administration (English versus Spanish) has not been evaluated. A sample of 527 adult Spanish-speaking Latinos completed the Spanish version of the 41-item PROMIS ® pain interference item bank. We evaluate dimensionality, monotonicity and local independence of the Spanish-language items. Then we evaluate differential item functioning (DIF) using ordinal logistic regression with item response theory scores estimated from DIF-free "anchor" items. One of the 41 items in the Spanish version of the PROMIS ® PI item bank was identified as having significant uniform DIF. English- and Spanish-speaking subjects with the same level of pain interference responded differently to 1 of the 41 items in the PROMIS ® PI item bank. This item was not retained due to proprietary issues. The original English language item parameters can be used when estimating PROMIS ® PI scores.
Yoza, Yoshiyasu; Ariyoshi, Koya; Honda, Sumihisa; Taniguchi, Hiroyuki; Senjyu, Hideaki
2009-10-01
Patients with COPD often experience restriction in their activities of daily living (ADL) due to dyspnea. This type of restriction is unique to patients with COPD and cannot be adequately evaluated by the generic ADL scales. This study developed an ADL scale (the Activity of Daily Living Dyspnea scale [ADL-D scale]) for patients with COPD and investigated its validity and internal consistency. Patients with stable COPD were recruited and completed a pilot 26-item questionnaire. Patients also performed the Incremental Shuttle Walk Test (ISWT), and completed the St George's Respiratory Questionnaire (SGRQ), and Medical Research Council (MRC) dyspnea grade. There were 83 male participants who completed the pilot questionnaire. Following the pilot, 8 items that were not undertaken by the majority of subjects, and 3 items judged to be of low clinical importance by physical therapists were removed from the pilot questionnaire. The final ADL-D scale contained 15 items. Scores obtained with the ADL-D scale were significantly correlated with the MRC dyspnea grades, distance walked on the ISWT and SGRQ scores. The ADL-D scores were significantly different across the five grades of the MRC dyspnea grade. The ADL-D scale showed high consistency (Chronbach's alpha coefficient of 0.96). The ADL-D scale is a useful scale for assessing impairments in ADL in Japanese male patients with COPD.
Honda, Yukiko; Meguro, Kenichi; Meguro, Mitsue; Akanuma, Kyoko
2013-01-01
Patients with vascular dementia (VaD) are often isolated, withdrawn from society because of negative symptoms and functional disabilities. The aim of this study was to detect factors associated with social withdrawal in patients with VaD. The participants were 36 institutionalized patients with VaD. Social withdrawal was assessed with the social withdrawal of the Multidimensional Observation Scale for Elderly Subjects (MOSES). Possible explanatory variables were the MOSES items depression and self-care, Cognitive Abilities Screening Instrument (CASI), apathy evaluation scale (AES), and Behavioral Pathology in Alzheimer's Disease Frequency-Weighted Severity Scale (BEHAVE-AD-FW). Multiple regression analyses were conducted for two groups: Analysis 1 was performed in all patients (N = 36) and Analysis 2 was performed in the patients with the ability to move by themselves (i.e., independent walking or independent movement with a cane or a wheelchair; n = 28). In Analysis 1, MOSES item social withdrawal was correlated with AES and MOSES item self-care. In Analysis 2, MOSES item social withdrawal was correlated with AES and CASI domain abstraction and judgment. Decreased social activities of VaD were not related to general cognitive function or depression. Disturbed activities of daily living (ADLs) for self-care may involve decreased frontal lobe function, indicating that comprehensive rehabilitation for both ADL and dementia are needed to improve the social activities of patients with VaD.
Crins, Martine H. P.; Roorda, Leo D.; Smits, Niels; de Vet, Henrica C. W.; Westhovens, Rene; Cella, David; Cook, Karon F.; Revicki, Dennis; van Leeuwen, Jaap; Boers, Maarten; Dekker, Joost; Terwee, Caroline B.
2015-01-01
The Dutch-Flemish PROMIS Group translated the adult PROMIS Pain Interference item bank into Dutch-Flemish. The aims of the current study were to calibrate the parameters of these items using an item response theory (IRT) model, to evaluate the cross-cultural validity of the Dutch-Flemish translations compared to the original English items, and to evaluate their reliability and construct validity. The 40 items in the bank were completed by 1085 Dutch chronic pain patients. Before calibrating the items, IRT model assumptions were evaluated using confirmatory factor analysis (CFA). Items were calibrated using the graded response model (GRM), an IRT model appropriate for items with more than two response options. To evaluate cross-cultural validity, differential item functioning (DIF) for language (Dutch vs. English) was examined. Reliability was evaluated based on standard errors and Cronbach’s alpha. To evaluate construct validity correlations with scores on legacy instruments (e.g., the Disabilities of the Arm, Shoulder and Hand Questionnaire) were calculated. Unidimensionality of the Dutch-Flemish PROMIS Pain Interference item bank was supported by CFA tests of model fit (CFI = 0.986, TLI = 0.986). Furthermore, the data fit the GRM and showed good coverage across the pain interference continuum (threshold-parameters range: -3.04 to 3.44). The Dutch-Flemish PROMIS Pain Interference item bank has good cross-cultural validity (only two out of 40 items showing DIF), good reliability (Cronbach’s alpha = 0.98), and good construct validity (Pearson correlations between 0.62 and 0.75). A computer adaptive test (CAT) and Dutch-Flemish PROMIS short forms of the Dutch-Flemish PROMIS Pain Interference item bank can now be developed. PMID:26214178
Crins, Martine H P; Roorda, Leo D; Smits, Niels; de Vet, Henrica C W; Westhovens, Rene; Cella, David; Cook, Karon F; Revicki, Dennis; van Leeuwen, Jaap; Boers, Maarten; Dekker, Joost; Terwee, Caroline B
2015-01-01
The Dutch-Flemish PROMIS Group translated the adult PROMIS Pain Interference item bank into Dutch-Flemish. The aims of the current study were to calibrate the parameters of these items using an item response theory (IRT) model, to evaluate the cross-cultural validity of the Dutch-Flemish translations compared to the original English items, and to evaluate their reliability and construct validity. The 40 items in the bank were completed by 1085 Dutch chronic pain patients. Before calibrating the items, IRT model assumptions were evaluated using confirmatory factor analysis (CFA). Items were calibrated using the graded response model (GRM), an IRT model appropriate for items with more than two response options. To evaluate cross-cultural validity, differential item functioning (DIF) for language (Dutch vs. English) was examined. Reliability was evaluated based on standard errors and Cronbach's alpha. To evaluate construct validity correlations with scores on legacy instruments (e.g., the Disabilities of the Arm, Shoulder and Hand Questionnaire) were calculated. Unidimensionality of the Dutch-Flemish PROMIS Pain Interference item bank was supported by CFA tests of model fit (CFI = 0.986, TLI = 0.986). Furthermore, the data fit the GRM and showed good coverage across the pain interference continuum (threshold-parameters range: -3.04 to 3.44). The Dutch-Flemish PROMIS Pain Interference item bank has good cross-cultural validity (only two out of 40 items showing DIF), good reliability (Cronbach's alpha = 0.98), and good construct validity (Pearson correlations between 0.62 and 0.75). A computer adaptive test (CAT) and Dutch-Flemish PROMIS short forms of the Dutch-Flemish PROMIS Pain Interference item bank can now be developed.
Castle, Cameron; Gray, Andrew; Neehoff, Shona; Glue, Paul
2017-10-01
Patients receiving ketamine for refractory depression and anxiety report dissociative symptoms in the first 60 min post-dose. The most commonly used instrument to assess this is the Clinician-Administered Dissociative States Scale (CADSS), developed based on the assessment of patients with dissociative symptoms. Its psychometric properties for ketamine-induced dissociation have not been reported. We evaluated these from a study using 0.25-1 mg/kg ketamine and midazolam (as an active control) in 18 patients with treatment-resistant anxiety. Dissociation ratings were increased by ketamine in a dose-dependent manner. In contrast, midazolam showed no effect on ratings of dissociation. For individual CADSS items, the magnitude of change and the ketamine dose at which changes were observed were not homogenous. The Cronbach alpha for the total scale was high (0.937), with acceptable item-rest correlations for almost all individual items. Purposefully removing items to maximise alpha did not lead to meaningful improvements. Acceptable internal consistency was still observed after removing items which lacked evidence of responsiveness at lower doses. The high Cronbach alpha values identified in this study suggests that the CADSS is an internally consistent instrument for evaluating ketamine-induced dissociation in clinical trials in anxiety, although it does not capture symptoms such as thought disorder.
Assessment of the U937 cell line for the detection of contact allergens
DOE Office of Scientific and Technical Information (OSTI.GOV)
Python, Francois; Goebel, Carsten; Aeby, Pierre
2007-04-15
The human myeloid cell line U937 was evaluated as an in vitro test system to identify contact sensitizers in order to develop alternatives to animal tests for the cosmetic industry. Specific culture conditions (i.e., presence of interleukin-4, IL-4) were applied to obtain a dendritic cell-like phenotype. In the described test protocol, these cells were exposed to test chemicals and then analyzed by flow cytometry for CD86 expression and by quantitative real-time reverse transcriptase-polymerase chain reaction for IL-1{beta} and IL-8 gene expressions. Eight sensitizers, three non-sensitizers and five oxidative hair dye precursors were examined after 24-, 48- and 72-h exposure times.more » Test item-specific modulations of the chosen activation markers (CD86, IL-1{beta} and IL-8) suggest that this U937 activation test could discriminate test items classified as contact sensitizers or non-sensitizers in the local lymph node assay in mice (LLNA). More specifically, a test item can be considered as a potential sensitizer when it significantly induced the upregulation of the expression of at least two markers. Using this approach, we could correctly evaluate the dendritic cell (DC) activation potential for 15 out of 16 tested chemicals. We conclude that the U937 activation test may represent an useful tool in a future in vitro test battery for predicting sensitizing properties of chemicals.« less
Characterizing and modeling the dynamics of activity and popularity.
Zhang, Peng; Li, Menghui; Gao, Liang; Fan, Ying; Di, Zengru
2014-01-01
Social media, regarded as two-layer networks consisting of users and items, turn out to be the most important channels for access to massive information in the era of Web 2.0. The dynamics of human activity and item popularity is a crucial issue in social media networks. In this paper, by analyzing the growth of user activity and item popularity in four empirical social media networks, i.e., Amazon, Flickr, Delicious and Wikipedia, it is found that cross links between users and items are more likely to be created by active users and to be acquired by popular items, where user activity and item popularity are measured by the number of cross links associated with users and items. This indicates that users generally trace popular items, overall. However, it is found that the inactive users more severely trace popular items than the active users. Inspired by empirical analysis, we propose an evolving model for such networks, in which the evolution is driven only by two-step random walk. Numerical experiments verified that the model can qualitatively reproduce the distributions of user activity and item popularity observed in empirical networks. These results might shed light on the understandings of micro dynamics of activity and popularity in social media networks.
Characterizing and Modeling the Dynamics of Activity and Popularity
Zhang, Peng; Li, Menghui; Gao, Liang; Fan, Ying; Di, Zengru
2014-01-01
Social media, regarded as two-layer networks consisting of users and items, turn out to be the most important channels for access to massive information in the era of Web 2.0. The dynamics of human activity and item popularity is a crucial issue in social media networks. In this paper, by analyzing the growth of user activity and item popularity in four empirical social media networks, i.e., Amazon, Flickr, Delicious and Wikipedia, it is found that cross links between users and items are more likely to be created by active users and to be acquired by popular items, where user activity and item popularity are measured by the number of cross links associated with users and items. This indicates that users generally trace popular items, overall. However, it is found that the inactive users more severely trace popular items than the active users. Inspired by empirical analysis, we propose an evolving model for such networks, in which the evolution is driven only by two-step random walk. Numerical experiments verified that the model can qualitatively reproduce the distributions of user activity and item popularity observed in empirical networks. These results might shed light on the understandings of micro dynamics of activity and popularity in social media networks. PMID:24586586
Validity of Computer Adaptive Tests of Daily Routines for Youth with Spinal Cord Injury
Haley, Stephen M.
2013-01-01
Objective: To evaluate the accuracy of computer adaptive tests (CATs) of daily routines for child- and parent-reported outcomes following pediatric spinal cord injury (SCI) and to evaluate the validity of the scales. Methods: One hundred ninety-six daily routine items were administered to 381 youths and 322 parents. Pearson correlations, intraclass correlation coefficients (ICC), and 95% confidence intervals (CI) were calculated to evaluate the accuracy of simulated 5-item, 10-item, and 15-item CATs against the full-item banks and to evaluate concurrent validity. Independent samples t tests and analysis of variance were used to evaluate the ability of the daily routine scales to discriminate between children with tetraplegia and paraplegia and among 5 motor groups. Results: ICC and 95% CI demonstrated that simulated 5-, 10-, and 15-item CATs accurately represented the full-item banks for both child- and parent-report scales. The daily routine scales demonstrated discriminative validity, except between 2 motor groups of children with paraplegia. Concurrent validity of the daily routine scales was demonstrated through significant relationships with the FIM scores. Conclusion: Child- and parent-reported outcomes of daily routines can be obtained using CATs with the same relative precision of a full-item bank. Five-item, 10-item, and 15-item CATs have discriminative and concurrent validity. PMID:23671380
Evaluation of mercury in the liquid waste processing facilities
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jain, Vijay; Shah, Hasmukh; Occhipinti, John E.
2015-08-13
This report provides a summary of Phase I activities conducted to support an Integrated Evaluation of Mercury in Liquid Waste System (LWS) Processing Facilities. Phase I activities included a review and assessment of the liquid waste inventory and chemical processing behavior of mercury using a system by system review methodology approach. Gaps in understanding mercury behavior as well as action items from the structured reviews are being tracked. 64% of the gaps and actions have been resolved.
Validation of the Modified Fatigue Impact Scale in mild to moderate traumatic brain injury.
Schiehser, Dawn M; Delano-Wood, Lisa; Jak, Amy J; Matthews, Scott C; Simmons, Alan N; Jacobson, Mark W; Filoteo, J Vincent; Bondi, Mark W; Orff, Henry J; Liu, Lin
2015-01-01
To evaluate the validity of the Modified Fatigue Impact Scale (MFIS) in veterans with a history of mild to moderate traumatic brain injury (TBI). Veterans (N = 106) with mild (92%) or moderate (8%) TBI. Veterans Administration Health System. Factor structure, internal consistency, convergent validity, sensitivity, and specificity of the MFIS were examined. Principal component analysis identified 2 viable MFIS factors: a Cognitive subscale and a Physical/Activities subscale. Item analysis revealed high internal consistency of the MFIS Total scale and subscale items. Strong convergent validity of the MFIS scales was established with 2 Beck Depression Inventory II fatigue items. Receiver operating characteristic curve analysis revealed good to excellent accuracy of the MFIS in classifying fatigued versus nonfatigued individuals. The MFIS is a valid multidimensional measure that can be used to evaluate the impact of fatigue on cognitive and physical functioning in individuals with mild to moderate TBI. The psychometric properties of the MFIS make it useful for evaluating fatigue and provide the potential for improving research on fatigue in this population.
An Initial evaluation of law enforcement overdose training in Rhode Island.
Saucier, Cory D; Zaller, Nickolas; Macmadu, Alexandria; Green, Traci C
2016-05-01
To assess initial change in knowledge, self-efficacy, and anticipated behaviors among Rhode Island law enforcement officers on drug overdose response and prevention. Law enforcement officers (N=316) voluntarily completed a pre-post evaluation immediately before and after taking part in overdose prevention and response trainings. Assessment items included measures of knowledge (Brief Overdose Recognition and Response Assessment (BORRA)), self-efficacy, attitudes toward drugs and overdose prevention, awareness of the Good Samaritan Law, and open-ended items pertaining to overdose knowledge and response behaviors. Non-parametric tests measured within-group and between-group differences. Wilcoxon Signed Rank tests and Kruskal-Wallis tests evaluated changes in BORRA scores and self-efficacy items. McNemar's tests assessed changes regarding the Good Samaritan law and open-ended items. Wilcoxon Signed Rank tests measured post-training change in attitudes. Law enforcement officers demonstrated statistically significant improvements in self-efficacy (identifying signs of opioid overdose, naloxone indication, counseling witnesses in overdose prevention, and referring witnesses for more information), overdose identification knowledge (BORRA mean increased from 7.00 to 10.39), naloxone administration knowledge (BORRA mean increased from 10.15 to 12.59), Good Samaritan Law awareness (17.9% increase after training), and anticipated behaviors in response to future observed overdose (65.7% changed from passive to active response post training). Harm reduction programs can provide law enforcement officers with the knowledge and skills necessary to intervene and reduce overdose mortality. Given the statistically significant improvements in self-efficacy, attitudinal changes, and Good Samaritan law awareness, law enforcement officers are more prepared to actively interact with drug users during a drug-involved emergency. Copyright © 2016. Published by Elsevier Ireland Ltd.
Launch Deployment Assembly Extravehicular Activity Neutral Buoyancy Development Test Report
NASA Technical Reports Server (NTRS)
Loughead, T.
1996-01-01
This test evaluated the Launch Deployment Assembly (LDA) design for Extravehicular Activity (EVA) work sites (setup, igress, egress), reach and visual access, and translation required for cargo item removal. As part of the LDA design, this document describes the method and results of the LDA EVA Neutral Buoyancy Development Test to ensure that the LDA hardware support the deployment of the cargo items from the pallet. This document includes the test objectives, flight and mockup hardware description, descriptions of procedures and data collection used in the testing, and the results of the development test at the National Aeronautics and Space Administrations (NASA) Marshall Space Flight Center (MSFC) Neutral Buoyancy Simulator (NBS).
Johnson, Ray; Henkell, Heather; Simon, Elizabeth; Zhu, John
2008-01-01
This study sought to extend previous results regarding deceptions about specific memories by investigating the role of executive processes in deceptions about evaluative judgments. In addition, given that previous studies of deception have not included valence manipulations, we also wanted to determine whether the goodness/badness aspect of the items would affect the processes used during deception. Thus, we compared behavioral and event-related potential (ERP) activity while participants made truthful and directed lie (i.e., press opposite of the truth) responses about attitude items with which they either strongly agreed or disagreed. Consistent with previous results, deceptive responses required greater cognitive control as indicated by slower RTs, larger medial frontal negativities (MFN) and smaller late positive components than truthful responses. Furthermore, the magnitude of these deception-related effects was dependent on the valence that participants assigned to the items (i.e., agree/disagree). Directed lie responses about attitudes also resulted in greatly reduced pre-response positivities, an indication that participants strategically monitored their responses even in the absence of explicit task demands. Item valence also differentially affected the amplitude of three ERP components in a 650 ms pre-response interval, independently of whether truthful or deceptive responses were made. Analyses using dipole locations based on results from fMRI studies of evaluative judgments and deception indicated a high degree of overlap between the ERP and fMRI results and revealed the possible temporal characteristics of the hemodynamic activations.
Shipping: The World Connection. Student Guide and Teacher Guide. OEAGLS Investigation 12.
ERIC Educational Resources Information Center
Fortner, Rosanne; Pauken, Ray
This unit investigates through three activities the importance of the Great Lakes in international trade. A student workbook and a teaching guide are provided. Included in the teacher's manual are an overview of the unit, a materials list, objectives, teaching suggestions, evaluation items, and answer keys to student activities. In the first…
Wang, Guang-zhi; Li, Wei-guang; He, Wen-jie; Han, Hong-da; Ding, Chi; Ma, Xiao-na; Qu, Yan-ming
2006-10-01
By means of immobilizing five kinds of activated carbon, we studied the influence between the chief activated carbon property items and immobilized bioactivated carbon (IBAC) purification effect with the correlation analysis. The result shows that the activated carbon property items which the correlation coefficient is up 0.7 include molasses, abrasion number, hardness, tannin, uniform coefficient, mean particle diameter and effective particle diameter; the activated carbon property items which the correlation coefficient is up 0.5 include pH, iodine, butane and tetrachloride. In succession, the partial correlation analysis shows that activated carbon property items mostly influencing on IBAC purification effect include molasses, hardness, abrasion number, uniform coefficient, mean particle diameter and effective particle diameter. The causation of these property items bringing influence on IBAC purification is that the activated carbon holes distribution (representative activated carbon property item is molasses) provides inhabitable location and adjust food for the dominance bacteria; the mechanical resist-crash property of activated carbon (representative activated carbon property items: abrasion number and hardness) have influence on the stability of biofilm; and the particle diameter size and distribution of activated carbon (representative activated carbon property items: uniform coefficient, mean particle diameter and effective particle diameter) can directly affect the force of water in IBAC filter bed, which brings influence on the dominance bacteria immobilizing on activated carbon.
Leidy, Nancy Kline; Hamilton, Alan; Becker, Karin
2012-01-01
The performance of daily activities is a major challenge for people with chronic obstructive pulmonary disease (COPD). The Functional Performance Inventory (FPI) was developed based on an analytical framework of functional status and qualitative interviews with COPD patients describing these difficulties. The 65-item FPI was reduced to a 32-item short form (SF) through a systematic process of qualitative and quantitative item reduction and formatted for greater clarity and ease of use. This study examined the content validity of the reduced, reformatted form of the instrument, the FPI-SF. Qualitative cognitive interviews were conducted with COPD patients recruited from three geographically diverse pulmonary clinics in the United States. Interviews were designed to assess respondent interpretation of the instrument, evaluate clarity and ease of completion, and identify any new activities participants found important and difficult to perform that were not represented by the existing items. Twenty subjects comprised the sample; 12 (60%) were male, 14 (70%) were Caucasian, the mean age was 63.0 ± 11.3 years, 12 (60%) were retired, the mean forced expiratory volume in 1 second (FEV(1)) was 1.5 ± 0.5 L, and the mean percent predicted FEV(1) was 48.4% ± 13.1%. Participants understood the FPI-SF as intended, including instructions, items, and response options. Two minor formatting changes were suggested to improve clarity of presentation. Participants found the content of the FPI-SF to be comprehensive, with items covering activities they felt were important and often difficult to perform. These results, together with its development history and previously tested quantitative properties, suggest that the FPI-SF is content valid for use in clinical studies of COPD.
Leidy, Nancy Kline; Hamilton, Alan; Becker, Karin
2012-01-01
Purpose The performance of daily activities is a major challenge for people with chronic obstructive pulmonary disease (COPD). The Functional Performance Inventory (FPI) was developed based on an analytical framework of functional status and qualitative interviews with COPD patients describing these difficulties. The 65-item FPI was reduced to a 32-item short form (SF) through a systematic process of qualitative and quantitative item reduction and formatted for greater clarity and ease of use. This study examined the content validity of the reduced, reformatted form of the instrument, the FPI-SF. Patients and methods Qualitative cognitive interviews were conducted with COPD patients recruited from three geographically diverse pulmonary clinics in the United States. Interviews were designed to assess respondent interpretation of the instrument, evaluate clarity and ease of completion, and identify any new activities participants found important and difficult to perform that were not represented by the existing items. Results Twenty subjects comprised the sample; 12 (60%) were male, 14 (70%) were Caucasian, the mean age was 63.0 ± 11.3 years, 12 (60%) were retired, the mean forced expiratory volume in 1 second (FEV1) was 1.5 ± 0.5 L, and the mean percent predicted FEV1 was 48.4% ± 13.1%. Participants understood the FPI-SF as intended, including instructions, items, and response options. Two minor formatting changes were suggested to improve clarity of presentation. Participants found the content of the FPI-SF to be comprehensive, with items covering activities they felt were important and often difficult to perform. Conclusion These results, together with its development history and previously tested quantitative properties, suggest that the FPI-SF is content valid for use in clinical studies of COPD. PMID:22969295
Weinfurt, Kevin P; Lin, Li; Bruner, Deborah Watkins; Cyranowski, Jill M; Dombeck, Carrie B; Hahn, Elizabeth A; Jeffery, Diana D; Luecht, Richard M; Magasi, Susan; Porter, Laura S; Reese, Jennifer Barsky; Reeve, Bryce B; Shelby, Rebecca A; Smith, Ashley Wilder; Willse, John T; Flynn, Kathryn E
2015-09-01
The Patient-Reported Outcomes Measurement Information System (PROMIS)(®) Sexual Function and Satisfaction measure (SexFS) version 1.0 was developed with cancer populations. There is a need to expand the SexFS and provide evidence of its validity in diverse populations. The aim of this study was to describe the development of the SexFS v2.0 and present preliminary evidence for its validity. Development built on version 1.0, plus additional review of extant items, discussions with 15 clinical experts, 11 patient focus groups (including individuals with diabetes, heart disease, anxiety, depression, and/or are lesbian, gay, bisexual, or aged 65 or older), 48 cognitive interviews, and psychometric evaluation in a random sample of U.S. adults plus an oversample for specific sexual problems (2281 men, 1686 women). We examined differential item functioning (DIF) by gender and sexual activity. We examined convergent and known-groups validity. The final set of domains includes 11 scored scales (interest in sexual activity, lubrication, vaginal discomfort, clitoral discomfort, labial discomfort, erectile function, orgasm ability, orgasm pleasure, oral dryness, oral discomfort, satisfaction), and six nonscored item pools (screeners, sexual activities, anal discomfort, therapeutic aids, factors interfering with sexual satisfaction, bother). Domains from version 1.0 were reevaluated and improved. Domains considered applicable across gender and sexual activity status, namely interest, orgasm, and satisfaction, were found to have significant DIF. We identified subsets of items in each domain that provided consistent measurement across these important respondent groups. Convergent and known-groups validity was supported. The SexFS version 2.0 has several improvements and enhancements over version 1.0 and other extant measures, including expanded evidence for validity, scores centered around norms for sexually active U.S. adults, new domains, and a final set of items applicable for both men and women and those sexually active with a partner and without. The SexFS is customizable, allowing users to select relevant domains and items for their study. © 2015 International Society for Sexual Medicine.
Coyne, Karin S; Sexton, Chris C; Thompson, Christine; Bavendam, Tamara; Brubaker, Linda
2015-03-01
Urinary urgency is the cardinal symptom of overactive bladder (OAB). However, there is no single instrument that assesses the context, severity, intensity, and daily life impact of urinary urgency. The purpose of this manuscript is to describe the methods and results of the qualitative and quantitative research conducted to develop a new tool for this purpose, the Urgency Questionnaire (UQ). Qualitative data from interviews with patients with urinary urgency were used to develop and refine the items and response options of the UQ. Three studies were used to evaluate psychometric properties: a clinical trial of tolterodine (Detrol; n = 974); a psychometric validation study (n = 163); and a test-retest validation study (n = 47). Item and exploratory factor analysis (EFA) were performed to assess the subscale structure, and the psychometric performance of the resulting scales was evaluated. Fifteen Likert-scale items and four VAS questions were retained. A four-factor solution was shown to best fit the data, with the subscales: Impact on Daily Activities, Time to Control Urgency, Nocturia, and Fear of Incontinence. All subscales and VAS items demonstrated good reliability (Cronbach's α 0.79-0.94), convergent and discriminant validity, and responsiveness to change. The UQ differentiated between OAB patients and controls. The results provide quantitative evidence that urinary urgency, as assessed by the UQ, is a pathological sensation distinctive from the normal urge to void and suggest that the UQ might be a reliable, valid, and responsive instrument for evaluating the severity and HRQL impact of urinary urgency in OAB.
Evaluation of Mercury in Liquid Waste Processing Facilities - Phase I Report
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jain, V.; Occhipinti, J.; Shah, H.
2015-07-01
This report provides a summary of Phase I activities conducted to support an Integrated Evaluation of Mercury in Liquid Waste System (LWS) Processing Facilities. Phase I activities included a review and assessment of the liquid waste inventory and chemical processing behavior of mercury using a system by system review methodology approach. Gaps in understanding mercury behavior as well as action items from the structured reviews are being tracked. 64% of the gaps and actions have been resolved.
Evaluation of mercury in liquid waste processing facilities - Phase I report
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jain, V.; Occhipinti, J. E.; Shah, H.
2015-07-01
This report provides a summary of Phase I activities conducted to support an Integrated Evaluation of Mercury in Liquid Waste System (LWS) Processing Facilities. Phase I activities included a review and assessment of the liquid waste inventory and chemical processing behavior of mercury using a system by system review methodology approach. Gaps in understanding mercury behavior as well as action items from the structured reviews are being tracked. 64% of the gaps and actions have been resolved.
Assessment of visiting activities for young children using the UNAWE Evaluation Guide
NASA Astrophysics Data System (ADS)
Tomita, Akihiko
2015-08-01
When the target is young children and the activity type is play, the assessment of the activity is not easy. The table of domains of active learning shown in the EU Universe Awareness Programme Evaluation Guide is useful for the assessment; the Guide shows the four domains; motivation, scientific skills, universe knowledge, and intercultural attitudes, and many items of objectives in each domains. The Guide can be a basic format and the items can be modified so as to fit each activity. Taking my activity as an example, I will present an assessment using the Guide. The activity I will present is "Uchu no O-hanashi," a visiting activity which includeds slide show, story telling, and enjoying pictures on large sheets for children at nursery, kindergarten, preschool and other sites. In order to obtain the data, I have recorded the voice of children. The analysis method is a kind of qualitative one. I picked up "motivation" and "scientific skills" words from the record when they muttered about and asked each other what they felt, what they found, and what they got excited about. Among the items in the "scientific skills domain," looking at carefully, asking, exchanging opinions, interpreting or trying to interpret, and trying were frequently appeared. Other skills such as devising and confirming were not frequently appeared but they would sometimes appear later at home or at school after the activity. I also picked up the words of children obtaining scientific way of view and attitude through the activity. One example is "It seems that stars float in the sky and do not move. Do they really set like the Sun, our nearest star? I never saw stars set!" A boy was trying to make a new framework for his understanding. This kind of thinking will enrich his or her future "universe knowledge" and "intercultural attitudes."
Calibration of the Dutch-Flemish PROMIS Pain Behavior item bank in patients with chronic pain.
Crins, M H P; Roorda, L D; Smits, N; de Vet, H C W; Westhovens, R; Cella, D; Cook, K F; Revicki, D; van Leeuwen, J; Boers, M; Dekker, J; Terwee, C B
2016-02-01
The aims of the current study were to calibrate the item parameters of the Dutch-Flemish PROMIS Pain Behavior item bank using a sample of Dutch patients with chronic pain and to evaluate cross-cultural validity between the Dutch-Flemish and the US PROMIS Pain Behavior item banks. Furthermore, reliability and construct validity of the Dutch-Flemish PROMIS Pain Behavior item bank were evaluated. The 39 items in the bank were completed by 1042 Dutch patients with chronic pain. To evaluate unidimensionality, a one-factor confirmatory factor analysis (CFA) was performed. A graded response model (GRM) was used to calibrate the items. To evaluate cross-cultural validity, Differential item functioning (DIF) for language (Dutch vs. English) was evaluated. Reliability of the item bank was also examined and construct validity was studied using several legacy instruments, e.g. the Roland Morris Disability Questionnaire. CFA supported the unidimensionality of the Dutch-Flemish PROMIS Pain Behavior item bank (CFI = 0.960, TLI = 0.958), the data also fit the GRM, and demonstrated good coverage across the pain behavior construct (threshold parameters range: -3.42 to 3.54). Analysis showed good cross-cultural validity (only six DIF items), reliability (Cronbach's α = 0.95) and construct validity (all correlations ≥0.53). The Dutch-Flemish PROMIS Pain Behavior item bank was found to have good cross-cultural validity, reliability and construct validity. The development of the Dutch-Flemish PROMIS Pain Behavior item bank will serve as the basis for Dutch-Flemish PROMIS short forms and computer adaptive testing (CAT). © 2015 European Pain Federation - EFIC®
Cain, Kelli L; Gavand, Kavita A; Conway, Terry L; Geremia, Carrie M; Millstein, Rachel A; Frank, Lawrence D; Saelens, Brian E; Adams, Marc A; Glanz, Karen; King, Abby C; Sallis, James F
2017-06-01
Macroscale built environment factors (e.g., street connectivity) are correlated with physical activity. Less-studied but more modifiable microscale elements (e.g., sidewalks) may also influence physical activity, but shorter audit measures of microscale elements are needed to promote wider use. This study evaluated the relation of an abbreviated 54-item streetscape audit tool with multiple measures of physical activity in four age groups. We developed a 54-item version from the original 120-item Microscale Audit of Pedestrian Streetscapes (MAPS). Audits were conducted on 0.25-0.45 mile routes from participant residences toward the nearest nonresidential destination for children (N=758), adolescents (N=897), younger adults (N=1,655), and older adults (N=367). Active transport and leisure physical activity were measured with surveys, and objective physical activity was measured with accelerometers. Items to retain from original MAPS were selected primarily by correlations with physical activity. Mixed linear regression analyses were conducted for MAPS-Abbreviated summary scores, adjusting for demographics, participant clustering, and macroscale walkability. MAPS-Abbreviated and original MAPS total scores correlated r=.94 The MAPS-Abbreviated tool was related similarly to physical activity outcomes as the original MAPS. Destinations and land use, streetscape and walking path characteristics, and overall total scores were significantly related to active transport in all age groups. Street crossing characteristics were related to active transport in children and older adults. Aesthetics and social characteristics were related to leisure physical activity in children and younger adults, and cul-de-sacs were related with physical activity in youth. Total scores were related to accelerometer-measured physical activity in children and older adults. MAPS-Abbreviated is a validated observational measure for use in research. The length and related cost of implementation has been cited as a barrier to use of microscale instruments, so availability of this shorter validated measure could lead to more widespread use of streetscape audits in health research.
Cain, Kelli L.; Gavand, Kavita A.; Conway, Terry L.; Geremia, Carrie M.; Millstein, Rachel A.; Frank, Lawrence D.; Saelens, Brian E.; Adams, Marc A.; Glanz, Karen; King, Abby C.; Sallis, James F.
2017-01-01
Purpose Macroscale built environment factors (e.g., street connectivity) are correlated with physical activity. Less-studied but more modifiable microscale elements (e.g., sidewalks) may also influence physical activity, but shorter audit measures of microscale elements are needed to promote wider use. This study evaluated the relation of an abbreviated 54-item streetscape audit tool with multiple measures of physical activity in four age groups. Methods We developed a 54-item version from the original 120-item Microscale Audit of Pedestrian Streetscapes (MAPS). Audits were conducted on 0.25-0.45 mile routes from participant residences toward the nearest nonresidential destination for children (N=758), adolescents (N=897), younger adults (N=1,655), and older adults (N=367). Active transport and leisure physical activity were measured with surveys, and objective physical activity was measured with accelerometers. Items to retain from original MAPS were selected primarily by correlations with physical activity. Mixed linear regression analyses were conducted for MAPS-Abbreviated summary scores, adjusting for demographics, participant clustering, and macroscale walkability. Results MAPS-Abbreviated and original MAPS total scores correlated r=.94 The MAPS-Abbreviated tool was related similarly to physical activity outcomes as the original MAPS. Destinations and land use, streetscape and walking path characteristics, and overall total scores were significantly related to active transport in all age groups. Street crossing characteristics were related to active transport in children and older adults. Aesthetics and social characteristics were related to leisure physical activity in children and younger adults, and cul-de-sacs were related with physical activity in youth. Total scores were related to accelerometer-measured physical activity in children and older adults. Conclusion MAPS-Abbreviated is a validated observational measure for use in research. The length and related cost of implementation has been cited as a barrier to use of microscale instruments, so availability of this shorter validated measure could lead to more widespread use of streetscape audits in health research. PMID:29270361
Meyer-Bahlburg, Heino F L; Dolezal, Curtis; Zucker, Kenneth J; Kessler, Suzanna J; Schober, Justine M; New, Maria I
2006-11-01
We administered the 18-item Recalled Childhood Gender Questionnaire-Revised (RCGQ-R), female version, to 147 adult women with congenital adrenal hyperplasia (CAH) representing three different degrees of prenatal androgenization due to 21-hydroxylase deficiency and to non-CAH controls. A principal components analysis generated three components accounting for 46%, 9%, and 6% of the variance, respectively. Corresponding unit-weighted scales (high scores = feminine) were labeled Gender Role (13 items; Cronbach alpha = .91), Physical Activity (3 items; alpha = .64), and Cross-Gender Desire (2 items; alpha = .47). Discriminant validity was demonstrated in terms of highly significant comparisons across the four groups. We conclude that the first 2 RCGQ-R scales show good psychometric qualities, but that the third scale needs to be further evaluated in a sample that includes women with gender identity disorder.
Shillingsburg, M Alice; Powell, Nicole M; Bowen, Crystal N
2013-01-01
Mand training is often a primary focus in early language instruction and typically includes mands that are positively reinforced. However, mands maintained by negative reinforcement are also important skills to teach. These include mands to escape aversive demands or unwanted items. Another type of negatively reinforced mand important to teach involves the removal of a stimulus that prevents access to a preferred activity. We taught 5 participants diagnosed with autism spectrum disorders to mand for the removal of a stimulus in order to access a preferred item that had been blocked. An evaluation was conducted to determine if participants responded differentially when the establishing operations for the preferred item were present versus absent. All participants learned to mand for the removal of the stimulus exclusively under conditions when the establishing operation was present.
Establishing generative yes/no responses in developmentally disabled children.
Neef, N A; Walters, J; Egel, A L
1984-01-01
We evaluated the effects of two procedures for teaching four developmentally disabled children to respond yes/no appropriately. During baseline, tutoring was conducted in which five known items were individually presented with the question, "Is this a ----?", followed either by access to requested items or by remedial prompting contingent on responding. When tutoring did not improve performance, instruction was embedded in the regular classroom activities. In this condition, items requested by students were either presented or withheld on the basis of their response to the question, "Do you want ----?". Increases in correct responding were confirmed by a multiple-baseline design across all four students and were maintained with the introduction of new items. However, generalization to "Is this a ----?" questions did not occur in the tutoring setting until specifically programmed. Subsequently, students also demonstrated appropriate yes/no responding to questions involving actions, possession, and spatial relations. PMID:6526766
MAHR, ALFRED D.; NEOGI, TUHINA; LAVALLEY, MICHAEL P.; DAVIS, JOHN C.; HOFFMAN, GARY S.; MCCUNE, W. JOSEPH; SPECKS, ULRICH; SPIERA, ROBERT F.; ST.CLAIR, E. WILLIAM; STONE, JOHN H.; MERKEL, PETER A.
2013-01-01
Objective To assess the Birmingham Vasculitis Activity Score for Wegener's Granulomatosis (BVAS/WG) with respect to its selection and weighting of items. Methods This study used the BVAS/WG data from the Wegener's Granulomatosis Etanercept Trial. The scoring frequencies of the 34 predefined items and any “other” items added by clinicians were calculated. Using linear regression with generalized estimating equations in which the physician global assessment (PGA) of disease activity was the dependent variable, we computed weights for all predefined items. We also created variables for clinical manifestations frequently added as other items, and computed weights for these as well. We searched for the model that included the items and their generated weights yielding an activity score with the highest R2 to predict the PGA. Results We analyzed 2,044 BVAS/WG assessments from 180 patients; 734 assessments were scored during active disease. The highest R2 with the PGA was obtained by scoring WG activity based on the following items: the 25 predefined items rated on ≥5 visits, the 2 newly created fatigue and weight loss variables, the remaining minor other and major other items, and a variable that signified whether new or worse items were present at a specific visit. The weights assigned to the items ranged from 1 to 21. Compared with the original BVAS/WG, this modified score correlated significantly more strongly with the PGA. Conclusion This study suggests possibilities to enhance the item selection and weighting of the BVAS/WG. These changes may increase this instrument's ability to capture the continuum of disease activity in WG. PMID:18512722
Measurement of self-evaluative motives: a shopping scenario.
Wajda, Theresa A; Kolbe, Richard; Hu, Michael Y; Cui, Annie Peng
2008-08-01
To develop measures of consumers' self-evaluative motives of Self-verification, Self-enhancement, and Self-improvement within the context of a mall shopping environment, an initial set of 49 items was generated by conducting three focus-group sessions. These items were subsequently converted into shopping-dependent motive statements. 250 undergraduate college students responded on a 7-point scale to each statement as these related to the acquisition of recent personal shopping goods. An exploratory factor analysis yielded five factors, accounting for 57.7% of the variance, three of which corresponded to the Self-verification motive (five items), Self-enhancement motive (three items), and Self-improvement motive (six items). These 14 items, along with 9 reconstructed items, yielded 23 items retained and subjected to additional testing. In a final round of data collection, 169 college students provided data for exploratory factor analysis. 11 items were used in confirmatory factor analysis. Analysis indicated that the 11-item scale adequately captured measures of the three self-evaluative motives. However, further data reduction produced a 9-item scale with marked improvement in statistical fit over the 11-item scale.
Schmitt, Andreas; Gahr, Annika; Hermanns, Norbert; Kulzer, Bernhard; Huber, Jörg; Haak, Thomas
2013-08-13
Though several questionnaires on self-care and regimen adherence have been introduced, the evaluations do not always report consistent and substantial correlations with measures of glycaemic control. Small ability to explain variance in HbA1c constitutes a significant limitation of an instrument's use for scientific purposes as well as clinical practice. In order to assess self-care activities which can predict glycaemic control, the Diabetes Self-Management Questionnaire (DSMQ) was designed. A 16 item questionnaire to assess self-care activities associated with glycaemic control was developed, based on theoretical considerations and a process of empirical improvements. Four subscales, 'Glucose Management' (GM), 'Dietary Control' (DC), 'Physical Activity' (PA), and 'Health-Care Use' (HU), as well as a 'Sum Scale' (SS) as a global measure of self-care were derived. To evaluate its psychometric quality, 261 patients with type 1 or 2 diabetes were assessed with the DSMQ and an established analogous scale, the Summary of Diabetes Self-Care Activities Measure (SDSCA). The DSMQ's item and scale characteristics as well as factorial and convergent validity were analysed, and its convergence with HbA1c was compared to the SDSCA. The items showed appropriate characteristics (mean item-total-correlation: 0.46 ± 0.12; mean correlation with HbA1c: -0.23 ± 0.09). Overall internal consistency (Cronbach's alpha) was good (0.84), consistencies of the subscales were acceptable (GM: 0.77; DC: 0.77; PA: 0.76; HU: 0.60). Principal component analysis indicated a four factor structure and confirmed the designed scale structure. Confirmatory factor analysis indicated appropriate fit of the four factor model. The DSMQ scales showed significant convergent correlations with their parallel SDSCA scales (GM: 0.57; DC: 0.52; PA: 0.58; HU: n/a; SS: 0.57) and HbA1c (GM: -0.39; DC: -0.30; PA: -0.15; HU: -0.22; SS: -0.40). All correlations with HbA1c were significantly stronger than those obtained with the SDSCA. This study provides preliminary evidence that the DSMQ is a reliable and valid instrument and enables an efficient assessment of self-care behaviours associated with glycaemic control. The questionnaire should be valuable for scientific analyses as well as clinical use in both type 1 and type 2 diabetes patients.
Vohs, Kathleen D.; Luciana, Monica; Cuthbert, Bruce N.; MacDonald, Angus W.
2013-01-01
Perceived message effectiveness is often used as a diagnostic tool to determine whether a health message is likely to be successful or needs modification before use in an intervention. Yet, published research on the antecedents of perceived effectiveness is scarce and, consequently, little is known about why a message is perceived to be effective or ineffective. The present study’s aim was to identify and test the affective antecedents of perceived effectiveness of antidrug television messages in a sample of 190 adolescents in the 15–19 year age range. Factor-analytical tests of retrospective message evaluation items suggested two dimensions of perceived effectiveness, one that contained items such as convincingness whereas the other contained pleasantness items. Using retrospective data as well as real time valence and arousal ratings, we found that arousal underlies perceived convincingness and valence underlies perceived pleasantness. The results indicated activation of appetitive and defensive motivational systems, which suggests a clear motivational component to the concept of perceived message effectiveness. PMID:21499729
Marsh, Herbert W; Vallerand, Robert J; Lafrenière, Marc-André K; Parker, Philip; Morin, Alexandre J S; Carbonneau, Noémie; Jowett, Sophia; Bureau, Julien S; Fernet, Claude; Guay, Frédéric; Salah Abduljabbar, Adel; Paquet, Yvan
2013-09-01
The passion scale, based on the dualistic model of passion, measures 2 distinct types of passion: Harmonious and obsessive passions are predictive of adaptive and less adaptive outcomes, respectively. In a substantive-methodological synergy, we evaluate the construct validity (factor structure, reliability, convergent and discriminant validity) of Passion Scale responses (N = 3,571). The exploratory structural equation model fit to the data was substantially better than the confirmatory factor analysis solution, and resulted in better differentiated (less correlated) factors. Results from a 13-model taxonomy of measurement invariance supported complete invariance (factor loadings, factor correlations, item uniquenesses, item intercepts, and latent means) over language (French vs. English; the instrument was originally devised in French, then translated into English) and gender. Strong measurement partial invariance over 5 passion activity groups (leisure, sport, social, work, education) indicates that the same set of items is appropriate for assessing passion across a wide variety of activities--a previously untested, implicit assumption that greatly enhances practical utility. Support was found for the convergent and discriminant validity of the harmonious and obsessive passion scales, based on a set of validity correlates: life satisfaction, rumination, conflict, time investment, activity liking and valuation, and perceiving the activity as a passion.
Procurement engineering - the productivity factor
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bargerstock, S.B.
1993-01-01
The industry is several years on the road to implementation of the Nuclear Management and Resources Council (NUMARC) initiatives on commercial-grade item dedication and procurement. Utilities have taken several approaches to involve engineering in the procurement process. A common result for the approaches is the additional operations and maintenance (O M) cost imposed by the added resource requirements. Procurement engineering productivity is a key element in controlling this business area. Experience shows that 400 to 500% improvements in productivity are possible with a 2-yr period. Improving the productivity of the procurement engineering function is important in today's competitive utility environment.more » Procurement engineering typically involves four distinct technical evaluation responsibilities along with several administrative areas. Technical evaluations include the functionally based safety classification of replacement components and parts (lacking a master parts list), the determination of dedication requirements for safety-related commercial-grade items, the preparation of a procurement specification to maintain the licensed design bases, and the equivalency evaluation of alternate items not requiring the design-change process. Administrative duties include obtaining technical review of vendor-supplied documentation, identifying obsolete parts and components, resolving material nonconformances, initiating the design-change process for replacement items (as needed), and providing technical support to O M. Although most utilities may not perform or require all the noted activities, a large percentage will apply to each utility station.« less
ERIC Educational Resources Information Center
Haley, Stephen M.; Coster, Wendy J.; Dumas, Helene M.; Fragala-Pinkham, Maria A.; Kramer, Jessica; Ni, Pengsheng; Tian, Feng; Kao, Ying-Chia; Moed, Rich; Ludlow, Larry H.
2011-01-01
Aim: The aims of the study were to: (1) build new item banks for a revised version of the Pediatric Evaluation of Disability Inventory (PEDI) with four content domains: daily activities, mobility, social/cognitive, and responsibility; and (2) use post-hoc simulations based on the combined normative and disability calibration samples to assess the…
Development of an item bank for computerized adaptive test (CAT) measurement of pain.
Petersen, Morten Aa; Aaronson, Neil K; Chie, Wei-Chu; Conroy, Thierry; Costantini, Anna; Hammerlid, Eva; Hjermstad, Marianne J; Kaasa, Stein; Loge, Jon H; Velikova, Galina; Young, Teresa; Groenvold, Mogens
2016-01-01
Patient-reported outcomes should ideally be adapted to the individual patient while maintaining comparability of scores across patients. This is achievable using computerized adaptive testing (CAT). The aim here was to develop an item bank for CAT measurement of the pain domain as measured by the EORTC QLQ-C30 questionnaire. The development process consisted of four steps: (1) literature search, (2) formulation of new items and expert evaluations, (3) pretesting and (4) field-testing and psychometric analyses for the final selection of items. In step 1, we identified 337 pain items from the literature. Twenty-nine new items fitting the QLQ-C30 item style were formulated in step 2 that were reduced to 26 items by expert evaluations. Based on interviews with 31 patients from Denmark, France and the UK, the list was further reduced to 21 items in step 3. In phase 4, responses were obtained from 1103 cancer patients from five countries. Psychometric evaluations showed that 16 items could be retained in a unidimensional item bank. Evaluations indicated that use of the CAT measure may reduce sample size requirements with 15-25% compared to using the QLQ-C30 pain scale. We have established an item bank of 16 items suitable for CAT measurement of pain. While being backward compatible with the QLQ-C30, the new item bank will significantly improve measurement precision of pain. We recommend initiating CAT measurement by screening for pain using the two original QLQ-C30 pain items. The EORTC pain CAT is currently available for "experimental" purposes.
Methodology for Developing and Evaluating the PROMIS® Smoking Item Banks
Cai, Li; Stucky, Brian D.; Tucker, Joan S.; Shadel, William G.; Edelen, Maria Orlando
2014-01-01
Introduction: This article describes the procedures used in the PROMIS® Smoking Initiative for the development and evaluation of item banks, short forms (SFs), and computerized adaptive tests (CATs) for the assessment of 6 constructs related to cigarette smoking: nicotine dependence, coping expectancies, emotional and sensory expectancies, health expectancies, psychosocial expectancies, and social motivations for smoking. Methods: Analyses were conducted using response data from a large national sample of smokers. Items related to each construct were subjected to extensive item factor analyses and evaluation of differential item functioning (DIF). Final item banks were calibrated, and SF assessments were developed for each construct. The performance of the SFs and the potential use of the item banks for CAT administration were examined through simulation study. Results: Item selection based on dimensionality assessment and DIF analyses produced item banks that were essentially unidimensional in structure and free of bias. Simulation studies demonstrated that the constructs could be accurately measured with a relatively small number of carefully selected items, either through fixed SFs or CAT-based assessment. Illustrative results are presented, and subsequent articles provide detailed discussion of each item bank in turn. Conclusions: The development of the PROMIS smoking item banks provides researchers with new tools for measuring smoking-related constructs. The use of the calibrated item banks and suggested SF assessments will enhance the quality of score estimates, thus advancing smoking research. Moreover, the methods used in the current study, including innovative approaches to item selection and SF construction, may have general relevance to item bank development and evaluation. PMID:23943843
Kramer, Jessica M; Coster, Wendy J; Kao, Ying-Chia; Snow, Anne; Orsmond, Gael I
2012-02-01
The use of current adaptive behavior measures in practice and research is limited by their length and need for a professional interviewer. There is a need for alternative measures that more efficiently assess adaptive behavior in children and youth with autism spectrum disorders (ASDs). The Pediatric Evaluation of Disability Inventory-Computer Adaptive Test (PEDI-CAT) is a computer-based assessment of a child's ability to perform activities required for personal self-sufficiency and engagement in the community. This study evaluated the applicability, representativeness, and comprehensiveness of the Daily Activity, Social/Cognitive, and Responsibility domains for children and youth with an ASD. Twenty professionals and 18 parents provided feedback via in-person or virtual focus groups and cognitive interviews. Items were perceived to represent relevant functional activities within each domain. Child factors and assessment characteristics influenced parents' ratings. In response to feedback, 15 items and additional directions were added to ensure the PEDI-CAT is a meaningful measure when used with this population.
2014-01-01
Background Latino preschoolers (3-5 year old children) have among the highest rates of obesity. Low levels of physical activity (PA) are a risk factor for obesity. Characterizing what Latino parents do to encourage or discourage their preschooler to be physically active can help inform interventions to increase their PA. The objective was therefore to develop and assess the psychometrics of a new instrument: the Preschooler Physical Activity Parenting Practices (PPAPP) among a Latino sample, to assess parenting practices used to encourage or discourage PA among preschool-aged children. Methods Cross-sectional study of 240 Latino parents who reported the frequency of using PA parenting practices. 95% of respondents were mothers; 42% had more than a high school education. Child mean age was 4.5 (±0.9) years (52% male). Test-retest reliability was assessed in 20%, 2 weeks later. We assessed the fit of a priori models using Confirmatory factor analyses (CFA). In a separate sub-sample (35%), preschool-aged children wore accelerometers to assess associations with their PA and PPAPP subscales. Results The a-priori models showed poor fit to the data. A modified factor structure for encouraging PPAPP had one multiple-item scale: engagement (15 items), and two single-items (have outdoor toys; not enroll in sport-reverse coded). The final factor structure for discouraging PPAPP had 4 subscales: promote inactive transport (3 items), promote screen time (3 items), psychological control (4 items) and restricting for safety (4 items). Test-retest reliability (ICC) for the two scales ranged from 0.56-0.85. Cronbach’s alphas ranged from 0.5-0.9. Several sub-factors correlated in the expected direction with children’s objectively measured PA. Conclusion The final models for encouraging and discouraging PPAPP had moderate to good fit, with moderate to excellent test-retest reliabilities. The PPAPP should be further evaluated to better assess its associations with children’s PA and offers a new tool for measuring PPAPP among Latino families with preschool-aged children. PMID:24428935
Evaluation of the Psychometric Properties of the Parents' Proxy MPAQ-C in Chinese Population
ERIC Educational Resources Information Center
Leung, Ka Man; Chung, Pak-Kwong; Ransdell, Lynda B.; Gao, Yong
2016-01-01
We examined psychometric properties of a Modified Physical Activity Questionnaire for Children (MPAQ-C). Thirty-two parents (Study 1), 40 students (6-9 years) and one of each student's parents (Study 2), and 625 parents (Study 3) completed the MPAQ-C. The MPAQ-C (six items) measured children's physical activity (PA) after school, and during…
Evaluating Common Item Block Options When Faced with Practical Constraints
ERIC Educational Resources Information Center
Wolkowitz, Amanda; Davis-Becker, Susan
2015-01-01
This study evaluates the impact of common item characteristics on the outcome of equating in credentialing examinations when traditionally recommended representation is not possible. This research used real data sets from several credentialing exams to test the impact of content representation, item statistics, and number of common items on…
ERIC Educational Resources Information Center
Quaigrain, Kennedy; Arhin, Ato Kwamina
2017-01-01
Item analysis is essential in improving items which will be used again in later tests; it can also be used to eliminate misleading items in a test. The study focused on item and test quality and explored the relationship between difficulty index (p-value) and discrimination index (DI) with distractor efficiency (DE). The study was conducted among…
Development of the Oxford Participation and Activities Questionnaire: constructing an item pool
Kelly, Laura; Jenkinson, Crispin; Dummett, Sarah; Dawson, Jill; Fitzpatrick, Ray; Morley, David
2015-01-01
Purpose The Oxford Participation and Activities Questionnaire is a patient-reported outcome measure in development that is grounded on the World Health Organization International Classification of Functioning, Disability, and Health (ICF). The study reported here aimed to inform and generate an item pool for the new measure, which is specifically designed for the assessment of participation and activity in patients experiencing a range of health conditions. Methods Items were informed through in-depth interviews conducted with 37 participants spanning a range of conditions. Interviews aimed to identify how their condition impacted their ability to participate in meaningful activities. Conditions included arthritis, cancer, chronic back pain, diabetes, motor neuron disease, multiple sclerosis, Parkinson’s disease, and spinal cord injury. Transcripts were analyzed using the framework method. Statements relating to ICF themes were recast as questionnaire items and shown for review to an expert panel. Cognitive debrief interviews (n=13) were used to assess items for face and content validity. Results ICF themes relevant to activities and participation in everyday life were explored, and a total of 222 items formed the initial item pool. This item pool was refined by the research team and 28 generic items were mapped onto all nine chapters of the ICF construct, detailing activity and participation. Cognitive interviewing confirmed the questionnaire instructions, items, and response options were acceptable to participants. Conclusion Using a clear conceptual basis to inform item generation, 28 items have been identified as suitable to undergo further psychometric testing. A large-scale postal survey will follow in order to refine the instrument further and to assess its psychometric properties. The final instrument is intended for use in clinical trials and interventions targeted at maintaining or improving activity and participation. PMID:26056503
Development of the Oxford Participation and Activities Questionnaire: constructing an item pool.
Kelly, Laura; Jenkinson, Crispin; Dummett, Sarah; Dawson, Jill; Fitzpatrick, Ray; Morley, David
2015-01-01
The Oxford Participation and Activities Questionnaire is a patient-reported outcome measure in development that is grounded on the World Health Organization International Classification of Functioning, Disability, and Health (ICF). The study reported here aimed to inform and generate an item pool for the new measure, which is specifically designed for the assessment of participation and activity in patients experiencing a range of health conditions. Items were informed through in-depth interviews conducted with 37 participants spanning a range of conditions. Interviews aimed to identify how their condition impacted their ability to participate in meaningful activities. Conditions included arthritis, cancer, chronic back pain, diabetes, motor neuron disease, multiple sclerosis, Parkinson's disease, and spinal cord injury. Transcripts were analyzed using the framework method. Statements relating to ICF themes were recast as questionnaire items and shown for review to an expert panel. Cognitive debrief interviews (n=13) were used to assess items for face and content validity. ICF themes relevant to activities and participation in everyday life were explored, and a total of 222 items formed the initial item pool. This item pool was refined by the research team and 28 generic items were mapped onto all nine chapters of the ICF construct, detailing activity and participation. Cognitive interviewing confirmed the questionnaire instructions, items, and response options were acceptable to participants. Using a clear conceptual basis to inform item generation, 28 items have been identified as suitable to undergo further psychometric testing. A large-scale postal survey will follow in order to refine the instrument further and to assess its psychometric properties. The final instrument is intended for use in clinical trials and interventions targeted at maintaining or improving activity and participation.
Prakash, V; Ganesan, Mohan; Vasanthan, R; Hariohm, K
2017-04-01
In India, post-stroke outcomes are determined using functional outcome measures (FOMs), the contents of which have not been validated for their relevance to the Indian population. In this study, we aimed to evaluate the cultural validity of five frequently used stroke-specific FOMs by comparing their contents with the problems reported by patients with stroke in India. Face-to-face structured interviews were conducted with 152 patients diagnosed with stroke in India. Problems and goals identified by the patients were compared to each item included in the FOMs used in stroke rehabilitation. The Stroke Impact Scale (SIS) and the Frenchay Activities Index (FAI) include items related to the most frequently identified problems. However, neither covers problems related to the need for squatting and sitting on the floor. Use of public transport and community walking are not included in the SIS. Leisure and recreational activities (e.g. gardening, reading books), cognitive and speech functions (e.g. memory, thinking) and bowel and bladder dysfunctions were the common items identified as "not a problem" or "not relevant" by the patients. Our findings suggest that the SIS and FAI are the most appropriate FOMs for patients with stroke in India as they include items related to the majority of problems identified by study participants. Many items on both measures, however, were identified as not a problem or not relevant. There is a need for developing culture-specific FOMs that incorporate all major concerns expressed by patients with stroke in India.
Validation of psychosocial scales for physical activity in university students
Tassitano, Rafael Miranda; de Farias, José Cazuza; Rech, Cassiano Ricardo; Tenório, Maria Cecília Marinho; Cabral, Poliana Coelho; da Silva, Giselia Alves Pontes
2015-01-01
OBJECTIVE Translate the Patient-centered Assessment and Counseling for Exercise questionnaire, adapt it cross-culturally and identify the psychometric properties of the psychosocial scales for physical activity in young university students. METHODS The Patient-centered Assessment and Counseling for Exercise questionnaire is made up of 39 items divided into constructs based on the social cognitive theory and the transtheoretical model. The analyzed constructs were, as follows: behavior change strategy (15 items), decision-making process (10), self-efficacy (6), support from family (4), and support from friends (4). The validation procedures were conceptual, semantic, operational, and functional equivalences, in addition to the equivalence of the items and of measurements. The conceptual, of items and semantic equivalences were performed by a specialized committee. During measurement equivalence, the instrument was applied to 717 university students. Exploratory factor analysis was used to verify the loading of each item, explained variance and internal consistency of the constructs. Reproducibility was measured by means of intraclass correlation coefficient. RESULTS The two translations were equivalent and back-translation was similar to the original version, with few adaptations. The layout, presentation order of the constructs and items from the original version were kept in the same form as the original instrument. The sample size was adequate and was evaluated by the Kaiser-Meyer-Olkin test, with values between 0.72 and 0.91. The correlation matrix of the items presented r < 0.8 (p < 0.05). The factor loadings of the items from all the constructs were satisfactory (> 0.40), varying between 0.43 and 0.80, which explained between 45.4% and 59.0% of the variance. Internal consistency was satisfactory (α ≥ 0.70), with support from friends being 0.70 and 0.92 for self-efficacy. Most items (74.3%) presented values above 0.70 for the reproducibility test. CONCLUSIONS The validation process steps were considered satisfactory and adequate for applying to the population. PMID:26270013
Validation of psychosocial scales for physical activity in university students.
Tassitano, Rafael Miranda; de Farias Júnior, José Cazuza; Rech, Cassiano Ricardo; Tenório, Maria Cecília Marinho; Cabral, Poliana Coelho; da Silva, Giselia Alves Pontes
2015-01-01
OBJECTIVE Translate the Patient-centered Assessment and Counseling for Exercise questionnaire, adapt it cross-culturally and identify the psychometric properties of the psychosocial scales for physical activity in young university students. METHODS The Patient-centered Assessment and Counseling for Exercise questionnaire is made up of 39 items divided into constructs based on the social cognitive theory and the transtheoretical model. The analyzed constructs were, as follows: behavior change strategy (15 items), decision-making process (10), self-efficacy (6), support from family (4), and support from friends (4). The validation procedures were conceptual, semantic, operational, and functional equivalences, in addition to the equivalence of the items and of measurements. The conceptual, of items and semantic equivalences were performed by a specialized committee. During measurement equivalence, the instrument was applied to 717 university students. Exploratory factor analysis was used to verify the loading of each item, explained variance and internal consistency of the constructs. Reproducibility was measured by means of intraclass correlation coefficient. RESULTS The two translations were equivalent and back-translation was similar to the original version, with few adaptations. The layout, presentation order of the constructs and items from the original version were kept in the same form as the original instrument. The sample size was adequate and was evaluated by the Kaiser-Meyer-Olkin test, with values between 0.72 and 0.91. The correlation matrix of the items presented r < 0.8 (p < 0.05). The factor loadings of the items from all the constructs were satisfactory (> 0.40), varying between 0.43 and 0.80, which explained between 45.4% and 59.0% of the variance. Internal consistency was satisfactory (α ≥ 0.70), with support from friends being 0.70 and 0.92 for self-efficacy. Most items (74.3%) presented values above 0.70 for the reproducibility test. CONCLUSIONS The validation process steps were considered satisfactory and adequate for applying to the population.
Twiss, J; McKenna, S P; Graham, J; Swetz, K; Sloan, J; Gomberg-Maitland, M
2016-04-09
Electronic formats of patient-reported outcome (PRO) measures are now routinely used in clinical research studies. When changing from a validated paper and pen to electronic administration it is necessary to establish their equivalence. This study reports on the value of Rasch analysis in this process. Three groups of US pulmonary hypertension (PH) patients participated. The first completed an electronic version of the CAMPHOR Activity Limitation scale (e-sample) and this was compared with two pen and paper administrated samples (pp1 and pp2). The three databases were combined and analysed for fit to the Rasch model. Equivalence was evaluated by differential item functioning (DIF) analyses. The three datasets were matched randomly in terms of sample size (n = 147). Mean age (years) and percentage of male respondents were as follows: e-sample (51.7, 16.0 %); pp1 (50.0, 14.0 %); pp2 (55.5, 40.4 %). The combined dataset achieved fit to the Rasch model. Two items showed evidence of borderline DIF. Further analyses showed the inclusion of these items had little impact on Rasch estimates indicating the DIF identified was unimportant. Differences between the performance of the electronic and pen and paper administrations of the CAMPHOR Activity Limitation scale were minor. The results were successful in showing how the Rasch model can be used to determine the equivalence of alternative formats of PRO measures.
Methodology for developing and evaluating the PROMIS smoking item banks.
Hansen, Mark; Cai, Li; Stucky, Brian D; Tucker, Joan S; Shadel, William G; Edelen, Maria Orlando
2014-09-01
This article describes the procedures used in the PROMIS Smoking Initiative for the development and evaluation of item banks, short forms (SFs), and computerized adaptive tests (CATs) for the assessment of 6 constructs related to cigarette smoking: nicotine dependence, coping expectancies, emotional and sensory expectancies, health expectancies, psychosocial expectancies, and social motivations for smoking. Analyses were conducted using response data from a large national sample of smokers. Items related to each construct were subjected to extensive item factor analyses and evaluation of differential item functioning (DIF). Final item banks were calibrated, and SF assessments were developed for each construct. The performance of the SFs and the potential use of the item banks for CAT administration were examined through simulation study. Item selection based on dimensionality assessment and DIF analyses produced item banks that were essentially unidimensional in structure and free of bias. Simulation studies demonstrated that the constructs could be accurately measured with a relatively small number of carefully selected items, either through fixed SFs or CAT-based assessment. Illustrative results are presented, and subsequent articles provide detailed discussion of each item bank in turn. The development of the PROMIS smoking item banks provides researchers with new tools for measuring smoking-related constructs. The use of the calibrated item banks and suggested SF assessments will enhance the quality of score estimates, thus advancing smoking research. Moreover, the methods used in the current study, including innovative approaches to item selection and SF construction, may have general relevance to item bank development and evaluation. © The Author 2013. Published by Oxford University Press on behalf of the Society for Research on Nicotine and Tobacco. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Wu, Tzu-Yi; Lin, Chung-Ying; Årestedt, Kristofer; Griffiths, Mark D.; Broström, Anders; Pakpour, Amir H.
2017-01-01
Background and aims The nine-item Internet Gaming Disorder Scale – Short Form (IGDS-SF9) is brief and effective to evaluate Internet Gaming Disorder (IGD) severity. Although its scores show promising psychometric properties, less is known about whether different groups of gamers interpret the items similarly. This study aimed to verify the construct validity of the Persian IGDS-SF9 and examine the scores in relation to gender and hours spent online gaming among 2,363 Iranian adolescents. Methods Confirmatory factor analysis (CFA) and Rasch analysis were used to examine the construct validity of the IGDS-SF9. The effects of gender and time spent online gaming per week were investigated by multigroup CFA and Rasch differential item functioning (DIF). Results The unidimensionality of the IGDS-SF9 was supported in both CFA and Rasch. However, Item 4 (fail to control or cease gaming activities) displayed DIF (DIF contrast = 0.55) slightly over the recommended cutoff in Rasch but was invariant in multigroup CFA across gender. Items 4 (DIF contrast = −0.67) and 9 (jeopardize or lose an important thing because of gaming activity; DIF contrast = 0.61) displayed DIF in Rasch and were non-invariant in multigroup CFA across time spent online gaming. Conclusions Given the Persian IGDS-SF9 was unidimensional, it is concluded that the instrument can be used to assess IGD severity. However, users of the instrument are cautioned concerning the comparisons of the sum scores of the IGDS-SF9 across gender and across adolescents spending different amounts of time online gaming. PMID:28571474
Wu, Tzu-Yi; Lin, Chung-Ying; Årestedt, Kristofer; Griffiths, Mark D; Broström, Anders; Pakpour, Amir H
2017-06-01
Background and aims The nine-item Internet Gaming Disorder Scale - Short Form (IGDS-SF9) is brief and effective to evaluate Internet Gaming Disorder (IGD) severity. Although its scores show promising psychometric properties, less is known about whether different groups of gamers interpret the items similarly. This study aimed to verify the construct validity of the Persian IGDS-SF9 and examine the scores in relation to gender and hours spent online gaming among 2,363 Iranian adolescents. Methods Confirmatory factor analysis (CFA) and Rasch analysis were used to examine the construct validity of the IGDS-SF9. The effects of gender and time spent online gaming per week were investigated by multigroup CFA and Rasch differential item functioning (DIF). Results The unidimensionality of the IGDS-SF9 was supported in both CFA and Rasch. However, Item 4 (fail to control or cease gaming activities) displayed DIF (DIF contrast = 0.55) slightly over the recommended cutoff in Rasch but was invariant in multigroup CFA across gender. Items 4 (DIF contrast = -0.67) and 9 (jeopardize or lose an important thing because of gaming activity; DIF contrast = 0.61) displayed DIF in Rasch and were non-invariant in multigroup CFA across time spent online gaming. Conclusions Given the Persian IGDS-SF9 was unidimensional, it is concluded that the instrument can be used to assess IGD severity. However, users of the instrument are cautioned concerning the comparisons of the sum scores of the IGDS-SF9 across gender and across adolescents spending different amounts of time online gaming.
Writing, Evaluating and Assessing Data Response Items in Economics.
ERIC Educational Resources Information Center
Trotman-Dickenson, D. I.
1989-01-01
Describes some of the problems in writing data response items in economics for use by A Level and General Certificate of Secondary Education (GCSE) students. Examines the experience of two series of workshops on writing items, evaluating them and assessing responses from schools. Offers suggestions for producing packages of data response items as…
Evaluating Item Fit for Multidimensional Item Response Models
ERIC Educational Resources Information Center
Zhang, Bo; Stone, Clement A.
2008-01-01
This research examines the utility of the s-x[superscript 2] statistic proposed by Orlando and Thissen (2000) in evaluating item fit for multidimensional item response models. Monte Carlo simulation was conducted to investigate both the Type I error and statistical power of this fit statistic in analyzing two kinds of multidimensional test…
Choi, Mona; Ahn, Sangwoo; Jung, Dukyoo
2015-01-01
We evaluated the psychometric properties of the Korean version of the Self-Efficacy for Exercise Scale (SEE-K). The SEE-K consists of nine items and was translated into Korean using the forward-backward translation method. We administered it to 212 community-dwelling older adults along with measures of outcome expectation for exercise, quality of life, and physical activity. The validity was determined using confirmatory factor analysis and Rasch analysis with INFIT and OUTFIT statistics, which showed acceptable model fit. The concurrent validity was confirmed according to positive correlations between the SEE-K, outcome expectation for exercise, and quality of life. Furthermore, the high physical activity group had higher SEE-K scores. Finally, the reliability of the SEE-K was deemed acceptable based on Cronbach's alpha, coefficients of determination, and person and item separation indices with reliability. Thus, the SEE-K appears to have satisfactory validity and reliability among older adults in South Korea. Copyright © 2015 Elsevier Inc. All rights reserved.
Development and Testing of the Church Environment Audit Tool.
Kaczynski, Andrew T; Jake-Schoffman, Danielle E; Peters, Nathan A; Dunn, Caroline G; Wilcox, Sara; Forthofer, Melinda
2018-05-01
In this paper, we describe development and reliability testing of a novel tool to evaluate the physical environment of faith-based settings pertaining to opportunities for physical activity (PA) and healthy eating (HE). Tool development was a multistage process including a review of similar tools, stakeholder review, expert feedback, and pilot testing. Final tool sections included indoor opportunities for PA, outdoor opportunities for PA, food preparation equipment, kitchen type, food for purchase, beverages for purchase, and media. Two independent audits were completed at 54 churches. Interrater reliability (IRR) was determined with Kappa and percent agreement. Of 218 items, 102 were assessed for IRR and 116 could not be assessed because they were not present at enough churches. Percent agreement for all 102 items was over 80%. For 42 items, the sample was too homogeneous to assess Kappa. Forty-six of the remaining items had Kappas greater than 0.60 (25 items 0.80-1.00; 21 items 0.60-0.79), indicating substantial to almost perfect agreement. The tool proved reliable and efficient for assessing church environments and identifying potential intervention points. Future work can focus on applications within faith-based partnerships to understand how church environments influence diverse health outcomes.
Cruciani, Fernanda; Adami, Fernando; Assunção, Nathalia Antiqueira; Bergamaschi, Denise Pimentel
2011-01-01
There is a lack of Brazilian questionnaires to assess physical activity in children. The Physical Activity Checklist Interview (PACI) was originally developed for North American children and allows assessing physical activity during the previous day. The objectives of this study were: i) to describe procedures for choosing the PACI for cross-cultural adaptation and ii) to assess conceptual, item, and semantic equivalence of the Brazilian version to be used with 7-to-10-year-old children. PACI was identified from a systematic review of 18 questionnaires. The process of choosing the instrument involved discussions with researchers. The PACI allows assessing the construct and its dimensions. Some kinds of physical activity that are uncommon in the Brazilian population had to be eliminated. The following steps were taken to evaluate semantic equivalence: translation, retranslation, connotative and referential meaning assessment, and a pretest with 24 children aged 7 to 10 years. We present the PACI in its Brazilian adapted version, called Lista de Atividades Físicas (LAF).
15 CFR 752.2 - Eligible activities.
Code of Federal Regulations, 2013 CFR
2013-01-01
...) Service activities. Exporting items subject to the EAR as spare and replacement parts for servicing or stocking. (2) End-user activities. Exporting and reexporting items subject to the EAR for use as capital equipment. (3) Distribution activities. Exporting and reexporting items subject to the EAR for the purpose...
15 CFR 752.2 - Eligible activities.
Code of Federal Regulations, 2014 CFR
2014-01-01
...) Service activities. Exporting items subject to the EAR as spare and replacement parts for servicing or stocking. (2) End-user activities. Exporting and reexporting items subject to the EAR for use as capital equipment. (3) Distribution activities. Exporting and reexporting items subject to the EAR for the purpose...
15 CFR 752.2 - Eligible activities.
Code of Federal Regulations, 2010 CFR
2010-01-01
...) Service activities. Exporting items subject to the EAR as spare and replacement parts for servicing or stocking. (2) End-user activities. Exporting and reexporting items subject to the EAR for use as capital equipment. (3) Distribution activities. Exporting and reexporting items subject to the EAR for the purpose...
15 CFR 752.2 - Eligible activities.
Code of Federal Regulations, 2012 CFR
2012-01-01
...) Service activities. Exporting items subject to the EAR as spare and replacement parts for servicing or stocking. (2) End-user activities. Exporting and reexporting items subject to the EAR for use as capital equipment. (3) Distribution activities. Exporting and reexporting items subject to the EAR for the purpose...
15 CFR 752.2 - Eligible activities.
Code of Federal Regulations, 2011 CFR
2011-01-01
...) Service activities. Exporting items subject to the EAR as spare and replacement parts for servicing or stocking. (2) End-user activities. Exporting and reexporting items subject to the EAR for use as capital equipment. (3) Distribution activities. Exporting and reexporting items subject to the EAR for the purpose...
Dirven, Linda; Groenvold, Mogens; Taphoorn, Martin J B; Conroy, Thierry; Tomaszewski, Krzysztof A; Young, Teresa; Petersen, Morten Aa
2017-11-01
The European Organisation of Research and Treatment of Cancer (EORTC) Quality of Life Group is developing computerized adaptive testing (CAT) versions of all EORTC Quality of Life Questionnaire (QLQ-C30) scales with the aim to enhance measurement precision. Here we present the results on the field-testing and psychometric evaluation of the item bank for cognitive functioning (CF). In previous phases (I-III), 44 candidate items were developed measuring CF in cancer patients. In phase IV, these items were psychometrically evaluated in a large sample of international cancer patients. This evaluation included an assessment of dimensionality, fit to the item response theory (IRT) model, differential item functioning (DIF), and measurement properties. A total of 1030 cancer patients completed the 44 candidate items on CF. Of these, 34 items could be included in a unidimensional IRT model, showing an acceptable fit. Although several items showed DIF, these had a negligible impact on CF estimation. Measurement precision of the item bank was much higher than the two original QLQ-C30 CF items alone, across the whole continuum. Moreover, CAT measurement may on average reduce study sample sizes with about 35-40% compared to the original QLQ-C30 CF scale, without loss of power. A CF item bank for CAT measurement consisting of 34 items was established, applicable to various cancer patients across countries. This CAT measurement system will facilitate precise and efficient assessment of HRQOL of cancer patients, without loss of comparability of results.
Bohman, Benjamin; Rasmussen, Finn; Ghaderi, Ata
2016-10-20
Parental self-efficacy (PSE) refers to beliefs of parents to effectively engage in behaviors that result in desired outcomes for their children. There are several instruments of PSE for promoting healthy dietary or physical activity (PA) behaviors in children. These measures typically assess PSE in relation to some quantity or frequency of behavior, for example, number of servings or times per week. However, measuring PSE in relation to contextual circumstances, for example, psychological states and situational demands, may be a more informative approach. The purpose of the present study was to develop and psychometrically evaluate a context-based PSE instrument. Swedish mothers of five-year-old children (n = 698) responded to the Parental Self-Efficacy for Healthy Dietary and Physical Activity Behaviors in Preschoolers Scale (PDAP) and a questionnaire on dietary and PA behaviors in children. Interviews were conducted to explore participant perceptions of the quality of the PDAP items. Psychometric evaluation was conducted using exploratory and confirmatory factor analyses. Spearman correlations between PSE and child behaviors were examined. Twenty-seven interviews were conducted with participants, who perceived the items as highly comprehensible, relevant and acceptable. A four-factor model of a revised 21-item version of the PDAP fitted the data, with different factors of PSE for promoting healthy dietary or PA behaviors in children depending on whether circumstances were facilitating or impeding successful performance. Internal consistency was excellent for total scale (Cronbach's α = .94), and good for factors (α = .84-.88). Correlations were in the expected direction: positive correlations between PSE and healthy behaviors, and negative correlations between PSE and unhealthy behaviors (all r s s ≤ .32). Psychometric evaluation of the PDAP provided preliminary support of construct validity and internal consistency.
NASA Astrophysics Data System (ADS)
Campbell, Chad Edward
Over the past decade, hundreds of studies have introduced genomics and bioinformatics (GB) curricula and laboratory activities at the undergraduate level. While these publications have facilitated the teaching and learning of cutting-edge content, there has yet to be an evaluation of these assessment tools to determine if they are meeting the quality control benchmarks set forth by the educational research community. An analysis of these assessment tools indicated that <10% referenced any quality control criteria and that none of the assessments met more than one of the quality control benchmarks. In the absence of evidence that these benchmarks had been met, it is unclear whether these assessment tools are capable of generating valid and reliable inferences about student learning. To remedy this situation the development of a robust GB assessment aligned with the quality control benchmarks was undertaken in order to ensure evidence-based evaluation of student learning outcomes. Content validity is a central piece of construct validity, and it must be used to guide instrument and item development. This study reports on: (1) the correspondence of content validity evidence gathered from independent sources; (2) the process of item development using this evidence; (3) the results from a pilot administration of the assessment; (4) the subsequent modification of the assessment based on the pilot administration results and; (5) the results from the second administration of the assessment. Twenty-nine different subtopics within GB (Appendix B: Genomics and Bioinformatics Expert Survey) were developed based on preliminary GB textbook analyses. These subtopics were analyzed using two methods designed to gather content validity evidence: (1) a survey of GB experts (n=61) and (2) a detailed content analyses of GB textbooks (n=6). By including only the subtopics that were shown to have robust support across these sources, 22 GB subtopics were established for inclusion in the assessment. An expert panel subsequently developed, evaluated, and revised two multiple-choice items to align with each of the 22 subtopics, producing a final item pool of 44 items. These items were piloted with student samples of varying content exposure levels. Both Classical Test Theory (CTT) and Item Response Theory (IRT) methodologies were used to evaluate the assessment's validity, reliability and ability inferences, and its ability to differentiate students with different magnitudes of content exposure. A total of 18 items were subsequently modified and reevaluated by an expert panel. The 26 original and 18 modified items were once again piloted with student samples of varying content exposure levels. Both CTT and IRT methodologies were once again used to evaluate student responses in order to evaluate the assessment's validity and reliability inferences as well as its ability to differentiate students with different magnitudes of content exposure. Interviews with students from different content exposure levels were also performed in order to gather convergent validity evidence (external validity evidence) as well as substantive validity evidence. Also included are the limitations of the assessment and a set of guidelines on how the assessment can best be used.
Household Food Security in Isfahan Based on Current Population Survey Adapted Questionnaire
Rafiei, Morteza; Rastegari, Hosein Ali; Ghiasi, Mojdeh; Shahsanaie, Vahid
2013-01-01
Background: Food security is a state in which all people at every time have physical and economic access to adequate food to obviate nutritional needs and live a healthy and active life. Therefore, this study was performed to quantitatively evaluate the household food security in Esfahan using the localized version of US Household Food Security Survey Module (US HFSSM). Methods: This descriptive cross-sectional study was performed in year 2006 on 3000 households of Esfahan. The study instrument used in this work is 18-item US food security module, which is developed into a localized 15-item questionnaire. This study is performed in two stages of families with no children (under 18 years old) and families with children over 18 years old. Results: The results showed that item severity coefficient, ratio of responses given by households and item infit and outfit coefficient in adult's and children's questionnaire respectively. According to obtained data, scale score of +3 in adults group is described as determination limit of slight food insecurity and +6 is stated as the limit for severe food insecurity. For children's group, scale score of +2 is defined to be the limit of slight food insecurity and +5 is the determination limit of severe food insecurity. Conclusions: The main hypothesis of this survey analysis is based on the raw scale score of USFSSM The item of “lack of enough money for buying food” (item 2) and the item of “lack of balanced meal” (3rd item) have the lowest severity coefficient. Then, the ascending rate of item severity continues in first item, 4th item and keeps increasing into 10th item. PMID:24498498
Kosteniuk, Julie G; Wilson, Erin C; Penz, Kelly L; MacLeod, Martha L P; Stewart, Norma J; Kulig, Judith C; Karunanayake, Chandima P; Kilpatrick, Kelley
2016-01-01
To report the development and psychometric evaluation of a scale to measure rural and remote (rural/remote) nurses' perceptions of the engagement of their workplaces in key dimensions of primary health care (PHC). Amidst ongoing PHC reforms, a comprehensive instrument is needed to evaluate the degree to which rural/remote health care settings are involved in the key dimensions that characterize PHC delivery, particularly from the perspective of professionals delivering care. This study followed a three-phase process of instrument development and psychometric evaluation. A literature review and expert consultation informed instrument development in the first phase, followed by an iterative process of content evaluation in the second phase. In the final phase, a pilot survey was undertaken and item discrimination analysis employed to evaluate the internal consistency reliability of each subscale in the preliminary 60-item Primary Health Care Engagement (PHCE) Scale. The 60-item scale was subsequently refined to a 40-item instrument. The pilot survey sample included 89 nurses in current practice who had experience in rural/remote practice settings. Participants completed either a web-based or paper survey from September to December, 2013. Following item discrimination analysis, the 60-item instrument was refined to a 40-item PHCE Scale consisting of 10 subscales, each including three to five items. Alpha estimates of the 10 refined subscales ranged from 0.61 to 0.83, with seven of the subscales demonstrating acceptable reliability (α ⩾ 0.70). The refined 40-item instrument exhibited good internal consistency reliability (α=0.91). The 40-item PHCE Scale may be considered for use in future studies regardless of locale, to measure the extent to which health care professionals perceive their workplaces to be engaged in key dimensions of PHC.
Jones, Salene M W; Ziebell, Rebecca; Walker, Rod; Nekhlyudov, Larissa; Rabin, Borsika A; Nutt, Stephanie; Fujii, Monica; Chubak, Jessica
2016-02-01
Benefit finding has been shown to be beneficial for people with cancer and may be an indication that one is coping adequately with the stress of cancer. This study evaluated the psychometric properties of a four-item benefit finding measure from the cancer survivorship supplement of the Medical Expenditure Panel Survey (MEPS). Long-term survivors (5-10 years post-diagnosis) of breast, prostate, colorectal or lung cancer or melanoma (n = 594) completed the MEPS cancer supplement survey in 2013. Four items asked about benefit finding after the cancer: stronger person, coping better, positive changes and having healthier habits. Information on sociodemographics, disease and activity limitations after the cancer was also collected. We examined factor structure, reliability (Kuder-Richardson 20) and validity. The four benefit finding items did not appear to measure one factor. Three of the benefit finding items (stronger person, coping better, positive changes) were related to gender, receipt of chemotherapy and activity limitations but not cancer stage, time since diagnosis or income. Having healthier habits was unrelated to any sociodemographic or disease variable. Three of the items (stronger person, coping better, positive changes) appeared to have validity as they were related to variables that literature has shown are related to benefit finding. However, having healthier habits is likely measuring a separate but related construct. This short instrument may be used in future studies assessing benefit finding post cancer; however, the four items should be analyzed separately. Copyright © 2015 Elsevier Ltd. All rights reserved.
Adams, Marc A; Ryan, Sherry; Kerr, Jacqueline; Sallis, James F; Patrick, Kevin; Frank, Lawrence D; Norman, Gregory J
2009-01-01
Concurrent validity of Neighborhood Environment Walkability Scale (NEWS) items was evaluated with objective measures of the built environment using geographic information systems (GIS). A sample of 878 parents of children 10 to 16 years old (mean age 43.5 years, SD = 6.8, 34.8% non-White, 63.8% overweight) completed NEWS and the International Physical Activity Questionnaire. GIS was used to develop 1-mile street network buffers around participants' residences. GIS measures of the built environment within participants' buffers included percent of commercial and institutional land uses; number of schools and colleges, recreational facilities, parks, transit stops, and trees; land topography; and traffic congestion. Except for trees and traffic, concordance between the NEWS and GIS measures were significant, with weak to moderate effect sizes (r = -0.09 to -0.36, all P < or = 01). After participants were stratified by physical activity level, stronger concordance was observed among active participants for some measures. A sensitivity analysis of self-reported distance to 15 neighborhood destinations found a 20-minute (compared with 10- or 30-minute) walking threshold generally had the strongest correlations with GIS measures. These findings provide evidence of the concurrent validity of self-reported built environment items with objective measures. Physically active adults may be more knowledgeable about their neighborhood characteristics.
ERIC Educational Resources Information Center
Gattamorta, Karina A.; Penfield, Randall D.; Myers, Nicholas D.
2012-01-01
Measurement invariance is a common consideration in the evaluation of the validity and fairness of test scores when the tested population contains distinct groups of examinees, such as examinees receiving different forms of a translated test. Measurement invariance in polytomous items has traditionally been evaluated at the item-level,…
Mathysen, Danny G P; Aclimandos, Wagih; Roelant, Ella; Wouters, Kristien; Creuzot-Garcher, Catherine; Ringens, Peter J; Hawlina, Marko; Tassignon, Marie-José
2013-11-01
To investigate whether introduction of item-response theory (IRT) analysis, in parallel to the 'traditional' statistical analysis methods available for performance evaluation of multiple T/F items as used in the European Board of Ophthalmology Diploma (EBOD) examination, has proved beneficial, and secondly, to study whether the overall assessment performance of the current written part of EBOD is sufficiently high (KR-20≥ 0.90) to be kept as examination format in future EBOD editions. 'Traditional' analysis methods for individual MCQ item performance comprise P-statistics, Rit-statistics and item discrimination, while overall reliability is evaluated through KR-20 for multiple T/F items. The additional set of statistical analysis methods for the evaluation of EBOD comprises mainly IRT analysis. These analysis techniques are used to monitor whether the introduction of negative marking for incorrect answers (since EBOD 2010) has a positive influence on the statistical performance of EBOD as a whole and its individual test items in particular. Item-response theory analysis demonstrated that item performance parameters should not be evaluated individually, but should be related to one another. Before the introduction of negative marking, the overall EBOD reliability (KR-20) was good though with room for improvement (EBOD 2008: 0.81; EBOD 2009: 0.78). After the introduction of negative marking, the overall reliability of EBOD improved significantly (EBOD 2010: 0.92; EBOD 2011:0.91; EBOD 2012: 0.91). Although many statistical performance parameters are available to evaluate individual items, our study demonstrates that the overall reliability assessment remains the only crucial parameter to be evaluated allowing comparison. While individual item performance analysis is worthwhile to undertake as secondary analysis, drawing final conclusions seems to be more difficult. Performance parameters need to be related, as shown by IRT analysis. Therefore, IRT analysis has proved beneficial for the statistical analysis of EBOD. Introduction of negative marking has led to a significant increase in the reliability (KR-20 > 0.90), indicating that the current examination format can be kept for future EBOD examinations. © 2013 Acta Ophthalmologica Scandinavica Foundation. Published by John Wiley & Sons Ltd.
2010-01-01
Background Measure Yourself Medical Outcome Profile (MYMOP) is a patient generated outcome instrument applicable in the evaluation of both allopathic and complementary medicine treatment. This study aims to adapt MYMOP into Chinese, and to assess its validity, responsiveness and minimally important change values in a sample of patients using Chinese medicine (CM) services. Methods A Chinese version of MYMOP (CMYMOP) is developed by forward-backward-forward translation strategy, expert panel assessment and pilot testing amongst patients. 272 patients aged 18 or above with subjective symptoms in the past 2 weeks were recruited at a CM clinic, and were invited to complete a set of questionnaire containing CMYMOP and SF-36. Follow ups were performed at 2nd and 4th week after consultation, using the same set of questionnaire plus a global rating of change question. Criterion validity of CMYMOP was assessed by its correlation with SF-36 at baseline, and responsiveness was evaluated by calculating the Cohen effect size (ES) of change at two follow ups. Minimally important difference (MID) values were estimated via anchor based method, while minimally detectable difference (MDC) figures were calculated by distribution based method. Results Criterion validity of CMYMOP was demonstrated by negative correlation between CMYMOP Profile scores and all SF-36 domain and summary scores at baseline. For responsiveness between baseline and 4th week follow up, ES of CMYMOP Symptom 1, Activity and Profile reached the moderate change threshold (ES>0.5), while Symptom 2 and Wellbeing reached the weak change threshold (ES>0.2). None of the SF-36 scores reached the moderate change threshold, implying CMYMOP's stronger responsiveness in CM setting. At 2nd week follow up, MID values for Symptom 1, Symptom 2, Wellbeing and Profile items were 0.894, 0.580, 0.263 and 0.516 respectively. For Activity item, MDC figure of 0.808 was adopted to estimate MID. Conclusions The findings support the validity and responsiveness of CMYMOP for capturing patient centred clinical changes within 2 weeks in a CM clinical setting. Further researches are warranted (1) to estimate Activity item MID, (2) to assess the test-retest reliability of CMYMOP, and (3) to perform further MID evaluation using multiple, item specific anchor questions. PMID:20920284
36 CFR 1225.12 - How are records schedules developed?
Code of Federal Regulations, 2010 CFR
2010-07-01
... activity to identify records series, systems, and nonrecord materials. (c) Determine the appropriate scope of the records schedule items, e.g., individual series/system component, work process, group of related work processes, or broad program area. (d) Evaluate the period of time the agency needs each...
36 CFR 1225.12 - How are records schedules developed?
Code of Federal Regulations, 2011 CFR
2011-07-01
... activity to identify records series, systems, and nonrecord materials. (c) Determine the appropriate scope of the records schedule items, e.g., individual series/system component, work process, group of related work processes, or broad program area. (d) Evaluate the period of time the agency needs each...
36 CFR 1225.12 - How are records schedules developed?
Code of Federal Regulations, 2012 CFR
2012-07-01
... activity to identify records series, systems, and nonrecord materials. (c) Determine the appropriate scope of the records schedule items, e.g., individual series/system component, work process, group of related work processes, or broad program area. (d) Evaluate the period of time the agency needs each...
Franco, Marcia Rodrigues; Pinto, Rafael Zambelli; Delbaere, Kim; Eto, Bianca Yumie; Faria, Maíra Sgobbi; Aoyagi, Giovana Ayumi; Steffens, Daniel; Pastre, Carlos Marcelo
2018-02-14
The Iconographical Falls Efficacy Scale (Icon-FES) is an innovative tool to assess concern of falling that uses pictures as visual cues to provide more complete environmental contexts. Advantages of Icon-FES over previous scales include the addition of more demanding balance-related activities, ability to assess concern about falling in highly functioning older people, and its normal distribution. To perform a cross-cultural adaptation and to assess the measurement properties of the 30-item and 10-item Icon-FES in a community-dwelling Brazilian older population. The cross-cultural adaptation followed the recommendations of international guidelines. We evaluated the measurement properties (i.e. internal consistency, test-retest reproducibility, standard error of the measurement, minimal detectable change, construct validity, ceiling/floor effect, data distribution and discriminative validity), in 100 community-dwelling people aged ≥60 years. The 30-item and 10-item Icon-FES-Brazil showed good internal consistency (alpha and omega >0.70) and excellent intra-rater reproducibility (ICC 2,1 =0.96 and 0.93, respectively). According to the standard error of the measurement and minimal detectable change, the magnitude of change needed to exceed the measurement error and variability were 7.2 and 3.4 points for the 30-item and 10-item Icon-FES, respectively. We observed an excellent correlation between both versions of the Icon-FES and Falls Efficacy Scale - International (rho=0.83, p<0.001 [30-item version]; 0.76, p<0.001 [10-item version]). Icon-FES versions showed normal distribution, no floor/ceiling effects and were able to discriminate between groups relating to fall risk factors. Icon-FES-Brazil is a semantically and linguistically appropriate tool with acceptable measurement properties to evaluate concern about falling among the community-dwelling older population. Copyright © 2018 Associação Brasileira de Pesquisa e Pós-Graduação em Fisioterapia. Publicado por Elsevier Editora Ltda. All rights reserved.
Perceived barriers to walking for physical activity.
Dunton, Genevieve F; Schneider, Margaret
2006-10-01
Although the health benefits of walking for physical activity have received increasing research attention, barriers specific to walking are not well understood. In this study, questions to measure barriers to walking for physical activity were developed and tested among college students. The factor structure, test-retest and internal consistency reliability, and discriminant and criterion validity of the perceived barriers were evaluated. A total of 305 undergraduate students participated. Participants had a mean age (+/- SD) of 20.6 (+/- 3.02) years, and 70.3% were female. Participants responded to a questionnaire assessing barriers specific to walking for physical activity. Perceived barriers to vigorous exercise, walking for transportation and recreation, and participation in lifestyle activities (such as taking the stairs instead of the elevator) were also assessed. Subsamples completed the walking barriers instrument a second time after 5 days in order to determine test-retest reliability (n = 104) and wore an accelerometer to measure moderate-intensity physical activity (n = 85). Factor analyses confirmed the existence of three factors underlying the perceived barriers to walking questions: appearance (four items), footwear (three items), and situation (three items). Appearance and situational barriers demonstrated acceptable reliability, discriminant validity, and relations with physical activity criteria. After we controlled for barriers to vigorous exercise, appearance and situational barriers to walking explained additional variation in objectively-measured moderate physical activity. The prediction of walking for physical activity, especially walking that is unstructured and spontaneous, may be improved by considering appearance and situational barriers. Assessing barriers specific to walking may have important implications for interventions targeting walking as means for engaging in physical activity.
Active neutron and gamma-ray imaging of highly enriched uranium for treaty verification
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hamel, Michael C.; Polack, J. Kyle; Ruch, Marc L.
The detection and characterization of highly enriched uranium (HEU) presents a large challenge in the non-proliferation field. HEU has a low neutron emission rate and most gamma rays are low energy and easily shielded. To address this challenge, an instrument known as the dual-particle imager (DPI) was used with a portable deuterium-tritium (DT) neutron generator to detect neutrons and gamma rays from induced fission in HEU. We evaluated system response using a 13.7-kg HEU sphere in several configurations with no moderation, high-density polyethylene (HDPE) moderation, and tungsten moderation. A hollow tungsten sphere was interrogated to evaluate the response to amore » possible hoax item. First, localization capabilities were demonstrated by reconstructing neutron and gamma-ray images. Once localized, additional properties such as fast neutron energy spectra and time-dependent neutron count rates were attributed to the items. For the interrogated configurations containing HEU, the reconstructed neutron spectra resembled Watt spectra, which gave confidence that the interrogated items were undergoing induced fission. The time-dependent neutron count rate was also compared for each configuration and shown to be dependent on the neutron multiplication of the item. This result showed that the DPI is a viable tool for localizing and confirming fissile mass and multiplication.« less
Active neutron and gamma-ray imaging of highly enriched uranium for treaty verification
Hamel, Michael C.; Polack, J. Kyle; Ruch, Marc L.; ...
2017-08-11
The detection and characterization of highly enriched uranium (HEU) presents a large challenge in the non-proliferation field. HEU has a low neutron emission rate and most gamma rays are low energy and easily shielded. To address this challenge, an instrument known as the dual-particle imager (DPI) was used with a portable deuterium-tritium (DT) neutron generator to detect neutrons and gamma rays from induced fission in HEU. We evaluated system response using a 13.7-kg HEU sphere in several configurations with no moderation, high-density polyethylene (HDPE) moderation, and tungsten moderation. A hollow tungsten sphere was interrogated to evaluate the response to amore » possible hoax item. First, localization capabilities were demonstrated by reconstructing neutron and gamma-ray images. Once localized, additional properties such as fast neutron energy spectra and time-dependent neutron count rates were attributed to the items. For the interrogated configurations containing HEU, the reconstructed neutron spectra resembled Watt spectra, which gave confidence that the interrogated items were undergoing induced fission. The time-dependent neutron count rate was also compared for each configuration and shown to be dependent on the neutron multiplication of the item. This result showed that the DPI is a viable tool for localizing and confirming fissile mass and multiplication.« less
The physical activity scale for individuals with physical disabilities: development and evaluation.
Washburn, Richard A; Zhu, Weimo; McAuley, Edward; Frogley, Michael; Figoni, Stephen F
2002-02-01
To evaluate the construct validity of a new 13-item physical activity survey designed to assess physical activity in individuals with physical disabilities. Mail survey requesting information on physical activity, basic demographic characteristics, self-rated health, and self-rated physical activity. In February 2000, surveys were sent to 1176 individuals who had used rehabilitative services at a major midwestern university between 1950 and 1999. Two hundred twenty-seven men and 145 women with disabilities responded to the mail survey (80%, spinal cord or other locomotor injuries; 13%, visual and auditory injuries; 7%, other; 92%, white; mean age +/- standard deviation, 49.8 +/- 12.9y; mean length of disability, 36.9 +/- 14.9y). Not applicable. Physical activity was assessed with the Physical Activity Scale for Individuals with Physical Disabilities (PASIPD). The PASIPD requests the number of days a week and hours daily (categories) of participation in recreational, household, and occupational activities over the past 7 days. Total scores were calculated as the average hours daily times a metabolic equivalent value and summed over items. Pearson correlations between each survey item and the total PASIPD score were all statistically significant (P < .05) and >or= .20 (range, .20- .67). Factor analysis with principal component extraction and varimax orthogonal rotations revealed 5 latent factors (eigenvalues >or= 1, factor loadings >or= .40): home repair and lawn and garden, housework, vigorous sport and recreation, light sport and recreation, and occupation and transportation. These 5 factors accounted for 63% of the total variance. Cronbach alpha coefficients ranged from.37 to.65, indicating low-to-moderate internal consistency within factors. Those who reported being "active/highly active" had higher total and subcategory scores compared with those "not active at all." Those in "excellent" health had higher total, vigorous sport and recreation, and occupation and transportation subcategory scores compared with those who rated their health "fair/poor" (all P < .05). These results provide preliminary support for the construct validity of the PASIPD. Additional validation studies using an external criterion and in more generalizable samples are warranted. Copyright 2002 by the American Congress of Rehabilitation Medicine and the American Academy of Physical Medicine and Rehabilitation
Marom, Batia S; Carel, Rafael S; Sharabi, Moshe; Ratzon, Navah Z
2017-06-01
The World Health Organization Disability Assessment Schedule 2.0 (WHODAS 2.0) questionnaire is used internationally to assess function and disability. The instrument has been translated into several languages, but no Hebrew version exists. The objective of this study was to evaluate the use of the 12-item WHODAS 2.0 questionnaire among Hebrew speakers with and without hand injuries (HI). The translated questionnaire was conducted among 155 uninjured subjects (UI) and 77 male workers with HI. Internal consistency was assessed using Cronbach's alpha. Test-retest reliability was assessed in UI subjects and calculated using the intraclass correlation coefficient (ICC agreement ). Validity was evaluated by correlating the 12-item WHODAS 2.0 to the short-form of health survey (SF-12) in UI subjects and comparing the 12-item WHODAS 2.0 scores and the Quick Disability of Arm, Shoulder, and Hand (QDASH) Outcome Measure in the HI group. The Cronbach's alpha of the WHODAS 2.0 for the entire sample was α = 0.83. The ICC agreement for test-retest reliability was 0.88. A positive significant correlation was found between the 12-item WHODAS 2.0 and the QDASH (r s = 0.53, p < .005). The results support the reliability and validity of this Hebrew translation of the 12-item WHODAS 2.0. IMPLICATIONS FOR REHABILITATION Measurement tools that assess activities and participation after HI are an essential part of the rehabilitation process. The 12-item WHODAS 2.0 is a useful tool, since it addresses a broader range of activity and participation domains compared to the DASH and enables better implementation of the ICF model. Since the WHODAS 2.0 does not target a specific disease (as oppose to the DASH), it can be used to compare disabilities caused by different diseases or traumas. The WHODAS 2.0 measures both the function and disability in general populations as well as clinical situations; therefore, the instrument is useful for assessing both health and disability.
Bergbom, Ingegerd; Karlsson, Veronika; Ringdal, Mona
2018-01-01
Measuring and evaluating patients' recovery, following intensive care, is essential for assessing their recovery process. By using a questionnaire, which includes spiritual and existential aspects, possibilities for identifying appropriate nursing care activities may be facilitated. The study describes the development and evaluation of a recovery questionnaire and its validity and reliability. A questionnaire consisting of 30 items on a 5-point Likert scale was completed by 169 patients (103 men, 66 women), 18 years or older (m=69, SD 12.5) at 2, 6, 12 or 24 months following discharge from an ICU. An exploratory factor analysis, including a principal component analysis with orthogonal varimax rotation, was conducted. Ten initial items, with loadings below 0.40, were removed. The internal item/scale structure obtained in the principal component analysis was tested in relation to convergent and discrimination validity with a multi-trait analysis. Items consistency and reliability were assessed by Cronbach's alpha and internal item consistency. Test of scale quality, the proportion of missing values and respondents' scoring at maximum and minimum levels were also conducted. A total of 20 items in six factors - forward looking, supporting relations, existential ruminations, revaluation of life, physical and mental strength and need of social support were extracted with eigen values above one. Together, they explained 75% of the variance. The half-scale criterion showed that the proportion of incomplete scale scores ranged from 0% to 4.3%. When testing the scale's ability to differentiate between levels of the assessed concept, we found that the observed range of scale scores covered the theoretical range. Substantial proportions of respondents, who scored at the ceiling for forward looking and supporting relations and at floor for the need of social support, were found. These findings should be further investigated. The factor analysis, including discriminant validity and the mean value for the item correlations, was found to be excellent. The RAIN instrument could be used to assess recovery following intensive care. It could provide post-ICU clinics and community/primary healthcare nurses with valuable information on which areas patients may need more support.
Cost effective nuclear commercial grade dedication
DOE Office of Scientific and Technical Information (OSTI.GOV)
Maletz, J.J.; Marston, M.J.
1991-01-01
This paper describes a new computerized database method to create/edit/view specification technical data sheets (mini-specifications) for procurement of spare parts for nuclear facility maintenance and to develop information that could support possible future facility life extension efforts. This method may reduce cost when compared with current manual methods. The use of standardized technical data sheets (mini-specifications) for items of the same category improves efficiency. This method can be used for a variety of tasks, including: Nuclear safety-related procurement; Non-safety related procurement; Commercial grade item procurement/dedication; Evaluation of replacement items. This program will assist the nuclear facility in upgrading its procurementmore » activities consistent with the recent NUMARC Procurement Initiative. Proper utilization of the program will assist the user in assuring that the procured items are correct for the applications, provide data to assist in detecting fraudulent materials, minimize human error in withdrawing database information, improve data retrievability, improve traceability, and reduce long-term procurement costs.« less
Il'in, V K; Starkov, L V; Kostrov, S V; Belikodvorskaia, G A; Chuvil'skaia, N A; Mukhamedieva, L N; Mikos, K N
2004-01-01
Cellulose-containing wastes are one of the heaviest and biggest ingredients of solid domestic wastes piling up during spaceflight. For the most part these are disposable personal hygiene items used in large quantities in the absence of shower. These wastes contain human body products which are very dangerous from the sanitary-epidemiological standpoint. The purpose was to explore potentiality of microbial biodegradation of cellulose-containing hygiene items anaerobically with dry mass transformation into liquid and biogas. Among specific objectives were test cultivation of active strains of reference cultures of cellulose-fermenting anaerobic thermophilic bacteria on hygiene items as the only source of carbon, evaluation of ways and need of pretreatment of gauze pads to stimulate biodegradation, and chemical analysis of resulting biogas. From the investigation it was concluded that gauze pads are susceptible to biodegradation by anaerobic bacteria producing a low toxicity gas fraction. Therefore, the proposed technology can be considered as a candidate for integration into the spacecrew life support system.
This document describes the updated draft testing protocol recommended by the EPA to support the registration of copper-containing surface products (such as door knobs, or other items that are not intended for food contact) that bear sanitizer claims.
The Birth and Slow Death of the Ontario Assessment Instrument Pool.
ERIC Educational Resources Information Center
Raphael, Dennis
1993-01-01
Describes the development of the Ontario Assessment Instrument Pool (OAIP), a curriculum-based item bank for use in Ontario schools. The nearly $10,000,000 project, lacking implementation and evaluation activities, resulted in limited classroom use. The objective-based assessment also contradicted a child-centered educational philosophy. (KS)
The Effects of Reinforcer Pairing and Fading on Preschoolers' Snack Selections
ERIC Educational Resources Information Center
Solberg, Katherine M.; Hanley, Gregory P.; Layer, Stacy A.; Ingvarsson, Einar T.
2007-01-01
The effects of reinforcement pairing and fading on preschoolers' snack selections were evaluated in a multiple baseline design. Baseline preferences for snack options were assessed via repeated paired-item preference assessments. Edible, social, and activity-based reinforcers were then exclusively paired with a less preferred snack option. Once…
Evaluating innovative items for the NCLEX, part I: usability and pilot testing.
Wendt, Anne; Harmes, J Christine
2009-01-01
National Council of State Boards of Nursing (NCSBN) has recently conducted preliminary research on the feasibility of including various types of innovative test questions (items) on the NCLEX. This article focuses on the participants' reactions to and their strategies for interacting with various types of innovative items. Part 2 in the May/June issue will focus on the innovative item templates and evaluation of the statistical characteristics and the level of cognitive processing required to answer the examination items.
Restricted interests and teacher presentation of items.
Stocco, Corey S; Thompson, Rachel H; Rodriguez, Nicole M
2011-01-01
Restricted and repetitive behavior (RRB) is more pervasive, prevalent, frequent, and severe in individuals with autism spectrum disorders (ASDs) than in their typical peers. One subtype of RRB is restricted interests in items or activities, which is evident in the manner in which individuals engage with items (e.g., repetitious wheel spinning), the types of items or activities they select (e.g., preoccupation with a phone book), or the range of items or activities they select (i.e., narrow range of items). We sought to describe the relation between restricted interests and teacher presentation of items. Overall, we observed 5 teachers interacting with 2 pairs of students diagnosed with an ASD. Each pair included 1 student with restricted interests. During these observations, teachers were free to present any items from an array of 4 stimuli selected by experimenters. We recorded student responses to teacher presentation of items and analyzed the data to determine the relation between teacher presentation of items and the consequences for presentation provided by the students. Teacher presentation of items corresponded with differential responses provided by students with ASD, and those with restricted preferences experienced a narrower array of items.
Investigation and comprehensive evaluation of the litter pollution on the Heishijiao beach in Dalian
NASA Astrophysics Data System (ADS)
Han, Mengdi; Zhao, Kaiyuan; Zhang, Yan; Sui, Chuanguo
2018-02-01
From November 2015 to August 2016, this paper conducted an investigation into the classification of the litter on the Heishijiao beach in Dalian, and made a comprehensive evaluation of the litter pollution on the beach in different seasons. According to the results, the litter on the Heishijiao beach in Dalian mainly come from human’s offshore activities and other wastes, and spring is the season which witnesses the largest quantity of litter resulting from the activities. Most of the fragmental wastes are glass, plastic and paper, while there is a little metal, rubber and wooden products. On the Heishijiao beach, most of the fragmental litter are small, followed by medium and large ones; outsized wastes are rare. The quantitative density of litter is highest in winter (9.0items/m2), with the average quantitative density of 4.6 items/m2; the qualitative density of litter is highest in spring (8 g/m2), with the average qualitative density of 6.0 g/m2. The results of the comprehensive evaluation show that the litter pollution on the Heishijiao beach stays between “Average” and “Unsatisfactory”.
Item Difficulty in the Evaluation of Computer-Based Instruction: An Example from Neuroanatomy
Chariker, Julia H.; Naaz, Farah; Pani, John R.
2012-01-01
This article reports large item effects in a study of computer-based learning of neuroanatomy. Outcome measures of the efficiency of learning, transfer of learning, and generalization of knowledge diverged by a wide margin across test items, with certain sets of items emerging as particularly difficult to master. In addition, the outcomes of comparisons between instructional methods changed with the difficulty of the items to be learned. More challenging items better differentiated between instructional methods. This set of results is important for two reasons. First, it suggests that instruction may be more efficient if sets of consistently difficult items are the targets of instructional methods particularly suited to them. Second, there is wide variation in the published literature regarding the outcomes of empirical evaluations of computer-based instruction. As a consequence, many questions arise as to the factors that may affect such evaluations. The present paper demonstrates that the level of challenge in the material that is presented to learners is an important factor to consider in the evaluation of a computer-based instructional system. PMID:22231801
Item difficulty in the evaluation of computer-based instruction: an example from neuroanatomy.
Chariker, Julia H; Naaz, Farah; Pani, John R
2012-01-01
This article reports large item effects in a study of computer-based learning of neuroanatomy. Outcome measures of the efficiency of learning, transfer of learning, and generalization of knowledge diverged by a wide margin across test items, with certain sets of items emerging as particularly difficult to master. In addition, the outcomes of comparisons between instructional methods changed with the difficulty of the items to be learned. More challenging items better differentiated between instructional methods. This set of results is important for two reasons. First, it suggests that instruction may be more efficient if sets of consistently difficult items are the targets of instructional methods particularly suited to them. Second, there is wide variation in the published literature regarding the outcomes of empirical evaluations of computer-based instruction. As a consequence, many questions arise as to the factors that may affect such evaluations. The present article demonstrates that the level of challenge in the material that is presented to learners is an important factor to consider in the evaluation of a computer-based instructional system. Copyright © 2011 American Association of Anatomists.
Neural Correlates of Learning from Induced Insight: A Case for Reward-Based Episodic Encoding.
Kizilirmak, Jasmin M; Thuerich, Hannes; Folta-Schoofs, Kristian; Schott, Björn H; Richardson-Klavehn, Alan
2016-01-01
Experiencing insight when solving problems can improve memory formation for both the problem and its solution. The underlying neural processes involved in this kind of learning are, however, thus far insufficiently understood. Here, we conceptualized insight as the sudden understanding of a novel relationship between known stimuli that fits into existing knowledge and is accompanied by a positive emotional response. Hence, insight is thought to comprise associative novelty, schema congruency, and intrinsic reward, all of which are separately known to enhance memory performance. We examined the neural correlates of learning from induced insight with functional magnetic resonance imaging (fMRI) using our own version of the compound-remote-associates-task (CRAT) in which each item consists of three clue words and a solution word. (Pseudo-)Solution words were presented after a brief period of problem-solving attempts to induce either sudden comprehension (CRA items) or continued incomprehension (control items) at a specific time point. By comparing processing of the solution words of CRA with control items, we found induced insight to elicit activation of the rostral anterior cingulate cortex/medial prefrontal cortex (rACC/mPFC) and left hippocampus. This pattern of results lends support to the role of schema congruency (rACC/mPFC) and associative novelty (hippocampus) in the processing of induced insight. We propose that (1) the mPFC not only responds to schema-congruent information, but also to the detection of novel schemata, and (2) that the hippocampus responds to a form of associative novelty that is not just a novel constellation of familiar items, but rather comprises a novel meaningful relationship between the items-which was the only difference between our insight and no insight conditions. To investigate episodic long-term memory encoding, we compared CRA items whose solution word was recognized 24 h after encoding to those with forgotten solutions. We found activation in the left striatum and parts of the left amygdala, pointing to a potential role of brain reward circuitry in the encoding of the solution words. We propose that learning from induced insight mainly relies on the amygdala evaluating the internal value (as an affective evaluation) of the suddenly comprehended information, and striatum-dependent reward-based learning.
Miyata, T
1990-06-01
The purpose of this study is to reveal the relation between stomatognathic system and the systemic condition. In the present study, experimental occlusal interference was given to the first molar on main mastication side of 6 healthy subjects and the influence on the upright posture was evaluated through simultaneous measurements of changes in activity of antigravity muscles via electromyography, other than the measurement of loci of the gravity fluctuation for stabilograph before and after the interference was provided. The following results were obtained, 1. Loci of gravity fluctuation 1) All parameters tended increase 24 hours after the interference was provided. 2) The decreasing trend was noted 24 hours after the interference was removed. 3) At one week after the interference was removed all analysis items tended to restore to the normal range. 2. Activity of antigravity muscles In some of the subjects, the muscular activity showed the same trend as the changes of analysis items of gravity fluctuation. 3. The above results suggest that the evaluation of the loci of the gravity fluctuation may be helpful to assess the therapeutic effect of malocclusion.
Creating adaptive web recommendation system based on user behavior
NASA Astrophysics Data System (ADS)
Walek, Bogdan
2018-01-01
The paper proposes adaptive web recommendation system based on user behavior. The proposed system uses expert system to evaluating and recommending suitable items of content. Relevant items are subsequently evaluated and filtered based on history of visited items and user´s preferred categories of items. Main parts of the proposed system are presented and described. The proposed recommendation system is verified on specific example.
A Five-Year Evaluation of Examination Structure in a Cardiovascular Pharmacotherapy Course
Kolar, Claire; Janke, Kristin K.
2015-01-01
Objective. To evaluate the composition and effectiveness as an assessment tool of a criterion-referenced examination comprised of clinical cases tied to practice decisions, to examine the effect of varying audience response system (ARS) questions on student examination preparation, and to articulate guidelines for structuring examinations to maximize evaluation of student learning. Design. Multiple-choice items developed over 5 years were evaluated using Bloom’s Taxonomy classification, point biserial correlation, item difficulty, and grade distribution. In addition, examination items were classified into categories based on similarity to items used in ARS preparation. Assessment. As the number of items directly tied to clinical practice rose, Bloom’s Taxonomy level and item difficulty also rose. In examination years where Bloom’s levels were high but preparation was minimal, average grade distribution was lower compared with years in which student preparation was higher. Conclusion. Criterion-referenced examinations can benefit from systematic evaluation of their composition and effectiveness as assessment tools. Calculated design and delivery of classroom preparation is an asset in improving examination performance on rigorous, practice-relevant examinations. PMID:27168611
Austvoll-Dahlgren, Astrid; Guttersrud, Øystein; Nsangi, Allen; Semakula, Daniel; Oxman, Andrew D
2017-01-01
Background The Claim Evaluation Tools database contains multiple-choice items for measuring people’s ability to apply the key concepts they need to know to be able to assess treatment claims. We assessed items from the database using Rasch analysis to develop an outcome measure to be used in two randomised trials in Uganda. Rasch analysis is a form of psychometric testing relying on Item Response Theory. It is a dynamic way of developing outcome measures that are valid and reliable. Objectives To assess the validity, reliability and responsiveness of 88 items addressing 22 key concepts using Rasch analysis. Participants We administrated four sets of multiple-choice items in English to 1114 people in Uganda and Norway, of which 685 were children and 429 were adults (including 171 health professionals). We scored all items dichotomously. We explored summary and individual fit statistics using the RUMM2030 analysis package. We used SPSS to perform distractor analysis. Results Most items conformed well to the Rasch model, but some items needed revision. Overall, the four item sets had satisfactory reliability. We did not identify significant response dependence between any pairs of items and, overall, the magnitude of multidimensionality in the data was acceptable. The items had a high level of difficulty. Conclusion Most of the items conformed well to the Rasch model’s expectations. Following revision of some items, we concluded that most of the items were suitable for use in an outcome measure for evaluating the ability of children or adults to assess treatment claims. PMID:28550019
Cordier, Reinie; Speyer, Renée; Schindler, Antonio; Michou, Emilia; Heijnen, Bas Joris; Baijens, Laura; Karaduman, Ayşe; Swan, Katina; Clavé, Pere; Joosten, Annette Veronica
2018-02-01
The Swallowing Quality of Life questionnaire (SWAL-QOL) is widely used clinically and in research to evaluate quality of life related to swallowing difficulties. It has been described as a valid and reliable tool, but was developed and tested using classic test theory. This study describes the reliability and validity of the SWAL-QOL using item response theory (IRT; Rasch analysis). SWAL-QOL data were gathered from 507 participants at risk of oropharyngeal dysphagia (OD) across four European countries. OD was confirmed in 75.7% of participants via videofluoroscopy and/or fiberoptic endoscopic evaluation, or a clinical diagnosis based on meeting selected criteria. Patients with esophageal dysphagia were excluded. Data were analysed using Rasch analysis. Item and person reliability was good for all the items combined. However, person reliability was poor for 8 subscales and item reliability was poor for one subscale. Eight subscales exhibited poor person separation and two exhibited poor item separation. Overall item and person fit statistics were acceptable. However, at an individual item fit level results indicated unpredictable item responses for 28 items, and item redundancy for 10 items. The item-person dimensionality map confirmed these findings. Results from the overall Rasch model fit and Principal Component Analysis were suggestive of a second dimension. For all the items combined, none of the item categories were 'category', 'threshold' or 'step' disordered; however, all subscales demonstrated category disordered functioning. Findings suggest an urgent need to further investigate the underlying structure of the SWAL-QOL and its psychometric characteristics using IRT.
75 FR 8751 - Records Schedules; Availability and Request for Comments
Federal Register 2010, 2011, 2012, 2013, 2014
2010-02-25
..., Administration on Aging (N1-439-09-2, 14 items, 14 temporary items). Schedules of daily activities and files of... Services, Administration on Aging (N1-439-09-4, 9 items, 8 temporary items). Master data files containing... associated with the death of active duty military personnel. Proposed for permanent retention are such...
Verkoeijen, Peter P J L; Rikers, Remy M J P; Schmidt, Henk G
2005-01-01
In this study, the authors examined the influence of prior knowledge activation on information processing by means of a prior knowledge activation procedure adopted from the read-generate paradigm. On the basis of cue-target pairs, participants in the experimental groups generated two different sets of items before studying a relevant list. Subsequently, participants were informed that they had to study the items in the list and that they should try to remember as many items as possible. The authors assessed the processing time allocated to the items in the list and free recall of those items. The results revealed that the experimental groups spent less time on items that had already been activated. In addition, the experimental groups outperformed the control group in overall free recall and in free recall of the activated items. Between-group comparisons did not demonstrate significant effects with respect to the processing time and free recall of nonactivated items. The authors interpreted these results in terms of the discrepancy reduction model of regulating the amount of processing time allocated to different parts of the list.
Kirsch, Monika; Mitchell, Sandra A; Dobbels, Fabienne; Stussi, Georg; Basch, Ethan; Halter, Jorg P; De Geest, Sabina
2015-02-01
The aim of this sequential mixed methods study was to develop a PRO-CTCAE (Patient-Reported Outcomes version of the Common Terminology Criteria for Adverse Events)-based measure of the symptom experience of late effects in German speaking long-term survivors of allogeneic stem cell transplantation (SCT), and to examine its content validity. The US National Cancer Institute's PRO-CTAE item library was translated into German and linguistically validated. PRO-CTCAE symptoms prevalent in ≥50% of survivors (n = 15) and recognized in its importance by SCT experts (n = 9) were identified. Additional concepts relevant to the symptom experience and its consequences were elicited. Content validity of the PROVIVO (Patient-Reported Outcomes of long-term survivors after allogeneic SCT) instrument was assessed through an additional round of cognitive debriefing in 15 patients, and item and scale content validity indices by 9 experts. PROVIVO is comprised of a total of 49 items capturing the experience of physical, emotional and cognitive symptoms. To improve the instrument's utility for clinical decision-making, questions soliciting limitations in activities of daily living, frequent infections, and overall well-being were added. Cognitive debriefings demonstrated that items were well understood and relevant to the SCT survivor experience. Scale Content Validity Index (CVI) (0.94) and item CVI (median = 1; range 0.75-1) were very high. Qualitative and quantitative data provide preliminary evidence supporting the content validity of PROVIVO and identify a PRO-CTCAE item bundle for use in SCT survivors. A study to evaluate the measurement properties of PROVIVO and to examine its capacity to improve survivorship care planning is underway. Copyright © 2014 Elsevier Ltd. All rights reserved.
Action and Valence Modulate Choice and Choice-Induced Preference Change
Koster, Raphael; Duzel, Emrah; Dolan, Raymond J.
2015-01-01
Choices are not only communicated via explicit actions but also passively through inaction. In this study we investigated how active or passive choice impacts upon the choice process itself as well as a preference change induced by choice. Subjects were tasked to select a preference for unfamiliar photographs by action or inaction, before and after they gave valuation ratings for all photographs. We replicate a finding that valuation increases for chosen items and decreases for unchosen items compared to a control condition in which the choice was made post re-evaluation. Whether choice was expressed actively or passively affected the dynamics of revaluation differently for positive and negatively valenced items. Additionally, the choice itself was biased towards action such that subjects tended to choose a photograph obtained by action more often than a photographed obtained through inaction. These results highlight intrinsic biases consistent with a tight coupling of action and reward and add to an emerging understanding of how the mode of action itself, and not just an associated outcome, modulates the decision making process. PMID:25747703
Psychometric Principles in Measurement for Geoscience Education Research: A Climate Change Example
NASA Astrophysics Data System (ADS)
Libarkin, J. C.; Gold, A. U.; Harris, S. E.; McNeal, K.; Bowles, R.
2015-12-01
Understanding learning in geoscience classrooms requires that we use valid and reliable instruments aligned with intended learning outcomes. Nearly one hundred instruments assessing conceptual understanding in undergraduate science and engineering classrooms (often called concept inventories) have been published and are actively being used to investigate learning. The techniques used to develop these instruments vary widely, often with little attention to psychometric principles of measurement. This paper will discuss the importance of using psychometric principles to design, evaluate, and revise research instruments, with particular attention to the validity and reliability steps that must be undertaken to ensure that research instruments are providing meaningful measurement. An example from a climate change inventory developed by the authors will be used to exemplify the importance of validity and reliability, including the value of item response theory for instrument development. A 24-item instrument was developed based on published items, conceptions research, and instructor experience. Rasch analysis of over 1000 responses provided evidence for the removal of 5 items for misfit and one item for potential bias as measured via differential item functioning. The resulting 18-item instrument can be considered a valid and reliable measure based on pre- and post-implementation metrics. Consideration of the relationship between respondent demographics and concept inventory scores provides unique insight into the relationship between gender, religiosity, values and climate change understanding.
Pololi, Linda H; Evans, Arthur T; Civian, Janet T; Gibbs, Brian K; Gillum, Linda H; Brennan, Robert T
2016-01-01
Despite the well-recognized benefits of mentoring in academic medicine, there is a lack of clarity regarding what constitutes effective mentoring. We developed a tool to assess mentoring activities experienced by faculty and evaluated evidence for its validity. The National Initiative on Gender, Culture, and Leadership in Medicine-"C-Change"-previously developed the C-Change Faculty Survey to assess the culture of academic medicine. After intensive review, we added six items representing six components of mentoring to the survey-receiving help with career and personal goals, learning skills, sponsorship, and resources. We tested the items in four academic health centers during 2013 to 2014. We estimated reliability of the new items and tested the correlation of the new items with a mentoring composite variable representing faculty mentoring experiences as positive, neutral, or inadequate and with other C-Change dimensions of culture. Among the 1520 responding faculty (response rate 61-63%), there was a positive association between each of the six mentoring activities and satisfaction with both the amount and quality of mentoring received. There was no difference by sex. Cronbach α coefficients ranged from 0.89 to 0.95 across subgroups of faculty (by sex, race, and principal roles). The mentoring responses were associated most closely with dimensions of Institutional Support (r = 0.58, P < .001), Institutional Change Efforts for Faculty Support (r = 0.52, P < .001), Values Alignment (r = 0.58, P < .001), Self-efficacy (r = 0.43; P < .001), and Relationships/Inclusion/Trust (r = 0.41; P < .001). Data demonstrated that the Mentoring scale is a valid instrument to assess mentoring. Survey results could facilitate mentoring program development and evaluation.
Evaluation of Item Candidates: The PROMIS Qualitative Item Review
DeWalt, Darren A.; Rothrock, Nan; Yount, Susan; Stone, Arthur A.
2009-01-01
One of the PROMIS (Patient-Reported Outcome Measurement Information System) network's primary goals is the development of a comprehensive item bank for patient-reported outcomes of chronic diseases. For its first set of item banks, PROMIS chose to focus on pain, fatigue, emotional distress, physical function, and social function. An essential step for the development of an item pool is the identification, evaluation, and revision of extant questionnaire items for the core item pool. In this work, we also describe the systematic process wherein items are classified for subsequent statistical processing by the PROMIS investigators. Six phases of item development are documented: identification of extant items, item classification and selection, item review and revision, focus group input on domain coverage, cognitive interviews with individual items, and final revision before field testing. Identification of items refers to the systematic search for existing items in currently available scales. Expert item review and revision was conducted by trained professionals who reviewed the wording of each item and revised as appropriate for conventions adopted by the PROMIS network. Focus groups were used to confirm domain definitions and to identify new areas of item development for future PROMIS item banks. Cognitive interviews were used to examine individual items. Items successfully screened through this process were sent to field testing and will be subjected to innovative scale construction procedures. PMID:17443114
Paz, Sylvia H; Spritzer, Karen L; Morales, Leo S; Hays, Ron D
2013-03-29
To evaluate the equivalence of the PROMIS® wave 1 physical functioning item bank, by age (50 years or older versus 18-49). A total of 114 physical functioning items with 5 response choices were administered to English- (n=1504) and Spanish-language (n=640) adults. Item frequencies, means and standard deviations, item-scale correlations, and internal consistency reliability were estimated. Differential Item Functioning (DIF) by age was evaluated. Thirty of the 114 items were fagged for DIF based on an R-squared of 0.02 or above criterion. The expected total score was higher for those respondents who were 18-49 than those who were 50 or older. Those who were 50 years or older versus 18-49 years old with the same level of physical functioning responded differently to 30 of the 114 items in the PROMIS® physical functioning item bank. This study yields essential information about the equivalence of the physical functioning items in older versus younger individuals.
Acculturation and the Center For Epidemiological Studies-Depression Scale for Hispanic women.
McCabe, Brian E; Vermeesch, Amber L; Hall, Rosemary F; Peragallo, Nilda P; Mitrani, Victoria B
2011-01-01
Culturally valid measures of depression for Spanish-speaking Hispanic women are important for developing and implementing effective interventions to reduce health disparities. The Center for Epidemiological Studies-Depression Scale (CES-D) is a widely used measure of depression. Differential item functioning has been studied using language preference as a proxy for acculturation, but it is unknown if the results were due to acculturation or the language of administration. The aim of this study was to evaluate the relationship of acculturation, defined with a dimensional measure, to Spanish CES-D item responses. Spanish-speaking Hispanic women (n = 504) were recruited for a randomized controlled trial of Salud, Educación, Prevención y Autocuidado (Health, Education, Prevention, and Self-Care). Acculturation, an important dimension of variation within the diverse U.S. Hispanic community, was defined by high or low scores on the Americanism subscale of the Bidimensional Acculturation Scale. Differential item functioning for each of the 20 CES-D items between more acculturated and less acculturated women was tested using ordinal logistic regression. No items on the Depressed Affect, Somatic Activity, or Positive Affect subscales showed meaningful differential item functioning, but 1 item ("People were unfriendly") on the Interpersonal subscale had small results (R = 1.1%). The majority of CES-D items performed similarly for Spanish-speaking Hispanic women with high and low acculturation. Less acculturated women responded more positively to "People were unfriendly," despite having an equivalent level of depression, than did more acculturated women. Possibilities for improving this item are proposed.
He, Wei; Xie, Yanming; Wang, Yongyan
2011-10-01
The purpose of post-marketing Chinese medicine re-evaluation is to identify Chinese medicine clinical indications, while designing scientific and rational of Chinese medicine symptoms items are important to the result of symptoms re-evaluation. This study give screening of traditional Chinese medicine(TCM) symptoms item of post-marketing medicine Xuezhikang re-evaluation as example that reference to principle dyslipidemia clinical research, academic dissertations, Xuezhikang directions, clinical expert practice experience etc. while standardization those symptom names and screening 41 dyslipidemia common symptoms. Furthermore, this paper discuss about the accoerdance and announcements when screening symptoms item, so as to providing a research thread to manufacture PRO chart for post-marketing medicine re-evaluation.
Factor- and Item-Level Analyses of the 38-Item Activities Scale for Kids-Performance
ERIC Educational Resources Information Center
Bagley, Anita M.; Gorton, George E.; Bjornson, Kristie; Bevans, Katherine; Stout, Jean L.; Narayanan, Unni; Tucker, Carole A.
2011-01-01
Aim: Children and adolescents highly value their ability to participate in relevant daily life and recreational activities. The Activities Scale for Kids-performance (ASKp) instrument measures the frequency of performance of 30 common childhood activities, and has been shown to be valid and reliable. A revised and expanded 38-item ASKp (ASKp38)…
Park, In Sook; Suh, Yeon Ok; Park, Hae Sook; Kang, So Young; Kim, Kwang Sung; Kim, Gyung Hee; Choi, Yeon-Hee; Kim, Hyun-Ju
2017-01-01
The purpose of this study was to improve the quality of items on the Korean Nursing Licensing Examination by developing and evaluating case-based items that reflect integrated nursing knowledge. We conducted a cross-sectional observational study to develop new case-based items. The methods for developing test items included expert workshops, brainstorming, and verification of content validity. After a mock examination of undergraduate nursing students using the newly developed case-based items, we evaluated the appropriateness of the items through classical test theory and item response theory. A total of 50 case-based items were developed for the mock examination, and content validity was evaluated. The question items integrated 34 discrete elements of integrated nursing knowledge. The mock examination was taken by 741 baccalaureate students in their fourth year of study at 13 universities. Their average score on the mock examination was 57.4, and the examination showed a reliability of 0.40. According to classical test theory, the average level of item difficulty of the items was 57.4% (80%-100% for 12 items; 60%-80% for 13 items; and less than 60% for 25 items). The mean discrimination index was 0.19, and was above 0.30 for 11 items and 0.20 to 0.29 for 15 items. According to item response theory, the item discrimination parameter (in the logistic model) was none for 10 items (0.00), very low for 20 items (0.01 to 0.34), low for 12 items (0.35 to 0.64), moderate for 6 items (0.65 to 1.34), high for 1 item (1.35 to 1.69), and very high for 1 item (above 1.70). The item difficulty was very easy for 24 items (below -2.0), easy for 8 items (-2.0 to -0.5), medium for 6 items (-0.5 to 0.5), hard for 3 items (0.5 to 2.0), and very hard for 9 items (2.0 or above). The goodness-of-fit test in terms of the 2-parameter item response model between the range of 2.0 to 0.5 revealed that 12 items had an ideal correct answer rate. We surmised that the low reliability of the mock examination was influenced by the timing of the test for the examinees and the inappropriate difficulty of the items. Our study suggested a methodology for the development of future case-based items for the Korean Nursing Licensing Examination.
Karnoe, Astrid; Furstrand, Dorthe; Batterham, Roy; Christensen, Karl Bang; Elsworth, Gerald; Osborne, Richard H
2018-01-01
Background For people to be able to access, understand, and benefit from the increasing digitalization of health services, it is critical that services are provided in a way that meets the user’s needs, resources, and competence. Objective The objective of the study was to develop a questionnaire that captures the 7-dimensional eHealth Literacy Framework (eHLF). Methods Draft items were created in parallel in English and Danish. The items were generated from 450 statements collected during the conceptual development of eHLF. In all, 57 items (7 to 9 items per scale) were generated and adjusted after cognitive testing. Items were tested in 475 people recruited from settings in which the scale was intended to be used (community and health care settings) and including people with a range of chronic conditions. Measurement properties were assessed using approaches from item response theory (IRT) and classical test theory (CTT) such as confirmatory factor analysis (CFA) and reliability using composite scale reliability (CSR); potential bias due to age and sex was evaluated using differential item functioning (DIF). Results CFA confirmed the presence of the 7 a priori dimensions of eHLF. Following item analysis, a 35-item 7-scale questionnaire was constructed, covering (1) using technology to process health information (5 items, CSR=.84), (2) understanding of health concepts and language (5 items, CSR=.75), (3) ability to actively engage with digital services (5 items, CSR=.86), (4) feel safe and in control (5 items, CSR=.87), (5) motivated to engage with digital services (5 items, CSR=.84), (6) access to digital services that work (6 items, CSR=.77), and (7) digital services that suit individual needs (4 items, CSR=.85). A 7-factor CFA model, using small-variance priors for cross-loadings and residual correlations, had a satisfactory fit (posterior productive P value: .27, 95% CI for the difference between the observed and replicated chi-square values: −63.7 to 133.8). The CFA showed that all items loaded strongly on their respective factors. The IRT analysis showed that no items were found to have disordered thresholds. For most scales, discriminant validity was acceptable; however, 2 pairs of dimensions were highly correlated; dimensions 1 and 5 (r=.95), and dimensions 6 and 7 (r=.96). All dimensions were retained because of strong content differentiation and potential causal relationships between these dimensions. There is no evidence of DIF. Conclusions The eHealth Literacy Questionnaire (eHLQ) is a multidimensional tool based on a well-defined a priori eHLF framework with robust properties. It has satisfactory evidence of construct validity and reliable measurement across a broad range of concepts (using both CTT and IRT traditions) in various groups. It is designed to be used to understand and evaluate people’s interaction with digital health services. PMID:29434011
Differential Item Functioning Analysis of the 2003-04 NHANES Physical Activity Questionnaire
ERIC Educational Resources Information Center
Gao, Yong; Zhu, Weimo
2011-01-01
Using differential item functioning (DIF) analyses, this study examined whether there were any DIF items in the National Health and Nutrition Examination Survey (NHANES) physical activity (PA) questionnaire. A subset of adult data from the 2003-04 NHANES study (n = 3,083) was used. PA items related to respondents' occupational, transportation,…
Salathé, Cornelia Rolli; Trippolini, Maurizio Alen; Terribilini, Livio Claudio; Oliveri, Michael; Elfering, Achim
2018-06-01
Purpose To develop a multidimensional scale to asses psychosocial beliefs-the Yellow Flag Questionnaire (YFQ)-aimed at guiding interventions for workers with chronic musculoskeletal (MSK) pain. Methods Phase 1 consisted of item selection based on literature search, item development and expert consensus rounds. In phase 2, items were reduced with calculating a quality-score per item, using structure equation modeling and confirmatory factor analysis on data from 666 workers. In phase 3, Cronbach's α, and Pearson correlations coefficients were computed to compare YFQ with disability, anxiety, depression and self-efficacy and the YFQ score based on data from 253 injured workers. Regressions of YFQ total score on disability, anxiety, depression and self-efficacy were calculated. Results After phase 1, the YFQ included 116 items and 15 domains. Further reductions of items in phase 2 by applying the item quality criteria reduced the total to 48 items. Phase factor analysis with structural equation modeling confirmed 32 items in seven domains: activity, work, emotions, harm & blame, diagnosis beliefs, co-morbidity and control. Cronbach α was 0.91 for the total score, between 0.49 and 0.81 for the 7 distinct scores of each domain, respectively. Correlations between YFQ total score ranged with disability, anxiety, depression and self-efficacy was .58, .66, .73, -.51, respectively. After controlling for age and gender the YFQ total score explained between R2 27% and R2 53% variance of disability, anxiety, depression and self-efficacy. Conclusions The YFQ, a multidimensional screening scale is recommended for use to assess psychosocial beliefs of workers with chronic MSK pain. Further evaluation of the measurement properties such as the test-retest reliability, responsiveness and prognostic validity is warranted.
Austvoll-Dahlgren, Astrid; Guttersrud, Øystein; Nsangi, Allen; Semakula, Daniel; Oxman, Andrew D
2017-05-25
The Claim Evaluation Tools database contains multiple-choice items for measuring people's ability to apply the key concepts they need to know to be able to assess treatment claims. We assessed items from the database using Rasch analysis to develop an outcome measure to be used in two randomised trials in Uganda. Rasch analysis is a form of psychometric testing relying on Item Response Theory. It is a dynamic way of developing outcome measures that are valid and reliable. To assess the validity, reliability and responsiveness of 88 items addressing 22 key concepts using Rasch analysis. We administrated four sets of multiple-choice items in English to 1114 people in Uganda and Norway, of which 685 were children and 429 were adults (including 171 health professionals). We scored all items dichotomously. We explored summary and individual fit statistics using the RUMM2030 analysis package. We used SPSS to perform distractor analysis. Most items conformed well to the Rasch model, but some items needed revision. Overall, the four item sets had satisfactory reliability. We did not identify significant response dependence between any pairs of items and, overall, the magnitude of multidimensionality in the data was acceptable. The items had a high level of difficulty. Most of the items conformed well to the Rasch model's expectations. Following revision of some items, we concluded that most of the items were suitable for use in an outcome measure for evaluating the ability of children or adults to assess treatment claims. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Ontology-Based Multiple Choice Question Generation
Al-Yahya, Maha
2014-01-01
With recent advancements in Semantic Web technologies, a new trend in MCQ item generation has emerged through the use of ontologies. Ontologies are knowledge representation structures that formally describe entities in a domain and their relationships, thus enabling automated inference and reasoning. Ontology-based MCQ item generation is still in its infancy, but substantial research efforts are being made in the field. However, the applicability of these models for use in an educational setting has not been thoroughly evaluated. In this paper, we present an experimental evaluation of an ontology-based MCQ item generation system known as OntoQue. The evaluation was conducted using two different domain ontologies. The findings of this study show that ontology-based MCQ generation systems produce satisfactory MCQ items to a certain extent. However, the evaluation also revealed a number of shortcomings with current ontology-based MCQ item generation systems with regard to the educational significance of an automatically constructed MCQ item, the knowledge level it addresses, and its language structure. Furthermore, for the task to be successful in producing high-quality MCQ items for learning assessments, this study suggests a novel, holistic view that incorporates learning content, learning objectives, lexical knowledge, and scenarios into a single cohesive framework. PMID:24982937
Evaluating the healthiness of chain-restaurant menu items using crowdsourcing: a new method.
Lesser, Lenard I; Wu, Leslie; Matthiessen, Timothy B; Luft, Harold S
2017-01-01
To develop a technology-based method for evaluating the nutritional quality of chain-restaurant menus to increase the efficiency and lower the cost of large-scale data analysis of food items. Using a Modified Nutrient Profiling Index (MNPI), we assessed chain-restaurant items from the MenuStat database with a process involving three steps: (i) testing 'extreme' scores; (ii) crowdsourcing to analyse fruit, nut and vegetable (FNV) amounts; and (iii) analysis of the ambiguous items by a registered dietitian. In applying the approach to assess 22 422 foods, only 3566 could not be scored automatically based on MenuStat data and required further evaluation to determine healthiness. Items for which there was low agreement between trusted crowd workers, or where the FNV amount was estimated to be >40 %, were sent to a registered dietitian. Crowdsourcing was able to evaluate 3199, leaving only 367 to be reviewed by the registered dietitian. Overall, 7 % of items were categorized as healthy. The healthiest category was soups (26 % healthy), while desserts were the least healthy (2 % healthy). An algorithm incorporating crowdsourcing and a dietitian can quickly and efficiently analyse restaurant menus, allowing public health researchers to analyse the healthiness of menu items.
Moonseong, Heo; Erica, Irvin; Natania, Ostrovsky; Carmen, Isasi; Shawn, Hayes; Judith, Wylie-Rosett
2015-01-01
BACKGROUND HealthCorps provides school wellness programming using curricula to promote changes in nutrition, mental health and physical activity behaviors. The research objective was to evaluate effects of implementing its curricula on nutrition, mental health and physical activity knowledge and behavior. METHODS Pre- and post-survey data were collected (N = 2255) during the 2012-13 academic year from 14 New York City public high schools. An 18-item knowledge questionnaire addressed 3 domains; 26 behavioral items were analyzed by factor analysis to identify 6 behavior domains, breakfast being a seventh one-item domain. We examined the effects stratified by sex, applying mixed-effects models to take into account clustering effects of schools and participants adjusted for age. RESULTS The HealthCorps program significantly increased all 3 knowledge domains (p < .05), and significantly changed several key behavioral domains. Boys significantly increased fruits/vegetables intake (p = .03). Girls increased acceptance of new fruits/vegetables (p = .03) and breakfast consumption (p = .04), and decreased sugar-sweetened beverages and energy dense food intake (p = .03). The associations between knowledge and behavior were stronger in boys than girls. CONCLUSION The HealthCorps program significantly increased participants’ knowledge on nutrition, mental health and physical activity. It also improved several key behavioral domains, which are targets of the 2010 Dietary Guidelines to address obesity in youth. PMID:26762819
Heo, Moonseong; Irvin, Erica; Ostrovsky, Natania; Isasi, Carmen; Blank, Arthur E; Lounsbury, David W; Fredericks, Lynn; Yom, Tiana; Ginsberg, Mindy; Hayes, Shawn; Wylie-Rosett, Judith
2016-02-01
HealthCorps provides school wellness programming using curricula to promote changes in nutrition, mental health, and physical activity behaviors. The research objective was to evaluate effects of implementing its curricula on nutrition, mental health, and physical activity knowledge and behavior. Pre- and postsurvey data were collected (N = 2255) during the 2012-2013 academic year from 14 New York City public high schools. An 18-item knowledge questionnaire addressed 3 domains; 26 behavioral items were analyzed by factor analysis to identify 6 behavior domains, breakfast being a seventh 1-item domain. We examined the effects stratified by sex, applying mixed-effects models to take into account clustering effects of schools and participants adjusted for age. The HealthCorps program significantly increased all 3 knowledge domains (p < .05), and significantly changed several key behavioral domains. Boys significantly increased fruits/vegetables intake (p = .03). Girls increased acceptance of new fruits/vegetables (p = .03) and breakfast consumption (p = .04), and decreased sugar-sweetened beverages and energy dense food intake (p = .03). The associations between knowledge and behavior were stronger in boys than girls. The HealthCorps program significantly increased participants' knowledge on nutrition, mental health, and physical activity. It also improved several key behavioral domains, which are targets of the 2010 Dietary Guidelines to address obesity in youth. © 2016, American School Health Association.
Lassetter, Jane H; Macintosh, Christopher I; Williams, Mary; Driessnack, Martha; Ray, Gaye; Wisco, Jonathan J
2018-04-01
The purpose of this study was to develop and assess the psychometric properties for two related questionnaires: the Healthy Eating and Physical Activity Self-Efficacy Questionnaire for Children (HEPASEQ-C) and the Healthy Eating and Physical Activity Behavior Recall Questionnaire for Children (HEPABRQ-C). HEPASEQ-C and HEPABRQ-C were administered to 517 participating children with 492 completing. Data were analyzed to evaluate for reliability and validity of the questionnaires. Content validity was established through a 10-person expert panel. For the HEPASEQ-C, item content validity index (CVI) ranged from 0.80 to 1.00. The CVI for the total questionnaire was 1.0. All HEPASEQ-C items loaded on a single factor. Cronbach's alpha was deemed acceptable (.749). For the HEPABRQ-C, item CVI ranged from 0.88 to 1.00. CVI for the total questionnaire was 1.0. Pearson product moment correlation between HEPASEQ-C and HEPABRQ-C scores was significant (r = .501, p = .000). The HEPASEQ-C and HEPABRQ-C are easily administered and provide helpful insights into children's self-efficacy and behavior recall. They are easy to use and applicable for upper elementary school settings, in clinical settings for individual patients, and in health promotion settings. © 2018 Wiley Periodicals, Inc.
The Development and Validation of a Behaviorally Defined Interest Instrument for Science.
ERIC Educational Resources Information Center
Butzow, John W., Jr.
A semantic differential (SD) instrument, modified by replacing words or noun phrases with phrases describing a behavior, was administered to male freshmen students. Six items discriminated between two groups, 97 science majors and 161 non-science majors, on three axes, labelled as evaluation, potency, and activity. To test whether the instrument…
Extension Professionals' Strengths and Needs Related to Nutrition and Health Programs
ERIC Educational Resources Information Center
Peña-Purcell, Ninfa; Bowen, Elaine; Zoumenou, Virginie; Schuster, Ellen R.; Boggess, May; Manore, Melinda M.; Gerrior, Shirley A.
2012-01-01
We report results of a Web-based nationwide survey of nutrition and health Extension specialists representing 42 states. Survey items (n = 36) assessed five areas: curriculum review, nutrition and physical activity, professional training, communication, and evaluation. An internal curriculum review was common, but few states shared their criteria…
Monetary and affective judgments of consumer goods: modes of evaluation matter.
Seta, John J; Seta, Catherine E; McCormick, Michael; Gallagher, Ashleigh H
2014-01-01
Participants who evaluated 2 positively valued items separately reported more positive attraction (using affective and monetary measures) than those who evaluated the same two items as a unit. In Experiments 1-3, this separate/unitary evaluation effect was obtained when participants evaluated products that they were purchasing for a friend. Similar findings were obtained in Experiments 4 and 5 when we considered the amount participants were willing to spend to purchase insurance for items that they currently owned. The averaging/summation model was contrasted with several theoretical perspectives and implicated averaging and summation integration processes in how items are evaluated. The procedural and theoretical similarities and differences between this work and related research on unpacking, comparison processes, public goods, and price bundling are discussed. Overall, the results support the operation of integration processes and contribute to an understanding of how these processes influence the evaluation and valuation of private goods.
The hippocampus supports both recollection and familiarity when memories are strong
Smith, Christine N.; Wixted, John T.; Squire, Larry R.
2011-01-01
Recognition memory is thought to consist of two component processes – recollection and familiarity. It has been suggested that the hippocampus supports recollection, while adjacent cortex supports familiarity. However, the qualitative experiences of recollection and familiarity are typically confounded with a quantitative difference in memory strength (recollection > familiarity). Thus, the question remains whether the hippocampus might in fact support familiarity-based memories whenever they are as strong as recollection-based memories. We addressed this problem in a novel way using the Remember/Know procedure where we could explicitly match the confidence and accuracy of Remember and Know decisions. As in earlier studies, recollected items had higher accuracy and confidence than familiar items, and hippocampal activity was higher for recollected items than for familiar items. Furthermore hippocampal activity was similar for familiar items, misses, and correct rejections. When the accuracy and confidence of recollected and familiar items were matched, the findings were dramatically different. Hippocampal activity was now similar for recollected and familiar items. Importantly, hippocampal activity was also greater for familiar items than for misses or correct rejections (as well as for recollected items vs. misses or correct rejections). Our findings suggest that the hippocampus supports both recollection and familiarity when memories are strong. PMID:22049412
Khoiriyah, Umatul; Roberts, Chris; Jorm, Christine; Van der Vleuten, C P M
2015-08-26
Problem based learning (PBL) is a powerful learning activity but fidelity to intended models may slip and student engagement wane, negatively impacting learning processes, and outcomes. One potential solution to solve this degradation is by encouraging self-assessment in the PBL tutorial. Self-assessment is a central component of the self-regulation of student learning behaviours. There are few measures to investigate self-assessment relevant to PBL processes. We developed a Self-assessment Scale on Active Learning and Critical Thinking (SSACT) to address this gap. We wished to demonstrated evidence of its validity in the context of PBL by exploring its internal structure. We used a mixed methods approach to scale development. We developed scale items from a qualitative investigation, literature review, and consideration of previous existing tools used for study of the PBL process. Expert review panels evaluated its content; a process of validation subsequently reduced the pool of items. We used structural equation modelling to undertake a confirmatory factor analysis (CFA) of the SSACT and coefficient alpha. The 14 item SSACT consisted of two domains "active learning" and "critical thinking." The factorial validity of SSACT was evidenced by all items loading significantly on their expected factors, a good model fit for the data, and good stability across two independent samples. Each subscale had good internal reliability (>0.8) and strongly correlated with each other. The SSACT has sufficient evidence of its validity to support its use in the PBL process to encourage students to self-assess. The implementation of the SSACT may assist students to improve the quality of their learning in achieving PBL goals such as critical thinking and self-directed learning.
ERIC Educational Resources Information Center
Lu, Ru; Haberman, Shelby; Guo, Hongwen; Liu, Jinghua
2015-01-01
In this study, we apply jackknifing to anchor items to evaluate the impact of anchor selection on equating stability. In an ideal world, the choice of anchor items should have little impact on equating results. When this ideal does not correspond to reality, selection of anchor items can strongly influence equating results. This influence does not…
Meta-analytic guidelines for evaluating single-item reliabilities of personality instruments.
Spörrle, Matthias; Bekk, Magdalena
2014-06-01
Personality is an important predictor of various outcomes in many social science disciplines. However, when personality traits are not the principal focus of research, for example, in global comparative surveys, it is often not possible to assess them extensively. In this article, we first provide an overview of the advantages and challenges of single-item measures of personality, a rationale for their construction, and a summary of alternative ways of assessing their reliability. Second, using seven diverse samples (Ntotal = 4,263) we develop the SIMP-G, the German adaptation of the Single-Item Measures of Personality, an instrument assessing the Big Five with one item per trait, and evaluate its validity and reliability. Third, we integrate previous research and our data into a first meta-analysis of single-item reliabilities of personality measures, and provide researchers with guidelines and recommendations for the evaluation of single-item reliabilities. © The Author(s) 2013.
Psychometric Evaluation of the Hypogonadism Impact of Symptoms Questionnaire Short Form (HIS-Q-SF).
Gelhorn, Heather L; Roberts, Laurie J; Khandelwal, Nikhil; Revicki, Dennis A; DeRogatis, Leonard R; Dobs, Adrian; Hepp, Zsolt; Miller, Michael G
2017-08-01
The Hypogonadism Impact of Symptoms Questionnaire Short Form (HIS-Q-SF) is a patient-reported outcome measurement designed to evaluate the symptoms of hypogonadism. The HIS-Q-SF is an abbreviated version including17 items from the original 28-item HIS-Q. To conduct item analyses and reduction, evaluate the psychometric properties of the HIS-Q-SF, and provide guidance on score interpretation. A 12-week observational longitudinal study of hypogonadal men was conducted as part of the original HIS-Q psychometric evaluation. Participants completed the original HIS-Q every 2 weeks. Blood samples were collected to evaluate testosterone levels. Participants completed the Aging Male's Symptoms Scale, the International Index of Erectile Function, the Short Form-12, and the PROMIS Sexual Activity, Satisfaction with Sex Life, Sleep Disturbance, and Applied Cognition Scales (baseline and weeks 6 and 12). Clinicians completed the Clinical Global Impression of Severity and Change scales and a clinical form. Item performance was evaluated using descriptive statistics and Rasch analyses. Reliability (internal consistency and test-retest), validity (concurrent and know groups), and responsiveness were assessed. One hundred seventy-seven men participated (mean age = 54.1 years, range = 23-83). Similar to the full HIS-Q, the final abbreviated HIS-Q-SF instrument includes five domains (sexual, energy, sleep, cognition, and mood) with two sexual subdomains (libido and sexual function). For key domains, test-retest reliability was very good, and construct validity was good for all domains. Known-groups validity was demonstrated for all domain scores, subdomain scores, and total score based on the Clinical Global Impression-Severity. All domains and subdomains were responsive to change based on patient-rated anchor questions. The HIS-Q-SF could be a useful tool in clinical practice, epidemiologic studies, and other academic research settings. Careful consideration was given to the selection of the final HIS-Q-SF items based on quantitative data and clinical expert feedback. Overall, the reduced set of items demonstrated strong psychometric properties. Testosterone levels for the participating men were not as low as anticipated, which could have limited the ability to examine the relations between the HIS-Q-SF and testosterone levels. Further, the analyses used data collected through administration of the full HIS-Q, and future studies should administer the standalone HIS-Q-SF to replicate the psychometric analyses reported in the present study. Similar to the original HIS-Q, the HIS-Q-SF has evidence supporting reliability, validity, and responsiveness. The short form includes a smaller set of items that might be more suitable for use in clinical practice or academic research settings. Gelhorn HL, Roberts LJ, Khandelwal N, et al. Psychometric Evaluation of the Hypogonadism Impact of Symptoms Questionnaire Short Form (HIS-Q-SF). J Sex Med 2017;14:1046-1058. Copyright © 2017 International Society for Sexual Medicine. Published by Elsevier Inc. All rights reserved.
Consensus on Quality Indicators of Postgraduate Medical E-Learning: Delphi Study
Walsh, Kieran; Westerman, Michiel; Scheele, Fedde
2018-01-01
Background The progressive use of e-learning in postgraduate medical education calls for useful quality indicators. Many evaluation tools exist. However, these are diversely used and their empirical foundation is often lacking. Objective We aimed to identify an empirically founded set of quality indicators to set the bar for “good enough” e-learning. Methods We performed a Delphi procedure with a group of 13 international education experts and 10 experienced users of e-learning. The questionnaire started with 57 items. These items were the result of a previous literature review and focus group study performed with experts and users. Consensus was met when a rate of agreement of more than two-thirds was achieved. Results In the first round, the participants accepted 37 items of the 57 as important, reached no consensus on 20, and added 15 new items. In the second round, we added the comments from the first round to the items on which there was no consensus and added the 15 new items. After this round, a total of 72 items were addressed and, of these, 37 items were accepted and 34 were rejected due to lack of consensus. Conclusions This study produced a list of 37 items that can form the basis of an evaluation tool to evaluate postgraduate medical e-learning. This is, to our knowledge, the first time that quality indicators for postgraduate medical e-learning have been defined and validated. The next step is to create and validate an e-learning evaluation tool from these items. PMID:29699970
Edelbring, Samuel
2012-08-15
The degree of learners' self-regulated learning and dependence on external regulation influence learning processes in higher education. These regulation strategies are commonly measured by questionnaires developed in other settings than in which they are being used, thereby requiring renewed validation. The aim of this study was to psychometrically evaluate the learning regulation strategy scales from the Inventory of Learning Styles with Swedish medical students (N = 206). The regulation scales were evaluated regarding their reliability, scale dimensionality and interrelations. The primary evaluation focused on dimensionality and was performed with Mokken scale analysis. To assist future scale refinement, additional item analysis, such as item-to-scale correlations, was performed. Scale scores in the Swedish sample displayed good reliability in relation to published results: Cronbach's alpha: 0.82, 0.72, and 0.65 for self-regulation, external regulation and lack of regulation scales respectively. The dimensionalities in scales were adequate for self-regulation and its subscales, whereas external regulation and lack of regulation displayed less unidimensionality. The established theoretical scales were largely replicated in the exploratory analysis. The item analysis identified two items that contributed little to their respective scales. The results indicate that these scales have an adequate capacity for detecting the three theoretically proposed learning regulation strategies in the medical education sample. Further construct validity should be sought by interpreting scale scores in relation to specific learning activities. Using established scales for measuring students' regulation strategies enables a broad empirical base for increasing knowledge on regulation strategies in relation to different disciplinary settings and contributes to theoretical development.
Development and validation of the patient evaluation scale (PES) for primary health care in Nigeria.
Ogaji, Daprim S; Giles, Sally; Daker-White, Gavin; Bower, Peter
2017-03-01
Questionnaires developed for patient evaluation of the quality of primary care are often focussed on primary care systems in developed countries. Aim To report the development and validation of the patient evaluation scale (PES) designed for use in the Nigerian primary health care context. An iterative process was used to develop and validate the questionnaire using patients attending 28 primary health centres across eight states in Nigeria. The development involved literature review, patient interviews, expert reviews, cognitive testing with patients and waves of quantitative cross-sectional surveys. The questionnaire's content validity, internal structures, acceptability, reliability and construct validity are reported. Findings The full and shortened version of PES with 27 and 18 items, respectively, were developed through these process. The low item non-response from the serial cross-sectional surveys depicts questionnaire's acceptability among the local population. PES-short form (SF) has Cronbach's α of 0.87 and three domains (codenamed 'facility', 'organisation' and 'health care') with Cronbach's αs of 0.78, 0.79 and 0.81, respectively. Items in the multi-dimensional questionnaire demonstrated adequate convergent and discriminant properties. PES-SF scores show significant positive correlation with scores of the full PES and also discriminated population groups in support of a priori hypotheses. The PES and PES-SF contain items that are relevant to the needs of patients in Nigeria. The good measurement properties of the questionnaire demonstrates its potential usefulness for patient-focussed quality improvement activities in Nigeria. There is still need to translate these questionnaires into major languages in Nigeria and assess their validity against external quality criteria.
Murray, Aileen; Hall, Amanda; Williams, Geoffrey C; McDonough, Suzanne M; Ntoumanis, Nikos; Taylor, Ian; Jackson, Ben; Copsey, Bethan; Hurley, Deirdre A; Matthews, James
2018-02-27
To assess the inter-rater reliability and concurrent validity of the Communication Evaluation in Rehabilitation Tool, which aims to externally assess physiotherapists competency in using Self-Determination Theory-based communication strategies in practice. Audio recordings of initial consultations between 24 physiotherapists and 24 patients with chronic low back pain in four hospitals in Ireland were obtained as part of a larger randomised controlled trial. Three raters, all of whom had Ph.Ds in psychology and expertise in motivation and physical activity, independently listened to the 24 audio recordings and completed the 18-item Communication Evaluation in Rehabilitation Tool. Inter-rater reliability between all three raters was assessed using intraclass correlation coefficients. Concurrent validity was assessed using Pearson's r correlations with a reference standard, the Health Care Climate Questionnaire. The total score for the Communication Evaluation in Rehabilitation Tool is an average of all 18 items. Total scores demonstrated good inter-rater reliability (Intraclass Correlation Coefficient (ICC) = 0.8) and concurrent validity with the Health Care Climate Questionnaire total score (range: r = 0.7-0.88). Item-level scores of the Communication Evaluation in Rehabilitation Tool identified five items that need improvement. Results provide preliminary evidence to support future use and testing of the Communication Evaluation in Rehabilitation Tool. Implications for Rehabilitation Promoting patient autonomy is a learned skill and while interventions exist to train clinicians in these skills there are no tools to assess how well clinicians use these skills when interacting with a patient. The lack of robust assessment has severe implications regarding both the fidelity of clinician training packages and resulting outcomes for promoting patient autonomy. This study has developed a novel measurement tool Communication Evaluation in Rehabilitation Tool and a comprehensive user manual to assess how well health care providers use autonomy-supportive communication strategies in real world-clinical settings. This tool has demonstrated good inter-rater reliability and concurrent validity in its initial testing phase. The Communication Evaluation in Rehabilitation Tool can be used in future studies to assess autonomy-supportive communication and undergo further measurement property testing as per our recommendations.
Sabari, Joyce S.; Woodbury, Michelle; Velozo, Craig A.
2014-01-01
Objectives. (1) To develop two independent measurement scales for use as items assessing hand movements and hand activities within the Motor Assessment Scale (MAS), an existing instrument used for clinical assessment of motor performance in stroke survivors; (2) To examine the psychometric properties of these new measurement scales. Design. Scale development, followed by a multicenter observational study. Setting. Inpatient and outpatient occupational therapy programs in eight hospital and rehabilitation facilities in the United States and Canada. Participants. Patients (N = 332) receiving stroke rehabilitation following left (52%) or right (48%) cerebrovascular accident; mean age 64.2 years (sd 15); median 1 month since stroke onset. Intervention. Not applicable. Main Outcome Measures. Data were tested for unidimensionality and reliability, and behavioral criteria were ordered according to difficulty level with Rasch analysis. Results. The new scales assessing hand movements and hand activities met Rasch expectations of unidimensionality and reliability. Conclusion. Following a multistep process of test development, analysis, and refinement, we have redesigned the two scales that comprise the hand function items on the MAS. The hand movement scale contains an empirically validated 10-behavior hierarchy and the hand activities item contains an empirically validated 8-behavior hierarchy. PMID:25177513
Elmore, Kim; Flanagan, Barry; Jones, Nicholas F; Heitgerd, Janet L
2010-04-01
In 2008, CDC convened an expert panel to gather input on the use of geospatial science in surveillance, research and program activities focused on CDC's Healthy Communities Goal. The panel suggested six priorities: spatially enable and strengthen public health surveillance infrastructure; develop metrics for geospatial categorization of community health and health inequity; evaluate the feasibility and validity of standard metrics of community health and health inequities; support and develop GIScience and geospatial analysis; provide geospatial capacity building, training and education; and, engage non-traditional partners. Following the meeting, the strategies and action items suggested by the expert panel were reviewed by a CDC subcommittee to determine priorities relative to ongoing CDC geospatial activities, recognizing that many activities may need to occur either in parallel, or occur multiple times across phases. Phase A of the action items centers on developing leadership support. Phase B focuses on developing internal and external capacity in both physical (e.g., software and hardware) and intellectual infrastructure. Phase C of the action items plan concerns the development and integration of geospatial methods. In summary, the panel members provided critical input to the development of CDC's strategic thinking on integrating geospatial methods and research issues across program efforts in support of its Healthy Communities Goal.
A Psychometric Evaluation of the Threadgold Communication Tool for Persons with Dementia
Strøm, Benedicte Sørensen; Engedal, Knut; Grov, Ellen-Karine
2016-01-01
Background The objective of this study was to investigate the psychometric properties of the Threadgold Communication Tool (TCT). Method Internal consistency reliability was measured using Cronbach's α coefficient and inter-item correlation. Test-retest was performed to examine the instrument's stability. Exploratory principal component analysis (PCA) with oblimin rotation was carried out to evaluate construct validity. Finally, the score on each item of the TCT was correlated with the person's Mini Mental State Examination (MMSE) and Barthel Index of activities of daily living scores. Results A total of 51 persons participated, with a mean age of 86.7 (SD 6.6) years, of whom 46 were women with moderate-to-severe dementia [mean MMSE score 7.5 (SD 6.7)]. There were two measurement points 2 weeks apart. The results showed a satisfactory level for internal consistency and a high test-retest reliability (r = 0.76). The corrected item-total correlation ranged between 0.50 and 0.87, and a two-factor structure was revealed at the PCA. ‘Vocalizing’ seemed to measure another aspect of communication and was the only item which was negatively loaded. Conclusion Despite the low sample size in this study, the results revealed the TCT as a reliable and valid instrument, suitable for measuring communication among people with dementia. We suggest clarifying the understanding of ‘vocalizing’ before considering removing it from the scale. PMID:27239188
Gender Differences in Perception of Romance in Chinese College Students
Yin, Jie; Zhang, John X.; Xie, Jing; Zou, Zhiling; Huang, Xiting
2013-01-01
Women often complain that their partners are not romantic enough. This raises the question: how romance is recognized and evaluated in a love relationship? However, there has been essentially no empirical research bearing on this issue. The present set of studies examined possible gender differences in perceptions of romance and the associated neural mechanisms in Chinese college students. In Study 1, 303 participants (198 women, 105 men) were administrated a questionnaire consisting of 60 sentences and required to rate the romance level of each sentence. Results showed higher rating scores in males than females for low romance items, but not for high or medium romance items. In Study 2, 69 participants (37 women, 32 men) were recruited to judge the degree of romance in sentences presented on a computer screen one by one. Compared with females, males again showed higher scores and responded more slowly only to low romance items. In Study 3, 36 participants (18 women, 18 men) currently in love with someone were scanned with functional MRI while they did the romance judgment task from Study 2. Compared with females, greater brain activation was found for males in the frontal lobe, precentral gyrus, precuneus and parahippocampal gyrus for low romance items. The results provide the first piece of evidence for gender differences in romance perception, suggesting enhanced cognitive processing in males when evaluating the degree of romance in romantic scenes. PMID:24146853
Hassett, Afton L; Li, Tracy; Buyske, Steven; Savage, Shantal V; Gignac, Monique A M
2008-05-01
To consider the feasibility of assessing multiple facets of independence in rheumatoid arthritis (RA) using a measure developed from existing items and examining its face validity, construct validity and responsiveness to change. The ATTAIN (Abatacept Trial in Treatment of Anti-tumor necrosis factor [TNF] Inadequate responders) database was used. Patients with RA were randomized 2:1, abatacept (n = 258) and placebo (n = 133). A multi-faceted scale to measure physical and psychosocial independence was constructed using items from the Health Assessment Questionnaire (HAQ) and Short Form 36 Health Survey (SF-36). Questions assessing activity limitations and need for outside caregiver help were also examined. Interviews with 20 RA patients assessed face validity. Item Response Theory analysis yielded two traits - 'Psychosocial Independence', derived from the number of days with activity limitations plus the Role Emotional, Social Functioning and Role Physical subscale items from the SF-36; and 'Physical Independence', derived from 15 HAQ items assessing need for help from another. The two traits showed no significant differential item functioning for age or gender and demonstrated good face validity. Changes over 169 days on Psychosocial Independence were greater (mean 0.46 units, 95% confidence interval [CI]: 0.17-0.75) for the abatacept group than for placebo (p = 0.002). Changes in Physical Independence were greater (mean 0.59 units, 95% CI: 0.35-0.82) for the abatacept group than for placebo (p < 0.001). The multi-faceted assessment of independence in RA based on items from commonly used instruments is feasible suggesting promise for evaluating independence in future clinical trials. This approach demonstrated good face and construct validity and responsiveness in RA patients who had previously failed anti-TNF therapy. However, we caution against an interpretation that these data suggest that abatacept improves independence because the component parts of this assessment came from instruments used in the ATTAIN trial where data had been previously analyzed.
Development and reliability testing of the Worksite and Energy Balance Survey.
Hoehner, Christine M; Budd, Elizabeth L; Marx, Christine M; Dodson, Elizabeth A; Brownson, Ross C
2013-01-01
Worksites represent important venues for health promotion. Development of psychometrically sound measures of worksite environments and policy supports for physical activity and healthy eating are needed for use in public health research and practice. Assess the test-retest reliability of the Worksite and Energy Balance Survey (WEBS), a self-report instrument for assessing perceptions of worksite supports for physical activity and healthy eating. The WEBS included items adapted from existing surveys or new items on the basis of a review of the literature and expert review. Cognitive interviews among 12 individuals were used to test the clarity of items and further refine the instrument. A targeted random-digit-dial telephone survey was administered on 2 occasions to assess test-retest reliability (mean days between time periods = 8; minimum = 5; maximum = 14). Five Missouri census tracts that varied by racial-ethnic composition and walkability. Respondents included 104 employed adults (67% white, 64% women, mean age = 48.6 years). Sixty-three percent were employed at worksites with less than 100 employees, approximately one-third supervised other people, and the majority worked a regular daytime shift (75%). Test-retest reliability was assessed using Spearman correlations for continuous variables, Cohen's κ statistics for nonordinal categorical variables, and 1-way random intraclass correlation coefficients for ordinal categorical variables. Test-retest coefficients ranged from 0.41 to 0.97, with 80% of items having reliability coefficients of more than 0.6. Items that assessed participation in or use of worksite programs/facilities tended to have lower reliability. Reliability of some items varied by gender, obesity status, and worksite size. Test-retest reliability and internal consistency for the 5 scales ranged from 0.84 to 0.94 and 0.63 to 0.84, respectively. The WEBS items and scales exhibited sound test-retest reliability and may be useful for research and surveillance. Further evaluation is needed to document the validity of the WEBS and associations with energy balance outcomes.
ERIC Educational Resources Information Center
Li, Yanmei
2012-01-01
In a common-item (anchor) equating design, the common items should be evaluated for item parameter drift. Drifted items are often removed. For a test that contains mostly dichotomous items and only a small number of polytomous items, removing some drifted polytomous anchor items may result in anchor sets that no longer resemble mini-versions of…
Evaluating the Psychometric Characteristics of Generated Multiple-Choice Test Items
ERIC Educational Resources Information Center
Gierl, Mark J.; Lai, Hollis; Pugh, Debra; Touchie, Claire; Boulais, André-Philippe; De Champlain, André
2016-01-01
Item development is a time- and resource-intensive process. Automatic item generation integrates cognitive modeling with computer technology to systematically generate test items. To date, however, items generated using cognitive modeling procedures have received limited use in operational testing situations. As a result, the psychometric…
Using a Linear Regression Method to Detect Outliers in IRT Common Item Equating
ERIC Educational Resources Information Center
He, Yong; Cui, Zhongmin; Fang, Yu; Chen, Hanwei
2013-01-01
Common test items play an important role in equating alternate test forms under the common item nonequivalent groups design. When the item response theory (IRT) method is applied in equating, inconsistent item parameter estimates among common items can lead to large bias in equated scores. It is prudent to evaluate inconsistency in parameter…
ERIC Educational Resources Information Center
Fukuhara, Hirotaka; Kamata, Akihito
2011-01-01
A differential item functioning (DIF) detection method for testlet-based data was proposed and evaluated in this study. The proposed DIF model is an extension of a bifactor multidimensional item response theory (MIRT) model for testlets. Unlike traditional item response theory (IRT) DIF models, the proposed model takes testlet effects into…
ERIC Educational Resources Information Center
Scheuneman, Janice Dowd; Gerritz, Kalle
1990-01-01
Differential item functioning (DIF) methodology for revealing sources of item difficulty and performance characteristics of different groups was explored. A total of 150 Scholastic Aptitude Test items and 132 Graduate Record Examination general test items were analyzed. DIF was evaluated for males and females and Blacks and Whites. (SLD)
Classical Item Analysis Using Latent Variable Modeling: A Note on a Direct Evaluation Procedure
ERIC Educational Resources Information Center
Raykov, Tenko; Marcoulides, George A.
2011-01-01
A directly applicable latent variable modeling procedure for classical item analysis is outlined. The method allows one to point and interval estimate item difficulty, item correlations, and item-total correlations for composites consisting of categorical items. The approach is readily employed in empirical research and as a by-product permits…
ERIC Educational Resources Information Center
Tay, Louis; Vermunt, Jeroen K.; Wang, Chun
2013-01-01
We evaluate the item response theory with covariates (IRT-C) procedure for assessing differential item functioning (DIF) without preknowledge of anchor items (Tay, Newman, & Vermunt, 2011). This procedure begins with a fully constrained baseline model, and candidate items are tested for uniform and/or nonuniform DIF using the Wald statistic.…
Smolen, Josef S; Breedveld, Ferdinand C; Burmester, Gerd R; Bykerk, Vivian; Dougados, Maxime; Emery, Paul; Kvien, Tore K; Navarro-Compán, M Victoria; Oliver, Susan; Schoels, Monika; Scholte-Voshaar, Marieke; Stamm, Tanja; Stoffer, Michaela; Takeuchi, Tsutomu; Aletaha, Daniel; Andreu, Jose Louis; Aringer, Martin; Bergman, Martin; Betteridge, Neil; Bijlsma, Hans; Burkhardt, Harald; Combe, Bernard; Durez, Patrick; Fonseca, Joao Eurico; Gibofsky, Alan; Gomez-Reino, Juan J; Graninger, Winfried; Hannonen, Pekka; Haraoui, Boulos; Kouloumas, Marios; Landewe, Robert; Martin-Mola, Emilio; Nash, Peter; Ostergaard, Mikkel; Östör, Andrew; Richards, Pam; Sokka-Isler, Tuulikki; Thorne, Carter; Tzioufas, Athanasios G; van Vollenhoven, Ronald; de Wit, Martinus
2016-01-01
Background Reaching the therapeutic target of remission or low-disease activity has improved outcomes in patients with rheumatoid arthritis (RA) significantly. The treat-to-target recommendations, formulated in 2010, have provided a basis for implementation of a strategic approach towards this therapeutic goal in routine clinical practice, but these recommendations need to be re-evaluated for appropriateness and practicability in the light of new insights. Objective To update the 2010 treat-to-target recommendations based on systematic literature reviews (SLR) and expert opinion. Methods A task force of rheumatologists, patients and a nurse specialist assessed the SLR results and evaluated the individual items of the 2010 recommendations accordingly, reformulating many of the items. These were subsequently discussed, amended and voted upon by >40 experts, including 5 patients, from various regions of the world. Levels of evidence, strengths of recommendations and levels of agreement were derived. Results The update resulted in 4 overarching principles and 10 recommendations. The previous recommendations were partly adapted and their order changed as deemed appropriate in terms of importance in the view of the experts. The SLR had now provided also data for the effectiveness of targeting low-disease activity or remission in established rather than only early disease. The role of comorbidities, including their potential to preclude treatment intensification, was highlighted more strongly than before. The treatment aim was again defined as remission with low-disease activity being an alternative goal especially in patients with long-standing disease. Regular follow-up (every 1–3 months during active disease) with according therapeutic adaptations to reach the desired state was recommended. Follow-up examinations ought to employ composite measures of disease activity that include joint counts. Additional items provide further details for particular aspects of the disease, especially comorbidity and shared decision-making with the patient. Levels of evidence had increased for many items compared with the 2010 recommendations, and levels of agreement were very high for most of the individual recommendations (≥9/10). Conclusions The 4 overarching principles and 10 recommendations are based on stronger evidence than before and are supposed to inform patients, rheumatologists and other stakeholders about strategies to reach optimal outcomes of RA. PMID:25969430
Doralp, Samantha; Bartlett, Doreen
2014-08-01
This paper highlights the development and testing of the Infant Movement Motivation Questionnaire (IMMQ), an instrument designed to evaluate qualities of infant characteristics that relate specifically to early motor development. The measurement development process included three phases: item generation, pilot testing and evaluation of acceptability and feasibility for parents and exploratory factor analysis. The resultant 27-item questionnaire is designed for completion by parents and contains four factors including Activity, Exploration, Motivation and Adaptability. Overall, the internal consistency of the IMMQ is 0.89 (Cronbach's alpha), with test-retest reliability measured at 0.92 (ICC, with 95% CI 0.83-0.96). Further work could be done to strengthen the individual factors; however it is adequate for use in its full form. The IMMQ can be used for clinical or research purposes, as well as an educational tool for parents. Copyright © 2014 Elsevier Inc. All rights reserved.
Calibration of the Spanish PROMIS Smoking Item Banks.
Huang, Wenjing; Stucky, Brian D; Edelen, Maria O; Tucker, Joan S; Shadel, William G; Hansen, Mark; Cai, Li
2016-07-01
The Patient-Reported Outcomes Measurement Information System (PROMIS) Smoking Initiative has developed item banks for assessing six smoking behaviors and biopsychosocial correlates of smoking among adult cigarette smokers. The goal of this study is to evaluate the performance of the Spanish version of the PROMIS smoking item banks as compared to the original banks developed in English. The six PROMIS banks for daily smokers were translated into Spanish and administered to a sample of Spanish-speaking adult daily smokers in the United States (N = 302). We first evaluated the unidimensionality of each bank using confirmatory factor analysis. We then conducted a two-group item response theory calibration, including an item response theory-based Differential Item Functioning (DIF) analysis by language of administration (Spanish vs. English). Finally, we generated full bank and short form scores for the translated banks and evaluated their psychometric performance. Unidimensionality of the Spanish smoking item banks was supported by confirmatory factor analysis results. Out of a total of 109 items that were evaluated for language DIF, seven items in three of the six banks were identified as having levels of DIF that exceeded an established criterion. The psychometric performance of the Spanish daily smoker banks is largely comparable to that of the English versions. The Spanish PROMIS smoking item banks are highly similar, but not entirely equivalent, to the original English versions. The parameters from these two-group calibrations can be used to generate comparable bank scores across the two language versions. In this study, we developed a Spanish version of the PROMIS smoking toolkit, which was originally designed and developed for English speakers. With the growing Spanish-speaking population, it is important to make the toolkit more accessible by translating the items and calibrating the Spanish version to be comparable with English-language scores. This study provided the translated item banks and short forms, comparable unbiased scores for Spanish speakers and evaluations of the psychometric properties of the new Spanish toolkit. © The Author 2016. Published by Oxford University Press on behalf of the Society for Research on Nicotine and Tobacco. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Petrillo, Jennifer; Cano, Stefan J; McLeod, Lori D; Coon, Cheryl D
2015-01-01
To provide comparisons and a worked example of item- and scale-level evaluations based on three psychometric methods used in patient-reported outcome development-classical test theory (CTT), item response theory (IRT), and Rasch measurement theory (RMT)-in an analysis of the National Eye Institute Visual Functioning Questionnaire (VFQ-25). Baseline VFQ-25 data from 240 participants with diabetic macular edema from a randomized, double-masked, multicenter clinical trial were used to evaluate the VFQ at the total score level. CTT, RMT, and IRT evaluations were conducted, and results were assessed in a head-to-head comparison. Results were similar across the three methods, with IRT and RMT providing more detailed diagnostic information on how to improve the scale. CTT led to the identification of two problematic items that threaten the validity of the overall scale score, sets of redundant items, and skewed response categories. IRT and RMT additionally identified poor fit for one item, many locally dependent items, poor targeting, and disordering of over half the response categories. Selection of a psychometric approach depends on many factors. Researchers should justify their evaluation method and consider the intended audience. If the instrument is being developed for descriptive purposes and on a restricted budget, a cursory examination of the CTT-based psychometric properties may be all that is possible. In a high-stakes situation, such as the development of a patient-reported outcome instrument for consideration in pharmaceutical labeling, however, a thorough psychometric evaluation including IRT or RMT should be considered, with final item-level decisions made on the basis of both quantitative and qualitative results. Copyright © 2015. Published by Elsevier Inc.
Psychometrics of the self-report safe driving behavior measure for older adults.
Classen, Sherrilene; Wen, Pey-Shan; Velozo, Craig A; Bédard, Michel; Winter, Sandra M; Brumback, Babette; Lanford, Desiree N
2012-01-01
We investigated the psychometric properties of the 68-item Safe Driving Behavior Measure (SDBM) with 80 older drivers, 80 caregivers, and 2 evaluators from two sites. Using Rasch analysis, we examined unidimensionality and local dependence; rating scale; item- and person-level psychometrics; and item hierarchy of older drivers, caregivers, and driving evaluators who had completed the SDBM. The evidence suggested the SDBM is unidimensional, but pairs of items showed local dependency. Across the three rater groups, the data showed good person (≥3.4) and item (≥3.6) separation as well as good person (≥.93) and item reliability (≥.92). Cronbach's α was ≥.96, and few items were misfitting. Some of the items did not follow the hypothesized order of item difficulty. The SDBM classified the older drivers into six ability levels, but to fully calibrate the instrument it must be refined in terms of its items (e.g., item exclusion) and then tested among participants of lesser ability. Copyright © 2012 by the American Occupational Therapy Association, Inc.
An Evaluation of Three Approximate Item Response Theory Models for Equating Test Scores.
ERIC Educational Resources Information Center
Marco, Gary L.; And Others
Three item response models were evaluated for estimating item parameters and equating test scores. The models, which approximated the traditional three-parameter model, included: (1) the Rasch one-parameter model, operationalized in the BICAL computer program; (2) an approximate three-parameter logistic model based on coarse group data divided…
2011-01-01
Background For hospital accreditation and health promotion reasons, we examined whether the 22-item Job Content Questionnaire (JCQ) could be applied to evaluate job strain of individual hospital employees and to determine the number of factors extracted from JCQ. Additionally, we developed an Excel module of self-evaluation diagnostic system for consultation with experts. Methods To develop an Excel-based self-evaluation diagnostic system for consultation to experts to make job strain assessment easier and quicker than ever, Rasch rating scale model was used to analyze data from 1,644 hospital employees who enrolled in 2008 for a job strain survey. We determined whether the 22-item Job Content Questionnaire (JCQ) could evaluate job strain of individual employees in work sites. The respective item responding to specific groups' occupational hazards causing job stress was investigated by using skewness coefficient with its 95% CI through item-by-item analyses. Results Each of those 22 items on the questionnaire was examined to have five factors. The prevalence rate of Chinese hospital workers with high job strain was 16.5%. Conclusions Graphical representations of four quadrants, item-by-item bar chart plots and skewness 95% CI comparison generated in Excel can help employers and consultants of an organization focusing on a small number of key areas of concern for each worker in job strain. PMID:21682912
Chien, Tsair-Wei; Lai, Wen-Pin; Wang, Hsien-Yi; Hsu, Sen-Yen; Castillo, Roberto Vasquez; Guo, How-Ran; Chen, Shih-Chung; Su, Shih-Bin
2011-06-18
For hospital accreditation and health promotion reasons, we examined whether the 22-item Job Content Questionnaire (JCQ) could be applied to evaluate job strain of individual hospital employees and to determine the number of factors extracted from JCQ. Additionally, we developed an Excel module of self-evaluation diagnostic system for consultation with experts. To develop an Excel-based self-evaluation diagnostic system for consultation to experts to make job strain assessment easier and quicker than ever, Rasch rating scale model was used to analyze data from 1,644 hospital employees who enrolled in 2008 for a job strain survey. We determined whether the 22-item Job Content Questionnaire (JCQ) could evaluate job strain of individual employees in work sites. The respective item responding to specific groups' occupational hazards causing job stress was investigated by using skewness coefficient with its 95% CI through item-by-item analyses. Each of those 22 items on the questionnaire was examined to have five factors. The prevalence rate of Chinese hospital workers with high job strain was 16.5%. Graphical representations of four quadrants, item-by-item bar chart plots and skewness 95% CI comparison generated in Excel can help employers and consultants of an organization focusing on a small number of key areas of concern for each worker in job strain.
An evaluation of the "TrEAT Yourself Well" restaurant nutrition campaign.
Acharya, Ram N; Patterson, Paul M; Hill, Esther P; Schmitz, Troy G; Bohm, Erica
2006-06-01
This study examined the effect of the "TrEAT Yourself Well" campaign on diners'menu choices using data from four restaurant chains in California. Within each chain, two locations in the greater San Diego area were selected as experimental sites and either one or two locations outside the greater San Diego area were selected as control sites. Various promotional activities, including in-restaurant promotions, community events, and paid media advertising, were conducted in the experimental region to promote healthy menu entrées. The results show that the campaign was successful in reaching diners and had positive effects on their beliefs and attitudes toward healthy dining. The campaign directly increased the probability of a consumer purchasing a healthy menu item by 3.7% (p = .05). By improving consumer attitudes toward healthy menu items, the campaign indirectly increased purchases of these items by 4.4%.
Guerranti, Cristiana; Cannas, Susanna; Scopetani, Costanza; Fastelli, Paolo; Cincinelli, Alessandra; Renzi, Monia
2017-04-15
During two surveys in 2015 and 2016, sediments samples were collected along the Ombrone river (Maremma Regional Park, province of Grosseto, Italy), in particular at its mouth and in the marine area in front of it, in order to quantify, identify and categorize plastic items (macro, meso and micro-plastics and colour, material etc.) and evaluate their potential sources. The Albegna and Osa rivers were identified as external areas of comparison. The results of the analysis showed different situations, especially as regards fluvial inputs, in addition to evidencing local provisions of plastic material derived from agricultural activities. The microplastics values per kg of sediment and the prevailing type of items found largely varied between the investigated sites (45-1069items/kg dry sample). Copyright © 2017 Elsevier Ltd. All rights reserved.
Gattinger, Heidrun; Senn, Beate; Hantikainen, Virpi; Köpke, Sascha; Ott, Stefan; Leino-Kilpi, Helena
2017-01-01
Impaired mobility is a prevalent condition among care-dependent persons living in nursing homes. Therefore, competence development of nursing staff in mobility care is important. This study aimed to develop and initially test the Kinaesthetics Competence Self-Evaluation (KCSE) scale for assessing nursing staff's competence in mobility care. The KCSE scale was developed based on an analysis of the concept of nurses' competence in kinaesthetics. Kinaesthetics is a training concept that provides theory and practice about movement foundations that comprise activities of daily living. The scale contains 28 items and four subscales (attitude, dynamic state, knowledge and skills). Content validity was assessed by determining the content validity index within two expert panels. Internal consistency and construct validity were tested within a cross-sectional study in three nursing homes in the German-speaking region of Switzerland between September and November 2015. The content validity index for the entire scale was good (0.93). Based on a sample of nursing staff ( n = 180) the internal consistency results were good for the whole scale (Cronbach's alpha = 0.91) and for the subscales knowledge and skills (α = 0.91, 0.86), acceptable for the subscale attitude (α = 0.63) and weak for the subscale dynamic state (α = 0.54). Most items showed acceptable inter-item and item-total correlations. Based on the exploratory factor analysis, four factors explaining 52% of the variance were extracted. The newly developed KCSE scale is a promising instrument for measuring nursing staff's attitude, dynamic state, knowledge, and skills in mobility care based on kinaesthetics. Despite the need for further psychometric evaluation, the KCSE scale can be used in clinical practice to evaluate competence in mobility care based on kinaesthetics and to identify educational needs for nursing staff.
Ogunyemi, Dotun; Eno, Michelle; Rad, Steve; Fong, Alex; Alexander, Carolyn; Azziz, Ricardo
2010-01-01
Objective The purpose of this article was to develop and determine the utility of a compliance form in evaluating and teaching the Accreditation Council for Graduate Medical Education competencies of professionalism, practice-based learning and improvement, and systems-based practice. Methods In 2006, we introduced a 17-item compliance form in an obstetrics and gynecology residency program. The form prospectively monitored residents on attendance at required activities (5 items), accountability of required obligations (9 items), and completion of assigned projects (3 items). Scores were compared to faculty evaluations of residents, resident status as a contributor or a concerning resident, and to the residents' conflict styles, using the Thomas-Kilmann Conflict MODE Instrument. Results Our analysis of 18 residents for academic year 2007–2008 showed a mean (standard error of mean) of 577 (65.3) for postgraduate year (PGY)-1, 692 (42.4) for PGY-2, 535 (23.3) for PGY-3, and 651.6 (37.4) for PGY-4. Non-Hispanic white residents had significantly higher scores on compliance, faculty evaluations on interpersonal and communication skills, and competence in systems-based practice. Contributing residents had significantly higher scores on compliance compared with concerning residents. Senior residents had significantly higher accountability scores compared with junior residents, and junior residents had increased project completion scores. Attendance scores increased and accountability scores decreased significantly between the first and second 6 months of the academic year. There were positive correlations between compliance scores with competing and collaborating conflict styles, and significant negative correlations between compliance with avoiding and accommodating conflict styles. Conclusions Maintaining a compliance form allows residents and residency programs to focus on issues that affect performance and facilitate assessment of the ACGME competencies. Postgraduate year, behavior, and conflict styles appear to be associated with compliance. A lack of association with faculty evaluations suggests measurement of different perceptions of residents' behavior. PMID:21976093
Ogunyemi, Dotun; Eno, Michelle; Rad, Steve; Fong, Alex; Alexander, Carolyn; Azziz, Ricardo
2010-09-01
The purpose of this article was to develop and determine the utility of a compliance form in evaluating and teaching the Accreditation Council for Graduate Medical Education competencies of professionalism, practice-based learning and improvement, and systems-based practice. In 2006, we introduced a 17-item compliance form in an obstetrics and gynecology residency program. The form prospectively monitored residents on attendance at required activities (5 items), accountability of required obligations (9 items), and completion of assigned projects (3 items). Scores were compared to faculty evaluations of residents, resident status as a contributor or a concerning resident, and to the residents' conflict styles, using the Thomas-Kilmann Conflict MODE Instrument. Our analysis of 18 residents for academic year 2007-2008 showed a mean (standard error of mean) of 577 (65.3) for postgraduate year (PGY)-1, 692 (42.4) for PGY-2, 535 (23.3) for PGY-3, and 651.6 (37.4) for PGY-4. Non-Hispanic white residents had significantly higher scores on compliance, faculty evaluations on interpersonal and communication skills, and competence in systems-based practice. Contributing residents had significantly higher scores on compliance compared with concerning residents. Senior residents had significantly higher accountability scores compared with junior residents, and junior residents had increased project completion scores. Attendance scores increased and accountability scores decreased significantly between the first and second 6 months of the academic year. There were positive correlations between compliance scores with competing and collaborating conflict styles, and significant negative correlations between compliance with avoiding and accommodating conflict styles. Maintaining a compliance form allows residents and residency programs to focus on issues that affect performance and facilitate assessment of the ACGME competencies. Postgraduate year, behavior, and conflict styles appear to be associated with compliance. A lack of association with faculty evaluations suggests measurement of different perceptions of residents' behavior.
Sexual behaviors among club drug users: prevalence and reliability
Shacham, Enbal; Cottler, Linda B.
2013-01-01
HIV prevention efforts require a focus on reducing high risk sexual behavior. Because these are self-reported, assessments that reduce memory bias and improve elicitation of data are needed. As part of a multi-site psychometric study of club drug use, abuse, and dependence, data were collected with a test-retest design that measured the reliability of the Washington University Risk Behavior Assessment for Club Drugs (WU-RBA-CD). Reliability was assessed separately by sex via kappa coefficients and intraclass correlation coefficients (ICC); z tests compared coefficients by sex. A total of 603 participants were interviewed by independent assessors with 5 days in between interviews. Reliability for all 51 items of the sexual activity section of the WU-RBA-CD ranged from .23 to 1.00; 71% (n = 36) of items resulted in moderate to high reliability (.55–1.00). Number of lifetime sex partners was consistently reported for same-sex partners for both men and women and opposite-sex partners. Items with high reliability included reporting ever being under the influence of ecstasy (.87) or GHB (.87) while having sex. Items with lower reliability included those that queried the determinants of condom use (.45–.82) and about behaviors and attitudes experienced while using drugs (.23–.87). Very few sex differences were revealed in the reliability of reported sexual activities. Overall, the WU-RBA-CD performed with fairly high reliability rates. Assessing situations of when, how, and why individuals use condoms may offer the clearest evaluation of determinants of sexual behaviors, yet those items are not as reliable. PMID:19757011
Validation of an instrument to evaluate health promotion at schools
Pinto, Raquel Oliveira; Pattussi, Marcos Pascoal; Fontoura, Larissa do Prado; Poletto, Simone; Grapiglia, Valenca Lemes; Balbinot, Alexandre Didó; Teixeira, Vanessa Andina; Horta, Rogério Lessa
2016-01-01
ABSTRACT OBJECTIVE To validate an instrument designed to assess health promotion in the school environment. METHODS A questionnaire, based on guidelines from the World Health Organization and in line with the Brazilian school health context, was developed to validate the research instrument. There were 60 items in the instrument that included 40 questions for the school manager and 20 items with direct observations made by the interviewer. The items’ content validation was performed using the Delphi technique, with the instrument being applied in 53 schools from two medium-sized cities in the South region of Brazil. Reliability (Cronbach’s alpha and split-half) and validity (principal component analysis) analyses were performed. RESULTS The final instrument remained composed of 28 items, distributed into three dimensions: pedagogical, structural and relational. The resulting components showed good factorial loads (> 0.4) and acceptable reliability (> 0.6) for most items. The pedagogical dimension identifies educational activities regarding drugs and sexuality, violence and prejudice, auto care and peace and quality of life. The structural dimension is comprised of access, sanitary structure, and conservation and equipment. The relational dimension includes relationships within the school and with the community. CONCLUSIONS The proposed instrument presents satisfactory validity and reliability values, which include aspects relevant to promote health in schools. Its use allows the description of the health promotion conditions to which students from each educational institution are exposed. Because this instrument includes items directly observed by the investigator, it should only be used during periods when there are full and regular activities at the school in question. PMID:26982958
A confirmative clinimetric analysis of the 36-item Family Assessment Device.
Timmerby, Nina; Cosci, Fiammetta; Watson, Maggie; Csillag, Claudio; Schmitt, Florence; Steck, Barbara; Bech, Per; Thastum, Mikael
2018-02-07
The Family Assessment Device (FAD) is a 60-item questionnaire widely used to evaluate self-reported family functioning. However, the factor structure as well as the number of items has been questioned. A shorter and more user-friendly version of the original FAD-scale, the 36-item FAD, has therefore previously been proposed, based on findings in a nonclinical population of adults. We aimed in this study to evaluate the brief 36-item version of the FAD in a clinical population. Data from a European multinational study, examining factors associated with levels of family functioning in adult cancer patients' families, were used. Both healthy and ill parents completed the 60-item version FAD. The psychometric analyses conducted were Principal Component Analysis and Mokken-analysis. A total of 564 participants were included. Based on the psychometric analysis we confirmed that the 36-item version of the FAD has robust psychometric properties and can be used in clinical populations. The present analysis confirmed that the 36-item version of the FAD (18 items assessing 'well-being' and 18 items assessing 'dysfunctional' family function) is a brief scale where the summed total score is a valid measure of the dimensions of family functioning. This shorter version of the FAD is, in accordance with the concept of 'measurement-based care', an easy to use scale that could be considered when the aim is to evaluate self-reported family functioning.
Forslin, Mia; Kottorp, Anders; Kierkegaard, Marie; Johansson, Sverker
2016-11-11
To translate and culturally adapt the Acceptance of Chronic Health Conditions (ACHC) Scale for people with multiple sclerosis into Swedish, and to analyse the psychometric properties of the Swedish version. Ten people with multiple sclerosis participated in translation and cultural adaptation of the ACHC Scale; 148 people with multiple sclerosis were included in evaluation of the psychometric properties of the scale. Translation and cultural adaptation were carried out through translation and back-translation, by expert committee evaluation and pre-test with cognitive interviews in people with multiple sclerosis. The psychometric properties of the Swedish version were evaluated using Rasch analysis. The Swedish version of the ACHC Scale was an acceptable equivalent to the original version. Seven of the original 10 items fitted the Rasch model and demonstrated ability to separate between groups. A 5-item version, including 2 items and 3 super-items, demonstrated better psychometric properties, but lower ability to separate between groups. The Swedish version of the ACHC Scale with the original 10 items did not fit the Rasch model. Two solutions, either with 7 items (ACHC-7) or with 2 items and 3 super-items (ACHC-5), demonstrated acceptable psychometric properties. Use of the ACHC-5 Scale with super-items is recommended, since this solution adjusts for local dependency among items.
Henry, Beverly W; Smith, Thomas J; Ahmad, Saadia
2014-05-01
To assess parents' perspectives of their home environments to establish the validity of scores from the Behavior and Attitudes Questionnaire for Healthy Habits (BAQ-HH). In the present descriptive study, we surveyed a cross-sectional sample of parents of pre-school children. Questionnaire items developed in an iterative process with community-based programming addressed parents' knowledge/awareness, attitudes/concerns and behaviours about healthy foods and physical activity habits with 6-point rating scales. Exploratory and confirmatory factor analyses were used to psychometrically evaluate scores from the scales. English and Spanish versions of the BAQ-HH were administered at parent-teacher conferences for pre-school children at ten Head Start centres across a five-county agency in autumn 2010. From 672 families with pre-school children, 532 parents provided responses to the BAQ-HH (79 % response rate). The majority was female (83 %), Hispanic (66 %) or white (16 %), and ages ranged from 20 to 39 years (85 %). Exploratory and confirmatory analyses revealed a knowledge scale (seven items), an attitude scale (four items) and three behaviour subscales (three items each). Correlations were identified between parents' perceptions of home activities and reports of children's habits. Differences were identified by gender and ethnicity groupings. As a first step in psychometric testing, the dimensionality of each of the three scales (Knowledge, Attitudes and Behaviours) was identified and scale scores were related to other indicators of child behaviours and parents' demographic characteristics. This questionnaire offers a method to measure parents' views to inform planning and monitoring of obesity-prevention education programmes.
Crins, Martine H P; Terwee, Caroline B; Klausch, Thomas; Smits, Niels; de Vet, Henrica C W; Westhovens, Rene; Cella, David; Cook, Karon F; Revicki, Dennis A; van Leeuwen, Jaap; Boers, Maarten; Dekker, Joost; Roorda, Leo D
2017-07-01
The objective of this study was to assess the psychometric properties of the Dutch-Flemish Patient-Reported Outcomes Measurement Information System (PROMIS) Physical Function item bank in Dutch patients with chronic pain. A bank of 121 items was administered to 1,247 Dutch patients with chronic pain. Unidimensionality was assessed by fitting a one-factor confirmatory factor analysis and evaluating resulting fit statistics. Items were calibrated with the graded response model and its fit was evaluated. Cross-cultural validity was assessed by testing items for differential item functioning (DIF) based on language (Dutch vs. English). Construct validity was evaluated by calculation correlations between scores on the Dutch-Flemish PROMIS Physical Function measure and scores on generic and disease-specific measures. Results supported the Dutch-Flemish PROMIS Physical Function item bank's unidimensionality (Comparative Fit Index = 0.976, Tucker Lewis Index = 0.976) and model fit. Item thresholds targeted a wide range of physical function construct (threshold-parameters range: -4.2 to 5.6). Cross-cultural validity was good as four items only showed DIF for language and their impact on item scores was minimal. Physical Function scores were strongly associated with scores on all other measures (all correlations ≤ -0.60 as expected). The Dutch-Flemish PROMIS Physical Function item bank exhibited good psychometric properties. Development of a computer adaptive test based on the large bank is warranted. Copyright © 2017 Elsevier Inc. All rights reserved.
Huang, Wenhao; Chapman-Novakofski, Karen M
2017-01-01
Background The extensive availability and increasing use of mobile apps for nutrition-based health interventions makes evaluation of the quality of these apps crucial for integration of apps into nutritional counseling. Objective The goal of this research was the development, validation, and reliability testing of the app quality evaluation (AQEL) tool, an instrument for evaluating apps’ educational quality and technical functionality. Methods Items for evaluating app quality were adapted from website evaluations, with additional items added to evaluate the specific characteristics of apps, resulting in 79 initial items. Expert panels of nutrition and technology professionals and app users reviewed items for face and content validation. After recommended revisions, nutrition experts completed a second AQEL review to ensure clarity. On the basis of 150 sets of responses using the revised AQEL, principal component analysis was completed, reducing AQEL into 5 factors that underwent reliability testing, including internal consistency, split-half reliability, test-retest reliability, and interrater reliability (IRR). Two additional modifiable constructs for evaluating apps based on the age and needs of the target audience as selected by the evaluator were also tested for construct reliability. IRR testing using intraclass correlations (ICC) with all 7 constructs was conducted, with 15 dietitians evaluating one app. Results Development and validation resulted in the 51-item AQEL. These were reduced to 25 items in 5 factors after principal component analysis, plus 9 modifiable items in two constructs that were not included in principal component analysis. Internal consistency and split-half reliability of the following constructs derived from principal components analysis was good (Cronbach alpha >.80, Spearman-Brown coefficient >.80): behavior change potential, support of knowledge acquisition, app function, and skill development. App purpose split half-reliability was .65. Test-retest reliability showed no significant change over time (P>.05) for all but skill development (P=.001). Construct reliability was good for items assessing age appropriateness of apps for children, teens, and a general audience. In addition, construct reliability was acceptable for assessing app appropriateness for various target audiences (Cronbach alpha >.70). For the 5 main factors, ICC (1,k) was >.80, with a P value of <.05. When 15 nutrition professionals evaluated one app, ICC (2,15) was .98, with a P value of <.001 for all 7 constructs when the modifiable items were specified for adults seeking weight loss support. Conclusions Our preliminary effort shows that AQEL is a valid, reliable instrument for evaluating nutrition apps’ qualities for clinical interventions by nutrition clinicians, educators, and researchers. Further efforts in validating AQEL in various contexts are needed. PMID:29079554
Bjorner, Jakob Bue; Pejtersen, Jan Hyld
2010-02-01
To evaluate the construct validity of the Copenhagen Psychosocial Questionnaire II (COPSOQ II) by means of tests for differential item functioning (DIF) and differential item effect (DIE). We used a Danish general population postal survey (n = 4,732 with 3,517 wage earners) with a one-year register based follow up for long-term sickness absence. DIF was evaluated against age, gender, education, social class, public/private sector employment, and job type using ordinal logistic regression. DIE was evaluated against job satisfaction and self-rated health (using ordinal logistic regression), against depressive symptoms, burnout, and stress (using multiple linear regression), and against long-term sick leave (using a proportional hazards model). We used a cross-validation approach to counter the risk of significant results due to multiple testing. Out of 1,052 tests, we found 599 significant instances of DIF/DIE, 69 of which showed both practical and statistical significance across two independent samples. Most DIF occurred for job type (in 20 cases), while we found little DIF for age, gender, education, social class and sector. DIE seemed to pertain to particular items, which showed DIE in the same direction for several outcome variables. The results allowed a preliminary identification of items that have a positive impact on construct validity and items that have negative impact on construct validity. These results can be used to develop better shortform measures and to improve the conceptual framework, items and scales of the COPSOQ II. We conclude that tests of DIF and DIE are useful for evaluating construct validity.
Earned print media in advancing tobacco control in Himachal Pradesh, India: a descriptive study.
Sharma, Renu; Shewade, Hemant Deepak; Gopalan, Balasubramaniam; Badrel, Ramesh Kumar; Rana, Jugdeep Singh
2017-01-01
The Union-Bloomberg Initiative tobacco control projects were implemented in Himachal Pradesh (a hilly state in North India) from 2007 to 2014. The project focused on the establishment of an administrative framework; increasing the capacity of stakeholders; enforcement of legislation; coalition and networking with multiple stakeholders; awareness generation with focus on earned media and monitoring and evaluation with policy-focussed research. This study aimed to systematically analyse all earned print news items related to the projects. In this cross-sectional descriptive study, quantitative content analysis of earned print news items was carried out using predetermined codes related to areas of tobacco control policies. We also carried out a cost description of the hypothetical value of this earned media. The area of the news item in cm 2 was multiplied by the average rate of space for the paid news item in that particular newspaper. There were 6348 news items: the numbers steadily increased with time. Focus on Monitoring tobacco use, Protecting people from tobacco smoke, Offering help to quit, Warning about dangers of tobacco, Enforcing a ban on tobacco advertising and promotion, Raising tax on tobacco products was seen in 24, 17, 9, 23, 22 and 3% of news items, respectively. Press releases were highest at 44% and report by correspondents at 24%. Further, 55, 23 and 21% news items focused on smoking, smokeless and both forms of tobacco use, respectively. Sixty-six per cent and 34% news items, respectively, were focused on youth and women. The news items had a hypothetical value of US$1503 628.3, which was three times more than the funds spent on all project activities. In the absence of funding for paid media, the project strategically used earned media to promote tobacco control policies in the state.
2013-01-01
Background Though several questionnaires on self-care and regimen adherence have been introduced, the evaluations do not always report consistent and substantial correlations with measures of glycaemic control. Small ability to explain variance in HbA1c constitutes a significant limitation of an instrument’s use for scientific purposes as well as clinical practice. In order to assess self-care activities which can predict glycaemic control, the Diabetes Self-Management Questionnaire (DSMQ) was designed. Methods A 16 item questionnaire to assess self-care activities associated with glycaemic control was developed, based on theoretical considerations and a process of empirical improvements. Four subscales, ‘Glucose Management’ (GM), ‘Dietary Control’ (DC), ‘Physical Activity’ (PA), and ‘Health-Care Use’ (HU), as well as a ‘Sum Scale’ (SS) as a global measure of self-care were derived. To evaluate its psychometric quality, 261 patients with type 1 or 2 diabetes were assessed with the DSMQ and an established analogous scale, the Summary of Diabetes Self-Care Activities Measure (SDSCA). The DSMQ’s item and scale characteristics as well as factorial and convergent validity were analysed, and its convergence with HbA1c was compared to the SDSCA. Results The items showed appropriate characteristics (mean item-total-correlation: 0.46 ± 0.12; mean correlation with HbA1c: -0.23 ± 0.09). Overall internal consistency (Cronbach’s alpha) was good (0.84), consistencies of the subscales were acceptable (GM: 0.77; DC: 0.77; PA: 0.76; HU: 0.60). Principal component analysis indicated a four factor structure and confirmed the designed scale structure. Confirmatory factor analysis indicated appropriate fit of the four factor model. The DSMQ scales showed significant convergent correlations with their parallel SDSCA scales (GM: 0.57; DC: 0.52; PA: 0.58; HU: n/a; SS: 0.57) and HbA1c (GM: -0.39; DC: -0.30; PA: -0.15; HU: -0.22; SS: -0.40). All correlations with HbA1c were significantly stronger than those obtained with the SDSCA. Conclusions This study provides preliminary evidence that the DSMQ is a reliable and valid instrument and enables an efficient assessment of self-care behaviours associated with glycaemic control. The questionnaire should be valuable for scientific analyses as well as clinical use in both type 1 and type 2 diabetes patients. PMID:23937988
Bookbinder, Marilyn; Hugodot, Amandine; Freeman, Katherine; Homel, Peter; Santiago, Elisabeth; Riggs, Alexa; Gavin, Maggie; Chu, Alice; Brady, Ellen; Lesage, Pauline; Portenoy, Russell K
2018-02-01
Quality improvement in end-of-life care generally acquires data from charts or caregivers. "Tracer" methodology, which assesses real-time information from multiple sources, may provide complementary information. The objective of this study was to develop a valid brief audit tool that can guide assessment and rate care when used in a clinician tracer to evaluate the quality of care for the dying patient. To identify items for a brief audit tool, 248 items were created to evaluate overall quality, quality in specific content areas (e.g., symptom management), and specific practices. Collected into three instruments, these items were used to interview professional caregivers and evaluate the charts of hospitalized patients who died. Evidence that this information could be validly captured using a small number of items was obtained through factor analyses, canonical correlations, and group comparisons. A nurse manager field tested tracer methodology using candidate items to evaluate the care provided to other patients who died. The survey of 145 deaths provided chart data and data from 445 interviews (26 physicians, 108 nurses, 18 social workers, and nine chaplains). The analyses yielded evidence of construct validity for a small number of items, demonstrating significant correlations between these items and content areas identified as latent variables in factor analyses. Criterion validity was suggested by significant differences in the ratings on these items between the palliative care unit and other units. The field test evaluated 127 deaths, demonstrated the feasibility of tracer methodology, and informed reworking of the candidate items into the 14-item Tracer EoLC v1. The Tracer EoLC v1 can be used with tracer methodology to guide the assessment and rate the quality of end-of-life care. Copyright © 2017 American Academy of Hospice and Palliative Medicine. Published by Elsevier Inc. All rights reserved.
Assessing psychological well-being: self-report instruments for the NIH Toolbox.
Salsman, John M; Lai, Jin-Shei; Hendrie, Hugh C; Butt, Zeeshan; Zill, Nicholas; Pilkonis, Paul A; Peterson, Christopher; Stoney, Catherine M; Brouwers, Pim; Cella, David
2014-02-01
Psychological well-being (PWB) has a significant relationship with physical and mental health. As a part of the NIH Toolbox for the Assessment of Neurological and Behavioral Function, we developed self-report item banks and short forms to assess PWB. Expert feedback and literature review informed the selection of PWB concepts and the development of item pools for positive affect, life satisfaction, and meaning and purpose. Items were tested with a community-dwelling US Internet panel sample of adults aged 18 and above (N = 552). Classical and item response theory (IRT) approaches were used to evaluate unidimensionality, fit of items to the overall measure, and calibrations of those items, including differential item function (DIF). IRT-calibrated item banks were produced for positive affect (34 items), life satisfaction (16 items), and meaning and purpose (18 items). Their psychometric properties were supported based on the results of factor analysis, fit statistics, and DIF evaluation. All banks measured the concepts precisely (reliability ≥0.90) for more than 98% of participants. These adult scales and item banks for PWB provide the flexibility, efficiency, and precision necessary to promote future epidemiological, observational, and intervention research on the relationship of PWB with physical and mental health.
Assessing the Evaluative Content of Personality Questionnaires Using Bifactor Models.
Biderman, Michael D; McAbee, Samuel T; Job Chen, Zhuo; Hendy, Nhung T
2018-01-01
Exploratory bifactor models with keying factors were applied to item response data for the NEO-FFI-3 and HEXACO-PI-R questionnaires. Loadings on a general factor and positive and negative keying factors correlated with independent estimates of item valence, suggesting that item valence influences responses to these questionnaires. Correlations between personality domain scores and measures of self-esteem, depression, and positive and negative affect were all reduced significantly when the influence of evaluative content represented by the general and keying factors was removed. Findings support the need to model personality inventories in ways that capture reactions to evaluative item content.
Primacy Versus Recency in a Quantitative Model: Activity Is the Critical Distinction
Greene, Anthony J.; Prepscius, Colin; Levy, William B.
2000-01-01
Behavioral and neurobiological evidence shows that primacy and recency are subserved by memory systems for intermediate- and short-term memory, respectively. A widely accepted explanation of recency is that in short-term memory, new learning overwrites old learning. Primacy is not as well understood, but many hypotheses contend that initial items are better encoded into long-term memory because they have had more opportunity to be rehearsed. A simple, biologically motivated neural network model supports an alternative hypothesis of the distinct processing requirements for primacy and recency given single-trial learning without rehearsal. Simulations of the model exhibit either primacy or recency, but not both simultaneously. The incompatibility of primacy and recency clarifies possible reasons for two neurologically distinct systems. Inhibition, and its control of activity, determines those list items that are acquired and retained. Activity levels that are too low do not provide sufficient connections for learning to occur, while higher activity diminishes capacity. High recurrent inhibition, and progressively diminishing activity, allows acquisition and retention of early items, while later items are never acquired. Conversely, low recurrent inhibition, and the resulting high activity, allows continuous acquisition such that acquisition of later items eventually interferes with the retention of early items. PMID:10706602
Cappelleri, J C; Althof, S E; Siegel, R L; Shpilsky, A; Bell, S S; Duttagupta, S
2004-02-01
Development and validation of a patient-reported measure of psychosocial variables in men with erectile dysfunction (ED) is described. Literature review, focus groups, and medical specialists identified 86 potential items. Redundant, ambiguous, or low item-to-total correlation items were removed. Data from 98 men reporting diagnosed ED and 94 controls assisted in final item selection and psychometric evaluation. Treatment responsiveness was evaluated in 93 men with ED in a 10-week open-label trial of sildenafil citrate (Viagra). The 14 chosen items resolved into two domains: Sexual Relationship (eight items) and Confidence (six items), the latter comprising Self-Esteem (four items) and Overall Relationship (two items) subscales. The resulting Self-Esteem And Relationship (SEAR) questionnaire demonstrated validity and reliability. The intervention study demonstrated responsiveness to beneficial treatment with significant improvement in scores (P=0.0001). The SEAR questionnaire possesses strong psychometric properties that support its validity and reliability for measuring sexual relationship, confidence, and particularly self-esteem.
Developing an Assessment Method of Active Aging: University of Jyvaskyla Active Aging Scale.
Rantanen, Taina; Portegijs, Erja; Kokko, Katja; Rantakokko, Merja; Törmäkangas, Timo; Saajanaho, Milla
2018-01-01
To develop an assessment method of active aging for research on older people. A multiphase process that included drafting by an expert panel, a pilot study for item analysis and scale validity, a feedback study with focus groups and questionnaire respondents, and a test-retest study. Altogether 235 people aged 60 to 94 years provided responses and/or feedback. We developed a 17-item University of Jyvaskyla Active Aging Scale with four aspects in each item (goals, ability, opportunity, and activity; range 0-272). The psychometric and item properties are good and the scale assesses a unidimensional latent construct of active aging. Our scale assesses older people's striving for well-being through activities pertaining to their goals, abilities, and opportunities. The University of Jyvaskyla Active Aging Scale provides a quantifiable measure of active aging that may be used in postal questionnaires or interviews in research and practice.
Hakimian, Pantea; Lak, Azadeh
2016-01-01
Background: In spite of the increased range of inactivity and obesity among Iranian adults, insufficient research has been done on environmental factors influencing physical activity. As a result adapting a subjective (self-report) measurement tool for assessment of physical environment in Iran is critical. Accordingly, in this study Neighborhood Environment Walkability Scale (NEWS) was adapted for Iran and also its reliability was evaluated. Methods: This study was conducted using a systematic adaptation method consisting of 3 steps: translate-back translation procedures, revision by a multidisciplinary panel of local experts and a cognitive study. Then NEWS-Iran was completed among adults aged 18 to 65 years (N=19) with an interval of 15 days. Intra-Class Coefficient (ICC) was used to evaluate the reliability of the adapted questionnaire. Results: NEWS-Iran is an adapted version of NEWS-A (abbreviated) and in the adaptation process five items were added from other versions of NEWS, two subscales were significantly modified for a shorter and more effective questionnaire, and five new items were added about climate factors and site-specific uses. NEWS-Iran showed almost perfect reliability (ICCs: more than 0.8) for all subscales, with items having moderate to almost perfect reliability scores (ICCs: 0.56-0.96). Conclusion: This study introduced NEWS-Iran, which is a reliable version of NEWS for measuring environmental perceptions related to physical activity behavior adapted for Iran. It is the first adapted version of NEWS which demonstrates a systematic adaptation process used by earlier studies. It can be used for other developing countries with similar environmental, social and cultural context. PMID:28210592
Hakimian, Pantea; Lak, Azadeh
2016-01-01
Background: In spite of the increased range of inactivity and obesity among Iranian adults, insufficient research has been done on environmental factors influencing physical activity. As a result adapting a subjective (self-report) measurement tool for assessment of physical environment in Iran is critical. Accordingly, in this study Neighborhood Environment Walkability Scale (NEWS) was adapted for Iran and also its reliability was evaluated. Methods: This study was conducted using a systematic adaptation method consisting of 3 steps: translate-back translation procedures, revision by a multidisciplinary panel of local experts and a cognitive study. Then NEWS-Iran was completed among adults aged 18 to 65 years (N=19) with an interval of 15 days. Intra-Class Coefficient (ICC) was used to evaluate the reliability of the adapted questionnaire. Results: NEWS-Iran is an adapted version of NEWS-A (abbreviated) and in the adaptation process five items were added from other versions of NEWS, two subscales were significantly modified for a shorter and more effective questionnaire, and five new items were added about climate factors and site-specific uses. NEWS-Iran showed almost perfect reliability (ICCs: more than 0.8) for all subscales, with items having moderate to almost perfect reliability scores (ICCs: 0.56-0.96). Conclusion: This study introduced NEWS-Iran, which is a reliable version of NEWS for measuring environmental perceptions related to physical activity behavior adapted for Iran. It is the first adapted version of NEWS which demonstrates a systematic adaptation process used by earlier studies. It can be used for other developing countries with similar environmental, social and cultural context.
São-João, Thaís Moreira; Rodrigues, Roberta Cunha Matheus; Gallani, Maria Cecilia Bueno Jayme; Miura, Cinthya Tamie de Passos; Domingues, Gabriela de Barros Leite; Godin, Gaston
2013-06-01
To conduct the cultural adaptation of the Brazilian version of the Godin-Shephard Leisure-Time Physical Activity Questionnaire (GSLTPAQ) and to assess its content validity, practicability, acceptability and reliability. The stages of translation, synthesis, back translation, expert committee review and pre-test were carried out, followed by the evaluation of the practicability, acceptability and reliability (test-retest). The judges assessed its semantic, idiomatic, conceptual, cultural and metabolic equivalences. The adapted version was submitted to the pre-test (n = 20), and test-retest (n = 80), in healthy individuals and in those suffering from cardiovascular disease in Limeira, SP, Southeastern Brazil, between 2010 and 2011. The proportion of agreement of the committee of judges was assessed using the Content Validity Index. Reliability was assessed by the criterion of stability, with 15 days between applications. Practicability was evaluated by the time spent interviewing and acceptability was estimated as the percentage of unanswered items and the proportion of patients who responded to all items. The translated version of the questionnaire showed evidence of appropriate semantic-idiomatic, conceptual, cultural and metabolic equivalence, with substitutions of several physical activities more appropriate to the Brazilian population. The practicability analysis showed short time needed for the application of the instrument (mean 3.0 minutes). As for acceptability, all patients answered 100% of the items. The test-retest analysis suggested that stability was good (Intraclass Correlation Coefficient value of 0.84). The Brazilian version of the questionnaire showed satisfactory measures of the qualities in question. Its application to diverse populations in future studies is recommended in order to provide robust measures of these qualities.
Developing Item Response Theory-Based Short Forms to Measure the Social Impact of Burn Injuries.
Marino, Molly E; Dore, Emily C; Ni, Pengsheng; Ryan, Colleen M; Schneider, Jeffrey C; Acton, Amy; Jette, Alan M; Kazis, Lewis E
2018-03-01
To develop self-reported short forms for the Life Impact Burn Recovery Evaluation (LIBRE) Profile. Short forms based on the item parameters of discrimination and average difficulty. A support network for burn survivors, peer support networks, social media, and mailings. Burn survivors (N=601) older than 18 years. Not applicable. The LIBRE Profile. Ten-item short forms were developed to cover the 6 LIBRE Profile scales: Relationships with Family & Friends, Social Interactions, Social Activities, Work & Employment, Romantic Relationships, and Sexual Relationships. Ceiling effects were ≤15% for all scales; floor effects were <1% for all scales. The marginal reliability of the short forms ranged from .85 to .89. The LIBRE Profile-Short Forms demonstrated credible psychometric properties. The short form version provides a viable alternative to administering the LIBRE Profile when resources do not allow computer or Internet access. The full item bank, computerized adaptive test, and short forms are all scored along the same metric, and therefore scores are comparable regardless of the mode of administration. Copyright © 2017 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Construction of an efficient evaluative instrument for myasthenia gravis: the MG composite.
Burns, Ted M; Conaway, Mark R; Cutter, Gary R; Sanders, Donald B
2008-12-01
We assessed the performance of items from the Quantitative Myasthenia Gravis (QMG), MMT (Manual Muscle Test), and MG-ADL (Myasthenia Gravis - Activities of Daily Living) scales, using data from two recently completed treatment trials of generalized MG. Items were selected that were relevant to manifestations of MG, meaningful to both the physician and the patient, and responsive to clinical change. After the 10 items were chosen, they were weighted based on input from MG experts from around the world, considering factors such as quality of life, disease severity, risk, prognosis, validity, and reliability. The MG Composite is easy to administer, takes less than 5 minutes to complete, and requires no equipment. Weighting of the response options of the 10 items should result in ordinal scores that better represent MG status and are more responsive to meaningful clinical change. To better determine its suitability for clinical use and for treatment trials, the MG Composite will be tested prospectively at several academic medical centers and will be used as a secondary measure of efficacy in pending clinical trials of MG.
Consensus on Quality Indicators of Postgraduate Medical E-Learning: Delphi Study.
de Leeuw, Robert Adrianus; Walsh, Kieran; Westerman, Michiel; Scheele, Fedde
2018-04-26
The progressive use of e-learning in postgraduate medical education calls for useful quality indicators. Many evaluation tools exist. However, these are diversely used and their empirical foundation is often lacking. We aimed to identify an empirically founded set of quality indicators to set the bar for “good enough” e-learning. We performed a Delphi procedure with a group of 13 international education experts and 10 experienced users of e-learning. The questionnaire started with 57 items. These items were the result of a previous literature review and focus group study performed with experts and users. Consensus was met when a rate of agreement of more than two-thirds was achieved. In the first round, the participants accepted 37 items of the 57 as important, reached no consensus on 20, and added 15 new items. In the second round, we added the comments from the first round to the items on which there was no consensus and added the 15 new items. After this round, a total of 72 items were addressed and, of these, 37 items were accepted and 34 were rejected due to lack of consensus. This study produced a list of 37 items that can form the basis of an evaluation tool to evaluate postgraduate medical e-learning. This is, to our knowledge, the first time that quality indicators for postgraduate medical e-learning have been defined and validated. The next step is to create and validate an e-learning evaluation tool from these items. ©Robert Adrianus de Leeuw, Kieran Walsh, Michiel Westerman, Fedde Scheele. Originally published in JMIR Medical Education (http://mededu.jmir.org), 26.04.2018.
Swanson, Mark W; Bodner, Eric; Sawyer, Patricia; Allman, Richard
2013-01-01
Little is known about the affect of reduced vision on physical activity in older adults. This study evaluates the association of visual acuity level, self-reported vision and ocular disease conditions with leisure-time physical activity and calculated caloric expenditure. A cross sectional study of 911 subjects 65 yr and older from the University of Alabama at Birmingham Study of Aging (SOA) cohort was conducted evaluating the association of vision-related variables to weekly kilocalorie expenditure calculated from the 17-item Leisure Time Physical Activity Questionnaire. Ordinal logistic regression was used to evaluate possible associations controlling for potential confounders. In multivariate analyses, each lower step in visual acuity category below 20/50 was significantly associated with reduced odds of having a higher level of physical activity OR 0.81, 95% CI 0.67, 0.97. Reduced visual acuity appears to be independently associated with lower levels of physical activity among community-dwelling adults. PMID:21945888
Correlates of energy intake and body mass index among homeless children in Minnesota.
Richards, Rickelle; Smith, Chery; Eggett, Dennis L
2013-06-01
This study evaluated environmental, personal, and behavioral correlates of BMI-for-age percentiles, dietary intake (kilocalories, carbohydrates, protein, fat, and Food Guide Pyramid food groups), and physical activity variables among homeless children. A 74-item survey, using social cognitive theory as the theoretical framework, height, weight, and one 24-hour recall were collected from homeless children aged 9-13 (n=159) at two shelters in Minneapolis, MN. Principal component analysis was performed on the subsections of the survey. Independent t-tests, Fisher exact tests, and chi-squared statistics evaluated sociodemographic and BMI percentile variables. Nonparametric tests evaluated dietary data. Stepwise regression models evaluated correlates of BMI percentiles, physical activity, and dietary intake variables. Approximately 45% were overweight or obese (≥85(th) percentile). Dietary data represented intake on a given day, with children consuming a median 1.2 servings from the fruits and vegetables food group, 17.3 servings from the fats and sweets food group (one serving=grams in 1 Tbsp. fat/1 tsp. sugar), and the percent of calories from fat varying significantly between shelter 1 (S1) versus shelter 2 (S2) boys (37.1% vs. 31.7%, p<0.001). Factors identified from survey items and sociodemographic variables accounted for between 6% and 14% of the variance in energy intake and other dietary and physical activity variables (p range, 0.008 to <0.001). Parental role modeling of eating behaviors and getting enough food were associated with less favorable food choices among homeless children. Policy interventions and program initiatives in the homeless environment could promote healthier food choices among children.
van der Maas, Nico Arie
2017-03-16
The Multiple Sclerosis Questionnaire for Physical Therapists (MSQPT) is a patient-rated outcome questionnaire for evaluating the rehabilitation of persons with multiple sclerosis (MS). Responsiveness was evaluated, and minimal important difference (MID) estimates were calculated to provide thresholds for clinical change for four items, three sections and the total score of the MSQPT. This multicentre study used a combined distribution- and anchor-based approach with multiple anchors and multiple rating of change questions. Responsiveness was evaluated using effect size, standardized response mean (SRM), modified SRM and relative efficiency. For distribution-based MID estimates, 0.2 and 0.33 standard deviations (SD), standard error of measurement (SEM) and minimal detectable change were used . Triangulation of anchor- and distribution-based MID estimates provided a range of MID values for each of the four items, the three sections and the total score of the MSQPT. The MID values were tested for their sensitivity and specificity for amelioration and deterioration for each of the four items, the three sections and the total score of the MSQPT. The MID values of each item and section and of the total score with the best sensitivity and specificity were selected as thresholds for clinical change. The outcome measures were the MSQPT, Hamburg Quality of Life Questionnaire for Multiple Sclerosis (HAQUAMS), rating of change questionnaires, Expanded Disability Status Scale, 6-metre timed walking test, Berg Balance Scale and 6-minute walking test. The effect size ranged from 0.46 to 1.49. The SRM data showed comparable results. The modified SRM ranged from 0.00 to 0.60. Anchor-based MID estimates were very low and were comparable with SD- and SEM-based estimates. The MSQPT was more responsive than the HAQUAMS in detecting improvement but less responsive in finding deterioration. The best MID estimates of the items, sections and total score, expressed in percentage of their maximum score, were between 5.4% (activity) and 22% (item 10) change for improvement and between 5.7% (total score) and 22% (item 10) change for deterioration. The MSQPT is a responsive questionnaire with an adequate MID that may be used as threshold for change during rehabilitation of MS patients. This trial was retrospectively (01/24/2015) registered in ClinicalTrials.gov as NCT02346279.
Sidi, Avner; Gravenstein, Nikolaus; Vasilopoulos, Terrie; Lampotang, Samsun
2017-06-02
We describe observed improvements in nontechnical or "higher-order" deficiencies and cognitive performance skills in an anesthesia residency cohort for a 1-year time interval. Our main objectives were to evaluate higher-order, cognitive performance and to demonstrate that simulation can effectively serve as an assessment of cognitive skills and can help detect "higher-order" deficiencies, which are not as well identified through more traditional assessment tools. We hypothesized that simulation can identify longitudinal changes in cognitive skills and that cognitive performance deficiencies can then be remediated over time. We used 50 scenarios evaluating 35 residents during 2 subsequent years, and 18 of those 35 residents were evaluated in both years (post graduate years 3 then 4) in the same or similar scenarios. Individual basic knowledge and cognitive performance during simulation-based scenarios were assessed using a 20- to 27-item scenario-specific checklist. Items were labeled as basic knowledge/technical (lower-order cognition) or advanced cognitive/nontechnical (higher-order cognition). Identical or similar scenarios were repeated annually by a subset of 18 residents during 2 successive academic years. For every scenario and item, we calculated group error scenario rate (frequency) and individual (resident) item success. Grouped individuals' success rates are calculated as mean (SD), and item success grade and group error rates are calculated and presented as proportions. For all analyses, α level is 0.05. Overall PGY4 residents' error rates were lower and success rates higher for the cognitive items compared with technical item performance in the operating room and resuscitation domains. In all 3 clinical domains, the cognitive error rate by PGY4 residents was fairly low (0.00-0.22) and the cognitive success rate by PGY4 residents was high (0.83-1.00) and significantly better compared with previous annual assessments (P < 0.05). Overall, there was an annual decrease in error rates for 2 years, primarily driven by decreases in cognitive errors. The most commonly observed cognitive error types remained anchoring, availability bias, premature closure, and confirmation bias. Simulation-based assessments can highlight cognitive performance areas of relative strength, weakness, and progress in a resident or resident cohort. We believe that they can therefore be used to inform curriculum development including activities that require higher-level cognitive processing.
Using the Item Response Theory (IRT) for Educational Evaluation through Games
ERIC Educational Resources Information Center
Euzébio Batista, Marcelo Henrique; Victória Barbosa, Jorge Luis; da Rosa Tavares, João Elison; Hackenhaar, Jonathan Luis
2013-01-01
This article shows the application of Item Response Theory (IRT) for educational evaluation using games. The article proposes a computational model to create user profiles, called Psychometric Profile Generator (PPG). PPG uses the IRT mathematical model for exploring the levels of skills and behaviors in the form of items and/or stimuli. The model…
School Self-Evaluation Instruments and Cognitive Validity. Do Items Capture What They Intend to?
ERIC Educational Resources Information Center
Faddar, Jerich; Vanhoof, Jan; De Maeyer, Sven
2017-01-01
School self-evaluation (SSE) often makes use of questionnaires in order to sketch a picture of the school. How respondents cognitively process questionnaire items determines the validity of SSE results. Still, one readily assumes that respondents interpret and answer items as intended by the instrument developer (referred to as cognitive…
Pharmacy students' opinions of direct-to-consumer advertising: a pilot study at one university.
Harrington, Amanda R; Desselle, Shane P; Apgar, David A; Hesselbacher, Elizabeth; Pié, Aaron; Quesnel, Aimee; Warholak, Terri L
2013-01-01
Direct-to-consumer advertisement (DTCA) of prescription medications has become an important informational source for health care consumers. As future health care professionals on the front line of potential communication and dispensing of products emerging from DTCA, it is important to elicit the attitudes of student-pharmacists. This study aims to (1) evaluate the validity of the DTCA attitudinal questionnaire using Rasch rating scale analysis and (2) investigate the attitudes of pharmacy students toward DTCA and determine whether these attitudes were associated with years of pharmacy education and demographic characteristics. This investigation used a cross-sectional print-based questionnaire to evaluate the attitudes of pharmacy students toward DTCA of prescription medications. The 16-item questionnaire included items addressing the attitudes of pharmacy students toward DTCA with respect to patients' knowledge of medications, pharmacists' interaction with patients, and overall consumer judgment of medical prescriptions. Analyses included Rasch analysis and a multiple linear regression. A total of 243 students submitted usable questionnaires (85% response rate). Item response categories were collapsed from 5 categories to 3, and 4 items were removed to achieve acceptable Rasch model fit. Pharmacy students demonstrated little difficulty in agreeing with the statements suggesting that DTCA helps patients take a more active role in health care and had the most difficulty in agreeing with items suggesting that DTCA may lead to inappropriate prescribing to satisfy patient requests. Students' overall support for DTCA was the only variable that predicted the questionnaire score (P<.001). In conclusion, the Rasch analysis evaluated the psychometric properties of the instrument and identified the necessity to adapt the questionnaire from previous iterations to adequately fit the student population. Future research should examine factors that contribute to the variance in attitudes toward DTCA among a larger and more heterogeneous population. Copyright © 2013 Elsevier Inc. All rights reserved.
Galindo-Garre, Francisca; Hidalgo, María Dolores; Guilera, Georgina; Pino, Oscar; Rojo, J Emilio; Gómez-Benito, Juana
2015-03-01
The World Health Organization Disability Assessment Schedule II (WHO-DAS II) is a multidimensional instrument developed for measuring disability. It comprises six domains (getting around, self-care, getting along with others, life activities and participation in society). The main purpose of this paper is the evaluation of the psychometric properties for each domain of the WHO-DAS II with parametric and non-parametric Item Response Theory (IRT) models. A secondary objective is to assess whether the WHO-DAS II items within each domain form a hierarchy of invariantly ordered severity indicators of disability. A sample of 352 patients with a schizophrenia spectrum disorder is used in this study. The 36 items WHO-DAS II was administered during the consultation. Partial Credit and Mokken scale models are used to study the psychometric properties of the questionnaire. The psychometric properties of the WHO-DAS II scale are satisfactory for all the domains. However, we identify a few items that do not discriminate satisfactorily between different levels of disability and cannot be invariantly ordered in the scale. In conclusion the WHO-DAS II can be used to assess overall disability in patients with schizophrenia, but some domains are too general to assess functionality in these patients because they contain items that are not applicable to this pathology. Copyright © 2014 John Wiley & Sons, Ltd.
Stetson, Barbara; Schlundt, David; Rothschild, Chelsea; Floyd, Jennifer E; Rogers, Whitney; Mokshagundam, Sri Prakash
2011-03-01
To develop and evaluate the validity and reliability of The Personal Diabetes Questionnaire (PDQ), a brief, yet comprehensive measure of diabetes self-care behaviors, perceptions and barriers. To examine individual items to provide descriptive and normative information and provide data on scale reliability and associations between PDQ scales and concurrently assessed HBA(1c) and BMI. Items were written to address nutritional management, medication utilization, blood glucose monitoring, and physical activity. The initial instrument was reviewed by multidisciplinary diabetes care providers and items subsequently revised until the measure provided complete coverage of the diabetes care domains using as few items as possible. The scoring scheme was generated rationally. Subjects were 790 adults (205 with type 1 and 585 with type 2 diabetes) who completed the PDQ while waiting for clinic appointments. Item completion rates were high, with few items skipped by participants. Subscales demonstrated good internal consistency (Cronbach α=.650-.834) and demonstrated significant associations with BMI (p ≤.001) and HbA(1c) (p ≤.001). The PDQ is a useful measure of diabetes self-care behaviors and related perceptions and barriers that is reliable and valid and feasible to administer in a clinic setting. This measure may be used to obtain data for assessing diabetes self-management and barriers and to guide patient care. Copyright © 2010 Elsevier Ireland Ltd. All rights reserved.
Torrejón, Antonio; Oltra, Lorena; Hernández-Sampelayo, Paloma; Marín, Laura; García-Sánchez, Valle; Casellas, Francesc; Alfaro, Noelia; Lázaro, Pablo; Vera, María Isabel
2013-01-01
nursing management of inflammatory bowel disease (IBD) is highly relevant for patient care and outcomes. However, there is evidence of substantial variability in clinical practices. The objectives of this study were to develop standards of healthcare quality for nursing management of IBD and elaborate the evaluation tool "Nursing Care Quality in IBD Assessment" (NCQ-IBD) based on these standards. a 178-item healthcare quality questionnaire was developed based on a systematic review of IBD nursing management literature. The questionnaire was used to perform two 2-round Delphi studies: Delphi A included 27 IBD healthcare professionals and Delphi B involved 12 patients. The NCQ-IBD was developed from the list of items resulting from both Delphi studies combined with the Scientific Committee´s expert opinion. the final NCQ-IBD consists of 90 items, organized in13 sections measuring the following aspects of nursing management of IBD: infrastructure, services, human resources, type of organization, nursing responsibilities, nurse-provided information to the patient, nurses training, annual audits of nursing activities, and nursing research in IBD. Using the NCQ-IBD to evaluate these components allows the rating of healthcare quality for nursing management of IBD into 4 categories: A (highest quality) through D (lowest quality). the use of the NCQ-IBD tool to evaluate nursing management quality of IBD identifies areas in need of improvement and thus contribute to an enhancement of care quality and reduction in clinical practice variations.
Distractor devaluation requires visual working memory.
Goolsby, Brian A; Shapiro, Kimron L; Raymond, Jane E
2009-02-01
Visual stimuli seen previously as distractors in a visual search task are subsequently evaluated more negatively than those seen as targets. An attentional inhibition account for this distractor-devaluation effect posits that associative links between attentional inhibition and to-be-ignored stimuli are established during search, stored, and then later reinstantiated, implying that distractor devaluation may require visual working memory (WM) resources. To assess this, we measured distractor devaluation with and without a concurrent visual WM load. Participants viewed a memory array, performed a simple search task, evaluated one of the search items (or a novel item), and then viewed a memory test array. Although distractor devaluation was observed with low (and no) WM load, it was absent when WM load was increased. This result supports the notions that active association of current attentional states with stimuli requires WM and that memory for these associations plays a role in affective response.
Kayser, Lars; Karnoe, Astrid; Furstrand, Dorthe; Batterham, Roy; Christensen, Karl Bang; Elsworth, Gerald; Osborne, Richard H
2018-02-12
For people to be able to access, understand, and benefit from the increasing digitalization of health services, it is critical that services are provided in a way that meets the user's needs, resources, and competence. The objective of the study was to develop a questionnaire that captures the 7-dimensional eHealth Literacy Framework (eHLF). Draft items were created in parallel in English and Danish. The items were generated from 450 statements collected during the conceptual development of eHLF. In all, 57 items (7 to 9 items per scale) were generated and adjusted after cognitive testing. Items were tested in 475 people recruited from settings in which the scale was intended to be used (community and health care settings) and including people with a range of chronic conditions. Measurement properties were assessed using approaches from item response theory (IRT) and classical test theory (CTT) such as confirmatory factor analysis (CFA) and reliability using composite scale reliability (CSR); potential bias due to age and sex was evaluated using differential item functioning (DIF). CFA confirmed the presence of the 7 a priori dimensions of eHLF. Following item analysis, a 35-item 7-scale questionnaire was constructed, covering (1) using technology to process health information (5 items, CSR=.84), (2) understanding of health concepts and language (5 items, CSR=.75), (3) ability to actively engage with digital services (5 items, CSR=.86), (4) feel safe and in control (5 items, CSR=.87), (5) motivated to engage with digital services (5 items, CSR=.84), (6) access to digital services that work (6 items, CSR=.77), and (7) digital services that suit individual needs (4 items, CSR=.85). A 7-factor CFA model, using small-variance priors for cross-loadings and residual correlations, had a satisfactory fit (posterior productive P value: .27, 95% CI for the difference between the observed and replicated chi-square values: -63.7 to 133.8). The CFA showed that all items loaded strongly on their respective factors. The IRT analysis showed that no items were found to have disordered thresholds. For most scales, discriminant validity was acceptable; however, 2 pairs of dimensions were highly correlated; dimensions 1 and 5 (r=.95), and dimensions 6 and 7 (r=.96). All dimensions were retained because of strong content differentiation and potential causal relationships between these dimensions. There is no evidence of DIF. The eHealth Literacy Questionnaire (eHLQ) is a multidimensional tool based on a well-defined a priori eHLF framework with robust properties. It has satisfactory evidence of construct validity and reliable measurement across a broad range of concepts (using both CTT and IRT traditions) in various groups. It is designed to be used to understand and evaluate people's interaction with digital health services. ©Lars Kayser, Astrid Karnoe, Dorthe Furstrand, Roy Batterham, Karl Bang Christensen, Gerald Elsworth, Richard H Osborne. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 12.02.2018.
A Review of Classical Methods of Item Analysis.
ERIC Educational Resources Information Center
French, Christine L.
Item analysis is a very important consideration in the test development process. It is a statistical procedure to analyze test items that combines methods used to evaluate the important characteristics of test items, such as difficulty, discrimination, and distractibility of the items in a test. This paper reviews some of the classical methods for…
ERIC Educational Resources Information Center
Dedrick, Robert F.; Greenbaum, Paul E.
2011-01-01
Multilevel confirmatory factor analysis was used to evaluate the factor structure underlying the 12-item, three-factor "Interagency Collaboration Activities Scale" (ICAS) at the informant level and at the agency level. Results from 378 professionals (104 administrators, 201 service providers, and 73 case managers) from 32 children's mental health…
ERIC Educational Resources Information Center
Latimer, Lara; Walker, Lorraine O.; Kim, Sunghun; Pasch, Keryn E.; Sterling, Bobbie Sue
2011-01-01
Objective: This study examined test-retest reliability, internal consistency, and construct and predictive validity of the Physical Activity and Nutrition Self-Efficacy (PANSE) scale, an 11-item instrument to assess weight-loss self-efficacy among postpartum women of lower income. Methods: Seventy-one women completed the PANSE scale and…
ERIC Educational Resources Information Center
Nissim, Yonit; Weissblueth, Eyal; Scott-Webber, Lennie; Amar, Shimon
2016-01-01
We investigated the effect of an innovative technology-supported learning environment on pre-service student teachers' motivation and 21st century skills. Students and instructors filled-in the Active Learning Post Occupancy Evaluation (AL-POE) questionnaire. Analysis included tests for individual items and a comparison of the overall mean,…
Better assessment of physical function: item improvement is neglected but essential
2009-01-01
Introduction Physical function is a key component of patient-reported outcome (PRO) assessment in rheumatology. Modern psychometric methods, such as Item Response Theory (IRT) and Computerized Adaptive Testing, can materially improve measurement precision at the item level. We present the qualitative and quantitative item-evaluation process for developing the Patient Reported Outcomes Measurement Information System (PROMIS) Physical Function item bank. Methods The process was stepwise: we searched extensively to identify extant Physical Function items and then classified and selectively reduced the item pool. We evaluated retained items for content, clarity, relevance and comprehension, reading level, and translation ease by experts and patient surveys, focus groups, and cognitive interviews. We then assessed items by using classic test theory and IRT, used confirmatory factor analyses to estimate item parameters, and graded response modeling for parameter estimation. We retained the 20 Legacy (original) Health Assessment Questionnaire Disability Index (HAQ-DI) and the 10 SF-36's PF-10 items for comparison. Subjects were from rheumatoid arthritis, osteoarthritis, and healthy aging cohorts (n = 1,100) and a national Internet sample of 21,133 subjects. Results We identified 1,860 items. After qualitative and quantitative evaluation, 124 newly developed PROMIS items composed the PROMIS item bank, which included revised Legacy items with good fit that met IRT model assumptions. Results showed that the clearest and best-understood items were simple, in the present tense, and straightforward. Basic tasks (like dressing) were more relevant and important versus complex ones (like dancing). Revised HAQ-DI and PF-10 items with five response options had higher item-information content than did comparable original Legacy items with fewer response options. IRT analyses showed that the Physical Function domain satisfied general criteria for unidimensionality with one-, two-, three-, and four-factor models having comparable model fits. Correlations between factors in the test data sets were > 0.90. Conclusions Item improvement must underlie attempts to improve outcome assessment. The clear, personally important and relevant, ability-framed items in the PROMIS Physical Function item bank perform well in PRO assessment. They will benefit from further study and application in a wider variety of rheumatic diseases in diverse clinical groups, including those at the extremes of physical functioning, and in different administration modes. PMID:20015354
Better assessment of physical function: item improvement is neglected but essential.
Bruce, Bonnie; Fries, James F; Ambrosini, Debbie; Lingala, Bharathi; Gandek, Barbara; Rose, Matthias; Ware, John E
2009-01-01
Physical function is a key component of patient-reported outcome (PRO) assessment in rheumatology. Modern psychometric methods, such as Item Response Theory (IRT) and Computerized Adaptive Testing, can materially improve measurement precision at the item level. We present the qualitative and quantitative item-evaluation process for developing the Patient Reported Outcomes Measurement Information System (PROMIS) Physical Function item bank. The process was stepwise: we searched extensively to identify extant Physical Function items and then classified and selectively reduced the item pool. We evaluated retained items for content, clarity, relevance and comprehension, reading level, and translation ease by experts and patient surveys, focus groups, and cognitive interviews. We then assessed items by using classic test theory and IRT, used confirmatory factor analyses to estimate item parameters, and graded response modeling for parameter estimation. We retained the 20 Legacy (original) Health Assessment Questionnaire Disability Index (HAQ-DI) and the 10 SF-36's PF-10 items for comparison. Subjects were from rheumatoid arthritis, osteoarthritis, and healthy aging cohorts (n = 1,100) and a national Internet sample of 21,133 subjects. We identified 1,860 items. After qualitative and quantitative evaluation, 124 newly developed PROMIS items composed the PROMIS item bank, which included revised Legacy items with good fit that met IRT model assumptions. Results showed that the clearest and best-understood items were simple, in the present tense, and straightforward. Basic tasks (like dressing) were more relevant and important versus complex ones (like dancing). Revised HAQ-DI and PF-10 items with five response options had higher item-information content than did comparable original Legacy items with fewer response options. IRT analyses showed that the Physical Function domain satisfied general criteria for unidimensionality with one-, two-, three-, and four-factor models having comparable model fits. Correlations between factors in the test data sets were > 0.90. Item improvement must underlie attempts to improve outcome assessment. The clear, personally important and relevant, ability-framed items in the PROMIS Physical Function item bank perform well in PRO assessment. They will benefit from further study and application in a wider variety of rheumatic diseases in diverse clinical groups, including those at the extremes of physical functioning, and in different administration modes.
Hughes, Jane; Wilson, Wayne J; MacBean, Naomi; Hill, Anne E
2016-12-01
To develop a tool for assessing audiology students taking a case history and giving feedback with simulated patients (SP). Single observation, single group design. Twenty-four first-year audiology students, five simulated patients, two clinical educators, and three evaluators. The Audiology Simulated Patient Interview Rating Scale (ASPIRS) was developed consisting of six items assessing specific clinical skills, non-verbal communication, verbal communication, interpersonal skills, interviewing skills, and professional practice skills. These items are applied once for taking a case history and again for giving feedback. The ASPIRS showed very high internal consistency (α = 0.91-0.97; mean inter-item r = 0.64-0.85) and fair-to-moderate agreement between evaluators (29.2-54.2% exact and 79.2-100% near agreement; κ weighted up to 0.60). It also showed fair-to-moderate absolute agreement amongst evaluators for single evaluator scores (intraclass correlation coefficient [ICC] r = 0.35-0.59) and substantial consistency of agreement amongst evaluators for three-evaluator averaged scores (ICC r = 0.62-0.81). Factor analysis showed the ASPIRS' 12 items fell into two components, one containing all feedback items and one containing all case history items. The ASPIRS shows promise as the first published tool for assessing audiology students taking a case history and giving feedback with an SP.
Gamblers seeking online help are active help-seekers: Time to support autonomy and competence.
Rodda, S N; Dowling, N A; Lubman, D I
2018-06-05
Research investigating rates of help-seeking for problem gambling has traditionally focused on the uptake of face-to-face gambling services alone, despite the World Health Organisation defining help-seeking as any action or activity undertaken to improve or resolve emotional, psychological or behavioural problems. The primary aim of this study is to examine the full range of help-seeking options utilised by gamblers, and to determine whether administering a comprehensive list of help options yields higher help-seeking rates than a single item measure. A one-item and expanded 14-item help-seeking Questionnaire (the Help-Seeking Questionnaire; HSQ) were administered to 277 problem gamblers seeking help online. We found the 14-item HSQ yielded a significantly higher level of lifetime professional help-seeking (70%) compared to the one-item measure (22%). When we included self-directed activities, 93% of gamblers reported they had previously attempted at least one activity to reduce their gambling. Current measurement of help-seeking appears to underestimate the range of activities currently undertaken by gamblers to reduce their gambling. Surveys need to include the one-item HSQ (over the past 12 months have you sought professional help or advice (online, by phone, or in person), support from family or friends, or did something by yourself to limit or reduce your gambling?) or the three-item HSQ which measures engagement of face-to-face services (i.e., counselling, advice, groups), distance-based (i.e., anonymous telephone, online) and self-directed (i.e., activities not involving professional oversight) activities separately. The full 14-item screen can be administered when brief screens are positive to ensure accurate measurement of help-seeking. Copyright © 2018 Elsevier Ltd. All rights reserved.
Neural Correlates of Learning from Induced Insight: A Case for Reward-Based Episodic Encoding
Kizilirmak, Jasmin M.; Thuerich, Hannes; Folta-Schoofs, Kristian; Schott, Björn H.; Richardson-Klavehn, Alan
2016-01-01
Experiencing insight when solving problems can improve memory formation for both the problem and its solution. The underlying neural processes involved in this kind of learning are, however, thus far insufficiently understood. Here, we conceptualized insight as the sudden understanding of a novel relationship between known stimuli that fits into existing knowledge and is accompanied by a positive emotional response. Hence, insight is thought to comprise associative novelty, schema congruency, and intrinsic reward, all of which are separately known to enhance memory performance. We examined the neural correlates of learning from induced insight with functional magnetic resonance imaging (fMRI) using our own version of the compound-remote-associates-task (CRAT) in which each item consists of three clue words and a solution word. (Pseudo-)Solution words were presented after a brief period of problem-solving attempts to induce either sudden comprehension (CRA items) or continued incomprehension (control items) at a specific time point. By comparing processing of the solution words of CRA with control items, we found induced insight to elicit activation of the rostral anterior cingulate cortex/medial prefrontal cortex (rACC/mPFC) and left hippocampus. This pattern of results lends support to the role of schema congruency (rACC/mPFC) and associative novelty (hippocampus) in the processing of induced insight. We propose that (1) the mPFC not only responds to schema-congruent information, but also to the detection of novel schemata, and (2) that the hippocampus responds to a form of associative novelty that is not just a novel constellation of familiar items, but rather comprises a novel meaningful relationship between the items—which was the only difference between our insight and no insight conditions. To investigate episodic long-term memory encoding, we compared CRA items whose solution word was recognized 24 h after encoding to those with forgotten solutions. We found activation in the left striatum and parts of the left amygdala, pointing to a potential role of brain reward circuitry in the encoding of the solution words. We propose that learning from induced insight mainly relies on the amygdala evaluating the internal value (as an affective evaluation) of the suddenly comprehended information, and striatum-dependent reward-based learning. PMID:27847490
ERIC Educational Resources Information Center
Racsmany, Mihaly; Conway, Martin A.
2006-01-01
Six experiments examined the proposal that an item of long-term knowledge can be simultaneously inhibited and activated. In 2 directed forgetting experiments items to-be-forgotten were found to be inhibited in list-cued recall but activated in lexical decision tasks. In 3 retrieval practice experiments, unpracticed items from practiced categories…
Sexual Health and Positive Subjective Well-Being in Partnered Older Men and Women
Vanhoutte, Bram; Nazroo, James; Pendleton, Neil
2016-01-01
Objectives: We examine the associations between different patterns of sexual behavior and function and three indicators of subjective well-being (SWB) covering eudemonic, evaluative, and affective well-being in a representative sample of partnered older people. Method: Using data from a Sexual Relationships and Activities Questionnaire (SRA-Q) in Wave 6 of the English Longitudinal Study of Ageing, latent class analysis identified groups characterized by distinctive patterns of sexual behavior and function and then examined their link to SWB. Eudemonic SWB was measured using a revised 15-item version of the CASP-19, evaluative SWB using the Satisfaction With Life Scale, and affective SWB using the 8-item version of the Centre for Epidemiologic Studies-Depression scale. Results: Sexual behavior and function was best described by six classes among men and five classes among women. These ranged from high sexual desire, frequent partnered sexual activities, and few sexual problems (Class 1) to low sexual desire, infrequent/no sexual activity, and problems with sexual function (Class 5[women]/6[men]). Men and women who reported either infrequent/no sexual activity, or were sexually active but reported sexual problems, generally had lower SWB than those individuals identified in Class 1. Poorer SWB in men was more strongly associated with sexual function difficulties, whereas in women desire and frequency of partnered activities appeared more important in relation to SWB. Discussion: Within the context of a partnered relationship continuing sexual desire, activity and functioning are associated with higher SWB, with distinctive patterns for women and men. PMID:26993519
ITEM SELECTION TECHNIQUES AND EVALUATION OF INSTRUCTIONAL OBJECTIVES.
ERIC Educational Resources Information Center
COX, RICHARD C.
THE VALIDITY OF AN EDUCATIONAL ACHIEVEMENT TEST DEPENDS UPON THE CORRESPONDENCE BETWEEN SPECIFIED EDUCATIONAL OBJECTIVES AND THE EXTENT TO WHICH THESE OBJECTIVES ARE MEASURED BY THE EVALUATION INSTRUMENT. THIS STUDY IS DESIGNED TO EVALUATE THE EFFECT OF STATISTICAL ITEM SELECTION ON THE STRUCTURE OF THE FINAL EVALUATION INSTRUMENT AS COMPARED WITH…
Reliability and validity of a short form household food security scale in a Caribbean community.
Gulliford, Martin C; Mahabir, Deepak; Rocke, Brian
2004-06-16
We evaluated the reliability and validity of the short form household food security scale in a different setting from the one in which it was developed. The scale was interview administered to 531 subjects from 286 households in north central Trinidad in Trinidad and Tobago, West Indies. We evaluated the six items by fitting item response theory models to estimate item thresholds, estimating agreement among respondents in the same households and estimating the slope index of income-related inequality (SII) after adjusting for age, sex and ethnicity. Item-score correlations ranged from 0.52 to 0.79 and Cronbach's alpha was 0.87. Item responses gave within-household correlation coefficients ranging from 0.70 to 0.78. Estimated item thresholds (standard errors) from the Rasch model ranged from -2.027 (0.063) for the 'balanced meal' item to 2.251 (0.116) for the 'hungry' item. The 'balanced meal' item had the lowest threshold in each ethnic group even though there was evidence of differential functioning for this item by ethnicity. Relative thresholds of other items were generally consistent with US data. Estimation of the SII, comparing those at the bottom with those at the top of the income scale, gave relative odds for an affirmative response of 3.77 (95% confidence interval 1.40 to 10.2) for the lowest severity item, and 20.8 (2.67 to 162.5) for highest severity item. Food insecurity was associated with reduced consumption of green vegetables after additionally adjusting for income and education (0.52, 0.28 to 0.96). The household food security scale gives reliable and valid responses in this setting. Differing relative item thresholds compared with US data do not require alteration to the cut-points for classification of 'food insecurity without hunger' or 'food insecurity with hunger'. The data provide further evidence that re-evaluation of the 'balanced meal' item is required.
ERIC Educational Resources Information Center
Jackson, Allen W.; Morrow, James R., Jr.; Bowles, Heather R.; FitzGerald, Shannon J.; Blair, Steven N.
2007-01-01
Valid measurement of physical activity is important for studying the risks for morbidity and mortality. The purpose of this study was to examine evidence of construct validity of two similar single-response items assessing physical activity via self-report. Both items are based on the stages of change model. The sample was 687 participants (men =…
Independent Orbiter Assessment (IOA): Analysis of the active thermal control subsystem
NASA Technical Reports Server (NTRS)
Sinclair, S. K.; Parkman, W. E.
1987-01-01
The results of the Independent Orbiter Assessment (IOA) of the Failure Modes and Effects Analysis (FMEA) and Critical Items List (CIL) are presented. The IOA approach features a top-down analysis of the hardware to determine failure modes, criticality, and potential critical (PCIs) items. To preserve independence, this analysis was accomplished without reliance upon the results contained within the NASA FMEA/CIL documentation. The independent analysis results corresponding to the Orbiter Active Thermal Control Subsystem (ATCS) are documented. The major purpose of the ATCS is to remove the heat, generated during normal Shuttle operations from the Orbiter systems and subsystems. The four major components of the ATCS contributing to the heat removal are: Freon Coolant Loops; Radiator and Flow Control Assembly; Flash Evaporator System; and Ammonia Boiler System. In order to perform the analysis, the IOA process utilized available ATCS hardware drawings and schematics for defining hardware assemblies, components, and hardware items. Each level of hardware was evaluated and analyzed for possible failure modes and effects. Criticality was assigned based upon the severity of the effect for each failure mode. Of the 310 failure modes analyzed, 101 were determined to be PCIs.
The PROactive innovative conceptual framework on physical activity.
Dobbels, Fabienne; de Jong, Corina; Drost, Ellen; Elberse, Janneke; Feridou, Chryssoula; Jacobs, Laura; Rabinovich, Roberto; Frei, Anja; Puhan, Milo A; de Boer, Willem I; van der Molen, Thys; Williams, Kate; Pinnock, Hillary; Troosters, Thierry; Karlsson, Niklas; Kulich, Karoly; Rüdell, Katja
2014-11-01
Although physical activity is considered an important therapeutic target in chronic obstructive pulmonary disease (COPD), what "physical activity" means to COPD patients and how their perspective is best measured is poorly understood. We designed a conceptual framework, guiding the development and content validation of two patient reported outcome (PRO) instruments on physical activity (PROactive PRO instruments). 116 patients from four European countries with diverse demographics and COPD phenotypes participated in three consecutive qualitative studies (63% male, age mean±sd 66±9 years, 35% Global Initiative for Chronic Obstructive Lung Disease stage III-IV). 23 interviews and eight focus groups (n = 54) identified the main themes and candidate items of the framework. 39 cognitive debriefings allowed the clarity of the items and instructions to be optimised. Three themes emerged, i.e. impact of COPD on amount of physical activity, symptoms experienced during physical activity, and adaptations made to facilitate physical activity. The themes were similar irrespective of country, demographic or disease characteristics. Iterative rounds of appraisal and refinement of candidate items resulted in 30 items with a daily recall period and 34 items with a 7-day recall period. For the first time, our approach provides comprehensive insight on physical activity from the COPD patients' perspective. The PROactive PRO instruments' content validity represents the pivotal basis for empirically based item reduction and validation. ©ERS 2014.
Mokken scaling of the Myocardial Infarction Dimensional Assessment Scale (MIDAS).
Thompson, David R; Watson, Roger
2011-02-01
The purpose of this study was to examine the hierarchical and cumulative nature of the 35 items of the Myocardial Infarction Dimensional Assessment Scale (MIDAS), a disease-specific health-related quality of life measure. Data from 668 participants who completed the MIDAS were analysed using the Mokken Scaling Procedure, which is a computer program that searches polychotomous data for hierarchical and cumulative scales on the basis of a range of diagnostic criteria. Fourteen MIDAS items were retained in a Mokken scale and these items included physical activity, insecurity, emotional reaction and dependency items but excluded items related to diet, medication or side-effects. Item difficulty, in item response theory terms, ran from physical activity items (low difficulty) to insecurity, suggesting that the most severe quality of life effect of myocardial infarction is loneliness and isolation. Items from the MIDAS form a strong and reliable Mokken scale, which provides new insight into the relationship between items in the MIDAS and the measurement of quality of life after myocardial infarction. © 2010 Blackwell Publishing Ltd.
Teresi, Jeanne A.; Ocepek-Welikson, Katja; Kleinman, Marjorie; Ramirez, Mildred; Kim, Giyeon
2017-01-01
Short form measures from the Patient Reported Outcomes Measurement Information System® (PROMIS®) are used widely. The present study was among the first to examine differential item functioning (DIF) in the PROMIS Depression short form scales in a sample of over 5000 racially/ethnically diverse patients with cancer. DIF analyses were conducted across different racial/ethnic, educational, age, gender and language groups. Methods DIF hypotheses, generated by content experts, informed the evaluation of the DIF analyses. The graded item response theory (IRT) model was used to evaluate the five-level ordinal items. The primary tests of DIF were Wald tests; sensitivity analyses were conducted using the IRT ordinal logistic regression procedure. Magnitude was evaluated using expected item score functions, and the non-compensatory differential item functioning (NCDIF) and T1 indexes, both based on group differences in the item curves. Aggregate impact was evaluated with expected scale score (test) response functions; individual impact was assessed through examination of differences in DIF adjusted and unadjusted depression estimates. Results Many items evidenced DIF; however, only a few had slightly elevated magnitude. No items evidenced salient DIF with respect to NCDIF and the scale-level impact was minimal for all group comparisons. The following short form items might be targeted for further study because they were also hypothesized to evidence DIF. One item showed slightly higher magnitude of DIF for age: nothing to look forward to; conditional on depression, this item was more likely to be endorsed in the depressed direction by individuals in older groups as contrasted with the cohort aged 21 to 49. This item was also hypothesized to show age DIF. Only one item (failure) showed DIF of slightly higher magnitude (just above threshold) for Whites vs. Asians/Pacific Islanders in the direction of higher likelihood of endorsement for Asians/Pacific Islanders. This item was also hypothesized to show DIF for minority groups. The impact of DIF was negligible. Conditional on depression, the items, worthless and hopeless were more likely to be endorsed in the depressed direction by respondents with less than high school education vs. those with a graduate degree; the magnitude of DIF was slightly above the T1 threshold, but not that of NCDIF. These items were also hypothesized to show DIF in the direction of more feelings of worthlessness by groups with lower education. While the magnitude and aggregate impact of DIF was small, in a few instances, individual impact was observed. Information provided was relatively high, particularly in the middle upper (depressed) tail of the distribution. Reliability estimates were high (> 0.90) across all studied groups, regardless of estimation method. Conclusions This was the first study to evaluate measurement equivalence of the PROMIS Depression short forms across large samples of ethnically diverse groups. There were few items with DIF, and none of high magnitude, thus supporting the use of PROMIS Depression short form measures across such groups. These results could be informative for those using the short forms in minority populations or clinicians evaluating individuals with the depression short forms. PMID:28553573
User embracement with risk classification in an emergency care unit: an evaluative study.
Hermida, Patrícia Madalena Vieira; Nascimento, Eliane Regina Pereira do; Echevarría-Guanilo, Maria Elena; Brüggemann, Odaléa Maria; Malfussi, Luciana Bihain Hagemann de
2018-01-01
Objective Describing the evaluation of the Structure, Process and Outcome of User Embracement with Risk Classification of an Emergency Care Unit from the perspective of physicians and nurses. Method An evaluative, descriptive, quantitative study developed in Santa Catarina. Data were collected using a validated and adapted instrument consisting of 21 items distributed in the dimensions of Structure (facilities), Process (activities and relationships in providing care) and Outcome (care effects). In the analysis, descriptive statistics and the Mean Ranking and Mean Score calculations were applied. Results The sample consisted of 37 participants. From the 21 evaluated items, 11 (52.4%) had a Mean Ranking between 3 and 4, and none of them reached the maximum ranking (5 points). "Prioritization of severe cases" and "Primary care according to the severity of the case" reached a higher Mean Ranking (4.5), while "Flowchart discussion" had the lowest Ranking (2.1). The dimensions of Structure, Process and Outcome reached mean scores of 23.9, 21.9 and 25.5, respectively, indicating a Precarious evaluation (17.5 to 26.1 points). Conclusion User Embracement with Risk Classification is precarious, especially regarding the Process which obtained a lower satisfaction level from the participants.
Kiernan, Michaela; Schoffman, Danielle E.; Lee, Katherine; Brown, Susan D.; Fair, Joan M.; Perri, Michael G.; Haskell, William L.
2015-01-01
Background Physical activity is essential for chronic disease prevention, yet <40% of overweight/obese adults meet national activity recommendations. For time-efficient counseling, clinicians need a brief easy-to-use tool that reliably and validly assesses a full range of activity levels, and most importantly, is sensitive to clinically meaningful changes in activity. The Stanford Leisure-Time Activity Categorical Item (L-Cat) is a single item comprised of six descriptive categories ranging from inactive to very active. This novel methodological approach assesses national activity recommendations as well as multiple clinically relevant categories below and above recommendations, and incorporates critical methodological principles that enhance psychometrics (reliability, validity, sensitivity to change). Methods We evaluated the L-Cat’s psychometrics among 267 overweight/obese women asked to meet national activity recommendations in a randomized behavioral weight-loss trial. Results The L-Cat had excellent test-retest reliability (κ=0.64, P<.001) and adequate concurrent criterion validity; each L-Cat category at 6 months was associated with 1059 more daily pedometer steps (95% CI 712–1407, β=0.38, P<.001) and 1.9% greater initial weight loss at 6 months (95% CI −2.4 to −1.3, β=−0.38, P<.001). Of interest, L-Cat categories differentiated from each other in a dose-response gradient for steps and weight loss (Ps<.05) with excellent face validity. The L-Cat was sensitive to change in response to the trial’s activity component. Women increased one L-Cat category at 6 months (M=1.0±1.4, P<.001); 55.8% met recommendations at 6 months whereas 20.6% did at baseline (P<.001). Even among women not meeting recommendations at both baseline and 6 months (n=106), women who moved ≥1 L-Cat categories at 6 months lost more weight than those who did not (M=−4.6%, 95% CI −6.7 to −2.5, P<.001). Conclusions Given strong psychometrics, the L-Cat has timely potential for clinical use such as tracking activity changes via electronic medical records especially among overweight/obese populations unable or unlikely to reach national recommendations. PMID:23588625
Kiernan, M; Schoffman, D E; Lee, K; Brown, S D; Fair, J M; Perri, M G; Haskell, W L
2013-12-01
Physical activity is essential for chronic disease prevention, yet <40% of overweight/obese adults meet the national activity recommendations. For time-efficient counseling, clinicians need a brief, easy-to-use tool that reliably and validly assesses a full range of activity levels, and, most importantly, is sensitive to clinically meaningful changes in activity. The Stanford Leisure-Time Activity Categorical Item (L-Cat) is a single item comprising six descriptive categories ranging from inactive to very active. This novel methodological approach assesses national activity recommendations as well as multiple clinically relevant categories below and above the recommendations, and incorporates critical methodological principles that enhance psychometrics (reliability, validity and sensitivity to change). We evaluated the L-Cat's psychometrics among 267 overweight/obese women who were asked to meet the national activity recommendations in a randomized behavioral weight-loss trial. The L-Cat had excellent test-retest reliability (κ=0.64, P<0.001) and adequate concurrent criterion validity; each L-Cat category at 6 months was associated with 1059 more daily pedometer steps (95% CI 712-1407, β=0.38, P<0.001) and 1.9% greater initial weight loss at 6 months (95% CI -2.4 to -1.3, β=-0.38, P<0.001). Of interest, L-Cat categories differentiated from each other in a dose-response gradient for steps and weight loss (Ps<0.05) with excellent face validity. The L-Cat was sensitive to change in response to the trial's activity component. Women increased one L-Cat category at 6 months (M=1.0±1.4, P<0.001); 55.8% met the recommendations at 6 months whereas 20.6% did at baseline (P<0.001). Even among women not meeting the recommendations at both baseline and 6 months (n=106), women who moved 1 L-Cat categories at 6 months lost more weight than those who did not (M=-4.6%, 95% CI -6.7 to -2.5, P<0.001). Given strong psychometrics, the L-Cat has timely potential for clinical use such as tracking activity changes via electronic medical records, especially among overweight/obese populations who are unable or unlikely to reach national recommendations.
2013-01-01
Background Assessing the risk of bias of randomized controlled trials (RCTs) is crucial to understand how biases affect treatment effect estimates. A number of tools have been developed to evaluate risk of bias of RCTs; however, it is unknown how these tools compare to each other in the items included. The main objective of this study was to describe which individual items are included in RCT quality tools used in general health and physical therapy (PT) research, and how these items compare to those of the Cochrane Risk of Bias (RoB) tool. Methods We used comprehensive literature searches and a systematic approach to identify tools that evaluated the methodological quality or risk of bias of RCTs in general health and PT research. We extracted individual items from all quality tools. We calculated the frequency of quality items used across tools and compared them to those in the RoB tool. Comparisons were made between general health and PT quality tools using Chi-squared tests. Results In addition to the RoB tool, 26 quality tools were identified, with 19 being used in general health and seven in PT research. The total number of quality items included in general health research tools was 130, compared with 48 items across PT tools and seven items in the RoB tool. The most frequently included items in general health research tools (14/19, 74%) were inclusion and exclusion criteria, and appropriate statistical analysis. In contrast, the most frequent items included in PT tools (86%, 6/7) were: baseline comparability, blinding of investigator/assessor, and use of intention-to-treat analysis. Key items of the RoB tool (sequence generation and allocation concealment) were included in 71% (5/7) of PT tools, and 63% (12/19) and 37% (7/19) of general health research tools, respectively. Conclusions There is extensive item variation across tools that evaluate the risk of bias of RCTs in health research. Results call for an in-depth analysis of items that should be used to assess risk of bias of RCTs. Further empirical evidence on the use of individual items and the psychometric properties of risk of bias tools is needed. PMID:24044807
Armijo-Olivo, Susan; Fuentes, Jorge; Ospina, Maria; Saltaji, Humam; Hartling, Lisa
2013-09-17
Assessing the risk of bias of randomized controlled trials (RCTs) is crucial to understand how biases affect treatment effect estimates. A number of tools have been developed to evaluate risk of bias of RCTs; however, it is unknown how these tools compare to each other in the items included. The main objective of this study was to describe which individual items are included in RCT quality tools used in general health and physical therapy (PT) research, and how these items compare to those of the Cochrane Risk of Bias (RoB) tool. We used comprehensive literature searches and a systematic approach to identify tools that evaluated the methodological quality or risk of bias of RCTs in general health and PT research. We extracted individual items from all quality tools. We calculated the frequency of quality items used across tools and compared them to those in the RoB tool. Comparisons were made between general health and PT quality tools using Chi-squared tests. In addition to the RoB tool, 26 quality tools were identified, with 19 being used in general health and seven in PT research. The total number of quality items included in general health research tools was 130, compared with 48 items across PT tools and seven items in the RoB tool. The most frequently included items in general health research tools (14/19, 74%) were inclusion and exclusion criteria, and appropriate statistical analysis. In contrast, the most frequent items included in PT tools (86%, 6/7) were: baseline comparability, blinding of investigator/assessor, and use of intention-to-treat analysis. Key items of the RoB tool (sequence generation and allocation concealment) were included in 71% (5/7) of PT tools, and 63% (12/19) and 37% (7/19) of general health research tools, respectively. There is extensive item variation across tools that evaluate the risk of bias of RCTs in health research. Results call for an in-depth analysis of items that should be used to assess risk of bias of RCTs. Further empirical evidence on the use of individual items and the psychometric properties of risk of bias tools is needed.
Parietal cortex and episodic memory retrieval in schizophrenia.
Lepage, Martin; Pelletier, Marc; Achim, Amélie; Montoya, Alonso; Menear, Matthew; Lal, Sam
2010-06-30
People with schizophrenia consistently show memory impairment on varying tasks including item recognition memory. Relative to the correct rejection of distracter items, the correct recognition of studied items consistently produces an effect termed the old/new effect that is characterized by increased activity in parietal and frontal cortical regions. This effect has received only scant attention in schizophrenia. We examined the old/new effect in 15 people with schizophrenia and 18 controls during an item recognition test, and neural activity was examined with event-related functional magnetic resonance imaging. Both groups performed equally well during the recognition test and showed increased activity in a left dorsolateral prefrontal region and in the precuneus bilaterally during the successful recognition of old items relative to the correct rejection of new items. The control group also exhibited increased activity in the dorsal left parietal cortex. This region has been implicated in the top-down modulation of memory which involves control processes that support memory-retrieval search, monitoring and verification. Although these processes may not be of paramount importance in item recognition memory performance, the present findings suggest that people with schizophrenia may have difficulty with such top-down modulation, a finding consistent with many other studies in information processing.
Vegetable parenting practices scale: Item response modeling analyses
USDA-ARS?s Scientific Manuscript database
Our objective was to evaluate the psychometric properties of a vegetable parenting practices scale using multidimensional polytomous item response modeling which enables assessing item fit to latent variables and the distributional characteristics of the items in comparison to the respondents. We al...
Semon, Natalie L.; Lating, Jeffrey M.; Everly, George S.; Perry, Charlene J.; Moore, Suzanne Straub; Mosley, Adrian M.; Thompson, Carol B.; Links, Jonathan M.
2014-01-01
Objectives Faculty and affiliates of the Johns Hopkins Preparedness and Emergency Response Research Center partnered with local health departments and faith-based organizations to develop a dual-intervention model of capacity-building for public mental health preparedness and community resilience. Project objectives included (1) determining the feasibility of the tri-partite collaborative concept; (2) designing, delivering, and evaluating psychological first aid (PFA) training and guided preparedness planning (GPP); and (3) documenting preliminary evidence of the sustainability and impact of the model. Methods We evaluated intervention effectiveness by analyzing pre- and post-training changes in participant responses on knowledge-acquisition tests administered to three urban and four rural community cohorts. Changes in percent of correct items and mean total correct items were evaluated. Criteria for model sustainability and impact were, respectively, observations of nonacademic partners engaging in efforts to advance post-project preparedness alliances, and project-attributable changes in preparedness-related practices of local or state governments. Results The majority (11 of 14) test items addressing technical or practical PFA content showed significant improvement; we observed comparable testing results for GPP training. Government and faith partners developed ideas and tools for sustaining preparedness activities, and numerous project-driven changes in local and state government policies were documented. Conclusions Results suggest that the model could be an effective approach to promoting public health preparedness and community resilience. PMID:25355980
McCabe, O Lee; Semon, Natalie L; Lating, Jeffrey M; Everly, George S; Perry, Charlene J; Moore, Suzanne Straub; Mosley, Adrian M; Thompson, Carol B; Links, Jonathan M
2014-01-01
Faculty and affiliates of the Johns Hopkins Preparedness and Emergency Response Research Center partnered with local health departments and faith-based organizations to develop a dual-intervention model of capacity-building for public mental health preparedness and community resilience. Project objectives included (1) determining the feasibility of the tri-partite collaborative concept; (2) designing, delivering, and evaluating psychological first aid (PFA) training and guided preparedness planning (GPP); and (3) documenting preliminary evidence of the sustainability and impact of the model. We evaluated intervention effectiveness by analyzing pre- and post-training changes in participant responses on knowledge-acquisition tests administered to three urban and four rural community cohorts. Changes in percent of correct items and mean total correct items were evaluated. Criteria for model sustainability and impact were, respectively, observations of nonacademic partners engaging in efforts to advance post-project preparedness alliances, and project-attributable changes in preparedness-related practices of local or state governments. The majority (11 of 14) test items addressing technical or practical PFA content showed significant improvement; we observed comparable testing results for GPP training. Government and faith partners developed ideas and tools for sustaining preparedness activities, and numerous project-driven changes in local and state government policies were documented. Results suggest that the model could be an effective approach to promoting public health preparedness and community resilience.
Forrest, Christopher B; Devine, Janine; Bevans, Katherine B; Becker, Brandon D; Carle, Adam C; Teneralli, Rachel E; Moon, JeanHee; Tucker, Carole A; Ravens-Sieberer, Ulrike
2018-01-01
To describe the psychometric evaluation and item response theory calibration of the PROMIS Pediatric Life Satisfaction item banks, child-report, and parent-proxy editions. A pool of 55 life satisfaction items was administered to 1992 children 8-17 years old and 964 parents of children 5-17 years old. Analyses included descriptive statistics, reliability, factor analysis, differential item functioning, and assessment of construct validity. Thirteen items were deleted because of poor psychometric performance. An 8-item short form was administered to a national sample of 996 children 8-17 years old, and 1294 parents of children 5-17 years old. The combined sample (2988 children and 2258 parents) was used in item response theory (IRT) calibration analyses. The final item banks were unidimensional, the items were locally independent, and the items were free from impactful differential item functioning. The 8-item and 4-item short form scales showed excellent reliability, convergent validity, and discriminant validity. Life satisfaction decreased with declining socio-economic status, presence of a special health care need, and increasing age for girls, but not boys. After IRT calibration, we found that 4- and 8-item short forms had a high degree of precision (reliability) across a wide range (>4 SD units) of the latent variable. The PROMIS Pediatric Life Satisfaction item banks and their short forms provide efficient, precise, and valid assessments of life satisfaction in children and youth.
Forrest, Christopher B; Ravens-Sieberer, Ulrike; Devine, Janine; Becker, Brandon D; Teneralli, Rachel; Moon, JeanHee; Carle, Adam; Tucker, Carole A; Bevans, Katherine B
2018-03-01
The purpose of this study is to describe the psychometric evaluation and item response theory calibration of the PROMIS Pediatric Positive Affect item bank, child-report and parent-proxy editions. The initial item pool comprising 53 items, previously developed using qualitative methods, was administered to 1,874 children 8-17 years old and 909 parents of children 5-17 years old. Analyses included descriptive statistics, reliability, factor analysis, differential item functioning, and construct validity. A total of 14 items were deleted, because of poor psychometric performance, and an 8-item short form constructed from the remaining 39 items was administered to a national sample of 1,004 children 8-17 years old, and 1,306 parents of children 5-17 years old. The combined sample was used in item response theory (IRT) calibration analyses. The final item bank appeared unidimensional, the items appeared locally independent, and the items were free from differential item functioning. The scales showed excellent reliability and convergent and discriminant validity. Positive affect decreased with children's age and was lower for those with a special health care need. After IRT calibration, we found that 4 and 8 item short forms had a high degree of precision (reliability) across a wide range of the latent trait (>4 SD units). The PROMIS Pediatric Positive Affect item bank and its short forms provide an efficient, precise, and valid assessment of positive affect in children and youth.
NASA Astrophysics Data System (ADS)
Arif, W.; Suhandi, A.; Kaniawati, I.; Setiawan, A.
2017-02-01
The development of scaffolding for evaluation instrument construction training program on the cognitive domain for senior high school physics teacher and the same level that is specified in the test instrument has been done. This development was motivated by the low ability of the majority of physics teachers in constructing the physics learning achievement test. This situation not in accordance with the demands of Permendiknas RI no. 16 tahun 2007 concerning the standard of academic qualifications and competence of teachers, stating that teachers should have a good ability to develop instruments for assessment and evaluation of process and learning outcomes. Based on the preliminary study results, it can be seen that the main cause of the inability of teachers in developing physics achievement test is because they do not good understand of the indicators for each aspect of cognitive domains. Scaffolding development is done by using the research and development methods formulated by Thiagarajan which includes define, design and develope steps. Develop step includes build the scaffolding, validation of scaffolding by experts and the limited pilot implementations on the training activities. From the build scaffolding step, resulted the scaffolding for the construction of test instruments training program which include the process steps; description of indicators, operationalization of indicators, construction the itemsframework (items scenarios), construction the items stem, construction the items and checking the items. The results of the validation by three validator indicates that the built scaffolding are suitable for use in the construction of physics achievement test training program, especially for novice. The limited pilot implementation of the built scaffolding conducted in training activities attended by 10 senior high school physics teachers in Garut district. The results of the limited pilot implementation shows that the built scaffolding have a medium effectiveness in improving the ability of senior high school physics teachers in constructing the physic achievement test instrument that is characterized by more than 70% of trainees achieve scores of test instruments construction of about 80 or more.
Development and Validation of the Consumer Health Activation Index.
Wolf, Michael S; Smith, Samuel G; Pandit, Anjali U; Condon, David M; Curtis, Laura M; Griffith, James; O'Conor, Rachel; Rush, Steven; Bailey, Stacy C; Kaplan, Gordon; Haufle, Vincent; Martin, David
2018-04-01
Although there has been increasing interest in patient engagement, few measures are publicly available and suitable for patients with limited health literacy. We sought to develop a Consumer Health Activation Index (CHAI) for use among diverse patients. Expert opinion, a systematic literature review, focus groups, and cognitive interviews with patients were used to create and revise a potential set of items. Psychometric testing guided by item response theory was then conducted among 301 English-speaking, community-dwelling adults. This included differential item functioning analyses to evaluate item performance across participant health literacy levels. To determine construct validity, CHAI scores were compared to scales measuring similar personality constructs. Associations between the CHAI and physical and mental health established predictive validity. A second study among 9,478 adults was used to confirm CHAI associations with health outcomes. Exploratory factor analyses revealed a single-factor solution with a 10-item scale. The CHAI showed good internal consistency (alpha = 0.81) and moderate test-retest reliability (ICC = 0.53). Reading grade level was found to be at the 6 th grade. Moderate to strong correlations were found with similar constructs (Multidimensional Health Locus of Control, r = 0.38, P < 0.001; Conscientiousness, r = 0.41, P < 0.001). Predictive validity was demonstrated through associations with functional health status measures (depression, r = -0.28, P < 0.001; anxiety, r = -0.22, P < 0.001; and physical functioning, r = 0.22, P < 0.001). In the validation sample, the CHAI was significantly associated with self-reported physical and mental health ( r = 0.31 and 0.32 respectively; both P < 0.001). The CHAI appears to be a valid, reliable, and easily administered tool that can be used to assess health activation among adults, including those with limited health literacy. Future studies should test the tool in actual use and explore further applications.
ERIC Educational Resources Information Center
Lynch, Mervin D.; Chaves, John
Items from Peirs-Harris and Coopersmith self-concept tests were evaluated against independent measures on three self-constructs, idealized, empathic, and worth. Construct measurements were obtained with the semantic differential and D statistic. Ratings were obtained from 381 children, grades 4-6. For each test, item ratings and construct measures…
Item Bank Development for a Revised Pediatric Evaluation of Disability Inventory (PEDI)
ERIC Educational Resources Information Center
Dumas, Helene; Fragala-Pinkham, Maria; Haley, Stephen; Coster, Wendy; Kramer, Jessica; Kao, Ying-Chia; Moed, Richard
2010-01-01
The Pediatric Evaluation of Disability Inventory (PEDI) is a useful clinical and research assessment, but it has limitations in content, age range, and efficiency. The purpose of this article is to describe the development of the item bank for a new computer adaptive testing version of the PEDI (PEDI-CAT). An expanded item set and response options…
ERIC Educational Resources Information Center
Pawade, Yogesh R.; Diwase, Dipti S.
2016-01-01
Item analysis of Multiple Choice Questions (MCQs) is the process of collecting, summarizing and utilizing information from students' responses to evaluate the quality of test items. Difficulty Index (p-value), Discrimination Index (DI) and Distractor Efficiency (DE) are the parameters which help to evaluate the quality of MCQs used in an…
The structure of coping among older adults living with HIV/AIDS and depressive symptoms
Hansen, Nathan B; Harrison, Blair; Fambro, Stacy; Bodnar, Sara; Heckman, Timothy G; Sikkema, Kathleen J
2013-01-01
One-third of adults living with HIV/AIDS are over the age of 50. This study evaluated the structure of coping among 307 older adults living with HIV/AIDS. Participants completed 61 coping items and measures of anxiety, depression, loneliness, and coping self-efficacy. Exploratory factor analyses retained 40 coping items loading on five specific first order factors (Distancing Avoidance, Social Support Seeking, Self-Destructive Avoidance, Spiritual Coping, and Solution-Focused Coping) and two general second order factors (Active and Avoidant Coping). Factors demonstrated good reliability and validity. Results suggest that general coping factors should be considered with specific factors when measuring coping among older adults. PMID:22453164
Validity and Reliability of General Nutrition Knowledge Questionnaire for Adults in Uganda
Bukenya, Richard; Ahmed, Abhiya; Andrade, Jeanette M.; Grigsby-Toussaint, Diana S.; Muyonga, John; Andrade, Juan E.
2017-01-01
This study sought to develop and validate a general nutrition knowledge questionnaire (GNKQ) for Ugandan adults. The initial draft consisted of 133 items on five constructs associated with nutrition knowledge; expert recommendations (16 items), food groups (70 items), selecting food (10 items), nutrition and disease relationship (23 items), and food fortification in Uganda (14 items). The questionnaire validity was evaluated in three studies. For the content validity (study 1), a panel of five content matter nutrition experts reviewed the GNKQ draft before and after face validity. For the face validity (study 2), head teachers and health workers (n = 27) completed the questionnaire before attending one of three focus groups to review the clarity of the items. For the construct and test-rest reliability (study 3), head teachers (n = 40) from private and public primary schools and nutrition (n = 52) and engineering (n = 49) students from Makerere University took the questionnaire twice (two weeks apart). Experts agreed (content validity index, CVI > 0.9; reliability, Gwet’s AC1 > 0.85) that all constructs were relevant to evaluate nutrition knowledge. After the focus groups, 29 items were identified as unclear, requiring major (n = 5) and minor (n = 24) reviews. The final questionnaire had acceptable internal consistency (Cronbach α > 0.95), test-retest reliability (r = 0.89), and differentiated (p < 0.001) nutrition knowledge scores between nutrition (67 ± 5) and engineering (39 ± 11) students. Only the construct on nutrition recommendations was unreliable (Cronbach α = 0.51, test-retest r = 0.55), which requires further optimization. The final questionnaire included topics on food groups (41 items), selecting food (2 items), nutrition and disease relationship (14 items), and food fortification in Uganda (22 items) and had good content, construct, and test-retest reliability to evaluate nutrition knowledge among Ugandan adults. PMID:28230779
Ramsay-Curve Item Response Theory for the Three-Parameter Logistic Item Response Model
ERIC Educational Resources Information Center
Woods, Carol M.
2008-01-01
In Ramsay-curve item response theory (RC-IRT), the latent variable distribution is estimated simultaneously with the item parameters of a unidimensional item response model using marginal maximum likelihood estimation. This study evaluates RC-IRT for the three-parameter logistic (3PL) model with comparisons to the normal model and to the empirical…
ERIC Educational Resources Information Center
Preston, Kathleen; Reise, Steven; Cai, Li; Hays, Ron D.
2011-01-01
The authors used a nominal response item response theory model to estimate category boundary discrimination (CBD) parameters for items drawn from the Emotional Distress item pools (Depression, Anxiety, and Anger) developed in the Patient-Reported Outcomes Measurement Information Systems (PROMIS) project. For polytomous items with ordered response…
Evaluation of Northwest University, Kano Post-UTME Test Items Using Item Response Theory
ERIC Educational Resources Information Center
Bichi, Ado Abdu; Hafiz, Hadiza; Bello, Samira Abdullahi
2016-01-01
High-stakes testing is used for the purposes of providing results that have important consequences. Validity is the cornerstone upon which all measurement systems are built. This study applied the Item Response Theory principles to analyse Northwest University Kano Post-UTME Economics test items. The developed fifty (50) economics test items was…
Weighted Maximum-a-Posteriori Estimation in Tests Composed of Dichotomous and Polytomous Items
ERIC Educational Resources Information Center
Sun, Shan-Shan; Tao, Jian; Chang, Hua-Hua; Shi, Ning-Zhong
2012-01-01
For mixed-type tests composed of dichotomous and polytomous items, polytomous items often yield more information than dichotomous items. To reflect the difference between the two types of items and to improve the precision of ability estimation, an adaptive weighted maximum-a-posteriori (WMAP) estimation is proposed. To evaluate the performance of…
A Process for Reviewing and Evaluating Generated Test Items
ERIC Educational Resources Information Center
Gierl, Mark J.; Lai, Hollis
2016-01-01
Testing organization needs large numbers of high-quality items due to the proliferation of alternative test administration methods and modern test designs. But the current demand for items far exceeds the supply. Test items, as they are currently written, evoke a process that is both time-consuming and expensive because each item is written,…
Rios, Sebastian; Perlman, Christopher M
2017-04-24
Social withdrawal is a symptom experienced by individuals with an array of mental health conditions, particularly those with schizophrenia and mood disorders. Assessments of social withdrawal are often lengthy and may not be routinely integrated within the comprehensive clinical assessment of the individual. This study utilized item response and classical test theory methods to derive a Social Withdrawal Scale (SWS) using items embedded within a routine clinical assessment, the RAI-Mental Health (RAI-MH). Using data from 60,571 inpatients in Ontario, Canada, a common factor analysis identified seven items from the RAI-MH that measure social withdrawal. A graded response model found that six items had acceptable discrimination parameters: lack of motivation, reduced interaction, decreased energy, flat affect, anhedonia, and loss of interest. Summing these items, the SWS was found to have strong internal consistency (Cronbach's alpha = 0.82) and showed a medium to large effect size (d = 0.77) from admission to discharge. Fewer individuals with high SWS scores participated in social activity or reported having a confidant compared to those with lower scores. Since the RAI-MH is available across clinical subgroups in several jurisdictions, the SWS is a useful tool for screening, clinical decision support, and evaluation.
Development and validation of the Myasthenia Gravis Impairment Index.
Barnett, Carolina; Bril, Vera; Kapral, Moira; Kulkarni, Abhaya; Davis, Aileen M
2016-08-30
We aimed to develop a measure of myasthenia gravis impairment using a previously developed framework and to evaluate reliability and validity, specifically face, content, and construct validity. The first draft of the Myasthenia Gravis Impairment Index (MGII) included examination items from available measures enriched with newly developed, patient-reported items, modified after patient input. International neuromuscular specialists evaluated face and content validity via an e-mail survey. Test-retest reliability was assessed in stable patients at a 3-week interval and interrater reliability was evaluated in the same day. Construct validity was assessed through correlations between the MGII and other measures and by comparing scores in different patient groups. The first draft was assessed by 18 patients, and 72 specialists answered the survey. The second draft had 7 examination and 22 patient-reported items. Field testing included 200 patients, with 54 patients completing the reliability studies. Test-retest reliability of the total score was good (intraclass correlation coefficient 0.92; 95% confidence interval 0.79-0.94), as was interrater reliability of the examination component (intraclass correlation coefficient 0.81; 95% confidence interval 0.79-0.94). The MGII correlated well with comparison measures, with higher correlations with the MG-activities of daily living (r = 0.91) and MG-specific quality of life 15-item scale (r = 0.78). When assessing different patient groups, the scores followed expected patterns. The MGII was developed using a patient-centered framework of myasthenia-related impairments and incorporating patient input throughout the development process. It is reliable in an outpatient setting and has demonstrated construct validity. Responsiveness studies are under way. © 2016 American Academy of Neurology.
Development and validation of the Myasthenia Gravis Impairment Index
Bril, Vera; Kapral, Moira; Kulkarni, Abhaya; Davis, Aileen M.
2016-01-01
Objective: We aimed to develop a measure of myasthenia gravis impairment using a previously developed framework and to evaluate reliability and validity, specifically face, content, and construct validity. Methods: The first draft of the Myasthenia Gravis Impairment Index (MGII) included examination items from available measures enriched with newly developed, patient-reported items, modified after patient input. International neuromuscular specialists evaluated face and content validity via an e-mail survey. Test–retest reliability was assessed in stable patients at a 3-week interval and interrater reliability was evaluated in the same day. Construct validity was assessed through correlations between the MGII and other measures and by comparing scores in different patient groups. Results: The first draft was assessed by 18 patients, and 72 specialists answered the survey. The second draft had 7 examination and 22 patient-reported items. Field testing included 200 patients, with 54 patients completing the reliability studies. Test–retest reliability of the total score was good (intraclass correlation coefficient 0.92; 95% confidence interval 0.79–0.94), as was interrater reliability of the examination component (intraclass correlation coefficient 0.81; 95% confidence interval 0.79–0.94). The MGII correlated well with comparison measures, with higher correlations with the MG–activities of daily living (r = 0.91) and MG-specific quality of life 15-item scale (r = 0.78). When assessing different patient groups, the scores followed expected patterns. Conclusions: The MGII was developed using a patient-centered framework of myasthenia-related impairments and incorporating patient input throughout the development process. It is reliable in an outpatient setting and has demonstrated construct validity. Responsiveness studies are under way. PMID:27402891
Development and assessment of floor and ceiling items for the PROMIS physical function item bank
2013-01-01
Introduction Disability and Physical Function (PF) outcome assessment has had limited ability to measure functional status at the floor (very poor functional abilities) or the ceiling (very high functional abilities). We sought to identify, develop and evaluate new floor and ceiling items to enable broader and more precise assessment of PF outcomes for the NIH Patient-Reported-Outcomes Measurement Information System (PROMIS). Methods We conducted two cross-sectional studies using NIH PROMIS item improvement protocols with expert review, participant survey and focus group methods. In Study 1, respondents with low PF abilities evaluated new floor items, and those with high PF abilities evaluated new ceiling items for clarity, importance and relevance. In Study 2, we compared difficulty ratings of new floor items by low functioning respondents and ceiling items by high functioning respondents to reference PROMIS PF-10 items. We used frequencies, percentages, means and standard deviations to analyze the data. Results In Study 1, low (n = 84) and high (n = 90) functioning respondents were mostly White, women, 70 years old, with some college, and disability scores of 0.62 and 0.30. More than 90% of the 31 new floor and 31 new ceiling items were rated as clear, important and relevant, leaving 26 ceiling and 30 floor items for Study 2. Low (n = 246) and high (n = 637) functioning Study 2 respondents were mostly White, women, 70 years old, with some college, and Health Assessment Questionnaire (HAQ) scores of 1.62 and 0.003. Compared to difficulty ratings of reference items, ceiling items were rated to be 10% more to greater than 40% more difficult to do, and floor items were rated to be about 12% to nearly 90% less difficult to do. Conclusions These new floor and ceiling items considerably extend the measurable range of physical function at either extreme. They will help improve instrument performance in populations with broad functional ranges and those concentrated at one or the other extreme ends of functioning. Optimal use of these new items will be assisted by computerized adaptive testing (CAT), reducing questionnaire burden and insuring item administration to appropriate individuals. PMID:24286166
Social desirability in personality inventories: Symptoms, diagnosis and prescribed cure
Bäckström, Martin; Björklund, Fredrik
2013-01-01
An analysis of social desirability in personality assessment is presented. Starting with the symptoms, Study 1 showed that mean ratings of graded personality items are moderately to strongly linearly related to social desirability (Self Deception, Impression formation, and the first Principal Component), suggesting that item popularity may be a useful heuristic tool for identifying items which elicit socially desirable responding. We diagnose the cause of socially desirable responding as an interaction between the evaluative content of the item and enhancement motivation in the rater. Study 2 introduced a possible cure; evaluative neutralization of items. To test the feasibility of the method lay psychometricians (undergraduates) reformulated existing personality test items according to written instructions. The new items were indeed lower in social desirability while essentially retaining the five factor structure and reliability of the inventory. We conclude that although neutralization is no miracle cure, it is simple and has beneficial effects. PMID:23252410
Bervoets, Liene; Van Noten, Caroline; Van Roosbroeck, Sofie; Hansen, Dominique; Van Hoorenbeeck, Kim; Verheyen, Els; Van Hal, Guido; Vankerckhoven, Vanessa
2014-01-01
This study was designed to validate the Dutch Physical Activity Questionnaires for Children (PAQ-C) and Adolescents (PAQ-A). After adjustment of the original Canadian PAQ-C and PAQ-A (i.e. translation/back-translation and evaluation by expert committee), content validity of both PAQs was assessed and calculated using item-level (I-CVI) and scale-level (S-CVI) content validity indexes. Inter-item and inter-rater reliability of 196 PAQ-C and 95 PAQ-A filled in by both children or adolescents and their parent, were evaluated. Inter-item reliability was calculated by Cronbach's alpha (α) and inter-rater reliability was examined by percent observed agreement and weighted kappa (κ). Concurrent validity of PAQ-A was examined in a subsample of 28 obese and 16 normal-weight children by comparing it with concurrently measured physical activity using a maximal cardiopulmonary exercise test for the assessment of peak oxygen uptake (VO2 peak). For both PAQs, I-CVI ranged 0.67-1.00. S-CVI was 0.89 for PAQ-C and 0.90 for PAQ-A. A total of 192 PAQ-C and 94 PAQ-A were fully completed by both child and parent. Cronbach's α was 0.777 for PAQ-C and 0.758 for PAQ-A. Percent agreement ranged 59.9-74.0% for PAQ-C and 51.1-77.7% for PAQ-A, and weighted κ ranged 0.48-0.69 for PAQ-C and 0.51-0.68 for PAQ-A. The correlation between total PAQ-A score and VO2 peak - corrected for age, gender, height and weight - was 0.516 (p = 0.001). Both PAQs have an excellent content validity, an acceptable inter-item reliability and a moderate to good strength of inter-rater agreement. In addition, total PAQ-A score showed a moderate positive correlation with VO2 peak. Both PAQs have an acceptable to good reliability and validity, however, further validity testing is recommended to provide a more complete assessment of both PAQs.
Dinkel, Danae; Dev, Dipti; Guo, Yage; Hulse, Emily; Rida, Zainab; Sedani, Ami; Coyle, Brian
2018-05-09
The purpose of this study was to determine if the Go Nutrition and Physical Activity Self-Assessment in Child Care (Go NAP SACC) intervention was effective in improving best practices in the areas of infant and child physical activity and outdoor play and learning in family child care homes (FCCHs) in Nebraska. FCCHs (n = 201) participated in a pre-post evaluation using the Infant and Child Physical Activity and Outdoor Play and Learning assessments from the Go NAP SACC validated measure to assess compliance with best practices. At post, FCCHs demonstrated significant differences in 85% of the Infant and Child Physical Activity items (17 of 20) and 80% of the Outdoor Play and Learning items (12 of 15). Significant differences in best practices between urban and rural FCCH providers were also found. Go NAP SACC appears to be an effective intervention in Nebraska as, after participation in the initiative, providers were improving child care physical activity best practices. Additional research is needed to objectively determine if these changes resulted in objective improvements in children's physical activity levels. Further, efforts are needed to develop and/or identify geographic-specific resources for continued improvement.
Doig, Emmah; Prescott, Sarah; Fleming, Jennifer; Cornwell, Petrea; Kuipers, Pim
2016-01-01
To examine the internal reliability and test-retest reliability of the Client-Centeredness of Goal Setting (C-COGS) scale. The C-COGS scale was administered to 42 participants with acquired brain injury after completion of multidisciplinary goal planning. Internal reliability of scale items was examined using item-partial total correlations and Cronbach's α coefficient. The scale was readministered within a 1-mo period to a subsample of 12 participants to examine test-retest reliability by calculating exact and close percentage agreement for each item. After examination of item-partial total correlations, test items were revised. The revised items demonstrated stronger internal consistency than the original items. Preliminary evaluation of test-retest reliability was fair, with an average exact percent agreement across all test items of 67%. Findings support the preliminary reliability of the C-COGS scale as a tool to evaluate and promote client-centered goal planning in brain injury rehabilitation. Copyright © 2016 by the American Occupational Therapy Association, Inc.
Chinese Mobile Health APPs for Hypertension Management: A Systematic Evaluation of Usefulness.
Liang, Jun; He, Xiaojun; Jia, Yuxi; Zhu, Wei; Lei, Jianbo
2018-01-01
To analyze and compare the usefulness of hypertension management APPs released in the Chinese market; to understand the general situations, characteristics, problems, and trends in hypertension management mHealth APPs; and to identify the gaps between mainland China products and non-mainland China products with the aim to provide recommendations for developers in industry and assist hypertensive patients in selecting suitable APPs. The hypertension management APPs available by October 2016 in China were analyzed from the perspective of data items and function usefulness. Sample sets were determined through PRISMA. An evaluation item set was developed based on the usability framework of TURF and the Chinese Guideline for the Management of Hypertension and used to quantitatively analyze the functionalities and data items collected from the sample APPs from the perspective of designers, users, and activity models. Among the 73 Chinese-supported APPs, none of the hypertension management APPs could fully cover the usefulness item set (mean = 37.4%). Regarding the use of mobile terminal hardware, only cameras and positioning sensors are commonly used in information collection. Regarding the data items and services provided, the most commonly collected data are "demographic information" (88% versus 100%) and "vital signs" (76% versus 100%), but APPs developed in mainland China and non-mainland China provided significantly different services and profit-making patterns. Regarding data security and privacy protection, the APPs from mainland China provided far lower usefulness (31% versus 56%). mHealth APPs can promptly and efficiently acquire sign-related data by improving the professionality and scientificity of data about healthy living habits. APPs also improve the preventive usefulness of the collected data and bring about new opportunities for the management and control of hypertension. Other important research trends include privacy protection and data security.
Chinese Mobile Health APPs for Hypertension Management: A Systematic Evaluation of Usefulness
Jia, Yuxi; Zhu, Wei
2018-01-01
Objective To analyze and compare the usefulness of hypertension management APPs released in the Chinese market; to understand the general situations, characteristics, problems, and trends in hypertension management mHealth APPs; and to identify the gaps between mainland China products and non-mainland China products with the aim to provide recommendations for developers in industry and assist hypertensive patients in selecting suitable APPs. Methods The hypertension management APPs available by October 2016 in China were analyzed from the perspective of data items and function usefulness. Sample sets were determined through PRISMA. An evaluation item set was developed based on the usability framework of TURF and the Chinese Guideline for the Management of Hypertension and used to quantitatively analyze the functionalities and data items collected from the sample APPs from the perspective of designers, users, and activity models. Results Among the 73 Chinese-supported APPs, none of the hypertension management APPs could fully cover the usefulness item set (mean = 37.4%). Regarding the use of mobile terminal hardware, only cameras and positioning sensors are commonly used in information collection. Regarding the data items and services provided, the most commonly collected data are “demographic information” (88% versus 100%) and “vital signs” (76% versus 100%), but APPs developed in mainland China and non-mainland China provided significantly different services and profit-making patterns. Regarding data security and privacy protection, the APPs from mainland China provided far lower usefulness (31% versus 56%). Conclusions mHealth APPs can promptly and efficiently acquire sign-related data by improving the professionality and scientificity of data about healthy living habits. APPs also improve the preventive usefulness of the collected data and bring about new opportunities for the management and control of hypertension. Other important research trends include privacy protection and data security. PMID:29744027
2009-12-31
Active Engagement, Protective Buffering, and Overprotection questionnaire and Stephen Lepore’s 15-item Social Constraints Scale have been added to the...questionnaires (the Active Engagement, Protective Buffering, and Overprotection questionnaire and Stephen Lepore’s 15- item Social Constraints Scale) is still...this end we included the Active Engagement, Protective Buffering and Overprotection questionnaire and Stephen Lepore’s 15-item Social Constraints
Disentangling the roles of arousal and amygdala activation in emotional declarative memory
Fernández, Guillén; Hermans, Erno J.
2016-01-01
A large body of evidence in animals and humans implicates the amygdala in promoting memory for arousing experiences. Although the amygdala can trigger threat-related noradrenergic-sympathetic arousal, in humans amygdala activation and noradrenergic-sympathetic arousal do not always concur. This raises the question how these two processes play a role in enhancing emotional declarative memory. This study was designed to disentangle these processes in a combined subsequent-memory/fear-conditioning paradigm with neutral items belonging to two conceptual categories as conditioned stimuli. Functional MRI, skin conductance (index of sympathetic activity), and pupil dilation (indirect index of central noradrenergic activity) were acquired throughout procedures. Recognition memory for individual items was tested 24 h later. We found that pupil dilation and skin conductance responses were higher on CS+ (associated with a shock) compared with CS− trials, irrespective of later memory for those items. By contrast, amygdala activity was only higher for CS+ items that were later confidently remembered compared with CS+ items that were later forgotten. Thus, amygdala activity and not noradrenergic-sympathetic arousal, predicted enhanced declarative item memory. This dissociation is in line with animal models stating that the amygdala integrates arousal-related neuromodulatory changes to alter mnemonic processes elsewhere in the brain. PMID:27217115
DiFilippo, Kristen Nicole; Huang, Wenhao; Chapman-Novakofski, Karen M
2017-10-27
The extensive availability and increasing use of mobile apps for nutrition-based health interventions makes evaluation of the quality of these apps crucial for integration of apps into nutritional counseling. The goal of this research was the development, validation, and reliability testing of the app quality evaluation (AQEL) tool, an instrument for evaluating apps' educational quality and technical functionality. Items for evaluating app quality were adapted from website evaluations, with additional items added to evaluate the specific characteristics of apps, resulting in 79 initial items. Expert panels of nutrition and technology professionals and app users reviewed items for face and content validation. After recommended revisions, nutrition experts completed a second AQEL review to ensure clarity. On the basis of 150 sets of responses using the revised AQEL, principal component analysis was completed, reducing AQEL into 5 factors that underwent reliability testing, including internal consistency, split-half reliability, test-retest reliability, and interrater reliability (IRR). Two additional modifiable constructs for evaluating apps based on the age and needs of the target audience as selected by the evaluator were also tested for construct reliability. IRR testing using intraclass correlations (ICC) with all 7 constructs was conducted, with 15 dietitians evaluating one app. Development and validation resulted in the 51-item AQEL. These were reduced to 25 items in 5 factors after principal component analysis, plus 9 modifiable items in two constructs that were not included in principal component analysis. Internal consistency and split-half reliability of the following constructs derived from principal components analysis was good (Cronbach alpha >.80, Spearman-Brown coefficient >.80): behavior change potential, support of knowledge acquisition, app function, and skill development. App purpose split half-reliability was .65. Test-retest reliability showed no significant change over time (P>.05) for all but skill development (P=.001). Construct reliability was good for items assessing age appropriateness of apps for children, teens, and a general audience. In addition, construct reliability was acceptable for assessing app appropriateness for various target audiences (Cronbach alpha >.70). For the 5 main factors, ICC (1,k) was >.80, with a P value of <.05. When 15 nutrition professionals evaluated one app, ICC (2,15) was .98, with a P value of <.001 for all 7 constructs when the modifiable items were specified for adults seeking weight loss support. Our preliminary effort shows that AQEL is a valid, reliable instrument for evaluating nutrition apps' qualities for clinical interventions by nutrition clinicians, educators, and researchers. Further efforts in validating AQEL in various contexts are needed. ©Kristen Nicole DiFilippo, Wenhao Huang, Karen M. Chapman-Novakofski. Originally published in JMIR Mhealth and Uhealth (http://mhealth.jmir.org), 27.10.2017.
Statistical evaluation of synchronous spike patterns extracted by frequent item set mining
Torre, Emiliano; Picado-Muiño, David; Denker, Michael; Borgelt, Christian; Grün, Sonja
2013-01-01
We recently proposed frequent itemset mining (FIM) as a method to perform an optimized search for patterns of synchronous spikes (item sets) in massively parallel spike trains. This search outputs the occurrence count (support) of individual patterns that are not trivially explained by the counts of any superset (closed frequent item sets). The number of patterns found by FIM makes direct statistical tests infeasible due to severe multiple testing. To overcome this issue, we proposed to test the significance not of individual patterns, but instead of their signatures, defined as the pairs of pattern size z and support c. Here, we derive in detail a statistical test for the significance of the signatures under the null hypothesis of full independence (pattern spectrum filtering, PSF) by means of surrogate data. As a result, injected spike patterns that mimic assembly activity are well detected, yielding a low false negative rate. However, this approach is prone to additionally classify patterns resulting from chance overlap of real assembly activity and background spiking as significant. These patterns represent false positives with respect to the null hypothesis of having one assembly of given signature embedded in otherwise independent spiking activity. We propose the additional method of pattern set reduction (PSR) to remove these false positives by conditional filtering. By employing stochastic simulations of parallel spike trains with correlated activity in form of injected spike synchrony in subsets of the neurons, we demonstrate for a range of parameter settings that the analysis scheme composed of FIM, PSF and PSR allows to reliably detect active assemblies in massively parallel spike trains. PMID:24167487
Distinct regions of the hippocampus are associated with memory for different spatial locations.
Jeye, Brittany M; MacEvoy, Sean P; Karanian, Jessica M; Slotnick, Scott D
2018-05-15
In the present functional magnetic resonance imaging (fMRI) study, we aimed to evaluate whether distinct regions of the hippocampus were associated with spatial memory for items presented in different locations of the visual field. In Experiment 1, during the study phase, participants viewed abstract shapes in the left or right visual field while maintaining central fixation. At test, old shapes were presented at fixation and participants classified each shape as previously in the "left" or "right" visual field followed by an "unsure"-"sure"-"very sure" confidence rating. Accurate spatial memory for shapes in the left visual field was isolated by contrasting accurate versus inaccurate spatial location responses. This contrast produced one hippocampal activation in which the interaction between item type and accuracy was significant. The analogous contrast for right visual field shapes did not produce activity in the hippocampus; however, the contrast of high confidence versus low confidence right-hits produced one hippocampal activation in which the interaction between item type and confidence was significant. In Experiment 2, the same paradigm was used but shapes were presented in each quadrant of the visual field during the study phase. Accurate memory for shapes in each quadrant, exclusively masked by accurate memory for shapes in the other quadrants, produced a distinct activation in the hippocampus. A multi-voxel pattern analysis (MVPA) of hippocampal activity revealed a significant correlation between behavioral spatial location accuracy and hippocampal MVPA accuracy across participants. The findings of both experiments indicate that distinct hippocampal regions are associated with memory for different visual field locations. Copyright © 2018 Elsevier B.V. All rights reserved.
Stochl, Jan; Böhnke, Jan R; Pickett, Kate E; Croudace, Tim J
2016-05-20
Recent developments in psychometric modeling and technology allow pooling well-validated items from existing instruments into larger item banks and their deployment through methods of computerized adaptive testing (CAT). Use of item response theory-based bifactor methods and integrative data analysis overcomes barriers in cross-instrument comparison. This paper presents the joint calibration of an item bank for researchers keen to investigate population variations in general psychological distress (GPD). Multidimensional item response theory was used on existing health survey data from the Scottish Health Education Population Survey (n = 766) to calibrate an item bank consisting of pooled items from the short common mental disorder screen (GHQ-12) and the Affectometer-2 (a measure of "general happiness"). Computer simulation was used to evaluate usefulness and efficacy of its adaptive administration. A bifactor model capturing variation across a continuum of population distress (while controlling for artefacts due to item wording) was supported. The numbers of items for different required reliabilities in adaptive administration demonstrated promising efficacy of the proposed item bank. Psychometric modeling of the common dimension captured by more than one instrument offers the potential of adaptive testing for GPD using individually sequenced combinations of existing survey items. The potential for linking other item sets with alternative candidate measures of positive mental health is discussed since an optimal item bank may require even more items than these.
Psychological distress in cancer survivors: the further development of an item bank.
Smith, Adam B; Armes, Jo; Richardson, Alison; Stark, Dan P
2013-02-01
Assessment of psychological distress by patient report is necessary to meet patients' needs throughout the cancer journey. We have previously developed an item bank to assess psychological distress but not evaluated it for cancer survivors. Our first aim in this study was to test whether we could extend our item bank to include cancer survivors. The second aim was to examine whether the item bank could assess positive affect as a single construct alongside negative psychological symptoms. Responses from 1315 cancer survivors to the Hospital Anxiety and Depression Scale (HADS) and the Positive and Negative Affect Scale (PANAS) were considered for inclusion in a pre-existing item bank created from a heterogeneous sample of 4914 cancer patients. Differential item functioning (DIF) was used to assess whether HADS responses drawn from the two samples were equivalent. Common-item equating was used to anchor the shared (HADS) items, whilst the PANAS items were added. Item fit was evaluated at each stage, and misfitting items were removed. Unidimensionality was assessed with a principal components factor analysis. The DIF analysis did not reveal any differences between the HADS item locations from the two samples. Three misfitting PANAS items were removed, resulting in a final unidimensional bank of 80 items with good internal reliability (α = 0.85). The new item bank is valid for use across the cancer journey, including cancer survivors, and modestly improves the assessment of all levels of psychological distress and positive psychological function. Copyright © 2011 John Wiley & Sons, Ltd.
Bimodal Bilinguals Co-activate Both Languages during Spoken Comprehension
Shook, Anthony; Marian, Viorica
2012-01-01
Bilinguals have been shown to activate their two languages in parallel, and this process can often be attributed to overlap in input between the two languages. The present study examines whether two languages that do not overlap in input structure, and that have distinct phonological systems, such as American Sign Language (ASL) and English, are also activated in parallel. Hearing ASL-English bimodal bilinguals’ and English monolinguals’ eye-movements were recorded during a visual world paradigm, in which participants were instructed, in English, to select objects from a display. In critical trials, the target item appeared with a competing item that overlapped with the target in ASL phonology. Bimodal bilinguals looked more at competing items than at phonologically unrelated items, and looked more at competing items relative to monolinguals, indicating activation of the sign-language during spoken English comprehension. The findings suggest that language co-activation is not modality specific, and provide insight into the mechanisms that may underlie cross-modal language co-activation in bimodal bilinguals, including the role that top-down and lateral connections between levels of processing may play in language comprehension. PMID:22770677
Williams, Karen Patricia; Templin, Thomas N.
2013-01-01
Objective This research describes the development and evaluation of a new scale for assessing functional cervical cancer health literacy, the Cervical Cancer Literacy Assessment Tool (C-CLAT). Methods In Phase 1, 35 items in English, Spanish and Arabic, for C-CLAT were generated, taking into account three content domains-Awareness, Knowledge, and Prevention/Control. After content validation, 24 items were retained for psychometric evaluation. In Phase 2, the 24-item C-CLAT was evaluated in three racial/ethnic populations of urban women (N =543). Psychometric methods included item analysis, multifactor Item Response Theory modeling, and concurrent correlations. Results The final C-CLAT consisted of 16 items, with an internal consistency reliability of .72. C-CLAT reliabilities in Black, Latina, and Arab women were .73, .76, and .60, respectively. The rank order correlations of item difficulties across racial/ethnic groups was high (r’s = .97 to .98). The C-CLAT was positively related to educational level, and Arab women scored significantly higher than the Black and Latina participants. Conclusions This study presents a psychometrically sound instrument that measures health literacy related to cervical cancer. Practice Implications The C-CLAT is a tool that can be orally administered by a lay person and used in a community-based health promotion intervention. PMID:24072456
Active Learning with Irrelevant Examples
NASA Technical Reports Server (NTRS)
Wagstaff, Kiri; Mazzoni, Dominic
2009-01-01
An improved active learning method has been devised for training data classifiers. One example of a data classifier is the algorithm used by the United States Postal Service since the 1960s to recognize scans of handwritten digits for processing zip codes. Active learning algorithms enable rapid training with minimal investment of time on the part of human experts to provide training examples consisting of correctly classified (labeled) input data. They function by identifying which examples would be most profitable for a human expert to label. The goal is to maximize classifier accuracy while minimizing the number of examples the expert must label. Although there are several well-established methods for active learning, they may not operate well when irrelevant examples are present in the data set. That is, they may select an item for labeling that the expert simply cannot assign to any of the valid classes. In the context of classifying handwritten digits, the irrelevant items may include stray marks, smudges, and mis-scans. Querying the expert about these items results in wasted time or erroneous labels, if the expert is forced to assign the item to one of the valid classes. In contrast, the new algorithm provides a specific mechanism for avoiding querying the irrelevant items. This algorithm has two components: an active learner (which could be a conventional active learning algorithm) and a relevance classifier. The combination of these components yields a method, denoted Relevance Bias, that enables the active learner to avoid querying irrelevant data so as to increase its learning rate and efficiency when irrelevant items are present. The algorithm collects irrelevant data in a set of rejected examples, then trains the relevance classifier to distinguish between labeled (relevant) training examples and the rejected ones. The active learner combines its ranking of the items with the probability that they are relevant to yield a final decision about which item to present to the expert for labeling. Experiments on several data sets have demonstrated that the Relevance Bias approach significantly decreases the number of irrelevant items queried and also accelerates learning speed.
The PROactive innovative conceptual framework on physical activity
Dobbels, Fabienne; de Jong, Corina; Drost, Ellen; Elberse, Janneke; Feridou, Chryssoula; Jacobs, Laura; Rabinovich, Roberto; Frei, Anja; Puhan, Milo A.; de Boer, Willem I.; van der Molen, Thys; Williams, Kate; Pinnock, Hillary; Troosters, Thierry; Karlsson, Niklas; Kulich, Karoly; Rüdell, Katja; Brindicci, Caterina; Higenbottam, Tim; Troosters, Thierry; Dobbels, Fabienne; Decramer, Marc; Tabberer, Margaret; Rabinovich, Roberto A; MacNee, William; Vogiatzis, Ioannis; Polkey, Michael; Hopkinson, Nick; Garcia-Aymerich, Judith; Puhan, Milo; Frei, Anja; van der Molen, Thys; de Jong, Corina; de Boer, Pim; Jarrod, Ian; McBride, Paul; Kamel, Nadia; Rudell, Katja; Wilson, Frederick J.; Ivanoff, Nathalie; Kulich, Karoly; Glendenning, Alistair; Karlsson, Niklas X.; Corriol-Rohou, Solange; Nikai, Enkeleida; Erzen, Damijan
2014-01-01
Although physical activity is considered an important therapeutic target in chronic obstructive pulmonary disease (COPD), what “physical activity” means to COPD patients and how their perspective is best measured is poorly understood. We designed a conceptual framework, guiding the development and content validation of two patient reported outcome (PRO) instruments on physical activity (PROactive PRO instruments). 116 patients from four European countries with diverse demographics and COPD phenotypes participated in three consecutive qualitative studies (63% male, age mean±sd 66±9 years, 35% Global Initiative for Chronic Obstructive Lung Disease stage III–IV). 23 interviews and eight focus groups (n = 54) identified the main themes and candidate items of the framework. 39 cognitive debriefings allowed the clarity of the items and instructions to be optimised. Three themes emerged, i.e. impact of COPD on amount of physical activity, symptoms experienced during physical activity, and adaptations made to facilitate physical activity. The themes were similar irrespective of country, demographic or disease characteristics. Iterative rounds of appraisal and refinement of candidate items resulted in 30 items with a daily recall period and 34 items with a 7-day recall period. For the first time, our approach provides comprehensive insight on physical activity from the COPD patients’ perspective. The PROactive PRO instruments’ content validity represents the pivotal basis for empirically based item reduction and validation. PMID:25034563
1985-12-01
PECI program sets aside funds in the annual budget and makes them available to managers and personnel for a 10 %[ wide range of cost and labor- saving ... money to fund PECI projects of particular concern to the individual services. [Ref. 4:pp. 3-4] The OSD-sponsored projects include two programs that... money is used by Navy non-industrial funded activities. Although - the line item amount is approved by Congress, individual project approval is the
Development and Evaluation of the Quality of Life for Obesity Surgery (QOLOS) Questionnaire.
Müller, Astrid; Crosby, Ross D; Selle, Janine; Osterhus, Alexandra; Köhler, Hinrich; Mall, Julian W; Meyer, Thorsten; de Zwaan, Martina
2018-02-01
Even though health-related quality of life (HRQOL) is considered an important component of bariatric surgery outcome, there is a lack of HRQOL measures relevant for preoperative and postoperative patients. The objective of the current study was to develop a new instrument assessing HRQOL prior to and following bariatric surgery, entitled Quality of Life for Obesity Surgery (QOLOS) Questionnaire. Topics for the QOLOS were initially generated via open-ended interviews and focus groups with 19 postoperative bariatric surgery patients. Qualitative analysis resulted in 250 items, which were rated by patients (n = 101) and experts (n = 69) in terms of their importance. A total of 120 items were retained for further evaluation and administered to 220 preoperative patients and 219 postoperative patients. They also completed a battery of other assessments to analyze issues of construct validity. Analyses resulted in a 36-item section 1 QOLOS form targeting both preoperative and postoperative aspects across seven domains (eating disturbances, physical functioning, body satisfaction, family support, social discrimination, positive activities, partnership) and a 20-item section 2 QOLOS form focusing on postoperative concerns only (domains: excess skin, eating adjustment, dumping, satisfaction with surgery). Subscales of both sections showed acceptable to excellent internal consistency (Cronbach's α 0.72 to 0.95) and good convergent and discriminant validity. The QOLOS represents a reliable and valid instrument to assess HRQOL in preoperative and postoperative patients. Future studies should test the questionnaire in larger samples consisting of patients undergoing different types of surgery.
Sheldon, Signy; Levine, Brian
2015-12-01
During autobiographical memory retrieval, the medial temporal lobes (MTL) relate together multiple event elements, including object (within-item relations) and context (item-context relations) information, to create a cohesive memory. There is consistent support for a functional specialization within the MTL according to these relational processes, much of which comes from recognition memory experiments. In this study, we compared brain activation patterns associated with retrieving within-item relations (i.e., associating conceptual and sensory-perceptual object features) and item-context relations (i.e., spatial relations among objects) with respect to naturalistic autobiographical retrieval. We developed a novel paradigm that cued participants to retrieve information about past autobiographical events, non-episodic within-item relations, and non-episodic item-context relations with the perceptuomotor aspects of retrieval equated across these conditions. We used multivariate analysis techniques to extract common and distinct patterns of activity among these conditions within the MTL and across the whole brain, both in terms of spatial and temporal patterns of activity. The anterior MTL (perirhinal cortex and anterior hippocampus) was preferentially recruited for generating within-item relations later in retrieval whereas the posterior MTL (posterior parahippocampal cortex and posterior hippocampus) was preferentially recruited for generating item-context relations across the retrieval phase. These findings provide novel evidence for functional specialization within the MTL with respect to naturalistic memory retrieval. © 2015 Wiley Periodicals, Inc.
Crossley, Kay M; Macri, Erin M; Cowan, Sallie M; Collins, Natalie J; Roos, Ewa M
2017-03-03
Patellofemoral pain and osteoarthritis are prevalent and associated with substantial pain and functional impairments. Patient-reported outcome measures (PROMs) are recommended for research and clinical use, but no PROMs are specific for patellofemoral osteoarthritis, and existing PROMs for patellofemoral pain have methodological limitations. This study aimed to develop a new subscale of the Knee injury and Osteoarthritis Outcome Score for patellofemoral pain and osteoarthritis (KOOS-PF), and evaluate its measurement properties. Items were generated using input from 50 patients with patellofemoral pain and/or osteoarthritis and 14 health and medical clinicians. Item reduction was performed using data from patellofemoral cohorts (n=138). We used the COnsesus-based Standards for the selection of health Measurements INstruments guidelines to evaluate reliability, validity, responsiveness and interpretability of the final version of KOOS-PF and other KOOS subscales. From an initial 80 generated items, the final subscale included 11 items. KOOS-PF items loaded predominantly on one factor, pain during activities that load the patellofemoral joint. KOOS-PF had good internal consistency (Cronbach's α 0.86) and adequate test-retest reliability (intraclass correlation coefficient 0.86). Hypothesis testing supported convergent, divergent and known-groups validity. Responsiveness was confirmed, with KOOS-PF demonstrating a moderate correlation with Global Rating of Change scores (r 0.52) and large effect size (Cohen's d 0.89). Minimal detectable change was 2.3 (groups) and 16 (individuals), while minimal important change was 16.4. There were no floor or ceiling effects. The 11-item KOOS-PF, developed in consultation with patients and clinicians, demonstrated adequate measurement properties, and is recommended for clinical and research use in patients with patellofemoral pain and osteoarthritis. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Choi, Bongsam
2018-01-01
[Purpose] This study aimed to cross-cultural adapt and validate the Korean version of an physical activity measure (K-PAM) for community-dwelling elderly. [Subjects and Methods] One hundred and thirty eight community-dwelling elderlies, 32 males and 106 female, participated in the study. All participants were asked to fill out a fifty-one item questionnaire measuring perceived difficulty in the activities of daily living (ADL) for the elderly. One-parameter model of item response theory (Rasch analysis) was applied to determine the construct validity and to inspect item-level psychometric properties of 51 ADL items of the K-PAM. [Results] Person separation reliability (analogous to Cronbach's alpha) for internal consistency was ranging 0.93 to 0.94. A total of 16 items was misfit to the Rasch model. After misfit item deletion, 35 ADL items of the K-PAM were placed in an empirically meaningful hierarchy from easy to hard. The item-person map analysis delineated that the item difficulty was well matched for the elderlies with moderate and low ability except for high ceilings. [Conclusion] Cross-cultural adapted K-PAM was shown to be sufficient for establishing construct validity and stable psychometric properties confirmed by person separation reliability and fit statistics.
Lima-Serrano, Marta; Lima-Rodríguez, Joaquín Salvador; Sáez-Bueno, Africa
2012-01-01
Different authors suggest that attitude is a mediator in behavior change, so it is a predictor of behavior practice. The main of this study was to design and to validate two scales for measure adolescent attitude toward healthy eating and adolescent attitude toward healthy physical activity. Scales were design based on a literature review. After, they were validated using an on-line Delphi Panel with eighteen experts, a pretest, and a pilot test with a sample of 188 high school students. Comprehensibility, content validity, adequacy, as well as the reliability (alpha of Cronbach test), and construct validity (exploratory factor analysis) of scales were tested. Scales validated by experts were considered appropriate in the pretest. In the pilot test, the ten-item Attitude to Eating Scale obtained α=0.72. The eight-item Attitude to Physical Activity Scale obtained α=0.86. They showed evidence of one-dimensional interpretation after factor analysis, a) all items got weights r>0.30 in first factor before rotations, b) the first factor explained a significant proportion of variance before rotations, and c) the total variance explained by the main factors extracted was greater than 50%. The Scales showed their reliability and validity. They could be employed to assess attitude to these priority intervention areas in Spanish adolescents, and to evaluate this intermediate result of health interventions and health programs.
Evidence against global attention filters selective for absolute bar-orientation in human vision.
Inverso, Matthew; Sun, Peng; Chubb, Charles; Wright, Charles E; Sperling, George
2016-01-01
The finding that an item of type A pops out from an array of distractors of type B typically is taken to support the inference that human vision contains a neural mechanism that is activated by items of type A but not by items of type B. Such a mechanism might be expected to yield a neural image in which items of type A produce high activation and items of type B low (or zero) activation. Access to such a neural image might further be expected to enable accurate estimation of the centroid of an ensemble of items of type A intermixed with to-be-ignored items of type B. Here, it is shown that as the number of items in stimulus displays is increased, performance in estimating the centroids of horizontal (vertical) items amid vertical (horizontal) distractors degrades much more quickly and dramatically than does performance in estimating the centroids of white (black) items among black (white) distractors. Together with previous findings, these results suggest that, although human vision does possess bottom-up neural mechanisms sensitive to abrupt local changes in bar-orientation, and although human vision does possess and utilize top-down global attention filters capable of selecting multiple items of one brightness or of one color from among others, it cannot use a top-down global attention filter capable of selecting multiple bars of a given absolute orientation and filtering bars of the opposite orientation in a centroid task.
Code of Federal Regulations, 2010 CFR
2010-10-01
...— (1) Before purchasing an item of supply listed in the FPI Schedule, conduct market research to... item to supplies available from the private sector; (3) If the FPI item is comparable, purchase the... for award in accordance with the item description or specifications, and evaluation factors in the...
Item Analysis in Introductory Economics Testing.
ERIC Educational Resources Information Center
Tinari, Frank D.
1979-01-01
Computerized analysis of multiple choice test items is explained. Examples of item analysis applications in the introductory economics course are discussed with respect to three objectives: to evaluate learning; to improve test items; and to help improve classroom instruction. Problems, costs and benefits of the procedures are identified. (JMD)
Phillips, Steven; Niki, Kazuhisa
2002-10-01
Working memory is affected by items stored and the relations between them. However, separating these factors has been difficult, because increased items usually accompany increased associations/relations. Hence, some have argued, relational effects are reducible to item effects. We overcome this problem by manipulating index length: the fewest number of item positions at which there is a unique item, or tuple of items (if length >1), for every instance in the relational (memory) set. Longer indexes imply greater similarity (number of shared items) between instances and higher load on encoding processes. Subjects were given lists of study pairs and asked to make a recognition judgement. The number of unique items and index length in the three list conditions were: (1) AB, CD: four/one; (2) AB, CD, EF: six/one; and (3) AB, AD, CB: four/two, respectively. Japanese letters were used in Experiments 1 (kanji-ideograms) and 2 (hiragana-phonograms); numbers in Experiment 3; and shapes generated from Fourier descriptors in Experiment 4. Across all materials, right dominant temporoparietal and middle frontal gyral activity was found with increased index length, but not items during study. In Experiment 5, a longer delay was used to isolate retention effects in the absence of visual stimuli. Increased left hemispheric activity was observed in the precuneus, middle frontal gyrus, and superior temporal gyrus with increased index length for the delay period. These results show that relational load is not reducible to item load.
ERIC Educational Resources Information Center
Tian, Wei; Cai, Li; Thissen, David; Xin, Tao
2013-01-01
In item response theory (IRT) modeling, the item parameter error covariance matrix plays a critical role in statistical inference procedures. When item parameters are estimated using the EM algorithm, the parameter error covariance matrix is not an automatic by-product of item calibration. Cai proposed the use of Supplemented EM algorithm for…
Muratov, Sergei; Podbielski, Dominik W; Kennedy, Kevin; Jack, Susan M; Pemberton, Julia; Ahmed, Iqbal Ike K; Baltaziak, Monika; Xie, Feng
2018-05-12
To develop a descriptive system for a glaucoma-specific preference-based health-related quality of life (HRQoL) instrument: the Health Utility for Glaucoma (HUG-5). The descriptive system was developed in two stages: item identification and item selection. A systematic literature review of HRQoL assessment of glaucoma was conducted using a comprehensive search strategy. Purposeful sampling was used to recruit patients with different clinical characteristics. Relevant items were presented to glaucoma patients through face-to-face, semi-structured interviews. Framework methodology was applied to analyze interview content. The recurring themes identified through an iterative content analysis represented topics of most importance and relevance to patients. These themes formed the domains of the HUG-5 descriptive system. Three versions of the descriptive system, differing in explanatory detail, were pilot tested using a focus group. The literature review identified 19 articles which contained 266 items. These items were included for the full text review and were used to develop an interview guide. From twelve patient interviews, 22 themes were identified and grouped into five domains that informed the five questions of the descriptive system. The HUG-5 measures visual discomfort, mobility, daily life activities, emotional well-being, and social activities. Each question has five response levels that range from "no problem" to "severe problem". The focus group comprised of seven additional patients unanimously preferred the version that contained detailed, specific examples to support each question. A 5-domain descriptive system of a glaucoma-specific preference-based instrument, the HUG-5, was developed and remains to be evaluated for validity and reliability in the glaucoma patient population.
Reeve, Bryce B.; Mitchell, Sandra A.; Clauser, Steven B.; Minasian, Lori M.; Dueck, Amylou C.; Mendoza, Tito R.; Hay, Jennifer; Atkinson, Thomas M.; Abernethy, Amy P.; Bruner, Deborah W.; Cleeland, Charles S.; Sloan, Jeff A.; Chilukuri, Ram; Baumgartner, Paul; Denicoff, Andrea; St. Germain, Diane; O’Mara, Ann M.; Chen, Alice; Kelaghan, Joseph; Bennett, Antonia V.; Sit, Laura; Rogak, Lauren; Barz, Allison; Paul, Diane B.; Schrag, Deborah
2014-01-01
The standard approach for documenting symptomatic adverse events (AEs) in cancer clinical trials involves investigator reporting using the National Cancer Institute’s (NCI’s) Common Terminology Criteria for Adverse Events (CTCAE). Because this approach underdetects symptomatic AEs, the NCI issued two contracts to create a patient-reported outcome (PRO) measurement system as a companion to the CTCAE, called the PRO-CTCAE. This Commentary describes development of the PRO-CTCAE by a group of multidisciplinary investigators and patient representatives and provides an overview of qualitative and quantitative studies of its measurement properties. A systematic evaluation of all 790 AEs listed in the CTCAE identified 78 appropriate for patient self-reporting. For each of these, a PRO-CTCAE plain language term in English and one to three items characterizing the frequency, severity, and/or activity interference of the AE were created, rendering a library of 124 PRO-CTCAE items. These items were refined in a cognitive interviewing study among patients on active cancer treatment with diverse educational, racial, and geographic backgrounds. Favorable measurement properties of the items, including construct validity, reliability, responsiveness, and between-mode equivalence, were determined prospectively in a demographically diverse population of patients receiving treatments for many different tumor types. A software platform was built to administer PRO-CTCAE items to clinical trial participants via the internet or telephone interactive voice response and was refined through usability testing. Work is ongoing to translate the PRO-CTCAE into multiple languages and to determine the optimal approach for integrating the PRO-CTCAE into clinical trial workflow and AE analyses. It is envisioned that the PRO-CTCAE will enhance the precision and patient-centeredness of adverse event reporting in cancer clinical research. PMID:25265940
ERIC Educational Resources Information Center
Festini, Sara B.; Reuter-Lorenz, Patricia A.
2017-01-01
Directed forgetting tasks instruct people to forget targeted memoranda. In the context of working memory, people attempt to forget representations that are currently held in mind. Here, we evaluated candidate mechanisms of directed forgetting within working memory, by (a) testing the influence of articulatory suppression, a rehearsal-reducing and…
Gambling-Related Cognition Scale (GRCS): Are skills-based games at a disadvantage?
Lévesque, David; Sévigny, Serge; Giroux, Isabelle; Jacques, Christian
2017-09-01
The Gambling-Related Cognition Scale (GRCS; Raylu & Oei, 2004) was developed to evaluate gambling-related cognitive distortions for all types of gamblers, regardless of their gambling activities (poker, slot machine, etc.). It is therefore imperative to ascertain the validity of its interpretation across different types of gamblers; however, some skills-related items endorsed by players could be interpreted as a cognitive distortion despite the fact that they play skills-related games. Using an intergroup (168 poker players and 73 video lottery terminal [VLT] players) differential item functioning (DIF) analysis, this study examined the possible manifestation of item biases associated with the GRCS. DIF was analyzed with ordinal logistic regressions (OLRs) and Ramsay's (1991) nonparametric kernel smoothing approach with TestGraf. Results show that half of the items display at least moderate DIF between groups and, depending on the type of analysis used, 3 to 7 items displayed large DIF. The 5 items with the most DIF were more significantly endorsed by poker players (uniform DIF) and were all related to skills, knowledge, learning, or probabilities. Poker players' interpretations of some skills-related items may lead to an overestimation of their cognitive distortions due to their total score increased by measurement artifact. Findings indicate that the current structure of the GRCS contains potential biases to be considered when poker players are surveyed. The present study conveys new and important information on bias issues to ponder carefully before using and interpreting the GRCS and other similar wide-range instruments with poker players. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Suzukamo, Yoshimi; Oshika, Tetsuro; Yuzawa, Mitsuko; Tokuda, Yoshihiro; Tomidokoro, Atsuo; Oki, Kotaro; Mangione, Carol M; Green, Joseph; Fukuhara, Shunichi
2005-10-26
The importance of evaluating the outcomes of health care from the standpoint of the patient is now widely recognized. The purpose of this study is to develop and test a Japanese version of the National Eye Institute Visual Function Questionnaire (NEI VFQ-25). A Japanese version was developed with a previously standardized method. The questionnaire and optional items were completed by 245 patients with cataracts, glaucoma, or age-related macular degeneration, by 110 others before and after cataract surgery, and by a reference group (n = 31). We computed rates of missing data, measured reproducibility and internal consistency reliability, and tested for convergent and discriminant validity, concurrent validity, known-groups validity, factor structure, and responsiveness to change. Based on information from the participants, some items were changed to 2-step items (asking if an activity was done, and if it was done, then asking how difficult it was). The near-vision and distance-vision subscales each had 1 item that was endorsed by very few participants, so these items were replaced with items that were optional in the English version. For example, more than 60% of participants did not drive, so the driving question was excluded. Reliability and validity were adequate for all subscales except driving, ocular pain, color vision, and peripheral vision. With cataract surgery, most scores improved by at least 20 points. With minor modifications from the English version, the Japanese NEI VFQ-25 can give reliable, valid, responsive data on vision-related quality of life, for group-level comparisons or for tracking therapeutic outcomes.
Development of Logistics for Building Radiation Storm Shelters and Their Operational Evaluation
NASA Technical Reports Server (NTRS)
Cerro, Jeffrey A.
2015-01-01
Over the past three years NASA has been studying the operational effectiveness and astronaut protection efficacy of numerous radiation protection shelters for use in space exploration activities outside of earth's magnetosphere. The work presented was part of NASA's Advanced Exploration Systems (AES) RadWorks Storm Shelter project. This paper is a summary of the concept development activities of this third year. Fabricated items were integrated into mock up deep space habitat vehicle sections for operational evaluations. Two full scale human-in-loop simulations were designed, fabricated, and implemented through an Institutional Review Board approved solicited participant assessment process. Fabricated items are described, along with usage scenarios of two protection approaches. Existing ISS type logistics along with proposed variations of those logistics were used. Preliminary Discrete Event Simulation (DES) work is noted to be useful in quantifying and documenting operational performance measures for the two primary shelter methods, including some characterization of radiation dose accumulation over a mission timeline. The project also performed correlation analyses between effective radiation dose and the Risk of Exposure Induced Death (REID) to show that concept level work may be able to include such a performance metric in early stages of mission scenario habitat design trade space investigation.
Kramer, Jessica M; Schwartz, Ariel
2017-10-01
This study examined the item interpretability and rating scale use of the Pediatric Evaluation of Disability Inventory-Patient-Reported Outcome (PEDI-PRO) by young people with developmental disabilities. The PEDI-PRO assesses the functional performance of discrete functional tasks in the context of everyday life situations. A two-phase cognitive interview design was implemented with a convenience sample of 37 young people (mean age 19y, SD 2y 5mo; 13 males and 24 females; 68% with intellectual disability) with developmental disabilities. In phase I, 182 item candidates were each reviewed by an average of four young people. In phase II, 103 items were carried forward or revised and each reviewed by an average of seven additional young people. Two raters coded responses for intended item interpretation and performance quality; codes were analysed using descriptive statistics. Qualitative analysis explored young people's self-evaluation process. Items were interpreted as intended by most young people (mean 86%). Young people can use PEDI-PRO response categories appropriately to describe their performance: 94% of positive performance descriptions coincided with a positive response category choice; 73% of negative descriptions coincided with a negative response category choice. Young people interpreted items in a literal manner, and their self-evaluation incorporated the use of supports that facilitate functional performance. The PEDI-PRO's measurement framework appears to support the self-evaluation of functional performance of young people with developmental disabilities. © 2017 Mac Keith Press.
West, Courtney; Landry, Karen; Graham, Anna; Graham, Lori; Cianciolo, Anna T; Kalet, Adina; Rosen, Michael; Sherman, Deborah Witt
2015-01-01
SGEA 2015 CONFERENCE ABSTRACT (EDITED). Evaluating Interprofessional Teamwork During a Large-Scale Simulation. Courtney West, Karen Landry, Anna Graham, and Lori Graham. CONSTRUCT: This study investigated the multidimensional measurement of interprofessional (IPE) teamwork as part of large-scale simulation training. Healthcare team function has a direct impact on patient safety and quality of care. However, IPE team training has not been the norm. Recognizing the importance of developing team-based collaborative care, our College of Nursing implemented an IPE simulation activity called Disaster Day and invited other professions to participate. The exercise consists of two sessions: one in the morning and another in the afternoon. The disaster scenario is announced just prior to each session, which consists of team building, a 90-minute simulation, and debriefing. Approximately 300 Nursing, Medicine, Pharmacy, Emergency Medical Technicians, and Radiology students and over 500 standardized and volunteer patients participated in the Disaster Day event. To improve student learning outcomes, we created 3 competency-based instruments to evaluate collaborative practice in multidimensional fashion during this exercise. A 20-item IPE Team Observation Instrument designed to assess interprofessional team's attainment of Interprofessional Education Collaborative (IPEC) competencies was completed by 20 faculty and staff observing the Disaster Day simulation. One hundred sixty-six standardized patients completed a 10-item Standardized Patient IPE Team Evaluation Instrument developed from the IPEC competencies and adapted items from the 2014 Henry et al. PIVOT Questionnaire. This instrument assessed the standardized or volunteer patient's perception of the team's collaborative performance. A 29-item IPE Team's Perception of Collaborative Care Questionnaire, also created from the IPEC competencies and divided into 5 categories of Values/Ethics, Roles and Responsibilities, Communication, Teamwork, and Self-Evaluation, was completed by 188 students including 99 from Nursing, 43 from Medicine, 6 from Pharmacy, and 40 participants who belonged to more than one component, were students at another institution, or did not indicate their institution. The team instrument was designed to assess each team member's perception of how well the team and him- or herself met the competencies. Five of the items on the team perceptions questionnaire mirrored items on the standardized patient evaluation: demonstrated leadership practices that led to effective teamwork, discussed care and decisions about that care with patient, described roles and responsibilities clearly, worked well together to coordinate care, and good/effective communication. Internal consistency reliability of the IPE Team Observation Instrument was 0.80. In 18 of the 20 items, more than 50% of observers indicated the item was demonstrated. Of those, 6 of the items were observed by 50% to 75% of the observers, and the remaining 12 were observed by more than 80% of the observers. Internal consistency reliability of the IPE Team's Perception of Collaborative Care Instrument was 0.95. The mean response score-1 (strongly disagree) to 4 (strongly agree)-was calculated for each section of the instrument. The overall mean score was 3.57 (SD = .11). Internal consistency reliability of the Standardized Patient IPE Team Evaluation Instrument was 0.87. The overall mean score was 3.28 (SD = .17). The ratings for the 5 items shared by the standardized patient and team perception instruments were compared using independent sample t tests. Statistically significant differences (p < .05) were present in each case, with the students rating themselves higher on average than the standardized patients did (mean differences between 0.2 and 0.6 on a scale of 1-4). Multidimensional, competency-based instruments appear to provide a robust view of IPE teamwork; however, challenges remain. Due to the large scale of the simulation exercise, observation-based assessment did not function as well as self- and standardized patient-based assessment. To promote greater variation in observer assessments during future Disaster Day simulations, we plan to adjust the rating scale from "not observed," "observed," and "not applicable" to a 4-point scale and reexamine interrater reliability.
Using the Entrustable Professional Activities Framework in the Assessment of Procedural Skills.
Pugh, Debra; Cavalcanti, Rodrigo B; Halman, Samantha; Ma, Irene W Y; Mylopoulos, Maria; Shanks, David; Stroud, Lynfa
2017-04-01
The entrustable professional activity (EPA) framework has been identified as a useful approach to assessment in competency-based education. To apply an EPA framework for assessment, essential skills necessary for entrustment to occur must first be identified. Using an EPA framework, our study sought to (1) define the essential skills required for entrustment for 7 bedside procedures expected of graduates of Canadian internal medicine (IM) residency programs, and (2) develop rubrics for the assessment of these procedural skills. An initial list of essential skills was defined for each procedural EPA by focus groups of experts at 4 academic centers using the nominal group technique. These lists were subsequently vetted by representatives from all Canadian IM training programs through a web-based survey. Consensus (more than 80% agreement) about inclusion of each item was sought using a modified Delphi exercise. Qualitative survey data were analyzed using a framework approach to inform final assessment rubrics for each procedure. Initial lists of essential skills for procedural EPAs ranged from 10 to 24 items. A total of 111 experts completed the national survey. After 2 iterations, consensus was reached on all items. Following qualitative analysis, final rubrics were created, which included 6 to 10 items per procedure. These EPA-based assessment rubrics represent a national consensus by Canadian IM clinician educators. They provide a practical guide for the assessment of procedural skills in a competency-based education model, and a robust foundation for future research on their implementation and evaluation.
Ortiz, Glorimar; Schacht, Lucille
2012-01-01
Measurement of consumers' satisfaction in psychiatric settings is important because it has been correlated with improved clinical outcomes and administrative measures of high-quality care. These consumer satisfaction measurements are actively used as performance measures required by the accreditation process and for quality improvement activities. Our objectives were (i) to re-evaluate, through exploratory factor analysis (EFA) and confirmatory factor analysis (CFA), the structure of an instrument intended to measure consumers' satisfaction with care in psychiatric settings and (ii) to examine and publish the psychometric characteristics, validity and reliability, of the Inpatient Consumer Survey (ICS). To psychometrically test the structure of the ICS, 34 878 survey results, submitted by 90 psychiatric hospitals in 2008, were extracted from the Behavioral Healthcare Performance Measurement System (BHPMS). Basic descriptive item-response and correlation analyses were performed for total surveys. Two datasets were randomly created for analysis. A random sample of 8229 survey results was used for EFA. Another random sample of 8261 consumer survey results was used for CFA. This same sample was used to perform validity and reliability analyses. The item-response analysis showed that the mean range for a disagree/agree five-point scale was 3.10-3.94. Correlation analysis showed a strong relationship between items. Six domains (dignity, rights, environment, empowerment, participation, and outcome) with internal reliabilities between good to moderate (0.87-0.73) were shown to be related to overall care satisfaction. Overall reliability for the instrument was excellent (0.94). Results from CFA provided support for the domains structure of the ICS proposed through EFA. The overall findings from this study provide evidence that the ICS is a reliable measure of consumer satisfaction in psychiatric inpatient settings. The analysis has shown the ICS to provide valid and reliable results and to focus on the specific concerns of consumers of psychiatric inpatient care. Scores by item indicate that opportunity for improvement exists across healthcare organizations.
McDuff, Susan G. R.; Frankel, Hillary C.; Norman, Kenneth A.
2009-01-01
We used multi-voxel pattern analysis (MVPA) of fMRI data to gain insight into how subjects’ retrieval agendas influence source memory judgments (was item X studied using source Y?). In Experiment 1, we used a single-agenda test where subjects judged whether items were studied with the targeted source or not. In Experiment 2, we used a multi-agenda test where subjects judged whether items were studied using the targeted source, studied using a different source, or nonstudied. To evaluate the differences between single- and multi-agenda source monitoring, we trained a classifier to detect source-specific fMRI activity at study, and then we applied the classifier to data from the test phase. We focused on trials where the targeted source and the actual source differed, so we could use MVPA to track neural activity associated with both the targeted source and the actual source. Our results indicate that single-agenda monitoring was associated with increased focus on the targeted source (as evidenced by increased targeted-source activity, relative to baseline) and reduced use of information relating to the actual, non-target source. In the multi-agenda experiment, high-levels of actual-source activity were associated with increased correct rejections, suggesting that subjects were using recollection of actual-source information to avoid source memory errors. In the single-agenda experiment, there were comparable levels of actual-source activity (suggesting that recollection was taking place), but the relationship between actual-source activity and behavior was absent (suggesting that subjects were failing to make proper use of this information). PMID:19144851
41 CFR 101-30.302 - Types of items excluded from cataloging.
Code of Federal Regulations, 2014 CFR
2014-07-01
... Catalog System except when an agency determines that Federal item identification data will be of value in...-FEDERAL CATALOG SYSTEM 30.3-Cataloging Items of Supply § 101-30.302 Types of items excluded from...) Items procured in foreign markets for use in overseas activities of Federal agencies. (e) Printed forms...
41 CFR 101-30.302 - Types of items excluded from cataloging.
Code of Federal Regulations, 2012 CFR
2012-07-01
... Catalog System except when an agency determines that Federal item identification data will be of value in...-FEDERAL CATALOG SYSTEM 30.3-Cataloging Items of Supply § 101-30.302 Types of items excluded from...) Items procured in foreign markets for use in overseas activities of Federal agencies. (e) Printed forms...
41 CFR 101-30.302 - Types of items excluded from cataloging.
Code of Federal Regulations, 2011 CFR
2011-07-01
... Catalog System except when an agency determines that Federal item identification data will be of value in...-FEDERAL CATALOG SYSTEM 30.3-Cataloging Items of Supply § 101-30.302 Types of items excluded from...) Items procured in foreign markets for use in overseas activities of Federal agencies. (e) Printed forms...
41 CFR 101-30.302 - Types of items excluded from cataloging.
Code of Federal Regulations, 2010 CFR
2010-07-01
... Catalog System except when an agency determines that Federal item identification data will be of value in...-FEDERAL CATALOG SYSTEM 30.3-Cataloging Items of Supply § 101-30.302 Types of items excluded from...) Items procured in foreign markets for use in overseas activities of Federal agencies. (e) Printed forms...
41 CFR 101-30.302 - Types of items excluded from cataloging.
Code of Federal Regulations, 2013 CFR
2013-07-01
... Catalog System except when an agency determines that Federal item identification data will be of value in...-FEDERAL CATALOG SYSTEM 30.3-Cataloging Items of Supply § 101-30.302 Types of items excluded from...) Items procured in foreign markets for use in overseas activities of Federal agencies. (e) Printed forms...
Tepe, Rodger; Tepe, Chabha
2015-03-01
To develop and psychometrically evaluate an information literacy (IL) self-efficacy survey and an IL knowledge test. In this test-retest reliability study, a 25-item IL self-efficacy survey and a 50-item IL knowledge test were developed and administered to a convenience sample of 53 chiropractic students. Item analyses were performed on all questions. The IL self-efficacy survey demonstrated good reliability (test-retest correlation = 0.81) and good/very good internal consistency (mean κ = .56 and Cronbach's α = .92). A total of 25 questions with the best item analysis characteristics were chosen from the 50-item IL knowledge test, resulting in a 25-item IL knowledge test that demonstrated good reliability (test-retest correlation = 0.87), very good internal consistency (mean κ = .69, KR20 = 0.85), and good item discrimination (mean point-biserial = 0.48). This study resulted in the development of three instruments: a 25-item IL self-efficacy survey, a 50-item IL knowledge test, and a 25-item IL knowledge test. The information literacy self-efficacy survey and the 25-item version of the information literacy knowledge test have shown preliminary evidence of adequate reliability and validity to justify continuing study with these instruments.
Tepe, Rodger; Tepe, Chabha
2015-01-01
Objective To develop and psychometrically evaluate an information literacy (IL) self-efficacy survey and an IL knowledge test. Methods In this test–retest reliability study, a 25-item IL self-efficacy survey and a 50-item IL knowledge test were developed and administered to a convenience sample of 53 chiropractic students. Item analyses were performed on all questions. Results The IL self-efficacy survey demonstrated good reliability (test–retest correlation = 0.81) and good/very good internal consistency (mean κ = .56 and Cronbach's α = .92). A total of 25 questions with the best item analysis characteristics were chosen from the 50-item IL knowledge test, resulting in a 25-item IL knowledge test that demonstrated good reliability (test–retest correlation = 0.87), very good internal consistency (mean κ = .69, KR20 = 0.85), and good item discrimination (mean point-biserial = 0.48). Conclusions This study resulted in the development of three instruments: a 25-item IL self-efficacy survey, a 50-item IL knowledge test, and a 25-item IL knowledge test. The information literacy self-efficacy survey and the 25-item version of the information literacy knowledge test have shown preliminary evidence of adequate reliability and validity to justify continuing study with these instruments. PMID:25517736
Development and initial evaluation of the SCI-FI/AT
Jette, Alan M.; Slavin, Mary D.; Ni, Pengsheng; Kisala, Pamela A.; Tulsky, David S.; Heinemann, Allen W.; Charlifue, Susie; Tate, Denise G.; Fyffe, Denise; Morse, Leslie; Marino, Ralph; Smith, Ian; Williams, Steve
2015-01-01
Objectives To describe the domain structure and calibration of the Spinal Cord Injury Functional Index for samples using Assistive Technology (SCI-FI/AT) and report the initial psychometric properties of each domain. Design Cross sectional survey followed by computerized adaptive test (CAT) simulations. Setting Inpatient and community settings. Participants A sample of 460 adults with traumatic spinal cord injury (SCI) stratified by level of injury, completeness of injury, and time since injury. Interventions None Main outcome measure SCI-FI/AT Results Confirmatory factor analysis (CFA) and Item response theory (IRT) analyses identified 4 unidimensional SCI-FI/AT domains: Basic Mobility (41 items) Self-care (71 items), Fine Motor Function (35 items), and Ambulation (29 items). High correlations of full item banks with 10-item simulated CATs indicated high accuracy of each CAT in estimating a person's function, and there was high measurement reliability for the simulated CAT scales compared with the full item bank. SCI-FI/AT item difficulties in the domains of Self-care, Fine Motor Function, and Ambulation were less difficult than the same items in the original SCI-FI item banks. Conclusion With the development of the SCI-FI/AT, clinicians and investigators have available multidimensional assessment scales that evaluate function for users of AT to complement the scales available in the original SCI-FI. PMID:26010975
Development and initial evaluation of the SCI-FI/AT.
Jette, Alan M; Slavin, Mary D; Ni, Pengsheng; Kisala, Pamela A; Tulsky, David S; Heinemann, Allen W; Charlifue, Susie; Tate, Denise G; Fyffe, Denise; Morse, Leslie; Marino, Ralph; Smith, Ian; Williams, Steve
2015-05-01
To describe the domain structure and calibration of the Spinal Cord Injury Functional Index for samples using Assistive Technology (SCI-FI/AT) and report the initial psychometric properties of each domain. Cross sectional survey followed by computerized adaptive test (CAT) simulations. Inpatient and community settings. A sample of 460 adults with traumatic spinal cord injury (SCI) stratified by level of injury, completeness of injury, and time since injury. None SCI-FI/AT RESULTS: Confirmatory factor analysis (CFA) and Item response theory (IRT) analyses identified 4 unidimensional SCI-FI/AT domains: Basic Mobility (41 items) Self-care (71 items), Fine Motor Function (35 items), and Ambulation (29 items). High correlations of full item banks with 10-item simulated CATs indicated high accuracy of each CAT in estimating a person's function, and there was high measurement reliability for the simulated CAT scales compared with the full item bank. SCI-FI/AT item difficulties in the domains of Self-care, Fine Motor Function, and Ambulation were less difficult than the same items in the original SCI-FI item banks. With the development of the SCI-FI/AT, clinicians and investigators have available multidimensional assessment scales that evaluate function for users of AT to complement the scales available in the original SCI-FI.
Remijn, L; Speyer, R; Groen, B E; Holtus, P C M; van Limbeek, J; Nijhuis-van der Sanden, M W G
2013-05-01
The aim of this study was to develop the Mastication Observation and Evaluation instrument for observing and assessing the chewing ability of children eating solid and lumpy foods. This study describes the process of item definition and item selection and reports the content validity, reproducibility and consistency of the instrument. In the developmental phase, 15 experienced speech therapists assessed item relevance and descriptions over three Delphi rounds. Potential items were selected based on the results from a literature review. At the initial Delphi round, 17 potential items were included. After three Delphi rounds, 14 items that regarded as providing distinctive value in assessment of mastication (consensus >75%) were included in the Mastication Observation and Evaluation instrument. To test item reproducibility and consistency, two experts and five students evaluated video recordings of 20 children (10 children with cerebral palsy aged 29-65 months and 10 healthy children aged 11-42 months) eating bread and a biscuit. Reproducibility was estimated by means of the intraclass correlation coefficient (ICC). With the exception of one item concerning chewing duration, all items showed good to excellent intra-observer agreement (ICC students: 0.73-1.0). With the exception of chewing duration and number of swallows, inter-observer agreement was fair to excellent for all items (ICC experts: 0.68-1.0 and ICC students: 0.42-1.0). Results indicate that this tool is a feasible instrument and could be used in clinical practice after further research is completed on the reliability of the tool. © 2013 Blackwell Publishing Ltd.
Development of an instrument for the evaluation of advanced life support performance.
Peltonen, L-M; Peltonen, V; Salanterä, S; Tommila, M
2017-10-01
Assessing advanced life support (ALS) competence requires validated instruments. Existing instruments include aspects of technical skills (TS), non-technical skills (NTS) or both, but one instrument for detailed assessment that suits all resuscitation situations is lacking. This study aimed to develop an instrument for the evaluation of the overall ALS performance of the whole team. This instrument development study had four phases. First, we reviewed literature and resuscitation guidelines to explore items to include in the instrument. Thereafter, we interviewed resuscitation team professionals (n = 66), using the critical incident technique, to determine possible additional aspects associated with the performance of ALS. Second, we developed an instrument based on the findings. Third, we used an expert panel (n = 20) to assess the validity of the developed instrument. Finally, we revised the instrument based on the experts' comments and tested it with six experts who evaluated 22 video recorded resuscitations. The final version of the developed instrument had 69 items divided into adherence to guidelines (28 items), clinical decision-making (5 items), workload management (12 items), team behaviour (8 items), information management (6 items), patient integrity and consideration of laymen (4 items) and work routines (6 items). The Cronbach's α values were good, and strong correlations between the overall performance and the instrument were observed. The instrument may be useful for detailed assessment of the team's overall performance, but the numerous items make the use demanding. The instrument is still under development, and more research is needed to determine its psychometric properties. © 2017 The Acta Anaesthesiologica Scandinavica Foundation. Published by John Wiley & Sons Ltd.
The special role of item-context associations in the direct-access region of working memory.
Campoy, Guillermo
2017-09-01
The three-embedded-component model of working memory (WM) distinguishes three representational states corresponding to three WM regions: activated long-term memory, direct-access region (DAR), and focus of attention. Recent neuroimaging research has revealed that access to the DAR is associated with enhanced hippocampal activity. Because the hippocampus mediates the encoding and retrieval of item-context associations, it has been suggested that this hippocampal activation is a consequence of the fact that item-context associations are particularly strong and accessible in the DAR. This study provides behavioral evidence for this view using an item-recognition task to assess the effect of non-intentional encoding and maintenance of item-location associations across WM regions. Five pictures of human faces were sequentially presented in different screen locations followed by a recognition probe. Visual cues immediately preceding the probe indicated the location thereof. When probe stimuli appeared in the same location that they had been presented within the memory set, the presentation of the cue was expected to elicit the activation of the corresponding WM representation through the just-established item-location association, resulting in faster recognition. Results showed this same-location effect, but only for items that, according to their serial position within the memory set, were held in the DAR.
Sol, Marleen Elisabeth; Verschuren, Olaf; de Groot, Laura; de Groot, Janke Frederike
2017-02-13
Wheelchair mobility skills (WMS) training is regarded by children using a manual wheelchair and their parents as an important factor to improve participation and daily physical activity. Currently, there is no outcome measure available for the evaluation of WMS in children. Several wheelchair mobility outcome measures have been developed for adults, but none of these have been validated in children. Therefore the objective of this study is to develop a WMS outcome measure for children using the current knowledge from literature in combination with the clinical expertise of health care professionals, children and their parents. Mixed methods approach. Phase 1: Item identification of WMS items through a systematic review using the 'COnsensus-based Standards for the selection of health Measurement Instruments' (COSMIN) recommendations. Phase 2: Item selection and validation of relevant WMS items for children, using a focus group and interviews with children using a manual wheelchair, their parents and health care professionals. Phase 3: Feasibility of the newly developed Utrecht Pediatric Wheelchair Mobility Skills Test (UP-WMST) through pilot testing. Phase 1: Data analysis and synthesis of nine WMS related outcome measures showed there is no widely used outcome measure with levels of evidence across all measurement properties. However, four outcome measures showed some levels of evidence on reliability and validity for adults. Twenty-two WMS items with the best clinimetric properties were selected for further analysis in phase 2. Phase 2: Fifteen items were deemed as relevant for children, one item needed adaptation and six items were considered not relevant for assessing WMS in children. Phase 3: Two health care professionals administered the UP-WMST in eight children. The instructions of the UP-WMST were clear, but the scoring method of the height difference items needed adaptation. The outdoor items for rolling over soft surface and the side slope item were excluded in the final version of the UP-WMST due to logistic reasons. The newly developed 15 item UP-WMST is a validated outcome measure which is easy to administer in children using a manual wheelchair. More research regarding reliability, construct validity and responsiveness is warranted before the UP-WMST can be used in practice.
RhinAsthma patient perspective: A Rasch validation study.
Molinengo, Giorgia; Baiardini, Ilaria; Braido, Fulvio; Loera, Barbara
2018-02-01
In daily practice, Health-Related Quality of Life (HRQoL) tools are useful for supplementing clinical data with the patient's perspective. To encourage their use by clinicians, the availability of tools that can quickly provide valid results is crucial. A new HRQoL tool has been proposed for patients with asthma and rhinitis: the RhinAsthma Patient Perspective-RAPP. The aim of this study was to evaluate the psychometric robustness of the RAPP using the Item Response Theory (IRT) approach, to evaluate the scalability of items and test whether or not patients use the items response scale correctly. 155 patients (53.5% women, mean age 39.1, range 16-76) were recruited during a multicenter study. RAPP metric properties were investigated using IRT models. Differential item functioning (DIF) was used for gender, age, and asthma control test (ACT). The RAPP adequately fitted the Rating Scale model, demonstrating the equality of the rating scale structure for all items. All statistics on items were satisfactory. The RAPP had adequate internal reliability and showed good ability to discriminate among different groups of participants. DIF analysis indicated that there were no differential item functioning issues for gender. One item showed a DIF by age and four items by ACT. The psychometric evaluation performed using IRT models demonstrated that the RAPP met all the criteria to be considered a reliable and valid method of measurement. From a clinical perspective, this will allow physicians to confidently interpret scores as good indicators of Quality of Life of patients with asthma.
Psychometric properties of a revised version of the Assisting Hand Assessment (Kids-AHA 5.0).
Holmefur, Marie M; Krumlinde-Sundholm, Lena
2016-06-01
The aim of this study was to scrutinize the Assisting Hand Assessment (AHA) version 4.4 for possible improvements and to evaluate the psychometric properties regarding internal scale validity and aspects of reliability of a revised version of the AHA. In collaboration with experts, scoring criteria were changed for four items, and one fully new item was constructed. Twenty-two original, one new, and four revised items were scored for 164 assessments of children with unilateral cerebral palsy aged 18 months to 12 years. Rasch measurement analysis was used to evaluate internal scale validity by exploring rating-scale functioning, item and person goodness-of-fit, and principal component analysis. Targeting and scale reliability were also evaluated. After removal of misfitting items, a 20-item scale showed satisfactory goodness-of-fit. Unidimensionality was confirmed by principal component analysis. The rating scale functioned well for the 20 items, and the item difficulty was well suited to the ability level of the sample. The person reliability coefficient was 0.98, indicating high separation ability of the scale. A conversion table of AHA scores between the previous version (4.4) and the new version (5.0) was constructed. The new, 20-item version of the Kids-AHA (version 5.0), demonstrated excellent internal scale validity, suggesting improved responsiveness to changes and shortened scoring time. For comparison of scores from version 4.4 to 5.0, a transformation table is presented. © 2015 Mac Keith Press.
Schünemann, Holger J; Wiercioch, Wojtek; Etxeandia, Itziar; Falavigna, Maicon; Santesso, Nancy; Mustafa, Reem; Ventresca, Matthew; Brignardello-Petersen, Romina; Laisaar, Kaja-Triin; Kowalski, Sérgio; Baldeh, Tejan; Zhang, Yuan; Raid, Ulla; Neumann, Ignacio; Norris, Susan L; Thornton, Judith; Harbour, Robin; Treweek, Shaun; Guyatt, Gordon; Alonso-Coello, Pablo; Reinap, Marge; Brozek, Jan; Oxman, Andrew; Akl, Elie A
2014-02-18
Although several tools to evaluate the credibility of health care guidelines exist, guidance on practical steps for developing guidelines is lacking. We systematically compiled a comprehensive checklist of items linked to relevant resources and tools that guideline developers could consider, without the expectation that every guideline would address each item. We searched data sources, including manuals of international guideline developers, literature on guidelines for guidelines (with a focus on methodology reports from international and national agencies, and professional societies) and recent articles providing systematic guidance. We reviewed these sources in duplicate, extracted items for the checklist using a sensitive approach and developed overarching topics relevant to guidelines. In an iterative process, we reviewed items for duplication and omissions and involved experts in guideline development for revisions and suggestions for items to be added. We developed a checklist with 18 topics and 146 items and a webpage to facilitate its use by guideline developers. The topics and included items cover all stages of the guideline enterprise, from the planning and formulation of guidelines, to their implementation and evaluation. The final checklist includes links to training materials as well as resources with suggested methodology for applying the items. The checklist will serve as a resource for guideline developers. Consideration of items on the checklist will support the development, implementation and evaluation of guidelines. We will use crowdsourcing to revise the checklist and keep it up to date.
Schünemann, Holger J.; Wiercioch, Wojtek; Etxeandia, Itziar; Falavigna, Maicon; Santesso, Nancy; Mustafa, Reem; Ventresca, Matthew; Brignardello-Petersen, Romina; Laisaar, Kaja-Triin; Kowalski, Sérgio; Baldeh, Tejan; Zhang, Yuan; Raid, Ulla; Neumann, Ignacio; Norris, Susan L.; Thornton, Judith; Harbour, Robin; Treweek, Shaun; Guyatt, Gordon; Alonso-Coello, Pablo; Reinap, Marge; Brožek, Jan; Oxman, Andrew; Akl, Elie A.
2014-01-01
Background: Although several tools to evaluate the credibility of health care guidelines exist, guidance on practical steps for developing guidelines is lacking. We systematically compiled a comprehensive checklist of items linked to relevant resources and tools that guideline developers could consider, without the expectation that every guideline would address each item. Methods: We searched data sources, including manuals of international guideline developers, literature on guidelines for guidelines (with a focus on methodology reports from international and national agencies, and professional societies) and recent articles providing systematic guidance. We reviewed these sources in duplicate, extracted items for the checklist using a sensitive approach and developed overarching topics relevant to guidelines. In an iterative process, we reviewed items for duplication and omissions and involved experts in guideline development for revisions and suggestions for items to be added. Results: We developed a checklist with 18 topics and 146 items and a webpage to facilitate its use by guideline developers. The topics and included items cover all stages of the guideline enterprise, from the planning and formulation of guidelines, to their implementation and evaluation. The final checklist includes links to training materials as well as resources with suggested methodology for applying the items. Interpretation: The checklist will serve as a resource for guideline developers. Consideration of items on the checklist will support the development, implementation and evaluation of guidelines. We will use crowdsourcing to revise the checklist and keep it up to date. PMID:24344144
"Up Means Good": The Effect of Screen Position on Evaluative Ratings in Web Surveys.
Tourangeau, Roger; Couper, Mick P; Conrad, Frederick G
2013-01-01
This paper presents results from six experiments that examine the effect of the position of an item on the screen on the evaluative ratings it receives. The experiments are based on the idea that respondents expect "good" things-those they view positively-to be higher up on the screen than "bad" things. The experiments use items on different topics (Congress and HMOs, a variety of foods, and six physician specialties) and different methods for varying their vertical position on the screen. A meta-analysis of all six experiments demonstrates a small but reliable effect of the item's screen position on mean ratings of the item; the ratings are significantly more positive when the item appears in a higher position on the screen than when it appears farther down. These results are consistent with the hypothesis that respondents follow the "Up means good" heuristic, using the vertical position of the item as a cue in evaluating it. Respondents seem to rely on heuristics both in interpreting response scales and in forming judgments.
Cacchio, Angelo; De Paulis, Fosco; Maffulli, Nicola
2014-03-01
There is a need for a patient-reported outcome (PRO) questionnaire to evaluate patients with proximal hamstring tendinopathy (PHT). To develop a PRO questionnaire based on VISA questionnaire forms for patients with PHT. Item generation, item reduction, item scaling and evaluation of the psychometric properties were used to develop a questionnaire to assess the severity of symptoms, function and ability to play sports in patients with PHT and healthy subjects. The final version, named Victorian Institute of Sport Assessment-Proximal Hamstring Tendons (VISA-H), consisted of eight questions that measured the domains of pain, function and sporting activity. The psychometric properties of a questionnaire were estimated in a population of non-surgical (n=20) and surgical (n=10) patients, as well as in healthy subjects (n=30). The VISA-H questionnaire displayed a high degree of internal consistency, with a Cronbach α of 0.84. (The test-retest reliability was high for all groups of participants with an intraclass correlation coefficient ranging from 0.90 to 0.95.) The VISA-H exhibited a high correlation with the Nirschl phase rating scale (r ranging from -0.75 to -0.89) and a generic tendon grading system proposed by Curwin and Stanish (r ranging from -0.70 to -0.88). Also, the responsiveness was higher for the VISA-H questionnaire with an area under the curve of 0.90 and a minimum clinically important difference of 22 points. The VISA-H is a PRO questionnaire with high psychometric properties for measuring pain, function and sporting activity in patients with PHT.
17 CFR 229.1205 - (Item 1205) Drilling and other exploratory and development activities.
Code of Federal Regulations, 2011 CFR
2011-04-01
... 17 Commodity and Securities Exchanges 2 2011-04-01 2011-04-01 false (Item 1205) Drilling and other... Registrants Engaged in Oil and Gas Producing Activities § 229.1205 (Item 1205) Drilling and other exploratory..., disclose: (1) The number of net productive and dry exploratory wells drilled; and (2) The number of net...
17 CFR 229.1205 - (Item 1205) Drilling and other exploratory and development activities.
Code of Federal Regulations, 2014 CFR
2014-04-01
... Registrants Engaged in Oil and Gas Producing Activities § 229.1205 (Item 1205) Drilling and other exploratory... 17 Commodity and Securities Exchanges 3 2014-04-01 2014-04-01 false (Item 1205) Drilling and other..., disclose: (1) The number of net productive and dry exploratory wells drilled; and (2) The number of net...
17 CFR 229.1205 - (Item 1205) Drilling and other exploratory and development activities.
Code of Federal Regulations, 2013 CFR
2013-04-01
... Registrants Engaged in Oil and Gas Producing Activities § 229.1205 (Item 1205) Drilling and other exploratory... 17 Commodity and Securities Exchanges 2 2013-04-01 2013-04-01 false (Item 1205) Drilling and other..., disclose: (1) The number of net productive and dry exploratory wells drilled; and (2) The number of net...
17 CFR 229.1205 - (Item 1205) Drilling and other exploratory and development activities.
Code of Federal Regulations, 2012 CFR
2012-04-01
... Registrants Engaged in Oil and Gas Producing Activities § 229.1205 (Item 1205) Drilling and other exploratory... 17 Commodity and Securities Exchanges 2 2012-04-01 2012-04-01 false (Item 1205) Drilling and other..., disclose: (1) The number of net productive and dry exploratory wells drilled; and (2) The number of net...
Building an Evaluation Scale using Item Response Theory.
Lalor, John P; Wu, Hao; Yu, Hong
2016-11-01
Evaluation of NLP methods requires testing against a previously vetted gold-standard test set and reporting standard metrics (accuracy/precision/recall/F1). The current assumption is that all items in a given test set are equal with regards to difficulty and discriminating power. We propose Item Response Theory (IRT) from psychometrics as an alternative means for gold-standard test-set generation and NLP system evaluation. IRT is able to describe characteristics of individual items - their difficulty and discriminating power - and can account for these characteristics in its estimation of human intelligence or ability for an NLP task. In this paper, we demonstrate IRT by generating a gold-standard test set for Recognizing Textual Entailment. By collecting a large number of human responses and fitting our IRT model, we show that our IRT model compares NLP systems with the performance in a human population and is able to provide more insight into system performance than standard evaluation metrics. We show that a high accuracy score does not always imply a high IRT score, which depends on the item characteristics and the response pattern.
Building an Evaluation Scale using Item Response Theory
Lalor, John P.; Wu, Hao; Yu, Hong
2016-01-01
Evaluation of NLP methods requires testing against a previously vetted gold-standard test set and reporting standard metrics (accuracy/precision/recall/F1). The current assumption is that all items in a given test set are equal with regards to difficulty and discriminating power. We propose Item Response Theory (IRT) from psychometrics as an alternative means for gold-standard test-set generation and NLP system evaluation. IRT is able to describe characteristics of individual items - their difficulty and discriminating power - and can account for these characteristics in its estimation of human intelligence or ability for an NLP task. In this paper, we demonstrate IRT by generating a gold-standard test set for Recognizing Textual Entailment. By collecting a large number of human responses and fitting our IRT model, we show that our IRT model compares NLP systems with the performance in a human population and is able to provide more insight into system performance than standard evaluation metrics. We show that a high accuracy score does not always imply a high IRT score, which depends on the item characteristics and the response pattern.1 PMID:28004039
Dorn, Barry C; Savoia, Elena; Testa, Marcia A; Stoto, Michael A; Marcus, Leonard J
2007-01-01
Survey instruments for evaluating public health preparedness have focused on measuring the structure and capacity of local, state, and federal agencies, rather than linkages among structure, process, and outcomes. To focus evaluation on the latter, we evaluated the linkages among individuals, organizations, and systems using the construct of "connectivity" and developed a measurement instrument. Results from focus groups of emergency preparedness first responders generated 62 items used in the development sample of 187 respondents. Item reduction and factors analyses were conducted to confirm the scale's components. The 62 items were reduced to 28. Five scales explained 70% of the total variance (number of items, percent variance explained, Cronbach's alpha) including connectivity with the system (8, 45%, 0.94), coworkers (7, 7%, 0.91), organization (7, 12%, 0.93), and perceptions (6, 6%, 0.90). Discriminant validity was found to be consistent with the factor structure. We developed a Connectivity Measurement Tool for the public health workforce consisting of a 34-item questionnaire found to be a reliable measure of connectivity with preliminary evidence of construct validity.
Assessing fear-avoidance beliefs in patients with cervical radiculopathy.
Dedering, Asa; Börjesson, Tina
2013-12-01
The study sought to evaluate validity and reliability of the Fear Avoidance Beliefs Questionnaire and the Tampa Scale for Kinesiophobia in patients with cervical radiculopathy. A test-retest design was used to test stability over time in 46 patients with cervical radiculopathy. Differences between patients and healthy subjects were also evaluated comparing the patients with 41 physically active and healthy subjects. The patients answered the Fear Avoidance Beliefs Questionnaire and the Tampa Scale for Kinesiophobia twice. To test for differences between the patients and the healthy subjects, the latter answered the same questionnaires once. Questionnaires about activity, personal factors and health were also used. The test-retest reliability assessed with weighted kappa was 0.68 for the Fear Avoidance Beliefs Questionnaire and 0.45 for the Tampa Scale for Kinesiophobia. Only six of the 11 single items of the Fear Avoidance Beliefs Questionnaire and none of the single items of the Tampa Scale of Kinesiophobia showed kappa coefficients exceeding 0.60 (good reliability). Patients with cervical radiculopathy rated significantly worse on the Fear Avoidance Beliefs Questionnaire and the Tampa Scale for Kinesiophobia than the healthy subjects did. The Fear Avoidance Beliefs Questionnaire may be recommended for test-retest evaluations because 'good' reliability was found. The Tampa Scale for Kinesiophobia had only 'moderate' test-retest reliability, and this should be considered when using this scale in test-retest evaluations. Both questionnaires can discriminate between patients with cervical radiculopathy and healthy subjects. Copyright © 2012 John Wiley & Sons, Ltd.
ERIC Educational Resources Information Center
New South Wales Dept. of Education, Sydney (Australia).
As one in a series of test item collections developed by the Assessment and Evaluation Unit of the Directorate of Studies, items are made available to teachers for the construction of unit tests or term examinations or as a basis for class discussion. Each collection was reviewed for content validity and reliability. The test items meet syllabus…
ERIC Educational Resources Information Center
New South Wales Dept. of Education, Sydney (Australia).
As one in a series of test item collections developed by the Assessment and Evaluation Unit of the Directorate of Studies, items are made available to teachers for the construction of unit tests or term examinations or as a basis for class discussion. Each collection was reviewed for content validity and reliability. The test items meet syllabus…
ERIC Educational Resources Information Center
New South Wales Dept. of Education, Sydney (Australia).
As one in a series of test item collections developed by the Assessment and Evaluation Unit of the Directorate of Studies, items are made available to teachers for the construction of unit tests or term examinations or as a basis for class discussion. Each collection was reviewed for content validity and reliability. The test items meet syllabus…
ERIC Educational Resources Information Center
Rizavi, Saba; Way, Walter D.; Lu, Ying; Pitoniak, Mary; Steffen, Manfred
2004-01-01
The purpose of this study was to use realistically simulated data to evaluate various CAT designs for use with the verbal reasoning measure of the Medical College Admissions Test (MCAT). Factors such as item pool depth, content constraints, and item formats often cause repeated adaptive administrations of an item at ability levels that are not…
Parent's confidence as a caregiver.
Raines, Deborah A; Brustad, Judith
2012-06-01
The purpose of this study was to describe the parent's self-reported confidence as a caregiver. The specific research questions were as follows: • What is the parent's perceived level of confidence when performing infant caregiving activities in the neonatal intensive care unit (NICU)? • What is the parent's projected level of confidence about performing infant caregiving activities on the first day at home? Participants were parents of infants with an anticipated discharge date within 5 days. Inclusion criteria were as follows: parent at least 18 years of age, infant's discharge destination is home with the parent, parent will have primary responsibility for the infant after discharge, and the infant's length of stay in the NICU was a minimum of 10 days. Descriptive, survey research. Participants perceived themselves to be confident in all but 2 caregiving activities when caring for their infants in the NICU, but parents projected a change in their level of confidence in their ability to independently complete infant care activities at home. When comparing the self-reported level of confidence in the NICU and the projected level of confidence at home, the levels of confidence decreased for 5 items, increased for 8 items, and remained unchanged for 2 items. All of the items with a decrease in score were the items with the lowest score when performed in the NICU. All of these low-scoring items are caregiving activities that are unique to the post-NICU status of the infant. Interestingly, the parent's projected level of confidence increased for the 8 items focused on handling and interacting with the infant. The findings of this research provide evidence that nurses may need to rethink when parents become active participants in their infant's medical-based caregiving activities.
Aazami, Sanaz; Mozafari, Mosayeb
2015-01-01
The patients’ rights status is one of the essential elements in defining norms related to the concept of clinical governance system. In addition, the patients’ rights status is an important index for quality of care offered in the health care system. However, the lack of a coherent instrument makes it difficult to evaluate patients’ rights status in hospitals and clinics. The aim of this study was to develop an instrument for the evaluation of patients’ rights prerequisites at educational hospitals in Iran. This study was conducted using the modified Delphi technique. In this study, 36 experts in the fields of law, medicine, and professional ethics were participated. The panel of experts participated in 3 rounds. First, experts were asked to judge some pre-identified items, and then, excluded items were judged again in the second round. At the end of the third round, all of the agreed items were included in the final list to form an evaluative scale on practice of patients’ rights. Experts were asked to judge a total 171 items in 3 rounds. Around 31% (n = 53) of items obtained the panel’s approval to be included in the final version of the scale. The experts’ opinions were collected using face-to-face interviews and electronic email during a 6-month period of data collection from October 2013 to February 2014. This study developed a 53-item scale for evaluation of patients’ rights prerequisites in educational hospitals in Iran. This scale was developed in 7 areas of commitments including university education, research, supervision, process management, physical structure, organizational policy, and human resources management. This study developed an evaluative scale to assess the practice of patients’ rights in educational hospitals. The items in the final version of this scale were obtained from a consensus of experts and the instrument can be used to evaluate the context and prerequisites for practice of patients’ rights in Iranian educational hospitals. PMID:27354900
Shikata, Satoru; Nakayama, Takeo; Yamagishi, Hisakazu
2008-01-01
In this study, we conducted a limited survey of reports of surgical randomized controlled trials, using the consolidated standards of reporting trials (CONSORT) statement and additional check items to clarify problems in the evaluation of surgical reports. A total of 13 randomized trials were selected from two latest review articles on biliary surgery. Each randomized trial was evaluated according to 28 quality measures that comprised items from the CONSORT statement plus additional items. Analysis focused on relationships between the quality of each study and the estimated effect gap ("pooled estimate in meta-analysis" -- "estimated effect of each study"). No definite relationships were found between individual study quality and the estimated effect gap. The following items could have been described but were not provided in almost all the surgical RCT reports: "clearly defined outcomes"; "details of randomization"; "participant flow charts"; "intention-to-treat analysis"; "ancillary analyses"; and "financial conflicts of interest". The item, "participation of a trial methodologist in the study" was not found in any of the reports. Although the quality of reporting trials is not always related to a biased estimation of treatment effect, the items used for quality measures must be described to enable readers to evaluate the quality and applicability of the reporting. Further development of an assessment tool is needed for items specific to surgical randomized controlled trials.
Ruiz-Sánchez de León, José M; Pedrero-Pérez, Eduardo J; Lozoya-Delgado, Paz; Llanero-Luque, Marcos; Rojo-Mota, Gloria; Puerta-García, Carmen
2012-06-01
Research has provided evidence of the presence of prefrontal symptoms in addicts, although they are usually evaluated using questionnaires that were created for acquired brain injury. To produce a specific instrument for evaluating those symptoms in subjects with addictions. For the study, 1624 participants were recruited (445 addicts and 1179 from the general population) and were given a 100-item inventory to complete based on the three spheres of human activity (cognition, emotion and behaviour) in relation to the three great prefrontal syndromes (dorsolateral, ventromedial and orbital). The preliminary analyses ruled out those that did not prove to have sufficient discriminating power, which resulted in the Prefrontal Symptoms Inventory (PSI) consisting of 46 items. The Dysexecutive Questionnaire (DEX-Sp) and the Perceived Stress Scale (PSS) were administered in order to study the convergent validity. The data show the three-factor structure of the questionnaire: problems with executive control (with three sub-factors: problems with motivation, control and attention), problems with social behaviour and problems with emotional control. The relationships between the scores on the PSI and sociodemographic and consumption variables, as well as with the DEX-Sp and the PSS were analysed. A reduced 20-item version is provided for screening. The PSI relates the ('subject-centred') self-evaluation of persons with the a priori ('brain-centred') theoretical formulation, the results showing adequate psychometric properties. We recommend its use when it comes to exploring the prefrontal symptoms of addicts, as well as other clinical or subclinical populations with similar cognitive profiles.
[Measurement of shoulder disability in the athlete: a systematic review].
Fayad, F; Mace, Y; Lefevre-Colau, M M; Poiraudeau, S; Rannou, F; Revel, M
2004-08-01
To identify all available shoulder disability questionnaires and to examine those that could be used for athlete. We systematically reviewed the literature in Medline using the keywords shoulder, function, scale, index, score, questionnaire, disability, quality of life, assessment, and evaluation. We searched for scales used for athletes with the keywords scale name AND (sport OR athlete). Data were completed by using the "Guide des Outils de Mesure et d'Evaluation en Médecine Physique et de Réadaptation" textbook. Analysis took into account the clinimetric quality of the instruments and the number of items specifically related to sports. A total of 37 instruments have been developed to measure disease-, shoulder-specific or upper extremity specific outcome. Older instruments were developed before the advent of modern measurement methods. They usually combined objective and subjective measures. Recent instruments were designed with use of more advanced methods. Most are self-administered questionnaires. Fourteen scales included items assessing sport activity. Four of these scales have been used to assess shoulder disability in athlete. Six scales have been used to assess such disability but do not have specific items related to sports. There is no gold standard for assessing shoulder outcome in the general population and no validated outcome instruments specifically for athletes. We suggest the use of ASES, WOSI and WORC scales for evaluating shoulder function in the recreational athletes. The DASH scale should be evaluated in this population. The principal criterion in evaluating shoulder function in the high level athlete is a return to the same level of sport performance. Further studies are required to identify measurement tools for shoulder disability that have a high predictive value for return to sport.
Disentangling the roles of arousal and amygdala activation in emotional declarative memory.
de Voogd, Lycia D; Fernández, Guillén; Hermans, Erno J
2016-09-01
A large body of evidence in animals and humans implicates the amygdala in promoting memory for arousing experiences. Although the amygdala can trigger threat-related noradrenergic-sympathetic arousal, in humans amygdala activation and noradrenergic-sympathetic arousal do not always concur. This raises the question how these two processes play a role in enhancing emotional declarative memory. This study was designed to disentangle these processes in a combined subsequent-memory/fear-conditioning paradigm with neutral items belonging to two conceptual categories as conditioned stimuli. Functional MRI, skin conductance (index of sympathetic activity), and pupil dilation (indirect index of central noradrenergic activity) were acquired throughout procedures. Recognition memory for individual items was tested 24 h later. We found that pupil dilation and skin conductance responses were higher on CS+ (associated with a shock) compared with CS- trials, irrespective of later memory for those items. By contrast, amygdala activity was only higher for CS+ items that were later confidently remembered compared with CS+ items that were later forgotten. Thus, amygdala activity and not noradrenergic-sympathetic arousal, predicted enhanced declarative item memory. This dissociation is in line with animal models stating that the amygdala integrates arousal-related neuromodulatory changes to alter mnemonic processes elsewhere in the brain. © The Author (2016). Published by Oxford University Press. For Permissions, please email: journals.permissions@oup.com.
Hand function evaluation: a factor analysis study.
Jarus, T; Poremba, R
1993-05-01
The purpose of this study was to investigate hand function evaluations. Factor analysis with varimax rotation was used to assess the fundamental characteristics of the items included in the Jebsen Hand Function Test and the Smith Hand Function Evaluation. The study sample consisted of 144 subjects without disabilities and 22 subjects with Colles fracture. Results suggest a four factor solution: Factor I--pinch movement; Factor II--grasp; Factor III--target accuracy; and Factor IV--activities of daily living. These categories differentiated the subjects without Colles fracture from the subjects with Colles fracture. A hand function evaluation consisting of these four factors would be useful. Such an evaluation that can be used for current clinical purposes is provided.
The role of the hippocampus in transitive inference
Zalesak, Martin; Heckers, Stephan
2009-01-01
Transitive inference (TI) is the ability to infer the relationship between items (e.g., A>C) after having learned a set of premise pairs (e.g., A>B and B>C). Previous studies in humans have identified a distributed neural network, including cortex, hippocampus, and thalamus, during TI judgments. We studied two aspects of TI using fMRI of subjects who had acquired the 6-item sequence (A>B>C>D>E>F) of visual stimuli. First, the identification of novel pairs not containing end items (i.e., B>D, C>E, B>E) was associated with greater left hippocampal activation when compared to the identification of novel pairs containing end items A and F. This demonstrates that the identification of stimulus pairs requiring the flexible representation of a sequence is associated with hippocampal activation. Second, for the three novel pairs devoid of end items we found greater right hippocampal activation for pairs B>D and C>E compared with pair B>E. This indicates that TI decisions on pairs derived from more adjacent items in the sequence are associated with greater hippocampal activation. Hippocampal activation thus scales with the degree of relational processing necessary for TI judgments. Both findings confirm a role of the hippocampus in transitive inference in humans. PMID:19216061
Usefulness of a KT Event to Address Practice and Policy Gaps Related to Integrated Care.
Jackson, Karen; Boakye, Omenaa; Wallace, Nicole
2016-02-01
There are limited evaluations of the impact of knowledge translation (KT) activities aimed at addressing practice and policy gaps. We report on the impact of an interactive, end-of-grant KT event. Although action items were developed and key stakeholder support attained, minimal follow-through had occurred three months after the KT event. Several organizational obstacles to transitioning knowledge into action were identified: leadership, program policies, infrastructure, changing priorities, workload and physician engagement. Key messages include: (1) ensure ongoing and facilitated networking opportunities, (2) invest in building implementation capacity, (3) target multi-level implementation activities and (4) focus further research on KT evaluation. Copyright © 2016 Longwoods Publishing.
Sexual Health and Positive Subjective Well-Being in Partnered Older Men and Women.
Lee, David M; Vanhoutte, Bram; Nazroo, James; Pendleton, Neil
2016-07-01
We examine the associations between different patterns of sexual behavior and function and three indicators of subjective well-being (SWB) covering eudemonic, evaluative, and affective well-being in a representative sample of partnered older people. Using data from a Sexual Relationships and Activities Questionnaire (SRA-Q) in Wave 6 of the English Longitudinal Study of Ageing, latent class analysis identified groups characterized by distinctive patterns of sexual behavior and function and then examined their link to SWB. Eudemonic SWB was measured using a revised 15-item version of the CASP-19, evaluative SWB using the Satisfaction With Life Scale, and affective SWB using the 8-item version of the Centre for Epidemiologic Studies-Depression scale. Sexual behavior and function was best described by six classes among men and five classes among women. These ranged from high sexual desire, frequent partnered sexual activities, and few sexual problems (Class 1) to low sexual desire, infrequent/no sexual activity, and problems with sexual function (Class 5([women])/6([men])). Men and women who reported either infrequent/no sexual activity, or were sexually active but reported sexual problems, generally had lower SWB than those individuals identified in Class 1. Poorer SWB in men was more strongly associated with sexual function difficulties, whereas in women desire and frequency of partnered activities appeared more important in relation to SWB. Within the context of a partnered relationship continuing sexual desire, activity and functioning are associated with higher SWB, with distinctive patterns for women and men. © The Author 2016. Published by Oxford University Press on behalf of The Gerontological Society of America. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Kerner, Matthew S; Kalinski, Michael I
2002-08-01
Using the Theory of Planned Behavior as a framework, the Attitude to Leisure-time Physical Activity, Expectations of Others, Perceived Control, and Intention of Engage in Leisure-time Physical Activity scales were developed for use among high school students. The study population included 20 boys and 68 girls 13 to 17 years of age (for boys, M = 15.1 yr., SD = 1.0; for girls, M = 15.0 yr., SD = 1.1). Generation of items and the establishment of content validity were performed by professionals in exercise physiology, physical education, and clinical psychology. Each scale item was phrased in a Likert-type format. Both unipolar and bipolar scales with seven response choices were developed. Following the pilot testing and subsequent revisions, 32 items were retained in the Attitude to Leisure-time Physical Activity scale, 10 items were retained in the Expectations of Others scale, 3 items were retained in the Perceived Control Scale, and 24 items were retained in the Intention to Engage in Leisure-time Physical Activity scale. Coefficients indicated adequate stability and internal consistency with alpha ranging from .81 to .96. Studies of validities are underway, after which scales would be made available to those interested in intervention techniques for promoting positive attitudes toward physical fitness, perception of control over engaging in leisure-time physical activities, and good intentions to engage in leisure-time physical activities. The present results are encouraging.
Pollard, Beth; Dixon, Diane; Dieppe, Paul; Johnston, Marie
2009-01-01
Background The International Classification of Functioning, Disability and Health (ICF) proposes three main health outcomes, Impairment (I), Activity Limitation (A) and Participation Restriction (P), but good measures of these constructs are needed The aim of this study was to use both Classical Test Theory (CTT) and Item Response Theory (IRT) methods to carry out an item analysis to improve measurement of these three components in patients having joint replacement surgery mainly for osteoarthritis (OA). Methods A geographical cohort of patients about to undergo lower limb joint replacement was invited to participate. Five hundred and twenty four patients completed ICF items that had been previously identified as measuring only a single ICF construct in patients with osteoarthritis. There were 13 I, 26 A and 20 P items. The SF-36 was used to explore the construct validity of the resultant I, A and P measures. The CTT and IRT analyses were run separately to identify items for inclusion or exclusion in the measurement of each construct. The results from both analyses were compared and contrasted. Results Overall, the item analysis resulted in the removal of 4 I items, 9 A items and 11 P items. CTT and IRT identified the same 14 items for removal, with CTT additionally excluding 3 items, and IRT a further 7 items. In a preliminary exploration of reliability and validity, the new measures appeared acceptable. Conclusion New measures were developed that reflect the ICF components of Impairment, Activity Limitation and Participation Restriction for patients with advanced arthritis. The resulting Aberdeen IAP measures (Ab-IAP) comprising I (Ab-I, 9 items), A (Ab-A, 17 items), and P (Ab-P, 9 items) met the criteria of conventional psychometric (CTT) analyses and the additional criteria (information and discrimination) of IRT. The use of both methods was more informative than the use of only one of these methods. Thus combining CTT and IRT appears to be a valuable tool in the development of measures. PMID:19422677
Neural correlates of differential retrieval orientation: Sustained and item-related components.
Woodruff, C Chad; Uncapher, Melina R; Rugg, Michael D
2006-01-01
Retrieval orientation refers to a cognitive state that biases processing of retrieval cues in service of a specific goal. The present study used a mixed fMRI design to investigate whether adoption of different retrieval orientations - as indexed by differences in the activity elicited by retrieval cues corresponding to unstudied items - is associated with differences in the state-related activity sustained across a block of test trials sharing a common retrieval goal. Subjects studied mixed lists comprising visually presented words and pictures. They then undertook a series of short test blocks in which all test items were visually presented words. The blocks varied according to whether the test items were used to cue retrieval of studied words or studied pictures. In several regions, neural activity elicited by correctly classified new items differed according to whether words or pictures were the targeted material. The loci of these effects suggest that one factor driving differential cue processing is modulation of the degree of overlap between cue and targeted memory representations. In addition to these item-related effects, neural activity sustained throughout the test blocks also differed according to the nature of the targeted material. These findings indicate that the adoption of different retrieval orientations is associated with distinct neural states. The loci of these sustained effects were distinct from those where new item activity varied, suggesting that the effects may play a role in biasing retrieval cue processing in favor of the current retrieval goal.
Setting Priorities: A Handbook of Alternative Techniques.
ERIC Educational Resources Information Center
Price, Nelson C.
Six models for setting priorities are presented in a workbook format with exercises for evaluating or practicing five techniques. In the San Mateo model one sets priorities, clarifies priority purpose, lists items, determines criteria, lists items and criteria on a rating sheet, studies all information on items, rates each item, tallies results,…
Differential Item Functioning: Its Consequences. Research Report. ETS RR-10-01
ERIC Educational Resources Information Center
Lee, Yi-Hsuan; Zhang, Jinming
2010-01-01
This report examines the consequences of differential item functioning (DIF) using simulated data. Its impact on total score, item response theory (IRT) ability estimate, and test reliability was evaluated in various testing scenarios created by manipulating the following four factors: test length, percentage of DIF items per form, sample sizes of…
ASCAL: A Microcomputer Program for Estimating Logistic IRT Item Parameters.
ERIC Educational Resources Information Center
Vale, C. David; Gialluca, Kathleen A.
ASCAL is a microcomputer-based program for calibrating items according to the three-parameter logistic model of item response theory. It uses a modified multivariate Newton-Raphson procedure for estimating item parameters. This study evaluated this procedure using Monte Carlo Simulation Techniques. The current version of ASCAL was then compared to…
Maindal, Helle Terkildsen; Sokolowski, Ineta; Vedsted, Peter
2009-06-29
The Patient Activation Measure (PAM) is a measure that assesses patient knowledge, skill, and confidence for self-management. This study validates the Danish translation of the 13-item Patient Activation Measure (PAM13) in a Danish population with dysglycaemia. 358 people with screen-detected dysglycaemia participating in a primary care health education study responded to PAM13. The PAM13 was translated into Danish by a standardised forward-backward translation. Data quality was assessed by mean, median, item response, missing values, floor and ceiling effects, internal consistency (Cronbach's alpha and average inter-item correlation) and item-rest correlations. Scale properties were assessed by Rasch Rating Scale models. The item response was high with a small number of missing values (0.8-4.2%). Floor effect was small (range 0.6-3.6%), but the ceiling effect was above 15% for all items (range 18.6-62.7%). The alpha-coefficient was 0.89 and the average inter-item correlation 0.38. The Danish version formed a unidimensional, probabilistic Guttman-like scale explaining 43.2% of the variance. We did however, find a different item sequence compared to the original scale. A Danish version of PAM13 with acceptable validity and reliability is now available. Further development should focus on single items, response categories in relation to ceiling effects and further validation of reproducibility and responsiveness.
Tolley, Elizabeth E; Guthrie, Kate Morrow; Zissette, Seth; Fava, Joseph L; Gill, Katherine; Louw, Cheryl E; Kotze, Philip; Reddy, Krishnaveni; MacQueen, Kathleen
2018-01-01
Low adherence in recent HIV prevention clinical trials highlights the need to better understand, measure, and support product use within clinical trials. Conventional self-reported adherence instruments within HIV prevention trials, often relying on single-item questions, have proven ineffective. While objective adherence measures are desirable, none currently exist that apply to both active and placebo arms. Scales are composed of multiple items in the form of questions or statements that, when combined, measure a more complex construct that may not be directly observable. When psychometrically validated, such measures may better assess the multiple factors contributing to adherence/non-adherence. This study aimed to develop and psychometrically evaluate tools to screen and monitor trial participants' adherence to HIV prevention products within the context of clinical trial research. Based on an extensive literature review and conceptual framework, we identified and refined 86 items assessing potential predictors of adherence and 48 items assessing adherence experience. A structured survey, including adherence items and other variables, was administered to former ASPIRE and Ring Study participants and similar non-trial participants (n = 709). We conducted exploratory factor analyses (EFA) to identify a reduced set of constructs and items that could be used at screening to predict potential adherence, and at follow-up to monitor and intervene on adherence. We examined associations with other variables to assess content and construct validity. The EFA of screener items resulted in a 6-factor solution with acceptable to very good internal reliability (α: .62-.84). Similar to our conceptual framework, factors represent trial-related commitment (Distrust of Research and Commitment to Research); alignment with trial requirements (Visit Adherence and Trial Incompatibility); Belief in Trial Benefits and Partner Disclosure. The EFA on monitoring items resulted in 4 Product-specific factors that represent Vaginal Ring Doubts, Vaginal Ring Benefits, Ring Removal, and Side Effects with good to very good internal reliability (α = .71-.82). Evidence of content and construct validity was found; relationship to social desirability bias was examined. These scales are easy and inexpensive to administer, available in several languages, and are applicable regardless of randomization. Once validated prospectively, they could (1) screen for propensity to adhere, (2) target adherence support/counselling, and (3) complement biomarker measures in determining true efficacy of the experimental product.
The sensory timecourses associated with conscious visual item memory and source memory.
Thakral, Preston P; Slotnick, Scott D
2015-09-01
Previous event-related potential (ERP) findings have suggested that during visual item and source memory, nonconscious and conscious sensory (occipital-temporal) activity onsets may be restricted to early (0-800 ms) and late (800-1600 ms) temporal epochs, respectively. In an ERP experiment, we tested this hypothesis by separately assessing whether the onset of conscious sensory activity was restricted to the late epoch during source (location) memory and item (shape) memory. We found that conscious sensory activity had a late (>800 ms) onset during source memory and an early (<200 ms) onset during item memory. In a follow-up fMRI experiment, conscious sensory activity was localized to BA17, BA18, and BA19. Of primary importance, the distinct source memory and item memory ERP onsets contradict the hypothesis that there is a fixed temporal boundary separating nonconscious and conscious processing during all forms of visual conscious retrieval. Copyright © 2015 Elsevier B.V. All rights reserved.
Stevenson, Katherine; Busch, Angela; Scott, Darlene J.; Henry, Carol; Wall, Patricia A.
2009-01-01
Objectives To develop and evaluate a classroom-based curriculum designed to promote interprofessional competencies by having undergraduate students from various health professions work together on system-based problems using quality improvement (QI) methods and tools to improve patient-centered care. Design Students from 4 health care programs (nursing, nutrition, pharmacy, and physical therapy) participated in an interprofessional QI activity. In groups of 6 or 7, students completed pre-intervention and post-intervention reflection tools on attitudes relating to interprofessio nal teams, and a tool designed to evaluate group process. Assessment One hundred thirty-four students (76.6%) completed both self-reflection instruments, and 132 (74.2%) completed the post-course group evaluation instrument. Although already high prior to the activity, students' mean post-intervention reflection scores increased for 12 of 16 items. Post-intervention group evaluation scores reflected a high level of satisfaction with the experience. Conclusion Use of a quality-based case study and QI methodology were an effective approach to enhancing interprofessional experiences among students. PMID:19657497
Madanat, Hala; Merrill, Ray M
2006-01-01
The purpose of this study was to investigate physical activity levels across the five stages of change for physical activity and to identify motivational factors for physical activity according to these stages of change among college students in Amman, Jordan. Analyses were based on a cross-sectional survey of 431 students, with a mean age of 21.1 (SD=0.16) and 67.5% female. Based on the recommendation that physical activity requires at least 30 minutes of physical activity 3 or more days per week, men were more likely than women to classify themselves in later stages: 7.3% vs. 9.5% in the precontemplation stage, 17.4% vs. 14.7% in the contemplation stage, 50.0% vs. 63.5% in the preparation stage, 9.4% vs. 5.6% in the action stage, and 15.9% vs. 6.7% in the maintenance stage [X2(4) = 14.04, p = 0.0072]. Seven potential motivational items for physical activity were assessed using factor analysis: experience better self-worth, prevent chronic disease, relieve stress, stay in shape, longevity, recreation/fun, and social benefits. Two factor groupings were identified from these items. The first factor included the first five items, labeled as "Physical and Mental". The second factor included the last two items, labeled as "Social and Recreational." "Physical and Mental" items compared with "Social and Recreational" items were most likely to motivate physical activity across the stages of change for physical activity. The strongest motivator of physical activity was to stay in shape. The weakest motivator of physical activity was for social reasons. The influence of the intermediate motivational factors was slightly affected by the students' stage of change for physical activity. Motivators for physical activity did not differ according to sex. These results provide important information about the motivational factors for physical activity for college-aged students in Jordan that can be useful in developing effective physical activity intervention programs.
Designing an evaluation framework for WFME basic standards for medical education.
Tackett, Sean; Grant, Janet; Mmari, Kristin
2016-01-01
To create an evaluation plan for the World Federation for Medical Education (WFME) accreditation standards for basic medical education. We conceptualized the 100 basic standards from "Basic Medical Education: WFME Global Standards for Quality Improvement: The 2012 Revision" as medical education program objectives. Standards were simplified into evaluable items, which were then categorized as inputs, processes, outputs and/or outcomes to generate a logic model and corresponding plan for data collection. WFME standards posed significant challenges to evaluation due to complex wording, inconsistent formatting and lack of existing assessment tools. Our resulting logic model contained 244 items. Standard B 5.1.1 separated into 24 items, the most for any single standard. A large proportion of items (40%) required evaluation of more than one input, process, output and/or outcome. Only one standard (B 3.2.2) was interpreted as requiring evaluation of a program outcome. Current WFME standards are difficult to use for evaluation planning. Our analysis may guide adaptation and revision of standards to make them more evaluable. Our logic model and data collection plan may be useful to medical schools planning an institutional self-review and to accrediting authorities wanting to provide guidance to schools under their purview.
The development of the lunchtime enjoyment of activity and play questionnaire.
Hyndman, Brendon; Telford, Amanda; Finch, Caroline; Ullah, Shahid; Benson, Amanda C
2013-04-01
Enjoyment of physical activity is as an important determinant of children's participation in physical activity. Despite this, there is an absence of reliable measures for assessing children's enjoyment of play activities during school lunchtime. The purpose of this study was to develop and assess the reliability of the Lunchtime Enjoyment of Activity and Play (LEAP) Questionnaire. Questionnaire items were categorized employing a social-ecological framework including intrapersonal (20 items), interpersonal (2 items), and physical environment/policy (17 items) components to identify the broader influences on children's enjoyment. An identical questionnaire was administered on 2 occasions, 10 days apart, to 176 children aged 8-12 years, attending a government elementary school in regional Victoria, Australia. Test-retest reliability confirmed that 35 of 39 LEAP Questionnaire items had at least moderate kappa agreement ranging from .44 to .78. Although 4 individual kappa values were low, median kappa scores for each aggregated social-ecological component reached at least moderate agreement (.44-.60). This study confirms the LEAP Questionnaire to be a reliable, context-specific instrument with sound content, and face validity that employs a social-ecological framework to assess children's enjoyment of school play and lunchtime activities. © 2013, American School Health Association.
Sadeh, Talya; Maril, Anat; Bitan, Tali; Goshen-Gottstein, Yonatan
2012-03-01
A remarkable act of memory entails binding different forms of information. We focus on the timeless question of how the bound engram is accessed such that its component features-item and context-are extracted. To shed light on this question, we investigate the dynamics between brain structures that together mediate the binding and extraction of item and context. Converging evidence has implicated the Parahippocampal cortex (PHc) in contextual processing, the Perirhinal cortex (PRc) in item processing, and the hippocampus in item-context binding. Effective connectivity analysis was conducted on fMRI data gathered during retrieval on tests that differ with regard to the to-be-extracted information. Results revealed that recall is initiated by context-related PHc activity, followed by hippocampal item-context engram activation, and completed with retrieval of the study-item by the PRc. The reverse path was found for recognition. We thus provide novel evidence for dissociative patterns of item-context unbinding during retrieval. Copyright © 2011 Elsevier Inc. All rights reserved.
[Role of creative discussion in the learning of critical reading of scientific articles].
Cobos-Aguilar, Héctor; Viniegra-Velázquez, Leonardo; Pérez-Cortés, Patricia
2011-01-01
To compare two active educational strategies on critical reading (two and three stages) for research learning in medical students. Four groups were conformed in a quasi-experimental design. The medical student group, related to three stages (critical reading guide resolution, creative discussion, group discussion) g1, n = 9 with school marks > 90 and g2, n = 19 with a < 90, respectively. The two-stage groups (guide resolution and group discussion) were conformed by pre-graduate interns, g3, n = 17 and g4, n = 12, who attended social security general hospitals. A validated and consistent survey with 144 items was applied to the four groups before and after educational strategies. Critical reading with its subcomponents: interpretation, judgment and proposal were evaluated with 47, 49 and 48 items, respectively. The case control studies, cohort studies, diagnostic test and clinical trial designs were evaluated. Nonparametric significance tests were performed to compare the groups and their results. A bias calculation was performed for each group. The highest median was obtained by the three-stage groups (g1 and g2) and so were the medians in interpretation, judgment and proposal. The several research design results were higher in the same groups. An active educational strategy with three stages is superior to another with two stages in medical students. It is advisable to perform these activities in goal of better learning in our students.
ERIC Educational Resources Information Center
New South Wales Dept. of Education, Sydney (Australia).
As one in a series of test item collections developed by the Assessment and Evaluation Unit of the Directorate of Studies, items are made available to teachers for the construction of unit tests or term examinations or as a basis for class discussion. Each collection was reviewed for content validity and reliability. The test items meet syllabus…
ERIC Educational Resources Information Center
New South Wales Dept. of Education, Sydney (Australia).
As one in a series of test item collections developed by the Assessment and Evaluation Unit of the Directorate of Studies, items are made available to teachers for the construction of unit tests or term examinations or as a basis for class discussion. Each collection was reviewed for content validity and reliability. The test items meet syllabus…
ERIC Educational Resources Information Center
New South Wales Dept. of Education, Sydney (Australia).
As one in a series of test item collections developed by the Assessment and Evaluation Unit of the Directorate of Studies, items are made available to teachers for the construction of unit tests or term examinations or as a basis for class discussion. Each collection was reviewed for content validity and reliability. The test items meet syllabus…
ERIC Educational Resources Information Center
Ito, Kyoko; Sykes, Robert C.
This study investigated the practice of weighting a type of test item, such as constructed response, more than other types of items, such as selected response, to compute student scores for a mixed-item type of test. The study used data from statewide writing field tests in grades 3, 5, and 8 and considered two contexts, that in which a single…
Avila, M L; Brandão, L R; Williams, S; Ward, L C; Montoya, M I; Stinson, J; Kiss, A; Lara-Corrales, I; Feldman, B M
2016-08-01
Our goal was to conduct the item generation and piloting phases of a new discriminative and evaluative tool for pediatric post-thrombotic syndrome. We followed a formative model for the development of the tool, focusing on the signs/symptoms (items) that define post-thrombotic syndrome. For item generation, pediatric thrombosis experts and subjects diagnosed with extremity post-thrombotic syndrome during childhood nominated items. In the piloting phase, items were cross-sectionally measured in children with limb deep vein thrombosis to examine item performance. Twenty-three experts and 16 subjects listed 34 items, which were then measured in 140 subjects with previous diagnosis of limb deep vein thrombosis (70 upper extremity and 70 lower extremity). The items with strongest correlation with post-thrombotic syndrome severity and largest area under the curve were pain (in older children), paresthesia, and swollen limb for the upper extremity group, and pain (in older children), tired limb, heaviness, tightness and paresthesia for the lower extremity group. The diagnostic properties of the items and their correlations with post-thrombotic syndrome severity varied according to the assessed venous territory. The information gathered in this study will help experts decide which item should be considered for inclusion in the new tool. Copyright © 2016 Elsevier Ltd. All rights reserved.
Baylor, Carolyn R.; Birch, Kristen; Yorkston, Kathryn M.
2017-01-01
Purpose The Communicative Participation Item Bank (CPIB) was developed to evaluate participation restrictions in communication situations for individuals with speech and language disorders. This study evaluated the potential relevance of CPIB items for individuals with hearing loss. Method Cognitive interviews were conducted with 17 adults with a range of treated and untreated hearing loss, who responded to 46 items. Interviews were continued until saturation was reached and prevalent trends emerged. A focus group was also conducted with 3 experienced audiologists to seek their views on the CPIB. Analysis of data included qualitative and quantitative approaches. Results The majority of the items were applicable to individuals with hearing loss; however, 12 items were identified as potentially not relevant. This was largely attributed to the items' focus on speech production rather than hearing. The results from the focus group were in agreement for a majority of items. Conclusions The next step in validating the CPIB for individuals with hearing loss is a psychometric analysis on a large sample. Possible outcomes could be that the CPIB is considered valid in its entirety or the creation of a new questionnaire or a hearing loss–specific short form with a subset of items is necessary. PMID:28114665
The Color Red Supports Avoidance Reactions to Unhealthy Food.
Rohr, Michaela; Kamm, Friederike; Koenigstorfer, Joerg; Groeppel-Klein, Andrea; Wentura, Dirk
2015-01-01
Empirical evidence suggests that the color red acts like an implicit avoidance cue in food contexts. Thus specific colors seem to guide the implicit evaluation of food items. We built upon this research by investigating the implicit meaning of color (red vs. green) in an approach-avoidance task with healthy and unhealthy food items. Thus, we examined the joint evaluative effects of color and food: Participants had to categorize food items by approach-avoidance reactions, according to their healthfulness. Items were surrounded by task-irrelevant red or green circles. We found that the implicit meaning of the traffic light colors influenced participants' reactions to the food items. The color red (compared to green) facilitated automatic avoidance reactions to unhealthy foods. By contrast, approach behavior toward healthy food items was not moderated by color. Our findings suggest that traffic light colors can act as implicit cues that guide automatic behavioral reactions to food.
Core Items for a Standardized Resource Use Measure: Expert Delphi Consensus Survey.
Thorn, Joanna C; Brookes, Sara T; Ridyard, Colin; Riley, Ruth; Hughes, Dyfrig A; Wordsworth, Sarah; Noble, Sian M; Thornton, Gail; Hollingworth, William
2018-06-01
Resource use measurement by patient recall is characterized by inconsistent methods and a lack of validation. A validated standardized resource use measure could increase data quality, improve comparability between studies, and reduce research burden. To identify a minimum set of core resource use items that should be included in a standardized adult instrument for UK health economic evaluation from a provider perspective. Health economists with experience of UK-based economic evaluations were recruited to participate in an electronic Delphi survey. Respondents were asked to rate 60 resource use items (e.g., medication names) on a scale of 1 to 9 according to the importance of the item in a generic context. Items considered less important according to predefined consensus criteria were dropped and a second survey was developed. In the second round, respondents received the median score and their own score from round 1 for each item alongside summarized comments and were asked to rerate items. A final project team meeting was held to determine the recommended core set. Forty-five participants completed round 1. Twenty-six items were considered less important and were dropped, 34 items were retained for the second round, and no new items were added. Forty-two respondents (93.3%) completed round 2, and greater consensus was observed. After the final meeting, 10 core items were selected, with further items identified as suitable for "bolt-on" questionnaire modules. The consensus on 10 items considered important in a generic context suggests that a standardized instrument for core resource use items is feasible. Copyright © 2018. Published by Elsevier Inc.
Ohashi, Y; Tashiro, K; Itoyama, Y; Nakano, I; Sobue, G; Nakamura, S; Sumino, S; Yanagisawa, N
2001-04-01
Amyotrophic lateral sclerosis(ALS) is progressive, degenerative, fatal disease of the motor neuron. No efficacious therapy is available to slow the progressive loss of function, but several new approaches including neurotrophic factors, antioxidants and glutamate antagonists, are currently being evaluated as potential therapies. Mortality, and/or time to tracheostomy, muscle strength and pulmonary function are used as primary endpoints in clinical trials for treatment of ALS. The effect of new therapies on the quality of patients' lives are also important, so we sought to develop a rating scale to measure it. The revised ALS Functional Rating Scale(ALSFRS-R), which has addition of items to ALSFRS to enhance the ability to assess respiratory symptoms, is an assessment determining the degree of impairment in ALS patients' abilities to function independently in activities of daily living. It consists of 12 items to evaluate bulbar function, motor function and respiratory function and each item is scored from 0(unable) to 4(normal). We translated the English score into Japanese one with minor modification considering the inter cultural difference. And we examined reliability of the translated scale. As a measure of reliability, the intraclass correlation coefficient(ICC) was evaluated for total score and the Kappa coefficient proposed by Cohen and Kraemer was calculated for each item. Moreover, we examined sensitivity to clinical change over time and carried out the factor analysis to analyze the factorial structure. The subjects were 27 ALS patients and each was scored twice for reliability or three times for sensitivity by 2 to 5 neurologists and if possible, nurses. The ICC for total score was 0.97(95% C. I.; 0.94-0.98). Extension of the Kappa coefficients were 0.48 to 1.00 for inter-rater reliability and the averaged Kappa coefficients were 0.63 to 1.00 for intra rater reliability, respectively. Concerning the factorial structure, the contribution of the first factor(the first principal component) were 53.5% principal factor solution. The factor loadings of items were 0.52-0.91 except "salivation" and this factor almost equal to the simple sum of all items was interpreted as the general degree of deterioration. The promax votation revealed the riginally supposed factor structure with 3 factors(groups of items): neuromuscuclar function, respiratory function and bulbar function. The rating scale correlated with Global clinical impression of change(GCIC) scored by neurologists and declined with time, indicating its sensitivity to change. On the bases of these results, ALSFRS-R(Japanese version) is considered to be highly reliable enough for clinical use.
Active Learning with Irrelevant Examples
NASA Technical Reports Server (NTRS)
Mazzoni, Dominic; Wagstaff, Kiri L.; Burl, Michael
2006-01-01
Active learning algorithms attempt to accelerate the learning process by requesting labels for the most informative items first. In real-world problems, however, there may exist unlabeled items that are irrelevant to the user's classification goals. Queries about these points slow down learning because they provide no information about the problem of interest. We have observed that when irrelevant items are present, active learning can perform worse than random selection, requiring more time (queries) to achieve the same level of accuracy. Therefore, we propose a novel approach, Relevance Bias, in which the active learner combines its default selection heuristic with the output of a simultaneously trained relevance classifier to favor items that are likely to be both informative and relevant. In our experiments on a real-world problem and two benchmark datasets, the Relevance Bias approach significantly improved the learning rate of three different active learning approaches.
The Time-Course of Lexical Activation During Sentence Comprehension in People With Aphasia
Ferrill, Michelle; Love, Tracy; Walenski, Matthew; Shapiro, Lewis P.
2012-01-01
Purpose To investigate the time-course of processing of lexical items in auditorily presented canonical (subject–verb–object) constructions in young, neurologically unimpaired control participants and participants with left-hemisphere damage and agrammatic aphasia. Method A cross modal picture priming (CMPP) paradigm was used to test 114 control participants and 8 participants with agrammatic aphasia for priming of a lexical item (direct object noun) immediately after it is initially encountered in the ongoing auditory stream and at 3 additional time points at 400-ms intervals. Results The control participants demonstrated immediate activation of the lexical item, followed by a rapid loss (decay). The participants with aphasia demonstrated delayed activation of the lexical item. Conclusion This evidence supports the hypothesis of a delay in lexical activation in people with agrammatic aphasia. The delay in lexical activation feeds syntactic processing too slowly, contributing to comprehension deficits in people with agrammatic aphasia. PMID:22355007
Jacobson, C Jeffrey; Kashikar-Zuck, Susmita; Farrell, Jennifer; Barnett, Kimberly; Goldschneider, Ken; Dampier, Carlton; Cunningham, Natoshia; Crosby, Lori; DeWitt, Esi Morgan
2015-12-01
As initial steps in a broader effort to develop and test pediatric pain behavior and pain quality item banks for the Patient-Reported Outcomes Measurement Information System (PROMIS), we used qualitative interview and item review methods to 1) evaluate the overall conceptual scope and content validity of the PROMIS pain domain framework among children with chronic/recurrent pain conditions, and 2) develop item candidates for further psychometric testing. To elicit the experiential and conceptual scope of pain outcomes across a variety of pediatric recurrent/chronic pain conditions, we conducted 32 semi-structured individual and 2 focus-group interviews with children and adolescents (8-17 years), and 32 individual and 2 focus-group interviews with parents of children with pain. Interviews with pain experts (10) explored the operational limits of pain measurement in children. For item bank development, we identified existing items from measures in the literature, grouped them by concept, removed redundancies, and modified the remaining items to match PROMIS formatting. New items were written as needed and cognitive debriefing was completed with the children and their parents, resulting in 98 pain behavior (47 self, 51 proxy), 54 quality, and 4 intensity items for further testing. Qualitative content analyses suggest that reportable pain outcomes that matter to children with pain are captured within and consistent with the pain domain framework in PROMIS. PROMIS pediatric pain behavior, quality, and intensity items were developed based on a theoretical framework of pain that was evaluated by multiple stakeholders in the measurement of pediatric pain, including researchers, clinicians, and children with pain and their parents, and the appropriateness of the framework was verified. Copyright © 2015 American Pain Society. Published by Elsevier Inc. All rights reserved.
Earned print media in advancing tobacco control in Himachal Pradesh, India: a descriptive study
Sharma, Renu; Shewade, Hemant Deepak; Gopalan, Balasubramaniam; Badrel, Ramesh Kumar; Rana, Jugdeep Singh
2017-01-01
Background The Union-Bloomberg Initiative tobacco control projects were implemented in Himachal Pradesh (a hilly state in North India) from 2007 to 2014. The project focused on the establishment of an administrative framework; increasing the capacity of stakeholders; enforcement of legislation; coalition and networking with multiple stakeholders; awareness generation with focus on earned media and monitoring and evaluation with policy-focussed research. This study aimed to systematically analyse all earned print news items related to the projects. Methods In this cross-sectional descriptive study, quantitative content analysis of earned print news items was carried out using predetermined codes related to areas of tobacco control policies. We also carried out a cost description of the hypothetical value of this earned media. The area of the news item in cm2 was multiplied by the average rate of space for the paid news item in that particular newspaper. Results There were 6348 news items: the numbers steadily increased with time. Focus on Monitoring tobacco use, Protecting people from tobacco smoke, Offering help to quit, Warning about dangers of tobacco, Enforcing a ban on tobacco advertising and promotion, Raising tax on tobacco products was seen in 24, 17, 9, 23, 22 and 3% of news items, respectively. Press releases were highest at 44% and report by correspondents at 24%. Further, 55, 23 and 21% news items focused on smoking, smokeless and both forms of tobacco use, respectively. Sixty-six per cent and 34% news items, respectively, were focused on youth and women. The news items had a hypothetical value of US$1503 628.3, which was three times more than the funds spent on all project activities. Conclusions In the absence of funding for paid media, the project strategically used earned media to promote tobacco control policies in the state. PMID:28589021
Dür, Mona; Steiner, Günter; Fialka-Moser, Veronika; Kautzky-Willer, Alexandra; Dejaco, Clemens; Prodinger, Birgit; Stoffer, Michaela Alexandra; Binder, Alexa; Smolen, Josef; Stamm, Tanja Alexandra
2014-04-05
Self-reported outcome instruments in health research have become increasingly important over the last decades. Occupational therapy interventions often focus on occupational balance. However, instruments to measure occupational balance are scarce. The aim of the study was therefore to develop a generic self-reported outcome instrument to assess occupational balance based on the experiences of patients and healthy people including an examination of its psychometric properties. We conducted a qualitative analysis of the life stories of 90 people with and without chronic autoimmune diseases to identify components of occupational balance. Based on these components, the Occupational Balance-Questionnaire (OB-Quest) was developed. Construct validity and internal consistency of the OB-Quest were examined in quantitative data. We used Rasch analyses to determine overall fit of the items to the Rasch model, person separation index and potential differential item functioning. Dimensionality testing was conducted by the use of t-tests and Cronbach's alpha. The following components emerged from the qualitative analyses: challenging and relaxing activities, activities with acknowledgement by the individual and by the sociocultural context, impact of health condition on activities, involvement in stressful activities and fewer stressing activities, rest and sleep, variety of activities, adaptation of activities according to changed living conditions and activities intended to care for oneself and for others. Based on these, the seven items of the questionnaire (OB-Quest) were developed. 251 people (132 with rheumatoid arthritis, 43 with systematic lupus erythematous and 76 healthy) filled in the OB-Quest. Dimensionality testing indicated multidimensionality of the questionnaire (t = 0.58, and 1.66 after item reduction, non-significant). The item on the component rest and sleep showed differential item functioning (health condition and age). Person separation index was 0.51. Cronbach's alpha changed from 0.38 to 0.57 after deleting two items. This questionnaire includes new items addressing components of occupational balance meaningful to patients and healthy people which have not been measured so far. The reduction of two items of the OB-Quest showed improved internal consistency. The multidimensionality of the questionnaire indicates the need for a summary of several components into subscales.
2014-01-01
Background Self-reported outcome instruments in health research have become increasingly important over the last decades. Occupational therapy interventions often focus on occupational balance. However, instruments to measure occupational balance are scarce. The aim of the study was therefore to develop a generic self-reported outcome instrument to assess occupational balance based on the experiences of patients and healthy people including an examination of its psychometric properties. Methods We conducted a qualitative analysis of the life stories of 90 people with and without chronic autoimmune diseases to identify components of occupational balance. Based on these components, the Occupational Balance-Questionnaire (OB-Quest) was developed. Construct validity and internal consistency of the OB-Quest were examined in quantitative data. We used Rasch analyses to determine overall fit of the items to the Rasch model, person separation index and potential differential item functioning. Dimensionality testing was conducted by the use of t-tests and Cronbach’s alpha. Results The following components emerged from the qualitative analyses: challenging and relaxing activities, activities with acknowledgement by the individual and by the sociocultural context, impact of health condition on activities, involvement in stressful activities and fewer stressing activities, rest and sleep, variety of activities, adaptation of activities according to changed living conditions and activities intended to care for oneself and for others. Based on these, the seven items of the questionnaire (OB-Quest) were developed. 251 people (132 with rheumatoid arthritis, 43 with systematic lupus erythematous and 76 healthy) filled in the OB-Quest. Dimensionality testing indicated multidimensionality of the questionnaire (t = 0.58, and 1.66 after item reduction, non-significant). The item on the component rest and sleep showed differential item functioning (health condition and age). Person separation index was 0.51. Cronbach’s alpha changed from 0.38 to 0.57 after deleting two items. Conclusions This questionnaire includes new items addressing components of occupational balance meaningful to patients and healthy people which have not been measured so far. The reduction of two items of the OB-Quest showed improved internal consistency. The multidimensionality of the questionnaire indicates the need for a summary of several components into subscales. PMID:24708642
Practicing universal design to actual hand tool design process.
Lin, Kai-Chieh; Wu, Chih-Fu
2015-09-01
UD evaluation principles are difficult to implement in product design. This study proposes a methodology for implementing UD in the design process through user participation. The original UD principles and user experience are used to develop the evaluation items. Difference of product types was considered. Factor analysis and Quantification theory type I were used to eliminate considered inappropriate evaluation items and to examine the relationship between evaluation items and product design factors. Product design specifications were established for verification. The results showed that converting user evaluation into crucial design verification factors by the generalized evaluation scale based on product attributes as well as the design factors applications in product design can improve users' UD evaluation. The design process of this study is expected to contribute to user-centered UD application. Copyright © 2015 Elsevier Ltd and The Ergonomics Society. All rights reserved.
Thompson, Laura R; Leung, Cynthia G; Green, Brad; Lipps, Jonathan; Schaffernocker, Troy; Ledford, Cynthia; Davis, John; Way, David P; Kman, Nicholas E
2017-01-01
Medical schools in the United States are encouraged to prepare and certify the entrustment of medical students to perform 13 core entrustable professional activities (EPAs) prior to graduation. Entrustment is defined as the informed belief that the learner is qualified to autonomously perform specific patient-care activities. Core EPA-10 is the entrustment of a graduate to care for the emergent patient. The purpose of this project was to design a realistic performance assessment method for evaluating fourth-year medical students on EPA-10. First, we wrote five emergent patient case-scenarios that a medical trainee would likely confront in an acute care setting. Furthermore, we developed high-fidelity simulations to realistically portray these patient case scenarios. Finally, we designed a performance assessment instrument to evaluate the medical student's performance on executing critical actions related to EPA-10 competencies. Critical actions included the following: triage skills, mustering the medical team, identifying causes of patient decompensation, and initiating care. Up to four students were involved with each case scenario; however, only the team leader was evaluated using the assessment instruments developed for each case. A total of 114 students participated in the EPA-10 assessment during their final year of medical school. Most students demonstrated competence in recognizing unstable vital signs (97%), engaging the team (93%), and making appropriate dispositions (92%). Almost 87% of the students were rated as having reached entrustment to manage the care of an emergent patient (99 of 114). Inter-rater reliability varied by case scenario, ranging from moderate to near-perfect agreement. Three of five case-scenario assessment instruments contained items that were internally consistent at measuring student performance. Additionally, the individual item scores for these case scenarios were highly correlated with the global entrustment decision. High-fidelity simulation showed good potential for effective assessment of medical student entrustment of caring for the emergent patient. Preliminary evidence from this pilot project suggests content validity of most cases and associated checklist items. The assessments also demonstrated moderately strong faculty inter-rater reliability.
Database of Standardized Questionnaires About Walking & Bicycling
This database contains questionnaire items and a list of validation studies for standardized items related to walking and biking. The items come from multiple national and international physical activity questionnaires.
Clinton-McHarg, Tara; Carey, Mariko; Sanson-Fisher, Rob; D'Este, Catherine; Shakeshaft, Anthony
2012-01-30
Adolescents and young adult (AYA) cancer survivors may have unique physical, psychological and social needs due to their cancer occurring at a critical phase of development. The aim of this study was to develop a psychometrically rigorous measure of unmet need to capture the specific needs of this group. Items were developed following a comprehensive literature review, focus groups with AYAs, and feedback from health care providers, researchers and other professionals. The measure was pilot tested with 32 AYA cancer survivors recruited through a state-based cancer registry to establish face and content validity. A main sample of 139 AYA cancer patients and survivors were recruited through seven treatment centres and invited to complete the questionnaire. To establish test-retest reliability, a sub-sample of 34 participants completed the measure a second time. Exploratory factor analysis was performed and the measure was assessed for internal consistency, discriminative validity, potential responsiveness and acceptability. The Cancer Needs Questionnaire - Young People (CNQ-YP) has established face and content validity, and acceptability. The final measure has 70 items and six factors: Treatment Environment and Care (33 items); Feelings and Relationships (14 items); Daily Life (12 items); Information and Activities (5 items); Education (3 items); and Work (3 items). All domains achieved Cronbach's alpha values greater than 0.80. Item-to-item test-retest reliability was also high, with all but four items reaching weighted kappa values above 0.60. The CNQ-YP is the first multi-dimensional measure of unmet need which has been developed specifically for AYA cancer patients and survivors. The measure displays a strong factor structure, and excellent internal consistency and test-retest reliability. However, the small sample size has implications for the reliability of the statistical analyses undertaken, particularly the exploratory factor analysis. Future studies with a larger sample are recommended to confirm the factor structure of the measure. Longitudinal studies to establish responsiveness and predictive validity should also be undertaken.
2012-01-01
Background Adolescents and young adult (AYA) cancer survivors may have unique physical, psychological and social needs due to their cancer occurring at a critical phase of development. The aim of this study was to develop a psychometrically rigorous measure of unmet need to capture the specific needs of this group. Methods Items were developed following a comprehensive literature review, focus groups with AYAs, and feedback from health care providers, researchers and other professionals. The measure was pilot tested with 32 AYA cancer survivors recruited through a state-based cancer registry to establish face and content validity. A main sample of 139 AYA cancer patients and survivors were recruited through seven treatment centres and invited to complete the questionnaire. To establish test-retest reliability, a sub-sample of 34 participants completed the measure a second time. Exploratory factor analysis was performed and the measure was assessed for internal consistency, discriminative validity, potential responsiveness and acceptability. Results The Cancer Needs Questionnaire - Young People (CNQ-YP) has established face and content validity, and acceptability. The final measure has 70 items and six factors: Treatment Environment and Care (33 items); Feelings and Relationships (14 items); Daily Life (12 items); Information and Activities (5 items); Education (3 items); and Work (3 items). All domains achieved Cronbach's alpha values greater than 0.80. Item-to-item test-retest reliability was also high, with all but four items reaching weighted kappa values above 0.60. Conclusions The CNQ-YP is the first multi-dimensional measure of unmet need which has been developed specifically for AYA cancer patients and survivors. The measure displays a strong factor structure, and excellent internal consistency and test-retest reliability. However, the small sample size has implications for the reliability of the statistical analyses undertaken, particularly the exploratory factor analysis. Future studies with a larger sample are recommended to confirm the factor structure of the measure. Longitudinal studies to establish responsiveness and predictive validity should also be undertaken. PMID:22284545
CTTITEM: SAS macro and SPSS syntax for classical item analysis.
Lei, Pui-Wa; Wu, Qiong
2007-08-01
This article describes the functions of a SAS macro and an SPSS syntax that produce common statistics for conventional item analysis including Cronbach's alpha, item difficulty index (p-value or item mean), and item discrimination indices (D-index, point biserial and biserial correlations for dichotomous items and item-total correlation for polytomous items). These programs represent an improvement over the existing SAS and SPSS item analysis routines in terms of completeness and user-friendliness. To promote routine evaluations of item qualities in instrument development of any scale, the programs are available at no charge for interested users. The program codes along with a brief user's manual that contains instructions and examples are downloadable from suen.ed.psu.edu/-pwlei/plei.htm.
Bode, Rita K.; Heinemann, Allen W.; Butt, Zeeshan; Stallings, Jena; Taylor, Caitlin; Rowe, Morgan; Roth, Elliot J.
2013-01-01
Bode RK, Heinemann AW, Butt Z, Stallings J, Taylor C, Rowe M, Roth EJ. Development and validation of participation and positive psychologic function measures for stroke survivors. Objective To evaluate the reliability and validity of Neurologic Quality of Life (NeuroQOL) item banks that assess quality-of-life (QOL) domains not typically included in poststroke measures. Design Secondary analysis of item responses to selected NeuroQOL domains. Setting Community. Participants Community-dwelling stroke survivors (n=111) who were at least 12 months poststroke. Interventions Not applicable. Main Outcome Measures Five measures developed for 3 NeuroQoL domains: ability to participate in social activities, satisfaction with participation in social activities, and positive psychologic function. Results A single bank was developed for the positive psychologic function domain, but 2 banks each were developed for the ability-to-participate and satisfaction-with-participation domains. The resulting item banks showed good psychometric properties and external construct validity with correlations with the legacy instruments, ranging from .53 to .71. Using these measures, stroke survivors in this sample reported an overall high level of QOL. Conclusions The NeuroQoL-derived measures are promising and valid methods for assessing aspects of QOL not typically measured in this population. PMID:20801251
Studer, Joseph; Baggio, Stéphanie; Mohler-Kuo, Meichun; Daeppen, Jean-Bernard; Gmel, Gerhard
2016-03-01
The Behavioural Inhibition System/Behavioural Activation System scales (BIS/BAS scales) constitute one of the most prominent questionnaires to assess individual differences in sensitivity to punishment and reward. However, some studies questioned its validity, especially that of the French and German translations. The aim of the present study was to re-evaluate the psychometric characteristics of the BIS/BAS scales in a large sample of French- and German-speaking young Swiss men (N = 5872). Results showed that factor structures previously found in the literature did not meet the standards of fit. Nine items had to be removed to achieve adequate fit statistics in confirmatory factor analysis, yielding a shortened version with four factors: one BIS factor comprising five items and three BAS factors, namely Reward Reactivity, Drive and Fun Seeking, each comprising two items. Convergent validity and group invariance analyses suggest that the shortened BIS/BAS scales constitute a valid and reliable instrument. Researchers interested in assessing individual differences in BIS and BAS reactivity in French- and German-speaking individuals should avoid using the BIS/BAS scales as originally specified. The shortened version may be a sound alternative at least in samples of young adults. Its shorter format may be particularly suited for surveys with constraints on questionnaire length.
Marfeo, Elizabeth E.; Ni, Pengsheng; Bogusz, Kara; Meterko, Mark; McDonough, Christine M.; Chan, Leighton; Rasch, Elizabeth K.; Brandt, Diane E.; Jette, Alan M.
2014-01-01
Objectives To use item response theory (IRT) data simulations to construct and perform initial psychometric testing of a newly developed instrument, the Social Security Administration Behavioral Health Function (SSA-BH) instrument, that aims to assess behavioral health functioning relevant to the context of work. Design Cross-sectional survey followed by item response theory (IRT) calibration data simulations Setting Community Participants A sample of individuals applying for SSA disability benefits, claimants (N=1015), and a normative comparative sample of US adults (N=1000) Interventions None. Main Outcome Measure Social Security Administration Behavioral Health Function (SSA-BH) measurement instrument Results Item response theory analyses supported the unidimensionality of four SSA-BH scales: Mood and Emotions (35 items), Self-Efficacy (23 items), Social Interactions (6 items), and Behavioral Control (15 items). All SSA-BH scales demonstrated strong psychometric properties including reliability, accuracy, and breadth of coverage. High correlations of the simulated 5- or 10- item CATs with the full item bank indicated robust ability of the CAT approach to comprehensively characterize behavioral health function along four distinct dimensions. Conclusions Initial testing and evaluation of the SSA-BH instrument demonstrated good accuracy, reliability, and content coverage along all four scales. Behavioral function profiles of SSA claimants were generated and compared to age and sex matched norms along four scales: Mood and Emotions, Behavioral Control, Social Interactions, and Self-Efficacy. Utilizing the CAT based approach offers the ability to collect standardized, comprehensive functional information about claimants in an efficient way, which may prove useful in the context of the SSA’s work disability programs. PMID:23542404
Forrest, Christopher B; Meltzer, Lisa J; Marcus, Carole L; de la Motte, Anna; Kratchman, Amy; Buysse, Daniel J; Pilkonis, Paul A; Becker, Brandon D; Bevans, Katherine B
2018-03-13
To develop and evaluate the measurement properties of child-report and parent-proxy versions of the PROMIS ® Pediatric Sleep Disturbance and Sleep-Related Impairment item banks. A national sample of 1,104 children (8-17 years-old) and 1,477 parents of children 5-17 years-old was recruited from an internet panel to evaluate the psychometric properties of 43 sleep health items. A convenience sample of children and parents recruited from a pediatric sleep clinic was obtained to provide evidence of the measures' validity; polysomnography data were collected from a subgroup of these children. Factor analyses suggested two dimensions: sleep disturbance and daytime sleep-related impairment. The final item banks included 15 items for Sleep Disturbance and 13 for Sleep-Related Impairment. Items were calibrated using the graded response model from item response theory. Of the 28 items, 16 are included in the parallel PROMIS adult sleep health measures. Reliability of the measures exceeded 0.90. Validity was supported by correlations with existing measures of pediatric sleep health and higher sleep disturbance and sleep-related impairment scores for children with sleep problems and those with chronic and neurodevelopmental disorders. The sleep health measures were not correlated with results from polysomnography. The PROMIS Pediatric Sleep Disturbance and Sleep-Related Impairment item banks provide subjective assessments of a child's difficulties falling and staying asleep as well as daytime sleepiness and its impact on functioning. They may prove useful in the future for clinical research and practice. Future research should evaluate their responsiveness to clinical change in diverse patient populations.
Combining item response theory with multiple imputation to equate health assessment questionnaires.
Gu, Chenyang; Gutman, Roee
2017-09-01
The assessment of patients' functional status across the continuum of care requires a common patient assessment tool. However, assessment tools that are used in various health care settings differ and cannot be easily contrasted. For example, the Functional Independence Measure (FIM) is used to evaluate the functional status of patients who stay in inpatient rehabilitation facilities, the Minimum Data Set (MDS) is collected for all patients who stay in skilled nursing facilities, and the Outcome and Assessment Information Set (OASIS) is collected if they choose home health care provided by home health agencies. All three instruments or questionnaires include functional status items, but the specific items, rating scales, and instructions for scoring different activities vary between the different settings. We consider equating different health assessment questionnaires as a missing data problem, and propose a variant of predictive mean matching method that relies on Item Response Theory (IRT) models to impute unmeasured item responses. Using real data sets, we simulated missing measurements and compared our proposed approach to existing methods for missing data imputation. We show that, for all of the estimands considered, and in most of the experimental conditions that were examined, the proposed approach provides valid inferences, and generally has better coverages, relatively smaller biases, and shorter interval estimates. The proposed method is further illustrated using a real data set. © 2016, The International Biometric Society.
The neurocognitive basis of borrowed context information.
O'Neill, Meagan; Diana, Rachel A
2017-06-01
Falsely remembered items can be accompanied by episodic context retrieval. This finding is difficult to explain because there is no episode that binds the remembered item to the experimenter-controlled context features. The current study examines the neural correlates of false context retrieval when the context features can be traced to encoding episodes of semantically-similar items. Our neuroimaging results support a "dissociated source" mechanism for context borrowing in false memory. We found that parahippocampal cortex (PHc) activation, thought to indicate context retrieval, was greater during trials that involved context borrowing (an incorrect, but plausible source decision) than during baseline correct context retrieval. In contrast, hippocampal activation, thought to indicate retrieval of an episodic binding, was stronger during correct source retrieval than during context borrowing. Vivid context retrieval during false recollection experiences was also indicated by increased activation in visual perceptual regions for context borrowing as compared to other incorrect source judgments. The pattern of findings suggests that context borrowing can arise when unusually strong activation of a semantically-related item's contextual features drives relatively weak retrieval of the associated episodic binding with failure to confirm the item information within that binding. This dissociated source retrieval mechanism suggests that context-driven episodic retrieval does not necessarily lead to retrieval of specific item details. That is, source information can be retrieved in the absence of item memory. Copyright © 2017 Elsevier Ltd. All rights reserved.
Yu, Dan-Dan; Xie, Yan-Ming; Liao, Xing; Zhi, Ying-Jie; Jiang, Jun-Jie; Chen, Wei
2018-02-01
To evaluate the methodological quality and reporting quality of randomized controlled trials(RCTs) published in China Journal of Chinese Materia Medica, we searched CNKI and China Journal of Chinese Materia webpage to collect RCTs since the establishment of the magazine. The Cochrane risk of bias assessment tool was used to evaluate the methodological quality of RCTs. The CONSORT 2010 list was adopted as reporting quality evaluating tool. Finally, 184 RCTs were included and evaluated methodologically, of which 97 RCTs were evaluated with reporting quality. For the methodological evaluating, 62 trials(33.70%) reported the random sequence generation; 9(4.89%) trials reported the allocation concealment; 25(13.59%) trials adopted the method of blinding; 30(16.30%) trials reported the number of patients withdrawing, dropping out and those lost to follow-up;2 trials (1.09%) reported trial registration and none of the trial reported the trial protocol; only 8(4.35%) trials reported the sample size estimation in details. For reporting quality appraising, 3 reporting items of 25 items were evaluated with high-quality,including: abstract, participants qualified criteria, and statistical methods; 4 reporting items with medium-quality, including purpose, intervention, random sequence method, and data collection of sites and locations; 9 items with low-quality reporting items including title, backgrounds, random sequence types, allocation concealment, blindness, recruitment of subjects, baseline data, harms, and funding;the rest of items were of extremely low quality(the compliance rate of reporting item<10%). On the whole, the methodological and reporting quality of RCTs published in the magazine are generally low. Further improvement in both methodological and reporting quality for RCTs of traditional Chinese medicine are warranted. It is recommended that the international standards and procedures for RCT design should be strictly followed to conduct high-quality trials. At the same time, in order to improve the reporting quality of randomized controlled trials, CONSORT standards should be adopted in the preparation of research reports and submissions. Copyright© by the Chinese Pharmaceutical Association.
Development of the PROMIS health expectancies of smoking item banks.
Edelen, Maria Orlando; Tucker, Joan S; Shadel, William G; Stucky, Brian D; Cerully, Jennifer; Li, Zhen; Hansen, Mark; Cai, Li
2014-09-01
Smokers' health-related outcome expectancies are associated with a number of important constructs in smoking research, yet there are no measures currently available that focus exclusively on this domain. This paper describes the development and evaluation of item banks for assessing the health expectancies of smoking. Using data from a sample of daily (N = 4,201) and nondaily (N = 1,183) smokers, we conducted a series of item factor analyses, item response theory analyses, and differential item functioning analyses (according to gender, age, and race/ethnicity) to arrive at a unidimensional set of health expectancies items for daily and nondaily smokers. We also evaluated the performance of short forms (SFs) and computer adaptive tests (CATs) to efficiently assess health expectancies. A total of 24 items were included in the Health Expectancies item banks; 13 items are common across daily and nondaily smokers, 6 are unique to daily, and 5 are unique to nondaily. For both daily and nondaily smokers, the Health Expectancies item banks are unidimensional, reliable (reliability = 0.95 and 0.96, respectively), and perform similarly across gender, age, and race/ethnicity groups. A SF common to daily and nondaily smokers consists of 6 items (reliability = 0.87). Results from simulated CATs showed that health expectancies can be assessed with good precision with an average of 5-6 items adaptively selected from the item banks. Health expectancies of smoking can be assessed on the basis of these item banks via SFs, CATs, or through a tailored set of items selected for a specific research purpose. © The Author 2014. Published by Oxford University Press on behalf of the Society for Research on Nicotine and Tobacco. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Development of the PROMIS nicotine dependence item banks.
Shadel, William G; Edelen, Maria Orlando; Tucker, Joan S; Stucky, Brian D; Hansen, Mark; Cai, Li
2014-09-01
Nicotine dependence is a core construct important for understanding cigarette smoking and smoking cessation behavior. This article describes analyses conducted to develop and evaluate item banks for assessing nicotine dependence among daily and nondaily smokers. Using data from a sample of daily (N = 4,201) and nondaily (N =1,183) smokers, we conducted a series of item factor analyses, item response theory analyses, and differential item functioning analyses (according to gender, age, and race/ethnicity) to arrive at a unidimensional set of nicotine dependence items for daily and nondaily smokers. We also evaluated performance of short forms (SFs) and computer adaptive tests (CATs) to efficiently assess dependence. A total of 32 items were included in the Nicotine Dependence item banks; 22 items are common across daily and nondaily smokers, 5 are unique to daily smokers, and 5 are unique to nondaily smokers. For both daily and nondaily smokers, the Nicotine Dependence item banks are strongly unidimensional, highly reliable (reliability = 0.97 and 0.97, respectively), and perform similarly across gender, age, and race/ethnicity groups. SFs common to daily and nondaily smokers consist of 8 and 4 items (reliability = 0.91 and 0.81, respectively). Results from simulated CATs showed that dependence can be assessed with very good precision for most respondents using fewer than 6 items adaptively selected from the item banks. Nicotine dependence on cigarettes can be assessed on the basis of these item banks via one of the SFs, by using CATs, or through a tailored set of items selected for a specific research purpose. © The Author 2014. Published by Oxford University Press on behalf of the Society for Research on Nicotine and Tobacco. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Development of the PROMIS negative psychosocial expectancies of smoking item banks.
Stucky, Brian D; Edelen, Maria Orlando; Tucker, Joan S; Shadel, William G; Cerully, Jennifer; Kuhfeld, Megan; Hansen, Mark; Cai, Li
2014-09-01
Negative psychosocial expectancies of smoking include aspects of social disapproval and disappointment in oneself. This paper describes analyses conducted to develop and evaluate item banks for assessing psychosocial expectancies among daily and nondaily smokers. Using data from a sample of daily (N = 4,201) and nondaily (N =1,183) smokers, we conducted a series of item factor analyses, item response theory analyses, and differential item functioning analyses (according to gender, age, and race/ethnicity) to arrive at a unidimensional set of psychosocial expectancies items for daily and nondaily smokers. We also evaluated performance of short forms (SFs) and computer adaptive tests (CATs) to efficiently assess psychosocial expectancies. A total of 21 items were included in the Psychosocial Expectancies item banks: 14 items are common across daily and nondaily smokers, 6 are unique to daily, and 1 is unique to nondaily. For both daily and nondaily smokers, the Psychosocial Expectancies item banks are strongly unidimensional, highly reliable (reliability = 0.95 and 0.93, respectively), and perform similarly across gender, age, and race/ethnicity groups. A SF common to daily and nondaily smokers consists of 6 items (reliability = 0.85). Results from simulated CATs showed that, on average, fewer than 8 items are needed to assess psychosocial expectancies with adequate precision when using the item banks. Psychosocial expectancies of smoking can be assessed on the basis of these item banks via the SF, by using CAT, or through a tailored set of items selected for a specific research purpose. © The Author 2014. Published by Oxford University Press on behalf of the Society for Research on Nicotine and Tobacco. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Goh, Rachel L Z; Kong, Yu Xiang George; McAlinden, Colm; Liu, John; Crowston, Jonathan G; Skalicky, Simon E
2018-01-01
To evaluate the use of smartphone-based virtual reality to objectively assess activity limitation in glaucoma. Cross-sectional study of 93 patients (54 mild, 22 moderate, 17 severe glaucoma). Sociodemographics, visual parameters, Glaucoma Activity Limitation-9 and Visual Function Questionnaire - Utility Index (VFQ-UI) were collected. Mean age was 67.4 ± 13.2 years; 52.7% were male; 65.6% were driving. A smartphone placed inside virtual reality goggles was used to administer the Virtual Reality Glaucoma Visual Function Test (VR-GVFT) to participants, consisting of three parts: stationary, moving ball, driving. Rasch analysis and classical validity tests were conducted to assess performance of VR-GVFT. Twenty-four of 28 stationary test items showed acceptable fit to the Rasch model (person separation 3.02, targeting 0). Eleven of 12 moving ball test items showed acceptable fit (person separation 3.05, targeting 0). No driving test items showed acceptable fit. Stationary test person scores showed good criterion validity, differentiating between glaucoma severity groups ( P = 0.014); modest convergence validity, with mild to moderate correlation with VFQ-UI, better eye (BE) mean deviation, BE pattern deviation, BE central scotoma, worse eye (WE) visual acuity, and contrast sensitivity (CS) in both eyes ( R = 0.243-0.381); and suboptimal divergent validity. Multivariate analysis showed that lower WE CS ( P = 0.044) and greater age ( P = 0.009) were associated with worse stationary test person scores. Smartphone-based virtual reality may be a portable objective simulation test of activity limitation related to glaucomatous visual loss. The use of simulated virtual environments could help better understand the activity limitations that affect patients with glaucoma.
Improving care coordination in primary care.
Wagner, Edward H; Sandhu, Nirmala; Coleman, Katie; Phillips, Kathryn E; Sugarman, Jonathan R
2014-11-01
Although coordinating care is a defining characteristic of primary care, evidence suggests that both patients and providers perceive failures in communication and care when care is received from multiple sources. To examine the utility of a newly developed Care Coordination Model in improving care coordination among participating practices in the Safety Net Medical Home Initiative (SNMHI). In this paper, we used correlation analysis to evaluate whether application of the elements of the Care Coordination Model by SNMHI sites, as measured by the Key Activities Checklist (KAC), was associated with more effective care coordination as measured by another instrument, the PCMH-A. SNMHI measures are practice self-assessments based on the 8 change concepts that define a PCMH, one of which is Care Coordination. For this study, we correlated 12 KAC items that describe activities felt to improve coordination of care with 5 PCMH-A items that indicate the extent to which a practice has developed the capability to effectively coordinate care. Practice staff indicated whether any of the KAC activities were being test, implemented, sustained, or not on 4 occasions. The Care Coordination Model elements-assume accountability, build relationships with care partners, support patients through the referral or transition process, and create connections to support information exchange-were positively correlated with some PCMH-A care coordination items but not others. Activities related to the model were most strongly correlated with following up patients seen in the Emergency Department or discharged from hospital. The analysis provides suggestive evidence that activities consistent with the 4 elements of the Care Coordination Model may enable safety net primary care to better coordinate care for its patients, but further study is clearly needed.
Goh, Rachel L. Z.; McAlinden, Colm; Liu, John; Crowston, Jonathan G.; Skalicky, Simon E.
2018-01-01
Purpose To evaluate the use of smartphone-based virtual reality to objectively assess activity limitation in glaucoma. Methods Cross-sectional study of 93 patients (54 mild, 22 moderate, 17 severe glaucoma). Sociodemographics, visual parameters, Glaucoma Activity Limitation-9 and Visual Function Questionnaire – Utility Index (VFQ-UI) were collected. Mean age was 67.4 ± 13.2 years; 52.7% were male; 65.6% were driving. A smartphone placed inside virtual reality goggles was used to administer the Virtual Reality Glaucoma Visual Function Test (VR-GVFT) to participants, consisting of three parts: stationary, moving ball, driving. Rasch analysis and classical validity tests were conducted to assess performance of VR-GVFT. Results Twenty-four of 28 stationary test items showed acceptable fit to the Rasch model (person separation 3.02, targeting 0). Eleven of 12 moving ball test items showed acceptable fit (person separation 3.05, targeting 0). No driving test items showed acceptable fit. Stationary test person scores showed good criterion validity, differentiating between glaucoma severity groups (P = 0.014); modest convergence validity, with mild to moderate correlation with VFQ-UI, better eye (BE) mean deviation, BE pattern deviation, BE central scotoma, worse eye (WE) visual acuity, and contrast sensitivity (CS) in both eyes (R = 0.243–0.381); and suboptimal divergent validity. Multivariate analysis showed that lower WE CS (P = 0.044) and greater age (P = 0.009) were associated with worse stationary test person scores. Conclusions Smartphone-based virtual reality may be a portable objective simulation test of activity limitation related to glaucomatous visual loss. Translational Relevance The use of simulated virtual environments could help better understand the activity limitations that affect patients with glaucoma. PMID:29372112
Maurer, Marcus; Mathias, Susan D; Crosby, Ross D; Rajput, Yamina; Zazzali, James L
2018-03-19
Chronic spontaneous urticaria (CSU), also known as chronic idiopathic urticaria (CIU), may produce hives, itch, and angioedema. The Urticaria Activity and Impact Measure (U-AIM) is a newly developed 9-item patient-reported measure designed for use in routine clinical practice to assess CSU activity and impact over the previous 7 days. To evaluate validity, responsiveness, and clinically meaningful change of the U-AIM. Data from a 24-week open-label single-arm period of a randomized, placebo-controlled study of omalizumab were used to assess the psychometric properties of U-AIM items for itch, hives, and angioedema. 206 patients (75% female, mean age 44.6 years) were enrolled. At baseline, U-AIM results included prevalent severe itch (55%) and >12 hives (67%), angioedema (15%), and bother by itch (84%), hives (84%), and angioedema (49%). Urticaria Patient Daily Diary (UPDD) mean weekly scores were 15.4 (itch severity), 16.8 (number of hives), and 32.2 (Urticaria Activity Score [UAS7]). At baseline, Weeks 12 and 24, U-AIM itch and hives items and UAS7 proxy scores (the sum of itch severity and number of hives over 7 days) demonstrated strong correlation coefficients with their corresponding measures from the UPDD (itch severity: 0.634-0.806; hives number: 0.735-0.843; UAS7 proxy: 0.724-0.852). Changes in U-AIM scores differentiated patients by their perspective of symptom improvement. Meaningful change thresholds were established for itch severity and number of hives scores (0.8-1.0 for both) and the UAS7 proxy score (10.5-12.5). The U-AIM is valid and responsive to change, and may help clinicians monitor CSU activity and track treatment effectiveness. Copyright © 2018. Published by Elsevier Inc.
Stratified and Maximum Information Item Selection Procedures in Computer Adaptive Testing
ERIC Educational Resources Information Center
Deng, Hui; Ansley, Timothy; Chang, Hua-Hua
2010-01-01
In this study we evaluated and compared three item selection procedures: the maximum Fisher information procedure (F), the a-stratified multistage computer adaptive testing (CAT) (STR), and a refined stratification procedure that allows more items to be selected from the high a strata and fewer items from the low a strata (USTR), along with…
Science Library of Test Items. Volume Two.
ERIC Educational Resources Information Center
New South Wales Dept. of Education, Sydney (Australia).
The second volume of test items in the Science Library of Test Items is intended as a resource to assist teachers in implementing and evaluating science courses in the first 4 years of Australian secondary school. The items were selected from questions submitted to the School Certificate Development Unit by teachers in New South Wales. Only the…
Item Difficulty in the Evaluation of Computer-Based Instruction: An Example from Neuroanatomy
ERIC Educational Resources Information Center
Chariker, Julia H.; Naaz, Farah; Pani, John R.
2012-01-01
This article reports large item effects in a study of computer-based learning of neuroanatomy. Outcome measures of the efficiency of learning, transfer of learning, and generalization of knowledge diverged by a wide margin across test items, with certain sets of items emerging as particularly difficult to master. In addition, the outcomes of…
Statistically Comparing the Performance of Multiple Automated Raters across Multiple Items
ERIC Educational Resources Information Center
Kieftenbeld, Vincent; Boyer, Michelle
2017-01-01
Automated scoring systems are typically evaluated by comparing the performance of a single automated rater item-by-item to human raters. This presents a challenge when the performance of multiple raters needs to be compared across multiple items. Rankings could depend on specifics of the ranking procedure; observed differences could be due to…
Identifying Differential Item Functioning in Multi-Stage Computer Adaptive Testing
ERIC Educational Resources Information Center
Gierl, Mark J.; Lai, Hollis; Li, Johnson
2013-01-01
The purpose of this study is to evaluate the performance of CATSIB (Computer Adaptive Testing-Simultaneous Item Bias Test) for detecting differential item functioning (DIF) when items in the matching and studied subtest are administered adaptively in the context of a realistic multi-stage adaptive test (MST). MST was simulated using a 4-item…
An Empirical Investigation of Methods for Assessing Item Fit for Mixed Format Tests
ERIC Educational Resources Information Center
Chon, Kyong Hee; Lee, Won-Chan; Ansley, Timothy N.
2013-01-01
Empirical information regarding performance of model-fit procedures has been a persistent need in measurement practice. Statistical procedures for evaluating item fit were applied to real test examples that consist of both dichotomously and polytomously scored items. The item fit statistics used in this study included the PARSCALE's G[squared],…
Pancreatitis Quality of Life Instrument: Development of a new instrument
Bova, Carol; Barton, Bruce; Hartigan, Celia
2014-01-01
Objectives: The goal of this project was to develop the first disease-specific instrument for the evaluation of quality of life in chronic pancreatitis. Methods: Focus groups and interview sessions were conducted, with chronic pancreatitis patients, to identify items felt to impact quality of life which were subsequently formatted into a paper-and-pencil instrument. This instrument was used to conduct an online survey by an expert panel of pancreatologists to evaluate its content validity. Finally, the modified instrument was presented to patients during precognitive testing interviews to evaluate its clarity and appropriateness. Results: In total, 10 patients were enrolled in the focus groups and interview sessions where they identified 50 items. Once redundant items were removed, the 40 remaining items were made into a paper-and-pencil instrument referred to as the Pancreatitis Quality of Life Instrument. Through the processes of content validation and precognitive testing, the number of items in the instrument was reduced to 24. Conclusions: This marks the development of the first disease-specific instrument to evaluate quality of life in chronic pancreatitis. It includes unique features not found in generic instruments (economic factors, stigma, and spiritual factors). Although this marks a giant step forward, psychometric evaluation is still needed prior to its clinical use. PMID:26770703
Saverino, Cristina; Fatima, Zainab; Sarraf, Saman; Oder, Anita; Strother, Stephen C.; Grady, Cheryl L.
2016-01-01
Human aging is characterized by reductions in the ability to remember associations between items, despite intact memory for single items. Older adults also show less selectivity in task-related brain activity, such that patterns of activation become less distinct across multiple experimental tasks. This reduced selectivity, or dedifferentiation, has been found for episodic memory, which is often reduced in older adults, but not for semantic memory, which is maintained with age. We used functional magnetic resonance imaging (fMRI) to investigate whether there is a specific reduction in selectivity of brain activity during associative encoding in older adults, but not during item encoding, and whether this reduction predicts associative memory performance. Healthy young and older adults were scanned while performing an incidental-encoding task for pictures of objects and houses under item or associative instructions. An old/new recognition test was administered outside the scanner. We used agnostic canonical variates analysis and split-half resampling to detect whole brain patterns of activation that predicted item vs. associative encoding for stimuli that were later correctly recognized. Older adults had poorer memory for associations than did younger adults, whereas item memory was comparable across groups. Associative encoding trials, but not item encoding trials, were predicted less successfully in older compared to young adults, indicating less distinct patterns of associative-related activity in the older group. Importantly, higher probability of predicting associative encoding trials was related to better associative memory after accounting for age and performance on a battery of neuropsychological tests. These results provide evidence that neural distinctiveness at encoding supports associative memory and that a specific reduction of selectivity in neural recruitment underlies age differences in associative memory. PMID:27082043
Caldieraro, Marco Antonio; Walsh, Samantha; Deckersbach, Thilo; Bobo, William V; Gao, Keming; Ketter, Terence A; Shelton, Richard C; Reilly-Harrington, Noreen A; Tohen, Mauricio; Calabrese, Joseph R; Thase, Michael E; Kocsis, James H; Sylvia, Louisa G; Nierenberg, Andrew A
2017-11-01
Activation encompasses energy and activity and is a central feature of bipolar disorder. However, the impact of activation on treatment response of bipolar depression requires further exploration. The aims of this study were to assess the association of decreased activation and sustained remission in bipolar depression and test for factors that could affect this association. We assessed participants with Diagnostic and Statistical Manual of Mental Disorders (4th ed) bipolar depression ( n = 303) included in a comparative effectiveness study of lithium- and quetiapine-based treatments (the Bipolar CHOICE study). Activation was evaluated using items from the Bipolar Inventory of Symptoms Scale. The selection of these items was based on a dimension of energy and interest symptoms associated with poorer treatment response in major depression. Decreased activation was associated with lower remission rates in the raw analyses and in a logistic regression model adjusted for baseline severity and subsyndromal manic symptoms (odds ratio = 0.899; p = 0.015). The manic features also predicted lower remission (odds ratio = 0.934; p < 0.001). Remission rates were similar in the two treatment groups. Decreased activation and subsyndromal manic symptoms predict lower remission rates in bipolar depression. Patients with these features may require specific treatment approaches, but new studies are necessary to identify treatments that could improve outcomes in this population.