Measuring Perceived Barriers to Physical Activity in Adolescents.
Gunnell, Katie E; Brunet, Jennifer; Wing, Erin K; Bélanger, Mathieu
2015-05-01
Perceived barriers to moderate-to-vigorous physical activity (PA) may contribute to the low rates of moderate-to-vigorous PA in adolescents. We examined the psychometric properties of scores from the perceived barriers to moderate-to-vigorous PA scale (PB-MVPA) by examining composite reliability and validity evidence based on the internal structure of the PB-MVPA and relations with other variables. This study was a cross-sectional analysis of data collected in 2013 from adolescents (N = 507; Mage = 12.40, SD = .62) via self-report scales. Using exploratory and confirmatory factor analyses, we found that perceived barriers were best represented as two factors representing internal (e.g., "I am not interested in physical activity") and external (e.g., "I need equipment I don't have") dimensions. Composite reliability was over .80. Using multiple regression to examine the relationship between perceived barriers and moderate-to-vigorous PA, we found that perceived internal barriers were inversely related to moderate-to-vigorous PA (β = -.32, p < .05). Based on results of the analysis of variances, there were no known-group sex differences for perceived internal and external barriers (p > .26). The PB-MVPA scale demonstrated evidence of score reliability and validity. To improve the understanding of the impact of perceived barriers on moderate-to- vigorous PA in adolescents, researchers should examine internal and external barriers separately.
[Validity and reliability of a scale to assess self-efficacy for physical activity in elderly].
Borges, Rossana Arruda; Rech, Cassiano Ricardo; Meurer, Simone Teresinha; Benedetti, Tânia Rosane Bertoldo
2015-04-01
This study aimed to analyze the confirmatory factor validity and reliability of a self-efficacy scale for physical activity in a sample of 118 elderly (78% women) from 60 to 90 years of age. Mplus 6.1 was used to evaluate the confirmatory factor analysis. Reliability was tested by internal consistency and temporal stability. The original scale consisted of five items with dichotomous answers (yes/no), independently for walking and moderate and vigorous physical activity. The analysis excluded the item related to confidence in performing physical activities when on vacation. Two constructs were identified, called "self-efficacy for walking" and "self-efficacy for moderate and vigorous physical activity", with a factor load ≥ 0.50. Internal consistency was adequate both for walking (> 0.70) and moderate and vigorous physical activity (> 0.80), and temporal stability was adequate for all the items. In conclusion, the self-efficacy scale for physical activity showed adequate validity, reliability, and internal consistency for evaluating this construct in elderly Brazilians.
Hellweg, Stephanie; Schuster-Amft, Corina
2016-07-19
Agitation is frequently observed during early recovery after traumatic brain injury (TBI). Agitated behaviour often interferes with a goal-orientated rehabilitation and can be a substantial hindrance to therapy. Despite the relatively high occurance of agitation in TBI population there is no objective assessement in German (G) available. An existing scale with excellent psychometric properties is the "Agitated Behavior Scale (ABS)" developed by Corrigan in 1989. The aim of the study was to translate the Agitated Behavior Scale (ABS) into German (ABS-G) and investigate the inter- and intrarater reliability and internal consistency in patients with moderate to severe TBI. A formal nine-step translation and cross-cultural adaptation procedure (TCCA) was applied. Subsequently a prospective observational patient study was conducted. To examine the interrater reliability and internal consistency, two therapists rated 20 patients independently after a therapy session. This procedure was repeated twice on a weekly basis. The intrarater reliability was assessed through video recordings from three patients. Nine raters scored the demonstrated behaviour on the videotape with the ABS-G independently twice within one month. The inter- and intrarater reliability were evaluated with the Spearman rank correlation coefficient and the quadratic weighted kappa. The internal consistency was tested with Cronbach's alpha. Behaviour of 20 patients (18 males; mean age 41 ± 20.7; mean Functional Independence Measure (FIM) cognitive score on admission 7.1 ± 4.04; mean ABS-G score at first observation 17.3 ± 2.83) was assessed threefold. Interrater reliability yielded a correlation coefficient for ABS-G total score of all 60 paired observations of r s 0.845 and a weighted Kappa of 0.738. Intrarater reliability for ABS-G total score ranged between r s 0.719 and 0.953 and showed a weighted Kappa between 0.871 and 0.953. Cronbach's alpha indicated moderate internal consistency with 0.661. This study demonstrates that the ABS-G is a reliable instrument for evaluating agitation in patients with moderate to severe TBI. Hereby it would be possible to monitor agitation objectively and optimise the management of agitated patients according to international recommendations.
ERIC Educational Resources Information Center
Maiano, Christophe; Begarie, Jerome; Morin, Alexandre J. S.; Garbarino, Jean-Marie; Ninot, Gregory
2010-01-01
The purpose of this study was to test the reliability (i.e. internal consistency and test-retest reliability) and construct validity (i.e. content validity, factor validity, measurement invariance, and latent mean invariance) of the Nutrition and Activity Knowledge Scale (NAKS) in a sample of French adolescents with mild to moderate Intellectual…
Loeding, B L; Greenan, J P
1998-12-01
The study examined the validity and reliability of four assessments, with three instruments per domain. Domains included generalizable mathematics, communication, interpersonal relations, and reasoning skills. Participants were deaf, legally blind, or visually impaired students enrolled in vocational classes at residential secondary schools. The researchers estimated the internal consistency reliability, test-retest reliability, and construct validity correlations of three subinstruments: student self-ratings, teacher ratings, and performance assessments. The data suggest that these instruments are highly internally consistent measures of generalizable vocational skills. Four performance assessments have high-to-moderate test-retest reliability estimates, and were generally considered to possess acceptable validity and reliability.
Scapula fractures: interobserver reliability of classification and treatment.
Neuhaus, Valentin; Bot, Arjan G J; Guitton, Thierry G; Ring, David C; Abdel-Ghany, Mahmoud I; Abrams, Jeffrey; Abzug, Joshua M; Adolfsson, Lars E; Balfour, George W; Bamberger, H Brent; Barquet, Antonio; Baskies, Michael; Batson, W Arnold; Baxamusa, Taizoon; Bayne, Grant J; Begue, Thierry; Behrman, Michael; Beingessner, Daphne; Biert, Jan; Bishop, Julius; Alves, Mateus Borges Oliveira; Boyer, Martin; Brilej, Drago; Brink, Peter R G; Brunton, Lance M; Buckley, Richard; Cagnone, Juan Carlos; Calfee, Ryan P; Campinhos, Luiz Augusto B; Cassidy, Charles; Catalano, Louis; Chivers, Karel; Choudhari, Pradeep; Cimerman, Matej; Conflitti, Joseph M; Costanzo, Ralph M; Crist, Brett D; Cross, Brian J; Dantuluri, Phani; Darowish, Michael; de Bedout, Ramon; DeCoster, Thomas; Dennison, David G; DeNoble, Peter H; DeSilva, Gregory; Dienstknecht, Thomas; Duncan, Scott F; Duralde, Xavier A; Durchholz, Holger; Egol, Kenneth; Ekholm, Carl; Elias, Nelson; Erickson, John M; Esparza, J Daniel Espinosa; Fernandes, C H; Fischer, Thomas J; Fischmeister, Martin; Forigua Jaime, E; Getz, Charles L; Gilbert, Richard S; Giordano, Vincenzo; Glaser, David L; Gosens, Taco; Grafe, Michael W; Filho, Jose Eduardo Grandi Ribeiro; Gray, Robert R L; Gulotta, Lawrence V; Gummerson, Nigel William; Hammerberg, Eric Mark; Harvey, Edward; Haverlag, R; Henry, Patrick D G; Hobby, Jonathan L; Hofmeister, Eric P; Hughes, Thomas; Itamura, John; Jebson, Peter; Jenkinson, Richard; Jeray, Kyle; Jones, Christopher M; Jones, Jedediah; Jubel, Axel; Kaar, Scott G; Kabir, K; Kaplan, F Thomas D; Kennedy, Stephen A; Kessler, Michael W; Kimball, Hervey L; Kloen, Peter; Klostermann, Cyrus; Kohut, Georges; Kraan, G A; Kristan, Anze; Loebenberg, Mark I; Malone, Kevin J; Marsh, L; Martineau, Paul A; McAuliffe, John; McGraw, Iain; Mehta, Samir; Merchant, Milind; Metzger, Charles; Meylaerts, S A; Miller, Anna N; Wolf, Jennifer Moriatis; Murachovsky, Joel; Murthi, Anand; Nancollas, Michael; Nolan, Betsy M; Omara, Timothy; Omid, Reza; Ortiz, Jose A; Overbeck, Joachim P; Castillo, Alberto Pérez; Pesantez, Rodrigo; Polatsch, Daniel; Porcellini, G; Prayson, Michael; Quell, M; Ragsdell, Matthew M; Reid, James G; Reuver, J M; Richard, Marc J; Richardson, Martin; Rizzo, Marco; Rowinski, Sergio; Rubio, Jorge; Guerrero, Carlos G Sánchez; Satora, Wojciech; Schandelmaier, Peter; Scheer, Johan H; Schmidt, Andrew; Schubkegel, Todd A; Schulte, Leah M; Schumer, Evan D; Sears, Benjamin W; Shafritz, Adam B; Shortt, Nicholas L; Siff, Todd; Silva, Dario Mejia; Smith, Raymond Malcolm; Spruijt, Sander; Stein, Jason A; Pemovska, Emilija Stojkovska; Streubel, Philipp N; Swigart, Carrie; Swiontkowski, Marc; Thomas, George; Tolo, Eric T; Turina, Matthias; Tyllianakis, Minos; van den Bekerom, Michel P J; van der Heide, Huub; van de Sande, M A J; van Eerten, P V; Verbeek, Diederik O F; Hoffmann, David Victoria; Vochteloo, A J H; Wagenmakers, Robert; Wall, Christopher J; Wallensten, Richard; Wascher, Daniel C; Weiss, Lawrence; Wiater, J Michael; Wills, Brian P D; Wint, Jeffrey; Wright, Thomas; Young, Jason P; Zalavras, Charalampos; Zura, Robert D; Zyto, Karol
2014-03-01
There is substantial variation in the classification and management of scapula fractures. The first purpose of this study was to analyze the interobserver reliability of the OTA/AO classification and the New International Classification for Scapula Fractures. The second purpose was to assess the proportion of agreement among orthopaedic surgeons on operative or nonoperative treatment. Web-based reliability study. Independent orthopaedic surgeons from several countries were invited to classify scapular fractures in an online survey. One hundred three orthopaedic surgeons evaluated 35 movies of three-dimensional computerized tomography reconstruction of selected scapular fractures, representing a full spectrum of fracture patterns. Fleiss kappa (κ) was used to assess the reliability of agreement between the surgeons. The overall agreement on the OTA/AO classification was moderate for the types (A, B, and C, κ = 0.54) with a 71% proportion of rater agreement (PA) and for the 9 groups (A1 to C3, κ = 0.47) with a 57% PA. For the New International Classification, the agreement about the intraarticular extension of the fracture (Fossa (F), κ = 0.79) was substantial and the agreement about a fractured body (Body (B), κ = 0.57) or process was moderate (Process (P), κ = 0.53); however, PAs were more than 81%. The agreement on the treatment recommendation was moderate (κ = 0.57) with a 73% PA. The New International Classification was more reliable. Body and process fractures generated more disagreement than intraarticular fractures and need further clear definitions.
Vatan, Sevginar; Ertaş, Sedar; Lester, David
2011-04-01
In a sample of 100 Turkish psychiatric patients with diagnoses of anxiety disorders, Lester's Helplessness, Hopelessness, and Haplessness inventory had moderate estimates of internal consistency, test-retest reliability, and construct validity.
Aerts, Frank; Carrier, Kathy; Alwood, Becky
2016-01-01
Background: The assessment of clinical manifestation of muscle fatigue is an effective procedure in establishing therapeutic exercise dose. Few studies have evaluated physical therapist reliability in establishing muscle fatigue through detection of changes in quality of movement patterns in a live setting. Objective: The purpose of this study is to evaluate the inter-rater reliability of physical therapists’ ability to detect altered movement patterns due to muscle fatigue. Design: A reliability study in a live setting with multiple raters. Participants: Forty-four healthy individuals (ages 19-35) were evaluated by six physical therapists in a live setting. Methods: Participants were evaluated by physical therapists for altered movement patterns during resisted shoulder rotation. Each participant completed a total of four tests: right shoulder internal rotation, right shoulder external rotation, left shoulder internal rotation and left shoulder external rotation. Results: For all tests combined, the inter-rater reliability for a single rater scoring ICC (2,1) was .65 (95%, .60, .71) This corresponds to moderate inter-rater reliability between physical therapists. Limitations: The results of this study apply only to healthy participants and therefore cannot be generalized to a symptomatic population. Conclusion: Moderate inter-rater reliability was found between physical therapists in establishing muscle fatigue through the observation of sustained altered movement patterns during dynamic resistive shoulder internal and external rotation. PMID:27347241
Reliability of the Melbourne assessment of unilateral upper limb function.
Randall, M; Carlin, J B; Chondros, P; Reddihough, D
2001-11-01
This study examines the reliability of the Melbourne Assessment of Unilateral Upper Limb Function: a quantitative test of quality of movement in children with neurological impairment. The assessment was administered to 20 children aged from 5 to 16 years (mean age 9 years 10 months, SD 2 years 10 months) who had various types and degrees of cerebral palsy (CP). The performances of the 20 children during assessment were videotaped for subsequent scoring by 15 occupational therapists. Scores were analyzed for internal consistency of test items, inter- and intrarater reliability of scorings of the same videotapes, and test-retest reliability using repeat videotaping. Results revealed very high internal consistency of test items (alpha=0.96), moderate to high agreement both within and between raters for all test items (intraclass correlations of at least 0.7) apart from item 16 (hand to mouth and down), and high interrater reliability (0.95) and intrarater reliability (0.97) for total test scores. Test-retest results revealed moderate to high intrarater reliability for item totals (mean of 0.83 and 0.79) for each rater and high reliability for test totals (0.98 and 0.97). These findings indicate that the Melbourne Assessment of Unilateral Upper Limb Function is a reliable tool for measuring the quality of unilateral upper-limb movement in children with CP.
Reliability and Validity of Five Mental Health Scales in Older Persons.
ERIC Educational Resources Information Center
Himmelfarb, Samuel; Murrell, Stanley A.
1983-01-01
Assessed five scales as mental health measures for older persons (N=318). The internal consistency reliabilities for the anxiety, depression, and well-being scales were moderately high to high, but the reliabilities for the affect balance scale suggest some caution. Cutting points for the well-being and depression scales are suggested. (Author/JAC)
2012-01-01
Background The purpose of this study was to examine the internal consistency, test-retest reliability, construct validity and predictive validity of a new German self-report instrument to assess the influence of social support and the physical environment on physical activity in adolescents. Methods Based on theoretical consideration, the short scales on social support and physical environment were developed and cross-validated in two independent study samples of 9 to 17 year-old girls and boys. The longitudinal sample of Study I (n = 196) was recruited from a German comprehensive school, and subjects in this study completed the questionnaire twice with a between-test interval of seven days. Cronbach’s alphas were computed to determine the internal consistency of the factors. Test-retest reliability of the latent factors was assessed using intra-class coefficients. Factorial validity of the scales was assessed using principle components analysis. Construct validity was determined using a cross-validation technique by performing confirmatory factor analysis with the independent nationwide cross-sectional sample of Study II (n = 430). Correlations between factors and three measures of physical activity (objectively measured moderate-to-vigorous physical activity (MVPA), self-reported habitual MVPA and self-reported recent MVPA) were calculated to determine the predictive validity of the instrument. Results Construct validity of the social support scale (two factors: parental support and peer support) and the physical environment scale (four factors: convenience, public recreation facilities, safety and private sport providers) was shown. Both scales had moderate test-retest reliability. The factors of the social support scale also had good internal consistency and predictive validity. Internal consistency and predictive validity of the physical environment scale were low to acceptable. Conclusions The results of this study indicate moderate to good reliability and construct validity of the social support scale and physical environment scale. Predictive validity was only confirmed for the social support scale but not for the physical environment scale. Hence, it remains unclear if a person’s physical environment has a direct or an indirect effect on physical activity behavior or a moderation function. PMID:22928865
Reimers, Anne K; Jekauc, Darko; Mess, Filip; Mewes, Nadine; Woll, Alexander
2012-08-29
The purpose of this study was to examine the internal consistency, test-retest reliability, construct validity and predictive validity of a new German self-report instrument to assess the influence of social support and the physical environment on physical activity in adolescents. Based on theoretical consideration, the short scales on social support and physical environment were developed and cross-validated in two independent study samples of 9 to 17 year-old girls and boys. The longitudinal sample of Study I (n = 196) was recruited from a German comprehensive school, and subjects in this study completed the questionnaire twice with a between-test interval of seven days. Cronbach's alphas were computed to determine the internal consistency of the factors. Test-retest reliability of the latent factors was assessed using intra-class coefficients. Factorial validity of the scales was assessed using principle components analysis. Construct validity was determined using a cross-validation technique by performing confirmatory factor analysis with the independent nationwide cross-sectional sample of Study II (n = 430). Correlations between factors and three measures of physical activity (objectively measured moderate-to-vigorous physical activity (MVPA), self-reported habitual MVPA and self-reported recent MVPA) were calculated to determine the predictive validity of the instrument. Construct validity of the social support scale (two factors: parental support and peer support) and the physical environment scale (four factors: convenience, public recreation facilities, safety and private sport providers) was shown. Both scales had moderate test-retest reliability. The factors of the social support scale also had good internal consistency and predictive validity. Internal consistency and predictive validity of the physical environment scale were low to acceptable. The results of this study indicate moderate to good reliability and construct validity of the social support scale and physical environment scale. Predictive validity was only confirmed for the social support scale but not for the physical environment scale. Hence, it remains unclear if a person's physical environment has a direct or an indirect effect on physical activity behavior or a moderation function.
Martínez-Gómez, David; Martínez-de-Haro, Vicente; Pozo, Tamara; Welk, Gregory J; Villagra, Ariel; Calle, Marisa E; Marcos, Ascensión; Veiga, Oscar L
2009-01-01
Questionnaires are feasible instruments to assess physical activity (PA) in large samples. The aim of the current study was to evaluate the reliability and validity of the PAQ-A questionnaire in Spanish adolescents using the measurement of PA by accelerometer as criterion. In a sample of 82 adolescents, aged 12 to 17 years, 1-week PAQ-A test-retest was administered. Reliability was analyzed by the Intraclass Correlation Coefficient (ICC) and the internal consistency by the Cronbach's alpha Coefficient. Two hundred thirty-two adolescents, aged 13-17 years, completed the PAQ-A and wore the ActiGraph GT1M accelerometer during 7-days. The PAQ-A was compared against total PA and moderate to vigorous PA (MVPA) obtained by the accelerometer. Test-retest reliability showed ICC = 0.71 for the final score of PAQ-A. Internal consistency was alpha = 0.65 in the first self-report, alpha = 0.67 in the retest in 82 adolescents sample, and alpha = 0.74 in the 232 adolescents sample. The PAQ-A was moderately correlated with total PA (rho = 0.39) and MVPA (rho= 0.34) assessed by the accelerometer. The PAQ-A obtained significantly moderate correlations in boys but not in girls against the accelerometer. The PAQ-A questionnaire shows an adequate reliability and a reasonable validity for assessing PA in Spanish adolescents.
Abma, Femke I; van der Klink, Jac J L; Bültmann, Ute
2013-03-01
The promotion of a sustainable, healthy and productive working life attracts more and more attention. Recently the Work Role Functioning Questionnaire (WRFQ) has been cross-culturally translated and adapted to Dutch. This questionnaire aims to measure the health-related work functioning of workers with health problems. The aim of this study is to evaluate the reliability, validity (including five new items) and responsiveness of the WRFQ 2.0 in the working population. A longitudinal study was conducted among workers. The reliability (internal consistency, test-retest reliability, measurement error), validity (structural validity-factor analysis, construct validity by means of hypotheses testing) and responsiveness of the WRFQ 2.0 were evaluated. A total of N = 553 workers completed the survey. The final WRFQ 2.0 has four subscales and showed very good internal consistency, moderate test-retest reliability, good construct validity and moderate responsiveness in the working population. The WRFQ was able to distinguish between groups with different levels of mental health, physical health, fatigue and need for recovery. A moderate correlation was found between WRFQ and related constructs respectively work ability and work productivity. A weak relationship was found with general self-rated health, work engagement and work involvement. The WRFQ 2.0 is a reliable and valid instrument to measure health-related work functioning in the working population. Further validation in larger samples is recommended, especially for test-retest reliability, responsiveness and the questionnaire's ability to predict the future course of health-related work functioning.
Vodanovich, Domagoj A; Bicknell, Thomas J; Holland, Anne E; Hill, Catherine J; Cecins, Nola; Jenkins, Sue; McDonald, Christine F; Burge, Angela T; Thompson, Philip; Stirling, Robert G; Lee, Annemarie L
2015-01-01
The chronic respiratory disease questionnaire (CRDQ) is designed to assess health-related quality of life (HRQOL) in chronic respiratory conditions, but its reliability, validity and responsiveness in individuals with mild to moderate non-cystic fibrosis (CF) bronchiectasis are unclear. This study aimed to determine measurement properties of the CRDQ in non-CF bronchiectasis. Participants with non-CF bronchiectasis involved in a randomised controlled trial of exercise training were recruited. Internal consistency was assessed using Cronbach's α. Over 8 weeks, reliability was evaluated using intra-class correlation coefficients and Bland-Altman analysis for measures of agreement. Convergent and divergent validity was assessed by correlations with the other HRQOL questionnaires and the Hospital Anxiety and Depression Scale (HADS). The responsiveness to exercise training was assessed using effect sizes and standardised response means. Eighty-five participants were included (mean age ± SD, 64 ± 13 years). Internal consistency was adequate (>0.7) for all CRDQ domains and the total score. Test-retest reliability ranged from 0.69 to 0.85 for each CRDQ domain and was 0.82 for the total score. Dyspnoea (CRDQ) was related to St George's respiratory questionnaire (SGRQ) symptoms only (r = 0.38), with no relationship to the Leicester cough questionnaire (LCQ) or HADS. Moderate correlations were found between the total score of the CRDQ, the SGRQ (rs = -0.49) and the LCQ score (rs = 0.51). Lower CRDQ scores were associated with higher anxiety and depression (rs = -0.46 to -0.56). The responsiveness of the CRDQ was small (effect size 0.1-0.24). The CRDQ is a valid and reliable measure of HRQOL in mild to moderate non-CF bronchiectasis, but responsiveness was limited. © 2015 S. Karger AG, Basel.
Malec, James F; Kragness, Miriam; Evans, Randall W; Finlay, Karen L; Kent, Ann; Lezak, Muriel D
2003-01-01
To evaluate the internal consistency of the Mayo-Portland Adaptability Inventory (MPAI), further refine the instrument, and provide reference data based on a large, geographically diverse sample of persons with acquired brain injury (ABI). 386 persons, most with moderate to severe ABI. Outpatient, community-based, and residential rehabilitation facilities for persons with ABI located in the United States: West, Midwest, and Southeast. Rasch, item cluster, principal components, and traditional psychometric analyses for internal consistency of MPAI data and subscales. With rescoring of rating scales for 4 items, a 29-item version of the MPAI showed satisfactory internal consistency by Rasch (Person Reliability=.88; Item Reliability=.99) and traditional psychometric indicators (Cronbach's alpha=.89). Three rationally derived subscales for Ability, Activity, and Participation demonstrated psychometric properties that were equivalent to subscales derived empirically through item cluster and factor analyses. For the 3 subscales, Person Reliability ranged from.78 to.79; Item Reliability, from.98 to.99; and Cronbach's alpha, from.76 to.83. Subscales correlated moderately (Pearson r =.49-.65) with each other and strongly with the overall scale (Pearson r=.82-.86). Outcome after ABI is represented by the unitary dimension described by the MPAI. MPAI subscales further define regions of this dimension that may be useful for evaluation of clinical cases and program evaluation.
Baker, Elizabeth A; Ledford, Cynthia H; Fogg, Louis; Way, David P; Park, Yoon Soo
2015-01-01
Construct: Clinical skills are used in the care of patients, including reporting, diagnostic reasoning, and decision-making skills. Written comprehensive new patient admission notes (H&Ps) are a ubiquitous part of student education but are underutilized in the assessment of clinical skills. The interpretive summary, differential diagnosis, explanation of reasoning, and alternatives (IDEA) assessment tool was developed to assess students' clinical skills using written comprehensive new patient admission notes. The validity evidence for assessment of clinical skills using clinical documentation following authentic patient encounters has not been well documented. Diagnostic justification tools and postencounter notes are described in the literature (1,2) but are based on standardized patient encounters. To our knowledge, the IDEA assessment tool is the first published tool that uses medical students' H&Ps to rate students' clinical skills. The IDEA assessment tool is a 15-item instrument that asks evaluators to rate students' reporting, diagnostic reasoning, and decision-making skills based on medical students' new patient admission notes. This study presents validity evidence in support of the IDEA assessment tool using Messick's unified framework, including content (theoretical framework), response process (interrater reliability), internal structure (factor analysis and internal-consistency reliability), and relationship to other variables. Validity evidence is based on results from four studies conducted between 2010 and 2013. First, the factor analysis (2010, n = 216) yielded a three-factor solution, measuring patient story, IDEA, and completeness, with reliabilities of .79, .88, and .79, respectively. Second, an initial interrater reliability study (2010) involving two raters demonstrated fair to moderate consensus (κ = .21-.56, ρ =.42-.79). Third, a second interrater reliability study (2011) with 22 trained raters also demonstrated fair to moderate agreement (intraclass correlations [ICCs] = .29-.67). There was moderate reliability for all three skill domains, including reporting skills (ICC = .53), diagnostic reasoning skills (ICC = .64), and decision-making skills (ICC = .63). Fourth, there was a significant correlation between IDEA rating scores (2010-2013) and final Internal Medicine clerkship grades (r = .24), 95% confidence interval (CI) [.15, .33]. The IDEA assessment tool is a novel tool with validity evidence to support its use in the assessment of students' reporting, diagnostic reasoning, and decision-making skills. The moderate reliability achieved supports formative or lower stakes summative uses rather than high-stakes summative judgments.
Kaminer, Y; Blitz, C; Burleson, J A; Kadden, R M; Rounsaville, B J
1998-07-01
The state of the art for treatment efficacy studies now requires manual guided treatments and tests of therapist adherence. This report provides findings regarding adherence assessment of therapists participating in an investigation of treatment matching in adolescent substance abusers. The Group Sessions Rating Scale (GSRS), a group-therapy process measure, was studied to determine its appropriateness for assessing group treatment of adolescents with a) substance use disorders (SUD), b) interrater reliability, c) internal consistency, and d) ability to discriminate the active ingredients of cognitive-behavioral therapy (CBT) from interactional therapy (IT). Interrater reliabilities were moderate to high, with those for CBT generally higher than those for IT. Internal consistency of CBT items was moderate, whereas those of IT were moderately high. Discriminability between the two treatment modalities was high. The frequency of active ingredients was generally therapy-specific: high for the relevant and low for the nonrelevant therapeutic modality items. The GSRS was found to be effective in the measurement of treatment process in adolescents with SUD.
A systematic review of the factor structure and reliability of the Spence Children's Anxiety Scale.
Orgilés, Mireia; Fernández-Martínez, Iván; Guillén-Riquelme, Alejandro; Espada, José P; Essau, Cecilia A
2016-01-15
The Spence Children's Anxiety Scale (SCAS) is a widely used instrument for assessing symptoms of anxiety disorders among children and adolescents. Previous studies have demonstrated its good reliability for children and adolescents from different backgrounds. However, remarkable variability in the reliability of the SCAS across studies and inconsistent results regarding its factor structure has been found. The present study aims to examine the SCAS factor structure by means of a systematic review with narrative synthesis, the mean reliability of the SCAS by means of a meta-analysis, and the influence of the moderators on the SCAS reliability. Databases employed to collect the studies included Scholar Google, PsycARTICLES, PsycINFO, Web of Science, and Scopus since 1997. Twenty-nine and 32 studies, which examined the factor structure and the internal consistency of the SCAS, respectively, were included. The SCAS was found to have strong internal consistency, influenced by different moderators. The systematic review demonstrated that the original six-factor model was supported by most studies. Factorial invariance studies (across age, gender, country) and test-retest reliability of the SCAS were not examined in this study. It is concluded that the SCAS is a reliable instrument for cross-cultural use, and it is suggested that the original six-factor model is appropriate for cross-cultural application. Copyright © 2015 Elsevier B.V. All rights reserved.
Test-retest reliability of cardinal plane isokinetic hip torque and EMG.
Claiborne, Tina L; Timmons, Mark K; Pincivero, Danny M
2009-10-01
The objective of the present study was to establish test-retest reliability of isokinetic hip torque and prime mover electromyogram (EMG) through the three cardinal planes of motion. Thirteen healthy young adults participated in two experimental sessions, separated by approximately one week. During each session, isokinetic hip torque was evaluated on the Biodex Isokinetic Dynamometer at a velocity of 60 deg/s. Subjects performed three maximal-effort concentric and eccentric contractions, separately, for right and left hip abduction/adduction, flexion/extension, and internal/external rotation. Surface EMGs were sampled from the gluteus maximus, gluteus medius, adductor, medial and lateral hamstring, and rectus femoris muscles during all contractions. Intraclass correlation coefficients (ICC - 2,1) and standard errors of measurement (SEM) were calculated for peak torque for each movement direction and contraction mode, while ICCs were only computed for the EMG data. Motions that demonstrated high torque reliability included concentric hip abduction (right and left), flexion (right and left), extension (right) and internal rotation (right and left), and eccentric hip abduction (left), adduction (left), flexion (right), and extension (right and left) (ICC range=0.81-0.91). Motions with moderate torque reliability included concentric hip adduction (right), extension (left), internal rotation (left), and external rotation (right), and eccentric hip abduction and adduction (right), flexion (left), internal rotation (right and left), and external rotation (right and left) (ICC range=0.49-0.79). The majority of the EMG sampled muscles (n=12 and n=11 for concentric and eccentric contractions, respectively) demonstrated high reliability (ICC=0.81-0.95). Instances of low, or unacceptable, EMG reliability values occurred for the medial hamstring muscle of the left leg (both contraction modes) and the adductor muscle of the right leg during eccentric internal rotation. The major finding revealed high and moderate levels of between-day reliability of isokinetic hip peak torque and prime mover EMG. It is recommended that the day-to-day variability estimates concomitant with acceptable levels of reliability be considered when attempting to objectify intervention effects on hip muscle performance.
Milanović, Zoran; Pantelić, Saša; Trajković, Nebojša; Jorgić, Bojan; Sporiš, Goran; Bratić, Milovan
2014-01-01
The purpose of this study was to determine the test-retest reliability of the International Physical Activity Questionnaire (IPAQ) for older adults in Serbia. Six hundred and sixty older adults (352 men, 53%; 308 women, 47%; mean age 67.65±5.76 years) participated in the study. To examine test-retest reliability, the participants were asked to complete the IPAQ on two occasions 2 weeks apart. Moderate reliability was observed between the repeated IPAQ, with intraclass correlation coefficients ranging from 0.53 to 0.91. The least reliability was established in leisure time activity (0.53) and the most reliability in the transport domain (0.91). Men and women had similar intraclass correlation coefficients for total physical activity (0.71 versus 0.74, respectively), while the biggest difference was obtained for housework in men (0.68) and in women (0.90). Our study shows that the long version of the IPAQ is a reliable instrument for assessing physical activity levels in older adults and that it may be useful for generating internationally comparable data.
Validation of a new classification system for skin tears.
LeBlanc, Kimberly; Baranoski, Sharon; Holloway, Samantha; Langemo, Diane
2013-06-01
The aim of this study was to validate and establish reliability of the International Skin Tear classification system. A consensus panel of 12 internationally recognized key opinion leaders convened in 2011 to establish consensus statements on the prevention, prediction, assessment, and treatment of skin tears. Subsequently, a new skin tear classification system was proposed. The system was then tested for interrater and intrarater reliability between the experts before being tested more widely on a sample of 327 individuals from the United States, Canada, and Europe. The results of the study indicated a substantial level of agreement for the expert panel (Fleiss κ = 0.619; 2-month follow-up = 0.653). Intrarater reliability was high (Cohen κ = 0.877). Interrater reliability was moderate (Fleiss κ = 0.555) for healthcare professionals (n = 303) and fair for non-health professionals (Fleiss κ = 0.338; n = 24). This international study established the reliability and validity of a new classification system for skin tears.
Internal consistency and stability of the CANTAB neuropsychological test battery in children.
Syväoja, Heidi J; Tammelin, Tuija H; Ahonen, Timo; Räsänen, Pekka; Tolvanen, Asko; Kankaanpää, Anna; Kantomaa, Marko T
2015-06-01
The Cambridge Neuropsychological Test Automated Battery (CANTAB) is a computer-assessed test battery widely use in different populations. The internal consistency and 1-year stability of CANTAB tests were examined in school-age children. Two hundred-thirty children (57% girls) from five schools in the Jyväskylä school district in Finland participated in the study in spring 2011. The children completed the following CANTAB tests: (a) visual memory (pattern recognition memory [PRM] and spatial recognition memory [SRM]), (b) executive function (spatial span [SSP], Stockings of Cambridge [SOC], and intra-extra dimensional set shift [IED]), and (c) attention (reaction time [RTI] and rapid visual information processing [RVP]). Seventy-four children participated in the follow-up measurements (64% girls) in spring 2012. Cronbach's alpha reliability coefficient was used to estimate the internal consistency of the nonhampering test, and structural equation models were applied to examine the stability of these tests. The reliability and the stability could not be determined for IED or SSP because of the nature of these tests. The internal consistency was acceptable only in the RTI task. The 1-year stability was moderate-to-good for the PRM, RTI, and RVP. The SSP and IED showed a moderate correlation between the two measurement points. The SRM and the SOC tasks were not reliable or stable measures in this study population. For research purposes, we recommend using structural equation modeling to improve reliability. The results suggest that the reliability and the stability of computer-based test batteries should be confirmed in the target population before using them for clinical or research purposes. (c) 2015 APA, all rights reserved).
Validity of selected physical activity questions in white Seventh-day Adventists and non-Adventists.
Singh, P N; Tonstad, S; Abbey, D E; Fraser, G E
1996-08-01
The validity and reliability of selected physical activity questions were assessed in both Seventh-day Adventist (N = 131) and non-Adventist (N = 101) study groups. Vigorous activity questions similar to those used by others and new questions that measured moderate and light activities were included. Validation was external, comparing questionnaire data with treadmill exercise time, resting heart rate, and body mass index (kg.m-2), and internal, comparing data with other similar questions. Both Adventist and non-Adventist males showed significant age-adjusted correlations between treadmill time and a "Run-Walk-Jog Index" (R = 0.28, R = 0.48, respectively). These correlations increased substantially when restricting analysis to exercise speeds exceeding 3 mph (R = 0.39, R = 0.71, respectively). Frequency of sweating and a vigorous physical activity index also correlated significantly with treadmill time in males. Correlations were generally weaker in females. Moderate- and light-intensity questions were not correlated with physical fitness. Internal correlations R = 0.50-0.78) between the above three vigorous activity questions were significant in all groups, and correlations (R = 0.14-0.60) for light and moderate activity questions were also documented. Test-retest reliability coefficients were high for vigorous activity questions (R = 0.48-0.85) and for one set of moderate activity questions (R = 0.43-0.75). No important differences in validity and reliability were found between Adventist and non-Adventists, but the validity of vigorous activity measures was generally weaker in females.
Øhre, Beate; Saltnes, Hege; von Tetzchner, Stephen; Falkum, Erik
2014-05-22
There is a need for psychiatric assessment instruments that enable reliable diagnoses in persons with hearing loss who have sign language as their primary language. The objective of this study was to assess the validity of the Norwegian Sign Language (NSL) version of the Mini International Neuropsychiatric Interview (MINI). The MINI was translated into NSL. Forty-one signing patients consecutively referred to two specialised psychiatric units were assessed with a diagnostic interview by clinical experts and with the MINI. Inter-rater reliability was assessed with Cohen's kappa and "observed agreement". There was 65% agreement between MINI diagnoses and clinical expert diagnoses. Kappa values indicated fair to moderate agreement, and observed agreement was above 76% for all diagnoses. The MINI diagnosed more co-morbid conditions than did the clinical expert interview (mean diagnoses: 1.9 versus 1.2). Kappa values indicated moderate to substantial agreement, and "observed agreement" was above 88%. The NSL version performs similarly to other MINI versions and demonstrates adequate reliability and validity as a diagnostic instrument for assessing mental disorders in persons who have sign language as their primary and preferred language.
Trathitiphan, Warayos; Paholpak, Permsak; Sirichativapee, Winai; Wisanuyotin, Taweechok; Laupattarakasem, Pat; Sukhonthamarn, Kamolsak; Jeeravipoolvarn, Polasak; Kosuwon, Weerachai
2016-10-01
HOOS was developed as an extension of the Western Ontario and McMaster Universities' Osteoarthritis Index questionnaire for measuring symptoms and functional limitations related to the hip(s) of patients with osteoarthritis. To determine the validity and reliability of the Thai version of the Hip disability and Osteoarthritis Outcome Score (HOOS) vis-à-vis hip osteoarthritis, the original HOOS was translated into a Thai version of HOOS, according to international recommendations. Patients with hip osteoarthritis (n = 57; 25 males) were asked to complete the Thai version of HOOS twice: once then again after a 3-week interval. The test-retest reliability was analyzed using the intraclass correlation coefficient (ICC). Internal consistencies were analyzed using Cronbach's alpha, while the construct validity was tested by comparing the Thai HOOS with the Thai modified SF-36 and calculating the Spearman's rank correlation coefficients. The Thai HOOS produced good reliability (i.e., the ICC was greater than 0.9 in all five subscales). All of the Cronbach's alpha showed that the Thai HOOS had high internal consistency (Cronbach's alpha greater than 0.8), especially for the pain and ADL subscales (0.89 and 0.90, respectively). The Spearman's rank correlation for all five subscales of the Thai HOOS had moderate correlation with the Bodily Pain subscale of the Thai SF-36. The pain subscale of the Thai HOOS had a high correlation with the Vitality and Social Function subscales of the Thai SF-36 (r = 0.55 and 0.54)-with which the symptom subscale had a moderate correlation. The Thai version of HOOS had excellent internal consistency, excellent test-retest reliability, and good construct validity. It can be used as a reliable tool for assessing quality of life for patients with hip osteoarthritis in Thailand.
Measurement properties of tools measuring mental health knowledge: a systematic review.
Wei, Yifeng; McGrath, Patrick J; Hayden, Jill; Kutcher, Stan
2016-08-23
Mental health literacy has received great attention recently to improve mental health knowledge, decrease stigma and enhance help-seeking behaviors. We conducted a systematic review to critically appraise the qualities of studies evaluating the measurement properties of mental health knowledge tools and the quality of included measurement properties. We searched PubMed, PsycINFO, EMBASE, CINAHL, the Cochrane Library, and ERIC for studies addressing psychometrics of mental health knowledge tools and published in English. We applied the COSMIN checklist to assess the methodological quality of each study as "excellent", "good", "fair", or "indeterminate". We ranked the level of evidence of the overall quality of each measurement property across studies as "strong", "moderate", "limited", "conflicting", or "unknown". We identified 16 mental health knowledge tools in 17 studies, addressing reliability, validity, responsiveness or measurement errors. The methodological quality of included studies ranged from "poor" to "excellent" including 6 studies addressing the content validity, internal consistency or structural validity demonstrating "excellent" quality. We found strong evidence of the content validity or internal consistency of 6 tools; moderate evidence of the internal consistency, the content validity or the reliability of 8 tools; and limited evidence of the reliability, the structural validity, the criterion validity, or the construct validity of 12 tools. Both the methodological qualities of included studies and the overall evidence of measurement properties are mixed. Based on the current evidence, we recommend that researchers consider using tools with measurement properties of strong or moderate evidence that also reached the threshold for positive ratings according to COSMIN checklist.
Validation of the Short Form of the Academic Procrastination Scale.
Yockey, Ronald D
2016-02-01
The factor structure, internal consistency reliability, and convergent validity of the five-item Academic Procrastination Scale-Short Form was investigated on an ethnically diverse sample of college students. The results provided support for the Academic Procrastination Scale-Short Form as a unidimensional measure of academic procrastination, which possessed good internal consistency reliability in this sample of 282 students. The scale also demonstrated good convergent validity, with moderate to large correlations with both the Procrastination Assessment Scale-Students and the Tuckman Procrastination Scale. Implications of the results are discussed and recommendations for future work provided.
Helou, Khalil; El Helou, Nour; Mahfouz, Maya; Mahfouz, Yara; Salameh, Pascale; Harmouche-Karaki, Mireille
2017-07-24
The International Physical Actvity Questionnaire (IPAQ) is a validated tool for physical activity assessment used in many countries however no Arabic version of the long-form of this questionnaire exists to this date. Hence, the aim of this study was to cross-culturally adapt and validate an Arabic version of the long International Physical Activity Questionnaire (AIPAQ) equivalent to the French version (F-IPAQ) in a Lebanese population. The guidelines for cross-cultural adaptation provided by the World Health Organization and the International Physical Activity Questionnaire committee were followed. One hundred fifty-nine students and staff members from Saint Joseph University of Beirut were randomly recruited to participate in the study. Items of the A-IPAQ were compared to those from the F-IPAQ for concurrent validity using Spearman's correlation coefficient. Content validity of the questionnaire was assessed using factor analysis for the A-IPAQ's items. The physical activity indicators derived from the A-IPAQ were compared with the body mass index (BMI) of the participants for construct validity. The instrument was also evaluated for internal consistency reliability using Cronbach's alpha and Intraclass Correlation Coefficient (ICC). Finally, thirty-one participants were asked to complete the A-IPAQ on two occasions three weeks apart to examine its test-retest reliability. Bland-Altman analyses were performed to evaluate the extent of agreement between the two versions of the questionnaire and its repeated administrations. A high correlation was observed between answers of the F-IPAQ and those of the A-IPAQ, with Spearman's correlation coefficients ranging from 0.91 to 1.00 (p < 0.05). Bland-Altman analysis showed a high level of agreement between the two versions with all values scattered around the mean for total physical activity (mean difference = 5.3 min/week, 95% limits of agreement = -145.2 to 155.8). Negative correlations were observed between MET values and BMI, independent of age, gender or university campus. The A-IPAQ showed a high internal consistency reliability with Cronbach's alpha ranging from 0.769-1.00 (p < 0.001) and intraclass correlation coefficient (ICC) ranging from 0.625-0.999 (p < 0.001), except for a moderate agreement with the moderate garden/yard activity (alpha = 0.682; ICC = 0.518; p < 0.001). The A-IPAQ had moderate-to-good test-retest reliability for most of its items (ICC ranging from 0.66-0.96; p < 0.001) and the Bland-Altman analysis showed a satisfactory agreement between the two administrations of the A-IPAQ for total physical activity (mean difference = 99.8 min/week, 95% limits of agreement = -1105.3; 1304.9) and total vigorous and moderate physical activity (mean difference = -29.7 min/week, 95% limits of agreement = -777.6; 718.2). The modified Arabic version of the IPAQ showed acceptable validity and reliability for the assessment of physical activity among Lebanese adults. More studies are necessary in the future to assess its validity compared to a gold-standard criterion measure.
Comparison of three instruments for measuring patient anxiety in a coronary care unit.
Elliott, D
1993-09-01
This paper compares the State-Trait Anxiety Inventory (STAI), Hospital Anxiety and Depression Scale (HAD Scale) and a Linear Analogue Anxiety Scale (LAAS) for evaluating anxiety in patients with acute ischaemic heart disease. The instruments were examined for correlation, reliability and internal consistency. Strong associations were demonstrated at pre-test between the STAI and the other scales. Moderate coefficients between HAD-A and HAD-D/LAAS were also apparent. Lower correlations were found at post-test than at pre-test. At post-test, strong inter-correlations occurred for STAI/LAAS. The HAD Scale demonstrated high test-retest reliability, while the STAI and LAAS were moderate in their reliability in this sample. The adequate correlation between the instruments suggest that each is a valid and appropriate measure of anxiety in this clinical sample.
Kyte, Derek; Cockwell, Paul; Marshall, Tom; Gheorghe, Adrian; Keeley, Thomas; Slade, Anita; Calvert, Melanie
2017-01-01
Background Patient-reported outcome measures (PROMs) can provide valuable information which may assist with the care of patients with chronic kidney disease (CKD). However, given the large number of measures available, it is unclear which PROMs are suitable for use in research or clinical practice. To address this we comprehensively evaluated studies that assessed the measurement properties of PROMs in adults with CKD. Methods Four databases were searched; reference list and citation searching of included studies was also conducted. The COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) checklist was used to appraise the methodological quality of the included studies and to inform a best evidence synthesis for each PROM. Results The search strategy retrieved 3,702 titles/abstracts. After 288 duplicates were removed, 3,414 abstracts were screened and 71 full-text articles were retrieved for further review. Of these, 24 full-text articles were excluded as they did not meet the eligibility criteria. Following reference list and citation searching, 19 articles were retrieved bringing the total number of papers included in the final analysis to 66. There was strong evidence supporting internal consistency and moderate evidence supporting construct validity for the Kidney Disease Quality of Life-36 (KDQOL-36) in pre-dialysis patients. In the dialysis population, the KDQOL-Short Form (KDQOL-SF) had strong evidence for internal consistency and structural validity and moderate evidence for test-retest reliability and construct validity while the KDQOL-36 had moderate evidence of internal consistency, test-retest reliability and construct validity. The End Stage Renal Disease-Symptom Checklist Transplantation Module (ESRD-SCLTM) demonstrated strong evidence for internal consistency and moderate evidence for test-retest reliability, structural and construct validity in renal transplant recipients. Conclusions We suggest considering the KDQOL-36 for use in pre-dialysis patients; the KDQOL-SF or KDQOL-36 for dialysis patients and the ESRD-SCLTM for use in transplant recipients. However, further research is required to evaluate the measurement error, structural validity, responsiveness and patient acceptability of PROMs used in CKD. PMID:28636678
Keessen, Paul; Maaskant, Jolanda; Visser, Bart
2018-08-01
The standardized Mensendieck test (SMT) was developed to quantify posture, movement, gait, and respiration. In the hands of an experienced therapist, the SMT is proven to be a reliable tool. It is unclear whether posture, movement, gait, and respiration are related to the degree of functional disability in patients with chronic pain. The objective of this study was to assess the reliability and convergent validity of the SMT in a heterogeneous sample of 50 patients with chronic pain. Internal consistency was determined by Cronbach's α and interrater reliability by the intraclass correlation coefficient (ICC). Convergent validity was assessed by determining the Spearman rank correlation coefficient between the movement quality measured in the SMT and functional limitation measured on the disability rating index (DRI). The internal consistency was Cronbach's α 0.91. Substantial reliability was found for the items: movement (ICC = 0.68), gait (ICC = 0.69), sitting posture (ICC = 0.63), and respiration (ICC = 0.64). Insufficient reliability was found for standing posture (ICC = 0.23). A moderate correlation was found between average test score SMT and the DRI (r = -0.37) and respiration and DRI (r = -0.45). The SMT is a reasonably reliable tool to assess movement, gait, sitting posture, and respiration. None of the items in the domain standing posture has sufficient reliability. A thorough study of this domain should be considered. The results show little evidence for convergent validity. Several items of the SMT correlated moderately with functional limitation with the DRI. These items were global movement, hip flexion, pelvis rotation, and all respiration items.
Reliability and validity of television food advertising questionnaire in Malaysia.
Zalma, Abdul Razak; Safiah, Md Yusof; Ajau, Danis; Khairil Anuar, Md Isa
2015-09-01
Interventions to counter the influence of television food advertising amongst children are important. Thus, reliable and valid instrument to assess its effect is needed. The objective of this study was to determine the reliability and validity of such a questionnaire. The questionnaire was administered twice on 32 primary schoolchildren aged 10-11 years in Selangor, Malaysia. The interval between the first and second administration was 2 weeks. Test-retest method was used to examine the reliability of the questionnaire. Intra-rater reliability was determined by kappa coefficient and internal consistency by Cronbach's alpha coefficient. Construct validity was evaluated using factor analysis. The test-retest correlation showed moderate-to-high reliability for all scores (r = 0.40*, p = 0.02 to r = 0.95**, p = 0.00), with one exception, consumption of fast foods (r = 0.24, p = 0.20). Kappa coefficient showed acceptable-to-strong intra-rater reliability (K = 0.40-0.92), except for two items under knowledge on television food advertising (K = 0.26 and K = 0.21) and one item under preference for healthier foods (K = 0.33). Cronbach's alpha coefficient indicated acceptable internal consistency for all scores (0.45-0.60). After deleting two items under Consumption of Commonly Advertised Food, the items showed moderate-to-high loading (0.52, 0.84, 0.42 and 0.42) with the Scree plot showing that there was only one factor. The Kaiser-Meyer-Olkin was 0.60, showing that the sample was adequate for factor analysis. The questionnaire on television food advertising is reliable and valid to assess the effect of media literacy education on television food advertising on schoolchildren. © The Author (2013). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Ashur, S T; Shamsuddin, K; Shah, S A; Bosseri, S; Morisky, D E
2015-12-13
No validation study has previously been made for the Arabic version of the 8-item Morisky Medication Adherence Scale (MMAS-8(©)) as a measure for medication adherence in diabetes. This study in 2013 tested the reliability and validity of the Arabic MMAS-8 for type 2 diabetes mellitus patients attending a referral centre in Tripoli, Libya. A convenience sample of 103 patients self-completed the questionnaire. Reliability was tested using Cronbach alpha, average inter-item correlation and Spearman-Brown coefficient. Known-group validity was tested by comparing MMAS-8 scores of patients grouped by glycaemic control. The Arabic version showed adequate internal consistency (α = 0.70) and moderate split-half reliability (r = 0.65). Known-group validity was supported as a significant association was found between medication adherence and glycaemic control, with a moderate effect size (ϕc = 0.34). The Arabic version displayed good psychometric properties and could support diabetes research and practice in Arab countries.
Smith, Toby O; Clark, Allan; Neda, Sophia; Arendt, Elizabeth A; Post, William R; Grelsamer, Ronald P; Dejour, David; Almqvist, Karl Fredrik; Donell, Simon T
2012-08-01
An accurate physical examination of patients with patellar instability is an important aspect of the diagnosis and treatment. While previous studies have assessed the diagnostic accuracy of such physical examination tests, little has been undertaken to assess the inter- and intra-tester reliability of such techniques. The purpose of this study was to determine the inter- and intra-tester reliability of the physical examination tests used for patients with patellar instability. Five patients (10 knees) with bilateral recurrent patellar instability were assessed by five members of the International Patellofemoral Study Group. Each surgeon assessed each patient twice using 18 reported physical examination tests. The inter- and intra-observer reliability was assessed using weighted Kappa statistics with 95% confidence intervals. The findings of the study suggested that there were very poor inter-observer reliability for the majority of the physical tests, with only the assessments of patellofemoral crepitus, foot arch position and the J-sign presenting with fair to moderate agreement respectively. The intra-observer reliability indicated largely moderate to substantial agreement between the first and second tests performed by each assessor, with the greatest agreement seen for the assessment of tibial torsion, popliteal angle and the Bassett's sign. For the common physical examination tests used in the management of patients with patellar instability inter-observer reliability is poor, while intra-observer reliability is moderate. Standardization of physical exam assessments and further study of these results among different clinicians and more divergent patient groups is indicated. Copyright © 2011 Elsevier B.V. All rights reserved.
Children's Social Desirability and Dietary Reports.
Baxter, Suzanne Domel; Smith, Albert F; Litaker, Mark S; Baglio, Michelle L; Guinn, Caroline H; Shaffer, Nicole M
2004-01-01
We investigated telephone administration of the Children's Social Desirability (CSD) scale and our adaptation for children of the Social Desirability for Food scale (C-SDF). Each of 100 4th-graders completed 2 telephone interviews 28 days apart. CSD scores had adequate internal consistency and test-retest reliability, and a 14-item subset was identified that sufficiently measures the same construct. Our C-SDF scale performed less well in terms of internal consistency and test-retest reliability; factor analysis revealed 2 factors, 1 of which was moderately related to the CSD. The 14-item subset of the CSD scale may help researchers understand error in children's dietary reports.
Children's Social Desirability and Dietary Reports
Baxter, Suzanne Domel; Smith, Albert F.; Litaker, Mark S.; Baglio, Michelle L.; Guinn, Caroline H.; Shaffer, Nicole M.
2005-01-01
We investigated telephone administration of the Children's Social Desirability (CSD) scale and our adaptation for children of the Social Desirability for Food scale (C-SDF). Each of 100 4th-graders completed 2 telephone interviews 28 days apart. CSD scores had adequate internal consistency and test—retest reliability, and a 14-item subset was identified that sufficiently measures the same construct. Our C-SDF scale performed less well in terms of internal consistency and test—retest reliability; factor analysis revealed 2 factors, 1 of which was moderately related to the CSD. The 14-item subset of the CSD scale may help researchers understand error in children's dietary reports. PMID:15068757
Hales, M; Biros, E; Reznik, J E
2015-01-01
Since 1982, the International Standards for Neurological Classification of Spinal Cord Injury (ISNCSCI) has been used to classify sensation of spinal cord injury (SCI) through pinprick and light touch scores. The absence of proprioception, pain, and temperature within this scale creates questions about its validity and accuracy. To assess whether the sensory component of the ISNCSCI represents a reliable and valid measure of classification of SCI. A systematic review of studies examining the reliability and validity of the sensory component of the ISNCSCI published between 1982 and February 2013 was conducted. The electronic databases MEDLINE via Ovid, CINAHL, PEDro, and Scopus were searched for relevant articles. A secondary search of reference lists was also completed. Chosen articles were assessed according to the Oxford Centre for Evidence-Based Medicine hierarchy of evidence and critically appraised using the McMasters Critical Review Form. A statistical analysis was conducted to investigate the variability of the results given by reliability studies. Twelve studies were identified: 9 reviewed reliability and 3 reviewed validity. All studies demonstrated low levels of evidence and moderate critical appraisal scores. The majority of the articles (~67%; 6/9) assessing the reliability suggested that training was positively associated with better posttest results. The results of the 3 studies that assessed the validity of the ISNCSCI scale were confounding. Due to the low to moderate quality of the current literature, the sensory component of the ISNCSCI requires further revision and investigation if it is to be a useful tool in clinical trials.
Hales, M.; Biros, E.
2015-01-01
Background: Since 1982, the International Standards for Neurological Classification of Spinal Cord Injury (ISNCSCI) has been used to classify sensation of spinal cord injury (SCI) through pinprick and light touch scores. The absence of proprioception, pain, and temperature within this scale creates questions about its validity and accuracy. Objectives: To assess whether the sensory component of the ISNCSCI represents a reliable and valid measure of classification of SCI. Methods: A systematic review of studies examining the reliability and validity of the sensory component of the ISNCSCI published between 1982 and February 2013 was conducted. The electronic databases MEDLINE via Ovid, CINAHL, PEDro, and Scopus were searched for relevant articles. A secondary search of reference lists was also completed. Chosen articles were assessed according to the Oxford Centre for Evidence-Based Medicine hierarchy of evidence and critically appraised using the McMasters Critical Review Form. A statistical analysis was conducted to investigate the variability of the results given by reliability studies. Results: Twelve studies were identified: 9 reviewed reliability and 3 reviewed validity. All studies demonstrated low levels of evidence and moderate critical appraisal scores. The majority of the articles (~67%; 6/9) assessing the reliability suggested that training was positively associated with better posttest results. The results of the 3 studies that assessed the validity of the ISNCSCI scale were confounding. Conclusions: Due to the low to moderate quality of the current literature, the sensory component of the ISNCSCI requires further revision and investigation if it is to be a useful tool in clinical trials. PMID:26363591
Voigt-Radloff, S; Leonhart, R; Schützwohl, M; Jurjanz, L; Reuster, T; Gerner, A; Marschner, K; van Nes, F; Graff, M; Vernooij-Dassen, M; Rikkert, M O; Holthoff, V; Hüll, M
2012-03-01
To translate the Dementia quality of life instrument (DQoL) into German and assess its construct and concurrent validity in community-dwelling people with mild to moderate dementia. Dementia quality of life instrument data of two pooled samples (n=287) were analysed regarding ceiling and floor effects, internal consistency, factor reliability and correlations with corresponding scales on quality of life (Quality of Life in Alzheimer's Disease and SF-12), cognition (Mini-Mental State Examination, Alzheimer's Disease Assessment Scale - cognitive), depression (Cornell Scale for Depression in Dementia) and activities of daily living (Interview of Deterioration in Daily Living Activities in Dementia). We found no floor effects (<2%), minor ceiling effects (1-11%), moderate to good internal consistency (Cronbach's α: 0.6-0.8) and factor reliability (0.6-0.8), moderate correlations with self-rated scales of quality of life (Spearman coefficient: 0.3-0.6) and no or minor correlations with scores for cognition, depression or activities of daily living (r<0.3). The original five-factor model could not be confirmed. The DQoL can be used in dementia research for assessing positive and negative affect, feelings of belonging and self-esteem. The findings suggest further research to improve the structure of the scales aesthetics, feelings of belonging and self-esteem. © 2011 The Author(s). European Journal of Neurology © 2011 EFNS.
Hansen, Andreas Wolff; Dahl-Petersen, Inger; Helge, Jørn Wulff; Brage, Søren; Grønbæk, Morten; Flensborg-Madsen, Trine
2014-03-01
The International Physical Activity Questionnaire (IPAQ) is commonly used in surveys, but reliability and validity has not been established in the Danish population. Among participants in the Danish Health Examination survey 2007-2008, 142 healthy participants (45% men) wore a unit that combined accelerometry and heart rate monitoring (Acc+HR) for 7 consecutive days and then completed the IPAQ. Background data were obtained from the survey. Physical activity energy expenditure (PAEE) and time in moderate, vigorous, and sedentary intensity levels were derived from the IPAQ and compared with estimates from Acc+HR using Spearman's correlation coefficients and Bland-Altman plots. Repeatability of the IPAQ was also assessed. PAEE from the 2 methods was significantly positively correlated (0.29 and 0.49; P = 0.02 and P < 0.001; for women and men, respectively). Men significantly overestimated PAEE by IPAQ (56.2 vs 45.3 kJ/kg/day, IPAQ: Acc+HR, P < .01), while the difference was nonsignificant for women (40.8 vs 44.4 kJ/kg/day). Bland-Altman plots showed that the IPAQ overestimated PAEE, moderate, and vigorous activity without systematic error. Reliability of the IPAQ was moderate to high for all domains and intensities (total PAEE intraclass correlation coefficient = 0.58). This Danish Internet-based version of the long IPAQ had modest validity and reliability when assessing PAEE at population level.
Bazo-Alvarez, Juan Carlos; Bazo-Alvarez, Oscar Alfredo; Aguila, Jeins; Peralta, Frank; Mormontoy, Wilfredo; Bennett, Ian M
2016-01-01
Our aim was to evaluate the psychometric properties of the FACES-III among Peruvian high school students. This is a psychometric cross-sectional study. A probabilistic sampling was applied, defined by three stages: stratum one (school), stratum two (grade) and cluster (section). The participants were 910 adolescent students of both sexes, between 11 and 18 years of age. The instrument was also the object of study: the Olson's FACES-III. The analysis included a review of the structure / construct validity of the measure by factor analysis and assessment of internal consistency (reliability). The real-cohesion scale had moderately high reliability (Ω=.85) while the real-flexibility scale had moderate reliability (Ω=.74). The reliability found for the ideal-cohesion was moderately high (Ω=.89) like for the scale of ideal-flexibility (Ω=.86). Construct validity was confirmed by the goodness of fit of a two factor model (cohesion and flexibility) with 10 items each [Adjusted goodness of fit index (AGFI) = 0.96; Expected Cross Validation Index (ECVI) = 0.87; Normed fit index (NFI) = 0.93; Goodness of fit index (GFI) = 0.97; Root mean square error of approximation (RMSEA) = 0.06]. FACES-III has sufficient reliability and validity to be used in Peruvian adolescents for the purpose of group or individual assessment.
Utility of the Rosenberg self-esteem scale.
Davis, Clare; Kellett, Stephen; Beail, Nigel
2009-05-01
The Rosenberg Self-Esteem Scale (RSES) continues to be used to purportedly measure self-esteem of people with intellectual disabilities, despite the lack of sound evidence concerning its validity and reliability when employed with this population. The psychometric foundations of the RSES were analyzed here with a sample of 219 participants with intellectual disabilities. The factor analytic methods employed revealed two factors (Self-Worth and Self-Criticism) and more specific problems with RSES Items 5 and 8. Overall, this scale showed only moderate temporal and moderate internal reliability and poor aspects of criterion validity. Results are discussed with reference to either developing a new measure of self-esteem or redesigning and simplifying the RSES in order to increase its initial face validity in intellectual disability samples.
Ventura, Joseph; Cienfuegos, Angel; Boxer, Oren; Bilder, Robert
2008-11-01
Cognitive deficits are core features of schizophrenia that have been associated reliably with functional outcomes and now are a focus of treatment research. New rating scales are needed to complement current psychometric testing procedures, both to enable wider clinical use, and to serve as endpoints in clinical trials. Subjects were 35 schizophrenia patient-and-caregiver pairs recruited from the UCLA and West Los Angeles VA Outpatient Psychiatry Departments. Participants were assessed with the Clinical Global Impression of Cognition in Schizophrenia (CGI-CogS), an interview-based rating scale of cognitive functioning, on 3 occasions (baseline, 1 month, and 3 months). A computerized neurocognitive battery (Cogtest), an assessment of functioning, and symptom measures were administered at two occasions (baseline and one month). The CGI-CogS ratings generally showed a high level of internal consistency (Cronbach's alpha=.69 to .96), adequate levels of inter-rater reliability (ICC's=.71 to .80), and high test-retest stability (ICC's=.92 to .95). Correlations of caregiver and rater global (but not "patient only rating") CGI-CogS ratings with neurocognitive performance were in the moderate range (r's=-.27 to -.48), while most of the correlations with functional outcome were moderate to high (r's=-.41 to -.72). In fact, the CGI-CogS ratings were significantly more correlated with Social Functioning than were objective neurocognitive test scores (p=.02) and showed a trend in the same direction for predicting Instrumental Functioning (p=.06). We found moderate correlations between CGI-CogS global ratings and PANSS positive (r's=.36 to .49) and SANS negative symptoms (r=.41 to .61), but not with BPRS depression (r's=.11 to .13). An interview-based measure of cognition demonstrated high internal consistency, good inter-rater reliability, and high test-retest reliability. Caregiver ratings appear to add important clinical information over patient-only ratings. The CGI-CogS showed moderate validity with respect to neurocognitive performance and functional outcome, and correlations of CGI-CogS with functional outcomes were stronger than correlations of objective neurocognitive performance with functional outcomes. The CGI-CogS appears to offer a reliable and valid method for clinical rating of cognitive deficits and their impact on everyday functioning in schizophrenia.
[Reliability and validity of a Mexican version of the Pro Children Project questionnaire].
Ochoa-Meza, Gerardo; Sierra, Juan Carlos; Pérez-Rodrigo, Carmen; Aranceta Bartrina, Javier; Esparza-Del Villar, Óscar A
2014-08-01
To determine the test-retest reliability, the internal consistency, and the predictive validity of the constructs of the Mexican version of the Pro Children Project questionnaire (PCHP) for assessing personal and environmental factors related to fruit and vegetable intake in 10-12 year-old schoolchildren. Test-retest design with a 14 days interval. A sample of 957 children completed the questionnaire with 82 items. The study was conducted at eight primary schools in 2012 in Ciudad Juarez, Chihuahua, Mexico. For all fruit constructs and vegetable constructs, the test-retest reliability was moderate (intraclass correlation coefficient (ICC) > 0.60). Cronbach s alpha values were from moderate to high (range of 0.54 to 0.92) similar to those in the original study. Values for predictive validity ranged from moderate to good with Spearman correlations between 0.23 and 0.60 for personal factors and between 0.14 and 0.40 for environmental factors. The results of the Mexican version of the PCHP questionnaire provide a sufficient reliability and validity for assessing personal and environmental factors of fruit and vegetable intake in 10-12 year old schoolchildren. Finally, implications to administer this instrument in scholar settings and guidelines for futures studies are discussed. Copyright AULA MEDICA EDICIONES 2014. Published by AULA MEDICA. All rights reserved.
Citronberg, Jessica S; Wilkens, Lynne R; Lim, Unhee; Hullar, Meredith A J; White, Emily; Newcomb, Polly A; Le Marchand, Loïc; Lampe, Johanna W
2016-09-01
Plasma lipopolysaccharide-binding protein (LBP), a measure of internal exposure to bacterial lipopolysaccharide, has been associated with several chronic conditions and may be a marker of chronic inflammation; however, no studies have examined the reliability of this biomarker in a healthy population. We examined the temporal reliability of LBP measured in archived samples from participants in two studies. In Study one, 60 healthy participants had blood drawn at two time points: baseline and follow-up (either three, six, or nine months). In Study two, 24 individuals had blood drawn three to four times over a seven-month period. We measured LBP in archived plasma by ELISA. Test-retest reliability was estimated by calculating the intraclass correlation coefficient (ICC). Plasma LBP concentrations showed moderate reliability in Study one (ICC 0.60, 95 % CI 0.43-0.75) and Study two (ICC 0.46, 95 % CI 0.26-0.69). Restricting the follow-up period improved reliability. In Study one, the reliability of LBP over a three-month period was 0.68 (95 % CI: 0.41-0.87). In Study two, the ICC of samples taken ≤seven days apart was 0.61 (95 % CI 0.29-0.86). Plasma LBP concentrations demonstrated moderate test-retest reliability in healthy individuals with reliability improving over a shorter follow-up period.
Quiroz, Viviana; Reinero, Daniela; Hernández, Patricia; Contreras, Johanna; Vernal, Rolando; Carvajal, Paola
2017-01-01
This study aimed to develop and assess the content validity and reliability of a cognitively adapted self-report questionnaire designed for surveillance of gingivitis in adolescents. Ten predetermined self-report questions evaluating early signs and symptoms of gingivitis were preliminary assessed by a panel of clinical experts. Eight questions were selected and cognitively tested in 20 adolescents aged 12 to 18 years from Santiago de Chile. The questionnaire was then conducted and answered by 178 Chilean adolescents. Internal consistency was measured using the Cronbach's alpha and temporal stability was calculated using the Kappa-index. A reliable final self-report questionnaire consisting of 5 questions was obtained, with a total Cronbach's alpha of 0.73 and a Kappa-index ranging from 0.41 to 0.77 between the different questions. The proposed questionnaire is reliable, with an acceptable internal consistency and a temporal stability from moderate to substantial, and it is promising for estimating the prevalence of gingivitis in adolescents.
Rademakers, Jany; Nijman, Jessica; van der Hoek, Lucas; Heijmans, Monique; Rijken, Mieke
2012-07-31
The American short form Patient Activation Measure (PAM) is a 13-item instrument which assesses patient (or consumer) self-reported knowledge, skills and confidence for self-management of one's health or chronic condition. In this study the PAM was translated into a Dutch version; psychometric properties of the Dutch version were established and the instrument was validated in a panel of chronically ill patients. The translation was done according to WHO guidelines. The PAM 13-Dutch was sent to 4178 members of the Dutch National Panel of people with Chronic illness or Disability (NPCD) in April 2010 (study A) and again to a sub sample of this group (N = 973) in June 2010 (study B). Internal consistency, test-retest reliability and cross-validation with the SBSQ-D (a measure for Health literacy) were computed. The Dutch results were compared to similar Danish and American data. The psychometric properties of the PAM 13-Dutch were generally good. The level of internal consistency is good (α = 0.88) and item-rest correlations are moderate to strong. The Dutch mean PAM score (61.3) is comparable to the American (61.9) and lower than the Danish (64.2). The test-retest reliability was moderate. The association with Health literacy was weak to moderate. The PAM-13 Dutch is a reliable instrument to measure patient activation. More research is needed into the validity of the Patient Activation Measure, especially with respect to a more comprehensive measure of Health literacy.
Wang, Chao; Chen, Peijie; Zhuang, Jie
2013-12-01
The psychometric profiles of the widely used International Physical Activity Questionnaire-Short Form (IPAQ-SF) in Chinese youth have not been reported. The purpose of this study was to examine the validity and reliability of the IPAQ-SF using a sample of Chinese youth. One thousand and twenty-one youth (M(age) = 14.26 +/- 1.63 years, 52.8% boys) from 11 cities in China wore accelerometers for 7 consecutive days and completed the IPAQ-SF on the 8th day to recall their physical activity (PA) during accelerometer-wearing days. A subsample of 92 youth (M(age) = 15.90 +/- 1.35 years, 46.7% boys) completed the IPAQ-SF again a week later to recall their PA during accelerometer-wearing days. Differences in PA estimated by the IPAQ-SF and accelerometer were examined by paired-sample t test. Spearman correlation coefficients were used to examine the correlation between the IPAQ-SF and accelerometer. Test-retest reliability of the IPAQ-SF was determined by the intraclass correlation coefficient (ICC). Compared with accelerometer, the IPAQ-SF overestimated sedentary time, moderate PA (MPA), vigorous PA (VPA), and moderate-to-vigorous PA (MVPA). Correlations between PA (total PA, MPA, VPA, and MVPA) and sedentary time measured by 2 instruments ranged from "none" to "low" (p = .08-.31). Test-retest ICC of the IPAQ-SF ranged from "moderate" to "high" (ICC = .43-.83), except for sitting in boys (ICC = .06), sitting for the whole sample (ICC = .32), and VPA in girls (ICC = .35). The IPAQ-SF was not a valid instrument for measuring PA and sedentary behavior in Chinese youth.
Oo, W M; Linklater, J M; Daniel, M; Saarakkala, S; Samuels, J; Conaghan, P G; Keen, H I; Deveza, L A; Hunter, D J
2018-05-01
The aims of this study were to systematically review clinimetrics of commonly assessed ultrasound pathologies in knee, hip and hand osteoarthritis (OA), and to conduct a meta-analysis for each clinimetric. Medline, Embase, and Cochrane Library databases were searched from their inceptions to September 2016. According to the Outcome Measures in Rheumatology (OMERACT) Instrument Selection Algorithm, data extraction focused on ultrasound technical features and performance metrics. Methodological quality was assessed with modified 19-item Downs and Black score and 11-item Quality Appraisal of Diagnostic Reliability (QAREL) score. Separate meta-analyses were performed for clinimetrics: (1) inter-rater/intra-rater reliability; (2) construct validity; (3) criteria validity; and (4) internal/external responsiveness. Statistical Package for the Social Sciences (SPSS), Excel and Comprehensive Meta-analysis were used. Our search identified 1126 records; of these, 100 were eligible, including a total of 8542 patients and 32,373 joints. The average Downs and Black score was 13.01, and average QAREL was 5.93. The stratified meta-analysis was performed only for knee OA, which demonstrated moderate to substantial reliability [minimum kappa > 0.44(0.15,0.74), minimum intraclass correlation coefficient (ICC) > 0.82(0.73-0.89)], weak construct validity against pain (r = 0.12 to 0.27), function (r = 0.15 to 0.23), and blood biomarkers (r = 0.01 to 0.21), but weak to strong correlation with plain radiography (r = 0.13 to 0.60), strong association with Magnetic Resonance Imaging (MRI) [minimum r = 0.60(0.52,0.67)] and strong discrimination against symptomatic patients (OR = 3.08 to 7.46). There was strong criterion validity against cartilage histology [r = 0.66(-0.05,0.93)], and small to moderate internal [standardized mean difference(SMD) = 0.20 to 0.58] and external (r = 0.35 to 0.43) responsiveness to interventions. Ultrasound demonstrated strong criterion validity with cartilage histology, poor to strong correlation with patient findings and MRI, moderate reliability, and low responsiveness to interventions. CRD42016039954. Copyright © 2018 Osteoarthritis Research Society International. All rights reserved.
Lenderking, William R; Wyrwich, Kathleen W; Stolar, Marilyn; Howard, Kellee A; Leibman, Chris; Buchanan, Jacqui; Lacey, Loretto; Kopp, Zoe; Stern, Yaakov
2013-12-01
The Dependence Scale (DS) was designed to measure dependence on others among patients with Alzheimer's disease (AD). The objectives of this research were primarily to strengthen the psychometric evidence for the use of the DS in AD studies. Patients with mild to moderately severe AD were examined in 3 study databases. Within each data set, internal consistency, validity, and responsiveness were examined, and structural equation models were fit. The DS has strong psychometric properties. The DS scores differed significantly across known groups and demonstrated moderate to strong correlations with measures hypothesized to be related to dependence (|r| ≥ .31). Structural equation modeling supported the validity of the DS concept. An anchor-based DS responder definition to interpret a treatment benefit over time was identified. The DS is a reliable, valid, and interpretable measure of dependence associated with AD and is shown to be related to--but provides information distinct from--cognition, functioning, and behavior.
Park, Hyeon Jin; Yang, Hyung Kook; Shin, Dong Wook; Kim, Yoon Yi; Kim, Young Ae; Yun, Young Ho; Nam, Byung Ho; Bhatia, Smita; Park, Byung Kiu; Ghim, Thad T; Kang, Hyoung Jin; Park, Kyung Duk; Shin, Hee Young; Ahn, Hyo Seop
2013-12-01
We verified the reliability and validity of the Korean version of the Minneapolis-Manchester Quality of Life Instrument-Adolescent Form (KMMQL-AF) among Korean childhood cancer survivors. A total of 107 childhood cancer patients undergoing cancer treatment and 98 childhood cancer survivors who completed cancer treatment were recruited. To assess the internal structure of the KMMQL-AF, we performed multi-trait scaling analyses and exploratory factor analysis. Additionally, we compared each domains of the KMMQL-AF with those of the Karnofsky Performance Status Scale and the Revised Children's Manifest Anxiety Scale (RCMAS). Internal consistency of the KMMQL-AF was sufficient (Cronbach's alpha: 0.78-0.92). In multi-trait scaling analyses, the KMMQL-AF showed sufficient construct validity. The "physical functioning" domain showed moderate correlation with Karnofsky scores and the "psychological functioning" domain showed moderate-to-high correlation with the RCMAS. The KMMQL-AF discriminated between subgroups of different adolescent cancer survivors depending on treatment completion. The KMMQL-AF is a sufficiently reliable and valid instrument for measuring quality of life among Korean childhood cancer survivors.
Tsuno, Kanami; Kawakami, Norito; Shimazu, Akihito; Shimada, Kyoko; Inoue, Akiomi; P Leiter, Michael
2017-05-25
Although incivility is a common interpersonal mistreatment and associated with poor mental health, there are few studies about it in Asian countries. The aim of this study was to develop the Japanese version of the modified Work Incivility Scale (J-MWIS), investigate its reliability and validity, and reveal the prevalence of incivility among Japanese employees in comparison with data on Canadian employees. A total of 2,191 Japanese and 1,071 Canadian employees were surveyed, using either the J-MWIS or MWIS. Japanese employees additionally answered questions on civility, worksite social support, workplace bullying, psychological distress, intention to leave, and work engagement to investigate construct validity. At least one form of workplace incivility was experienced by both Japanese (52.3%) and Canadian (86.0%) employees in the previous month. Internal consistency reliability of the J-MWIS was acceptable (α=0.71-0.81), and correlation analyses also confirmed its construct validity as expected. Workplace incivility was associated with lower workgroup civility, lower supervisor and coworker support, higher workplace bullying, higher psychological distress, higher intention to leave, and lower work engagement. Confirmatory factor analyses showed that the original three-factor model (supervisor incivility, coworker incivility, and instigated incivility) fitted moderately in both Japan and Canada data, though the privacy/overfamiliarity factor was additionally extracted from exploratory factor analysis for the J-MWIS. The results of this study suggested that the J-MWIS has moderate internal consistency reliability and good construct validity.
Tsuno, Kanami; Kawakami, Norito; Shimazu, Akihito; Shimada, Kyoko; Inoue, Akiomi; P. Leiter, Michael
2017-01-01
Objectives: Although incivility is a common interpersonal mistreatment and associated with poor mental health, there are few studies about it in Asian countries. The aim of this study was to develop the Japanese version of the modified Work Incivility Scale (J-MWIS), investigate its reliability and validity, and reveal the prevalence of incivility among Japanese employees in comparison with data on Canadian employees. Methods: A total of 2,191 Japanese and 1,071 Canadian employees were surveyed, using either the J-MWIS or MWIS. Japanese employees additionally answered questions on civility, worksite social support, workplace bullying, psychological distress, intention to leave, and work engagement to investigate construct validity. Results: At least one form of workplace incivility was experienced by both Japanese (52.3%) and Canadian (86.0%) employees in the previous month. Internal consistency reliability of the J-MWIS was acceptable (α=0.71-0.81), and correlation analyses also confirmed its construct validity as expected. Workplace incivility was associated with lower workgroup civility, lower supervisor and coworker support, higher workplace bullying, higher psychological distress, higher intention to leave, and lower work engagement. Confirmatory factor analyses showed that the original three-factor model (supervisor incivility, coworker incivility, and instigated incivility) fitted moderately in both Japan and Canada data, though the privacy/overfamiliarity factor was additionally extracted from exploratory factor analysis for the J-MWIS. Conclusions: The results of this study suggested that the J-MWIS has moderate internal consistency reliability and good construct validity. PMID:28302927
Uswatte, Gitendra; Taub, Edward; Morris, David; Vignolo, Mary; McCulloch, Karen
2005-11-01
In research on Constraint-Induced Movement (CI) therapy, a structured interview, the Motor Activity Log (MAL), is used to assess how stroke survivors use their more-impaired arm outside the laboratory. This article examines the psychometrics of the 14-item version of this instrument in 2 chronic stroke samples with mild-to-moderate upper-extremity hemiparesis. Participants (n=41) in the first study completed MALs before and after CI therapy or a placebo control procedure. In addition, caregivers independently completed a MAL on the participants. Participants (n=27) in the second study completed MALs and wore accelerometers that monitored their arm movements for 3 days outside the laboratory before and after an automated form of CI therapy. Validity of the participant MAL Quality of Movement (QOM) scale was supported. Correlations between pretreatment-to-posttreatment change scores on the participant QOM scale and caregiver MAL QOM scale, caregiver MAL amount of use (AOU) scale, and accelerometer recordings were 0.70, 0.73, and 0.91 (P<0.01), respectively. Internal consistency (alpha>0.81), test-retest reliability (r>0.91), stability, and responsiveness (ratio>3) of the participant QOM scale were also supported. The participant AOU and caregiver QOM and AOU scales were internally consistent, stable, and sensitive, but were not reliable. The participant MAL QOM scale can be used exclusively to reliably and validly measure real-world, upper-extremity rehabilitation outcome and functional status in chronic stroke patients with mild-to-moderate hemiparesis.
Hyperventilation in asthma: a validation study of the Nijmegen Questionnaire--NQ.
Grammatopoulou, Eirini P; Skordilis, Emmanouil K; Georgoudis, Georgios; Haniotou, Aikaterini; Evangelodimou, Afroditi; Fildissis, George; Katsoulas, Theodoros; Kalagiakos, Panagiotis
2014-10-01
The Nijmegen questionnaire (NQ) has previously been used for screening the hyperventilation syndrome (HVS) in asthmatics. However, no validity study has been reported so far. To examine the validity and reliability of the NQ in asthma patients and identify the prevalence of HVS. The NQ (n = 162) was examined for translation, construct, cross-sectional and discriminant validity as well as for internal consistency and test-retest reliability. Principal component analysis and exploratory factor analysis revealed a single factor solution with 11 items and 58.6% of explained variability. These 11 NQ items showed high internal consistency (Cronbach's alpha = 0.92) and test-retest reliability (IR = 0.98). Higher NQ scores were found in the following subgroups: women versus men (p < 0.01); participants with moderate versus mild asthma (p < 0.001) or uncontrolled versus controlled asthma (p < 0.001), and participants with breath-hold time (BHT) < 30 versus ≥ 30 s (p < 0.01) or end-tidal CO2 (ETCO2) ≤ 35 versus >35 mmHg (p < 0.001). A cut-off score of >17 discriminated the participants with regard to the presence of HVS. The NQ showed 92.73% sensitivity and 91.59% specificity. The total NQ score was found significantly correlated with ETCO2 (r = -0.68), RR (r = 0.66) and BHT (r = -0.65). The prevalence of HVS was found 34%. The NQ is a valid and reliable questionnaire for screening HVS in patients with stable mild-to-moderate asthma.
Life Participation for Parents: a tool for family-centered occupational therapy.
Fingerhut, Patricia E
2013-01-01
This study describes the continued development of the Life Participation for Parents (LPP), a measurement tool to facilitate family-centered pediatric practice. LPP questionnaires were completed by 162 parents of children with special needs receiving intervention at 15 pediatric private practice clinics. Results were analyzed to establish instrument reliability and validity. Good internal consistency (α = .90) and test-retest reliability (r = .89) were established. Construct validity was examined through assessment of internal structure and comparison of the instrument to related variables. A principal components analysis resulted in a two-factor model accounting for 43.81% of the variance. As hypothesized, the LPP correlated only moderately with the Parenting Stress Index-Short Form (r = .54). The variables of child's diagnoses, age, and time in therapy did not predict parental responses. The LPP is a reliable and valid instrument for measuring satisfaction with parental participation in life occupations. Copyright © 2013 by the American Occupational Therapy Association, Inc.
Rosa-Rizzotto, M; Visonà Dalla Pozza, L; Corlatti, A; Luparia, A; Marchi, A; Molteni, F; Facchin, P; Pagliano, E; Fedrizzi, E
2014-10-01
In hemiplegic children, the recognition of the activity limitation pattern and the possibility of grading its severity are relevant for clinicians while planning interventions, monitoring results, predicting outcomes. Aim of the study is to examine the reliability and validity of Besta Scale, an instrument used to measure in hemiplegic children from 18 months to 12 years of age both grasp on request (capacity) and spontaneous use of upper limb (performance) in bimanual play activities and in ADL. Psychometric analysis of reliability and of validity of the Besta scale was performed. Outpatient study sample Reliability study: A sample of 39 patients was enrolled. The administration of Besta scale was video-recorded in a standardized manner. All videos were scored by 20 independent raters on subsequent viewing. 3 raters randomly selected from the 20-raters group rescored the same video two years later for intra-rater reliability. Intra and inter-rater reliability were calculated using Intraclass Correlation Coefficient (ICC) and Kendall's coefficient (K), respectively. Internal consistency reliability was assessed using Alpha's Chronbach coefficient. Validity study: a sample of 105 children was assessed 5 times (at t0 and 2, 3, 6 and 12 months later) by 20 independent raters. Each patient underwent at the same time to QUEST and Besta scale administration and assessment. Criterion validity was calculated using rho-Pearson coefficient. Reliability study: The inter-rater reliability calculated with Kendall's coefficient resulted moderate K=0.47. The intra-rater (or test-retest) reliability for 3 raters was excellent (ICC=0.927). The Cronbach's alpha for internal consistency was 0.972. Validity study: Besta scale showed a good criterion validity compared to QUEST increasing by age and severity of impairment. Rho Pearson's correlation coefficient r was 0.81 (P<0.0001). Limitations. Besta scales in infants finds hard to distinguish between mild to moderately impaired hand function. Besta scale scoring system is a valid and reliable tool, utilizable in a clinical setting to monitor evolution of unimanual and bimanual manipulation and to distinguish hand's capacity from performance.
Psychometrics of the MHSIP Adult Consumer Survey.
Jerrell, Jeanette M
2006-10-01
The reliability and validity of the Mental Health Statistics Improvement Program (MHSIP) Adult Consumer Survey were assessed in a statewide convenience sample of 459 persons with severe mental illness served through a public mental health system. Consistent with previous findings and the intent of its developers, three factors were identified that demonstrate good internal consistency, moderate test-retest reliability, and good convergent validity with consumer perceptions of other aspects of their care. The reliability and validity of the MHSIP Adult Consumer Survey documented in this study underscore its scientific and practical utility as an abbreviated tool for assessing access, quality and appropriateness, and outcome in mental health service systems.
Kepler, Christopher K; Vaccaro, Alexander R; Koerner, John D; Dvorak, Marcel F; Kandziora, Frank; Rajasekaran, Shanmuganathan; Aarabi, Bizhan; Vialle, Luiz R; Fehlings, Michael G; Schroeder, Gregory D; Reinhold, Maximilian; Schnake, Klaus John; Bellabarba, Carlo; Cumhur Öner, F
2016-04-01
The aims of this study were (1) to demonstrate the AOSpine thoracolumbar spine injury classification system can be reliably applied by an international group of surgeons and (2) to delineate those injury types which are difficult for spine surgeons to classify reliably. A previously described classification system of thoracolumbar injuries which consists of a morphologic classification of the fracture, a grading system for the neurologic status and relevant patient-specific modifiers was applied to 25 cases by 100 spinal surgeons from across the world twice independently, in grading sessions 1 month apart. The results were analyzed for classification reliability using the Kappa coefficient (κ). The overall Kappa coefficient for all cases was 0.56, which represents moderate reliability. Kappa values describing interobserver agreement were 0.80 for type A injuries, 0.68 for type B injuries and 0.72 for type C injuries, all representing substantial reliability. The lowest level of agreement for specific subtypes was for fracture subtype A4 (Kappa = 0.19). Intraobserver analysis demonstrated overall average Kappa statistic for subtype grading of 0.68 also representing substantial reproducibility. In a worldwide sample of spinal surgeons without previous exposure to the recently described AOSpine Thoracolumbar Spine Injury Classification System, we demonstrated moderate interobserver and substantial intraobserver reliability. These results suggest that most spine surgeons can reliably apply this system to spine trauma patients as or more reliably than previously described systems.
Piqueras, Jose A; Martín-Vivar, María; Sandin, Bonifacio; San Luis, Concepción; Pineda, David
2017-08-15
Anxiety and depression are among the most common mental disorders during childhood and adolescence. Among the instruments for the brief screening assessment of symptoms of anxiety and depression, the Revised Child Anxiety and Depression Scale (RCADS) is one of the more widely used. Previous studies have demonstrated the reliability of the RCADS for different assessment settings and different versions. The aims of this study were to examine the mean reliability of the RCADS and the influence of the moderators on the RCADS reliability. We searched in EBSCO, PsycINFO, Google Scholar, Web of Science, and NCBI databases and other articles manually from lists of references of extracted articles. A total of 146 studies were included in our meta-analysis. The RCADS showed robust internal consistency reliability in different assessment settings, countries, and languages. We only found that reliability of the RCADS was significantly moderated by the version of RCADS. However, these differences in reliability between different versions of the RCADS were slight and can be due to the number of items. We did not examine factor structure, factorial invariance across gender, age, or country, and test-retest reliability of the RCADS. The RCADS is a reliable instrument for cross-cultural use, with the advantage of providing more information with a low number of items in the assessment of both anxiety and depression symptoms in children and adolescents. Copyright © 2017. Published by Elsevier B.V.
Cole, Jason C; Ito, Diane; Chen, Yaozhu J; Cheng, Rebecca; Bolognese, Jennifer; Li-McLeod, Josephine
2014-09-04
There is a lack of validated instruments to measure the level of burden of Alzheimer's disease (AD) on caregivers. The Impact of Alzheimer's Disease on Caregiver Questionnaire (IADCQ) is a 12-item instrument with a seven-day recall period that measures AD caregiver's burden across emotional, physical, social, financial, sleep, and time aspects. Primary objectives of this study were to evaluate psychometric properties of IADCQ administered on the Web and to determine most appropriate scoring algorithm. A national sample of 200 unpaid AD caregivers participated in this study by completing the Web-based version of IADCQ and Short Form-12 Health Survey Version 2 (SF-12v2™). The SF-12v2 was used to measure convergent validity of IADCQ scores and to provide an understanding of the overall health-related quality of life of sampled AD caregivers. The IADCQ survey was also completed four weeks later by a randomly selected subgroup of 50 participants to assess test-retest reliability. Confirmatory factor analysis (CFA) was implemented to test the dimensionality of the IADCQ items. Classical item-level and scale-level psychometric analyses were conducted to estimate psychometric characteristics of the instrument. Test-retest reliability was performed to evaluate the instrument's stability and consistency over time. Virtually none (2%) of the respondents had either floor or ceiling effects, indicating the IADCQ covers an ideal range of burden. A single-factor model obtained appropriate goodness of fit and provided evidence that a simple sum score of the 12 items of IADCQ can be used to measure AD caregiver's burden. Scales-level reliability was supported with a coefficient alpha of 0.93 and an intra-class correlation coefficient (for test-retest reliability) of 0.68 (95% CI: 0.50-0.80). Low-moderate negative correlations were observed between the IADCQ and scales of the SF-12v2. The study findings suggest the IADCQ has appropriate psychometric characteristics as a unidimensional, Web-based measure of AD caregiver burden and is supported by strong model fit statistics from CFA, high degree of item-level reliability, good internal consistency, moderate test-retest reliability, and moderate convergent validity. Additional validation of the IADCQ is warranted to ensure invariance between the paper-based and Web-based administration and to determine an appropriate responder definition.
2011-01-01
Background Since stress is hypothesized to play a role in the etiology of obesity during adolescence, research on associations between adolescent stress and obesity-related parameters and behaviours is essential. Due to lack of a well-established recent stress checklist for use in European adolescents, the study investigated the reliability and validity of the Adolescent Stress Questionnaire (ASQ) for assessing perceived stress in European adolescents. Methods The ASQ was translated into the languages of the participating cities (Ghent, Stockholm, Vienna, Zaragoza, Pecs and Athens) and was implemented within the HELENA cross-sectional study. A total of 1140 European adolescents provided a valid ASQ, comprising 10 component scales, used for internal reliability (Cronbach α) and construct validity (confirmatory factor analysis or CFA). Contributions of socio-demographic (gender, age, pubertal stage, socio-economic status) characteristics to the ASQ score variances were investigated. Two-hundred adolescents also provided valid saliva samples for cortisol analysis to compare with the ASQ scores (criterion validity). Test-retest reliability was investigated using two ASQ assessments from 37 adolescents. Results Cronbach α-values of the ASQ scales (0.57 to 0.88) demonstrated a moderate internal reliability of the ASQ, and intraclass correlation coefficients (0.45 to 0.84) established an insufficient test-retest reliability of the ASQ. The adolescents' gender (girls had higher stress scores than boys) and pubertal stage (those in a post-pubertal development had higher stress scores than others) significantly contributed to the variance in ASQ scores, while their age and socio-economic status did not. CFA results showed that the original scale construct fitted moderately with the data in our European adolescent population. Only in boys, four out of 10 ASQ scale scores were a significant positive predictor for baseline wake-up salivary cortisol, suggesting a rather poor criterion validity of the ASQ, especially in girls. Conclusions In our European adolescent sample, the ASQ had an acceptable internal reliability and construct validity and the adolescents' gender and pubertal stage systematically contributed to the ASQ variance, but its test-retest reliability and criterion validity were rather poor. Overall, the utility of the ASQ for assessing perceived stress in adolescents across Europe is uncertain and some aspects require further examination. PMID:21943341
Edouard, Pascal; Junge, Astrid; Kiss-Polauf, Marianna; Ramirez, Christophe; Sousa, Monica; Timpka, Toomas; Branco, Pedro
2018-03-01
The quality of epidemiological injury data depends on the reliability of reporting to an injury surveillance system. Ascertaining whether all physicians/physiotherapists report the same information for the same injury case is of major interest to determine data validity. The aim of this study was therefore to analyse the data collection reliability through the analysis of the interrater reliability. Cross-sectional survey. During the 2016 European Athletics Advanced Athletics Medicine Course in Amsterdam, all national medical teams were asked to complete seven virtual case reports on a standardised injury report form using the same definitions and classifications of injuries as the international athletics championships injury surveillance protocol. The completeness of data and the Fleiss' kappa coefficients for the inter-rater reliability were calculated for: sex, age, event, circumstance, location, type, assumed cause and estimated time-loss. Forty-one team physicians and physiotherapists of national medical teams participated in the study (response rate 89.1%). Data completeness was 96.9%. The Fleiss' kappa coefficients were: almost perfect for sex (k=1), injury location (k=0.991), event (k=0.953), circumstance (k=0.942), and age (k=0.870), moderate for type (k=0.507), fair for assumed cause (k=0.394), and poor for estimated time-loss (k=0.155). The injury surveillance system used during international athletics championships provided reliable data for "sex", "location", "event", "circumstance", and "age". More caution should be taken for "assumed cause" and "type", and even more for "estimated time-loss". This injury surveillance system displays satisfactory data quality (reliable data and high data completeness), and thus, can be recommended as tool to collect epidemiology information on injuries during international athletics championships. Copyright © 2018 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.
Pruitt, Sandi L; Jeffe, Donna B; Yan, Yan; Schootman, Mario
2012-04-01
Limited psychometric research has examined the reliability of self-reported measures of neighbourhood conditions, the effect of measurement error on associations between neighbourhood conditions and health, and potential differences in the reliabilities between neighbourhood strata (urban vs rural and low vs high poverty). We assessed overall and stratified reliability of self-reported perceived neighbourhood conditions using five scales (social and physical disorder, social control, social cohesion, fear) and four single items (multidimensional neighbouring). We also assessed measurement error-corrected associations of these conditions with self-rated health. Using random-digit dialling, 367 women without breast cancer (matched controls from a larger study) were interviewed twice, 2-3 weeks apart. Test-retest (intraclass correlation coefficients (ICC)/weighted κ) and internal consistency reliability (Cronbach's α) were assessed. Differences in reliability across neighbourhood strata were tested using bootstrap methods. Regression calibration corrected estimates for measurement error. All measures demonstrated satisfactory internal consistency (α ≥ 0.70) and either moderate (ICC/κ=0.41-0.60) or substantial (ICC/κ=0.61-0.80) test-retest reliability in the full sample. Internal consistency did not differ by neighbourhood strata. Test-retest reliability was significantly lower among rural (vs urban) residents for two scales (social control, physical disorder) and two multidimensional neighbouring items; test-retest reliability was higher for physical disorder and lower for one multidimensional neighbouring item among the high (vs low) poverty strata. After measurement error correction, the magnitude of associations between neighbourhood conditions and self-rated health were larger, particularly in the rural population. Research is needed to develop and test reliable measures of perceived neighbourhood conditions relevant to the health of rural populations.
Aggio, Daniel; Fairclough, Stuart; Knowles, Zoe; Graves, Lee
2016-01-01
Adaptation of physical activity self-report questionnaires is sometimes required to reflect the activity behaviours of diverse populations. The processes used to modify self-report questionnaires though are typically underreported. This two-phased study used a formative approach to investigate the validity and reliability of the Physical Activity Questionnaire for Adolescents (PAQ-A) in English youth. Phase one examined test content and response process validity and subsequently informed a modified version of the PAQ-A. Phase two assessed the validity and reliability of the modified PAQ-A. In phase one, focus groups (n = 5) were conducted with adolescents (n = 20) to investigate test content and response processes of the original PAQ-A. Based on evidence gathered in phase one, a modified version of the questionnaire was administered to participants (n = 169, 14.5 ± 1.7 years) in phase two. Internal consistency and test-retest reliability were assessed using Cronbach's alpha and intra-class correlations, respectively. Spearman correlations were used to assess associations between modified PAQ-A scores and accelerometer-derived physical activity, self-reported fitness and physical activity self-efficacy. Phase one revealed that the original PAQ-A was unrepresentative for English youth and that item comprehension varied. Contextual and population/cultural-specific modifications were made to the PAQ-A for use in the subsequent phase. In phase two, modified PAQ-A scores had acceptable internal consistency (α = 0.72) and test-retest reliability (ICC = 0.78). Modified PAQ-A scores were significantly associated with objectively assessed moderate-to-vigorous physical activity (r = 0.39), total physical activity (r = 0.42), self-reported fitness (r = 0.35), and physical activity self-efficacy (r = 0.32) (p ≤ 0.01). The modified PAQ-A had acceptable internal consistency and test-retest reliability. Modified PAQ-A scores displayed weak-to-moderate correlations with objectively measured physical activity, self-reported fitness, and self-efficacy providing evidence of satisfactory criterion and construct validity, respectively. Further testing with more diverse English samples is recommended to provide a more complete assessment of the tool.
Hayes, Corey J.; Bhandari, Naleen Raj; Kathe, Niranjan; Payakachat, Nalin
2017-01-01
Limited evidence exists on how non-cancer pain (NCP) affects an individual’s health-related quality of life (HRQoL). This study aimed to validate the Medical Outcomes Study Short Form-12 Version 2 (SF-12v2), a generic measure of HRQoL, in a NCP cohort using the Medical Expenditure Panel Survey Longitudinal Files. The SF Mental Component Summary (MCS12) and SF Physical Component Summary (PCS12) were tested for reliability (internal consistency and test-retest reliability) and validity (construct: convergent and discriminant; criterion: concurrent and predictive). A total of 15,716 patients with NCP were included in the final analysis. The MCS12 and PCS12 demonstrated high internal consistency (Cronbach’s alpha and Mosier’s alpha > 0.8), and moderate and high test-retest reliability, respectively (MCS12 intraclass correlation coefficient (ICC): 0.64; PCS12 ICC: 0.73). Both scales were significantly associated with a number of chronic conditions (p < 0.05). The PCS12 was strongly correlated with perceived health (r = 0.52) but weakly correlated with perceived mental health (r = 0.25). The MCS12 was moderately correlated with perceived mental health (r = 0.42) and perceived health (r = 0.33). Increasing PCS12 and MCS12 scores were significantly associated with lower odds of reporting future physical and cognitive limitations (PCS12: OR = 0.90 95%CI: 0.89–0.90, MCS12: OR = 0.94 95%CI: 0.93–0.94). In summary, the SF-12v2 is a reliable and valid measure of HRQoL for patients with NCP. PMID:28445438
Valentim, Daniela Pereira; Sato, Tatiana de Oliveira; Comper, Maria Luiza Caíres; Silva, Anderson Martins da; Boas, Cristiana Villas; Padula, Rosimeire Simprini
There are very few observational methods for analysis of biomechanical exposure available in Brazilian-Portuguese. This study aimed to cross-culturally adapt and test the measurement properties of the Rapid Upper Limb Assessment (RULA) and Strain Index (SI). The cross-cultural adaptation and measurement properties test were established according to Beaton et al. and COSMIN guidelines, respectively. Several tasks that required static posture and/or repetitive motion of upper limbs were evaluated (n>100). The intra-raters' reliability for the RULA ranged from poor to almost perfect (k: 0.00-0.93), and SI from poor to excellent (ICC 2.1 : 0.05-0.99). The inter-raters' reliability was very poor for RULA (k: -0.12 to 0.13) and ranged from very poor to moderate for SI (ICC 2.1 : 0.00-0.53). The agreement was good for RULA (75-100% intra-raters, and 42.24-100% inter-raters) and to SI (EPM: -1.03% to 1.97%; intra-raters, and -0.17% to 1.51% inter-raters). The internal consistency was appropriate for RULA (α=0.88), and low for SI (α=0.65). Moderate construct validity were observed between RULA and SI, in wrist/hand-wrist posture (rho: 0.61) and strength/intensity of exertion (rho: 0.39). The adapted versions of the RULA and SI presented semantic and cultural equivalence for the Brazilian Portuguese. The RULA and SI had reliability estimates ranged from very poor to almost perfect. The internal consistency for RULA was better than the SI. The correlation between methods was moderate only of muscle request/movement repetition. Previous training is mandatory to use of observations methods for biomechanical exposure assessment, although it does not guarantee good reproducibility of these measures. Copyright © 2017 Associação Brasileira de Pesquisa e Pós-Graduação em Fisioterapia. Publicado por Elsevier Editora Ltda. All rights reserved.
Voss, Christine; Dean, Paige H; Gardner, Ross F; Duncombe, Stephanie L; Harris, Kevin C
2017-01-01
To assess the criterion validity, internal consistency, reliability and cut-point for the Physical Activity Questionnaire for Children (PAQ-C) and Adolescents (PAQ-A) in children and adolescents with congenital heart disease-a special population at high cardiovascular risk in whom physical activity has not been extensively evaluated. We included 84 participants (13.6±2.9 yrs, 50% female) with simple (37%), moderate (31%), or severe congenital heart disease (27%), as well as cardiac transplant recipients (6%), from BC Children's Hospital, Canada. They completed the PAQ-C (≤11yrs, n = 28) or-A (≥12yrs, n = 56), and also wore a triaxial accelerometer (GT3X+ or GT9X) over the right hip for 7 days (n = 59 met valid wear time criteria). Median daily moderate-to-vigorous physical activity was 46.9 minutes per day (IQR 31.6-61.8) and 25% met physical activity guidelines defined as ≥60 minutes of moderate-to-vigorous physical activity per day. Median PAQ-score was 2.6 (IQR 1.9-3.0). PAQ-Scores were significantly related to accelerometry-derived metrics of physical activity (rho = 0.44-0.55, all p<0.01) and sedentary behaviour (rho = -0.53, p<0.001). Internal consistency was high (α = 0.837), as was reliability (stability) of PAQ-Scores over a 4-months period (ICC = 0.73, 95%CI 0.55-0.84; p<0.001). We identified that a PAQ-Score cut-point of 2.87 discriminates between those meeting physical guidelines and those that do not in the combined PAQ-C and-A samples (area under the curve = 0.80 (95%CI 0.67-0.92). Validity and reliability of the PAQ in children and adolescents with CHD was comparable to or stronger than previous studies in healthy children. Therefore, the PAQ may be used to estimate general levels of physical activity in children and adolescents with CHD.
Reliability and validity of the McDonald Play Inventory.
McDonald, Ann E; Vigen, Cheryl
2012-01-01
This study examined the ability of a two-part self-report instrument, the McDonald Play Inventory, to reliably and validly measure the play activities and play styles of 7- to 11-yr-old children and to discriminate between the play of neurotypical children and children with known learning and developmental disabilities. A total of 124 children ages 7-11 recruited from a sample of convenience and a subsample of 17 parents participated in this study. Reliability estimates yielded moderate correlations for internal consistency, total test intercorrelations, and test-retest reliability. Validity estimates were established for content and construct validity. The results suggest that a self-report instrument yields reliable and valid measures of a child's perceived play performance and discriminates between the play of children with and without disabilities. Copyright © 2012 by the American Occupational Therapy Association, Inc.
Oyeyemi, Adewale L; Oyeyemi, Adetoyeje Y; Adegoke, Babatunde O; Oyetoke, Fatima O; Aliyu, Habeeb N; Aliyu, Salamatu U; Rufai, Adamu A
2011-11-22
Accurate assessment of physical activity is important in determining the risk for chronic diseases such as cardiovascular disease, stroke, type 2 diabetes, cancer and obesity. The absence of culturally relevant measures in indigenous languages could pose challenges to epidemiological studies on physical activity in developing countries. The purpose of this study was to translate and cross-culturally adapt the Short International Physical Activity Questionnaire (IPAQ-SF) to the Hausa language, and to evaluate the validity and reliability of the Hausa version of IPAQ-SF in Nigeria. The English IPAQ-SF was translated into the Hausa language, synthesized, back translated, and subsequently subjected to expert committee review and pre-testing. The final product (Hausa IPAQ-SF) was tested in a cross-sectional study for concurrent (correlation with the English version) and construct validity, and test-retest reliability in a sample of 102 apparently healthy adults. The Hausa IPAQ-SF has good concurrent validity with Spearman correlation coefficients (ρ) ranging from 0.78 for vigorous activity (Min Week-1) to 0.92 for total physical activity (Metabolic Equivalent of Task [MET]-Min Week-1), but poor construct validity, with cardiorespiratory fitness (ρ = 0.21, p = 0.01) and body mass index (ρ = 0.22, p = 0.04) significantly correlated with only moderate activity and sitting time (Min Week-1), respectively. Reliability was good for vigorous (ICC = 0.73, 95% C.I = 0.55-0.84) and total physical activity (ICC = 0.61, 95% C.I = 0.47-0.72), but fair for moderate activity (ICC = 0.33, 95% C.I = 0.12-0.51), and few meaningful differences were found in the gender and socioeconomic status specific analyses. The Hausa IPAQ-SF has acceptable concurrent validity and test-retest reliability for vigorous-intensity activity, walking, sitting and total physical activity, but demonstrated only fair construct validity for moderate and sitting activities. The Hausa IPAQ-SF can be used for physical activity measurements in Nigeria, but further construct validity testing with objective measures such as an accelerometer is needed.
Shipley, Hilary; Guedes, Alonso; Graham, Lynelle; Goudie-DeAngelis, Elizabeth; Wendt-Hornickle, Erin
2018-05-01
Objectives The objective of this study was to determine the inter-rater reliability and convergent validity of the Colorado State University Feline Acute Pain Scale (CSU-FAPS) in a preliminary appraisal of its performance in a clinical teaching setting. Methods Sixty-eight female cats were assessed for pain after ovariohysterectomy. A cohort of 21 cats was examined independently by four raters (two board-certified anesthesiologists and two anesthesia residents) with the CSU-FAPS, and intra-class correlation coefficient (ICC) was used to determine inter-rater reliability. Weighted Cohen's kappa was used to determine inter-rater reliability centered on the 'need to reassess analgesic plan' (dichotomous scale). A separate cohort of 47 cats was evaluated independently by two raters (one board-certified anesthesiologist and one veterinary small animal rotating intern) using the CSU-FAPS and the Glasgow Composite Measure Pain Scale (CMPS-Feline), and Spearman rank-order correlation was determined to assess convergent validity. Reliability was interpreted using Altman's classification as very good, good, moderate, fair and poor. Validity was considered adequate if correlation coefficients were between 0.4 and 0.8. Results The ICC was 0.61 for anesthesiologists and 0.67 for residents, indicating good reliability. Weighted Cohen's kappa was 0.79 for anesthesiologists and 0.44 for residents, indicating moderate to good reliability. The Spearman rank correlation indicated a statistically significant ( P = 0.0003) positive correlation (0.31; 95% confidence interval 0.14-0.46) between the CSU-FAPS and the CMPS-Feline. Conclusions and relevance The CSU-FAPS showed moderate-to-good inter-rater reliability when used by veterinarians to assess pain level or need to reassess analgesic plan after ovariohysterectomy in cats. The validity fell short of current guidelines for correlation coefficients and further refinement and testing are warranted to improve its performance.
Development and validation of the brief esophageal dysphagia questionnaire.
Taft, T H; Riehl, M; Sodikoff, J B; Kahrilas, P J; Keefer, L; Doerfler, B; Pandolfino, J E
2016-12-01
Esophageal dysphagia is common in gastroenterology practice and has multiple etiologies. A complication for some patients with dysphagia is food impaction. A valid and reliable questionnaire to rapidly evaluate esophageal dysphagia and impaction symptoms can aid the gastroenterologist in gathering information to inform treatment approach and further evaluation, including endoscopy. 1638 patients participated over two study phases. 744 participants completed the Brief Esophageal Dysphagia Questionnaire (BEDQ) for phase 1; 869 completed the BEDQ, Visceral Sensitivity Index, Gastroesophageal Reflux Disease Questionnaire, and Hospital Anxiety and Depression Scale for phase 2. Demographic and clinical data were obtained via the electronic medical record. The BEDQ was evaluated for internal consistency, split-half reliability, ceiling and floor effects, and construct validity. The BEDQ demonstrated excellent internal consistency, reliability, and construct validity. The symptom frequency and severity scales scored above the standard acceptable cutoffs for reliability while the impaction subscale yielded poor internal consistency and split-half reliability; thus the impaction items were deemed qualifiers only and removed from the total score. No significant ceiling or floor effects were found with the exception of 1 item, and inter-item correlations fell within accepted ranges. Construct validity was supported by moderate yet significant correlations with other measures. The predictive ability of the BEDQ was small but significant. The BEDQ represents a rapid, reliable, and valid assessment tool for esophageal dysphagia with food impaction for clinical practice that differentiates between patients with major motor dysfunction and mechanical obstruction. © 2016 John Wiley & Sons Ltd.
Development and Validation of the Brief Esophageal Dysphagia Questionnaire
Taft, Tiffany H.; Riehl, Megan; Sodikoff, Jamie B.; Kahrilas, Peter J.; Keefer, Laurie; Doerfler, Bethany; Pandolfino, John E.
2017-01-01
Background Esophageal dysphagia is common in gastroenterology practice and has multiple etiologies. A complication for some patients with dysphagia is food impaction. A valid and reliable questionnaire to rapidly evaluate esophageal dysphagia and impaction symptoms can aid the gastroenterologist in gathering information to inform treatment approach and further evaluation, including endoscopy. Methods 1,638 patients participated over two study phases. 744 participants completed the Brief Esophageal Dysphagia Questionnaire (BEDQ) for phase 1; 869 completed the BEDQ, Visceral Sensitivity Index, Gastroesophageal Reflux Disease Questionnaire, and Hospital Anxiety and Depression Scale for phase 2. Demographic and clinical data were obtained via the electronic medical record. The BEDQ was evaluated for internal consistency, split-half reliability, ceiling and floor effects, and construct validity. Key Results The BEDQ demonstrated excellent internal consistency, reliability, and construct validity. The symptom frequency and severity scales scored above the standard acceptable cutoffs for reliability while the impaction subscale yielded poor internal consistency and split-half reliability; thus the impaction items were deemed qualifiers only and removed from the total score. No significant ceiling or floor effects were found with the exception of 1 item, and inter-item correlations fell within accepted ranges. Construct validity was supported by moderate yet significant correlations with other measures. The predictive ability of the BEDQ was small but significant. Conclusions & Inferences The BEDQ represents a rapid, reliable and valid assessment tool for esophageal dysphagia with food impaction for clinical practice that differentiates between patients with major motor dysfunction and mechanical obstruction. PMID:27380834
Farazdaghi, Mohammad Reza; Mansoori, Ali; Vosoughi, Omid; Kordi Yoosefinejad, Amin
2017-05-01
Elbow joint pathologies are highly prevalent in Persian-speaking countries. A reliable low-cost method like an appropriate questionnaire is mandatory for the early diagnosis of elbow joint disorders. Among designed questionnaires, Patient-Rated Elbow Evaluation (PREE) is an accepted commonly used scale evaluating pain and dysfunction of the patients. The aims of the study were to cross-culturally adapt and also to identify the psychometric properties of the Persian PREE. The original version of the PREE was translated and cross-culturally adapted to Persian according to the guidelines by Beaton et al. Seventy-three patients and thirty-nine healthy people were enrolled in the study. Test-retest reliability and internal consistency were evaluated using ICC, Cronbach's alpha, and item-total correlation, respectively. Construct validity was investigated using Disability of Arm, Shoulder, and Hand (DASH) questionnaire and physical component scale of SF-36 (PCS). To determine a cutoff point for discriminating patients from non-patients, receiver operating characteristic curve was plotted. The Persian PREE displayed high internal consistency (Cronbach's alpha = 0.91) and had acceptable ICC values in the subscales and total score (ICC > 0.90). A positive moderate correlation with DASH (r = 0.66, P < 0.001) and a negative moderate correlation with PCS of SF-36 (r = -0.44, P < 0.001) were observed. The cutoff point equal to 13.16 was determined for Persian PREE. The Persian PREE exhibited promising validity and reliability. The findings supported its applicability in clinical situations that were consistent with the original version.
Offenbächer, Martin; Sauer, Sebastian; Kohls, Niko; Waltz, Millard; Schoeps, Peter
2012-10-01
Our objectives were to translate the Quality of Life Scale (QOLS) into German and to evaluate its reliability and validity for the use in patients with fibromyalgia (FMS). Together with German versions of the Fibromyalgia Impact Questionnaire (FIQ), the SF-36, a tender point count (TPC) and other questionnaires, we administered the QOLS to 146 patients with FMS. Patients were asked about the severity of pain today (VAS) and the duration of symptoms. Test-retest reliability was assessed using Spearman's correlations. Internal consistency was evaluated with Cronbach's alpha. Construct validity of the QOLS was evaluated by correlating the QOLS with the FIQ, the SF-36, the Beck Depression Inventory (BDI), and the Symptom Checklist (SCL-90-R) as well as with the pain variables. An exploratory factor analysis (EFA) was also conducted. Mean age was 53.1 years. Means were for pain today 6.8 and for duration of symptoms 11.8 years. Test-retest reliability for the total QOLS was rho = .91. Internal consistency was α = .90. Low-to-moderate correlations were obtained between the QOLS and the total FIQ (rho = -.42), the SF-36 (e.g. physical functioning rho = .37; mental health rho = .56) as well as the pain variables (VAS rho = -.11 ns; TPC rho = -.20). Psychological variables were moderately to substantially correlated with the QOLS (e.g. BDI rho = -.61). An EFA suggested a three-factor solution. The QOLS-G is a reliable and valid instrument for measuring quality of life in German patients with FMS.
Lombarts, Kiki M J M H; Ferguson, Andrew; Hollmann, Markus W; Malling, Bente; Arah, Onyebuchi A
2016-11-01
Given the increasing international recognition of clinical teaching as a competency and regulation of residency training, evaluation of anesthesiology faculty teaching is needed. The System for Evaluating Teaching Qualities (SETQ) Smart questionnaires were developed for assessing teaching performance of faculty in residency training programs in different countries. This study investigated (1) the structure, (2) the psychometric qualities of the new tools, and (3) the number of residents' evaluations needed per anesthesiology faculty to use the instruments reliably. Two SETQ Smart questionnaires-for faculty self-evaluation and for resident evaluation of faculty-were developed. A multicenter survey was conducted among 399 anesthesiology faculty and 430 residents in six countries. Statistical analyses included exploratory factor analysis, reliability analysis using Cronbach α, item-total scale correlations, interscale correlations, comparison of composite scales to global ratings, and generalizability analysis to assess residents' evaluations needed per faculty. In total, 240 residents completed 1,622 evaluations of 247 faculty. The SETQ Smart questionnaires revealed six teaching qualities consisting of 25 items. Cronbach α's were very high (greater than 0.95) for the overall SETQ Smart questionnaires and high (greater than 0.80) for the separate teaching qualities. Interscale correlations were all within the acceptable range of moderate correlation. Overall, questionnaire and scale scores correlated moderately to highly with the global ratings. For reliable feedback to individual faculty, three to five resident evaluations are needed. The first internationally piloted questionnaires for evaluating individual anesthesiology faculty teaching performance can be reliably, validly, and feasibly used for formative purposes in residency training.
Solomon, Nadia; Fields, Paul J.; Tamarozzi, Francesca; Brunetti, Enrico; Macpherson, Calum N. L.
2017-01-01
Cystic echinococcosis (CE), a parasitic zoonosis, results in cyst formation in the viscera. Cyst morphology depends on developmental stage. In 2003, the World Health Organization (WHO) published a standardized ultrasound (US) classification for CE, for use among experts as a standard of comparison. This study examined the reliability of this classification. Eleven international CE and US experts completed an assessment of eight WHO classification images and 88 test images representing cyst stages. Inter- and intraobserver reliability and observer performance were assessed using Fleiss' and Cohen's kappa. Interobserver reliability was moderate for WHO images (κ = 0.600, P < 0.0001) and substantial for test images (κ = 0.644, P < 0.0001), with substantial to almost perfect interobserver reliability for stages with pathognomonic signs (CE1, CE2, and CE3) for WHO (0.618 < κ < 0.904) and test images (0.642 < κ < 0.768). Comparisons of expert performances against the majority classification for each image were significant for WHO (0.413 < κ < 1.000, P < 0.005) and test images (0.718 < κ < 0.905, P < 0.0001); and intraobserver reliability was significant for WHO (0.520 < κ < 1.000, P < 0.005) and test images (0.690 < κ < 0.896, P < 0.0001). Findings demonstrate moderate to substantial interobserver and substantial to almost perfect intraobserver reliability for the WHO classification, with substantial to almost perfect interobserver reliability for pathognomonic stages. This confirms experts' abilities to reliably identify WHO-defined pathognomonic signs of CE, demonstrating that the WHO classification provides a reproducible way of staging CE. PMID:28070008
Rating scales for dystonia in cerebral palsy: reliability and validity.
Monbaliu, E; Ortibus, E; Roelens, F; Desloovere, K; Deklerck, J; Prinzie, P; de Cock, P; Feys, H
2010-06-01
This study investigated the reliability and validity of the Barry-Albright Dystonia Scale (BADS), the Burke-Fahn-Marsden Movement Scale (BFMMS), and the Unified Dystonia Rating Scale (UDRS) in patients with bilateral dystonic cerebral palsy (CP). Three raters independently scored videotapes of 10 patients (five males, five females; mean age 13 y 3 mo, SD 5 y 2 mo, range 5-22 y). One patient each was classified at levels I-IV in the Gross Motor Function Classification System and six patients were classified at level V. Reliability was measured by (1) intraclass correlation coefficient (ICC) for interrater reliability, (2) standard error of measurement (SEM) and smallest detectable difference (SDD), and (3) Cronbach's alpha for internal consistency. Validity was assessed by Pearson's correlations among the three scales used and by content analysis. Moderate to good interrater reliability was found for total scores of the three scales (ICC: BADS=0.87; BFMMS=0.86; UDRS=0.79). However, many subitems showed low reliability, in particular for the UDRS. SEM and SDD were respectively 6.36% and 17.72% for the BADS, 9.88% and 27.39% for the BFMMS, and 8.89% and 24.63% for the UDRS. High internal consistency was found. Pearson's correlations were high. Content validity showed insufficient accordance with the new CP definition and classification. Our results support the internal consistency and concurrent validity of the scales; however, taking into consideration the limitations in reliability, including the large SDD values and the content validity, further research on methods of assessment of dystonia is warranted.
Rathi, Sangeeta; Taylor, Nicholas F; Gee, Jamie; Green, Rodney A
2016-12-01
Ultrasonography is an economical and non-invasive method for measuring real-time joint movements. Although physiotherapists are increasingly using ultrasound imaging for rotator cuff disorders, there is a lack of evidence on their reliability in using ultrasonography to measure glenohumeral translation. The aim of this study was to evaluate the reliability of a physiotherapist in measuring anterior and posterior glenohumeral joint translation with ultrasound. Study design: within day reliability. Anterior and posterior glenohumeral translations were measured at rest, in response to passive accessory motion testing force, and with isometric internal and external rotation in 12 young healthy adults. All the measurements were made in real time by a physiotherapist and an experienced sonographer in two positions (neutral and abducted) and in two views (anterior and posterior). Intra-rater and inter-rater reliability were expressed using intraclass correlation coefficients (ICC) and measurement error (mm). Intra-rater reliability was good for both raters (ICC P : 0.86-0.98; ICC S : 0.85-0.96). The inter-rater reliability between the physiotherapist and sonographer was moderate to good for posterior measurements (ICC 0.50-0.75) and poor to moderate for anterior measurements (ICC 0.31-0.53). For both intra-rater and inter-rater measurements, posterior translation was more reliable than the anterior translation with smaller measurement errors (posterior: 0.1-0.2 mm, anterior: 0.2-0.3 mm). A physiotherapist with minimal training was reliable in measuring glenohumeral joint translations. The ultrasound method was reliable for repeated measurement of both anterior and posterior glenohumeral translations with posterior measurements being more reliable than anterior. This method is recommended for future research to investigate the stabilising role of rotator cuff muscles. Copyright © 2016 Elsevier Ltd. All rights reserved.
Álvarez-Gallardo, Inmaculada C; Soriano-Maldonado, Alberto; Segura-Jiménez, Víctor; Carbonell-Baeza, Ana; Estévez-López, Fernando; McVeigh, Joseph G; Delgado-Fernández, Manuel; Ortega, Francisco B
2016-03-01
To examine the construct validity of the International FItness Scale (IFIS) (ie, self-reported fitness) against objectively measured physical fitness in women with fibromyalgia and in healthy women; and to study the test-retest reliability of the IFIS in women with fibromyalgia. Cross-sectional study. Fibromyalgia patient support groups. Women with fibromyalgia (n=413) and healthy women (controls) (n=195) for validity purposes and women with fibromyalgia (n=101) for the reliability study. The total sample was N=709. Not applicable. Fitness level was both self-reported (IFIS) and measured using performance-based fitness tests. For the reliability study the IFIS was completed on 2 occasions, 1 week apart. Women with fibromyalgia who reported average fitness had better measured fitness than those reporting very poor fitness (all P<.001, except 6-minute walk test where P<.05), with similar trends observed in healthy control women. The test-retest reliability of the IFIS, as measured by the average weighted κ, was .45. The IFIS was able to identify women with fibromyalgia who had very low fitness and distinguish them from those with higher fitness levels. Furthermore, the IFIS was moderately reliable in women with fibromyalgia. Copyright © 2016 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Beyhun, Nazim Ercument; Can, Gamze; Tiryaki, Ahmet; Karakullukcu, Serdar; Bulut, Bekir; Yesilbas, Sehbal; Kavgaci, Halil; Topbas, Murat
2016-01-01
Background Needs based biopsychosocial distress instrument for cancer patients (CANDI) is a scale based on needs arising due to the effects of cancer. Objectives The aim of this research was to determine the reliability and validity of the CANDI scale in the Turkish language. Patients and Methods The study was performed with the participation of 172 cancer patients aged 18 and over. Factor analysis (principal components analysis) was used to assess construct validity. Criterion validities were tested by computing Spearman correlation between CANDI and hospital anxiety depression scale (HADS), and brief symptom inventory (BSI) (convergent validity) and quality of life scales (FACT-G) (divergent validity). Test-retest reliabilities and internal consistencies were measured with intraclass correlation (ICC) and Cronbach-α. Results A three-factor solution (emotional, physical and social) was found with factor analysis. Internal reliability (α = 0.94) and test-retest reliability (ICC = 0.87) were significantly high. Correlations between CANDI and HADS (rs = 0.67), and BSI (rs = 0.69) and FACT-G (rs = -0.76) were moderate and significant in the expected direction. Conclusions CANDI is a valid and reliable scale in cancer patients with a three-factor structure (emotional, physical and social) in the Turkish language. PMID:27621931
Psychometric properties of a Dutch version of the behavior problems inventory-01 (BPI-01).
Dumont, Eric; Kroes, Diana; Korzilius, Hubert; Didden, Robert; Rojahn, Johannes
2014-03-01
There are only a limited number of Dutch validated measurement instruments for measuring behavioral problems in people with a moderate to profound intellectual disability. In this study, the psychometric properties of a Dutch version of the behavior Problems Inventory-01 (BPI-01; Rojahn et al., 2001) have been investigated among 195 people with a moderate to profound intellectual disability who live in a residential facility. The BPI-01 was completed by 42 informants (staff members) of 23 care units. The inter-rater reliability, intra-rater reliability and internal consistency turned out to be good. Factor analysis confirmed two of the three a priori factors and the third factor was a mix of self-injurious (SIB) behavior and stereotypic behavior. The BPI-01 was compared to the Aberrant Behavior Checklist (Aman et al., 1985a) and showed a good convergent validity. This study shows that a Dutch version of the BPI-01 has good psychometric properties for measuring behavior problems in individuals with moderate to profound intellectual disability. Copyright © 2014 Elsevier Ltd. All rights reserved.
Indrebø, Kirsten Lerum; Andersen, John Roger; Natvig, Gerd Karin
2014-01-01
The purpose of this study was to adapt the Ostomy Adjustment Scale to a Norwegian version and to assess its construct validity and 2 components of its reliability (internal consistency and test-retest reliability). One hundred fifty-eight of 217 patients (73%) with a colostomy, ileostomy, or urostomy participated in the study. Slightly more than half (56%) were men. Their mean age was 64 years (range, 26-91 years). All respondents had undergone ostomy surgery at least 3 months before participation in the study. The Ostomy Adjustment Scale was translated into Norwegian according to standard procedures for forward and backward translation. The questionnaire was sent to the participants via regular post. The Cronbach alpha and test-retest were computed to assess reliability. Construct validity was evaluated via correlations between each item and score sums; correlations were used to analyze relationships between the Ostomy Adjustment Scale and the 36-item Short Form Health Survey, the Quality of Life Scale, the Hospital Anxiety & Depression Scale, and the General Self-Efficacy Scale. The Cronbach alpha was 0.93, and test-retest reliability r was 0.69. The average correlation quotient item to sum score was 0.49 (range, 0.31-0.73). Results showed moderate negative correlations between the Ostomy Adjustment Scale and the Hospital Anxiety and Depression Scale (-0.37 and -0.40), and moderate positive correlations between the Ostomy Adjustment Scale and the 36-item Short Form Health Survey, the Quality of Life Scale, and the General Self-Efficacy Scale (0.30-0.45) with the exception of the pain domain in the Short Form 36 (0.28). Regression analysis showed linear associations between the Ostomy Adjustment Scale and sociodemographic and clinical variables with the exception of education. The Norwegian language version of the Ostomy Adjustment Scale was found to possess construct validity, along with internal consistency and test-retest reliability. The instrument is sensitive for sociodemographic and clinical variables pertinent to persons with urostomies, colostomies, and ileostomies.
Mau-Moeller, Anett; Gube, Martin; Felser, Sabine; Feldhege, Frank; Weippert, Matthias; Husmann, Florian; Tischer, Thomas; Bader, Rainer; Bruhn, Sven; Behrens, Martin
2017-08-17
To determine intrasession and intersession reliability of strength measurements and hamstrings to quadriceps strength imbalance ratios (H/Q ratios) using the new isoforce dynamometer. Repeated measures. Exercise science laboratory. Thirty healthy subjects (15 females, 15 males, 27.8 years). Coefficient of variation (CV) and intraclass correlation coefficients (ICC) were calculated for (1) strength parameters, that is peak torque, mean work, and mean power for concentric and eccentric maximal voluntary contractions; isometric maximal voluntary torque (IMVT); rate of torque development (RTD), and (2) H/Q ratios, that is conventional concentric, eccentric, and isometric H/Q ratios (Hcon/Qcon at 60 deg/s, 120 deg/s, and 180 deg/s, Hecc/Qecc at -60 deg/s and Hiso/Qiso) and functional eccentric antagonist to concentric agonist H/Q ratios (Hecc/Qcon and Hcon/Qecc). High reliability: CV <10%, ICC >0.90; moderate reliability: CV between 10% and 20%, ICC between 0.80 and 0.90; low reliability: CV >20%, ICC <0.80. (1) Strength parameters: (a) high intrasession reliability for concentric, eccentric, and isometric measurements, (b) moderate-to-high intersession reliability for concentric and eccentric measurements and IMVT, and (c) moderate-to-high intrasession reliability but low intersession reliability for RTD. (2) H/Q ratios: (a) moderate-to-high intrasession reliability for conventional ratios, (b) high intrasession reliability for functional ratios, (c) higher intersession reliability for Hcon/Qcon and Hiso/Qiso (moderate to high) than Hecc/Qecc (low to moderate), and (d) higher intersession reliability for conventional H/Q ratios (low to high) than functional H/Q ratios (low to moderate). The results have confirmed the reliability of strength parameters and the most frequently used H/Q ratios.
Muhamad, Zailani; Ramli, Ayiesah; Amat, Salleh
2015-05-01
The aim of this study was to determine the content validity, internal consistency, test-retest reliability and inter-rater reliability of the Clinical Competency Evaluation Instrument (CCEVI) in assessing the clinical performance of physiotherapy students. This study was carried out between June and September 2013 at University Kebangsaan Malaysia (UKM), Kuala Lumpur, Malaysia. A panel of 10 experts were identified to establish content validity by evaluating and rating each of the items used in the CCEVI with regards to their relevance in measuring students' clinical competency. A total of 50 UKM undergraduate physiotherapy students were assessed throughout their clinical placement to determine the construct validity of these items. The instrument's reliability was determined through a cross-sectional study involving a clinical performance assessment of 14 final-year undergraduate physiotherapy students. The content validity index of the entire CCEVI was 0.91, while the proportion of agreement on the content validity indices ranged from 0.83-1.00. The CCEVI construct validity was established with factor loading of ≥0.6, while internal consistency (Cronbach's alpha) overall was 0.97. Test-retest reliability of the CCEVI was confirmed with a Pearson's correlation range of 0.91-0.97 and an intraclass coefficient correlation range of 0.95-0.98. Inter-rater reliability of the CCEVI domains ranged from 0.59 to 0.97 on initial and subsequent assessments. This pilot study confirmed the content validity of the CCEVI. It showed high internal consistency, thereby providing evidence that the CCEVI has moderate to excellent inter-rater reliability. However, additional refinement in the wording of the CCEVI items, particularly in the domains of safety and documentation, is recommended to further improve the validity and reliability of the instrument.
Apivatgaroon, Adinun; Angthong, Chayanin; Sanguanjit, Prakasit; Chernchujit, Bancha
2016-10-01
To develop a Thai version of the Kujala score and show the evaluation of the validity and reliability of the score. The Thai version of the Kujala score was developed using the forward-backward translation protocol. The 49 PFPS patients answered the Thai version of questionnaires including the Kujala score, Short Form-36 (SF-36) and International Knee Documentation Committee (IKDC) Subjective Knee Form. The validity between the scores has been tested. The reliability was assessed using test-retest reliability and internal consistency. The Thai version of the Kujala score showed a good correlation with Thai IKDC Subjective Knee Form (Pearson's correlation coefficient; r = 0.74: p < 0.01) and moderate correlation with the Thai SF-36 subscales of physical component summary, total score and role physical (r = 0.586, 0.571 and 0.524, respectively: p < 0.01). The test-retest reliability was excellent with an intra-class correlation coefficient of 0.908 (p < 0.001; 95% CI [0.842-0.947]). The internal consistency was strong with Cronbach's alpha of 0.952 (p < 0.001). No floor and ceiling effects were observed. The Thai version of the Kujala score has shown good validity and reliability. This score can be effectively used for evaluating Thai patients with patellofemoral pain syndrome. Implications for Rehabilitation The Kujala score is a self-administered questionnaire for patients with patellofemoral pain syndrome (PFPS). The validity and reliability of the Thai version of Kujala are compatible with other versions (Turkish, Chinese and Persian version). The Thai version of Kujala has been shown to have validity and reliability in Thai PFPS patients and can be used for clinical evaluation and also in the research work.
Proposing a Parkinson's disease-specific tremor scale from the MDS-UPDRS.
Forjaz, Maria João; Ayala, Alba; Testa, Claudia M; Bain, Peter G; Elble, Rodger; Haubenberger, Dietrich; Rodriguez-Blazquez, Carmen; Deuschl, Günther; Martinez-Martin, Pablo
2015-07-01
This article proposes an International Parkinson and Movement Disorder Society (MDS)-UPDRS tremor-based scale and describes its measurement properties, with a view to developing an improved scale for assessing tremor in Parkinson's disease (PD). This was a cross-sectional, multicenter study of 435 PD patients. Rasch analysis was performed on the 11 MDS-UPDRS tremor items. Construct validity, precision, and test-retest reliability were also analyzed. After some modifications, which included removal of an item owing to redundancy, the obtained MDS-UPDRS tremor scale showed moderate reliability, unidimensionality, absence of differential item functioning, satisfactory convergent validity with medication, and better precision than the raw sum score. However, the scale displayed a floor effect and a need for more items measuring lower levels of tremor. The MDS-UPDRS tremor scale provides linear scores that can be used to assess tremor in PD in a valid, reliable way. The scale might benefit from modifications and studies that analyze its responsiveness. © 2015 International Parkinson and Movement Disorder Society.
Kim, Ki-Hyun; Anthwal, A; Pandey, Sudhir Kumar; Kabir, Ehsanul; Sohn, Jong Ryeul
2010-11-01
In this study, a series of GC calibration experiments were conducted to examine the feasibility of the thermal desorption approach for the quantification of five carbonyl compounds (acetaldehyde, propionaldehyde, butyraldehyde, isovaleraldehyde, and valeraldehyde) in conjunction with two internal standard compounds. The gaseous working standards of carbonyls were calibrated with the aid of thermal desorption as a function of standard concentration and of loading volume. The detection properties were then compared against two types of external calibration data sets derived by fixed standard volume and fixed standard concentration approach. According to this comparison, the fixed standard volume-based calibration of carbonyls should be more sensitive and reliable than its fixed standard concentration counterpart. Moreover, the use of internal standard can improve the analytical reliability of aromatics and some carbonyls to a considerable extent. Our preliminary test on real samples, however, indicates that the performance of internal calibration, when tested using samples of varying dilution ranges, can be moderately different from that derivable from standard gases. It thus suggests that the reliability of calibration approaches should be examined carefully with the considerations on the interactive relationships between the compound-specific properties and the operation conditions of the instrumental setups.
Development and evaluation of an instrument for assessing brief behavioral change interventions.
Strayer, Scott M; Martindale, James R; Pelletier, Sandra L; Rais, Salehin; Powell, Jon; Schorling, John B
2011-04-01
To develop an observational coding instrument for evaluating the fidelity and quality of brief behavioral change interventions based on the behavioral theories of the 5 A's, Stages of Change and Motivational Interviewing. Content and face validity were assessed prior to an intervention where psychometric properties were evaluated with a prospective cohort of 116 medical students. Properties assessed included the inter-rater reliability of the instrument, internal consistency of the full scale and sub-scales and descriptive statistics of the instrument. Construct validity was assessed based on student's scores. Inter-rater reliability for the instrument was 0.82 (intraclass correlation). Internal consistency for the full scale was 0.70 (KR20). Internal consistencies for the sub-scales were as follows: MI intervention component (KR20=.7); stage-appropriate MI-based intervention (KR20=.55); MI spirit (KR20=.5); appropriate assessment (KR20=.45) and appropriate assisting (KR20=.56). The instrument demonstrated good inter-rater reliability and moderate overall internal consistency when used to assess performing brief behavioral change interventions by medical students. This practical instrument can be used with minimal training and demonstrates promising psychometric properties when evaluated with medical students counseling standardized patients. Further testing is required to evaluate its usefulness in clinical settings. Copyright © 2010 Elsevier Ireland Ltd. All rights reserved.
Hosford, Charles C; Siders, William A
2010-10-01
Strategies to facilitate learning include using knowledge of students' learning style preferences to inform students and their teachers. Aims of this study were to evaluate the factor structure, internal consistency, and temporal stability of medical student responses to the Index of Learning Styles (ILS) and determine its appropriateness as an instrument for medical education. The ILS assesses preferences on four dimensions: sensing/intuitive information perceiving, visual/verbal information receiving, active/reflective information processing, and sequential/global information understanding. Students entering the 2002-2007 classes completed the ILS; some completed the ILS again after 2 and 4 years. Analyses of responses supported the ILS's intended structure and moderate reliability. Students had moderate preferences for sensing and visual learning. This study provides evidence supporting the appropriateness of the ILS for assessing learning style preferences in medical students.
[Psychometric properties of a self-efficacy scale for physical activity in Brazilian adults].
Rech, Cassiano Ricardo; Sarabia, Tais Taiana; Fermino, Rogério César; Hallal, Pedro Curi; Reis, Rodrigo Siqueira
2011-04-01
To test the validity and reliability of a self-efficacy scale for physical activity (PA) in Brazilian adults. A self-efficacy scale was applied jointly with a multidimensional questionnaire through face-to-face interviews with 1,418 individuals (63.4% women) aged ≥ 18 years. The scale was submitted to validity (factorial and construct) and reliability analysis (internal consistency and temporal stability). A test-retest procedure was conducted with 74 individuals to evaluate temporal stability. Exploratory factor analyses revealed two independent factors: self-efficacy for walking and self-efficacy for moderate and vigorous PA (MVPA). Together, these two factors explained 65.4% of the total variance of the scale (20.9% and 44.5% for walking and MVPA, respectively). Cronbach's alpha values were 0.83 for walking and 0.90 for MVPA, indicating high internal consistency. Both factors were significantly and positively correlated (rho ≥ 0.17, P < 0.001) with quality of life indicators (health perception, self-satisfaction, and energy for daily activities), indicating an adequate construct validity. The scale's validity, internal consistency, and reliability were adequate to evaluate self-efficacy for PA in Brazilian adults.
An Update on the Clinical Utility of the Children's Post-Traumatic Cognitions Inventory.
McKinnon, Anna; Smith, Patrick; Bryant, Richard; Salmon, Karen; Yule, William; Dalgleish, Tim; Dixon, Clare; Nixon, Reginald D V; Meiser-Stedman, Richard
2016-06-01
The Children's Post-Traumatic Cognitions Inventory (CPTCI) is a self-report questionnaire that measures maladaptive cognitions in children and young people following exposure to trauma. In this study, the psychometric properties of the CPTCI were examined in further detail with the objective of furthering its utility as a clinical tool. Specifically, we investigated the CPTCI's discriminant validity, test-retest reliability, and the potential for the development of a short form of the measure. Three samples (London, East Anglia, Australia) of children and young people exposed to trauma (N = 535; 7-17 years old) completed the CPTCI and a structured clinical interview to measure posttraumatic stress disorder (PTSD) symptoms between 1 and 6 months following trauma. Test-retest reliability was investigated in a subsample of 203 cases. The results showed that a score in the range of 46 to 48 on the CPTCI was indicative of clinically significant appraisals as determined by the presence of PTSD. The measure also had moderate-to-high test-retest reliability (r = .78) over a 2-month period. The Children's Post-Traumatic Cognitions Inventory-Short Form (CPTCI-S) had excellent internal consistency (α = .92), and moderate-to-high test-retest reliability (r = .78). The examination of construct validity showed the model had an excellent fitting factor structure (Comparative Fit index = 0.95, Tucker-Lewis index = 0.91, Root Mean Square Error of Approximation = .07). A score ranging from 16 to 18 was the best cutoff point on the CPTCI-S, in that it was indicative of clinically significant appraisals as determined by the presence of PTSD. Based on these results, we concluded that the CPTCI is a useful tool to support the practice of clinicians and that the CPTCI-S has excellent psychometric properties. Copyright © 2016 International Society for Traumatic Stress Studies.
Vertical and Horizontal Jump Capacity in International Cerebral Palsy Football Players.
Reina, Raúl; Iturricastillo, Aitor; Sabido, Rafael; Campayo-Piernas, Maria; Yanci, Javier
2018-05-01
To evaluate the reliability and validity of vertical and horizontal jump tests in football players with cerebral palsy (FPCP) and to analyze the jump performance differences between current International Federation for Cerebral Palsy Football functional classes (ie, FT5-FT8). A total of 132 international parafootballers (25.8 [6.7] y; 70.0 [9.1] kg; 175.7 [7.3] cm; 22.8 [2.8] kg·m -2 ; and 10.7 [7.5] y training experience) participated in the study. The participants were classified according to the International Federation for Cerebral Palsy Football classification rules, and a group of 39 players without cerebral palsy was included in the study as a control group. Football players' vertical and horizontal jump performance was assessed. All the tests showed good to excellent relative intrasession reliability scores, both in FPCP and in the control group (intraclass correlation = .78-.97, SEM < 10.5%). Significant between-groups differences (P < .001) were obtained in the countermovement jump, standing broad jump, 4 bounds for distance, and triple hop for distance dominant leg and nondominant leg. The control group performed higher/farther jumps with regard to all the FPCP classes, obtaining significant differences and moderate to large effect sizes (ESs) (.85 < ES < 5.54, P < .01). Players in FT8 class (less severe impairments) had significantly higher scores in all the jump tests than players in the lower classes (ES = moderate to large, P < .01). The vertical and horizontal jump tests performed in this study could be applied to the classification procedures and protocols for FPCP.
Validity and Reliability of the Upper Extremity Work Demands Scale.
Jacobs, Nora W; Berduszek, Redmar J; Dijkstra, Pieter U; van der Sluis, Corry K
2017-12-01
Purpose To evaluate validity and reliability of the upper extremity work demands (UEWD) scale. Methods Participants from different levels of physical work demands, based on the Dictionary of Occupational Titles categories, were included. A historical database of 74 workers was added for factor analysis. Criterion validity was evaluated by comparing observed and self-reported UEWD scores. To assess structural validity, a factor analysis was executed. For reliability, the difference between two self-reported UEWD scores, the smallest detectable change (SDC), test-retest reliability and internal consistency were determined. Results Fifty-four participants were observed at work and 51 of them filled in the UEWD twice with a mean interval of 16.6 days (SD 3.3, range = 10-25 days). Criterion validity of the UEWD scale was moderate (r = .44, p = .001). Factor analysis revealed that 'force and posture' and 'repetition' subscales could be distinguished with Cronbach's alpha of .79 and .84, respectively. Reliability was good; there was no significant difference between repeated measurements. An SDC of 5.0 was found. Test-retest reliability was good (intraclass correlation coefficient for agreement = .84) and all item-total correlations were >.30. There were two pairs of highly related items. Conclusion Reliability of the UEWD scale was good, but criterion validity was moderate. Based on current results, a modified UEWD scale (2 items removed, 1 item reworded, divided into 2 subscales) was proposed. Since observation appeared to be an inappropriate gold standard, we advise to investigate other types of validity, such as construct validity, in further research.
Validation of the MISSCARE-BRASIL survey - A tool to assess missed nursing care.
Siqueira, Lillian Dias Castilho; Caliri, Maria Helena Larcher; Haas, Vanderlei José; Kalisch, Beatrice; Dantas, Rosana Aparecida Spadoti
2017-12-21
to analyze the metric validity and reliability properties of the MISSCARE-BRASIL survey. methodological research conducted by assessing construct validity and reliability via confirmatory factor analysis, known-groups validation, convergent construct validation, analysis of internal consistency and test-retest reliability. The sample consisted of 330 nursing professionals, of whom 86 participated in the retest phase. of the 330 participants, 39.7% were aides, 33% technicians, 20.9% nurses, and 6.4% nurses with administrative roles. Confirmatory factorial analysis demonstrated that the Brazilian Portuguese version of the instrument is adequately adjusted to the dimensional structure the scale authors originally proposed. The correlation between "satisfaction with position/role" and "satisfaction with teamwork" and the survey's missed care variables was moderate (Spearman's coefficient =0.35; p<0.001). The results of the Student's t-test indicated known-group validity. Professionals from closed units reported lower levels of missed care in comparison with the other units. The reliability showed a strong correlation, with the exception of "institutional management/leadership style" (intraclass correlation coefficient (ICC)=0.15; p=0.04). The internal consistency was adequate (Cronbach's alpha was greater than 0.70). the MISSCARE-BRASIL was valid and reliable in the group studied. The application of the MISSCARE-BRASIL can contribute to identifying solutions for missed nursing care.
Aguiar, A S; Bataglion, C; Visscher, C M; Bevilaqua Grossi, D; Chaves, T C
2017-07-01
Fear of movement (kinesiophobia) seems to play an important role in the development of chronic pain. However, for temporomandibular disorders (TMD), there is a scarcity of studies about this topic. The Tampa Scale for Kinesiophobia for TMD (TSK/TMD) is the most widely used instrument to measure fear of movement and it is not available in Brazilian Portuguese. The purpose of this study was to culturally adapt the TSK/TMD to Brazilian Portuguese and to assess its psychometric properties regarding internal consistency, reliability, and construct and structural validity. A total of 100 female patients with chronic TMD participated in the validation process of the TSK/TMD-Br. The intraclass correlation coefficient (ICC) was used for statistical analysis of reliability (test-retest), Cronbach's alpha for internal consistency, Spearman's rank correlation for construct validity and confirmatory factor analysis (CFA) for structural validity. CFA endorsed the pre-specified model with two domains and 12-items (Activity Avoidance - AA/Somatic Focus - SF) and all items obtained a loading factor greater than 0·4. Acceptable levels of reliability were found (ICC > 0·75) for all questions and domains of the TSK/TMD-Br. For internal consistency, Cronbach's α of 0·78 for both domains were found. Moderate correlations (0·40 < r < 0.60) were observed for 84% of the analyses conducted between TSK/TMD-Br scores versus catastrophising, depression and jaw functional limitation. TSK/TMD-Br 12 items and two-factor demonstrated sound psychometric properties (transcultural validity, reliability, internal consistency and structural validity). In such a way, the instrument can be used in clinical settings and for research purposes. © 2017 John Wiley & Sons Ltd.
Kahraman, Turhan; Genç, Arzu; Göz, Evrim
2016-10-01
The purpose of this study was to linguistically and culturally adapt the Nordic Musculoskeletal Questionnaire (NMQ) for use in Turkey, and to examine the psychometric properties of this adapted version. The cross-cultural adaptation was achieved by translating the items from the original version, with back-translation performed by independent mother-tongue translators, followed by committee review. Reliability (internal consistency and test-retest) was examined for 198 participants who completed the NMQ twice (with a 1 week interval). Construct validity was examined with data from 126 participants from the same population, who completed further four questionnaires related to the body regions described in the NMQ. The internal consistency was excellent (Cronbach's alpha = 0.896). The test-retest reliability was examined with the prevalence-adjusted bias-adjusted kappa (PABAK) and all items showed moderate to almost perfect reliability (PABAK = 0.57-0.90). Participants with a musculoskeletal problem in a related region had significantly more disability/pain, as assessed by the relevant questionnaires (p < 0.001), indicating that the NMQ had a good construct validity. This study provided considerable evidence that the Turkish version of the NMQ has appropriate psychometric properties, including good test-retest reliability, internal consistency and construct validity. It can be used for screening and epidemiological investigations of musculoskeletal symptoms. Implications for Rehabilitation The Nordic Musculoskeletal Questionnaire (NMQ) can be used for the screening of musculoskeletal problems. The NMQ allows comparison of musculoskeletal problems in different body regions in epidemiological studies with large numbers of participants. The Turkish version of the NMQ can be used for rehabilitation due to its appropriate psychometric properties, including good test-retest reliability, internal consistency and construct validity.
Development and validation of the Smartphone Addiction Inventory (SPAI).
Lin, Yu-Hsuan; Chang, Li-Ren; Lee, Yang-Han; Tseng, Hsien-Wei; Kuo, Terry B J; Chen, Sue-Huei
2014-01-01
The aim of this study was to develop a self-administered scale based on the special features of smartphone. The reliability and validity of the Smartphone Addiction Inventory (SPAI) was demonstrated. A total of 283 participants were recruited from Dec. 2012 to Jul. 2013 to complete a set of questionnaires, including a 26-item SPAI modified from the Chinese Internet Addiction Scale and phantom vibration and ringing syndrome questionnaire. There were 260 males and 23 females, with ages 22.9 ± 2.0 years. Exploratory factor analysis, internal-consistency test, test-retest, and correlation analysis were conducted to verify the reliability and validity of the SPAI. Correlations between each subscale and phantom vibration and ringing were also explored. Exploratory factor analysis yielded four factors: compulsive behavior, functional impairment, withdrawal and tolerance. Test-retest reliabilities (intraclass correlations = 0.74-0.91) and internal consistency (Cronbach's α = 0.94) were all satisfactory. The four subscales had moderate to high correlations (0.56-0.78), but had no or very low correlation to phantom vibration/ringing syndrome. This study provides evidence that the SPAI is a valid and reliable, self-administered screening tool to investigate smartphone addiction. Phantom vibration and ringing might be independent entities of smartphone addiction.
Kiltz, Uta; van der Heijde, Désirée; Boonen, Annelies; Akkoc, Nurullah; Bautista-Molano, Wilson; Burgos-Vargas, Ruben; Wei, James Cheng-Chung; Chiowchanwisawakit, Praveena; Dougados, Maxime; Duruoz, M Tuncay; Elzorkany, Bassel Kamal; Gaydukova, Inna; Gensler, Lianne S; Gilio, Michele; Grazio, Simeon; Gu, Jieruo; Inman, Robert D; Kim, Tae-Jong; Navarro-Compan, Victoria; Marzo-Ortega, Helena; Ozgocmen, Salih; Pimentel Dos Santos, Fernando; Schirmer, Michael; Stebbings, Simon; Van den Bosch, Filip E; van Tubergen, Astrid; Braun, Juergen
2018-06-01
To evaluate construct validity, interpretability, reliability and responsiveness as well as determination of cut-off points for good and poor health within the original English version and the 18 translations of the disease-specific Assessment of Spondyloarthritis international Society Health Index (ASAS HI) in 23 countries worldwide in patients with spondyloarthritis (SpA). A representative sample of patients with SpA fulfilling the ASAS classification criteria for axial (axSpA) or peripheral SpA was used. The construct validity of the ASAS HI was tested using Spearman correlation with several standard health outcomes for axSpA. Test-retest reliability was assessed by intraclass correlation coefficients (ICCs) in patients with stable disease (interval 4-7 days). In patients who required an escalation of therapy because of high disease activity, responsiveness was tested after 2-24weeks using standardised response mean (SRM). Among the 1548 patients, 64.9% were men, with a mean (SD) age 42.0 (13.4) years. Construct validity ranged from low (age: 0.10) to high (Bath AnkylosingSpondylitisFunctioning Index: 0.71). Internal consistency was high (Cronbach's α of 0.93). The reliability among 578 patients was good (ICC=0.87 (95% CI 0.84 to 0.89)). Responsiveness among 246 patients was moderate-large (SRM=-0.44 for non-steroidal anti-inflammatory drugs, -0.69 for conventional synthetic disease-modifying antirheumatic drug and -0.85 for tumour necrosis factor inhibitor). The smallest detectable change was 3.0. Values ≤5.0 have balanced specificity to distinguish good health as opposed to moderate health, and values ≥12.0 are specific to represent poor health as opposed to moderate health. The ASAS HI proved to be valid, reliable and responsive. It can be used to evaluate the impact of SpA and its treatment on functioning and health. Furthermore, comparison of disease impact between populations is possible. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Dean, Paige H.; Gardner, Ross F.; Duncombe, Stephanie L.; Harris, Kevin C.
2017-01-01
Objective To assess the criterion validity, internal consistency, reliability and cut-point for the Physical Activity Questionnaire for Children (PAQ-C) and Adolescents (PAQ-A) in children and adolescents with congenital heart disease–a special population at high cardiovascular risk in whom physical activity has not been extensively evaluated. Methods We included 84 participants (13.6±2.9 yrs, 50% female) with simple (37%), moderate (31%), or severe congenital heart disease (27%), as well as cardiac transplant recipients (6%), from BC Children’s Hospital, Canada. They completed the PAQ-C (≤11yrs, n = 28) or–A (≥12yrs, n = 56), and also wore a triaxial accelerometer (GT3X+ or GT9X) over the right hip for 7 days (n = 59 met valid wear time criteria). Results Median daily moderate-to-vigorous physical activity was 46.9 minutes per day (IQR 31.6–61.8) and 25% met physical activity guidelines defined as ≥60 minutes of moderate-to-vigorous physical activity per day. Median PAQ-score was 2.6 (IQR 1.9–3.0). PAQ-Scores were significantly related to accelerometry-derived metrics of physical activity (rho = 0.44–0.55, all p<0.01) and sedentary behaviour (rho = -0.53, p<0.001). Internal consistency was high (α = 0.837), as was reliability (stability) of PAQ-Scores over a 4-months period (ICC = 0.73, 95%CI 0.55–0.84; p<0.001). We identified that a PAQ-Score cut-point of 2.87 discriminates between those meeting physical guidelines and those that do not in the combined PAQ-C and–A samples (area under the curve = 0.80 (95%CI 0.67–0.92). Conclusion Validity and reliability of the PAQ in children and adolescents with CHD was comparable to or stronger than previous studies in healthy children. Therefore, the PAQ may be used to estimate general levels of physical activity in children and adolescents with CHD. PMID:28445485
Reliability and Validity of the Work and Well-Being Inventory (WBI) for Employees.
Vendrig, A A; Schaafsma, F G
2018-06-01
Purpose The purpose of this study is to measure the psychometric properties of the Work and Wellbeing Inventory (WBI) (in Dutch: VAR-2), a screening tool that is used within occupational health care and rehabilitation. Our research question focused on the reliability and validity of this inventory. Methods Over the years seven different samples of workers, patients and sick listed workers varying in size between 89 and 912 participants (total: 2514), were used to measure the test-retest reliability, the internal consistency, the construct and concurrent validity, and the criterion and predictive validity. Results The 13 scales displayed good internal consistency and test-retest reliability. The constructive validity of the WBI could clearly be demonstrated in both patients and healthy workers. Confirmative factor analyses revealed a CFI >.90 for all scales. The depression scale predicted future work absenteeism (>6 weeks) because of a common mental disorder in healthy workers. The job strain scale and the illness behavior scale predicted long term absenteeism (>3 months) in workers with short-term absenteeism. The illness behavior scale moderately predicted return to work in rehab patients attending an intensive multidisciplinary program. Conclusions The WBI is a valid and reliable tool for occupational health practitioners to screen for risk factors for prolonged or future sickness absence. With this tool they will have reliable indications for further advice and interventions to restore the work ability.
Öksüz, Çigdem; Alemdaroglu, Ipek; Kilinç, Muhammed; Abaoğlu, Hatice; Demirci, Cevher; Karahan, Sevilay; Yilmaz, Oznur; Yildirim, Sibel Aksu
2017-10-01
This study was performed to examine the reliability and validity of the Turkish version of ABILHAND-Kids questionnaire which assesses manual functions of children with neuromuscular diseases (NMDs). A cross sectional survey study design and Rasch analysis were used to assess the reliability and validity of the Turkish version of scale. Ninety-three children with different neuromuscular disorders and their parents were included in the study. The scale was applied to the parents with face-to-face interview twice; on their first visit and after an interval of 15 days. The test-retest reliability was assessed with intraclass correlation coefficient (ICC), and internal consistency of the multi-item subscales by calculating Cronbach alpha values. Brooke Upper Extremity Functional Classification (BUEFC) and Wee-Functional Independency Measurement (Wee-FIM) were correlated to determine the construct validity. The ICC value for the test/retest reliability was 0.94. The internal consistency was 0.81. Floor (1.1%) and ceiling (11.8%) effects were not significant. There were moderate correlations between the Turkish version of ABILHAND-Kids and Wee-FIM (0.67) and BUEFC (-0.37). Rasch analysis indicated good item fit, unidimensionality, and model fit. The Turkish version of ABILHAND-Kids questionnaire was found to be a reliable and valid scale for the assessment of the manual ability of children with NMDs.
Carvalho, Flávia A; Morelhão, Priscila K; Franco, Marcia R; Maher, Chris G; Smeets, Rob J E M; Oliveira, Crystian B; Freitas Júnior, Ismael F; Pinto, Rafael Z
2017-02-01
Although there is some evidence for reliability and validity of self-report physical activity (PA) questionnaires in the general adult population, it is unclear whether we can assume similar measurement properties in people with chronic low back pain (LBP). To determine the test-retest reliability of the International Physical Activity Questionnaire (IPAQ) long-version and the Baecke Physical Activity Questionnaire (BPAQ) and their criterion-related validity against data derived from accelerometers in patients with chronic LBP. Cross-sectional study. Patients with non-specific chronic LBP were recruited. Each participant attended the clinic twice (one week interval) and completed self-report PA. Accelerometer measures >7 days included time spent in moderate-and-vigorous physical activity, steps/day, counts/minute, and vector magnitude counts/minute. Intraclass Correlation Coefficients (ICC) and Bland and Altman method were used to determine reliability and spearman rho correlation were used for criterion-related validity. A total of 73 patients were included in our analyses. The reliability analyses revealed that the BPAQ and its subscales have moderate to excellent reliability (ICC 2,1 : 0.61 to 0.81), whereas IPAQ and most IPAQ domains (except walking) showed poor reliability (ICC 2,1 : 0.20 to 0.40). The Bland and Altman method revealed larger discrepancies for the IPAQ. For the validity analysis, questionnaire and accelerometer measures showed at best fair correlation (rho < 0.37). Although the BPAQ showed better reliability than the IPAQ long-version, both questionnaires did not demonstrate acceptable validity against accelerometer data. These findings suggest that questionnaire and accelerometer PA measures should not be used interchangeably in this population. Copyright © 2016 Elsevier Ltd. All rights reserved.
Psychometric evaluation of the Shared Decision-Making Instrument--Revised.
Bartlett, Jacqueline A; Peterson, Jane A
2013-02-01
The purpose of this study was to evaluate the psychometric properties of the Shared Decision-Making Inventory-Revised (SDMI-R) to measure four constructs (knowledge, attitudes, self-efficacy, and intent) theoretically defined as vital in discussing the human papillomavirus (HPV) disease and vaccine with clients. The SDMI-R was distributed to a sample (N = 1,525) of school nurses. Correlational matrixes denoted moderate to strong correlations, indicating adequate internal reliability. Reliability for the total instrument was satisfactory (α = .874) along with Attitude, Self-Efficacy and Intent subscales .828, .917, .891, respectively. Exploratory factor analysis revealed five components that explained 75.96% of the variance.
Development of a Chinese version of the Suicide Intent Scale.
Gau, Susan S F; Chen, Chin-Hung; Lee, Charles T C; Chang, Jung-Chen; Cheng, Andrew T A
2009-06-01
This study established the psychometric properties of the Chinese version of the Suicide Intent Scale (SIS) in a clinic- and community-based sample of 36 patients and 592 respondents, respectively. Results showed that the Chinese SIS demonstrated good inter-rater and test-retest reliability. Factor analysis generated three factors (Precautions, Planning, and Seriousness) explaining 92.9% of the total variance with high internal consistency. It was moderately correlated with depressive symptoms. Results suggest that the Chinese SIS is a reliable and valid instrument for use in assessing the extent of suicidal intention among subjects with deliberate self-harm in ethnic Chinese populations.
Harlan, E; Clark, L A
1999-06-01
Researchers and clinicians alike increasingly seek brief, reliable, and valid measures to obtain personality trait ratings from both selves and peers. We report the development of a paragraph-descriptor short form of a full-length personality assessment instrument, the Schedule for Nonadaptive and Adaptive Personality (SNAP) with both self- and other versions. Reliability and validity data were collected on a sample of 294 college students, from 90 of whom we also obtained parental ratings of their personality. Internal consistency reliability was good in both self- and parent data. The factorial structures of the self-report short and long forms were very similar. Convergence between parental ratings was moderately high. Self-parent convergence was variable, with lower agreement on scales assessing subjective distress than those assessing more observable behaviors; it also was stronger for higher order factors than for scales.
Alzyoud, Sukaina; Veeranki, Sreenivas P.; Kheirallah, Khalid A.; Shotar, Ali M.; Pbert, Lori
2016-01-01
Introduction: Waterpipe use among adolescents has been increasing progressively. Yet no studies were reported to assess the validity and reliability of nicotine dependence scale. The current study aims to assess the validity and reliability of an Arabic version of the modified Waterpipe Tolerance Questionnaire WTQ among school-going adolescent waterpipe users. Methods: In a cross-sectional study conducted in Jordan, information on waterpipe use among 333 school-going adolescents aged 11-18 years was obtained using the Arabic version of the WTQ. An exploratory factor analysis and correlation matrices were conducted to assess validity and reliability of the WTQ. Results: The WTQ had a 0.73 alpha of internal consistency indicating moderate level of reliability. The scale showed multidimensionality with items loading on two factors, namely waterpipe consumption and morning smoking. Conclusion: This study report nicotine dependence level among school-going adolescents who identify themselves as waterpipe users using the WTQ. PMID:26383198
Development and validation of the Pediatric Stroke Quality of Life Measure.
Fiume, Andrea; Deveber, Gabrielle; Jang, Shu-Hyun; Fuller, Colleen; Viner, Shani; Friefeld, Sharon
2018-06-01
To develop and validate a disease-specific parent proxy and child quality of life (QoL) measure for patients aged 2 to 18 years surviving cerebral sinovenous thrombosis (CSVT) and arterial ischaemic stroke (AIS). Utilizing qualitative and quantitative methods, we developed a 75-item Pediatric Stroke Quality of Life Measure (PSQLM) questionnaire. We mailed the PSQLM and a standardized generic QoL measure, Pediatric Quality of Life Inventory (PedsQL), to 353 families. Stroke type, age at stroke, and neurological outcome on the Pediatric Stroke Outcome Measure were documented. We calculated the internal consistency, validity, and reliability of the PSQLM. The response rate was 29%, yielding a sample of 101 patients (mean age 9y 9mo [SD 4.30]; 69 AIS [68.3%], 32 CSVT [31.7%]). The internal consistency of the PSQLM was high (Cronbach's α=0.94-0.97). Construct validity for the PSQLM was moderately strong (r=0.3-0.4; p<0.003) and, as expected, correlation with the PedsQL was moderate, suggesting the PSQLM operationalizes QoL distinct from the PedsQL. Test-retest reliability at 2 weeks was very good (intraclass correlation coefficient [ICC] 0.85-0.95; 95% confidence interval 0.83-0.97) and good agreement was established between parent and child report (ICC 0.63-0.76). The PSQLM demonstrates sound psychometric properties. Further research will seek to increase its clinical utility by reducing length and establishing responsiveness for descriptive and longitudinal evaluative assessment. A pediatric stroke-specific quality of life (QoL) measurement tool for assessments based on perceptions of importance and satisfaction. Moderate-to-high reliability and validity established for a new clinical scale evaluating QoL among children with stroke. Perceived QoL measured using the Pediatric Stroke Quality of Life Measure appears lower in children with neurological impairment. © 2018 Mac Keith Press.
Questionnaire for low back pain in the garment industry workers
Bindra, Supreet; Sinha, A. G. K.; Benjamin, A. I.
2013-01-01
Low back pain affects up to 90% of the world's population at some point in their lives. Until date no questionnaire has been designed for back pain in the garment industry workers. Therefore, the objective of this study is to design a questionnaire to determine the prevalence, risk factors, impact, health care service utilization and back pain features in the garment industry workers and gain preliminary experience of its use. The content validity and reliability of the questionnaire was established. Items showing acceptable internal consistency and moderate to high test re-test reliability were retained in the questionnaire. Items showing unacceptable internal consistency, low test re-test reliability or poor differentiation were reworded, redrafted and re-tested on the workers. It took 20 min to complete one interview schedule. Environmental factors such as the absence of the garment industry owner/supervisor or co-workers at the time of the interview and interview during leisure hours need to be standardized. Thus, final questionnaire is ready for use after necessary amendments and will be used on the larger sample size in the main study. PMID:24421591
Questionnaire for low back pain in the garment industry workers.
Bindra, Supreet; Sinha, A G K; Benjamin, A I
2013-05-01
Low back pain affects up to 90% of the world's population at some point in their lives. Until date no questionnaire has been designed for back pain in the garment industry workers. Therefore, the objective of this study is to design a questionnaire to determine the prevalence, risk factors, impact, health care service utilization and back pain features in the garment industry workers and gain preliminary experience of its use. The content validity and reliability of the questionnaire was established. Items showing acceptable internal consistency and moderate to high test re-test reliability were retained in the questionnaire. Items showing unacceptable internal consistency, low test re-test reliability or poor differentiation were reworded, redrafted and re-tested on the workers. It took 20 min to complete one interview schedule. Environmental factors such as the absence of the garment industry owner/supervisor or co-workers at the time of the interview and interview during leisure hours need to be standardized. Thus, final questionnaire is ready for use after necessary amendments and will be used on the larger sample size in the main study.
Brief reasons for living inventory: a psychometric investigation.
Cwik, Jan Christopher; Siegmann, Paula; Willutzki, Ulrike; Nyhuis, Peter; Wolter, Marcus; Forkmann, Thomas; Glaesmer, Heide; Teismann, Tobias
2017-11-06
The present study aimed at validating the German version of the Brief Reasons for Living inventory (BRFL). Validity and reliability were established in a community (n = 339) and a clinical sample (n = 272). Convergent and discriminant validity were investigated, and confirmatory factor analyses were conducted for the complete BRFL as well as for a 10-item version excluding conditional items on child-related concerns. Furthermore, it was assessed how BRFL scores moderate the association between depression and suicide ideation. Results indicated an adequate fit of the data to the original factor structure. The total scale and the subscales of the German version of the BRFL had sufficient internal consistency, as well as good convergent and divergent validity. The BRFL demonstrated clinical utility by differentiating between participants with vs. without suicide ideation. Reasons for living proved to moderate the association between depression and suicide ideation. Results provide preliminary evidence that the BRFL may be a reliable and valid measure of adaptive reasons for living that can be used in clinic and research settings.
NASA Astrophysics Data System (ADS)
Tariq, Beenish; Mat, Nik Kamariah Nik
2017-10-01
Telecommunication sector of Pakistan is a significant contributor toward the economic development of Pakistan. However, telecommunication sector of Pakistan underwent a lot of changes from regulatory and marketing perspective in 2015, resulting in decreased cellular penetration, dropped down the cellular subscribers and decreased telecommunication revenue. Hence, this research paper is designed to validate the constructs used in addressing the moderating role of government regulations based on Oliver's four-stage loyalty model in telecom sector of Pakistan. This preliminary study has mainly employed the quantitative method (i.e. survey questionnaire), consisting of a total of 72 items related to eight constructs under study and used 7 points Likert scale. The main analysis method used is the reliability test of the constructs. The results reveal that the Cronbach alpha readings were between 0.756 and 0.932, indicating internally consistent and reliable measures of the constructs used. This result enables the constructs to be included in the actual data collection without change.
Comparison of two methods of measuring physical activity in South African older adults.
Kolbe-Alexander, Tracy L; Lambert, Estelle V; Harkins, Judith Biletnikoff; Ekelund, Ulf
2006-01-01
The aim of this study was to assess the validity and reliability of the Yale Physical Activity Survey (YPAS) and the short version of the International Physical Activity Questionnaire (IPAQ) in older South African adults. The YPAS includes measures of weekly energy expenditure (EE) for housework, yard work, caregiving, exercise, and recreation. The IPAQ measures total time and EE during vigorous and moderate activity, walking, and sitting. The instruments were administered twice for test-retest reliability (men, n = 52, 68 +/- 5.4 years, and women, n = 70, 66 +/- 5.8 years). Data for criterion validity were obtained from accelerometers. YPAS reliability ranged from r = .44 to.80 for men and r = .59 to .99 for women (p < .0001). IPAQ reliability was lower for men (r = .29 to .76) than for women (r = .46 to .77). Criterion validity of the YPAS was .31 to .54 for men and .26 to .29 for women. The YPAS and short IPAQ had comparable results for reliability and criterion validity.
Benz, Thomas; Lehmann, Susanne; Gantenbein, Andreas R; Sandor, Peter S; Stewart, Walter F; Elfering, Achim; Aeschlimann, André G; Angst, Felix
2018-03-09
The Migraine Disability Assessment (MIDAS) is a brief questionnaire and measures headache-related disability. This study aimed to translate and cross-culturally adapt the original English version of the MIDAS to German and to test its reliability. The standardized translation process followed international guidelines. The pre-final version was tested for clarity and comprehensibility by 34 headache sufferers. Test-retest reliability of the final version was quantified by 36 headache patients completing the MIDAS twice with an interval of 48 h. Reliability was determined by intraclass correlation coefficients and internal consistency by Cronbach's α. All steps of the translation process were followed, documented and approved by the developer of the MIDAS. The expert committee discussed in detail the complex phrasing of the questions that refer to one to another, especially exclusion of headache-days from one item to the next. The German version contains more active verb sentences and prefers the perfect to the imperfect tense. The MIDAS scales intraclass correlation coefficients ranged from 0.884 to 0.994 and was 0.991 (95% CI: 0.982-0.995) for the MIDAS total score. Cronbach's α for the MIDAS as a whole was 0.69 at test and 0.67 at retest. The translation process was challenged by the comprehensibility of the questionnaire. The German version of the MIDAS is a highly reliable instrument for assessing headache related disability with moderate internal consistency. Provided validity testing of the German MIDAS is successful, it can be recommended for use in clinical practice as well as in research.
Extensive validation of the pain disability index in 3 groups of patients with musculoskeletal pain.
Soer, Remko; Köke, Albère J A; Vroomen, Patrick C A J; Stegeman, Patrick; Smeets, Rob J E M; Coppes, Maarten H; Reneman, Michiel F
2013-04-20
A cross-sectional study design was performed. To validate the pain disability index (PDI) extensively in 3 groups of patients with musculoskeletal pain. The PDI is a widely used and studied instrument for disability related to various pain syndromes, although there is conflicting evidence concerning factor structure, test-retest reliability, and missing items. Additionally, an official translation of the Dutch language version has never been performed. For reliability, internal consistency, factor structure, test-retest reliability and measurement error were calculated. Validity was tested with hypothesized correlations with pain intensity, kinesiophobia, Rand-36 subscales, Depression, Roland-Morris Disability Questionnaire, Quality of Life, and Work Status. Structural validity was tested with independent backward translation and approval from the original authors. One hundred seventy-eight patients with acute back pain, 425 patients with chronic low back pain and 365 with widespread pain were included. Internal consistency of the PDI was good. One factor was identified with factor analyses. Test-retest reliability was good for the PDI (intraclass correlation coefficient, 0.76). Standard error of measurement was 6.5 points and smallest detectable change was 17.9 points. Little correlations between the PDI were observed with kinesiophobia and depression, fair correlations with pain intensity, work status, and vitality and moderate correlations with the Rand-36 subscales and the Roland-Morris Disability Questionnaire. The PDI-Dutch language version is internally consistent as a 1-factor structure, and test-retest reliable. Missing items seem high in sexual and professional items. Using the PDI as a 2-factor questionnaire has no additional value and is unreliable.
Test-retest reliability of the Military Pre-training Questionnaire.
Robinson, M; Stokes, K; Bilzon, J; Standage, M; Brown, P; Thompson, D
2010-09-01
Musculoskeletal injuries are a significant cause of morbidity during military training. A brief, inexpensive and user-friendly tool that demonstrates reliability and validity is warranted to effectively monitor the relationship between multiple predictor variables and injury incidence in military populations. To examine the test-retest reliability of the Military Pre-training Questionnaire (MPQ), designed specifically to assess risk factors for injury among military trainees across five domains (physical activity, injury history, diet, alcohol and smoking). Analyses were based on a convenience sample of 58 male British Army trainees. Kappa (kappa), weighted kappa (kappa(w)) and intraclass correlation coefficients (ICC) were used to evaluate the 2-week test-retest reliability of the MPQ. For index measures constituting the assessment of a given construct, internal consistency was assessed by Cronbach's alpha (alpha) coefficients. Reliability of individual items ranged from poor to almost perfect (kappa range = 0.45-0.86; kappa(w) range = 0.11-0.91; ICC range = 0.34-0.86) with most items demonstrating moderate reliability. Overall scores related to physical activity, diet, alcohol and smoking constructs were reliable between both administrations (ICC = 0.63-0.85). Support for the internal consistency of the incorporated alcohol (alpha = 0.78) and cigarette (alpha = 0.75) scales was also provided. The MPQ is a reliable self-report instrument for assessing multiple injury-related risk factors during initial military training. Further assessment of the psychometric properties of the MPQ (e.g. different types of validity) with military populations/samples will support its interpretation and use in future surveillance and epidemiological studies.
Kesselheim, Jennifer C; Agrawal, Anurag K; Bhatia, Nita; Cronin, Angel; Jubran, Rima; Kent, Paul; Kersun, Leslie; Rao, Amulya Nageswara; Rose, Melissa; Savelli, Stephanie; Sharma, Mukta; Shereck, Evan; Twist, Clare J; Wang, Michael
2017-05-01
Educators in pediatric hematology-oncology lack rigorously developed instruments to assess fellows' skills in humanism and professionalism. We developed a novel 15-item self-assessment instrument to address this gap in fellowship training. Fellows (N = 122) were asked to assess their skills in five domains: balancing competing demands of fellowship, caring for the dying patient, confronting depression and burnout, responding to challenging relationships with patients, and practicing humanistic medicine. An expert focus group predefined threshold scores on the instrument that could be used as a cutoff to identify fellows who need support. Reliability and feasibility were assessed and concurrent validity was measured using three established instruments: Maslach Burnout Inventory (MBI), Flourishing Scale (FS), and Jefferson Scale of Physician Empathy (JSPE). For 90 participating fellows (74%), the self-assessment proved feasible to administer and had high internal consistency reliability (Cronbach's α = 0.81). It was moderately correlated with the FS and MBI (Pearson's r = 0.41 and 0.4, respectively) and weakly correlated with the JSPE (Pearson's r = 0.15). Twenty-eight fellows (31%) were identified as needing support. The self-assessment had a sensitivity of 50% (95% confidence interval [CI]: 31-69) and a specificity of 77% (95% CI: 65-87) for identifying fellows who scored poorly on at least one of the three established scales. We developed a novel assessment instrument for use in pediatric fellowship training. The new scale proved feasible and demonstrated internal consistency reliability. Its moderate correlation with other established instruments shows that the novel assessment instrument provides unique, nonredundant information as compared to existing scales. © 2016 Wiley Periodicals, Inc.
Brosseau, Lucie; Laroche, Chantal; Guitard, Paulette; King, Judy; Poitras, Stéphane; Casimiro, Lynn; Barette, Julie Alexandra; Cardinal, Dominique; Cavallo, Sabrina; Laferrière, Lucie; Martini, Rose; Champoux, Nicholas; Taverne, Jennifer; Paquette, Chanyque; Tremblay, Sébastien; Sutton, Ann; Galipeau, Roseline; Tourigny, Jocelyne; Toupin-April, Karine; Loew, Laurianne; Demers, Catrine; Sauvé-Schenk, Katrine; Paquet, Nicole; Savard, Jacinthe; Lagacé, Josée; Pharand, Denyse; Vaillancourt, Véronique
2017-01-01
Objectives: The primary objective was to produce a French-Canadian translation of AMSTAR (a measurement tool to assess systematic reviews) and to examine the validity of the translation's contents. The secondary and tertiary objectives were to assess the inter-rater reliability and factorial construct validity of this French-Canadian version of AMSTAR. Methods: A modified approach to Vallerand's methodology (1989) for cross-cultural validation was used. 1 First, a parallel back-translation of AMSTAR 2 was performed, by both professionals and future professionals. Next, a first committee of experts (P1) examined the translations to create a first draft of the French-Canadian version of the AMSTAR tool. This draft was then evaluated and modified by a second committee of experts (P2). Following that, 18 future professionals (master's students in physiotherapy) rated this second draft of the instrument for clarity using a seven-point scale (1: very clear; 7: very ambiguous). Lastly, the principal co-investigators then reviewed the problematic elements and proposed final changes. Four independent raters used this French-Canadian version of AMSTAR to assess 20 systematic reviews that were published in French after the year 2000. An intraclass correlation coefficient (ICC) and kappa coefficient were calculated to measure the tool's inter-rater reliability. A Cronbach's alpha coefficient was also calculated to measure internal consistency. In addition, factor analysis was used to evaluate construct validity in order to determine the number of dimensions. Results: The statements on the final version of the AMSTAR tool received an average ambiguity rating of between 1.0 and 1.4. No statement received an average rating below 1.4, which indicates a high level of clarity. Inter-rater reliability ( n =4) for the instrument's total score was moderate, with an intraclass correlation coefficient of 0.61 (95% confidence interval [CI]: 0.29, 0.97). Inter-rater reliability for 82% of the individual items was good, according to the kappa values obtained. Internal consistency was excellent, with a Cronbach's alpha coefficient of 0.91 (95% CI: 0.83, 0.99). The French-Canadian version of AMSTAR is a unidimensional tool, as confirmed by factor analysis and community values greater than 0.30. Conclusion: A valid French-Canadian version of AMSTAR was created using this rigorous five-step process. This version is unidimensional, with moderate inter-rater reliability for the elements overall, and with excellent internal consistency. This tool could be valuable to French-Canadian professionals and researchers, and could also be of interest to the international Francophone community.
Lim, Chun Yi; Law, Mary; Khetani, Mary; Rosenbaum, Peter; Pollock, Nancy
2018-08-01
To estimate the psychometric properties of a culturally adapted version of the Young Children's Participation and Environment Measure (YC-PEM) for use among Singaporean families. This is a prospective cohort study. Caregivers of 151 Singaporean children with (n = 83) and without (n = 68) developmental disabilities, between 0 and 7 years, completed the YC-PEM (Singapore) questionnaire with 3 participation scales (frequency, involvement, and change desired) and 1 environment scale for three settings: home, childcare/preschool, and community. Setting-specific estimates of internal consistency, test-retest reliability, and construct validity were obtained. Internal consistency estimates varied from .59 to .92 for the participation scales and .73 to .79 for the environment scale. Test-retest reliability estimates from the YC-PEM conducted on two occasions, 2-3 weeks apart, varied from .39 to .89 for the participation scales and from .65 to .80 for the environment scale. Moderate to large differences were found in participation and perceived environmental support between children with and without a disability. YC-PEM (Singapore) scales have adequate psychometric properties except for low internal consistency for the childcare/preschool participation frequency scale and low test-retest reliability for home participation frequency scale. The YC-PEM (Singapore) may be used for population-level studies involving young children with and without developmental disabilities.
Strober, Bruce; Zhao, Yang; Tran, Mary Helen; Gnanasakthy, Ari; Nyirady, Judit; Papavassilis, Charis; Nelson, Lauren M; McLeod, Lori D; Mordin, Margaret; Gottlieb, Alice B; Elewski, Boni E; Lebwohl, Mark
2016-03-01
This analysis aimed to confirm the reliability, validity, and responsiveness of the Psoriasis Symptom Diary (PSD) using data from two Phase III studies in patients with moderate to severe chronic plaque psoriasis. Data from two randomized, double-blind, double-dummy, placebo-controlled, multicenter Phase III studies (n = 820) assessing the efficacy and safety of secukinumab were used. The PSD (24-h recall; 0-10 numeric rating scale) was electronically administered each evening. Test-retest reliability was determined using intraclass correlations. Construct validity hypotheses were evaluated via correlations with the Psoriasis Area and Severity Index (PASI), Investigator's Global Assessment (IGA), Dermatology Life Quality Index (DLQI), EuroQoL 5-Dimension Health Status Questionnaire, and Patient Global Impression of Change (PGIC). Discriminating ability and responsiveness were evaluated by estimating mean differences and effect sizes between known groups (using the PASI and IGA). Phase II-derived, anchor-based PGIC thresholds and cumulative distribution function (CDF) plots described meaningful change. Items on the PSD yielded high intraclass coefficients (>0.90). Correlations were in the anticipated direction and by week 12 were moderate to strong (0.41-0.73) in magnitude, demonstrating construct validity. Average PSD item scores differed predictably and significantly between known groups. Responsiveness effect size estimates were moderate to large (0.6-1.5), and CDF plots showed the percentage of responders to be consistently higher in treatment than in placebo arms across the range of change in PSD scores. The PSD is reliable, valid, and responsive, and represents a valid tool to enhance treatment decisions in patients with moderate to severe plaque psoriasis. © 2015 The International Society of Dermatology.
Kim, Ho-Joong; Ruscheweyh, Ruth; Yeo, Ji-Hyun; Cho, Hyeon-Guk; Yi, Je-Min; Chang, Bong-Soon; Lee, Choon-Ki; Yeom, Jin S
2014-11-01
The purpose of this study was to translate pain sensitivity questionnaires (PSQ) into the Korean language, perform a cross-cultural adaption of the PSQ, and validate the Korean version of PSQ in patients with degenerative spinal disease. The PSQ was translated forward and backward, cross-culturally adapted by 2 independent translators, and approved by an expert committee. The final Korean version of the PSQ was tested on 72 patients with degenerative spinal disease. Test-retest reliability was evaluated for 60 patients (83%) who completed the second assessment in an interval of 4 weeks. The mean PSQ-minor, PSQ-moderate, and PSQ-total (standard deviation [SD]) were 5.40 (2.02), 6.46 (1.98), and 5.93 (1.93), respectively. The PSQ-total, PSQ-minor, and PSQ-moderate of the Korean version showed very good internal consistencies determined by the Cronbach's α of 0.926, 0.869, and 0.877, respectively. For convergent validity, the PSQ scores of the Korean version showed significant correlations with pain catastrophizing scale (PCS) (r = 0.377, P = 0.002; r = 0.365, P = 0.003; r = 0.362, P = 0.003 for PSQ-total, PSQ-minor, and PSQ-moderate of the Korean version, respectively). For test-retest reliability, the intraclass correlation coefficients were 0.782 for PSQ-total, 0.752 for PSQ-minor, and 0.793 for PSQ-moderate. In conclusion, the validated Korean version of PSQ is a transculturally equivalent, reliable, and valid tool to assess individual pain sensitivity. © 2013 World Institute of Pain.
Yao, Min; Yang, Long; Cao, Zuo-Yuan; Cheng, Shao-Dan; Tian, Shuang-Lin; Sun, Yue-Li; Wang, Jing; Xu, Bao-Ping; Hu, Xiao-Chun; Wang, Yong-Jun; Zhang, Ying; Cui, Xue-Jun
2017-09-18
Shoulder pain is a common musculoskeletal disorder in Chinese population, which affects more than 1,3 billion individuals. To the best of our knowledge, there has been no available Chinese-language version of measurements of shoulder pain and disability so far. Moreover, the Constant-Murley score (CMS) questionnaire is a universally recognized patient-reported questionnaire for clinical practice and research. The present study was designed to evaluate a Chinese translational version of CMS and subsequently assess its reliability and validity. The Chinese translational version of CMS was formulated by means of forward-backward translation. Meanwhile, a final review was carried out by an expert committee, followed by conducting a test of the pre-final version. Therefore, the reliability and validity of the Chinese translational version of CMS could be assessed using the internal consistency, construct validity, factor analysis, reliability and floor and ceiling effects. Specifically, the reliability was assessed by testing the internal consistency (Cronbach's α) and test-retest reliability (intraclass coefficient correlation [ICC]), while the construct validity was evaluated via comparison between the Chinese translational version of CMS with visual analog scale (VAS) score and the 36-Item Short Form Health Survey (SF-36, Spearman correlation). The questionnaire was verified to be acceptable after distribution among 120 subjects with unilateral shoulder pain. Factor analysis had revealed a two-factor and 10-item solution. Moreover, the assessment results indicated that the Chinese translational version of CMS questionnaire harbored good internal consistency (Cronbach's α = 0.739) and test-retest reliability (ICC = 0.827). In addition, the Chinese translational version of CMS was moderately correlated with VAS score (r = 0.497) and SF-36 (r = 0.135). No obvious floor and ceiling effects were observed in the Chinese translational version of CMS questionnaire. Chinese translational version of CMS exhibited good reliability, which is relatively acceptable and is likely to be widely used in this population.
Developing the Person-Environment Apathy Rating for persons with dementia.
Jao, Ying-Ling; Algase, Donna L; Specht, Janet K; Williams, Kristine
2016-08-01
To develop the Person-Environment Apathy Rating (PEAR) scale that measures environmental stimulation and apathy in persons with dementia and to evaluate its psychometrics. The PEAR scale consists of the PEAR-Environment subscale and PEAR-Apathy subscales. The items were developed via literature review, field testing, expert review, and pilot testing. The construct validity and reliability were examined through video observation. The parent study enrolled 185 institutionalized residents with dementia. For this study, 96 videos were selected from 24 participants. The PEAR-Environment subscale was validated using the Ambiance Scale and the Crowding Index. The PEAR-Apathy subscale was validated using the Neuropsychiatric Inventory (NPI)-Apathy, Passivity in Dementia Scale (PDS), and NPI-Depression. The PEAR-Environment subscale and PEAR-Apathy subscales each consists of six items rated on a 1-4 scale. For validity, the Crowding Index slightly, yet significantly, correlated with the PEAR-Environment subscale total score and three of the individual scores. Ambiance Scale scores, both engaging and soothing, did not correlate with the PEAR-Environment subscale. The PEAR-Apathy highly correlated with the PDS and NPI-Apathy and moderately correlated with the NPI-Depression, suggesting good convergent validity and moderate discriminant validity. For reliability, both environment and apathy subscales demonstrated excellent internal consistency. Although facial expression and eye contact showed moderate inter-rater reliability, all other items showed good to excellent inter-rater and intra-rater reliability. This study has successfully developed the PEAR scale and established its psychometrics based on the compatible scales available. The PEAR scale is the first scale that concurrently assesses apathy and environmental stimulation, and is recommended for use in persons with dementia.
Badia, X; Mascaró, J M; Lozano, R
1999-10-01
The aim of this study was to assess the feasibility, validity, reliability and sensitivity to change of a Spanish version of the Dermatology Life Quality Index (DLQI) in patients with mild to moderate eczema and psoriasis who were treated with topical corticosteroids. The final study sample comprised 237 patients (48% eczema). Discriminant validity was tested by comparing patients' scores with those of a random sample of the general population (n = 100), and convergent validity by analysing correlations between DLQI scores, measures of clinical severity, and domain scores on the Nottingham Health Profile (NHP). Internal consistency and test-retest reliability were tested in clinically stable patients (n = 94), and responsiveness in a clinically unstable group (n = 143) initiating treatment with topical corticosteroids. Patient scores were significantly higher than general population scores (4.3 vs. 0. 27, P < 0.001). Correlations with NHP domains ranged from 0.12 to 0. 32, and there was significant correlation with clinical measures (r = 0.26, P < 0.001). Reliability was good (Cronbach's alpha = 0.83; intraclass correlation coefficient = 0.88), and the instrument proved responsive to change (effect size for the total group of de novo patients = 0.70), though the great majority of changes occurred in items 1 and 2. The NHP Emotional Reactions and Mobility domains were more responsive than some DLQI domains. In clinical trials of treatments for mild to moderate eczema and psoriasis, it is likely that only items 1 and 2 of the DLQI will be needed, and it is probably advisable to include generic instruments alongside the DLQI.
Reliability and validity of the range of motion scale (ROMS) in patients with abnormal postures.
van Rooijen, Diana E; Lalli, Stefania; Marinus, Johan; Maihöfner, Christian; McCabe, Candida S; Munts, Alex G; van der Plas, Anton A; Tijssen, Marina A J; van de Warrenburg, Bart P; Albanese, Alberto; van Hilten, Jacobus J
2015-03-01
Sustained abnormal postures (i.e., fixed dystonia) are the most frequently reported motor abnormalities in complex regional pain syndrome (CRPS), but these symptoms may also develop after peripheral trauma without CRPS. Currently, there is no valid and reliable measurement instrument available to measure the severity and distribution of these postures. The range of motion scale (ROMS) was therefore developed to assess the severity based on the possible active range of motion of all joints (arms, legs, trunk, and neck), and the present study evaluates its reliability and validity. Inter- and intra-rater reliability of the ROMS was determined in 16 patients with abnormal sustained postures, who were videotaped following a standard video protocol in a university hospital. The recordings were rated by a panel of international experts. In addition, 30 patients were clinically tested with both the Burke-Fahn-Marsden (BFM) scale as well as the ROMS to assess construct validity. Inter-rater reliability for total ROMS scores showed an intra-class correlation coefficient (ICC) of 0.85. The majority of the scores for the separate joints (13 out of 18) demonstrated an almost perfect agreement with ICCs ranging from 0.81 to 0.94; of the other items, one showed fair, one moderate, and three substantial agreement. The ICCs for the intra-rater reliability ranged from moderate to almost perfect (0.68-0.98). Spearman's correlation coefficients between corresponding body areas as measured with the ROMS or BFM were all above 0.82. The ROMS is a reliable and valid instrument to evaluate the severity and distribution of sustained abnormal postures. Wiley Periodicals, Inc.
The Quality of Written Feedback by Attendings of Internal Medicine Residents.
Jackson, Jeffrey L; Kay, Cynthia; Jackson, Wilkins C; Frank, Michael
2015-07-01
Attending evaluations are commonly used to evaluate residents. Evaluate the quality of written feedback of internal medicine residents. Retrospective. Internal medicine residents and faculty at the Medical College of Wisconsin from 2004 to 2012. From monthly evaluations of residents by attendings, a randomly selected sample of 500 written comments by attendings were qualitatively coded and rated as high-, moderate-, or low-quality feedback by two independent coders with good inter-rater reliability (kappa: 0.94). Small group exercises with residents and attendings also coded the utterances as high, moderate, or low quality and developed criteria for this categorization. In-service examination scores were correlated with written feedback. There were 228 internal medicine residents who had 6,603 evaluations by 334 attendings. Among 500 randomly selected written comments, there were 2,056 unique utterances: 29% were coded as nonspecific statements, 20% were comments about resident personality, 16% about patient care, 14% interpersonal communication, 7% medical knowledge, 6% professionalism, and 4% each on practice-based learning and systems-based practice. Based on criteria developed by group exercises, the majority of written comments were rated as moderate quality (65%); 22% were rated as high quality and 13% as low quality. Attendings who provided high-quality feedback rated residents significantly lower in all six of the Accreditation Council for Graduate Medical Education (ACGME) competencies (p <0.0005 for all), and had a greater range of scores. Negative comments on medical knowledge were associated with lower in-service examination scores. Most attending written evaluation was of moderate or low quality. Attendings who provided high-quality feedback appeared to be more discriminating, providing significantly lower ratings of residents in all six ACGME core competencies, and across a greater range. Attendings' negative written comments on medical knowledge correlated with lower in-service training scores.
Décary, Simon; Ouellet, Philippe; Vendittoli, Pascal-André; Desmeules, François
2016-12-01
Clinicians often rely on physical examination tests to guide them in the diagnostic process of knee disorders. However, reliability of these tests is often overlooked and may influence the consistency of results and overall diagnostic validity. Therefore, the objective of this study was to systematically review evidence on the reliability of physical examination tests for the diagnosis of knee disorders. A structured literature search was conducted in databases up to January 2016. Included studies needed to report reliability measures of at least one physical test for any knee disorder. Methodological quality was evaluated using the QAREL checklist. A qualitative synthesis of the evidence was performed. Thirty-three studies were included with a mean QAREL score of 5.5 ± 0.5. Based on low to moderate quality evidence, the Thessaly test for meniscal injuries reached moderate inter-rater reliability (k = 0.54). Based on moderate to excellent quality evidence, the Lachman for anterior cruciate ligament injuries reached moderate to excellent inter-rater reliability (k = 0.42 to 0.81). Based on low to moderate quality evidence, the Tibiofemoral Crepitus, Joint Line and Patellofemoral Pain/Tenderness, Bony Enlargement and Joint Pain on Movement tests for knee osteoarthritis reached fair to excellent inter-rater reliability (k = 0.29 to 0.93). Based on low to moderate quality evidence, the Lateral Glide, Lateral Tilt, Lateral Pull and Quality of Movement tests for patellofemoral pain reached moderate to good inter-rater reliability (k = 0.49 to 0.73). Many physical tests appear to reach good inter-rater reliability, but this is based on low-quality and conflicting evidence. High-quality research is required to evaluate the reliability of knee physical examination tests. Copyright © 2016 Elsevier Ltd. All rights reserved.
Baumeister, Sebastian E; Ricci, Cristian; Kohler, Simone; Fischer, Beate; Töpfer, Christine; Finger, Jonas D; Leitzmann, Michael F
2016-05-23
The current study examined the reliability and validity of the European Health Interview Survey-Physical Activity Questionnaire (EHIS-PAQ), a novel questionnaire for the surveillance of physical activity (PA) during work, transportation, leisure time, sports, health-enhancing and muscle-strengthening activities over a typical week. Reliability was assessed by administering the 8-item questionnaire twice to a population-based sample of 123 participants aged 15-79 years at a 30-day interval. Concurrent (inter-method) validity was examined in 140 participants by comparisons with self-report (International Physical Activity Questionnaire-Long Form (IPAQ-LF), 7-day Physical Activity Record (PAR), and objective criterion measures (GT3X+ accelerometer, physical work capacity at 75% (PWC(75%)) from submaximal cycle ergometer test, hand grip strength). The EHIS-PAQ showed acceptable reliability, with a median intraclass correlation coefficient across PA domains of 0.55 (range 0.43-0.73). Compared to the GT3X+ (counts/minutes/day), the EHIS-PAQ underestimated moderate-to-vigorous PA (median difference -11.7, p-value = 0.054). Spearman correlation coefficients (ρ) for validity were moderate-to-strong (ρ's > 0.41) for work-related PA (IPAQ = 0.64, GT3X + =0.43, grip strength = 0.48), transportation-related PA (IPAQ = 0.62, GT3X + =0.43), walking (IPAQ = 0.58), and health-enhancing PA (IPAQ = 0.58, PAR = 0.64, GT3X + =0.44, PWC(75%) = 0.48), and fair-to-poor (ρ's < 0.41) for moderate-to-vigorous aerobic recreational and muscle-strengthening PA. The EHIS-PAQ showed good evidence for reliability and validity for the measurement of PA levels at work, during transportation and health-enhancing PA.
Evaluation of the neighborhood environment walkability scale in Nigeria.
Oyeyemi, Adewale L; Sallis, James F; Deforche, Benedicte; Oyeyemi, Adetoyeje Y; De Bourdeaudhuij, Ilse; Van Dyck, Delfien
2013-03-21
The development of reliable and culturally sensitive measures of attributes of the built and social environment is necessary for accurate analysis of environmental correlates of physical activity in low-income countries, that can inform international evidence-based policies and interventions in the worldwide prevention of physical inactivity epidemics. This study systematically adapted the Neighborhood Environment Walkability Scale (NEWS) for Nigeria and evaluated aspects of reliability and validity of the adapted version among Nigerian adults. The adaptation of the NEWS was conducted by African and international experts, and final items were selected for NEWS-Nigeria after a cross-validation of the confirmatory factor analysis structure of the original NEWS. Participants (N = 386; female = 47.2%) from two cities in Nigeria completed the adapted NEWS surveys regarding perceived residential density, land use mix - diversity, land use mix - access, street connectivity, infrastructure and safety for walking and cycling, aesthetics, traffic safety, and safety from crime. Self-reported activity for leisure, walking for different purposes, and overall physical activity were assessed with the validated International Physical Activity Questionnaire (long version). The adapted NEWS subscales had moderate to high test-retest reliability (ICC range 0.59 -0.91). Construct validity was good, with residents of high-walkable neighborhoods reporting significantly higher residential density, more land use mix diversity, higher street connectivity, more traffic safety and more safety from crime, but lower infrastructure and safety for walking/cycling and aesthetics than residents of low-walkable neighborhoods. Concurrent validity correlations were low to moderate (r = 0.10 -0.31) with residential density, land use mix diversity, and traffic safety significantly associated with most physical activity outcomes. The NEWS-Nigeria demonstrated acceptable measurement properties among Nigerian adults and may be useful for evaluation of the built environment in Nigeria. Further adaptation and evaluation in other African countries is needed to create a version that could be used throughout the African region.
Development and Validation of the Smartphone Addiction Inventory (SPAI)
Lin, Yu-Hsuan; Chang, Li-Ren; Lee, Yang-Han; Tseng, Hsien-Wei; Kuo, Terry B. J.; Chen, Sue-Huei
2014-01-01
Objective The aim of this study was to develop a self-administered scale based on the special features of smartphone. The reliability and validity of the Smartphone Addiction Inventory (SPAI) was demonstrated. Methods A total of 283 participants were recruited from Dec. 2012 to Jul. 2013 to complete a set of questionnaires, including a 26-item SPAI modified from the Chinese Internet Addiction Scale and phantom vibration and ringing syndrome questionnaire. There were 260 males and 23 females, with ages 22.9±2.0 years. Exploratory factor analysis, internal-consistency test, test-retest, and correlation analysis were conducted to verify the reliability and validity of the SPAI. Correlations between each subscale and phantom vibration and ringing were also explored. Results Exploratory factor analysis yielded four factors: compulsive behavior, functional impairment, withdrawal and tolerance. Test–retest reliabilities (intraclass correlations = 0.74–0.91) and internal consistency (Cronbach's α = 0.94) were all satisfactory. The four subscales had moderate to high correlations (0.56–0.78), but had no or very low correlation to phantom vibration/ringing syndrome. Conclusion This study provides evidence that the SPAI is a valid and reliable, self-administered screening tool to investigate smartphone addiction. Phantom vibration and ringing might be independent entities of smartphone addiction. PMID:24896252
Suen, Yi-Nam; Cerin, Ester; Barnett, Anthony; Huang, Wendy Y J; Mellecker, Robin R
2017-09-01
Valid instruments of parenting practices related to children's physical activity (PA) are essential to understand how parents affect preschoolers' PA. This study developed and validated a questionnaire of PA-related parenting practices for Chinese-speaking parents of preschoolers in Hong Kong. Parents (n = 394) completed a questionnaire developed using findings from formative qualitative research and literature searches. Test-retest reliability was determined on a subsample (n = 61). Factorial validity was assessed using confirmatory factor analysis. Subscale internal consistency was determined. The scale of parenting practices encouraging PA comprised 2 latent factors: Modeling, structure and participatory engagement in PA (23 items), and Provision of appropriate places for child's PA (4 items). The scale of parenting practices discouraging PA scale encompassed 4 latent factors: Safety concern/overprotection (6 items), Psychological/behavioral control (5 items), Promoting inactivity (4 items), and Promoting screen time (2 items). Test-retest reliabilities were moderate to excellent (0.58 to 0.82), and internal subscale reliabilities were acceptable (0.63 to 0.89). We developed a theory-based questionnaire for assessing PA-related parenting practices among Chinese-speaking parents of Hong Kong preschoolers. While some items were context and culture specific, many were similar to those previously found in other populations, indicating a degree of construct generalizability across cultures.
Adaptation and validation of the Spanish version of the graded chronic pain scale.
Ferrer-Peña, Raúl; Gil-Martínez, Alfonso; Pardo-Montero, Joaquín; Jiménez-Penick, Virginia; Gallego-Izquierdo, Tomás; La Touche, Roy
2016-01-01
To adapt the Graded Chronic Pain Scale for use in Primary care patients in Spain, and to assess its psychometric properties. Clinical measures observational study investigating the severity of chronic pain. The methodology included a process of translation and back-translation following the international guidelines. Study participants were 75 patients who experienced lower back pain for more than six months and were sent to Primary Care physiotherapy units. Internal consistency, construct validity, test-retest reliability, floor and ceiling effects, and answering capacity were analysed. The Spanish version of the Graded Chronic Pain Scale had a high internal consistency, with a Cronbach's alpha of 0.87 and intraclass correlation coefficient of 0.81. Regarding construct validity, it was identified that two factors explained 72.37% of the variance. Convergent validity showed a moderate positive correlation with the Visual Analogue Scale, the activity avoidance subscale of the Tampa Scale of Kinesophobia, the Pain Catastrophizing Scale, the Roland-Morris Low Back Pain and Disability Questionnaire, and the FearAvoidance Beliefs Questionnaire. A moderate negative correlation was identified with the Chronic Pain Self-Efficacy Scale. The mean time of questionnaire administration was 2minutes and 28seconds. The Spanish version of the Graded Chronic Pain Scale appears to be a valid, reliable, and useful tool for measuring chronic pain at an early stage in Primary Care settings in Spain. Copyright © 2015 Elsevier España, S.L.U. and Sociedad Española de Reumatología y Colegio Mexicano de Reumatología. All rights reserved.
Pelegrino, Flávia M; Dantas, Rosana A S; Corbi, Inaiara S A; da Silva Carvalho, Ariana R; Schmidt, André; Pazin Filho, Antônio
2012-09-01
The aim of this study was to evaluate the internal reliability and validity of the Brazilian-Portuguese version of Duke Anticoagulation Satisfaction Scale (DASS) among cardiovascular patients. Oral anticoagulation is widely used to prevent and treat thromboembolic events in several conditions, especially in cardiovascular diseases; however, this therapy can induce dissatisfaction and reduce the quality of life. Methodological and cross-sectional research design. The cultural adaptation of the DASS included the translation and back-translation, discussions with healthcare professionals and patients to ensure conceptual equivalence, semantic evaluation and instrument pretest. The Brazilian-Portuguese version of the DASS was tested among subjects followed in a university hospital anticoagulation outpatient clinic. The psychometric properties were assessed by construct validity (convergent, known groups and dimensionality) and internal consistency/reliability (Cronbach's alpha). A total of 180 subjects under oral anticoagulation formed the baseline validation population. DASS total score and SF-36 domain correlations were moderate for General health (r=-0.47, p<0.01), Vitality (r=-0.44, p<0.01) and Mental health (r=-0.42, p<0.01) (convergent). Age and length on oral anticoagulation therapy (in years) were weakly correlated with total DASS score and most of the subscales, except Limitation (r=-0.375, p<0.01) (Known groups). The Cronbach's alpha coefficient was 0.79 for the total scale, and it ranged from 0.76 (hassles and burdens)-0.46 (psychological impact) among the domains, confirming the internal consistency reliability. The Brazilian-Portuguese version of the DASS has shown levels of reliability and validity comparable with the original English version. Healthcare practitioners and researchers need internationally validated measurement tools to compare outcomes of interventions in clinical management and research tools in oral anticoagulation therapy. © 2011 Blackwell Publishing Ltd.
Allen, Kate; Marlow, Ruth; Edwards, Vanessa; Parker, Claire; Rodgers, Lauren; Ukoumunne, Obioha C; Seem, Edward Chan; Hayes, Rachel; Price, Anna; Ford, Tamsin
2018-01-01
There is a growing focus on child wellbeing and happiness in schools, but we lack self-report measures for very young children. Three samples ( N = 2345) were combined to assess the psychometric properties of the How I Feel About My School (HIFAMS) questionnaire, which was designed for children aged 4-8 years. Test-retest reliability was moderate (intraclass correlation coefficient = .62). HIFAMS assessed a single concept and had moderate internal consistency (Cronbach's alpha values from .62 to .67). There were low correlations between scores on the child-reported HIFAMS and parent and teacher reports. Children at risk of exclusion had significantly lower HIFAMS scores than the community sample (mean difference = 2.4; 95% confidence interval (CI) = [1.6, 3.2]; p < .001). Schools contributed only 4.5% of the variability in HIFAMS score, the remaining 95.5% reflecting pupil differences within schools. Girls' scores were 0.37 units (95% CI = [0.16, 0.57]; p < .001) higher than boys, while year group and deprivation did not predict HIFAMS score. HIFAMS is a promising measure that demonstrates moderate reliability and discriminates between groups even among very young children.
Guo, Jing; Lau, Ajax Hong Yin; Chau, Jack; Ng, Bobby Kin Wah; Lee, Kwong Man; Qiu, Yong; Cheng, Jack Chun Yiu; Lam, Tsz Ping
2016-10-01
"Simplified Chinese" version of Spinal Appearance Questionnaire (SC-SAQ) for patients with adolescent idiopathic scoliosis (AIS) was available but did not fit for communities using "Traditional Chinese" as their primary language. We developed a traditional Chinese version of SAQ (TC-SAQ) and evaluated its reliability and validity. TC-SAQ was administered to 112 AIS patients, of which 101 bilingual (English and Chinese) patients completed E-SAQ and the traditional Chinese version of Scoliosis Research Society-22 questionnaire (TC-SRS-22). Internal consistency and test-retest reliability were evaluated. Concurrent validity was evaluated by comparing TC-SAQ score with E-SAQ score, and convergent validity by comparing TC-SAQ score with TC-SRS-22 self-image domain score, and discriminant validity by analyzing the relationship between TC-SAQ score and patients' characteristics. Internal consistency of individual TC-SAQ domain was high (Cronbach's α = 0.785 to 0.940), except for general (Cronbach's α = 0.665) and shoulders (Cronbach's α = 0.421) domain. Test-retest reliability of TC-SAQ was good (ICCs of each domain from 0.798 to 0.865). Concurrent validity demonstrated an excellent correlation between TC-SAQ and E-SAQ scores (r = 0.820 to 0.954, P < 0.0001 for all domains). Correlation between TC-SAQ domains and TC-SRS-22 self-image domain was weak to moderate. TC-SAQ total score and individual domain scores (except waist and chest domains) were positively correlated to major curve magnitude. TC-SAQ had good internal consistency and test-retest reliability. Concurrent validity evaluated against the original English version was excellent. TC-SAQ was both reliable and valid for clinical use for AIS patients using traditional Chinese as their primary language.
Jeong, Yunwha; Law, Mary; Stratford, Paul; DeMatteo, Carol; Kim, Hwan
2016-11-01
To develop the Korean version of the Participation and Environment Measure for Children and Youth (KPEM-CY) and examine its psychometric properties. The PEM-CY was cross-culturally translated into Korean using a specific guideline: pre-review of participation items, forward/backward translation, expert committee review, pre-test of the KPEM-CY and final review. To establish internal consistency, test-retest reliability and construct validity of the KPEM-CY, 80 parents of children with disabilities aged 5-13 years were recruited in South Korea. Across the home, school and community settings, 76% of participation items and 29% of environment items were revised to improve their fit with Korean culture. Internal consistency was moderate to excellent (0.67-0.92) for different summary scores. Test-retest reliability was excellent (>0.75) in the summary scores of participation frequency and extent of involvement across the three settings and moderate to excellent (0.53-0.95) in all summary scores at home. Child's age, type of school and annual income were the factors that significantly influenced specific dimensions of participation and environment across all settings. Results indicated that the KPEM-CY is equivalent to the original PEM-CY and has initial evidence of reliability and validity for use with Korean children with disabilities. Implications for rehabilitation Because 'participation' is a key outcome of the rehabilitation, measuring comprehensive participation of children with disabilities is necessary. The PEM-CY is a parent-report survey measure to assess comprehensive participation of children and youth and environment, which affect their participation, at home, school and in the community. A cross-cultural adaptation process is mandatory to adapt the measurement tool to a new culture or country. The Korean PEM-CY has both reliability and validity and can therefore generate useful clinical data for Korean children with disabilities.
Shim, Sung Ryul; Sun, Hwa Yeon; Ko, Young Myoung; Chun, Dong-Il; Yang, Won Jae
2014-01-01
Background Smartphone-based assessment may be a useful diagnostic and monitoring tool for patients. There have been many attempts to create a smartphone diagnostic tool for clinical use in various medical fields but few have demonstrated scientific validity. Objective The purpose of this study was to develop a smartphone application of the International Prostate Symptom Score (IPSS) and to demonstrate its validity and reliability. Methods From June 2012 to May 2013, a total of 1581 male participants (≥40 years old), with or without lower urinary tract symptoms (LUTS), visited our urology clinic via the health improvement center at Soonchunhyang University Hospital (Republic of Korea) and were enrolled in this study. A randomized repeated measures crossover design was employed using a smartphone application of the IPSS and the conventional paper form of the IPSS. Paired t test under a hypothesis of non-inferior trial was conducted. For the reliability test, the intraclass correlation coefficient (ICC) was measured. Results The total score of the IPSS (P=.289) and each item of the IPSS (P=.157-1.000) showed no differences between the paper version and the smartphone version of the IPSS. The mild, moderate, and severe LUTS groups showed no differences between the two versions of the IPSS. A significant correlation was noted in the total group (ICC=.935, P<.001). The mild, moderate, and severe LUTS groups also showed significant correlations (ICC=.616, .549, and .548 respectively, all P<.001).There was selection bias in this study, as only participants who had smartphones could participate. Conclusions The validity and reliability of the smartphone application version were comparable to the conventional paper version of the IPSS. The smartphone application of the IPSS could be an effective method for measuring lower urinary tract symptoms. PMID:24513507
Wang, W; Liu, L; Chang, X; Jia, Z Y; Zhao, J Z; Xu, W D
2016-10-19
The Lysholm Knee Score (LKS) is widely used and is one of the most effective questionnaires employed to assess knee injuries. Although LKS has been translated into multiple languages, there is no Chinese version even though China has the largest population of patients with knee-joint injuries. The objective of our study was to develop the Chinese version of LKS (C-LKS) and assess its reliability, validity and responsiveness in Chinese patients with anterior cruciate ligament (ACL) injuries. Study participants were mainly recruited among patients with ACL injuries scheduled for arthroscopic ACL reconstruction at our hospital. First, we developed the C-LKS in a five-step translation and cross-cultural adaptation procedure. Next, we calculated the Cronbach's alpha, intraclass correlation coefficient (ICC), Pearson's correlation coefficient (r), effect size (ES), and standardized response mean (SRM) to evaluate the reliability, validity, and responsiveness of C-LKS respectively. Overall, 126 patients with ACL injuries successfully completed the questionnaires. Acceptable internal consistency (Cronbach's alpha = 0.726) as well as excellent test-retest reliability (ICC = 0.935) was found for C-LKS. Good or moderate correlation (r = 0.514-0.837) was determined among C-LKS and International Knee Documentation Committee Subjective Knee Form (IKDC), Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC), physical subscales of SF-36; C-LKS also had fair or moderate correlation (r = 0.207-0.462) with the other subscales of SF-36, which adequately illustrated that good validity was included in C-LKS. In addition, good responsiveness was also observed in C-LKS (ES = 1.36,SRM = 1.26). We have shown that our developed C-LKS questionnaire is reliable, valid and responsible for the evaluation of Chinese-speaking patients with ACL injuries and it would be an effective instrument.
Measuring quality of life in patients with stress urinary incontinence: is the ICIQ-UI-SF adequate?
Kurzawa, Zuzanna; Sutherland, Jason M; Crump, Trafford; Liu, Guiping
2018-05-08
The International Consultation on Incontinence Questionnaire Short Form (ICIQ-UI-SF) is a widely used four-item patient-reported outcome (PRO) measure. Evaluations of this instrument are limited, restraining user's confidence in the instrument. This study conducts a comprehensive evaluation of the ICIQ-UI-SF on a sample of urological surgery patients in Canada. One hundred and seventy-seven surgical patients with stress urinary incontinence completed the ICIQ-UI-SF pre-operatively. Methods drawing from confirmatory factor analysis (CFA), measures of reliability, item response theory (IRT), and differential item functioning were applied. Ceiling effects were examined. Ceiling effects were identified. In the CFA, the factor loadings of items one and two differed significantly (p < 0.001) from item three indicating possible multidimensionality. The first two items reflect symptom severity not quality of life. Reliability was moderate as measured by Cronbach's alpha (0.63) and McDonald's coefficient (0.65). The IRT found the instrument does not discriminate between individuals with low incontinence-related quality of life. Due to low/moderate reliability, the ICIQ-UI-SF can be used as a complement to other data or used to report aggregated surgical outcomes among surgical patients. If the primary objective is to measure quality of life, other PROs should be considered.
Mehta, Saurabh P; MacDermid, Joy C; Richardson, Julie; MacIntyre, Norma J; Grewal, Ruby
2015-01-01
Clinical measurement. This study examined test-retest reliability and convergent/divergent construct validity of selected tests and measures that assess balance impairment, fear of falling (FOF), impaired physical activity (PA), and lower extremity muscle strength (LEMS) in females >45 years of age after the distal radius fracture (DRF) population. Twenty one female participants with DRF were assessed on two occasions. Timed Up and Go, Functional Reach, and One Leg Standing tests assessed balance impairment. Shortened Falls Efficacy Scale, Activity-specific Balance Confidence scale, and Fall Risk Perception Questionnaire assessed FOF. International Physical Activity Questionnaire and Rapid Assessment of Physical Activity were administered to assess PA level. Chair stand test and isometric muscle strength testing for hip and knee assessed LEMS. Intraclass correlation coefficients (ICC) examined the test-retest reliability of the measures. Pearson correlation coefficients (r) examined concurrent relationships between the measures. The results demonstrated fair to excellent test-retest reliability (ICC between 0.50 and 0.96) and low to moderate concordance between the measures (low if r ≤ 0.4; moderate if r = 0.4-0.7). The results provide preliminary estimates of test-retest reliability and convergent/divergent construct validity of selected measures associated with increased risk for falling in the females >45 years of age after DRF. Further research directions to advance knowledge regarding fall risk assessment in DRF population have been identified. Copyright © 2015 Hanley & Belfus. Published by Elsevier Inc. All rights reserved.
Parr, Jeremy R; De Jonge, Maretha V; Wallace, Simon; Pickles, Andrew; Rutter, Michael L; Le Couteur, Ann S; van Engeland, Herman; Wittemeyer, Kerstin; McConachie, Helen; Roge, Bernadette; Mantoulan, Carine; Pedersen, Lennart; Isager, Torben; Poustka, Fritz; Bolte, Sven; Bolton, Patrick; Weisblatt, Emma; Green, Jonathan; Papanikolaou, Katerina; Baird, Gillian; Bailey, Anthony J
2015-10-01
Clinical genetic studies confirm the broader autism phenotype (BAP) in some relatives of individuals with autism, but there are few standardized assessment measures. We developed three BAP measures (informant interview, self-report interview, and impression of interviewee observational scale) and describe the development strategy and findings from the interviews. International Molecular Genetic Study of Autism Consortium data were collected from families containing at least two individuals with autism. Comparison of the informant and self-report interviews was restricted to samples in which the interviews were undertaken by different researchers from that site (251 UK informants, 119 from the Netherlands). Researchers produced vignettes that were rated blind by others. Retest reliability was assessed in 45 participants. Agreement between live scoring and vignette ratings was very high. Retest stability for the interviews was high. Factor analysis indicated a first factor comprising social-communication items and rigidity (but not other repetitive domain items), and a second factor comprised mainly of reading and spelling impairments. Whole scale Cronbach's alphas were high for both interviews. The correlation between interviews for factor 1 was moderate (adult items 0.50; childhood items 0.43); Kappa values for between-interview agreement on individual items were mainly low. The correlations between individual items and total score were moderate. The inclusion of several factor 2 items lowered the overall Cronbach's alpha for the total set. Both interview measures showed good reliability and substantial stability over time, but the findings were better for factor 1 than factor 2. We recommend factor 1 scores be used for characterising the BAP. © 2015 The Authors Autism Research published by Wiley Periodicals, Inc. on behalf of International Society for Autism Research.
Assessing patient-centered care: one approach to health disparities education.
Wilkerson, LuAnn; Fung, Cha-Chi; May, Win; Elliott, Donna
2010-05-01
Patient-centered care has been described as one approach to cultural competency education that could reduce racial and ethnic health disparities by preparing providers to deliver care that is respectful and responsive to the preferences of each patient. In order to evaluate the effectiveness of a curriculum in teaching patient-centered care (PCC) behaviors to medical students, we drew on the work of Kleinman, Eisenberg, and Good to develop a scale that could be embedded across cases in an objective structured clinical examination (OSCE). To compare the reliability, validity, and feasibility of an embedded patient-centered care scale with the use of a single culturally challenging case in measuring students' use of PCC behaviors as part of a comprehensive OSCE. A total of 322 students from two California medical schools participated in the OSCE as beginning seniors. Cronbach's alpha was used to assess the internal consistency of each approach. Construct validity was addressed by establishing convergent and divergent validity using the cultural challenge case total score and OSCE component scores. Feasibility assessment considered cost and training needs for the standardized patients (SPs). Medical students demonstrated a moderate level of patient-centered skill (mean = 63%, SD = 11%). The PCC Scale demonstrated an acceptable level of internal consistency (alpha = 0.68) over the single case scale (alpha = 0.60). Both convergent and divergent validities were established through low to moderate correlation coefficients. The insertion of PCC items across multiple cases in a comprehensive OSCE can provide a reliable estimate of students' use of PCC behaviors without incurring extra costs associated with implementing a special cross-cultural OSCE. This approach is particularly feasible when an OSCE is already part of the standard assessment of clinical skills. Reliability may be increased with an additional investment in SP training.
Abdovic, Slaven; Mocic Pavic, Ana; Milosevic, Milan; Persic, Mladen; Senecic-Cala, Irena; Kolacek, Sanja
2013-12-01
To assess the reliability and validity of IMPACT-III (HR), a disease-specific, health-related quality of life instrument in Croatian children with inflammatory bowel disease. In a multicenter study, 104 children participated in a validation study of IMPACT-III (HR) cross-culturally adapted for Croatia. Factor analysis was used to determine optimal domain structure for this cohort, analysis of Cronbach's alpha coefficients to test internal reliability, ANOVA to assess discriminant validity, and correlation with Pediatric Quality of Life Inventory, Version 4.0 (PedsQL) using Pearson correlation coefficients to assess concurrent validity. Cronbach's alpha for the IMPACT-III (HR) total score was 0.92. The most robust factor solution was a 5-domain structure: Symptoms, Concerns, Socializing, Body Image, and Worry about Stool, all of which demonstrated good internal reliability (α=0.60-0.89), but two items were dropped to achieve this. Discriminant validity was demonstrated by significant differences (P<0.001) in mean IMPACT-III (HR) scores between quiescent and mild or moderate-severe disease activity groups for total (148 vs. 139 or 125) and following factor scores: Symptoms (84 vs. 71 or 61), Socializing (91 vs. 83 or 76), and Worry about Stool (significant only between quiescent and moderate-severe groups, 90 vs. 62, respectively). Concurrent validity of IMPACT-III (HR) with PedsQL showed significant correlation, which was strongest when similar domains were compared. IMPACT-III (HR) appears to be useful tool to measure health-related quality of life in Croatian children with Crohn's disease and ulcerative colitis. Copyright © 2012 European Crohn's and Colitis Organisation. Published by Elsevier B.V. All rights reserved.
Maïano, Christophe; Bégarie, Jérôme; Morin, Alexandre J S; Garbarino, Jean-Marie; Ninot, Grégory
2010-01-01
The purpose of this study was to test the reliability (i.e. internal consistency and test-retest reliability) and construct validity (i.e. content validity, factor validity, measurement invariance, and latent mean invariance) of the Nutrition and Activity Knowledge Scale (NAKS) in a sample of French adolescents with mild to moderate Intellectual Disability (ID). A total sample of 260 adolescents (144 boys and 116 girls), aged between 12 and 18 years old, with mild to moderate ID was involved in two studies. In the first study, analysis of items' content reveals that many words from the original version were not understood or induced confusion. These items were reworded and simplified while retaining their original meaning. In the second study, results provided support for: (i) the factor validity and reliability of a 15-item French version of the NAKS; (ii) the measurement invariance of the resulting NAKS across genders and ID levels; (iii) the partial measurement invariance of the resulting NAKS across age groups and type of school placement. In addition, the latent means of the 15-item French version of the NAKS proved to be invariant across gender, age categories, and ID levels, but to vary across type of school placement (with adolescents schooled in self-contained classes from regular schools presenting higher levels of NAK than adolescents placed in specialized establishments). The present results thus provide preliminary evidence regarding the construct validity of a 15-item French version of the NAKS in a sample of adolescents with ID.
Evaluation of constricted affect in chronic pain: an attempt using the Toronto Alexythymia Scale.
Millard, R W; Kinsler, B L
1992-09-01
The Toronto Alexythymia Scale (TAS) was applied as a potential measure of constricted affect among a sample of patients with chronic, non-malignant pain (n = 195). As previously demonstrated with non-clinical samples, the scale was found to possess moderate reliability with two principal internal factors. These factors seemed to reflect social introversion and a lack of proneness to fantasy. There was a moderate, negative association between them. The domain sampled by the TAS was apparently heterogeneous, with total scores showing no relationship to reported disability or pain intensity and a low relationship to reported distress. These results suggest potential limitations of the TAS and the alexythymia construct as means for evaluating constricted affect that accompanies chronic pain.
Development, scoring, and reliability of the Microscale Audit of Pedestrian Streetscapes (MAPS)
2013-01-01
Background Streetscape (microscale) features of the built environment can influence people’s perceptions of their neighborhoods’ suitability for physical activity. Many microscale audit tools have been developed, but few have published systematic scoring methods. We present the development, scoring, and reliability of the Microscale Audit of Pedestrian Streetscapes (MAPS) tool and its theoretically-based subscales. Methods MAPS was based on prior instruments and was developed to assess details of streetscapes considered relevant for physical activity. MAPS sections (route, segments, crossings, and cul-de-sacs) were scored by two independent raters for reliability analyses. There were 290 route pairs, 516 segment pairs, 319 crossing pairs, and 53 cul-de-sac pairs in the reliability sample. Individual inter-rater item reliability analyses were computed using Kappa, intra-class correlation coefficient (ICC), and percent agreement. A conceptual framework for subscale creation was developed using theory, expert consensus, and policy relevance. Items were grouped into subscales, and subscales were analyzed for inter-rater reliability at tiered levels of aggregation. Results There were 160 items included in the subscales (out of 201 items total). Of those included in the subscales, 80 items (50.0%) had good/excellent reliability, 41 items (25.6%) had moderate reliability, and 18 items (11.3%) had low reliability, with limited variability in the remaining 21 items (13.1%). Seventeen of the 20 route section subscales, valence (positive/negative) scores, and overall scores (85.0%) demonstrated good/excellent reliability and 3 demonstrated moderate reliability. Of the 16 segment subscales, valence scores, and overall scores, 12 (75.0%) demonstrated good/excellent reliability, three demonstrated moderate reliability, and one demonstrated poor reliability. Of the 8 crossing subscales, valence scores, and overall scores, 6 (75.0%) demonstrated good/excellent reliability, and 2 demonstrated moderate reliability. The cul-de-sac subscale demonstrated good/excellent reliability. Conclusions MAPS items and subscales predominantly demonstrated moderate to excellent reliability. The subscales and scoring system represent a theoretically based framework for using these complex microscale data and may be applicable to other similar instruments. PMID:23621947
Turkish Version of Kolcaba's Immobilization Comfort Questionnaire: A Validity and Reliability Study.
Tosun, Betül; Aslan, Özlem; Tunay, Servet; Akyüz, Aygül; Özkan, Hüseyin; Bek, Doğan; Açıksöz, Semra
2015-12-01
The purpose of this study was to determine the validity and reliability of the Turkish version of the Immobilization Comfort Questionnaire (ICQ). The sample used in this methodological study consisted of 121 patients undergoing lower extremity arthroscopy in a training and research hospital. The validity study of the questionnaire assessed language validity, structural validity and criterion validity. Structural validity was evaluated via exploratory factor analysis. Criterion validity was evaluated by assessing the correlation between the visual analog scale (VAS) scores (i.e., the comfort and pain VAS scores) and the ICQ scores using Spearman's correlation test. The Kaiser-Meyer-Olkin coefficient and Bartlett's test of sphericity were used to determine the suitability of the data for factor analysis. Internal consistency was evaluated to determine reliability. The data were analyzed with SPSS version 15.00 for Windows. Descriptive statistics were presented as frequencies, percentages, means and standard deviations. A p value ≤ .05 was considered statistically significant. A moderate positive correlation was found between the ICQ scores and the VAS comfort scores; a moderate negative correlation was found between the ICQ and the VAS pain measures in the criterion validity analysis. Cronbach α values of .75 and .82 were found for the first and second measurements, respectively. The findings of this study reveal that the ICQ is a valid and reliable tool for assessing the comfort of patients in Turkey who are immobilized because of lower extremity orthopedic problems. Copyright © 2015. Published by Elsevier B.V.
Simões, Maria do Socorro Mp; Garcia, Isabel Ff; Costa, Lucíola da Cm; Lunardi, Adriana C
2018-05-01
The Life-Space Assessment (LSA) assesses mobility from the spaces that older adults go, and how often and how independent they move. Despite its increased use, LSA measurement properties remain unclear. The aim of the present study was to analyze the content validity, reliability, construct validity and interpretability of the LSA for Brazilian community-dwelling older adults. In this clinimetric study we analyzed the measurement properties (content validity, reliability, construct validity and interpretability) of the LSA administered to 80 Brazilian community-dwelling older adults. Reliability was analyzed by Cronbach's alpha (internal consistency), intraclass correlation coefficients and 95% confidence interval (reproducibility), and standard error of measurement (measurement error). Construct validity was analyzed by Pearson's correlations between the LSA and accelerometry (time in inactivity and moderate-to-vigorous activities), and interpretability was analyzed by determination of the minimal detectable change, and floor and ceiling effects. The LSA met the criteria for content validity. The Cronbach's alpha was 0.92, intraclass correlation coefficient was 0.97 (95% confidence interval 0.95-0.98) and standard error of measurement was 4.12. The LSA showed convergence with accelerometry (negative correlation with time in inactivity and positive correlation with time in moderate to vigorous activities), the minimal detectable change was 0.36 and we observed no floor or ceiling effects. The LSA showed adequate reliability, validity and interpretability for life-space mobility assessment of Brazilian community-dwelling older adults. Geriatr Gerontol Int 2018; 18: 783-789. © 2018 Japan Geriatrics Society.
Measurement of sedentary behaviour in population health surveys: a review and recommendations
LeBlanc, Allana G.; Colley, Rachel C.; Saunders, Travis J.
2017-01-01
Background The purpose of this review was to determine the most valid and reliable questions for targeting key modes of sedentary behaviour (SB) in a broad range of national and international health surveillance surveys. This was done by reviewing the SB modules currently used in population health surveys, as well as examining SB questionnaires that have performed well in psychometric testing. Methods Health surveillance surveys were identified via scoping review and contact with experts in the field. Previous systematic reviews provided psychometric information on pediatric questionnaires. A comprehensive search of four bibliographic databases was used to identify studies reporting psychometric information for adult questionnaires. Only surveys/studies published/used in English or French were included. Results The review identified a total of 16 pediatric and 18 adult national/international surveys assessing SB, few of which have undergone psychometric testing. Fourteen pediatric and 35 adult questionnaires with psychometric information were included. While reliability was generally good to excellent for questions targeting key modes of SB, validity was poor to moderate, and reported much less frequently. The most valid and reliable questions targeting specific modes of SB were combined to create a single questionnaire targeting key modes of SB. Discussion Our results highlight the importance of including SB questions in survey modules that are adaptable, able to assess various modes of SB, and that exhibit adequate reliability and validity. Future research could investigate the psychometric properties of the module we have proposed in this paper, as well as other questionnaires currently used in national and international population health surveys. PMID:29250468
He, S L; Wang, J H; Ji, P
2018-03-01
To validate the Pain Resilience Scale (PRS) for use in Chinese patients with temporomandibular disorders (TMD) pain. According to international guidelines, the original PRS was first translated and cross-culturally adapted to formulate the Chinese version of PRS (PRS-C). A total of 152 patients with TMD pain were recruited to complete series of questionnaires. Reliability of the PRS-C was investigated using internal consistency and test-retest reliability. Validity of the PRS-C was calculated using cross-cultural validity and convergent validity. Cross-cultural validity was evaluated by examining the confirmatory factor analysis (CFA). And convergent validity was examined through correlating the PRS-C scores with scores of 2 commonly used pain-related measures (the Connor-Davidson Resilience Scale [CD-RISC] and the Tampa Scale for Kinesiophobia for Temporomandibular Disorders [TSK-TMD]). The PRS-C had a high internal consistency (Cronbach's alpha = 0.92) and good test-retest reliability (intra-class correlation coefficient [ICC] = 0.81). The CFA supported a 2-factor model for the PRS-C with acceptable fit to the data. The fit indices were chi-square/DF = 2.21, GFI = 0.91, TLI = 0.97, CFI = 0.98 and RMSEA = 0.08. As regards convergent validity, the PRS-C evidenced moderate-to-good relationships with the CD-RISC and the TSK-TMD. The PRS-C shows good psychometric properties and could be considered as a reliable and valid measure to evaluate pain-related resilience in patients with TMD pain. © 2017 John Wiley & Sons Ltd.
Measurement of sedentary behaviour in population health surveys: a review and recommendations.
Prince, Stephanie A; LeBlanc, Allana G; Colley, Rachel C; Saunders, Travis J
2017-01-01
The purpose of this review was to determine the most valid and reliable questions for targeting key modes of sedentary behaviour (SB) in a broad range of national and international health surveillance surveys. This was done by reviewing the SB modules currently used in population health surveys, as well as examining SB questionnaires that have performed well in psychometric testing. Health surveillance surveys were identified via scoping review and contact with experts in the field. Previous systematic reviews provided psychometric information on pediatric questionnaires. A comprehensive search of four bibliographic databases was used to identify studies reporting psychometric information for adult questionnaires. Only surveys/studies published/used in English or French were included. The review identified a total of 16 pediatric and 18 adult national/international surveys assessing SB, few of which have undergone psychometric testing. Fourteen pediatric and 35 adult questionnaires with psychometric information were included. While reliability was generally good to excellent for questions targeting key modes of SB, validity was poor to moderate, and reported much less frequently. The most valid and reliable questions targeting specific modes of SB were combined to create a single questionnaire targeting key modes of SB. Our results highlight the importance of including SB questions in survey modules that are adaptable, able to assess various modes of SB, and that exhibit adequate reliability and validity. Future research could investigate the psychometric properties of the module we have proposed in this paper, as well as other questionnaires currently used in national and international population health surveys.
Ray, Midge N; Houston, Thomas K; Yu, Feliciano B; Menachemi, Nir; Maisiak, Richard S; Allison, Jeroan J; Berner, Eta S
2006-01-01
The authors developed and evaluated a rating scale, the Attitudes toward Handheld Decision Support Software Scale (H-DSS), to assess physician attitudes about handheld decision support systems. The authors conducted a prospective assessment of psychometric characteristics of the H-DSS including reliability, validity, and responsiveness. Participants were 82 Internal Medicine residents. A higher score on each of the 14 five-point Likert scale items reflected a more positive attitude about handheld DSS. The H-DSS score is the mean across the fourteen items. Attitudes toward the use of the handheld DSS were assessed prior to and six months after receiving the handheld device. Cronbach's Alpha was used to assess internal consistency reliability. Pearson correlations were used to estimate and detect significant associations between scale scores and other measures (validity). Paired sample t-tests were used to test for changes in the mean attitude scale score (responsiveness) and for differences between groups. Internal consistency reliability for the scale was alpha = 0.73. In testing validity, moderate correlations were noted between the attitude scale scores and self-reported Personal Digital Assistant (PDA) usage in the hospital (correlation coefficient = 0.55) and clinic (0.48), p < 0.05 for both. The scale was responsive, in that it detected the expected increase in scores between the two administrations (3.99 (s.d. = 0.35) vs. 4.08, (s.d. = 0.34), p < 0.005). The authors' evaluation showed that the H-DSS scale was reliable, valid, and responsive. The scale can be used to guide future handheld DSS development and implementation.
El-Housseiny, Azza A; Alsadat, Farah A; Alamoudi, Najlaa M; El Derwi, Douaa A; Farsi, Najat M; Attar, Moaz H; Andijani, Basil M
2016-04-14
Early recognition of dental fear is essential for the effective delivery of dental care. This study aimed to test the reliability and validity of the Arabic version of the Children's Fear Survey Schedule-Dental Subscale (CFSS-DS). A school-based sample of 1546 children was randomly recruited. The Arabic version of the CFSS-DS was completed by children during class time. The scale was tested for internal consistency and test-retest reliability. To test criterion validity, children's behavior was assessed using the Frankl scale during dental examination, and results were compared with children's CFSS-DS scores. To test the scale's construct validity, scores on "fear of going to the dentist soon" were correlated with CFSS-DS scores. Factor analysis was also used. The Arabic version of the CFSS-DS showed high reliability regarding both test-retest reliability (intraclass correlation = 0.83, p < 0.001) and internal consistency (Cronbach's α = 0.88). It showed good criterion validity: children with negative behavior had significantly higher fear scores (t = 13.67, p < 0.001). It also showed moderate construct validity (Spearman's rho correlation, r = 0.53, p < 0.001). Factor analysis identified the following factors: "fear of invasive dental procedures," "fear of less invasive dental procedures" and "fear of strangers." The Arabic version of the CFSS-DS is a reliable and valid measure of dental fear in Arabic-speaking children. Pediatric dentists and researchers may use this validated version of the CFSS-DS to measure dental fear in Arabic-speaking children.
Arbab, Dariusch; Kuhlmann, Katharina; Schnurr, Christoph; Bouillon, Bertil; Lüring, Christian; König, Dietmar
2017-10-10
Patient-reported outcome measures are a critical tool in evaluating the efficacy of orthopedic procedures and are increasingly used in clinical trials to assess outcomes of health care. The intention of this study was to develop and culturally adapt a German version of the Self-reported Foot and Ankle Score (SEFAS) and to evaluate reliability, validity and responsiveness. According to Cross Cultural Adaptation of Self-Reported Measure guidelines forward and backward translation has been performed. The German SEFAS was investigated in 177 consecutive patients. 177 Patients completed the German SEFAS, Foot and Ankle Outcome Score (FAOS), Short-Form 36 and numeric scales for pain and disability (NRS) before and 118 patients 6 months after foot or ankle surgery. Test-Retest reliability, internal consistency, floor and ceiling effects, construct validity and minimal important change were analyzed. The German SEFAS demonstrated excellent test-retest reliability with ICC values of 0.97. Cronbach's alpha (α) value of 0.89 demonstrated strong internal consistency. No floor or ceiling effects were observed for the German version of the SEFAS. As hypothesized SEFAS correlated strongly with FAOS and SF-36 domains. It showed moderate (ES/SRM > 0.5) responsiveness between preoperative assessment and postoperative follow-up. The German version of the SEFAS demonstrated good psychometric properties. It proofed to be a valid and reliable instrument for use in foot and ankle patients. DRKS00007585.
Validity and reliability of the Turkish Migraine Disability Assessment (MIDAS) questionnaire.
Ertaş, Mustafa; Siva, Aksel; Dalkara, Turgay; Uzuner, Nevzat; Dora, Babür; Inan, Levent; Idiman, Fethi; Sarica, Yakup; Selçuki, Deniz; Sirin, Hadiye; Oğuzhanoğlu, Atilla; Irkeç, Ceyla; Ozmenoğlu, Mehmet; Ozbenli, Taner; Oztürk, Musa; Saip, Sabahattin; Neyal, Münife; Zarifoğlu, Mehmet
2004-09-01
The aim of this study is to assess the comprehensibility, internal consistency, patient-physician reliability, test-retest reliability, and validity of Turkish version of Migraine Disability Assessment (MIDAS) questionnaire in patients with headache. MIDAS questionnaire has been developed by Stewart et al and shown to be reliable and valid to determine the degree of disability caused by migraine. This study was designed as a national multicenter study to demonstrate the reliability and validity of Turkish version of MIDAS questionnaire. Patients applying to 17 Neurology Clinics in Turkey were evaluated at the baseline (visit 1), week 4 (visit 2), and week 12 (visit 3) visits in terms of disease severity and comprehensibility, internal consistency, test-retest reliability, and validity of MIDAS. Since the severity of the disease has been found to change significantly at visit 2 compared to visit 1, test-retest reliability was assessed using the MIDAS scores of a subgroup of patients whose disease severity remained unchanged (up to +/-3 days difference in the number of days with headache between visits 1 and 2). A total of 306 patients (86.2% female, mean age: 35.0 +/- 9.8 years) were enrolled into the study. A total of 65.7%, 77.5%, 82.0% of patients reported that "they had fully understood the MIDAS questionnaire" in visits 1, 2, and 3, respectively. A highly positive correlation was found between physician and patient and the applied total MIDAS scores in all three visits (Spearman correlation coefficients were R= 0.87, 0.83, and 0.90, respectively, P <.001). Internal consistency of MIDAS was assessed using Cronbach's alpha and was found at acceptable (>0.7) or excellent (>0.8) levels in both patient and physician applied MIDAS scores, respectively. Total MIDAS score showed good test-retest reliability (R= 0.68). Both the number of days with headache and the total MIDAS scores were positively correlated at all visits with correlation coefficients between 0.47 and 0.63. There was also a moderate degree of correlation (R= 0.54) between the total MIDAS score at week 12 and the number of days with headache at visit 2 + visit 3, which quantify headache-related disability over a 3-month period similar to MIDAS questionnaire. These findings demonstrated that the Turkish translation is equivalent to the English version of MIDAS in terms of internal consistency, test-retest reliability, and validity. Physicians can reliably use the Turkish translation of the MIDAS questionnaire in defining the severity of illness and its treatment strategy when applied as a self-administered report by migraine patients themselves.
Barbosa, Taís de Souza; Gavião, Maria Beatriz Duarte
2015-01-01
To test the validity and reliability of Brazilian Portuguese version of the Parental-Caregiver Perceptions Questionnaire (P-CPQ) (Aim 1) and to assess the agreement between parents and children concerning the child's oral health-related quality of life (OHRQoL) (Aim 2). The P-CPQ and the Brazilian Portuguese versions of the Child Perceptions Questionnaires (CPQ8-10 and CPQ11-14 ) were used. Objective 1 addressed in the study that involved 210 (validity and internal reliability) and 20 (test-retest reliability) parents and Objective 2 in the study that involved 210 pairs of parents and children. Construct validity was calculated using the Spearman's correlation and the Mann-Whitney/Kruskal-Wallis tests. Reliability was determined using Cronbach's alpha and intraclass correlation coefficient (ICC). Agreement between overall and subscale scores derived from the P-CPQ and CPQ was assessed in comparison and correlation analyses. The P-CPQ discriminated among the categories of malocclusion and dmft. The P-CPQ showed good construct validity, good internal consistency reliability, and excellent test-retest reliability. There was systematic under- and overreporting in parents' assessments for younger and older children, respectively. However, the magnitude of the directional differences was just small. At individual level, agreement between parents and children was excellent. However, it ranged from excellent to moderate or substantial in subscales for CPQ8-10 and CPQ11-14 groups, respectively. The Portuguese version of P-CPQ is valid and reliable. Some parents have limited knowledge about child OHRQoL. Given that parental and child reports measure different realities concerning the child's OHRQoL, information provided by parents can complement the child's evaluation. © 2015 American Association of Public Health Dentistry.
van Ark, Mathijs; Zwerver, Johannes; Diercks, Ronald L; van den Akker-Scheek, Inge
2014-08-11
Lateral Epicondylalgia (LE) is a common injury for which no reliable and valid measure exists to determine severity in the Dutch language. The Patient-Rated Tennis Elbow Evaluation (PRTEE) is the first questionnaire specifically designed for LE but in English. The aim of this study was to translate into Dutch and cross-culturally adapt the PRTEE and determine reliability and validity of the PRTEE-D (Dutch version). The PRTEE was cross-culturally adapted according to international guidelines. Participants (n = 122) were asked to fill out the PRTEE-D twice with a one week interval to assess test-retest reliability. Internal consistency of the PRTEE-D was determined by calculating Crohnbach's alphas for the questionnaire and subscales. Intraclass Correlation Coefficients (ICC) were calculated for the overall PRTEE-D score, pain and function subscale and individual questions to determine test-retest reliability. Additionally, the Disabilities for the Arm, Shoulder and Hand questionnaire (DASH) and Visual Analogue Scale (VAS) pain scores were obtained from 30 patients to assess construct validity; Spearman's correlation coefficients were calculated between the PRTEE-D (subscales) and DASH and VAS-pain scores. The PRTEE was successfully cross-culturally adapted into Dutch (PRTEE-D). Crohnbach's alpha for the first assessment of the PRTEE-D was 0.98; Crohnbach's alpha was 0.93 for the pain subscale and 0.97 for the function subscale. ICC for the PRTEE-D was 0.98; subscales also showed excellent ICC values (pain scale 0.97 and function scale 0.97). A significant moderate correlation exists between PRTEE-D and DASH (0.65) and PRTEE-D and VAS pain (0.68). The PRTEE was successfully cross-culturally adapted and this study showed that the PRTEE-D is reliable and valid to obtain an indication of severity of LE. An easy-to-use instrument for practitioners is now available and this facilitates comparing Dutch and international research data.
Odetunde, Marufat Oluyemisi; Akinpelu, Aderonke Omobonike; Odole, Adesola Christiana
2017-10-19
Psychometric evidence is necessary to establish scientific integrity and clinical usefulness of translations and cultural adaptations of the Stroke-Specific Quality of Life (SS-QoL) scale. However, the limited evidence on psychometrics of Yoruba version of SS-QoL 2.0 (SS-QoL(Y)) is a significant shortcoming. This study assessed the test-retest reliability, internal consistency, convergent, divergent, discriminant and known-group validity of the SS-QoL(Y). Yoruba version of the WHOQoL-BREF was used to test the convergent and divergent validity of the SS-QoL(Y) among 100 consenting stroke survivors. The WHOQoL-BREF and SS-QoL(Y) was administered randomly in order to eliminate bias. The test-retest reliability of the SS-QoL(Y) was carried out among 68 of the respondents within an interval of 7 days. All respondents were purposively recruited from selected secondary and tertiary health facilities in South-west Nigeria. Data were analysed using descriptive statistics of mean and standard deviation, and inferential statistics of Spearman correlation, Cronbach's alpha, Intra-class Correlation Coefficient (ICC), Independent t-test and One-way ANOVA. Alpha level was set at p < 0.05. The physical health, psychological health, social relationship and environment domains on WHOQoL-BREF with correlation coefficient that ranged from 0.214 to 0.360 showed significant correlation with similar domains on SS-QoL(Y). Dissimilar domains between the two scales had r values from 0.035 to 0.366. Discriminant validity of SS-QoL(Y) showed that items' r value ranged from 0.711 to 0.920 with their hypothesized domains. The scale demonstrated moderate to strong test-retest reliability with Intra-class correlation coefficient (ICC) for the domains and overall scores (r = 0.47 to 0.81) and moderate to high internal consistency (Cronbach's alpha =0.61 to 0.82) for domains scores. These correlations were also significant for the domains and overall scores (p < 0.05). There were no significant differences across different age groups or gender for the domains or overall scores of SS-QoL(Y). Discriminant and known-group validity, test-retest reliability and internal consistency of the Yoruba version of the Stroke Specific Quality of Life 2.0 are adequate while the convergent and divergent validity are low but acceptable. The SS-QoL(Y) is recommended for assessing health-related quality of life among Yoruba stroke survivors.
Validity and reliability assessment of the Brazilian version of the game addiction scale (GAS).
Lemos, Igor Lins; Cardoso, Adriana; Sougey, Everton Botelho
2016-05-01
The uncontrolled use of video games can be addictive. The Game Addiction Scale (GAS) is an instrument that was developed to assess this type of addiction. The GAS consists of 21 items that are divided into the following seven factors: salience, tolerance, mood modification, relapse, withdrawal, conflict and problems. This study assessed the convergent validity and reliability of the GAS according to measures of internal consistency and test-retest stability. Three hundred and eighty four students completed the GAS, the Internet Addiction Test (IAT), the Liebowitz Social Anxiety Scale (LSAS), the Beck Depression Inventory (BDI) and the Video Game Addiction Test (VAT). A subgroup of the participants (n=76) completed the GAS again after 30days to determine test-retest stability. The GAS demonstrated excellent internal consistency (Cronbach's alpha=0.92), was highly correlated with the VAT (r=0.883) and was moderately correlated with the BDI (r=0.358), the LSAS (r=0.326) and the IAT (r=0.454). In the Brazilian Portuguese population, the GAS shows good internal consistency. These data indicate that the GAS can be used to assess video game addiction due to its demonstrated psychometric validity. Copyright © 2016 Elsevier Inc. All rights reserved.
Navarro-Colom, M; Sendra-Lluis, M A; Castillo-Masa, A M; Robleda, G
2015-01-01
The Behavioral Pain Scale (BPS) is a tool of pain assessment that often gives contradictory results when used by different raters. To assess internal consistency and interrater reliability of BPS scale in the pain assessment performed by intensives care nurses. A prospective observational study in 34 mechanically-ventilated patients, carried out in an Intensive Care Unit from April to June 2012. Variables analyzed included demographic characteristics, diagnosis of referral, clinical status, pain and sedation level. Pain was assessed by two nurses independently at rest (T1) and during a mobilization procedure (T2) using the BPS scale. Internal consistency was calculated by Cronbach's alpha, and intraobserver reliability was determined with the intraclass correlation coefficient (ICC), with a confidence interval (CI) of 95%. This study was approved by the Ethical Committee for Clinical Research. One-hundred and twenty-eight pain assessments were performed. The Cronbach's alpha of total BPS score at rest was 0.66 (95%CI: 0.33 to 0.83) and during mobilization of 0.73 (95%CI: 0.47 to 0.87). The CCI of total BPS score was 0.50 (95%CI: 0.19 to 0.71) at rest and 0.58 (95%CI: 0.31 to 0.77) during mobilization. The level of internal consistency of the scale is appropriate and moderate interrater agreement. For the BPS useful in clinical practice, it is imperative that nurses have prior experience with a regulated use of this tool. Copyright © 2014 Elsevier España, S.L.U. y SEEIUC. All rights reserved.
The reliability of four widely used patellar height ratios.
van Duijvenbode, Dennis; Stavenuiter, Michel; Burger, Bart; van Dijke, Cees; Spermon, Jacco; Hoozemans, Marco
2016-03-01
The objective of this study was to evaluate the inter-observer reliability and the intra-observer reliability of four patellar height ratios: Insall-Salvati (IS), modified Insall-Salvati (MIS), Blackburne-Peel (BP) and Caton-Deschamps (CD). The patellar height ratios were assessed by four independent examiners using weight-bearing lateral knee radiographs in 30° flexion. Intra-class correlation coefficients and Fleiss' kappa's were determined. The inter-observer reliability was excellent for the IS and moderate for the other ratios. When the ratio values were categorized, the inter-observer reliability was strong for the IS, moderate for the MIS and BP, and poor for the CD. The intra-observer reliability was excellent for the IS, MIS and CD, and strong for the BP. When the ratio values were categorized, the intra-observer reliability was strong for the IS and MIS, and moderate for the other ratios. Although the IS showed best reliability, we advise to use the MIS as it showed the second best reliability but is, according to the literature, associated with better validity.
Bajwa, Nadia M; Yudkowsky, Rachel; Belli, Dominique; Vu, Nu Viet; Park, Yoon Soo
2017-03-01
The purpose of this study was to provide validity and feasibility evidence in measuring professionalism using the Professionalism Mini-Evaluation Exercise (P-MEX) scores as part of a residency admissions process. In 2012 and 2013, three standardized-patient-based P-MEX encounters were administered to applicants invited for an interview at the University of Geneva Pediatrics Residency Program. Validity evidence was gathered for P-MEX content (item analysis); response process (qualitative feedback); internal structure (inter-rater reliability with intraclass correlation and Generalizability); relations to other variables (correlations); and consequences (logistic regression to predict admission). To improve reliability, Kane's formula was used to create an applicant composite score using P-MEX, structured letter of recommendation (SLR), and structured interview (SI) scores. Applicant rank lists using composite scores versus faculty global ratings were compared using the Wilcoxon signed-rank test. Seventy applicants were assessed. Moderate associations were found between pairwise correlations of P-MEX scores and SLR (r = 0.25, P = .036), SI (r = 0.34, P = .004), and global ratings (r = 0.48, P < .001). Generalizability of the P-MEX using three cases was moderate (G-coefficient = 0.45). P-MEX scores had the greatest correlation with acceptance (r = 0.56, P < .001), were the strongest predictor of acceptance (OR 4.37, P < .001), and increased pseudo R-squared by 0.20 points. Including P-MEX scores increased composite score reliability from 0.51 to 0.74. Rank lists of applicants using composite score versus global rating differed significantly (z = 5.41, P < .001). Validity evidence supports the use of P-MEX scores to improve the reliability of the residency admissions process by improving applicant composite score reliability.
Bergeron, Lise; Smolla, Nicole; Berthiaume, Claude; Renaud, Johanne; Breton, Jean-Jacques; St-Georges, Marie; Morin, Pauline; Zavaglia, Elissa; Labelle, Réal
2017-03-01
The Dominic Interactive for Adolescents-Revised (DIA-R) is a multimedia self-report screen for 9 mental disorders, borderline personality traits, and suicidality defined by the fifth edition of the Diagnostic and Statistical Manual of Mental Disorders ( DSM-5). This study aimed to examine the reliability and the validity of this instrument. French- and English-speaking adolescents aged 12 to 15 years ( N = 447) were recruited from schools and clinical settings in Montreal and were evaluated twice. The internal consistency was estimated by Cronbach alpha coefficients and the test-retest reliability by intraclass correlation coefficients. Cutoff points on the DIA-R scales were determined by using clinically relevant measures for defining external validation criteria: the Schedule for Affective Disorders and Schizophrenia for School-Aged Children, the Beck Hopelessness Scale, and the Abbreviated-Diagnostic Interview for Borderlines. Receiver operating characteristic (ROC) analyses provided accuracy estimates (area under the ROC curve, sensitivity, specificity, likelihood ratio) to evaluate the ability of the DIA-R scales to predict external criteria. For most of the DIA-R scales, reliability coefficients were excellent or moderate. High or moderate accuracy estimates from ROC analyses demonstrated the ability of the DIA-R thresholds to predict psychopathological conditions. These thresholds were generally capable to discriminate between clinical and school subsamples. However, the validity of the obsessions/compulsions scale was too low. Findings clearly support the reliability and the validity of the DIA-R. This instrument may be useful to assess a wide range of adolescents' mental health problems in the continuum of services. This conclusion applies to all scales, except the obsessions/compulsions one.
[Design and Validation of a Questionnaire on Vaccination in Students of Health Sciences, Spain].
Fernández-Prada, María; Ramos-Martín, Pedro; Madroñal-Menéndez, Jaime; Martínez-Ortega, Carmen; González-Cabrera, Joaquín
2016-11-07
Immunization rates among medicine and nursing students -and among health professional in general- during hospital training are low. It is necessary to investigate the causes for these low immunization rates. The objective of this study was to design and validate a questionnaire for exploring the attitudes and behaviours of medicine and nursing students toward immunization of vaccine-preventable diseases. An instrument validation study. The sample included 646 nursing and medicine students at University of Oviedo, Spain. It was a non-ramdom sampling. After the content validation process, a 24-item questionnaire was designed to assess attitudes and behaviours/behavioural intentions. Reliability (ordinal alpha), internal validity (exploratory factor analysis by parellel analysis), ANOVA and mediational model tests were performed. Exploratory factor analysis yielded two factors which accounted for 48.8% of total variance. Ordinal alpha for the total score was 0.92. Differences were observed across academic years in the dimensions of attitudes (F5.447=3.728) and knowledge (F5.448=65.59), but not in behaviours/behavioural intentions (F5.461=1.680). Attitudes demonstrated to be a moderating variable of knowledge and attitudes/behavioural attitudes (Indirect effect B=0.15; SD=0.3; 95% CI:0.09-0.19). We developed a questionnaie based on sufficient evidence of reliability and internal validity. Scores on attitudes and knowledge increase with the academic year. Attitudes act as a moderating variable between knowledge and behaviours/behavioural intentions.
Rosenberg, David; Schön, Ulla-Karin; Nyholm, Maria; Grim, Katarina; Svedberg, Petra
2017-04-01
Despite the potential impact of shared decision making on users satisfaction with care and quality in health care decisions, there is a lack of knowledge and skills regarding how to work with shared decision making among health care providers. The aim of this study was to evaluate the psychometric properties of three instruments that measure varied dimensions of shared decision making, based on self-reports by clients, in a Swedish community mental health context. The study sample consisted of 121 clients with experience of community mental health care, and involved in a wide range of decisions regarding both social support and treatment. The questionnaires were examined for face and content validity, internal consistency, test-retest reliability and construct validity. The instruments displayed good face and content validity, satisfactory internal consistency and a moderate to good level of stability in test-retest reliability with fair to moderate construct correlations, in a sample of clients with serious mental illness and experience of community mental health services in Sweden. The questionnaires are considered to be relevant to the decision making process, user-friendly and appropriate in a Swedish community mental health care context. They functioned well in settings where non-medical decisions, regarding social and support services, are the primary focus. The use of instruments that measure various dimensions of the self-reported experience of clients, can be a key factor in developing knowledge of how best to implement shared decision making in mental health services.
Steenson, Sharalyn; Özcebe, Hilal; Arslan, Umut; Konşuk Ünlü, Hande; Araz, Özgür M; Yardim, Mahmut; Üner, Sarp; Bilir, Nazmi; Huang, Terry T-K
2018-01-01
Childhood obesity rates have been rising rapidly in developing countries. A better understanding of the risk factors and social context is necessary to inform public health interventions and policies. This paper describes the validation of several measurement scales for use in Turkey, which relate to child and parent perceptions of physical activity (PA) and enablers and barriers of physical activity in the home environment. The aim of this study was to assess the validity and reliability of several measurement scales in Turkey using a population sample across three socio-economic strata in the Turkish capital, Ankara. Surveys were conducted in Grade 4 children (mean age = 9.7 years for boys; 9.9 years for girls), and their parents, across 6 randomly selected schools, stratified by SES (n = 641 students, 483 parents). Construct validity of the scales was evaluated through exploratory and confirmatory factor analysis. Internal consistency of scales and test-retest reliability were assessed by Cronbach's alpha and intra-class correlation. The scales as a whole were found to have acceptable-to-good model fit statistics (PA Barriers: RMSEA = 0.076, SRMR = 0.0577, AGFI = 0.901; PA Outcome Expectancies: RMSEA = 0.054, SRMR = 0.0545, AGFI = 0.916, and PA Home Environment: RMSEA = 0.038, SRMR = 0.0233, AGFI = 0.976). The PA Barriers subscales showed good internal consistency and poor to fair test-retest reliability (personal α = 0.79, ICC = 0.29, environmental α = 0.73, ICC = 0.59). The PA Outcome Expectancies subscales showed good internal consistency and test-retest reliability (negative α = 0.77, ICC = 0.56; positive α = 0.74, ICC = 0.49). Only the PA Home Environment subscale on support for PA was validated in the final confirmatory model; it showed moderate internal consistency and test-retest reliability (α = 0.61, ICC = 0.48). This study is the first to validate measures of perceptions of physical activity and the physical activity home environment in Turkey. Our results support the originally hypothesized two-factor structures for Physical Activity Barriers and Physical Activity Outcome Expectancies. However, we found the one-factor rather than two-factor structure for Physical Activity Home Environment had the best model fit. This study provides general support for the use of these scales in Turkey in terms of validity, but test-retest reliability warrants further research.
Lehotkay, R; Saraswathi Devi, T; Raju, M V R; Bada, P K; Nuti, S; Kempf, N; Carminati, G Galli
2015-03-01
In this study realised in collaboration with the department of psychology and parapsychology of Andhra University, validation of the Aberrant Behavior Checklist-Community (ABC-C) in Telugu, the official language of Andhra Pradesh, one of India's 28 states, was carried out. To assess the factor validity and reliability of this Telugu version, 120 participants with moderate to profound intellectual disability (94 men and 26 women, mean age 25.2, SD 7.1) were rated by the staff of the Lebenshilfe Institution for Mentally Handicapped in Visakhapatnam, Andhra Pradesh, India. Rating data were analysed with a confirmatory factor analysis. The internal consistency was estimated by Cronbach's alpha. To confirm the test-retest reliability, 50 participants were rated twice with an interval of 4 weeks, and 50 were rated by pairs of raters to assess inter-rater reliability. Confirmatory factor analysis revealed that the root mean square error of approximation (RMSEA) was equal to 0.06, the comparative fit index (CFI) was equal to 0.77, and the Tucker Lewis index (TLI) was equal to 0.77, which indicated that the model with five correlated factors had a good fit. Coefficient alpha ranged from 0.85 to 0.92 across the five subscales. Spearman's rank correlation coefficients for inter-rater reliability tests ranged from 0.65 to 0.75, and the correlations for test-retest reliability ranged from 0.58 to 0.76. All reliability coefficients were statistically significant (P < 0.01). The factor validity and reliability of Telugu version of the ABC-C evidenced factor validity and reliability comparable to the original English version and appears to be useful for assessing behaviour disorders in Indian people with intellectual disabilities. © 2014 MENCAP and International Association of the Scientific Study of Intellectual and Developmental Disabilities and John Wiley & Sons Ltd.
Oyeyemi, Adewale L; Bello, Umar M; Philemon, Saratu T; Aliyu, Habeeb N; Majidadi, Rebecca W; Oyeyemi, Adetoyeje Y
2014-12-01
To investigate the reliability and an aspect of validity of a modified version of the long International Physical Activity Questionnaire (Hausa IPAQ-LF) in Nigeria. Cross-sectional study, examining the reliability and construct validity of the Hausa IPAQ-LF compared with anthropometric and biological variables. Metropolitan Maiduguri, the capital city of Borno State in Nigeria. 180 Nigerian adults (50% women) with a mean age of 35.6 (SD=10.3) years, recruited from neighbourhoods with diverse socioeconomic status and walkability. Domains (domestic physical activity (PA), occupational PA, leisure-time PA, active transportation and sitting time) and intensities of PA (vigorous, moderate and walking) were measured with the Hausa IPAQ-LF on two different occasions, 8 days apart. Outcomes for construct validity were measured body mass index (BMI), systolic blood pressure (SBP) and diastolic blood pressure (DBP). The Hausa IPAQ-LF demonstrated good test-retest reliability (intraclass correlation coefficient, ICC>75) for total PA (ICC=0.79, 95% CI 0.65 to 0.82), occupational PA (ICC=0.77, 95% CI 0.68 to 0.82), active transportation (ICC=0.82, 95% CI 0.75 to 0.87) and vigorous intensity activities (ICC=0.82, 95% CI 0.76 to 0.87). Reliability was substantially higher for total PA (ICC=0.80), occupational PA (ICC=0.78), leisure-time PA (ICC=0.75) and active transportation (ICC=0.80) in men than in women, but domestic PA (ICC=0.38) and sitting time (ICC=0.71) demonstrated more substantial reliability coefficients in women than in men. For the construct validity, domestic PA was significantly related mainly with SBP (r=-0.27) and DBP (r=-0.17), and leisure-time PA and total PA were significantly related only with SBP (r=-0.16) and BMI (r=-0.29), respectively. Similarly, moderate-intensity PA was mainly related with SBP (r=-0.16, p<0.05) and DBP (r=-0.21, p<0.01), but vigorous-intensity PA was only related with BMI (r=-0.11, p<0.05). The modified Hausa IPAQ-LF demonstrated sufficient evidence of test-retest reliability and may be valid for assessing context specific PA behaviours of adults in Nigeria. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Mills, Sarah D; Kwakkenbos, Linda; Carrier, Marie-Eve; Gholizadeh, Shadi; Fox, Rina S; Jewett, Lisa R; Gottesman, Karen; Roesch, Scott C; Thombs, Brett D; Malcarne, Vanessa L
2018-01-17
Systemic sclerosis (SSc) is an autoimmune disease that can cause disfiguring changes in appearance. This study examined the structural validity, internal consistency reliability, convergent validity, and measurement equivalence of the Social Appearance Anxiety Scale (SAAS) across SSc disease subtypes. Patients enrolled in the Scleroderma Patient-centered Intervention Network Cohort completed the SAAS and measures of appearance-related concerns and psychological distress. Confirmatory factor analysis (CFA) was used to examine the structural validity of the SAAS. Multiple-group CFA was used to determine if SAAS scores can be compared across patients with limited and diffuse disease subtypes. Cronbach's alpha was used to examine internal consistency reliability. Correlations of SAAS scores with measures of body image dissatisfaction, fear of negative evaluation, social anxiety, and depression were used to examine convergent validity. SAAS scores were hypothesized to be positively associated with all convergent validity measures, with correlations significant and moderate to large in size. A total of 938 patients with SSc were included. CFA supported a one-factor structure (CFI: .92; SRMR: .04; RMSEA: .08), and multiple-group CFA indicated that the scalar invariance model best fit the data. Internal consistency reliability was good in the total sample (α = .96) and in disease subgroups. Overall, evidence of convergent validity was found with measures of body image dissatisfaction, fear of negative evaluation, social anxiety, and depression. The SAAS can be reliably and validly used to assess fear of appearance evaluation in patients with SSc, and SAAS scores can be meaningfully compared across disease subtypes. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Cheung, Kenneth M C; Senkoylu, Alpaslan; Alanay, Ahmet; Genc, Yasemin; Lau, Sarah; Luk, Keith D
2007-05-01
Validation study to define validity and reliability of an adapted and translated questionnaire. Assessment of the concurrent validity and reliability of a Chinese version of SRS-22 outcome instrument. No valid health-related quality of life (HRQL) outcome instrument exists for patients with spinal deformity in Chinese. The modified SRS-22 questionnaire was proven to be an appropriate outcome instrument in English, and has already been translated and validated in several other languages. The English version of the SRS-22 questionnaire was adapted to Chinese according to the International Quality of Life Assessment Project guidelines. To assess reliability, 48 subjects with adolescent idiopathic scoliosis (mean age, 16.5 years) filled the questionnaire on 2 separate occasions (Group 1). To assess concurrent validity, 50 subjects (mean age, 21 years) filled in the same questionnaire and a previously validated Chinese version of the Short Form-36 (SF36) questionnaire (Group 2). Internal consistency, reproducibility and concurrent validity were determined with Cronbach's alpha coefficient, interclass correlation coefficient and Pearson correlation coefficient, respectively. Cronbach's alpha coefficient for the 4 major domains (function/activity, pain, self-image/appearance and mental health) were high. Intraclass correlation was also excellent for all domains. For concurrent validity, excellent correlation was found in 1 domain, good in 12 domains, moderate in 3 domains, and poor in 1 domain of the 17 relevant domains. Both cultural adaptation and linguistic translation are essential in any attempt to use a HRQL questionnaire across cultures. The Chinese version of the SRS-22 outcome instrument has satisfactory internal consistency and excellent reproducibility. It is ready for use in clinical studies on idiopathic scoliosis in Chinese-speaking societies.
Smith, L
2001-01-01
Background—No published quantitative instrument exists to measure maternal satisfaction with the quality of different models of labour care in the UK. Methods—A quantitative psychometric multidimensional maternal satisfaction questionnaire, the Women's Views of Birth Labour Satisfaction Questionnaire (WOMBLSQ), was developed using principal components analysis with varimax rotation of successive versions. Internal reliability and content and construct validity were assessed. Results—Of 300 women sent the first version (WOMBLSQ1), 120 (40%) replied; of 300 sent WOMBLSQ2, 188 (62.7%) replied; of 500 women sent WOMBLSQ3, 319 (63.8%) replied; and of 2400 women sent WOMBLSQ4, 1683 (70.1%) replied. The latter two versions consisted of 10 dimensions in addition to general satisfaction. These were (Cronbach's alpha): professional support in labour (0.91), expectations of labour (0.90), home assessment in early labour (0.90), holding the baby (0.87), support from husband/partner (0.83), pain relief in labour (0.83), pain relief immediately after labour (0.65), knowing labour carers (0.82), labour environment (0.80), and control in labour (0.62). There were moderate correlations (range 0.16–0.73) between individual dimensions and the general satisfaction scale (0.75). Scores on individual dimensions were significantly related to a range of clinical and demographic variables. Conclusion—This multidimensional labour satisfaction instrument has good validity and internal reliability. It could be used to assess care in labour across different models of maternity care, or as a prelude to in depth exploration of specific areas of concern. Its external reliability and transferability to care outside the South West region needs further evaluation, particularly in terms of ethnicity and social class. Key Words: Women's Views of Birth Labour Satisfaction Questionnaire (WOMBLSQ); labour; questionnaire PMID:11239139
Validation of the Brazilian Portuguese Version of Geriatric Anxiety Inventory--GAI-BR.
Massena, Patrícia Nitschke; de Araújo, Narahyana Bom; Pachana, Nancy; Laks, Jerson; de Pádua, Analuiza Camozzato
2015-07-01
The Geriatric Anxiety Inventory (GAI) is a recently developed scale aiming to evaluate symptoms of anxiety in later life. This 20-item scale uses dichotomous answers highlighting non-somatic anxiety complaints of elderly people. The present study aimed to evaluate the psychometric properties of the Brazilian Portuguese version GAI (GAI-BR) in a sample from community and outpatient psychogeriatric clinic. A mixed convenience sample of 72 subjects was recruited for answering the research protocol. The interview procedures were structured with questionnaires about sociodemographic data, clinical health status, anxiety, and depression previously validated instruments, Mini-Mental State Examination, Mini International Neuropsychiatric Interview, and GAI-BR. Twenty-two percent of the sample were interviewed twice for test-retest reliability. For internal consistency analyses, the Cronbach's α test was applied. The Spearman correlation test was applied to evaluate the test-retest GAI-BR reliability. A ROC (receiver operating characteristic) curve study was made to estimate the GAI-BR area under curve, cut-off points, sensitivity, and specificity for the Generalized Anxiety Disorder diagnosis. The GAI-BR version showed high internal consistency (Cronbach's α = 0.91) and strong and significant test-retest reliability (ρ = 0.85, p < 0.001). It also showed moderate and significant correlation with the Beck Anxiety Inventory (ρ = 0.68, p < 0.001) and the State-Trait Anxiety Inventory (ρ = 0.61, p < 0.001) showing evidence of concurrent validation. The cut-off point of 13 estimated by ROC curve analyses showed sensitivity of 83.3% and specificity of 84.6% to detect Generalized Anxiety Disorder (DSM-IV). GAI-BR has demonstrated very good psychometric properties and can be a reliable instrument to measure anxiety in Brazilian elderly people.
Reliability and concurrent validity of the Dutch hip and knee replacement expectations surveys
2010-01-01
Background Preoperative expectations of outcome of total hip and knee arthroplasty are important determinants of patients' satisfaction and functional outcome. Aims of the study were (1) to translate the Hospital for Special Surgery Hip Replacement Expectations Survey and Knee Replacement Expectations Survey into Dutch and (2) to study test-retest reliability and concurrent validity. Methods Patients scheduled for total hip (N = 112) or knee replacement (N = 101) were sent the Dutch Expectations Surveys twice with a 2 week interval to determine test-retest reliability. To determine concurrent validity, the Expectation WOMAC was sent. Results The results for the Dutch Hip Replacement Expectations Survey revealed good test-retest reliability (ICC 0.87), no bias and good internal consistency (alpha 0.86) (N = 72). The correlation between the Hip Expectations Score and the Expectation WOMAC score was 0.59 (N = 86). The results for the Dutch Knee Replacement Expectations Survey revealed good test-retest reliability (ICC 0.79), no bias and good internal consistency (alpha 0.91) (N = 46). The correlation with the Expectation WOMAC score was 0.52 (N = 57). Conclusions Both Dutch Expectations Surveys are reliable instruments to determine patients' expectations before total hip or knee arthroplasty. As for concurrent validity, the correlation between both surveys and the Expectation WOMAC was moderate confirming that the same construct was determined. However, patients scored systematically lower on the Expectation WOMAC compared to the Dutch Expectation Surveys. Research on patients' expectations before total hip and knee replacement has only been performed in a limited amount of countries. With the Dutch Expectations Surveys it is now possible to determine patients' expectations in another culture and healthcare setting. PMID:20958990
Reliability and concurrent validity of the Dutch hip and knee replacement expectations surveys.
van den Akker-Scheek, Inge; van Raay, Jos J A M; Reininga, Inge H F; Bulstra, Sjoerd K; Zijlstra, Wiebren; Stevens, Martin
2010-10-19
Preoperative expectations of outcome of total hip and knee arthroplasty are important determinants of patients' satisfaction and functional outcome. Aims of the study were (1) to translate the Hospital for Special Surgery Hip Replacement Expectations Survey and Knee Replacement Expectations Survey into Dutch and (2) to study test-retest reliability and concurrent validity. Patients scheduled for total hip (N = 112) or knee replacement (N = 101) were sent the Dutch Expectations Surveys twice with a 2 week interval to determine test-retest reliability. To determine concurrent validity, the Expectation WOMAC was sent. The results for the Dutch Hip Replacement Expectations Survey revealed good test-retest reliability (ICC 0.87), no bias and good internal consistency (alpha 0.86) (N = 72). The correlation between the Hip Expectations Score and the Expectation WOMAC score was 0.59 (N = 86). The results for the Dutch Knee Replacement Expectations Survey revealed good test-retest reliability (ICC 0.79), no bias and good internal consistency (alpha 0.91) (N = 46). The correlation with the Expectation WOMAC score was 0.52 (N = 57). Both Dutch Expectations Surveys are reliable instruments to determine patients' expectations before total hip or knee arthroplasty. As for concurrent validity, the correlation between both surveys and the Expectation WOMAC was moderate confirming that the same construct was determined. However, patients scored systematically lower on the Expectation WOMAC compared to the Dutch Expectation Surveys. Research on patients' expectations before total hip and knee replacement has only been performed in a limited amount of countries. With the Dutch Expectations Surveys it is now possible to determine patients' expectations in another culture and healthcare setting.
[Validity and reliability of the CERAD-Col neuropsychological battery].
Aguirre-Acevedo, D C; Gómez, R D; Moreno, S; Henao-Arboleda, E; Motta, M; Muñoz, C; Arana, A; Pineda, D A; Lopera, F
Alzheimer's disease (AD) is an important public health problem due to its disabling character and high individual, familial and social costs. The CERAD neuropsychological battery has been widely used for evaluation and diagnosis of the cognitive deficit associated with AD. This instrument has been adapted to the Colombian culture (CERAD-Col) for the Neurosciences Group. A study was carried out to establish the validity and reliability of the CERAD-Col in Colombian, Spanish-speaking individuals aged 50 years or more. It included 151 controls and 151 AD patients. Controls were selected from a convenience sample of 848 adults aged 50 years or more. The construct validity was determined in three ways: 1) factorial analysis; 2) correlation with the functional scales FAST and GDS (convergent-type validity) and, 3) comparison between the two groups. Internal consistency was determined by means of Cronbach's alpha coefficient. Three factors -memory, language and praxis- explained 88% of the total variance. Moderate but statistically significant correlations were found between neuropsychological tests and functional scales. Internal consistency and test-retest reproducibility were high. The AD group exhibited significantly lower scores (p < 0.05) than the control one. CERAD-Col is valid and reliable for the diagnosis of AD in Colombian Spanish-speaking population aged 50 years or more.
Marcin, James P; Romano, Patrick S; Dharmar, Madan; Chamberlain, James M; Dudley, Nanette; Macias, Charles G; Nigrovic, Lise E; Powell, Elizabeth C; Rogers, Alexander J; Sonnett, Meridith; Tzimenatos, Leah; Alpern, Elizabeth R; Andrews-Dickert, Rebecca; Borgialli, Dominic A; Sidney, Erika; Casper, Charlie; Dean, Jonathan Michael; Kuppermann, Nathan
2018-06-01
To evaluate the consistency, reliability, and validity of an implicit review instrument that measures the quality of care provided to children in the emergency department (ED). Medical records of randomly selected children from 12 EDs in the Pediatric Emergency Care Applied Research Network (PECARN). Eight pediatric emergency medicine physicians applied the instrument to 620 medical records. We determined internal consistency using Cronbach's alpha and inter-rater reliability using the intraclass correlation coefficient (ICC). We evaluated the validity of the instrument by correlating scores with four condition-specific explicit review instruments. Individual reviewers' Cronbach's alpha had a mean of 0.85 with a range of 0.76-0.97; overall Cronbach's alpha was 0.90. The ICC was 0.49 for the summary score with a range from 0.40 to 0.46. Correlations between the quality of care score and the four condition-specific explicit review scores ranged from 0.24 to 0.38. The quality of care instrument demonstrated good internal consistency, moderate inter-rater reliability, high inter-rater agreement, and evidence supporting validity. The instrument could be useful for systems' assessment and research in evaluating the care delivered to children in the ED. © Health Research and Educational Trust.
AO Distal Radius Fracture Classification: Global Perspective on Observer Agreement.
Jayakumar, Prakash; Teunis, Teun; Giménez, Beatriz Bravo; Verstreken, Frederik; Di Mascio, Livio; Jupiter, Jesse B
2017-02-01
Background The primary objective of this study was to test interobserver reliability when classifying fractures by consensus by AO types and groups among a large international group of surgeons. Secondarily, we assessed the difference in inter- and intraobserver agreement of the AO classification in relation to geographical location, level of training, and subspecialty. Methods A randomized set of radiographic and computed tomographic images from a consecutive series of 96 distal radius fractures (DRFs), treated between October 2010 and April 2013, was classified using an electronic web-based portal by an invited group of participants on two occasions. Results Interobserver reliability was substantial when classifying AO type A fractures but fair and moderate for type B and C fractures, respectively. No difference was observed by location, except for an apparent difference between participants from India and Australia classifying type B fractures. No statistically significant associations were observed comparing interobserver agreement by level of training and no differences were shown comparing subspecialties. Intra-rater reproducibility was "substantial" for fracture types and "fair" for fracture groups with no difference accounting for location, training level, or specialty. Conclusion Improved definition of reliability and reproducibility of this classification may be achieved using large international groups of raters, empowering decision making on which system to utilize. Level of Evidence Level III.
AO Distal Radius Fracture Classification: Global Perspective on Observer Agreement
Jayakumar, Prakash; Teunis, Teun; Giménez, Beatriz Bravo; Verstreken, Frederik; Di Mascio, Livio; Jupiter, Jesse B.
2016-01-01
Background The primary objective of this study was to test interobserver reliability when classifying fractures by consensus by AO types and groups among a large international group of surgeons. Secondarily, we assessed the difference in inter- and intraobserver agreement of the AO classification in relation to geographical location, level of training, and subspecialty. Methods A randomized set of radiographic and computed tomographic images from a consecutive series of 96 distal radius fractures (DRFs), treated between October 2010 and April 2013, was classified using an electronic web-based portal by an invited group of participants on two occasions. Results Interobserver reliability was substantial when classifying AO type A fractures but fair and moderate for type B and C fractures, respectively. No difference was observed by location, except for an apparent difference between participants from India and Australia classifying type B fractures. No statistically significant associations were observed comparing interobserver agreement by level of training and no differences were shown comparing subspecialties. Intra-rater reproducibility was “substantial” for fracture types and “fair” for fracture groups with no difference accounting for location, training level, or specialty. Conclusion Improved definition of reliability and reproducibility of this classification may be achieved using large international groups of raters, empowering decision making on which system to utilize. Level of Evidence Level III PMID:28119795
Guo, Jing; Simon, James H; Sedghizadeh, Parish; Soliman, Osman N; Chapman, Travis; Enciso, Reyes
2013-12-01
The purpose of this study was to evaluate the reliability and accuracy of cone-beam computed tomographic (CBCT) imaging against the histopathologic diagnosis for the differential diagnosis of periapical cysts (cavitated lesions) from (solid) granulomas. Thirty-six periapical lesions were imaged using CBCT scans. Apicoectomy surgeries were conducted for histopathological examination. Evaluator 1 examined each CBCT scan for the presence of 6 radiologic characteristics of a cyst (ie, location, periphery, shape, internal structure, effects on surrounding structure, and perforation of the cortical plate). Not every cyst showed all radiologic features (eg, not all cysts perforate the cortical plate). For the purpose of finding the minimum number of diagnostic criteria present in a scan to diagnose a lesion as a cyst, we conducted 6 receiver operating characteristic curve analyses comparing CBCT diagnoses with the histopathologic diagnosis. Two other independent evaluators examined the CBCT lesions. Statistical tests were conducted to examine the accuracy, inter-rater reliability, and intrarater reliability of CBCT images. Findings showed that a score of ≥4 positive findings was the optimal scoring system. The accuracies of differential diagnoses of 3 evaluators were moderate (area under the curve = 0.76, 0.70, and 0.69 for evaluators 1, 2, and 3, respectively). The inter-rater agreement of the 3 evaluators was excellent (α = 0.87). The intrarater agreement was good to excellent (κ = 0.71, 0.76, and 0.77). CBCT images can provide a moderately accurate diagnosis between cysts and granulomas. Copyright © 2013 American Association of Endodontists. Published by Elsevier Inc. All rights reserved.
The Reliability and Validity of the Computerized Double Inclinometer in Measuring Lumbar Mobility
MacDermid, Joy Christine; Arumugam, Vanitha; Vincent, Joshua Israel; Carroll, Krista L
2014-01-01
Study Design : Repeated measures reliability/validity study. Objectives : To determine the concurrent validity, test-retest, inter-rater and intra-rater reliability of lumbar flexion and extension measurements using the Tracker M.E. computerized dual inclinometer (CDI) in comparison to the modified-modified Schober (MMS) Summary of Background : Numerous studies have evaluated the reliability and validity of the various methods of measuring spinal motion, but the results are inconsistent. Differences in equipment and techniques make it difficult to correlate results. Methods : Twenty subjects with back pain and twenty without back pain were selected through convenience sampling. Two examiners measured sagittal plane lumbar range of motion for each subject. Two separate tests with the CDI and one test with the MMS were conducted. Each test consisted of three trials. Instrument and examiner order was randomly assigned. Intra-class correlations (ICCs 2, 2 and 2, 2) and Pearson correlation coefficients (r) were used to calculate reliability and concurrent validity respectively. Results : Intra-trial reliability was high to very high for both the CDI (ICCs 0.85 - 0.96) and MMS (ICCs 0.84 - 0.98). However, the reliability was poor to moderate, when the CDI unit had to be repositioned either by the same rate (ICCs 0.16 - 0.59) or a different rater (ICCs 0.45 - 0.52). Inter-rater reliability for the MMS was moderate to high (ICCs 0.75 - 0.82) which bettered the moderate correlation obtained for the CDI (ICCs 0.45 - 0.52). Correlations between the CDI and MMS were poor for flexion (0.32; p<0.05) and poor to moderate (-0.42 - -0.51; p<0.05) for extension measurements. Conclusion : When using the CDI, an average of subsequent tests is required to obtain moderate reliability. The MMS was highly reliable than the CDI. The MMS and the CDI measure lumbar movement on a different metric that are not highly related to each other. PMID:25352928
van der Meulen, Ineke; van de Sandt-Koenderman, W Mieke E; Duivenvoorden, Hugo J; Ribbers, Gerard M
2010-01-01
This study explores the psychometric qualities of the Scenario Test, a new test to assess daily-life communication in severe aphasia. The test is innovative in that it: (1) examines the effectiveness of verbal and non-verbal communication; and (2) assesses patients' communication in an interactive setting, with a supportive communication partner. To determine the reliability, validity, and sensitivity to change of the Scenario Test and discuss its clinical value. The Scenario Test was administered to 122 persons with aphasia after stroke and to 25 non-aphasic controls. Analyses were performed for the entire group of persons with aphasia, as well as for a subgroup of persons unable to communicate verbally (n = 43). Reliability (internal consistency, test-retest reliability, inter-judge, and intra-judge reliability) and validity (internal validity, convergent validity, known-groups validity) and sensitivity to change were examined using standard psychometric methods. The Scenario Test showed high levels of reliability. Internal consistency (Cronbach's alpha = 0.96; item-rest correlations = 0.58-0.82) and test-retest reliability (ICC = 0.98) were high. Agreement between judges in total scores was good, as indicated by the high inter- and intra-judge reliability (ICC = 0.86-1.00). Agreement in scores on the individual items was also good (square-weighted kappa values 0.61-0.92). The test demonstrated good levels of validity. A principal component analysis for categorical data identified two dimensions, interpreted as general communication and communicative creativity. Correlations with three other instruments measuring communication in aphasia, that is, Spontaneous Speech interview from the Aachen Aphasia Test (AAT), Amsterdam-Nijmegen Everyday Language Test (ANELT), and Communicative Effectiveness Index (CETI), were moderate to strong (0.50-0.85) suggesting good convergent validity. Group differences were observed between persons with aphasia and non-aphasic controls, as well as between persons with aphasia unable to use speech to convey information and those able to communicate verbally; this indicates good known-groups validity. The test was sensitive to changes in performance, measured over a period of 6 months. The data support the reliability and validity of the Scenario Test as an instrument for examining daily-life communication in aphasia. The test focuses on multimodal communication; its psychometric qualities enable future studies on the effect of Alternative and Augmentative Communication (AAC) training in aphasia.
Interrater reliability of the new criteria for behavioral variant frontotemporal dementia.
Lamarre, Amanda K; Rascovsky, Katya; Bostrom, Alan; Toofanian, Parnian; Wilkins, Sarah; Sha, Sharon J; Perry, David C; Miller, Zachary A; Naasan, Georges; Laforce, Robert; Hagen, Jayne; Takada, Leonel T; Tartaglia, Maria Carmela; Kang, Gail; Galasko, Douglas; Salmon, David P; Farias, Sarah Tomaszewski; Kaur, Berneet; Olichney, John M; Quitania Park, Lovingly; Mendez, Mario F; Tsai, Po-Heng; Teng, Edmond; Dickerson, Bradford Clark; Domoto-Reilly, Kimiko; McGinnis, Scott; Miller, Bruce L; Kramer, Joel H
2013-05-21
To evaluate the interrater reliability of the new International Behavioural Variant FTD Criteria Consortium (FTDC) criteria for behavioral variant frontotemporal dementia (bvFTD). Twenty standardized clinical case modules were developed for patients with a range of neurodegenerative diagnoses, including bvFTD, primary progressive aphasia (nonfluent, semantic, and logopenic variant), Alzheimer disease, and Lewy body dementia. Eighteen blinded raters reviewed the modules and 1) rated the presence or absence of core diagnostic features for the FTDC criteria, and 2) provided an overall diagnostic rating. Interrater reliability was determined by κ statistics for multiple raters with categorical ratings. The mean κ value for diagnostic agreement was 0.81 for possible bvFTD and 0.82 for probable bvFTD ("almost perfect agreement"). Interrater reliability for 4 of the 6 core features had "substantial" agreement (behavioral disinhibition, perseverative/compulsive, sympathy/empathy, hyperorality; κ = 0.61-0.80), whereas 2 had "moderate" agreement (apathy/inertia, neuropsychological; κ = 0.41-0.6). Clinician years of experience did not significantly influence rater accuracy. The FTDC criteria show promise for improving the diagnostic accuracy and reliability of clinicians and researchers. As disease-altering therapies are developed, accurate differential diagnosis between bvFTD and other neurodegenerative diseases will become increasingly important.
Reliability and validity of the Wolfram Unified Rating Scale (WURS)
2012-01-01
Background Wolfram syndrome (WFS) is a rare, neurodegenerative disease that typically presents with childhood onset insulin dependent diabetes mellitus, followed by optic atrophy, diabetes insipidus, deafness, and neurological and psychiatric dysfunction. There is no cure for the disease, but recent advances in research have improved understanding of the disease course. Measuring disease severity and progression with reliable and validated tools is a prerequisite for clinical trials of any new intervention for neurodegenerative conditions. To this end, we developed the Wolfram Unified Rating Scale (WURS) to measure the severity and individual variability of WFS symptoms. The aim of this study is to develop and test the reliability and validity of the Wolfram Unified Rating Scale (WURS). Methods A rating scale of disease severity in WFS was developed by modifying a standardized assessment for another neurodegenerative condition (Batten disease). WFS experts scored the representativeness of WURS items for the disease. The WURS was administered to 13 individuals with WFS (6-25 years of age). Motor, balance, mood and quality of life were also evaluated with standard instruments. Inter-rater reliability, internal consistency reliability, concurrent, predictive and content validity of the WURS were calculated. Results The WURS had high inter-rater reliability (ICCs>.93), moderate to high internal consistency reliability (Cronbach’s α = 0.78-0.91) and demonstrated good concurrent and predictive validity. There were significant correlations between the WURS Physical Assessment and motor and balance tests (rs>.67, p<.03), between the WURS Behavioral Scale and reports of mood and behavior (rs>.76, p<.04) and between WURS Total scores and quality of life (rs=-.86, p=.001). The WURS demonstrated acceptable content validity (Scale-Content Validity Index=0.83). Conclusions These preliminary findings demonstrate that the WURS has acceptable reliability and validity and captures individual differences in disease severity in children and young adults with WFS. PMID:23148655
Hemke, Robert; Tzaribachev, Nikolay; Nusman, Charlotte M; van Rossum, Marion A J; Maas, Mario; Doria, Andrea S
2017-08-01
There is increasing evidence that early therapeutic intervention improves longterm joint outcome in juvenile idiopathic arthritis (JIA). Given the existence of highly effective treatments, there is an urgent need for reliable and accurate measures of disease activity and joint damage in JIA. Our objective was to assess the reliability of 2 magnetic resonance imaging (MRI) scoring methods: the Juvenile Arthritis MRI Scoring (JAMRIS) system and the International Prophylaxis Study Group (IPSG) consensus score, for evaluating disease status of the knee in patients with JIA. Four international readers independently scored an MRI dataset of 25 JIA patients with clinical knee involvement. Synovial thickening, joint effusion, bone marrow changes, cartilage lesions, bone erosions, and subchondral cysts were scored using the JAMRIS and IPSG systems. Further, synovial enhancement, infrapatellar fat pad heterogeneity, tendinopathy, and enthesopathy were scored. Interreader reliability was analyzed by using the generalized κ, ICC, and the smallest detectable difference (SDD). ICC regarding interreader reliability ranged from 0.33 (95% CI 0.12-0.52, SDD = 0.29) for enthesopathy up to 0.95 (95% CI 0.92-0.97, SDD = 3.19) for synovial thickening. Good interreader reliability was found concerning joint effusion (ICC 0.93, 95% CI 0.89-0.95, SDD = 0.51), synovial enhancement (ICC 0.90, 95% CI 0.85-0.94, SDD = 9.85), and bone marrow changes (ICC 0.87, 95% CI 0.80-0.92, SDD = 10.94). Moderate to substantial reliability was found concerning cartilage lesions and bone erosions (ICC 0.55-0.72, SDD 1.41-13.65). The preliminary results are promising for most of the scored JAMRIS and IPSG items. However, further refinement of the scoring system is warranted for unsatisfactorily reliable items such as bone erosions, cartilage lesions, and enthesopathy.
Kaux, Jean-François; Delvaux, François; Schaus, Jean; Demoulin, Christophe; Locquet, Médéa; Buckinx, Fanny; Beaudart, Charlotte; Dardenne, Nadia; Van Beveren, Julien; Croisier, Jean-Louis; Forthomme, Bénédicte; Bruyère, Olivier
Translation and validation of algo-functional questionnaire. The lateral elbow tendinopathy is a common injury in tennis players and physical workers. The Patient-Rated Tennis Elbow Evaluation (PRTEE) Questionnaire was specifically designed to measure pain and functional limitations in patients with lateral epicondylitis (tennis elbow). First developed in English, this questionnaire has since been translated into several languages. The aims of the study were to translate and cross-culturally adapt the PRTEE questionnaire into French and to evaluate the reliability and validity of this translated version of the questionnaire (PRTEE-F). The PRTEE was translated and cross-culturally adapted into French according to international guidelines. To assess the reliability and validity of the PRTEE-F, 115 participants were asked twice to fill in the PRTEE-F, and once the Disabilities of Arm, Shoulder and Hand Questionnaire (DASH) and the Short Form Health Survey (SF-36). Internal consistency (using Cronbach's alpha), test-retest reliability (using intraclass correlation coefficient (ICC), standard error of measurement and minimal detectable change), and convergent and divergent validity (using the Spearman's correlation coefficients respectively with the DASH and with some subscales of the SF-36) were assessed. The PRTEE was translated into French without any problems. PRTEE-F showed a good test-retest reliability for the overall score (ICC 0.86) and for each item (ICC 0.8-0.96) and a high internal consistency (Cronbach's alpha = 0.98). The correlation analyses revealed high correlation coefficients between PRTEE-F and DASH (convergent validity) and, as expected, a low or moderate correlation with the divergent subscales of the SF-36 (discriminant validity). There was no floor or ceiling effect. The PRTEE questionnaire was successfully cross-culturally adapted into French. The PRTEE-F is reliable and valid for evaluating French-speaking patients with lateral elbow tendinopathy. Copyright © 2016 Hanley & Belfus. Published by Elsevier Inc. All rights reserved.
Fatigue in children: reliability and validity of the Dutch PedsQL™ Multidimensional Fatigue Scale.
Gordijn, M Suzanne; Suzanne Gordijn, M; Cremers, Eline M P; Kaspers, Gertjan J L; Gemke, Reinoud J B J
2011-09-01
The aim of the study is to report on the feasibility, reliability, validity, and the norm-references of the Dutch version of the PedsQL™ Multidimensional Fatigue Scale. The study participants are four hundred and ninety-seven parents of children aged 2-18 years and 366 children aged 5-18 years from various day care facilities, elementary schools, and a high school who completed the Dutch version of the PedsQL™ Multidimensional Fatigue Scale. The number of missing items was minimal. All scales showed satisfactory internal consistency reliability, with Cronbach's coefficient alpha exceeding 0.70. Test-retest reliability was good to excellent (ICCs 0.68-0.84) and inter-observer reliability varied from moderate to excellent (ICCs 0.56-0.93) for total scores. Parent/child concordance for total scores was poor to good (ICCs 0.25-0.68). The PedsQL™ Multidimensional Fatigue Scale was able to distinguish between healthy children and children with an impaired health condition. The Dutch version of the PedsQL™ Multidimensional Fatigue Scale demonstrates an adequate feasibility, reliability, and validity in another sociocultural context. With the obtained norm-references, it can be utilized as a tool in the evaluation of fatigue in healthy and chronically ill children aged 2-18 years.
Validation of the Walking Impairment Questionnaire for Spanish patients.
Lozano, Francisco S; March, José R; González-Porras, José R; Carrasco, Eduardo; Lobos, José M; Areitio-Aurtena, Alix
2013-09-01
The Walking Impairment Questionnaire (WIQ) is a short, easy to complete, disease-specific questionnaire to assess intermittent claudication. A Spanish version of the WIQ for Hispanic Americans has recently been validated in Texas, but it needs to be validated for European Spanish people. After translation and cultural adaptation of the WIQ, 920 patients with intermittent claudication (ankle brachial index < 0.9) completed two questionnaires (Spanish version of the WIQ and European Quality of Life 5 Dimension [EQ-5D]). The validity of the WIQ was determined by correlating WIQ and EQ-5D. Test-retest reliability and internal consistency were determined using the intra-class correlation coefficient (ICC) and Cronbach's alpha, respectively. The three domains of the WIQ were moderately correlated with the EQ-5D health outcome (r = 0.54 to 0.60; p < 0.001). Test-retest reliabilities ranged from ICC = 0.89 to 0.91 and internal consistency (Cronbach's alpha = 0.92) was high. The Spanish version of the WIQ for European Spanish patients was valid and reproducible, suggesting that it could be used in Spanish patients with intermittent claudication.
Psychometric Evaluation of Kingston Caregiver Stress Scale.
Sadak, Tatiana; Korpak, Anna; Wright, Jacob D; Lee, Mee Kyung; Noel, Margaret; Buckwalter, Kathleen; Borson, Soo
2017-01-01
Standardized measurement of caregiver stress is a component of Medicare's new health care benefit supporting care planning for people with dementia. In this article we identify existing measures of caregiver stress, strain and burden and propose specific criteria for choosing tools that may be suitable for wide use in primary care settings. We reviewed 22 measures and identified one, the Kingston Caregiver Stress Scale (KCSS), which met all the proposed criteria but had not been studied in a U.S. We conducted a psychometric evaluation of KCSS to determine its potential usefulness as a care planning tool with a U.S. We examined the internal consistency, test-retest reliability, component structure, and relationship to depression and anxiety in 227 dementia caregivers at two U.S. sites. The KCSS has high internal consistency and test-retest reliability, a strong factor structure, and moderate to high correlations with caregiver depression and anxiety. KCSS is a good candidate for use as part of comprehensive care planning for people with dementia and their caregivers. Routine assessment of caregiver stress in clinical care may facilitate timely intervention and potentially improve both patient and caregiver outcomes.
Low, Andrea Hsiu Ling; Xin, Xiaohui; Law, Weng Giap; Teng, Gim Gee; Santosa, Amelia; Lim, Anita; Chan, Grace; Ng, Swee Cheng; Thumboo, Julian
2017-07-01
The aim of this study was to (1) translate the Gastrointestinal Tract Instrument (GIT) 2.0 from English to Chinese and (2) validate both versions in a multi-ethnic systemic sclerosis cohort in Singapore (SCORE). The English GIT2.0 was translated to Chinese using a standard forward-backward translation approach. Psychometric evaluation of the GIT2.0 included internal consistency reliability (using Cronbach's alpha), test-retest reliability (using intra-class correlation coefficient (ICC)), scale level factor analysis, and construct validity (using Spearman correlation) against the modified Scleroderma Health Assessment Questionnaire (S-HAQ) and the SF-36 v2. Most of the patients were females (88.6%) and Chinese (78.2%), with mean (SD) age of 51.0 (13.0) years and median disease duration of 4.5 years. We administered English (n = 146) and Chinese (n = 74) GIT2.0. The mean (SD) total GIT score was 0.29 (0.37). There was good internal consistency (Cronbach's alpha >0.70 for all subscales) and good test-retest reliability for the scale and all subscales (ICC 0.71-0.92) except for "diarrhoea" (ICC = 0.54). Our hypothesised a priori construct validity was supported by moderate correlations between the total GIT score and S-HAQ GI subscale (r = 0.446), and the social functioning subscale and SF36v2 role-social domain (r = 0.337), and weak-to-moderate correlation between the emotional subscale and SF-36v2 role-emotional (r = 0.295) and mental health (r = 0.298) domains and mental component summary (r = 0.356). Exploratory factor analysis of the seven subscales yielded a two-factor solution explaining 69.63% of the total variance. This study provides evidence for the reliability and validity of the English and Chinese GIT2.0 to be used in Singapore for research and routine practice.
Addison, Clifton C.; Campbell-Jenkins, Brenda W.; Sarpong, Daniel F.; Kibler, Jeffery; Singh, Madhu; Dubbert, Patricia; Wilson, Gregory; Payne, Thomas; Taylor, Herman
2007-01-01
This study sought to establish the psychometric properties of a Coping Strategies Inventory Short Form (CSI-SF) by examining coping skills in the Jackson Heart Study cohort. We used exploratory and confirmatory factor analysis, Pearson’s correlation, and Cronbach Alpha to examine reliability and validity in the CSI-SF that solicited responses from 5302 African American men and women between the ages of 35 and 84. One item was dropped from the 16-item CSI-SF, making it a 15-item survey. No significant effects were found for age and gender, strengthening the generalizability of the CSI-SF. The internal consistency reliability analysis revealed reliability between alpha = 0.58–0.72 for all of the scales, and all of the fit indices used to examine the CSI-SF provided support for its use as an adequate measure of coping. This study provides empirical support for utilizing this instrument in future efforts to understand the role of coping in moderating health outcomes. PMID:18180539
QUIROZ, Viviana; REINERO, Daniela; HERNÁNDEZ, Patricia; CONTRERAS, Johanna; VERNAL, Rolando; CARVAJAL, Paola
2017-01-01
Abstract The major infectious diseases in Chile encompass the periodontal diseases, with a combined prevalence that rises up to 90% of the population. Thus, the population-based surveillance of periodontal diseases plays a central role for assessing their prevalence and for planning, implementing, and evaluating preventive and control programs. Self-report questionnaires have been proposed for the surveillance of periodontal diseases in adult populations world-wide. Objective This study aimed to develop and assess the content validity and reliability of a cognitively adapted self-report questionnaire designed for surveillance of gingivitis in adolescents. Material and Methods Ten predetermined self-report questions evaluating early signs and symptoms of gingivitis were preliminary assessed by a panel of clinical experts. Eight questions were selected and cognitively tested in 20 adolescents aged 12 to 18 years from Santiago de Chile. The questionnaire was then conducted and answered by 178 Chilean adolescents. Internal consistency was measured using the Cronbach’s alpha and temporal stability was calculated using the Kappa-index. Results A reliable final self-report questionnaire consisting of 5 questions was obtained, with a total Cronbach’s alpha of 0.73 and a Kappa-index ranging from 0.41 to 0.77 between the different questions. Conclusions The proposed questionnaire is reliable, with an acceptable internal consistency and a temporal stability from moderate to substantial, and it is promising for estimating the prevalence of gingivitis in adolescents. PMID:28877279
The Cardiff Acne Disability Index (CADI): linguistic and cultural validation in Serbian.
Jankovic, Slavenka; Vukicevic, Jelica; Djordjevic, Sanja; Jankovic, Janko; Marinkovic, Jelena; Basra, Mohammad K A
2013-02-01
The aims of this study were to translate the Cardiff Acne Disability Index (CADI) into Serbian and to assess its validity and reliability in Serbian acne patients. The CADI was translated and linguistically validated into Serbian according to published guidelines. This version of CADI, along with the Serbian version of Children's Dermatology Life Quality Index (CDLQI) and a short demographic questionnaire, was administrated to a cohort of secondary school pupils. The Global Acne Grading Score was used to measure the clinical severity of acne. The internal consistency reliability of the Serbian version of CADI was assessed by Cronbach's alpha coefficient while its concurrent validity was assessed by Spearman's correlation coefficient. Construct validity was examined by factor analysis. A total of 465 pupils completed questionnaires. Self-reported acne was present in 76% of pupils (353/465). The Serbian version of CADI showed high internal consistency reliability (Cronbach's alpha coefficient = 0.79). The mean item-total correlation coefficient was 0.74 with a range of 0.53-0.81. The concurrent validity of the scale was supported by a moderate but highly significant correlation with the CDLQI (Spearman's rho = 0.66; P < 0.001). Factor analysis revealed the presence of two dimensions underlying the factor structure of the scale. The Serbian version of the CADI is a reliable, valid, and valuable tool for assessing the impact of acne on the quality of life of Serbian-speaking patients.
Reliability and Validity of the Persian HIT-6 Questionnaire in Migraine and Tension-type Headache.
Zandifar, Alireza; Banihashemi, Mahboobeh; Haghdoost, Faraidoon; Masjedi, Samaneh S; Manouchehri, Navid; Asgari, Fatemeh; Najafi, Mohammad R; Ghorbani, Abbas; Zandifar, Samaneh; Saadatnia, Mohammad; White, Michelle K
2014-09-01
Headache Impact Test (HIT-6) measures the impact headaches in a 1-month period. We validated the Persian translation of HIT-6, compared the HIT-6 psychometric analysis between migraine and tension-type headache (TTH) patients, and evaluated the capability of HIT-6 to differentiate between TTH, chronic migraine, and episodic migraine. Qualified participants, including 274 patients diagnosed with migraine or TTH, were required to complete HIT-6, SF-36v2, and a symptoms questionnaire on their first visit. At 3 and 8 weeks from first visit, participants completed HIT-6. Internal consistency (Cronbach's α) and test-retest reproducibility (Pearson's correlation coefficient) were used to assess reliability. Convergent validity was also assessed. Tension-type headache, episodic, and chronic migraines included 24.5%, 61.9%, and 13.6% of the participants, respectively. Internal consistency among all patients, TTH, and migraine in the first visit were 0.74, 0.77, and 0.73, respectively. Test-retest reliability for HIT-6 between visit 1 and 2 showed a moderate level of correlation (r = 0.50). Convergent validity and also item total correlation were acceptable. There was no significant difference in HIT-6 total score between TTH and migraine. Persian HIT-6 is a valid and reliable questionnaire for the evaluation of headache. However, it cannot differentiate between chronic migraine, episodic migraine, and TTH in Iranian population. © 2013 World Institute of Pain.
Healthy eating opinion survey for individuals at risk for cardiovascular disease.
Mark, Amy E; Riley, Dana L; McDonnell, Lisa A; Pipe, Andrew L; Reid, Robert D
2014-08-01
To develop and evaluate the validity and reliability of a questionnaire to measure intentions and beliefs about healthy eating in individuals at risk for coronary heart disease. The Healthy Eating Opinion Survey was developed using the theory of planned behavior. An open-ended elicitation questionnaire was administered to 21 participants, and a 46-item questionnaire was developed for further testing. Test-retest reliability of each question on the survey was assessed by calculating the correlation coefficients between the responses over a 2- week period in 17 participants. Internal consistency was assessed using Cronbach's alpha, and factor analysis was used to assess the construct validity of the questionnaire in a sample of 388 participants. The responses to the elicitation questions were used to develop behavioral beliefs, normative beliefs, and control beliefs questions for the final questionnaire. Test-retest reliability ranged from 0.22-0.90, with the majority (89%) of correlations being moderate to strong. Internal consistency was good, with Cronbach's alpha ranging from 0.74-0.92. All intentions questions loaded onto a single factor; attitude questions loaded onto two factors; subjective norm questions loaded onto two factors; perceived behavioral control questions loaded onto one factor; behavioral beliefs questions loaded onto one factor; normative beliefs questions loaded onto one factor; and control beliefs questions loaded onto one factor. The questionnaire was found to be a reliable, valid questionnaire to assess beliefs and intentions toward eating a healthy diet in individuals at risk for coronary heart disease.
Salcı, Yeliz; Fil, Ayla; Keklicek, Hilal; Çetin, Barış; Armutlu, Kadriye; Dolgun, Anıl; Tuncer, Aslı; Karabudak, Rana
2017-11-01
Ataxia is an extremely common problem in multiple sclerosis (MS) patients. Thus, appropriate scales are required for detailed assessment of this issue. The aim of our study was to investigate the reliability and validity of the Turkish version of the International Cooperative Ataxia Rating Scale (ICARS) and Scale for the Assessment and Rating of Ataxia (SARA), which are widely used in ataxia evaluation in the context of other cerebellar diseases. This cross-sectional study included 80 MS patients with Kurtzke cerebellar functional system score (C-FSS) greater than zero and slight pyramidal involvement. The Expanded Disability Status Scale (EDSS), C-FSS, and Berg Balance Scale (BBS) were administered. SARA and ICARS were assessed on first admission by two physical therapists. Seven days later, second assessments were repeated in same way for reliability. Intra-rater and inter-rater reliability were found to be high for both ICARS and SARA (p< 0.001) The Cronbach's α coefficients were 0.922 and 0.921 for SARA (reviewer 1 and reviewer 2 respectively) and 0.952 and 0.952 for ICARS (reviewer 1 and reviewer 2, respectively). There were no floor or ceiling effects determined for either scale except for item 17 of ICARS (p= 0.055). The EDSS total score had significant correlations with both SARA and ICARS (rho: 0.557 and 0.707, respectively). C-FSS had moderate correlation with SARA and high correlation with ICARS (rho: 0.469 and 0.653, respectively). BBS had no significant correlation with SARA and ICARS. (rho: -0.048 and -0.008 respectively). According to the area under the curve (AUC) value, ICARS is the best scale to discriminate mild and moderate ataxia. (AUC: 0.875). Factor analyses of ICARS showed that the rating results were determined by five different factors that did not coincide with the ICARS sub-scales. Our study demonstrated that ICARS and SARA are both reliable in MS patients with ataxia. Although ICARS has some structural problems, it seems to be more valid given its high correlations with EDSS and C-FSS. SARA also can be preferred as a brief assessment. Copyright © 2017 Elsevier B.V. All rights reserved.
Mathias, Susan D; Bussel, James B; George, James N; McMillan, Robert; Okano, Gary J; Nichol, Janet L
2007-02-22
No validated disease-specific measures are available to assess health-related quality of life (HRQoL) in adult subjects with immune thrombocytopenic purpura (ITP). Therefore, we sought to develop and validate the ITP-Patient Assessment Questionnaire (ITP-PAQ) for adult subjects with ITP. Information from literature reviews, focus groups with subjects, and clinicians were used to develop 50 ITP-PAQ items. Factor analyses were conducted to develop the scale structure and reduce the number of items. The final 44-item ITP-PAQ, which includes ten scales [Symptoms (S), Bother-Physical Health (B), Fatigue/Sleep (FT), Activity (A), Fear (FR), Psychological Health (PH), Work (W), Social Activity (SA), Women's Reproductive Health (RH), and Overall (QoL)], was self-administered to adult ITP subjects at baseline and 7-10 days later. Test-retest reliability, internal consistency reliability, construct and known groups validity of the final ITP-PAQ were evaluated. Seventy-three subjects with ITP completed the questionnaire twice. Test-retest reliability, as measured by the intra-class correlation, ranged from 0.52-0.90. Internal consistency reliability was demonstrated with Cronbach's alpha for all scales above the acceptable level of 0.70 (range: 0.71-0.92), except for RH (0.66). Construct validity, assessed by correlating ITP-PAQ scales with established measures (Short Form-36 v.1, SF-36 and Center for Epidemiologic Studies Depression Scale, CES-D), was demonstrated through moderate correlations between the ITP-PAQ SA and SF-36 Social Function scales (r = 0.67), and between ITP-PAQ PH and SF-36 Mental Health Scales (r = 0.63). Moderate to strong inter-scale correlations were reported between ITP-PAQ scales and the CES-D, except for the RH scale. Known groups validity was evaluated by comparing mean scores for groups that differed clinically. Statistically significant differences (p < 0.01) were observed when subjects were categorized by treatment status [S, FT, B, A, PH, and QoL, perceived effectiveness of ITP treatment [S], and time elapsed since ITP diagnosis [PH]. Results provide preliminary evidence of the reliability and validity of the ITP-PAQ in adult subjects with ITP. Further work should be conducted to assess the responsiveness and to estimate the minimal clinical important difference of the ITP-PAQ to more fully understand the impact of ITP and its treatments on HRQoL.
Neuro-QoL health-related quality of life measurement system: Validation in Parkinson's disease.
Nowinski, Cindy J; Siderowf, Andrew; Simuni, Tanya; Wortman, Catherine; Moy, Claudia; Cella, David
2016-05-01
Neuro-QoL is a multidimensional patient-reported outcome measurement system assessing aspects of physical, mental, and social health identified by neurology patients and caregivers as important. One of the first neurology-specific patient-reported outcome measure systems created using modern test development methods, Neuro-Qol enables brief, yet precise, assessment and the ability to conduct both PD-specific and cross-disease comparisons. We present results of Neuro-QoL clinical validation using a sample of PD patients. A total of 120 PD patients recruited from academic medical centers were assessed at baseline, 1 week, and 6 months. Assessments included Neuro-QoL and general and PD-specific validity measures. Participants were 62% male and 95% white (average age = 66); H & Y stages were 1 (16%), 2 (61%), 3 (18%), and 4 (5%). Internal consistency and test-retest reliability of Neuro-QoL ranged from Cronbach's alphas = 0.81 to 0.94 with intraclass correlation coefficients = 0.66 to 0.80. Pearson's correlations between Neuro-QoL and legacy measures were generally moderate and in expected directions. UPDRS Part 2 was moderately correlated with Neuro-QoL Upper Extremity and Mobility, respectively (r's = -0.44; -0.59). Parkinson's Disease Questionnaire-39 and Neuro-QoL measures of similar constructs showed strong-to-moderate correlations (r's = 0.70-0.44). Neuro-QoL measures of fatigue, mobility, positive emotion, and emotional/behavioral control showed responsiveness to self-reported change. Neuro-QoL is valid for use in PD clinical research. Reliability for all but two measures is sufficient for group comparisons, with some evidence supporting responsiveness to change. Neuro-QoL possesses characteristics, such as brevity, flexibility in administration, and suitability, for cross-disease comparisons that may be advantageous to users in a variety of settings. © 2016 Movement Disorder Society. © 2016 International Parkinson and Movement Disorder Society.
Reliability and validity of the Youth Leisure-time Sedentary Behavior Questionnaire (YLSBQ).
Cabanas-Sánchez, Verónica; Martínez-Gómez, David; Esteban-Cornejo, Irene; Castro-Piñero, José; Conde-Caveda, Julio; Veiga, Óscar L
2018-01-01
To develop a questionnaire able to assess time spent by youth in a wide range of leisure-time sedentary behaviors (SB) and evaluate its test-retest reliability and criterion validity. Cross-sectional observational. The reliability sample included 194 youth, aged 10-18 years, who completed the questionnaire twice, separated by one-week interval. The validity study comprised 1207 participants aged 8-18 years. Participants wore an accelerometer for 7 consecutive days. The questionnaire was designed to assess the amount of time spent in twelve different SB during weekdays and weekends, separately. In order to avoid usual phenomenon of time over reporting, values were adjusted to real available leisure-time (LT) for each participant. Reliability was assessed by using Intraclass Correlation Coefficients (ICC) and weighted (quadratic) kappa (k), and validity was assessed by using Pearson correlation and Bland-Altman plots. The reliability of questionnaire showed a moderate-to-substantial agreement for the most (91%) of items (k=0.43-0.74; ICC=0.41-0.79) with three items (4%) reaching an almost perfect agreement (ICC=0.82-0.83). Only 'sitting and talking' evidenced fair-to-moderate reliability (k=0.27-0.39; ICC=0.34-0.46). The relationship between average sedentary time assessed by the questionnaire and accelerometry was moderate (r=0.36; p<0.001). Systematic biases were not found between questionnaire and accelerometer sedentary time for average day (r=0.05; p=0.11) but Bland-Altman plots suggest moderate discrepancies between both methods of SB measurement (mean=19.86; limits of agreement=-280.04 to 319.76). The questionnaire showed moderate to good test-retest reliability and a moderate level of validity for assessing SB in youth, similar or slightly better to previously published in this population. Copyright © 2017 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.
Page, Stephen J; Hade, Erinn; Persch, Andrew
2015-01-01
There remains a need for a quickly administered, stroke-specific, bedside measure of active wrist and finger movement for the expanding stroke population. The wrist stability and hand mobility scales of the upper extremity Fugl-Meyer Assessment (w/h UE FM) constitute a valid, reliable measure of paretic UE impairment in patients with active wrist and finger movement. The aim of this study was to determine performance on the w/h UE FM in a stable cohort of survivors of stroke with only palpable movement in their paretic wrist flexors. A single-center cohort study was conducted. Thirty-two individuals exhibiting stable, moderate upper extremity hemiparesis (15 male, 17 female; mean age=56.6 years, SD=10.1; mean time since stroke=4.6 years, SD=5.8) participated in the study, which was conducted at an outpatient rehabilitation clinic in the midwestern United States. The w/h UE FM and Action Research Arm Test (ARAT) were administered twice. Intraclass correlation coefficients (ICCs), Cronbach alpha, and ordinal alpha were computed to determine reliability, and Spearman rank correlation coefficients and Bland-Altman plots were computed to establish validity. Intraclass correlation coefficients for the w/h UE FM and ARAT were .95 and .99, respectively. The w/h UE FM intrarater reliability and internal consistency were greater than .80, and concurrent validity was greater than .70. This also was the first stroke rehabilitative study to apply ordinal alpha to examine internal consistency values, revealing w/h UE FM levels greater than .85. Concurrent validity findings were corroborated by Bland-Altman plots. It appears that the w/h UE FM is a promising tool to measure distal upper extremity movement in patients with little active paretic wrist and finger movement. This finding widens the segment of patients on whom the w/h UE FM can be effectively used and addresses a gap, as commonly used measures necessitate active distal upper extremity movement. © 2015 American Physical Therapy Association.
Viswanathan, Hema N; Mutebi, Alex; Milmont, Cassandra E; Gordon, Kenneth; Wilson, Hilary; Zhang, Hao; Klekotka, Paul A; Revicki, Dennis A; Augustin, Matthias; Kricorian, Gregory; Nirula, Ajay; Strober, Bruce
2017-09-01
The Psoriasis Symptom Inventory (PSI) is a patient-reported outcome instrument that measures the severity of psoriasis signs and symptoms. This study evaluated measurement properties of the PSI in patients with moderate to severe plaque psoriasis. This secondary analysis used pooled data from a phase 3 brodalumab clinical trial (AMAGINE-1). Outcome measures included the PSI, Psoriasis Area and Severity Index (PASI), static Physician's Global Assessment (sPGA), psoriasis-affected body surface area, 36-item Short-Form Health Survey version 2, and the Dermatology Life Quality Index (DLQI). The PSI was evaluated for dimensionality, item performance, reliability (internal consistency and test-retest), construct validity, ability to detect change, and agreement between PSI response and response measures based on the PASI, sPGA, and DLQI. Results supported unidimensionality, good item fit, ordered responses, and PSI scoring. The PSI demonstrated reliability: baseline Cronbach's alpha ≥ 0.92 and intraclass correlation coefficients ≥ 0.95. Correlations between PSI total score and DLQI item 1 (r = 0.86), DLQI symptoms and feelings (r = 0.87), and 36-item Short-Form Health Survey version 2 bodily pain (r = -0.61) supported convergent validity. PSI scores differed significantly (P < 0.001) among severity groups based on the PASI (< 12/≥ 12), sPGA (0-1/2-3/4-5), body surface area (< 5%/5%-10%/> 10%), and DLQI (≤ 5/> 5) at weeks 8 and 12. At week 12, the PSI detected significant changes in severity based on PASI responses (< 50/50- < 75/≥ 75) and sPGA (0-1/≥ 2), and showed good agreement (k ≥ 0.66) between PSI response and PASI, sPGA, and DLQI responses. The PSI demonstrated excellent validity, reliability, and ability to detect change in the severity of psoriasis signs and symptoms. Copyright © 2017 International Society for Pharmacoeconomics and Outcomes Research (ISPOR). Published by Elsevier Inc. All rights reserved.
Shiovitz-Ezra, Sharon; Leitsch, Sara; Graber, Jessica; Karraker, Amelia
2009-11-01
The National Social Life, Health, and Aging Project (NSHAP) measures seven indicators of quality of life (QoL) and psychological health. The measures used for happiness, self-esteem, depression, and loneliness are well established in the literature. Conversely, measures of anxiety, stress, and self-reported emotional health were modified for their use in this unique project. The purpose of this paper is to provide (a) an overview of NSHAP's QoL assessment and (b) evidence for the adequacy of the modified measures. First, we examined the psychometric properties of the modified measures. Second, the established QoL measures were used to examine the concurrent validity of the modified measures. Finally, gender- and age-group differences were examined for each modified measure. The anxiety index exhibited good internal reliability and concurrent validity. Consistent with the literature, a single-factor structure best fit the data. Stress was satisfactory in terms of concurrent validity but with only fair internal consistency. Self-reported emotional health exhibited good concurrent validity and moderate external validity. The modified indices used in NSHAP tended to exhibit good internal reliability and concurrent validity. These measures can confidently be used in the exploration of QoL and psychological health in later life and its many correlates.
Engels, Leopold G J B; Klinkenberg-Knol, Elly C; Carlsson, Jonas; Halling, Katarina
2010-08-17
The Quality of Life in Reflux and Dyspepsia (QOLRAD) questionnaire is one of the best-characterized disease-specific instruments that captures health-related problems and symptom-patterns in patients with gastroesophageal reflux disease (GERD). This paper reports the psychometric validation of a Dutch translation of the QOLRAD questionnaire in gastroenterology outpatients with GERD. Patients completed the QOLRAD questionnaire at visit 1 (baseline), visit 2 (after 2, 4 or 8 weeks of acute treatment with esomeprazole 40 mg once daily), and visit 4 (after 6 months with on-demand esomeprazole 40 mg once daily or continuous esomeprazole 20 mg once daily). Symptoms were assessed at each visit, and patient satisfaction was assessed at visits 2 and 4. Of the 1166 patients entered in the study, 97.3% had moderate or severe heartburn and 55.5% had moderate or severe regurgitation at baseline. At visit 2, symptoms of heartburn and regurgitation were mild or absent in 96.7% and 97.7%, respectively, and 95.3% of patients reported being satisfied with the treatment. The internal consistency and reliability of the QOLRAD questionnaire (range: 0.83-0.92) supported construct validity. Convergent validity was moderate to low. Known-groups validity was confirmed by a negative correlation between the QOLRAD score and clinician-assessed severity of GERD symptoms. Effect sizes (1.15-1.93) and standardized response means (1.17-1.86) showed good responsiveness to change. GERD symptoms had a negative impact on patients' lives. The psychometric characteristics of the Dutch translation of the QOLRAD questionnaire were found to be satisfactory, with good reliability and responsiveness to change, although convergent validity was at best moderate.
van Veelen, G A; Schweitzer, K J; van der Vaart, C H
2013-11-01
To evaluate the reliability of measurements of the levator hiatus and levator-urethra gap (LUG) using three/four-dimensional (3D/4D) transperineal ultrasound in women during their first pregnancy and 6 months postpartum, and to assess the learning process for these measurements. An inexperienced observer was taught to perform measurements of the levator hiatus and LUG by an experienced observer. After training, 3D/4D ultrasound volume datasets of 40 women in the first trimester were analyzed by these two observers. Another training session then took place and both observers repeated the analyses of the same volume datasets. Finally, analyses of 40 volume datasets of the women 6 months postpartum were performed by both observers. Intra- and interobserver reliability were determined by intraclass correlation coefficients (ICC) with 95% CIs. For levator hiatal measurements, in the women during their first pregnancy the interobserver reliability was substantial to almost perfect after both the first and second training session (ICC, 0.62-0.83 and 0.71-0.89, respectively, for anteroposterior diameter, transverse diameter and area at rest, on contraction and on Valsalva) and the intraobserver reliability was substantial to almost perfect for both observers. For these measurements performed once the women had delivered, interobserver reliability was moderate to almost perfect. For LUG measurements performed during pregnancy, interobserver reliability was slight to moderate after the first training session (ICC, 0.14-0.54), but improved after the second training session (ICC, 0.38-0.71), and intraobserver reliability was moderate to substantial for the experienced observer and slight to moderate for the inexperienced observer. For these measurements performed when the women had delivered, interobserver reliability was fair to moderate. The levator hiatus and LUG can be measured reliably using 3D/4D ultrasound in primigravid and primiparous women. The technique to measure dimensions of the levator hiatus requires limited teaching, but LUG measurements are more difficult and require more extensive training. Copyright © 2013 ISUOG. Published by John Wiley & Sons Ltd.
Satisfaction With Appearance Scale-SWAP: Adaptation and validation for Brazilian burn victims.
Caltran, Marina P; Freitas, Noélle O; Dantas, Rosana A S; Farina, Jayme Adriano; Rossi, Lidia A
2016-09-01
Methodological study that aimed to adapt the Satisfaction with Appearance Scale (SWAP) into Brazilian Portuguese language and to assess the validity, the reliability and the dimensionality of the adapted version in a sample of Brazilian burn victims. We carried out the adaptation process according to the international literature. Construct validity was assessed by correlating the adapted version of SWAP scores with depression (Beck Depression Index), self-esteem (Rosenberg Self-Esteem Scale), health-related quality of Life (Short Form Health Survey-36) and health status of burn victims (Burn Specific Health Scale-Revised), and with gender, total body surface area burned, and visibility of the scars. We tested dimensionality using Exploratory Factor Analysis (EFA) and the reliability by means of Cronbach's alpha. Participants were 106 adult burned patients. The correlations between the Brazilian version of the SWAP scores and the correlated construct measures varied from moderate to strong (r=.30-.77). The participants who perceived their burn sequelae was visible reported being more dissatisfied with their body image than the participants who answered that their scars would not be visible (p<.001). Cronbach's alpha for the adapted version was 0.88 and the item-total correlation varied from moderate to strong (r=.35-.73). The EFA resulted in three factors with a total explained variance percentage of 63.2%. The Brazilian version of the SWAP was valid and reliable for use with Brazilian burn victims. Copyright © 2016 Elsevier Ltd and ISBI. All rights reserved.
Sani, Gabriele; Vöhringer, Paul A; Barroilhet, Sergio A; Koukopoulos, Alexia E; Ghaemi, S Nassir
2018-05-01
It has been proposed that the broad major depressive disorder (MDD) construct is heterogenous. Koukopoulos has provided diagnostic criteria for an important subtype within that construct, "mixed depression" (MxD), which encompasses clinical pictures characterized by marked psychomotor or inner excitation and rage/anger, along with severe depression. This study provides psychometric validation for the first rating scale specifically designed to assess MxD symptoms cross-sectionally, the Koukopoulos Mixed Depression Rating Scale (KMDRS). 350 patients from the international mood network (IMN) completed three rating scales: the KMDRS, Montgomery-Asberg Depression Rating Scale (MADRS) and Young Mania Rating Scale (YMRS). KMDRS' psychometric properties assessed included Cronbach's alpha, inter-rater reliability, factor analysis, predictive validity, and Receiver Operator Curve analysis. Internal consistency (Cronbach's alpha = 0.76; 95% CI 0.57, 0.94) and interrater reliability (kappa = 0.73) were adequate. Confirmatory factor analysis identified 2 components: anger and psychomotor excitation (80% of total variance). Good predictive validity was seen (C-statistic = 0.82 95% CI 0.68, 0.93). Severity cut-off scores identified were as follows: none (0-4), possible (5-9), mild (10-15), moderate (16-20) and severe (> 21) MxD. Non DSM-based diagnosis of MxD may pose some difficulties in the initial use and interpretation of the scoring of the scale. Moreover, the cross-sectional nature of the evaluation does not verify the long-term stability of the scale. KMDRS was a reliable and valid instrument to assess MxD symptoms. Copyright © 2018 Elsevier B.V. All rights reserved.
A systematic review of the measurement properties of the Body Image Scale (BIS) in cancer patients.
Melissant, Heleen C; Neijenhuijs, Koen I; Jansen, Femke; Aaronson, Neil K; Groenvold, Mogens; Holzner, Bernhard; Terwee, Caroline B; van Uden-Kraan, Cornelia F; Cuijpers, Pim; Verdonck-de Leeuw, Irma M
2018-06-01
Body image is acknowledged as an important aspect of health-related quality of life in cancer patients. The Body Image Scale (BIS) is a patient-reported outcome measure (PROM) to evaluate body image in cancer patients. The aim of this study was to systematically review measurement properties of the BIS among cancer patients. A search in Embase, MEDLINE, PsycINFO, and Web of Science was performed to identify studies that investigated measurement properties of the BIS (Prospero ID 42017057237). Study quality was assessed (excellent, good, fair, poor), and data were extracted and analyzed according to the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) methodology on structural validity, internal consistency, reliability, measurement error, hypothesis testing for construct validity, and responsiveness. Evidence was categorized into sufficient, insufficient, inconsistent, or indeterminate. Nine studies were included. Evidence was sufficient for structural validity (one factor solution), internal consistency (α = 0.86-0.96), and reliability (r > 0.70); indeterminate for measurement error (information on minimal important change lacked) and responsiveness (increasing body image disturbance in only one study); and inconsistent for hypothesis testing (conflicting results). Quality of the evidence was moderate to low. No studies reported on cross-cultural validity. The BIS is a PROM with good structural validity, internal consistency, and test-retest reliability, but good quality studies on the other measurement properties are needed to optimize evidence. It is recommended to include a wider variety of cancer diagnoses and treatment modalities in these future studies.
Sakai, Naomi; Chu, Shin Ying; Mori, Koichi; Yaruss, J Scott
2017-03-01
This study evaluates the psychometric performance of the Japanese version of the Overall Assessment of the Speaker's Experience of Stuttering for Adults (OASES-A), a comprehensive assessment tool of individuals who stutter. The OASES-A-J was administered to 200 adults who stutter in Japan. All respondents also evaluated their own speech (SA scale), satisfaction of their own speech (SS scale) and the Japanese translation version of the Modified Erickson Communication Attitude scale (S-24). The test-retest reliability and internal consistency of the OASES-A-J were assessed. To examine the concurrent validity of the questionnaire, Pearson correlation was conducted between the OASES-A-J Impact score and the S-24 scale, SA scale and SS scale. In addition, Pearson correlation among the impact scores of each section and total were calculated to examine the construct validity. The OASES-A-J showed a good test-retest reliability (r=0.81-0.95) and high internal consistency (α>0.80). Concurrent validity was moderate to high (0.55-0.75). Construct validity was confirmed by the relation between internal consistency in each section and correlation among sections' impact scores. Japanese adults showed higher negative impact for 'General Information', 'Reactions to Stuttering' and 'Quality of Life' sections. These results suggest that the OASES-A-J is a reliable and valid instrument to measure the impact of stuttering on Japanese adults who stutter. The OASES-A-J could be used as a clinical tool in Japanese stuttering field. Copyright © 2016 Elsevier Inc. All rights reserved.
Psychometric Properties of the Persian Version of the Tinnitus Handicap Inventory (THI-P)
Jalali, Mir Mohammad; Soleimani, Robabeh; Fallahi, Mahnaz; Aghajanpour, Mohammad; Elahi, Masoumeh
2015-01-01
Introduction: Tinnitus can have a significant effect on an individual’s quality of life, and is very difficult quantify. One of the most popular questionnaires used in this area is the Tinnitus Handicap Inventory (THI). The aim of this study was to determine the reliability and validity of a Persian translation of the Tinnitus Handicap Inventory (THI-P). Materials and Methods: This prospective clinical study was performed in the Otolaryngology Department of Guilan University of Medical Sciences, Iran. A total of 102 patients aged 23–80 years with tinnitus completed the (THI-P). The patients were instructed to complete the Beck Depression Inventory (BDI) and the State-Trait Anxiety Inventory (STAI). Audiometry was performed. Eight-five patients were asked to complete the THI-P for a second time 7–10 days after the initial interview. We assessed test–retest reliability and internal reliability of the THI-P. Validity was assessed by analyzing the THI-P of patients according to their age, tinnitus duration and psychological distress (BDI and STAI). A factor analysis was computed to verify if three subscales (functional, emotional, and catastrophic) represented three distinct variables. Results: Test–retest correlation coefficient scores were highly significant. The THI-P and its subscales showed good internal consistency reliability (α = 0.80 to 0.96). High-to-moderate correlations were observed between THI-P and psychological distress and tinnitus symptom ratings. A confirmatory factor analysis failed to validate the three subscales of THI, and high inter-correlations found between the subscales question whether they represent three distinct factors. Conclusion: The results suggest that the THI-P is a reliable and valid tool which can be used in a clinical setting to quantify the impact of tinnitus on the quality of life of Iranian patients. PMID:25938079
2012-01-01
Background This study aimed to investigate the reliability and validity of the Iranian version of the Pediatric Quality of Life Inventory™ 4.0 (PedsQL™ 4.0) Generic Core Scales in children. Methods A standard forward and backward translation procedure was used to translate the US English version of the PedsQL™ 4.0 Generic Core Scales for children into the Iranian language (Persian). The Iranian version of the PedsQL™ 4.0 Generic Core Scales was completed by 503 healthy and 22 chronically ill children aged 8-12 years and their parents. The reliability was evaluated using internal consistency. Known-groups discriminant comparisons were made, and exploratory factor analysis (EFA) and confirmatory factor analysis (CFA) were conducted. Results The internal consistency, as measured by Cronbach's alpha coefficients, exceeded the minimum reliability standard of 0.70. All monotrait-multimethod correlations were higher than multitrait-multimethod correlations. The intraclass correlation coefficients (ICC) between the children self-report and parent proxy-reports showed moderate to high agreement. Exploratory factor analysis extracted six factors from the PedsQL™ 4.0 for both self and proxy reports, accounting for 47.9% and 54.8% of total variance, respectively. The results of the confirmatory factor analysis for 6-factor models for both self-report and proxy-report indicated acceptable fit for the proposed models. Regarding health status, as hypothesized from previous studies, healthy children reported significantly higher health-related quality of life than those with chronic illnesses. Conclusions The findings support the initial reliability and validity of the Iranian version of the PedsQL™ 4.0 as a generic instrument to measure health-related quality of life of children in Iran. PMID:22221765
Amiri, Parisa; Eslamian, Ghazaleh; Mirmiran, Parvin; Shiva, Niloofar; Jafarabadi, Mohammad Asghari; Azizi, Fereidoun
2012-01-05
This study aimed to investigate the reliability and validity of the Iranian version of the Pediatric Quality of Life Inventory™ 4.0 (PedsQL™ 4.0) Generic Core Scales in children. A standard forward and backward translation procedure was used to translate the US English version of the PedsQL™ 4.0 Generic Core Scales for children into the Iranian language (Persian). The Iranian version of the PedsQL™ 4.0 Generic Core Scales was completed by 503 healthy and 22 chronically ill children aged 8-12 years and their parents. The reliability was evaluated using internal consistency. Known-groups discriminant comparisons were made, and exploratory factor analysis (EFA) and confirmatory factor analysis (CFA) were conducted. The internal consistency, as measured by Cronbach's alpha coefficients, exceeded the minimum reliability standard of 0.70. All monotrait-multimethod correlations were higher than multitrait-multimethod correlations. The intraclass correlation coefficients (ICC) between the children self-report and parent proxy-reports showed moderate to high agreement. Exploratory factor analysis extracted six factors from the PedsQL™ 4.0 for both self and proxy reports, accounting for 47.9% and 54.8% of total variance, respectively. The results of the confirmatory factor analysis for 6-factor models for both self-report and proxy-report indicated acceptable fit for the proposed models. Regarding health status, as hypothesized from previous studies, healthy children reported significantly higher health-related quality of life than those with chronic illnesses. The findings support the initial reliability and validity of the Iranian version of the PedsQL™ 4.0 as a generic instrument to measure health-related quality of life of children in Iran.
Cheong, Sau Kuan; Lang, Cathryne P; Hemphill, Sheryl A; Johnston, Leanne M
2017-06-01
To evaluate the preliminary validity and reliability of the myTREEHOUSE Self-Concept Assessment for children with cerebral palsy (CP) aged 8 to 12 years. The myTREEHOUSE Self-Concept Assessment includes 26 items divided into eight domains, assessed across three Performance Perspectives (Personal, Social, and Perceived) and an additional Importance Rating. Face and content validity was assessed by semi-structured interviews with seven expert professionals regarding the assessment construct, content, and clinical utility. Reliability was assessed with 50 children aged 8 to 12 years with CP (29 males, 21 females; mean age 10y 2mo; Gross Motor Function Classification System [GMFCS] level I=35, II=8, III=5, IV=1; mean Wechsler Intelligence Scale for Children - Fourth Edition [WISC-IV]=104), whose data was used to calculate internal consistency of the scale, and a subset of 35 children (20 males, 15 females; mean age 10y 5mo; GMFCS level I=26, II=4, III=4, IV=1; mean WISC-IV=103) who participated in test-retest reliability within 14 to 28 days. Face and content validity was supported by positive expert feedback, with only minor adjustments suggested to clarify the wording of some items. After these amendments, strong internal consistency (Cronbach's α 0.84-0.91) and moderate to good test-retest reliability (intraclass correlation coefficient 0.64-0.75) was found for each component. The myTREEHOUSE Self-Concept Assessment is a valid and reliable assessment of self-concept for children with CP aged 8 to 12 years. © 2017 Mac Keith Press.
Cross-Cultural Adaptation and Validation of the Italian Version of SWAL-QOL.
Ginocchio, Daniela; Alfonsi, Enrico; Mozzanica, Francesco; Accornero, Anna Rosa; Bergonzoni, Antonella; Chiarello, Giulia; De Luca, Nicoletta; Farneti, Daniele; Marilia, Simonelli; Calcagno, Paola; Turroni, Valentina; Schindler, Antonio
2016-10-01
The aim of the study was to evaluate the reliability and validity of the Italian SWAL-QOL (I-SWAL-QOL). The study consisted of five phases: item generation, reliability analysis, normative data generation, validity analysis, and responsiveness analysis. The item generation phase followed the five-step, cross-cultural, adaptation process of translation and back-translation. A group of 92 dysphagic patients was enrolled for the internal consistency analysis. Seventy-eight patients completed the I-SWAL-QOL twice, 2 weeks apart, for test-retest reliability analysis. A group of 200 asymptomatic subjects completed the I-SWAL-QOL for normative data generation. I-SWAL-QOL scores obtained by both the group of dysphagic subjects and asymptomatic ones were compared for validity analysis. I-SWAL-QOL scores were correlated with SF-36 scores in 67 patients with dysphagia for concurrent validity analysis. Finally, I-SWAL-QOL scores obtained in a group of 30 dysphagic patients before and after successful rehabilitation treatment were compared for responsiveness analysis. All the enrolled patients managed to complete the I-SWAL-QOL without needing any assistance, within 20 min. Internal consistency was acceptable for all I-SWAL-QOL subscales (α > 0.70). Test-retest reliability was also satisfactory for all subscales (ICC > 0.7). A significant difference between the dysphagic group and the control group was found in all I-SWAL-QOL subscales (p < 0.05). Mild to moderate correlations between I-SWAL-QOL and SF-36 subscales were observed. I-SWAL-QOL scores obtained in the pre-treatment condition were significantly lower than those obtained after swallowing rehabilitation. I-SWAL-QOL is reliable, valid, responsive to changes in QOL, and recommended for clinical practice and outcome research.
Vermeulen, Margit I; Tromp, Fred; Zuithoff, Nicolaas P A; Pieters, Ron H M; Damoiseaux, Roger A M J; Kuyvenhoven, Marijke M
2014-12-01
Abstract Background: Historically, semi-structured interviews (SSI) have been the core of the Dutch selection for postgraduate general practice (GP) training. This paper describes a pilot study on a newly designed competency-based selection procedure that assesses whether candidates have the competencies that are required to complete GP training. The objective was to explore reliability and validity aspects of the instruments developed. The new selection procedure comprising the National GP Knowledge Test (LHK), a situational judgement tests (SJT), a patterned behaviour descriptive interview (PBDI) and a simulated encounter (SIM) was piloted alongside the current procedure. Forty-seven candidates volunteered in both procedures. Admission decision was based on the results of the current procedure. Study participants did hardly differ from the other candidates. The mean scores of the candidates on the LHK and SJT were 21.9 % (SD 8.7) and 83.8% (SD 3.1), respectively. The mean self-reported competency scores (PBDI) were higher than the observed competencies (SIM): 3.7(SD 0.5) and 2.9(SD 0.6), respectively. Content-related competencies showed low correlations with one another when measured with different instruments, whereas more diverse competencies measured by a single instrument showed strong to moderate correlations. Moreover, a moderate correlation between LHK and SJT was found. The internal consistencies (intraclass correlation, ICC) of LHK and SJT were poor while the ICC of PBDI and SIM showed acceptable levels of reliability. Findings on content validity and reliability of these new instruments are promising to realize a competency based procedure. Further development of the instruments and research on predictive validity should be pursued.
Cancela Carral, José María; Lago Ballesteros, Joaquín; Ayán Pérez, Carlos; Mosquera Morono, María Belén
2016-01-01
To analyse the reliability and validity of the Weekly Activity Checklist (WAC), the One Week Recall (OWR), and the Godin-Shephard Leisure Time Exercise Questionnaire (GLTEQ) in Spanish adolescents. A total of 78 adolescents wore a pedometer for one week, filled out the questionnaires at the end of this period and underwent a test to estimate their maximal oxygen consumption (VO2max). The reliability of the questionnaires was determined by means of a factor analysis. Convergent validity was obtained by comparing the questionnaires' scores against the amount of physical activity quantified by the pedometer and the VO2max reported. The questionnaires showed a weak internal consistency (WAC: α=0.59-0.78; OWR: α=0.53-0.73; GLTEQ: α=0.60). Moderate statistically significant correlations were found between the pedometer and the WAC (r=0.69; p <0.01) and the OWR (r=0.42; p <0.01), while a low statistically significant correlation was found for the GLTEQ (r=0.36; p=0.01). The estimated VO2max showed a low level of association with the WAC results (r=0.30; p <0.05), and the OWR results (r=0.29; p <0.05). When classifying the participants as active or inactive, the level of agreement with the pedometer was moderate for the WAC (k=0.46) and the OWR (r=0.44), and slight for the GLTEQ (r=0.20). Of the three questionnaires analysed, the WAC showed the best psychometric performance as it was the only one with respectable convergent validity, while sharing low reliability with the OWR and the GLTEQ. Copyright © 2016 SESPAS. Publicado por Elsevier España, S.L.U. All rights reserved.
Saloheimo, T; González, S A; Erkkola, M; Milauskas, D M; Meisel, J D; Champagne, C M; Tudor-Locke, C; Sarmiento, O; Katzmarzyk, P T; Fogelholm, M
2015-01-01
Objective: The main aim of this study was to assess the reliability and validity of a food frequency questionnaire with 23 food groups (I-FFQ) among a sample of 9–11-year-old children from three different countries that differ on economical development and income distribution, and to assess differences between country sites. Furthermore, we assessed factors associated with I-FFQ's performance. Methods: This was an ancillary study of the International Study of Childhood Obesity, Lifestyle and the Environment. Reliability (n=321) and validity (n=282) components of this study had the same participants. Participation rates were 95% and 70%, respectively. Participants completed two I-FFQs with a mean interval of 4.9 weeks to assess reliability. A 3-day pre-coded food diary (PFD) was used as the reference method in the validity analyses. Wilcoxon signed-rank tests, intraclass correlation coefficients and cross-classifications were used to assess the reliability of I-FFQ. Spearman correlation coefficients, percentage difference and cross-classifications were used to assess the validity of I-FFQ. A logistic regression model was used to assess the relation of selected variables with the estimate of validity. Analyses based on information in the PFDs were performed to assess how participants interpreted food groups. Results: Reliability correlation coefficients ranged from 0.37 to 0.78 and gross misclassification for all food groups was <5%. Validity correlation coefficients were below 0.5 for 22/23 food groups, and they differed among country sites. For validity, gross misclassification was <5% for 22/23 food groups. Over- or underestimation did not appear for 19/23 food groups. Logistic regression showed that country of participation and parental education were associated (P⩽0.05) with the validity of I-FFQ. Analyses of children's interpretation of food groups suggested that the meaning of most food groups was understood by the children. Conclusion: I-FFQ is a moderately reliable method and its validity ranged from low to moderate, depending on food group and country site. PMID:27152180
A Psychometric Evaluation of the Threadgold Communication Tool for Persons with Dementia
Strøm, Benedicte Sørensen; Engedal, Knut; Grov, Ellen-Karine
2016-01-01
Background The objective of this study was to investigate the psychometric properties of the Threadgold Communication Tool (TCT). Method Internal consistency reliability was measured using Cronbach's α coefficient and inter-item correlation. Test-retest was performed to examine the instrument's stability. Exploratory principal component analysis (PCA) with oblimin rotation was carried out to evaluate construct validity. Finally, the score on each item of the TCT was correlated with the person's Mini Mental State Examination (MMSE) and Barthel Index of activities of daily living scores. Results A total of 51 persons participated, with a mean age of 86.7 (SD 6.6) years, of whom 46 were women with moderate-to-severe dementia [mean MMSE score 7.5 (SD 6.7)]. There were two measurement points 2 weeks apart. The results showed a satisfactory level for internal consistency and a high test-retest reliability (r = 0.76). The corrected item-total correlation ranged between 0.50 and 0.87, and a two-factor structure was revealed at the PCA. ‘Vocalizing’ seemed to measure another aspect of communication and was the only item which was negatively loaded. Conclusion Despite the low sample size in this study, the results revealed the TCT as a reliable and valid instrument, suitable for measuring communication among people with dementia. We suggest clarifying the understanding of ‘vocalizing’ before considering removing it from the scale. PMID:27239188
Chinese adaptation and validation of the patellofemoral pain severity scale.
Cheung, Roy T H; Ngai, Shirley P C; Lam, Priscillia L; Chiu, Joseph K W; Fung, Eric Y H
2013-05-01
This study validated the Patellofemoral Pain Severity Scale translated into Chinese. The Chinese Patellofemoral Pain Severity Scale was translated from the original English version following standard forward and backward translation procedures recommended by the International Society for Pharmacoeconomics and Outcomes Research. The survey was then conducted in clinical settings by a questionnaire comprising the Chinese Patellofemoral Pain Severity Scale, Kujala Scale and Western Ontario and McMaster Universities (WOMAC) Osteoarthritis Index. Eighty-four Chinese reading patients with patellofemoral pain were recruited from physical therapy clinics. Internal consistency of the translated instrument was measured by Cronbach alpha. Convergent validity was examined by Spearman rank correlation coefficient (rho) tests by comparing its score with the validated Chinese version of the Kujala Scale and the WOMAC Osteoarthritis Index while the test-retest reliability was evaluated by administering the questionnaires twice. Cronbach alpha values of individual questions and their overall value were above 0.85. Strong association was found between the Chinese Patellofemoral Pain Severity Scale and the Kujala Scale (rho = -0.72, p < 0.001). Moderate correlation was also found between Chinese Patellofemoral Pain Severity Scale with the WOMAC Osteoarthritis Index (rho = 0.63, p < 0.001). Excellent test-retest reliability (Intraclass correlation coefficient = 0.98) was demonstrated. The Chinese translated version of the Patellofemoral Pain Severity Scale is a reliable and valid instrument for patients with patellofemoral pain.
2011-01-01
Background Current methodological guidelines provide advice about the assessment of sub-group analysis within RCTs, but do not specify explicit criteria for assessment. Our objective was to provide researchers with a set of criteria that will facilitate the grading of evidence for moderators, in systematic reviews. Method We developed a set of criteria from methodological manuscripts (n = 18) using snowballing technique, and electronic database searches. Criteria were reviewed by an international Delphi panel (n = 21), comprising authors who have published methodological papers in this area, and researchers who have been active in the study of sub-group analysis in RCTs. We used the Research ANd Development/University of California Los Angeles appropriateness method to assess consensus on the quantitative data. Free responses were coded for consensus and disagreement. In a subsequent round additional criteria were extracted from the Cochrane Reviewers' Handbook, and the process was repeated. Results The recommendations are that meta-analysts report both confirmatory and exploratory findings for sub-groups analysis. Confirmatory findings must only come from studies in which a specific theory/evidence based a-priori statement is made. Exploratory findings may be used to inform future/subsequent trials. However, for inclusion in the meta-analysis of moderators, the following additional criteria should be applied to each study: Baseline factors should be measured prior to randomisation, measurement of baseline factors should be of adequate reliability and validity, and a specific test of the interaction between baseline factors and interventions must be presented. Conclusions There is consensus from a group of 21 international experts that methodological criteria to assess moderators within systematic reviews of RCTs is both timely and necessary. The consensus from the experts resulted in five criteria divided into two groups when synthesising evidence: confirmatory findings to support hypotheses about moderators and exploratory findings to inform future research. These recommendations are discussed in reference to previous recommendations for evaluating and reporting moderator studies. PMID:21281501
Hamre, Charlotta; Botolfsen, Pernille; Tangen, Gro Gujord; Helbostad, Jorunn L
2017-04-20
The Balance Evaluation Systems Test (BESTest) was developed to assess underlying systems for balance control in order to be able to individually tailor rehabilitation interventions to people with balance disorders. A short form, the Mini-BESTest, was developed as a screening test. The study aimed to assess interrater and test-retest reliability of the Norwegian version of the BESTest and the Mini-BESTest in community-dwelling people with increased risk of falling and to assess concurrent validity with the Fall Efficacy Scale-International (FES-I), and it was an observational study with a cross-sectional design. Forty-two persons with increased risk of falling (elderly over 65 years of age, persons with a history of stroke or Multiple Sclerosis) were assessed twice by two raters. Relative reliability was analysed with Intraclass Correlation Coefficient (ICC), and absolute reliability with standard error of measurement (SEM) and smallest detectable change (SDC). Concurrent validity was assessed against the FES-I using Spearman's rho. The BESTest showed very good interrater reliability (ICC = 0.98, SEM = 1.79, SDC 95 = 5.0) and test-retest reliability (rater A/rater B = ICC = 0.89/0.89, SEM = 3.9/4.3, SDC 95 = 10.8/11.8). The Mini-BESTest also showed very good interrater reliability (ICC = 0.95, SEM = 1.19, SDC 95 = 3.3) and test-retest reliability (rater A/rater B = ICC = 0.85/0.84, SEM = 1.8/1.9, SDC 95 = 4.9/5.2). The correlations were moderate between the FES-I and both the BESTest and the Mini-BESTest (Spearman's rho -0.51 and-0.50, p < 0.01). The BESTest and its short form, the Mini-BESTest, showed very good interrater and test-retest reliability when assessed in a heterogeneous sample of people with increased risk of falling. The concurrent validity measured against the FES-I showed moderate correlation. The results are comparable with earlier studies and indicate that the Norwegian versions can be used in daily clinic and in research.
The German Version of the Herth Hope Index (HHI-D): Development and Psychometric Properties.
Geiser, Franziska; Zajackowski, Katharina; Conrad, Rupert; Imbierowicz, Katrin; Wegener, Ingo; Herth, Kaye A; Urbach, Anne Sarah
2015-01-01
The importance of hope is evident in clinical oncological care. Hope is associated with psychological and also physical functioning. However, there is still a dearth of empirical research on hope as a multidimensional concept. The Herth Hope Index is a reliable and valid instrument for the measurement of hope and is available in many languages. Until now no authorized German translation has been published and validated. After translation, the questionnaire was completed by 192 patients with different tumor entities in radiation therapy. Reliability, concurrent validity, and factor structure of the questionnaire were determined. Correlations were high with depression and anxiety as well as optimism and pessimism. As expected, correlations with coping styles were moderate. Internal consistency and test-retest reliability were satisfactory. We could not replicate the original 3-factor model. Application of the scree plot criterion in an exploratory factor analysis resulted in a single-factor structure. The Herth Hope Index - German Version (HHI-D) is a short, reliable, and valid instrument for the assessment of hope in patient populations. We recommend using only the HHI-D total score until further research gives more insights into possible factorial solutions and subscales. © 2015 S. Karger GmbH, Freiburg.
2011-01-01
Background Although measures of knowledge translation and exchange (KTE) effectiveness based on the theory of planned behavior (TPB) have been used among patients and providers, no measure has been developed for use among health system policymakers and stakeholders. A tool that measures the intention to use research evidence in policymaking could assist researchers in evaluating the effectiveness of KTE strategies that aim to support evidence-informed health system decision-making. Therefore, we developed a 15-item tool to measure four TPB constructs (intention, attitude, subjective norm and perceived control) and assessed its face validity through key informant interviews. Methods We carried out a reliability study to assess the tool's internal consistency and test-retest reliability. Our study sample consisted of 62 policymakers and stakeholders that participated in deliberative dialogues. We assessed internal consistency using Cronbach's alpha and generalizability (G) coefficients, and we assessed test-retest reliability by calculating Pearson correlation coefficients (r) and G coefficients for each construct and the tool overall. Results The internal consistency of items within each construct was good with alpha ranging from 0.68 to alpha = 0.89. G-coefficients were lower for a single administration (G = 0.34 to G = 0.73) than for the average of two administrations (G = 0.79 to G = 0.89). Test-retest reliability coefficients for the constructs ranged from r = 0.26 to r = 0.77 and from G = 0.31 to G = 0.62 for a single administration, and from G = 0.47 to G = 0.86 for the average of two administrations. Test-retest reliability of the tool using G theory was moderate (G = 0.5) when we generalized across a single observation, but became strong (G = 0.9) when we averaged across both administrations. Conclusion This study provides preliminary evidence for the reliability of a tool that can be used to measure TPB constructs in relation to research use in policymaking. Our findings suggest that the tool should be administered on more than one occasion when the intervention promotes an initial 'spike' in enthusiasm for using research evidence (as it seemed to do in this case with deliberative dialogues). The findings from this study will be used to modify the tool and inform further psychometric testing following different KTE interventions. PMID:21702956
Cuchna, Jennifer W; Hoch, Matthew C; Hoch, Johanna M
2016-05-01
To synthesize the literature and perform a meta-analysis for both the interrater and intrarater reliability of the FMS™. Academic Search Complete, CINAHL, Medline and SportsDiscus databases were systematically searched from inception to March 2015. Studies were included if the primary purpose was to determine the interrater or intrarater reliability of the FMS™, assessed and scored all 7-items using the standard scoring criteria, provided a composite score and employed intraclass correlation coefficients (ICCs). Studies were excluded if reliability was not the primary aim, participants were injured at data collection, or a modified FMS™ or scoring system was utilized. Seven papers were included; 6 assessing interrater and 6 assessing intrarater reliability. There was moderate evidence in good interrater reliability with a summary ICC of 0.843 (95% CI = 0.640, 0.936; Q7 = 84.915, p < 0.0001). There was moderate evidence in good intrarater reliability with a summary ICC of 0.869 (95% CI = 0.785, 0.921; Q12 = 60.763, p < 0.0001). There was moderate evidence for both forms of reliability. The sensitivity assessments revealed this interpretation is stable and not influenced by any one study. Overall, the FMS™ is a reliable tool for clinical practice. Copyright © 2015 Elsevier Ltd. All rights reserved.
de Witte, Annemarie M H; Hoozemans, Marco J M; Berger, Monique A M; van der Slikke, Rienk M A; van der Woude, Lucas H V; Veeger, Dirkjan H E J
2018-01-01
The aim of this study was to develop and describe a wheelchair mobility performance test in wheelchair basketball and to assess its construct validity and reliability. To mimic mobility performance of wheelchair basketball matches in a standardised manner, a test was designed based on observation of wheelchair basketball matches and expert judgement. Forty-six players performed the test to determine its validity and 23 players performed the test twice for reliability. Independent-samples t-tests were used to assess whether the times needed to complete the test were different for classifications, playing standards and sex. Intraclass correlation coefficients (ICC) were calculated to quantify reliability of performance times. Males performed better than females (P < 0.001, effect size [ES] = -1.26) and international men performed better than national men (P < 0.001, ES = -1.62). Performance time of low (≤2.5) and high (≥3.0) classification players was borderline not significant with a moderate ES (P = 0.06, ES = 0.58). The reliability was excellent for overall performance time (ICC = 0.95). These results show that the test can be used as a standardised mobility performance test to validly and reliably assess the capacity in mobility performance of elite wheelchair basketball athletes. Furthermore, the described methodology of development is recommended for use in other sports to develop sport-specific tests.
Reliability, validity, and significance of assessment of sense of contribution in the workplace.
Takaki, Jiro; Taniguchi, Toshiyo; Fujii, Yasuhito
2014-01-29
The purpose of this study was to assess the validity and reliability of the Sense of Contribution Scale (SCS), a newly developed, 7-item questionnaire used to measure sense of contribution in the workplace. Workers at 272 organizations answered questionnaires that included the SCS. Because of non-participation or missing data, the number of subjects included in the analyses for internal consistency and validity varied from 1,675 to 2,462 (response rates 54.6%-80.2%). Fifty-four workers were included in the analysis of test-retest reliability (response rate, 77.1%). The SCS showed high internal consistency (Cronbach's α coefficients in men and women were 0.85 and 0.86, respectively) and test-retest reliability (intraclass correlation coefficient = 0.91). Significant (p < 0.001), positive, moderate correlations were found between the SCS score and scores for organization-based self-esteem and work engagement in both genders, which support the SCS's convergent and discriminant validity. The criterion validity of the SCS was supported by the finding that in both genders, the SCS scores were significantly (p < 0.05) and inversely associated with psychological distress and sleep disturbance in crude and in multivariable analyses that adjusted for demographics, organization-based self-esteem, work engagement, effort-reward ratio, workplace bullying, and procedural and interactional justice. The SCS is a psychometrically satisfactory measure of sense of contribution in the workplace. The SCS provides a new and useful instrument to measure sense of contribution, which is independently associated with mental health in workers, for studies in organizational science, occupational health psychology and occupational medicine.
Shoemaker, Sarah J.; Wolf, Michael S.; Brach, Cindy
2016-01-01
Objective To develop a reliable and valid instrument to assess the understandability and actionability of print and audiovisual materials. Methods We compiled items from existing instruments/guides that the expert panel assessed for face/content validity. We completed four rounds of reliability testing, and produced evidence of construct validity with consumers and readability assessments. Results The experts deemed the PEMAT items face/content valid. Four rounds of reliability testing and refinement were conducted using raters untrained on the PEMAT. Agreement improved across rounds. The final PEMAT showed moderate agreement per Kappa (Average K = 0.57) and strong agreement per Gwet’s AC1 (Average = 0.74). Internal consistency was strong (α = 0.71; Average Item-Total Correlation = 0.62). For construct validation with consumers (n = 47), we found significant differences between actionable and poorly-actionable materials in comprehension scores (76% vs. 63%, p < 0.05) and ratings (8.9 vs. 7.7, p < 0.05). For understandability, there was a significant difference for only one of two topics on consumer numeric scores. For actionability, there were significant positive correlations between PEMAT scores and consumer-testing results, but no relationship for understandability. There were, however, strong, negative correlations between grade-level and both consumer-testing results and PEMAT scores. Conclusions The PEMAT demonstrated strong internal consistency, reliability, and evidence of construct validity. Practice implications The PEMAT can help professionals judge the quality of materials (available at: http://www.ahrq.gov/pemat). PMID:24973195
Sousa, Pedro; Gaspar, Pedro; Fonseca, Helena; Hendricks, Constance; Murdaugh, Carolyn
2015-01-01
Reliable and valid instruments are essential for understanding health-promoting behaviors in adolescents. This study analyzed the psychometric properties of the Portuguese version of the Adolescent Lifestyle Profile (ALP). A linguistic and cultural translation of the ALP was conducted with 236 adolescents from two different settings: a community (n=141) and a clinical setting (n=95). Internal consistency reliability and confirmatory factor analysis were performed. Results showed an adequate fit to data, yielding a 36-item, seven-factor structure (CMIN/DF=1.667, CFI=0.807, GFI=0.822, RMR=0.051, RMSEA=0.053, PNFI=0.575, PCFI=0.731). The ALP presented a high internal consistency (α=0.866), with the subscales presenting moderate reliability values (from 0.492 to 0.747). The highest values were in Interpersonal Relations (3.059±0.523) and Positive Life Perspective (2.985±0.588). Some gender differences were found. Findings showed that adolescents from the clinic reported an overall healthier lifestyle than those from the community setting (2.598±0.379 vs. 2.504±0.346; t=1.976, p=0.049). The ALP Portuguese version is a psychometrically reliable, valid, and useful measurement instrument for assessing health-promoting lifestyles in adolescence. The ALP is cross-culturally validated and can decisively contribute to a better understanding of adolescent health promotion needs. Additional research is needed to evaluate the instrument's predictive validity, as well as its clinical relevance for practice and research. Copyright © 2015 Sociedade Brasileira de Pediatria. Published by Elsevier Editora Ltda. All rights reserved.
Jean-Pierre, Pascal; Fundakowski, Christopher; Perez, Enrique; Jean-Pierre, Shadae E; Jean-Pierre, Ashley R; Melillo, Angelica B; Libby, Rachel; Sargi, Zoukaa
2013-02-01
Cancer and its treatments are associated with psychological distress that can negatively impact self-perception, psychosocial functioning, and quality of life. Patients with head and neck cancers (HNC) are particularly susceptible to psychological distress. This study involved a cross-validation of the Measure of Body Apperception (MBA) for HNC patients. One hundred and twenty-two English-fluent HNC patients between 20 and 88 years of age completed the MBA on a Likert scale ranging from "1 = disagree" to "4 = agree." We assessed the latent structure and internal consistency reliability of the MBA using Principal Components Analysis (PCA) and Cronbach's coefficient alpha (α), respectively. We determined convergent and divergent validities of the MBA using correlations with the Hospital Anxiety and Depression Scale (HADS), observer disfigurement rating, and patients' clinical and demographic variables. The PCA revealed a coherent set of items that explained 38 % of the variance. The Kaiser-Meyer-Olkin measure of sampling adequacy was 0.73 and the Bartlett's test of sphericity was statistically significant (χ (2) (28) = 253.64; p < 0.001), confirming the suitability of the data for dimension reduction analysis. The MBA had good internal consistency reliability (α = 0.77) and demonstrated adequate convergent and divergent validities based on statistically significant moderate correlations with the HADS (p < 0.01) and observer rating of disfigurement (p < 0.026) and nonstatistically significant correlations with patients' clinical and demographic variables: tumor location, age at diagnosis, and birth place (all p (s) > 0.05). The MBA is a valid and reliable screening measure of body apperception for HNC patients.
Chen, Shu-Ching; Chen, Hsiu-Fang; Peng, Hsi-Ling; Lee, Li-Yun; Chiang, Ting-Yu; Chiu, Hui-Chuan
2017-04-01
The purposes of this study were to evaluate the psychometric properties, reliability, and validity of the Chinese-version Glover-Nilsson Smoking Behavioral Questionnaire (GN-SBQ-C) and assess the behavioral nicotine dependence among community-dwelling adult smokers in Taiwan. The methods used were survey design, administration, and validation. A total of 202 adult smokers completed a survey to assess behavioral dependence, nicotine dependence, depression, social support, and demographic and smoking characteristics. Data analysis included descriptive statistics, internal consistency reliability, t test, exploratory factor analysis, independent t test, and Pearson product moment correlation. The results showed that (1) the GN-SBQ-C has good internal consistency reliability and stability (2-week test-retest reliability); (2) the extracted one factor explained 41.80 % of the variance, indicating construct validity; (3) the scale has acceptable concurrent validity, with significant positive correlation between the GN-SBQ-C and nicotine dependence, depression, and time smoking and negative correlation between the GN-SBQ-C and age and exercise habit; and (4) the instrument has discriminant validity, supported by significant differences between those with high and low-to-moderate nicotine dependence, smokers greater than 43 years old and those 43 years old and younger, and those who smoked 10 years or less and those smoking more than 10 years. The 11-item GN-SBQ-C has satisfactory psychometric properties when applied in a sample of Taiwanese adult smokers. The scale is feasible and valid to use to assess smoking behavioral dependence.
Dufour, Simon; Latour, Sylvie; Chicoine, Yvan; Fecteau, Gilles; Forget, Sylvain; Moreau, Jean; Trépanier, André
2012-01-01
A script concordance test (SCT) was developed measuring clinical reasoning of food-ruminant practitioners for whom potential clinical competence difficulties were identified by their provincial professional organization. The SCT was designed to be used as part of a broader evaluation procedure. A scoring key was developed based on answers from a reference panel of 12 experts and using the modified aggregate method commonly used for SCTs. A convenient sample of 29 food-ruminant practitioners was constituted to assess the reliability and precision of the SCT and to determine a fair threshold value for success. Cronbach's α coefficients were computed to evaluate internal reliability. To evaluate SCT precision, a test-retest methodology was used and measures of agreement beyond chance were computed at question and test levels. After optimization, the 36-question SCT yielded acceptable internal reliability (Cronbach's α=0.70). Precision of the SCT at question level was excellent with 33 questions (92%) yielding moderate to almost perfect agreement between administrations. At test level, fair agreement (concordance correlation coefficient=0.32) was observed between administrations. A slight SCT score improvement (M=+2.8 points) on the second administration was in part responsible for some of the disagreement and was potentially a result of an adaptation to the SCT format. Scores distribution was used to determine a fair threshold value for success, while considering the underlying objectives of the examination. The data suggest that the developed SCT can be used as a reliable and precise measurement of clinical reasoning of food-ruminant practitioners.
Shoemaker, Sarah J; Wolf, Michael S; Brach, Cindy
2014-09-01
To develop a reliable and valid instrument to assess the understandability and actionability of print and audiovisual materials. We compiled items from existing instruments/guides that the expert panel assessed for face/content validity. We completed four rounds of reliability testing, and produced evidence of construct validity with consumers and readability assessments. The experts deemed the PEMAT items face/content valid. Four rounds of reliability testing and refinement were conducted using raters untrained on the PEMAT. Agreement improved across rounds. The final PEMAT showed moderate agreement per Kappa (Average K=0.57) and strong agreement per Gwet's AC1 (Average=0.74). Internal consistency was strong (α=0.71; Average Item-Total Correlation=0.62). For construct validation with consumers (n=47), we found significant differences between actionable and poorly-actionable materials in comprehension scores (76% vs. 63%, p<0.05) and ratings (8.9 vs. 7.7, p<0.05). For understandability, there was a significant difference for only one of two topics on consumer numeric scores. For actionability, there were significant positive correlations between PEMAT scores and consumer-testing results, but no relationship for understandability. There were, however, strong, negative correlations between grade-level and both consumer-testing results and PEMAT scores. The PEMAT demonstrated strong internal consistency, reliability, and evidence of construct validity. The PEMAT can help professionals judge the quality of materials (available at: http://www.ahrq.gov/pemat). Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Reliability, Validity, and Significance of Assessment of Sense of Contribution in the Workplace
Takaki, Jiro; Taniguchi, Toshiyo; Fujii, Yasuhito
2014-01-01
The purpose of this study was to assess the validity and reliability of the Sense of Contribution Scale (SCS), a newly developed, 7-item questionnaire used to measure sense of contribution in the workplace. Workers at 272 organizations answered questionnaires that included the SCS. Because of non-participation or missing data, the number of subjects included in the analyses for internal consistency and validity varied from 1,675 to 2,462 (response rates 54.6%–80.2%). Fifty-four workers were included in the analysis of test–retest reliability (response rate, 77.1%). The SCS showed high internal consistency (Cronbach’s α coefficients in men and women were 0.85 and 0.86, respectively) and test–retest reliability (intraclass correlation coefficient = 0.91). Significant (p < 0.001), positive, moderate correlations were found between the SCS score and scores for organization-based self-esteem and work engagement in both genders, which support the SCS’s convergent and discriminant validity. The criterion validity of the SCS was supported by the finding that in both genders, the SCS scores were significantly (p < 0.05) and inversely associated with psychological distress and sleep disturbance in crude and in multivariable analyses that adjusted for demographics, organization-based self-esteem, work engagement, effort–reward ratio, workplace bullying, and procedural and interactional justice. The SCS is a psychometrically satisfactory measure of sense of contribution in the workplace. The SCS provides a new and useful instrument to measure sense of contribution, which is independently associated with mental health in workers, for studies in organizational science, occupational health psychology and occupational medicine. PMID:24481035
Wan, Chonghua; Li, Hezhan; Fan, Xuejin; Yang, Ruixue; Pan, Jiahua; Chen, Wenru; Zhao, Rong
2014-06-04
Quality of life (QOL) for patients with coronary heart disease (CHD) is now concerned worldwide with the specific instruments being seldom and no one developed by the modular approach. This paper is aimed to develop the CHD scale of the system of Quality of Life Instruments for Chronic Diseases (QLICD-CHD) by the modular approach and validate it by both classical test theory and Generalizability Theory. The QLICD-CHD was developed based on programmed decision procedures with multiple nominal and focus group discussions, in-depth interview, pre-testing and quantitative statistical procedures. 146 inpatients with CHD were used to provide the data measuring QOL three times before and after treatments. The psychometric properties of the scale were evaluated with respect to validity, reliability and responsiveness employing correlation analysis, factor analyses, multi-trait scaling analysis, t-tests and also G studies and D studies of Genralizability Theory analysis. Multi-trait scaling analysis, correlation and factor analyses confirmed good construct validity and criterion-related validity when using SF-36 as a criterion. The internal consistency α and test-retest reliability coefficients (Pearson r and Intra-class correlations ICC) for the overall instrument and all domains were higher than 0.70 and 0.80 respectively; The overall and all domains except for social domain had statistically significant changes after treatments with moderate effect size SRM (standardized response mea) ranging from 0.32 to 0.67. G-coefficients and index of dependability (Ф coefficients) confirmed the reliability of the scale further with more exact variance components. The QLICD-CHD has good validity, reliability, and moderate responsiveness and some highlights, and can be used as the quality of life instrument for patients with CHD. However, in order to obtain better reliability, the numbers of items for social domain should be increased or the items' quality, not quantity, should be improved.
What to Do With "Moderate" Reliability and Validity Coefficients?
Post, Marcel W
2016-07-01
Clinimetric studies may use criteria for test-retest reliability and convergent validity such that correlation coefficients as low as .40 are supportive of reliability and validity. It can be argued that moderate (.40-.60) correlations should not be interpreted in this way and that reliability coefficients <.70 should be considered as indicative of unreliability. Convergent validity coefficients in the .40 to .60 or .40 to .70 range should be considered as indications of validity problems, or as inconclusive at best. Studies on reliability and convergent should be designed in such a way that it is realistic to expect high reliability and validity coefficients. Multitrait multimethod approaches are preferred to study construct (convergent-divergent) validity. Copyright © 2016 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Ringdal, Kjetil G; Skaga, Nils Oddvar; Steen, Petter Andreas; Hestnes, Morten; Laake, Petter; Jones, J Mary; Lossius, Hans Morten
2013-01-01
Pre-injury comorbidities can influence the outcomes of severely injured patients. Pre-injury comorbidity status, graded according to the American Society of Anesthesiologists Physical Status (ASA-PS) classification system, is an independent predictor of survival in trauma patients and is recommended as a comorbidity score in the Utstein Trauma Template for Uniform Reporting of Data. Little is known about the reliability of pre-injury ASA-PS scores. The objective of this study was to examine whether the pre-injury ASA-PS system was a reliable scale for grading comorbidity in trauma patients. Nineteen Norwegian trauma registry coders were invited to participate in a reliability study in which 50 real but anonymised patient medical records were distributed. Reliability was analysed using quadratic weighted kappa (κ(w)) analysis with 95% CI as the primary outcome measure and unweighted kappa (κ) analysis, which included unknown values, as a secondary outcome measure. Fifteen of the invitees responded to the invitation, and ten participated. We found moderate (κ(w)=0.77 [95% CI: 0.64-0.87]) to substantial (κ(w)=0.95 [95% CI: 0.89-0.99]) rater-against-reference standard reliability using κ(w) and fair (κ=0.46 [95% CI: 0.29-0.64]) to substantial (κ=0.83 [95% CI: 0.68-0.94]) reliability using κ. The inter-rater reliability ranged from moderate (κ(w)=0.66 [95% CI: 0.45-0.81]) to substantial (κ(w)=0.96 [95% CI: 0.88-1.00]) for κ(w) and from slight (κ=0.36 [95% CI: 0.21-0.54]) to moderate (κ=0.75 [95% CI: 0.62-0.89]) for κ. The rater-against-reference standard reliability varied from moderate to substantial for the primary outcome measure and from fair to substantial for the secondary outcome measure. The study findings indicate that the pre-injury ASA-PS scale is a reliable score for classifying comorbidity in trauma patients. Copyright © 2012 Elsevier Ltd. All rights reserved.
Mulero-Portela, Ana L.; Colón-Santaella, Carmen L.; Cruz-Gomez, Cynthia
2010-01-01
The purpose of this study was to perform a cross-cultural adaptation of the Disability of Arm, Shoulder, and Hand (DASH) questionnaire to Spanish for Puerto Rico. Five steps were followed for the cross-cultural adaptation: forward translations into Spanish for Puerto Rico, synthesis of the translations, back translations into English, revision by an expert committee, and field test of the prefinal version. Psychometric characteristics of reliability and construct validity were evaluated for the final version. Internal consistency of the final version was high (Cronbach's α = 0.97) and item-to-total correlations were moderate (range from 0.44 to 0.85). Construct validity was evaluated by correlating the DASH with the scales of the Functional Assessment of Cancer Therapy - Breast. Fair to moderate correlations found in this study between the DASH and most scales of the Functional Assessment of Cancer Therapy - Breast support the construct validity of the Puerto Rico-Spanish DASH. The final version of the questionnaire was revised and approved by the Institute for Work and Health of Canada. Revisions to the original DASH English version are recommended. This version of the DASH is valid and reliable, and it can be used to evaluate outcomes in both clinical and research settings. PMID:19901616
Moessner, Anne; Malec, James F; Beveridge, Scott; Reddy, Cara Camiolo; Huffman, Tracy; Marton, Julia; Schmerzler, Audrey J
2016-01-01
To develop and provide initial validation of a measure for accurately determining the need for Constant Visual Observation (CVO) in patients with traumatic brain injury (TBI) admitted to inpatient rehabilitation. Rating scale development and evaluation through Rasch analysis and assessment of concurrent validity. One hundred and thirty-four individuals with moderate-severe TBI were studied in seven inpatient brain rehabilitation units associated with the National Institute for Disability, Independent Living and Rehabilitation Research (NIDILRR) TBI Model System. Participants were rated on the preliminary version of the CVO Needs Assessment scale (CVONA) and, by independent raters, on the Levels of Risk (LoR) and Supervision Rating Scale (SRS) at four time points during inpatient rehabilitation: admission, Days 2-3, Days 5-6 and Days 8-9. After pruning misfitting items, the CVONA showed satisfactory internal consistency (Person Reliability = 0.85-0.88) across time points. With reference to the LoR and SRS, low false negative rates (sensitivity > 90%) were associated with moderate-to-high false positive rates (29-56%). The CVONA may be a useful objective metric to complement clinical judgement regarding the need for CVO; however, further prospective study is desirable to further assess its utility in identifying at-risk patients, reducing adverse events and decreasing CVO costs.
Assessment of technical and nontechnical skills in surgical residents.
Ponton-Carss, Alicia; Kortbeek, John B; Ma, Irene W Y
2016-11-01
Surgical competence encompasses both technical and nontechnical skills. This study seeks to evaluate the validity evidence for a comprehensive surgical skills examination and to examine the relationship between technical and nontechnical skills. Six examination stations assessing both technical and nontechnical skills, conducted yearly for surgical trainees (n = 120) between 2010 and 2014 are included. The assessment tools demonstrated acceptable internal consistency. Interstation reliability for technical skills was low (alpha = .39). Interstation reliability for the nontechnical skills was lower (alpha range -.05 to .31). Nontechnical skills domains were strongly correlated, ranging from r = .65, P < .001 to .86, P < .001. The associations between nontechnical and technical skills were inconsistent, ranging from poor (r = -.06; P = .54) to moderate (r = .45; P < .001). Multiple samplings of integrated technical and nontechnical skills are necessary to assess overall surgical competency. Copyright © 2016 Elsevier Inc. All rights reserved.
Measuring hope among families impacted by cognitive impairment
Hunsaker, Amanda E.; Terhorst, Lauren; Gentry, Amanda; Lingler, Jennifer H.
2014-01-01
The current exploratory investigation aims to establish the reliability and validity of a hope measure, the Herth Hope Index (HHI), among families impacted by early cognitive impairment (N=96). Exploratory factor analysis was used to examine the dimensionality of the measure. Bivariate analyses were used to examine construct validity. The sample had moderately high hope scores. A two-factor structure emerged from the factor analysis, explaining 51.44% of the variance. Both factors exhibited strong internal consistency (Cronbach’s alphas ranged from .83 to .86). Satisfaction with social support was positively associated with hope, supporting convergent validity. Neurocognitive status, illness insight and depression were not associated with hope, indicating discriminant validity. Families impacted by cognitive impairment may maintain hope in the face of a potentially progressive illness, regardless of cognitive status. The HHI can be utilized as a reliable and valid measure of hope by practitioners providing support to families impacted by cognitive impairment. PMID:24784938
TWO MEASURES FOR CROSS-CULTURAL RESEARCH ON MORALITY: COMPARISON AND REVISION.
Zhang, Yanyan; Li, Sisi
2015-08-01
The current research assessed the reliability and validity of two Western measures of morality in a Chinese sample, namely the Community, Autonomy, and Divinity Scale (CADS) and the Moral Foundations Questionnaire (MFQ). Questionnaires were administered to 274 Chinese participants in Northern China (M age = 25.4 yr., SD = 8.50; 86% women). Confirmatory factor analysis using a structural equation model was conducted to evaluate the construct validity of the two scales. The results indicated a reasonable model fit of both the CADS and the MFQ after certain modifications. The revised versions of both measures had good internal consistency reliabilities. Correlation analysis indicated moderate correlations between the dimensions of the two scales. Regarding the content of morality, Chinese people endorsed more of the traditional ethics and foundations than people from Western cultures in other studies. In addition, participants who reported a religious affiliation scored higher on the Divinity subscale compared to those who claimed to be atheists.
2014-01-01
Background Health impairments can result in disability and changed work productivity imposing considerable costs for the employee, employer and society as a whole. A large number of instruments exist to measure health-related productivity changes; however their methodological quality remains unclear. This systematic review critically appraised the measurement properties in generic self-reported instruments that measure health-related productivity changes to recommend appropriate instruments for use in occupational and economic health practice. Methods PubMed, PsycINFO, Econlit and Embase were systematically searched for studies whereof: (i) instruments measured health-related productivity changes; (ii) the aim was to evaluate instrument measurement properties; (iii) instruments were generic; (iv) ratings were self-reported; (v) full-texts were available. Next, methodological quality appraisal was based on COSMIN elements: (i) internal consistency; (ii) reliability; (iii) measurement error; (iv) content validity; (v) structural validity; (vi) hypotheses testing; (vii) cross-cultural validity; (viii) criterion validity; and (ix) responsiveness. Recommendations are based on evidence syntheses. Results This review included 25 articles assessing the reliability, validity and responsiveness of 15 different generic self-reported instruments measuring health-related productivity changes. Most studies evaluated criterion validity, none evaluated cross-cultural validity and information on measurement error is lacking. The Work Limitation Questionnaire (WLQ) was most frequently evaluated with moderate respectively strong positive evidence for content and structural validity and negative evidence for reliability, hypothesis testing and responsiveness. Less frequently evaluated, the Stanford Presenteeism Scale (SPS) showed strong positive evidence for internal consistency and structural validity, and moderate positive evidence for hypotheses testing and criterion validity. The Productivity and Disease Questionnaire (PRODISQ) yielded strong positive evidence for content validity, evidence for other properties is lacking. The other instruments resulted in mostly fair-to-poor quality ratings with limited evidence. Conclusions Decisions based on the content of the instrument, usage purpose, target country and population, and available evidence are recommended. Until high-quality studies are in place to accurately assess the measurement properties of the currently available instruments, the WLQ and, in a Dutch context, the PRODISQ are cautiously preferred based on its strong positive evidence for content validity. Based on its strong positive evidence for internal consistency and structural validity, the SPS is cautiously recommended. PMID:24495301
Reliability and validity of a Chinese version of the Diagnostic Interview for Borderlines-Revised.
Wang, Lanlan; Yuan, Chenmei; Qiu, Jianying; Gunderson, John; Zhang, Min; Jiang, Kaida; Leung, Freedom; Zhong, Jie; Xiao, Zeping
2014-09-01
Borderline personality disorder (BPD) is the most studied of the axis II disorders. One of the most widely used diagnostic instruments is the Diagnostic Interview for Borderline Patients-Revised (DIB-R). The aim of this study was to test the reliability and validity of DIB-R for use in the Chinese culture. The reliability and validity of the DIB-R Chinese version were assessed in a sample of 236 outpatients with a probable BPD diagnosis. The Structured Clinical Interview for DSM-IV Personality Disorders (SCID-II) was used as a standard. Test-retest reliability was tested six months later with 20 patients, and inter-rater reliability was tested on 32 patients. The Chinese version of the DIB-R showed good internal global consistency (Cronbach's α of 0.916), good test-retest reliability (Pearson correlation of 0.704), good inter-rater reliability (intra-class correlation coefficient of 0.892 and kappa of 0.861). When compared with the DSM-IV diagnosis as measured by the SCID-II, the DIB-R showed relatively good sensitivity (0.768) and specificity (0.891) at the cutoff of 7, moderate diagnostic convergence (kappa of 0.631), as well as good discriminating validity. The Chinese version of the DIB-R has good psychometric properties, which renders it a valuable method for examining the presence, the severity, and component phenotypes of BPD in Chinese samples. © 2013 Wiley Publishing Asia Pty Ltd.
The development of a structured rating schedule (the BAS) to assess skills in breaking bad news
Miller, S J; Hope, T; Talbot, D C
1999-01-01
There has been considerable interest in how doctors break bad news, with calls from within the profession and from patients for doctors to improve their communication skills. In order to aid clinical training and assessment of the skills used in breaking bad news there is a need for a reliable, practical and valid, structured rating schedule. Such a rating schedule was compiled from agreed criteria in the literature. Video-taped recordings of simulated consultations breaking bad news were independently assessed by three raters using the schedule and compared to three experts who gave global ratings. The primary outcome measures were internal consistency of the schedule and level of agreement between raters. The internal consistency was high with a Cronbach's alpha of 0.93. Agreement between raters using the schedule was moderate to good. The majority of the variation in scores was due to the differences in skills demonstrated in the interviews. The agreement between raters not using the schedule was poor. The BAS provides a simple to use, reliable, and consistent rating schedule for assessing skills used in breaking bad news. It could be a valuable aid to teaching this difficult task. © 1999 Cancer Research Campaign PMID:10360657
Dittmann, Ralf W; Wehmeier, Peter M; Schacht, Alexander; Lehmann, Martin; Lehmkuhl, Gerd
2009-12-01
To report on (1) psychometric properties of the Rosenberg Self-Esteem Scale (SES) studied in adolescents with ADHD, (2) correlations of SES with ADHD scale scores, and (3) change in patient-reported self-esteem with atomoxetine treatment. ADHD patients (12-17 years), treated in an open-label study for 24 weeks. Secondary analyses on ADHD symptoms (assessed with ADHD-RS, CGI, GIPD scales) and self-esteem (SES) were performed. One hundred and fifty-nine patients were treated. A dichotomous structure of the SES could be confirmed. Reliability and internal consistency were moderate to excellent. Highest coefficients were found for the correlation between SES and GIPD scores. Self-esteem significantly increased over time, accompanied by an improvement of ADHD symptoms and related perceived difficulties. The Rosenberg SES was shown to be internally consistent, reliable, and sensitive to treatment-related changes of self-esteem. According to these findings, self-esteem may be an important individual patient outcome beyond the core symptoms of ADHD. © The Author(s) 2009. This article is published with open access at Springerlink.com
Athletic Engagement and Athletic Identity in Top Croatian Sprint Runners.
Babić, Vesna; Sarac, Jelena; Missoni, Sasa; Sindik, Josko
2015-09-01
The aim of the research was to determine construct validity and reliability for two questionnaires (Athlete Engagement Questionnaire-AEQ and Athletic Identity Measurement Scale-AIMS), applied on elite Croatian athletes-sprinters, as well as the correlations among the dimensions in these measuring instruments. Then, we have determined the differences in the dimensions of sport engagement and sport identity, according to gender, education level and winning medals on international competitions. A total of 71 elite athletes-sprinters (former and still active) are examined, from which 27 (38%) females and 44 (62%) males. The results of factor analyses revealed the existence of dimensions very similar as in the original instruments, which showed moderate to-high reliabilities. A small number of statistically significant correlations have been found between the dimensions of sport engagement and sport identity, mainly in male sprinter runners. Small number of statistically significant differences in the dimensions of sport engagement and sport identity have been found according to the gender, education level and winning medals on the international competitions. The most reasonable explanation of these differences could be given in terms of very similar characteristics of elite athletes on the same level of sport excellence.
Walvoort, Serge JW; van der Heijden, Paul T; Kessels, Roy PC; Egger, Jos IM
2016-01-01
Aim Impaired illness insight may hamper treatment outcome in patients with alcohol-related cognitive deficits. In this study, a short questionnaire for the assessment of illness insight (eg, the Q8) was investigated in patients with Korsakoff’s syndrome (KS) and in alcohol use disorder (AUD) patients with mild neurocognitive deficits. Methods First, reliability coefficients were computed and internal structure was investigated. Then, comparisons were made between patients with KS and patients with AUD. Furthermore, correlations with the Dysexecutive Questionnaire (DEX) were investigated. Finally, Q8 total scores were correlated with neuropsychological tests for processing speed, memory, and executive function. Results Internal consistency of the Q8 was acceptable (ie, Cronbach’s α =0.73). The Q8 items represent one factor, and scores differ significantly between AUD and KS patients. The Q8 total score, related to the DEX discrepancy score and scores on neuropsychological tests as was hypothesized, indicates that a higher degree of illness insight is associated with a higher level of cognitive functioning. Conclusion The Q8 is a short, valid, and easy-to-administer questionnaire to reliably assess illness insight in patients with moderate-to-severe alcohol-related cognitive dysfunction. PMID:27445476
Schuh, L A.; London, Z; Neel, R; Brock, C; Kissela, B M.; Schultz, L; Gelb, D J.
2009-01-01
Objective: The American Board of Psychiatry and Neurology (ABPN) has recently replaced the traditional, centralized oral examination with the locally administered Neurology Clinical Skills Examination (NEX). The ABPN postulated the experience with the NEX would be similar to the Mini-Clinical Evaluation Exercise, a reliable and valid assessment tool. The reliability and validity of the NEX has not been established. Methods: NEX encounters were videotaped at 4 neurology programs. Local faculty and ABPN examiners graded the encounters using 2 different evaluation forms: an ABPN form and one with a contracted rating scale. Some NEX encounters were purposely failed by residents. Cohen’s kappa and intraclass correlation coefficients (ICC) were calculated for local vs ABPN examiners. Results: Ninety-eight videotaped NEX encounters of 32 residents were evaluated by 20 local faculty evaluators and 18 ABPN examiners. The interrater reliability for a determination of pass vs fail for each encounter was poor (kappa 0.32; 95% confidence interval [CI] = 0.11, 0.53). ICC between local faculty and ABPN examiners for each performance rating on the ABPN NEX form was poor to moderate (ICC range 0.14-0.44), and did not improve with the contracted rating form (ICC range 0.09-0.36). ABPN examiners were more likely than local examiners to fail residents. Conclusions: There is poor interrater reliability between local faculty and American Board of Psychiatry and Neurology examiners. A bias was detected for favorable assessment locally, which is concerning for the validity of the examination. Further study is needed to assess whether training can improve interrater reliability and offset bias. GLOSSARY ABIM = American Board of Internal Medicine; ABPN = American Board of Psychiatry and Neurology; CI = confidence interval; HFH = Henry Ford Hospital; ICC = intraclass correlation coefficients; IM = internal medicine; mini-CEX = Mini-Clinical Evaluation Exercise; NEX = Neurology Clinical Skills Examination; RITE = residency inservice training examination; UC = University of Cincinnati; UM = University of Michigan; USF = University of South Florida. PMID:19605769
Multanen, Juhani; Honkanen, Mikko; Häkkinen, Arja; Kiviranta, Ilkka
2018-05-22
The Knee Injury and Osteoarthritis Outcome Score (KOOS) is a commonly used knee assessment and outcome tool in both clinical work and research. However, it has not been formally translated and validated in Finnish. The purpose of this study was to translate and culturally adapt the KOOS questionnaire into Finnish and to determine its validity and reliability among Finnish middle-aged patients with knee injuries. KOOS was translated and culturally adapted from English into Finnish. Subsequently, 59 patients with knee injuries completed the Finnish version of KOOS, Western Ontario and McMaster Osteoarthritis Index (WOMAC), Short-Form 36 Health Survey (SF-36) and Numeric Pain Rating Scale (Pain-NRS). The same KOOS questionnaire was re-administered 2 weeks later. Psychometric assessment of the Finnish KOOS was performed by testing its construct validity and reliability by using internal consistency, test-retest reliability and measurement error. The floor and ceiling effects were also examined. The cross-cultural adaptation revealed only minor cultural differences and was well received by the patients. For construct validity, high to moderate Spearman's Correlation Coefficients were found between the KOOS subscales and the WOMAC, SF-36, and Pain-NRS subscales. The Cronbach's alpha was from 0.79 to 0.96 for all subscales indicating acceptable internal consistency. The test-retest reliability was good to excellent, with Intraclass Correlation Coefficients ranging from 0.73 to 0.86 for all KOOS subscales. The minimal detectable change ranged from 17 to 34 on an individual level and from 2 to 4 on a group level. No floor or ceiling effects were observed. This study yielded an appropriately translated and culturally adapted Finnish version of KOOS which demonstrated good validity and reliability. Our data indicate that the Finnish version of KOOS is suitable for assessment of the knee status of Finnish patients with different knee complaints. Further studies are needed to evaluate the predictive ability of KOOS in the Finnish population.
Brennan, Sue E; McKenzie, Joanne E; Turner, Tari; Redman, Sally; Makkar, Steve; Williamson, Anna; Haynes, Abby; Green, Sally E
2017-01-17
Capacity building strategies are widely used to increase the use of research in policy development. However, a lack of well-validated measures for policy contexts has hampered efforts to identify priorities for capacity building and to evaluate the impact of strategies. We aimed to address this gap by developing SEER (Seeking, Engaging with and Evaluating Research), a self-report measure of individual policymakers' capacity to engage with and use research. We used the SPIRIT Action Framework to identify pertinent domains and guide development of items for measuring each domain. Scales covered (1) individual capacity to use research (confidence in using research, value placed on research, individual perceptions of the value their organisation places on research, supporting tools and systems), (2) actions taken to engage with research and researchers, and (3) use of research to inform policy (extent and type of research use). A sample of policymakers engaged in health policy development provided data to examine scale reliability (internal consistency, test-retest) and validity (relation to measures of similar concepts, relation to a measure of intention to use research, internal structure of the individual capacity scales). Response rates were 55% (150/272 people, 12 agencies) for the validity and internal consistency analyses, and 54% (57/105 people, 9 agencies) for test-retest reliability. The individual capacity scales demonstrated adequate internal consistency reliability (alpha coefficients > 0.7, all four scales) and test-retest reliability (intra-class correlation coefficients > 0.7 for three scales and 0.59 for fourth scale). Scores on individual capacity scales converged as predicted with measures of similar concepts (moderate correlations of > 0.4), and confirmatory factor analysis provided evidence that the scales measured related but distinct concepts. Items in each of these four scales related as predicted to concepts in the measurement model derived from the SPIRIT Action Framework. Evidence about the reliability and validity of the research engagement actions and research use scales was equivocal. Initial testing of SEER suggests that the four individual capacity scales may be used in policy settings to examine current capacity and identify areas for capacity building. The relation between capacity, research engagement actions and research use requires further investigation.
Cerin, Ester; Sit, Cindy H P; Huang, Ya-Jun; Barnett, Anthony; Macfarlane, Duncan J; Wong, Stephen S H
2014-06-06
Physical activity and sedentary behaviour are important contributors to adolescents' health. These behaviours may be affected by the school and neighbourhood built environments. However, current evidence on such effects is mainly limited to Western countries. The International Physical Activity and the Environment Network (IPEN)-Adolescent study aims to examine associations of the built environment with adolescent physical activity and sedentary behaviour across five continents.We report on the repeatability of measures of in-school and out-of school physical activity, plus measures of out-of-school sedentary and travel behaviours adopted by the IPEN - Adolescent study and adapted for Chinese-speaking Hong Kong adolescents participating in the international Healthy environments and active living in teenagers-(Hong Kong) [iHealt(H)] study, which is part of IPEN-Adolescent. Items gauging in-school physical activity and out-of-school physical activity, and out-of-school sedentary and travel behaviours developed for the IPEN - Adolescent study were translated from English into Chinese, adapted, and pilot tested. Sixty-eight Chinese-speaking 12-17 year old secondary school students (36 boys; 32 girls) residing in areas of Hong Kong differing in transport-related walkability were recruited. They self-completed the survey items twice, 8-16 days apart. Test-retest reliability was assessed for the whole sample and by gender using one-way random effects intra-class correlation coefficients (ICC). Test-retest reliability of items with restricted variability was assessed using percentage agreement. Overall test-retest reliability of items and scales was moderate to excellent (ICC = 0.47-0.92). Items with restricted variability in responses had a high percentage agreement (92%-100%). Test-retest reliability was similar in girls and boys, with the exception of daily hours of homework (reliability higher in girls) and number of school-based sports teams or after-school physical activity classes (reliability higher in boys). The translated and adapted self-report measures of physical activity, sedentary and travel behaviours used in the iHealt(H) study are sufficiently reliable. Levels of reliability are comparable or slightly higher than those observed for the original measures.
Porter, Anna K; Wen, Fang; Herring, Amy H; Rodríguez, Daniel A; Messer, Lynne C; Laraia, Barbara A; Evenson, Kelly R
2018-06-01
Reliable and stable environmental audit instruments are needed to successfully identify the physical and social attributes that may influence physical activity. This study described the reliability and stability of the PIN3 environmental audit instrument in both urban and rural neighborhoods. Four randomly sampled road segments in and around a one-quarter mile buffer of participants' residences from the Pregnancy, Infection, and Nutrition (PIN3) study were rated twice, approximately 2 weeks apart. One year later, 253 of the year 1 sampled roads were re-audited. The instrument included 43 measures that resulted in 73 item scores for calculation of percent overall agreement, kappa statistics, and log-linear models. For same-day reliability, 81% of items had moderate to outstanding kappa statistics (kappas ≥ 0.4). Two-week reliability was slightly lower, with 77% of items having moderate to outstanding agreement using kappa statistics. One-year stability had 68% of items showing moderate to outstanding agreement using kappa statistics. The reliability of the audit measures was largely consistent when comparing urban to rural locations, with only 8% of items exhibiting significant differences (α < 0.05) by urbanicity. The PIN3 instrument is a reliable and stable audit tool for studies assessing neighborhood attributes in urban and rural environments.
The challenge of mapping between two medical coding systems.
Wojcik, Barbara E; Stein, Catherine R; Devore, Raymond B; Hassell, L Harrison
2006-11-01
Deployable medical systems patient conditions (PCs) designate groups of patients with similar medical conditions and, therefore, similar treatment requirements. PCs are used by the U.S. military to estimate field medical resources needed in combat operations. Information associated with each of the 389 PCs is based on subject matter expert opinion, instead of direct derivation from standard medical codes. Currently, no mechanisms exist to tie current or historical medical data to PCs. Our study objective was to determine whether reliable conversion between PC codes and International Classification of Diseases, 9th Revision, Clinical Modification (ICD-9-CM) diagnosis codes is possible. Data were analyzed for three professional coders assigning all applicable ICD-9-CM diagnosis codes to each PC code. Inter-rater reliability was measured by using Cohen's K statistic and percent agreement. Methods were developed to calculate kappa statistics when multiple responses could be selected from many possible categories. Overall, we found moderate support for the possibility of reliable conversion between PCs and ICD-9-CM diagnoses (mean kappa = 0.61). Current PCs should be modified into a system that is verifiable with real data.
Youssof, Sarah; Romero-Clark, Carol; Warner, Teddy; Plowman, Emily
2017-07-01
The Swallowing Quality of Life instrument (SWAL-QOL) is a patient-reported outcome measure of swallowing-related quality of life (SR-QoL). Its psychometric properties in oculopharyngeal muscular dystrophy (OPMD) are not known. We administered the SWAL-QOL to U.S. OPMD Registry participants. We described SR-QoL profiles and assessed reliability and validity. The mean composite score in 113 individuals with OPMD was 54.4 ± 20.7, indicating moderate impairment. Severe impairments were observed in eating duration, burden, and fatigue scales. Internal consistency reliability of all scales was found to be satisfactory, and 9 of 10 scales demonstrated adequate test-retest reliability. Data confirmed 86% of hypotheses, supporting construct validity. The SWAL-QOL limitations in OPMD include: floor/ceiling effects in 7 of 10 scales and low specificity of sleep, fatigue, and communication scales for dysphagia. SR-QoL is reduced in OPMD. Given several limitations of the SWAL-QOL, development of an improved dysphagia-specific QoL instrument for OPMD is warranted. Muscle Nerve 56: 28-35, 2017. © 2016 Wiley Periodicals, Inc.
Psychometric properties of the Social Phobia and Anxiety Inventory for Children in a Spanish sample.
Olivares, José; Sánchez-García, Raquel; López-Pina, José Antonio; Rosa-Alcázar, Ana Isabel
2010-11-01
The objectives of the present study were to adapt and analyze the factor structure, reliability, and validity of the Social Phobia and Anxiety Inventory for Children (SPAI-C; Beidel, Turner, & Morris, 1995) in a Spanish population. The SPAI-C was applied to a sample of 1588 children and adolescents with ages ranging from 10 to 17 years. The confirmatory factor analysis (CFA) showed a four-factor structure: Public performance, Assertiveness, Fear and avoidance/escape in social encounters, and Cognitive and psychophysiological interferences. Internal consistency was high (.90) and test-retest reliability was moderate (.56). Significant differences were found in the variables sex and age, although the effect size was small in both variables and their interaction. Overall, the increase of the age value was inversely proportional to that of social anxiety measured with the SPAI-C; in participants of the same age, values were higher for girls than for boys. Results suggest that the Social Phobia and Anxiety Inventory For Children is a valid and reliable instrument to assess social anxiety in Spanish children and adolescents.
Validity of the Neurology Quality of Life (Neuro-QoL) Measurement System in Adult Epilepsy
Victorson, David; Cavazos, Jose E.; Holmes, Gregory L.; Reder, Anthony T.; Wojna, Valerie; Nowinski, Cindy; Miller, Deborah; Buono, Sarah; Mueller, Allison; Moy, Claudia; Cella, David
2014-01-01
Epilepsy is a chronic neurological disorder that results in recurring seizures and can have a significant adverse effect on health related quality of life (HRQL). Neuro-QoL is an NINDS-funded system of patient reported outcome measures for neurology clinical research, which was designed to provide a precise and standardized way to measure HRQL in epilepsy and other neurological disorders. Using mixed-methods and item response theory-based approaches, we developed generic item banks and targeted scales for adults and children with major neurological disorders. This paper provides empirical results from a clinical validation study with a sample of adults diagnosed with epilepsy. One hundred twenty one people diagnosed with epilepsy participated, of which the majority were male (62%), Caucasian (95%), with a mean age of 47.3 (SD=16.9). Baseline assessments included Neuro-QoL short forms and general and external validity measures. Neuro-QoL short forms that are not typically found in other epilepsy-specific HRQL instruments include Stigma, Sleep Disturbance, Emotional and Behavioral Dyscontrol and Positive Affect & Well-being. Neuro-QoL short forms demonstrated adequate reliability (internal consistency range = .86–.96; test-retest range = .57–.89). Pearson correlations (p<.01) between Neuro-QoL forms of emotional distress (Anxiety, Depression, Stigma) and the QOLIE-31 Emotional Well-being Subscale were in the moderate to strong range (r’s = .66, .71 & .53, respectively), as were relations with the PROMIS Global Mental Health subscale (r’s = .59, .74 & .52, respectively). Moderate correlations were observed between Neuro-QoL Social Role Performance and Satisfaction and the QOLIE-31 Social Function (r’s = .58 & .52, respectively). In measuring aspects of physical function, the Neuro-QoL Mobility and Upper Extremity forms demonstrated moderate associations with the PROMIS Global Physical Function Subscale (r’s = .60 & .61, respectively). Neuro-QoL measures of perceived cognitive function (executive function and general concerns) produced moderate to strong correlations with the QOLIE-31 Cognition subscale (r’s = .65 & .75, respectively) and moderate relations with the Liverpool Adverse Events scale (r’s = .51 & .69, respectively). Finally, the Neuro-QoL Fatigue measure demonstrated moderate associations with the QOLIE-31 Energy/Fatigue subscale (r=−.65), Liverpool Adverse Events Scale (r=.69) and the Liverpool Seizure Severity Scale (r=.50). Five Neuro-QoL short forms demonstrated statistically significant responsiveness to change at 5–7 months, including Fatigue, Sleep Disturbance, Depression, Positive Affect & Well-being, and Emotional and Behavioral Dyscontrol. Overall, Neuro-QoL instruments showed good evidence for internal consistency, test-retest reliability, convergent validity and responsiveness to change over several months. These results support the validity of Neuro-QoL to measure HRQL in adults with epilepsy. PMID:24361767
Missbach, Benjamin; Hinterbuchinger, Barbara; Dreiseitl, Verena; Zellhofer, Silvia; Kurz, Carina; König, Jürgen
2015-01-01
The characteristic trait of individuals developing a pathological obsession and preoccupation with healthy foods and a restrictive and avoidant eating behavior is described as orthorexia nervosa (ON). For ON, neither universal diagnosis criteria nor valid tools for large-scale epidemiologic assessment are available in the literature. The aim of the current study is to analyze the psychometric properties of a translated German version of the ORTO-15 questionnaire. The German version of the ORTO-15, a eating behavior and dieting habits questionnaire were completed by 1029 German-speaking participants (74.6% female) aged between 19 and 70 years (M = 31.21 ± 10.43 years). Our results showed that after confirmatory factor analysis, the best fitting model of the original version is a single-factor structure (9-item shortened version: ORTO-9-GE). The final model showed only moderate internal consistency (Cronbach's alpha = .67), even after omitting 40% of the original question. A total of 69.1% participants showed orthorectic tendencies. Orthorectic tendencies are associated with special eating behavior features (dieting frequency, vegetarian and vegan diet). Education level did not influence ON tendency and nutritional students did not show higher ON tendency compared to students from other disciplines. This study is the first attempt to translate and to evaluate the psychometric properties of a German version of the ORTO-15 questionnaire. The ORTO-9-GE questionnaire, however, is only a mediocre tool for assessing orthorectic tendencies in individuals and shows moderate reliability and internal consistency. Our research suggests, that future studies are needed to provide more reliable and valid assessment tools to investigate orthorexia nervosa.
Missbach, Benjamin; Hinterbuchinger, Barbara; Dreiseitl, Verena; Zellhofer, Silvia; Kurz, Carina; König, Jürgen
2015-01-01
The characteristic trait of individuals developing a pathological obsession and preoccupation with healthy foods and a restrictive and avoidant eating behavior is described as orthorexia nervosa (ON). For ON, neither universal diagnosis criteria nor valid tools for large-scale epidemiologic assessment are available in the literature. The aim of the current study is to analyze the psychometric properties of a translated German version of the ORTO-15 questionnaire. The German version of the ORTO-15, a eating behavior and dieting habits questionnaire were completed by 1029 German-speaking participants (74.6% female) aged between 19 and 70 years (M = 31.21 ± 10.43 years). Our results showed that after confirmatory factor analysis, the best fitting model of the original version is a single-factor structure (9-item shortened version: ORTO-9-GE). The final model showed only moderate internal consistency (Cronbach’s alpha = .67), even after omitting 40% of the original question. A total of 69.1% participants showed orthorectic tendencies. Orthorectic tendencies are associated with special eating behavior features (dieting frequency, vegetarian and vegan diet). Education level did not influence ON tendency and nutritional students did not show higher ON tendency compared to students from other disciplines. This study is the first attempt to translate and to evaluate the psychometric properties of a German version of the ORTO-15 questionnaire. The ORTO-9-GE questionnaire, however, is only a mediocre tool for assessing orthorectic tendencies in individuals and shows moderate reliability and internal consistency. Our research suggests, that future studies are needed to provide more reliable and valid assessment tools to investigate orthorexia nervosa. PMID:26280449
Yusoff, Muhamad Saiful Bahri; Yaacob, Mohd Jamil; Naing, Nyi Nyi; Esa, Ab Rahman
2013-02-01
This study evaluated the convergent, discriminant, construct, concurrent and discriminative validity of the Medical Student Wellbeing Index (MSWBI) as well as to evaluate its internal consistency and optimal cut-off total scores to detect at least moderate levels of general psychological distress, stress, anxiety and depression symptoms. A cross sectional study was done on 171 medical students. The MSWBI and DASS-21 were administered and returned immediately upon completion. Confirmatory factor analysis, reliability analysis, ROC analysis and Pearson correlation test were applied to assess psychometric properties of the MSWBI. A total of 168 (98.2%) medical students responded. The goodness of fit indices showed the MSWBI had a good construct (χ(2)=6.14, p=0.803, RMSEA<0.001, RMR=0.004, GFI=0.99, AGFI=0.97, CFI=1.00, IFI=1.02, TLI=1.04). The Cronbach's alpha value was 0.69 indicating an acceptable level of internal consistency. Pearson correlation coefficients and ROC analysis suggested each MSWBI's item showed adequate convergent and discriminant validity. Its optimal cut-off scores to detect at least moderate levels of general psychological distress, stress, anxiety, and depression were 1.5, 2.5, 1.5 and 2.5 respectively with sensitivity and specificity ranged from 62 to 80% and the areas under ROC curve ranged from 0.71 to 0.83. This study showed that the MSWBI had good level of psychometric properties. The MSWBI score more than 2 can be considered as having significant psychological distress. The MSWBI is a valid and reliable screening instrument to assess psychological distress of medical students. Copyright © 2012 Elsevier B.V. All rights reserved.
Hillen, Marij A; Postma, Rosa-May; Verdam, Mathilde G E; Smets, Ellen M A
2017-03-01
The original 18-item, four-dimensional Trust in Oncologist Scale assesses cancer patients' trust in their oncologist. The current aim was to develop and validate a short form version of the scale to enable more efficient assessment of cancer patients' trust. Existing validation data of the full-length Trust in Oncologist Scale were used to create a short form of the Trust in Oncologist Scale. The resulting short form was validated in a new sample of cancer patients (n = 92). Socio-demographics, medical characteristics, trust in the oncologist, satisfaction with communication, trust in healthcare, willingness to recommend the oncologist to others and to contact the oncologist in case of questions were assessed. Internal consistency, reliability, convergent and structural validity were tested. The five-item Trust in Oncologist Scale Short Form was created by selecting the statistically best performing item from each dimension of the original scale, to ensure content validity. Mean trust in the oncologist was high in the validation sample (response rate 86%, M = 4.30, SD = 0.98). Exploratory factor analyses supported one-dimensionality of the short form. Internal consistency was high, and temporal stability was moderate. Initial convergent validity was suggested by moderate correlations between trust scores with associated constructs. The Trust in Oncologist Scale Short Form appears to efficiently, reliably and validly measures cancer patients' trust in their oncologist. It may be used in research and as a quality indicator in clinical practice. More thorough validation of the scale is recommended to confirm this initial evidence of its validity.
Kaufman, Denise R; Puckett, Mallory J; Smith, Mitchell J; Wilson, Kyle S; Cheema, Rebecca; Landers, Merrill R
2014-08-01
The purpose of this study was to establish reliability and responsiveness of the dynamic visual acuity test (DVAT) at head speeds of 150-200 degrees per second (deg/s) and the gaze stabilization test (GST) in high school and college football players. Reliability design. Fifty high school and college football athletes completed the DVAT and GST in both the yaw (horizontal) and pitch (vertical) planes twice within two weeks. Test-retest reliability for the DVAT was good in yaw, Intraclass Correlation Coefficient (ICC) = 0.770, and moderate/good in pitch, ICC = 0.725. Minimal detectable change (MDC) was 0.16 logMAR for yaw and 0.21 logMAR for pitch. GST reliability was moderate in yaw, ICC = 0.634, and poor in pitch, ICC = 0.411. MDCs were 73.4 deg/s (yaw) and 81.2 deg/s (pitch). The DVAT is reliable at high head speeds in high school and college football athletes in both yaw and pitch. GST head speeds were higher than previously reported in the literature, but reliability of this tool for this population was poor to moderate. From a clinical perspective, DVAT may be reliably used in the assessment of high school and college football athletes; however, GST requires further evaluation. Copyright © 2013 Elsevier Ltd. All rights reserved.
Kim, Jin Goo; Lee, Joong Yub; Seo, Seung Suk; Choi, Choong Hyeok; Lee, Myung Chul
2013-01-01
Purpose To perform a cross-cultural adaptation and to test the measurement properties of the Korean version of International Knee Documentation Committee (K-IKDC) Subjective Knee Form. Materials and Methods According to the guidelines for cross-cultural adaptation, translation and backward translation of the English version of the IKDC Subjective Knee Form were performed. After translation into the Korean version, 150 patients who had knee-related problems were asked to complete the K-IKDC, Lysholm score, and Short Form-36 (SF-36). Of these patients, 126 were retested 2 weeks later to evaluate test-retest reliability, and 104 were recruited 3 months later to evaluate responsiveness. Construct validity was analyzed by investigating the correlation with Lysholm score and SF-36; content validity was also evaluated. Standardized mean response was calculated for evaluating responsiveness. Results The test-retest reliability proved excellent with a high value for the intraclass correlation coefficient (r=0.94). The internal consistency was strong (Cronbach's α=0.91). Good content validity with absence of floor not ceiling effects and good convergent and divergent validity were observed. Moderate responsiveness was shown (standardized mean response=0.689). Conclusions The K-IKDC demonstrated good measurement properties. We suggest that this instrument is an excellent evaluation instrument that can be used for Korean patients with knee-related injuries. PMID:24032098
Gordt, Katharina; Mikolaizak, A Stefanie; Nerz, Corinna; Barz, Carolin; Gerhardy, Thomas; Weber, Michaela; Becker, Clemens; Schwenk, Michael
2018-02-12
Tools to detect subtle balance deficits in high-functioning community-dwelling older adults are lacking. The Community Balance and Mobility Scale (CBM) is a valuable tool to measure balance deficits in this group; however, it is not yet available in the German language. The aim was 1) to translate and cross-culturally adapt the CBM into the German language and 2) to investigate the measurement properties of the German CBM (G-CBM). The original CBM was translated into the German language according to established guidelines. A total of 51 older adults (mean age 69.9 ± 7.1 years) were recruited to measure construct validity by comparing the G‑CBM against standardized balance and/or mobility assessments including the Fullerton Advanced Balance Scale (FAB), Berg Balance Scale (BBS), 3 m Tandem Walk (3MTW), 8 Level Balance Scale (8LBS), 30 s Chair Stand Test (30CST), Timed Up and Go (TUG) test, gait speed, and the Falls Efficacy Scale International (FES-I). Intrarater and interrater reliability and internal consistency reliability were estimated using intraclass correlations (ICC) and Cronbach's alpha, respectively. Ceiling effects were calculated as the percentage of the sample scoring the maximum score. The G‑CBM correlated excellently with FAB and BBS (ρ = 0.78-0.85; P < 0.001), good with 3MTW, TUG, and FES-I (ρ = -0.55 to -0.61; P < 0.001), and moderately with 8LBS, 30CST, and habitual gait speed (ρ = 0.32-0.46; P < 0.001). Intrarater (ICC 3,k = 0.998; P < 0.001) and interrater (ICC 2,k = 0.996; P < 0.001) reliability, and internal consistency reliability (α = 0.998) were also high. The G‑CBM did not show ceiling effects. The G‑CBM is a valid and reliable tool for measuring subtle balance deficits in older high-functioning adults. The absence of ceiling effects emphasizes the use of this scale in this cohort. The G‑CBM can now be utilized in clinical practice.
HIDECKER, MARY JO COOLEY; PANETH, NIGEL; ROSENBAUM, PETER L; KENT, RAYMOND D; LILLIE, JANET; EULENBERG, JOHN B; CHESTER, KEN; JOHNSON, BRENDA; MICHALSEN, LAUREN; EVATT, MORGAN; TAYLOR, KARA
2011-01-01
Aim The purpose of this study was to create and validate a Communication Function Classification System (CFCS) for children with cerebral palsy (CP) that can be used by a wide variety of individuals who are interested in CP. This paper reports the content validity, interrater reliability, and test–retest reliability of the CFCS for children with CP. Method An 11-member development team created comprehensive descriptions of the CFCS levels, and four nominal groups comprising 27 participants critiqued these levels. Within a Delphi survey, 112 participants commented on the clarity and usefulness of the CFCS. Interrater reliability was completed by 61 professionals and 68 parents/relatives who classified 69 children with CP aged 2 to 18 years. Test–retest reliability was completed by 48 professionals who allowed at least 2 weeks between classifications. The participants who assessed the CFCS were all relevant stakeholders: adults with CP, parents of children with CP, educators, occupational therapists, physical therapists, physicians, and speech–language pathologists. Results The interrater reliability of the CFCS was 0.66 between two professionals and 0.49 between a parent and a professional. Professional interrater reliability improved to 0.77 for classification of children older than 4 years. The test–retest reliability was 0.82. Interpretation The CFCS demonstrates content validity and shows very good test–retest reliability, good professional interrater reliability, and moderate parent–professional interrater reliability. Combining the CFCS with the Gross Motor Function Classification System and the Manual Ability Classification System contributes to a functional performance view of daily life for individuals with CP, in accordance with the World Health Organization’s International Classification of Functioning, Disability and Health. PMID:21707596
Translation, Adaptation and Cross Language Validation of Tinnitus Handicap Inventory in Urdu.
Aqeel, Muhammad; Ahmed, Ammar
2017-12-01
Tinnitus is characterized as a perception of numerous auditory sounds in absence of external stimulus. Tinnitus can have a considerable consequence on a person's quality of life, and is considered to be very complicated to quantify. The aim of this study was to investigate the reliability and validity of Urdu translation of the Tinnitus Handicap Inventory (THI) in Pakistan. It was designed to assess the presence of various auditory sounds without the external stimulus. Scale consisted of 25 items having three subscales functional, emotional, and catastrophic. The study comprised into two stages, preliminary and main studies. The results of preliminary study revealed that the overall scale had high internal consistency [alpha coefficient of Urdu version of THI (THI-U)= 0.99, alpha coefficient of English version of THI=0.98]. The overall scale had test-retest correlation over a fifteen days period of interval (0.99). Main study was performed on 110 tinnitus patients. The results of main study showed that the internal consistency and reliability of Urdu version was (α=0.93). The THI-U and its subscales demonstrated good internal consistency reliability ( α =0.81 to 0.86). High to moderate correlations were noted between tinnitus symptom ratings. A confirmatory factor analysis was used to validate the three subscales of THI-U, and high inter-correlations were found between the subscales also results revealed that a three-factor model for the THI-U was most tenable. The results displayed that the confirmatory factor analysis confirmed to validate the three subscales of THI-U. THI-U might present important information about precise facets of tinnitus distress along with diagnostic interviews in clinical practice.
Camargo, Diana Marina; Santisteban, Stefany; Paredes, Erika; Flórez, Mary Ann; Bueno, Diego
2015-01-01
International recommendations for physical activity and time spent in sedentary behaviors for children in their early years require the availability of measuring instruments with psychometric properties that allow for the assessment of population dynamics and interventions to improve health. To evaluate the reliability of a questionnaire to measure physical activity and sedentary behaviors in children from preschool to fourth grade. One hundred and eight parents answered the questionnaire. The instrument included socio-demographic variables, as well as those associated with physical activity, including time walking to school, organized sports and playtime activities. Sedentary behaviors included motorized transport to school, reading and "screen time", sleeping and extracurricular courses. Internal consistency, reproducibility and agreement were evaluated using Cronbach's alpha coefficient, the Intraclass Correlation Coefficient (ICC) and the Bland and Altman limits of agreement method, respectively. Internal consistency for physical activity ranged from 0.59 to 0.64, and for sedentary behaviors between 0.22 and 0.34. The highest reproducibility was found for walking to school and time spent on this (kappa=0.79, ICC 0.69), and organized sports, and time on this activity (kappa=0.72, ICC 0.76). Among sedentary behaviors, motorized transport to school and computer use showed kappas of 0.82 and 0.71, respectively; additionally, the time spent on these behaviors showed an ICC of 0.8 and 0.59, respectively. We found limits of agreement between moderate and good for reading time, napping, extracurricular courses, computer and console use. The questionnaire provided reliable information on the physical activity and sedentary behaviors in children under 10 years of age and could be used in other Latin American countries.
Ferreira, Mariana Cândido; Björklund, Martin; Dach, Fabiola; Chaves, Thais Cristina
The purpose of this study was to adapt and evaluate the psychometric properties of the ProFitMap-neck to Brazilian Portuguese. The cross-cultural adaptation consisted of 5 stages, and 180 female patients with chronic neck pain participated in the study. A subsample (n = 30) answered the pretest, and another subsample (n = 100) answered the questionnaire a second time. Internal consistency, test-retest reliability, and construct validity (hypothesis testing and structural validity) were estimated. For construct validity, the scores of the questionnaire were correlated with the Neck Disability Index (NDI), and the Hospital Anxiety and Depression Scale (HADS), the Tampa Scale of Kinesiophobia (TSK), and the 36-item Short-Form Health Survey (SF-36). Internal consistency was determined by adequate Cronbach's α values (α > 0.70). Strong reliability was identified by high intraclass correlation coefficients (ICC > 0.75). Construct validity was identified by moderate and strong correlations of the Br-ProFitMap-neck with total NDI score (-0.56
Deb, Shoumitro; Bryant, Eleanor; Morris, Paul G; Prior, Lindsay; Lewis, Glyn; Haque, Sayeed
2007-06-01
To develop a measure to assess post-acute outcome following from traumatic brain injury (TBI) with particular emphasis on the emotional and the behavioral outcome. The second objective was to assess the test-retest reliability, internal consistency, and factor structure of the newly developed patient version of the Head Injury Participation Scale (P-HIPS) and Patient-Head Injury Neurobehavioral Scale (P-HINAS). Thirty-two TBI individuals and 27 carers took part in in-depth qualitative interviews exploring the consequences of the TBI. Interview transcripts were analyzed and key themes and concepts were used to construct the 49-item P-HIPS. A postal survey was then conducted on a cohort of 113 TBI patients to 'field test' the P-HIPS and the P-HINAS. All individual 49 items of the P-HIPS and their total score showed good test-retest reliability (0.93) and internal consistency (0.95). The P-HIPS showed a very good correlations with the Mayo Portland Adaptability Inventory-3 (MPAI-3) (0.87) and a moderate negative correlation with the Glasgow Outcome Scale-Extended (GOSE) (-0.51). Factor analysis extracted the following domains: 'Emotion/Behavior,' 'Independence/Community Living,' 'Cognition' and 'Physical'. The 'Emotion/Behavior' factor constituted the P-HINAS, which showed good internal consistency (0.93), test-retest reliability (0.91) and concurrent validity with MPAI subscale (0.82). Both the P-HIPS and the P-HINAS show strong psychometric properties. The qualitative methodology employed in the construction stage of the questionnaires provided good evidence of face and content validity.
Deb, Shoumitro; Bryant, Eleanor; Morris, Paul G; Prior, Lindsay; Lewis, Glyn; Haque, Sayeed
2007-01-01
Objective To develop a measure to assess post-acute outcome following from traumatic brain injury (TBI) with particular emphasis on the emotional and the behavioral outcome. The second objective was to assess the test–retest reliability, internal consistency, and factor structure of the newly developed patient version of the Head Injury Participation Scale (P-HIPS) and Patient-Head Injury Neurobehavioral Scale (P-HINAS). Method Thirty-two TBI individuals and 27 carers took part in in-depth qualitative interviews exploring the consequences of the TBI. Interview transcripts were analyzed and key themes and concepts were used to construct the 49-item P-HIPS. A postal survey was then conducted on a cohort of 113 TBI patients to ‘field test’ the P-HIPS and the P-HINAS. Results All individual 49 items of the P-HIPS and their total score showed good test–retest reliability (0.93) and internal consistency (0.95). The P-HIPS showed a very good correlations with the Mayo Portland Adaptability Inventory-3 (MPAI-3) (0.87) and a moderate negative correlation with the Glasgow Outcome Scale-Extended (GOSE) (−0.51). Factor analysis extracted the following domains: ‘Emotion/Behavior,’ ‘Independence/Community Living,’ ‘Cognition’ and ‘Physical’. The ‘Emotion/Behavior’ factor constituted the P-HINAS, which showed good internal consistency (0.93), test–retest reliability (0.91) and concurrent validity with MPAI subscale (0.82). Conclusions Both the P-HIPS and the P-HINAS show strong psychometric properties. The qualitative methodology employed in the construction stage of the questionnaires provided good evidence of face and content validity. PMID:19300568
Perceived barriers to walking for physical activity.
Dunton, Genevieve F; Schneider, Margaret
2006-10-01
Although the health benefits of walking for physical activity have received increasing research attention, barriers specific to walking are not well understood. In this study, questions to measure barriers to walking for physical activity were developed and tested among college students. The factor structure, test-retest and internal consistency reliability, and discriminant and criterion validity of the perceived barriers were evaluated. A total of 305 undergraduate students participated. Participants had a mean age (+/- SD) of 20.6 (+/- 3.02) years, and 70.3% were female. Participants responded to a questionnaire assessing barriers specific to walking for physical activity. Perceived barriers to vigorous exercise, walking for transportation and recreation, and participation in lifestyle activities (such as taking the stairs instead of the elevator) were also assessed. Subsamples completed the walking barriers instrument a second time after 5 days in order to determine test-retest reliability (n = 104) and wore an accelerometer to measure moderate-intensity physical activity (n = 85). Factor analyses confirmed the existence of three factors underlying the perceived barriers to walking questions: appearance (four items), footwear (three items), and situation (three items). Appearance and situational barriers demonstrated acceptable reliability, discriminant validity, and relations with physical activity criteria. After we controlled for barriers to vigorous exercise, appearance and situational barriers to walking explained additional variation in objectively-measured moderate physical activity. The prediction of walking for physical activity, especially walking that is unstructured and spontaneous, may be improved by considering appearance and situational barriers. Assessing barriers specific to walking may have important implications for interventions targeting walking as means for engaging in physical activity.
The quality of information on the Internet on orthodontic retainer wear: a cross-sectional study.
Doğramacı, Esma J; Rossi-Fedele, Giampiero
2016-03-01
The objectives of this study were to assess the accessibility, usability, reliability and quality of information on the Internet written for the lay public about orthodontic retainers, and to elucidate the different retention protocols encouraged. A cross-sectional, observational study. Online, using a computer connected to the Internet in Australia. Two search terms; 'orthodontic retainer' and 'how long should someone wear a retainer after their braces are removed?' were entered alternatively into five search engines. Twenty results for each search term per search engine that fulfilled the inclusion criteria were evaluated in terms of accessibility, usability, reliability and quality of information using the LIDA and DISCERN instruments, ensuring there were no internal or cross-search engine duplicates. Any information about frequency and duration of retainer wear was also collected. Two hundred different websites were identified and assessed. The median overall LIDA score was 72%, corresponding to a moderate quality level. The median total DISCERN score was 47%. Twenty-two websites recommended patients adhere to the specific protocol prescribed to them by their practitioner. There were 45 (22.5%) and 28 (14%) websites advising indefinite use of removable and bonded retainers respectively. Information about retainers on the Internet is easily accessible and usable, though the quality of the content is generally of a moderate level. However, the information is not always accurate and reliable. Both full-time and part-time wear of removable retainers was suggested over greatly varying time periods. Indefinite wear of removable and bonded retainers was also advocated.
Are validated outcome measures used in distal radial fractures truly valid?
Nienhuis, R. W.; Bhandari, M.; Goslings, J. C.; Poolman, R. W.; Scholtes, V. A. B.
2016-01-01
Objectives Patient-reported outcome measures (PROMs) are often used to evaluate the outcome of treatment in patients with distal radial fractures. Which PROM to select is often based on assessment of measurement properties, such as validity and reliability. Measurement properties are assessed in clinimetric studies, and results are often reviewed without considering the methodological quality of these studies. Our aim was to systematically review the methodological quality of clinimetric studies that evaluated measurement properties of PROMs used in patients with distal radial fractures, and to make recommendations for the selection of PROMs based on the level of evidence of each individual measurement property. Methods A systematic literature search was performed in PubMed, EMbase, CINAHL and PsycINFO databases to identify relevant clinimetric studies. Two reviewers independently assessed the methodological quality of the studies on measurement properties, using the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) checklist. Level of evidence (strong / moderate / limited / lacking) for each measurement property per PROM was determined by combining the methodological quality and the results of the different clinimetric studies. Results In all, 19 out of 1508 identified unique studies were included, in which 12 PROMs were rated. The Patient-rated wrist evaluation (PRWE) and the Disabilities of Arm, Shoulder and Hand questionnaire (DASH) were evaluated on most measurement properties. The evidence for the PRWE is moderate that its reliability, validity (content and hypothesis testing), and responsiveness are good. The evidence is limited that its internal consistency and cross-cultural validity are good, and its measurement error is acceptable. There is no evidence for its structural and criterion validity. The evidence for the DASH is moderate that its responsiveness is good. The evidence is limited that its reliability and the validity on hypothesis testing are good. There is no evidence for the other measurement properties. Conclusion According to this systematic review, there is, at best, moderate evidence that the responsiveness of the PRWE and DASH are good, as are the reliability and validity of the PRWE. We recommend these PROMs in clinical studies in patients with distal radial fractures; however, more clinimetric studies of higher methodological quality are needed to adequately determine the other measurement properties. Cite this article: Dr Y. V. Kleinlugtenbelt. Are validated outcome measures used in distal radial fractures truly valid?: A critical assessment using the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN) checklist. Bone Joint Res 2016;5:153–161. DOI: 10.1302/2046-3758.54.2000462. PMID:27132246
Jones, Sydney A; Evenson, Kelly R; Johnston, Larry F; Trost, Stewart G; Samuel-Hodge, Carmen; Jewell, David A; Kraschnewski, Jennifer L; Keyserling, Thomas C
2015-01-01
This study explored the criterion-related validity and test-retest reliability of the modified RESIDential Environment physical activity questionnaire and whether the instrument's validity varied by body mass index, education, race/ethnicity, or employment status. Validation study using baseline data collected for randomized trial of a weight loss intervention. Participants recruited from health departments wore an ActiGraph accelerometer and self-reported non-occupational walking, moderate and vigorous physical activity on the modified RESIDential Environment questionnaire. We assessed validity (n=152) using Spearman correlation coefficients, and reliability (n=57) using intraclass correlation coefficients. When compared to steps, moderate physical activity, and bouts of moderate/vigorous physical activity measured by accelerometer, these questionnaire measures showed fair evidence for validity: recreational walking (Spearman correlation coefficients 0.23-0.36), total walking (Spearman correlation coefficients 0.24-0.37), and total moderate physical activity (Spearman correlation coefficients 0.18-0.36). Correlations for self-reported walking and moderate physical activity were higher among unemployed participants and women with lower body mass indices. Generally no other variability in the validity of the instrument was found. Evidence for reliability of RESIDential Environment measures of recreational walking, total walking, and total moderate physical activity was substantial (intraclass correlation coefficients 0.56-0.68). Evidence for questionnaire validity and reliability varied by activity domain and was strongest for walking measures. The questionnaire may capture physical activity less accurately among women with higher body mass indices and employed participants. Capturing occupational activity, specifically walking at work, may improve questionnaire validity. Copyright © 2014 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.
Lupi, Jaqueline Basilio; Carvalho de Abreu, Daniela Cristina; Ferreira, Mariana Candido; Oliveira, Renê Donizeti Ribeiro de; Chaves, Thais Cristina
2017-08-01
This study aimed to culturally adapt and validate the Revised Fibromyalgia Impact Questionnaire (FIQR) to Brazilian Portuguese, by the use of analysis of internal consistency, reliability, and construct and structural validity. A total of 100 female patients with fibromyalgia participated in the validation process of the Brazilian Portuguese version of the FIQR (FIQR-Br).The intraclass correlation coefficient (ICC) was used for statistical analysis of reliability (test-retest), Cronbach's alpha for internal consistency, Pearson's rank correlation for construct validity, and confirmatory factor analysis (CFA) for structural validity. It was verified excellent levels of reliability, with ICC greater than 0.75 for all questions and domains of the FIQR-Br. For internal consistency, alpha values greater than 0.70 for the items and domains of the questionnaire were observed. Moderate (0.40 < r < 0.70) and strong (r > 0.70) correlations were observed for the scores of domains and total score between the FIQR-Br and FIQ-Br. The structure of the three domains of the FIQR-Br was confirmed by CFA. The results of this study suggest that that the FIQR-Br is a reliable and valid instrument for assessing fibromyalgia-related impact, and supports its use in clinical settings and research. The structure of the three domains of the FIQR-Br was also confirmed. Implications for Rehabilitation Fibromyalgia is a chronic musculoskeletal disorder characterized by widespread and diffuse pain, fatigue, sleep disturbances, and depression. The disease significantly impairs patients' quality of life and can be highly disabling. To be used in multicenter research efforts, the Revised Fibromyalgia Impact Questionnaire (FIQR) must be cross-culturally validated and psychometrically tested. This paper will make available a new version of the FIQR-Br since another version already exists, but there are concerns about its measurement properties. The availability of an instrument adapted to and validated for Brazilian Portuguese may make it possible to reliably verify the effects of rehabilitation programs on disability from fibromyalgia. The FIQR-Br showed results comparable with other versions of the FIQR in other languages, thereby enabling comparison of effects of rehabilitation interventions on disability from fibromyalgia conducted in Brazil with results of studies carried out in other parts of the world.
Wight, Richard G.; LeBlanc, Allen J.; Meyer, Ilan H.; Harig, Frederick A.
2015-01-01
Objective In this paper we introduce the construct of “internalized gay ageism,” or the sense that one feels denigrated or depreciated because of aging in the context of a gay male identity, which we identify as an unexplored aspect of sexual minority stress specific to midlife and older gay-identified men. Methods Using a social stress process framework, we examine the association between internalized gay ageism and depressive symptoms, and whether one’s sense of mattering mediates or moderates this association, controlling for three decades of depressive symptom histories. The sample is 312 gay-identified men (average age = 60.7 years, range = 48 – 78, 61% HIV-negative) participating in the Multicenter AIDS Cohort Study (MACS) since 1984/85, one of the largest and longest running studies of the natural history of HIV/AIDS in the U.S., who provided contemporary (2012/13) reports of stress experiences. Results We find that internalized gay ageism can reliably be measured among these men, is positively associated with depressive symptoms net of an array of other factors that may also influence symptomatology (including depressive symptom histories), and mattering partially mediates but does not moderate its effect on depressive symptoms. Conclusion Midlife and older gay men have traversed unparalleled historical changes across their adult lives and have paved the way for younger generations of sexual minorities to live in a time of less institutionalized discrimination. Still, they are at distinct risk for feeling socially invisible and devalued in their later years. PMID:26588435
Muñoz, Gerard; Buxó, Maria; de Gracia, Javier; Olveira, Casilda; Martinez-Garcia, Miguel Angel; Giron, Rosa; Polverino, Eva; Alvarez, Antonio; Birring, Surinder S; Vendrell, Montserrat
2016-05-01
The Leicester Cough Questionnaire (LCQ) has been validated in non-cystic fibrosis bronchiectasis (NCFBC). The present study aimed to create and validate a Spanish version of the LCQ (LCQ-Sp) in NCFBC. The LCQ-Sp was developed following a standardized protocol. For reliability, we assessed internal consistency and the change in score over a 15-day period in stable state. For responsiveness, we assessed the change in scores between visit 1 and the first exacerbation. For validity, we evaluated convergent validity through correlation with the Saint George's Respiratory Questionnaire (SGRQ) and discriminant validity. Two hundred fifty-nine patients (118 mild bronchiectasis, 90 moderate bronchiectasis and 47 severe bronchiectasis) were included. Internal consistency was high for the total scoring and good for the different domains (Cronbach's α: 0.86-0.91). The test-retest reliability shows an intraclass correlation coefficient of 0.87 for the total score. The mean LCQ-Sp score at visit 1 decreased at the beginning of an exacerbation (15.13 ± 4.06 vs. 12.24 ± 4.64; p < 0.001). The correlation between LCQ-Sp and SGRQ scores was -0.66 (p < 0.01). The differences in the LCQ-Sp total score between the different groups of severity were significant (p < 0.001). The LCQ-Sp discriminates disease severity, is responsive to change when faced with exacerbations and is reliable for use in bronchiectasis. © The Author(s) 2016.
Muñoz, Gerard; Buxó, Maria; de Gracia, Javier; Olveira, Casilda; Martinez-Garcia, Miguel Angel; Giron, Rosa; Polverino, Eva; Alvarez, Antonio; Birring, Surinder S
2016-01-01
The Leicester Cough Questionnaire (LCQ) has been validated in non-cystic fibrosis bronchiectasis (NCFBC). The present study aimed to create and validate a Spanish version of the LCQ (LCQ-Sp) in NCFBC. The LCQ-Sp was developed following a standardized protocol. For reliability, we assessed internal consistency and the change in score over a 15-day period in stable state. For responsiveness, we assessed the change in scores between visit 1 and the first exacerbation. For validity, we evaluated convergent validity through correlation with the Saint George’s Respiratory Questionnaire (SGRQ) and discriminant validity. Two hundred fifty-nine patients (118 mild bronchiectasis, 90 moderate bronchiectasis and 47 severe bronchiectasis) were included. Internal consistency was high for the total scoring and good for the different domains (Cronbach’s α: 0.86–0.91). The test–retest reliability shows an intraclass correlation coefficient of 0.87 for the total score. The mean LCQ-Sp score at visit 1 decreased at the beginning of an exacerbation (15.13 ± 4.06 vs. 12.24 ± 4.64; p < 0.001). The correlation between LCQ-Sp and SGRQ scores was −0.66 (p < 0.01). The differences in the LCQ-Sp total score between the different groups of severity were significant (p < 0.001). The LCQ-Sp discriminates disease severity, is responsive to change when faced with exacerbations and is reliable for use in bronchiectasis. PMID:26902541
Ghirardelli, Alyssa; Quinn, Valerie; Sugerman, Sharon
2011-01-01
To develop a retail grocery instrument with weighted scoring to be used as an indicator of the food environment. Twenty six retail food stores in low-income areas in California. Observational. Inter-rater reliability for grocery store survey instrument. Description of store scoring methodology weighted to emphasize availability of healthful food. Type A intra-class correlation coefficients (ICC) with absolute agreement definition or a κ test for measures using ranges as categories. Measures of availability and price of fruits and vegetables performed well in reliability testing (κ = 0.681-0.800). Items for vegetable quality were better than for fruit (ICC 0.708 vs 0.528). Kappa scores indicated low to moderate agreement (0.372-0.674) on external store marketing measures and higher scores for internal store marketing. "Next to" the checkout counter was more reliable than "within 6 feet." Health departments using the store scoring system reported it as the most useful communication of neighborhood findings. There was good reliability of the measures among the research pairs. The local store scores can show the need to bring in resources and to provide access to fruits and vegetables and other healthful food. Copyright © 2011 Society for Nutrition Education. Published by Elsevier Inc. All rights reserved.
Validation of the Physical Activity Questionnaire for Older Children (PAQ-C) among Chinese Children.
Wang, Jing Jing; Baranowski, Tom; Lau, Wc Patrick; Chen, Tzu An; Pitkethly, Amanda Jane
2016-03-01
This study initially validates the Chinese version of the Physical Activity Questionnaire for Older Children (PAQ-C), which has been identified as a potentially valid instrument to assess moderate-to-vigorous physical activity (MVPA) in children among diverse racial groups. The psychometric properties of the PAQ-C with 742 Hong Kong Chinese children were assessed with the scale's internal consistency, reliability, test-retest reliability, confirmatory factory analysis (CFA) in the overall sample, and multistep invariance tests across gender groups as well as convergent validity with body mass index (BMI), and an accelerometry-based MVPA. The Cronbach alpha coefficient (α=0.79), composite reliability value (ρ=0.81), and the intraclass correlation coefficient (α=0.82) indicate the satisfactory reliability of the PAQ-C score. The CFA indicated data fit a single factor model, suggesting that the PAQ-C measures only one construct, on MVPA over the previous 7 days. The multiple-group CFAs suggested that the factor loadings and variances and covariances of the PAQ-C measurement model were invariant across gender groups. The PAQ-C score was related to accelerometry-based MVPA (r=0.33) and inversely related to BMI (r=-0.18). This study demonstrates the reliability and validity of the PAQ-C in Chinese children. Copyright © 2016 The Editorial Board of Biomedical and Environmental Sciences. Published by China CDC. All rights reserved.
Goossens, Peter J J; Beentjes, Titus A A; Knol, Suzanne; Salyers, Michelle P; de Vries, Sjoerd J
2017-12-01
The Illness Management and Recovery scales (IMRS) can measure the progress of clients' illness self-management and recovery. Previous studies have examined the psychometric properties of the IMRS. This study examined the reliability and validity of the Dutch version of the IMRS. Clients (n = 111) and clinicians (n = 40) completed the client and clinician versions of the IMRS, respectively. The scales were administered again 2 weeks later to assess stability over time. Validity was assessed with the Utrecht Coping List (UCL), Dutch Empowerment Scale (DES), and Brief Symptom Inventory (BSI). The client and clinician versions of the IMRS had moderate internal reliability, with α = 0.69 and 0.71, respectively. The scales showed strong test-retest reliability, r = 0.79, for the client version and r = 0.86 for the clinician version. Correlations between client and clinician versions ranged from r = 0.37 to 0.69 for the total and subscales. We also found relationships in expected directions between the client IMRS and UCL, DES and BSI, which supports validity of the Dutch version of the IMRS. The Dutch version of the IMRS demonstrated good reliability and validity. The IMRS could be useful for Dutch-speaking programs interested in evaluating client progress on illness self-management and recovery.
Pediatric Amblyopia Risk Investigation Study (PARIS).
Savage, Howard I; Lee, Hester H; Zaetta, Deneen; Olszowy, Ronald; Hamburger, Ellie; Weissman, Mark; Frick, Kevin
2005-12-01
To assess the learning curve, testability, and reliability of vision screening modalities administered by pediatric health extenders. Prospective masked clinical trial. Two hundred subjects aged 3 to 6 underwent timed screening for amblyopia by physician extenders, including LEA visual acuity (LEA), stereopsis (RDE), and noncycloplegic autorefraction (NCAR). Patients returned for a comprehensive diagnostic eye examination performed by an ophthalmologist or optometrist. Average screening time was 5.4 +/- 1.6 minutes (LEA), 1.9 +/- 0.9 minutes (RDE), and 1.7 +/- 1.0 minutes (NCAR). Test time for NCAR and RDE fell by 40% during the study period. Overall testability was 92% (LEA), 96% (RDE), and 94% (NCAR). Testability among 3-year-olds was 73% (LEA), 96% (RDE), and 89% (NCAR). Reliability of LEA was moderate (r = .59). Reliability of NCAR was high for astigmatism (Cyl) (r = .89), moderate for spherical equivalent (SE) (r = .66), and low for anisometropia (ANISO) (r = .38). Correlation of cycloplegic autorefraction (CAR) with gold standard cycloplegic retinoscopic refraction (CRR) was very high for SE (.85), CYL (.77), and moderate for ANISO (.48). With NCAR, physician extenders can quickly and reliably detect astigmatism and spherical refractive error in one-third the time it takes to obtain visual acuity. LEA has a lower initial cost, but is time consuming, moderately reliable, and more difficult for 3-year-olds. Shorter examination time and higher reliability may make NCAR a more efficient screening tool for refractive amblyopia in younger children. Future study is needed to determine the sensitivity and specificity of NCAR and other screening methods in detecting amblyopia and amblyopia risk factors.
ERIC Educational Resources Information Center
Surapiboonchai, Kampol
2010-01-01
There is a lack of valid and reliable low cost observational instruments to measure moderate to vigorous physical activity (MVPA) in school physical education (PE). The participants in this study were third to tenth grade boys and girls from a south Texas school district. The SAM (Simple Activity Measurement) activity levels were compared with…
ERIC Educational Resources Information Center
Wouters, Marieke; van der Zanden, Anna M.; Evenhuis, Heleen M.; Hilgenkamp, Thessa I. M.
2017-01-01
Physical fitness is an important marker for health. In this study we investigated the feasibility and reliability of health-related physical fitness tests in children with moderate to severe levels of intellectual disability. Thirty-nine children (2-18 yrs) performed tests for muscular strength and endurance, the modified 6-minute walk test (6mwt)…
Nessen, Thomas; Demmelmaier, Ingrid; Nordgren, Birgitta; Opava, Christina H
2015-01-01
The aim of the present study was to investigate aspects of reliability and validity of the Exercise Self-Efficacy Scale (ESES-S) in a rheumatoid arthritis (RA) population. A total of 244 people with RA participating in a physical activity study were included. The six-item ESES-S, exploring confidence in performing exercise, was assessed for test-retest reliability over 4-6 months, and for internal consistency. Construct validity investigated correlation with similar and other constructs. An intraclass correlation coefficient (ICC) of 0.59 (95% CI 0.37-0.73) was found for 84 participants with stable health perceptions between measurement occasions. Cronbach's alpha coefficients of 0.87 and 0.89 were found at the first and second measurements. Corrected item-total correlation single ESES-S items ranged between 0.53 and 0.73. Construct convergent validity for the ESES-S was partly confirmed by correlations with health-enhancing physical activity and outcome expectations respectively (Pearson's r = 0.18, p < 0.01). Construct divergent validity was confirmed by the absence of correlations with age or gender. No floor or ceiling effects were found for ESES-S. The results indicate that the ESES-S has moderate test-retest reliability and respectable internal consistency in people with RA. Construct validity was partially supported in the present sample. Further research on construct validity of the ESES-S is recommended. Physical exercise is crucial for management of symptoms and co-morbidity in rheumatoid arthritis. Self-efficacy for exercise is important to address in rehabilitation as it regulates exercise motivation and behavior. Measurement properties of self-efficacy scales need to be assessed in specific populations and different languages.
Moon, Ki Won; Lee, Shin-Seok; Kim, Jin Hyun; Song, Ran; Lee, Eun Young; Song, Yeong Wook; Bellamy, Nicholas; Lee, Eun Bong
2012-11-01
The Australian/Canadian Osteoarthritis Hand Index (AUSCAN) is a patient self-reported 15-item questionnaire measuring the severity of hand osteoarthritis symptoms in the respect of pain, stiffness, and function. In this study, we developed a Korean version of the AUSCAN Index (K-AUSCAN) and confirmed its reliability, validity, and responsiveness. The AUSCAN Index was translated into Korean by 3 translators and translated back into English by 3 different translators. In a group of 53 patients with clinical hand osteoarthritis (mean age 58.3 ± 7.6 years), validity was evaluated against other outcome measures, including the Functional Index for Hand Osteoarthritis (FIHOA) and Multidimensional Health Assessment Questionnaire (MDHAQ). Test-retest reliability was assessed at a 2-weeks interval in 51 patients. Internal consistency of K-AUSCAN was evaluated by Cronbach's α. Responsiveness was measured by standardized response mean (SRM). The test-retest reliability of K-AUSCAN yielded intraclass correlation coefficient of 0.46 for pain, 0.58 for stiffness, and 0.67 for function. The internal consistency of K-AUSCAN was satisfactory with Cronbach's α of 0.89 for pain and 0.93 for function. The K-AUSCAN index showed good correlation with other measures (r (2) was 0.67 for K-AUSCAN pain and MDHAQ pain; r (2) was 0.72 for K-AUSCAN function and FIHOA). The pain and function of K-AUSCAN correlated substantially with each other and moderately with stiffness subscale. The average SRM for K-AUSCAN pain, stiffness, and function was -0.92, -0.48, and -0.84, respectively. The Korean version of the AUSCAN Index is a valid, reliable, and responsive tool for the assessment of hand osteoarthritis symptoms.
Hadadi, Mohammad; Ebrahimi Takamjani, Ismail; Ebrahim Mosavi, Mohammad; Aminian, Gholamreza; Fardipour, Shima; Abbasi, Faeze
2017-08-01
The purpose of the present study was to translate and to cross-culturally adapt the Cumberland Ankle Instability Tool (CAIT) into Persian language and to evaluate its psychometric properties. The International Quality of Life Assessment process was pursued to translate CAIT into Persian. Two groups of Persian-speaking individuals, 105 participants with a history of ankle sprain and 30 participants with no history of ankle sprain, were asked to fill out Persian version of CAIT (CAIT-P), Foot and Ankle Ability Measure (FAAM), and Visual Analog Scale (VAS). Data obtained from the first administration of CAIT were used to evaluate floor and ceiling effects, internal consistency, dimensionality, and criterion validity. To determine the test-retest reliability, 45 individuals re-filled CAIT 5-7 days after the first session. Cronbach's alpha was over the cutoff point of 0.70 for both ankles and in both groups. The intra-class correlation coefficient was high for right (0.95) and left (0.91) ankles. There was a strong correlation between each item and the total score of the CAIT-P. Although the CAIT-P had strong correlation with VAS, its correlation with both subscales of FAAM was moderate. The CAIT-P has good validity and reliability and it can be used by clinicians and researchers for identification and investigation of functional ankle instability. Implications for Rehabilitation Chronic ankle instability is one of the most common consequences of acute ankle sprain. Cumberland Ankle Instability Tool is an acceptable measure to determine functional ankle instability and its severity. The Persian version of Cumberland Ankle Instability Tool is a valid and reliable tool for clinical and research purpose in Persian-speaking individuals.
Leung, Sau Fong; Lee, Ka Li; Lee, Sze Man; Leung, Sik Chi; Hung, Wing Sze; Lee, Wai Leng; Leung, Yuen Yee; Li, Man Wai; Tse, Tak Kin; Wong, Hoi Kei; Wong, Yuen Ni
2009-02-01
Eating disorders are affecting an increasing number of high school students in Western and Asian countries. The availability of an effective screening tool is crucial for early detection and prompt intervention. The objective of this study was to examine the validity and reliability of the SCOFF questionnaire for screening eating disorders in Hong Kong high school students. This study adopted a cross-sectional design to examine the psychometric properties of the SCOFF questionnaire. A panel of 7 experts and 936 students of a high school participated in the study. The SCOFF questionnaire was translated into Chinese and back-translated into English to ensure the linguistic equivalence. A panel of 7 experts involved in the content validation of the SCOFF questionnaire. The Eating Disorder Examination-Questionnaire (EDE-Q) was used as the "reference standard" to assess its concurrent validity in 936 students of a high school. Its reliability was examined by internal consistency and the test-retest method at a 2-week interval and with 38 students. The SCOFF questionnaire achieved an agreement of 86-100% among the experts for the content relevance. Of 812 students (86.8%) who responded to this study, their SCOFF scores correlated significantly with their global scores on the EDE-Q (r=0.5, P<0.01). Students identified as potentially having eating disorders had significantly higher scores in the EDE-Q than those not identified as such by SCOFF. The SCOFF questionnaire demonstrated moderate test-retest reliability (ICC=0.66) and an acceptable internal consistency reliability (Cronbach's alpha=0.44-0.57) in comparing with previous studies. The SCOFF questionnaire has acceptable psychometric properties in the Chinese culture. It will be useful for detecting potential eating disorders and assisting health promotion activity.
Baradaran, Aslan; Ebrahimzadeh, Mohammad H; Birjandinejad, Ali; Kachooei, Amir Reza
2016-04-01
Prospective study. We aimed to validate the Persian version of the modified Oswestry disability questionnaire (MODQ) in patients with low back pain. Modified Oswestry low back pain disability questionnaire is a well-known condition-specific outcome measure that helps quantify disability in patients with lumbar syndromes. To test the validity in a pilot study, the Persian MODQ was administered to 25 individuals with low back pain. We then enrolled 200 consecutive patients with low back pain to fill the Persian MODQ as well as the short form 36 (SF-36) questionnaire. Convergent validity of the MODQ was tested using the Spearman's correlation coefficient between the MODQ and SF-36 subscales. Intraclass correlation coefficient (ICC) and Cronbach's α coefficient were measured to test the reliability between test and retest and internal consistency of all items, respectively. ICC for individual items ranged from 0.43 to 0.80 showing good reliability and reproducibility of each individual item. Cronbach's α coefficient was 0.69 showing good internal consistency across all 10 items of the Persian MODQ. Total MODQ score showed moderate to strong correlation with the eight subscales and the two domains of the SF-36. The highest correlation was between the MODQ and the physical functioning subscale of the SF-36 (r=-0.54, p<0.001) and the physical component domain of the SF-36 (r=-0.55, p<0.001) showing that MODQ is measuring what it is supposed to measure in terms of disability and physical function. Persian version of the MODQ is a valid and reliable tool for the assessment of the disability following low back pain.
Moriguchi, Eri; Ito, Mikiko; Nagai, Toshisaburo
2015-11-01
A Japanese version of the Quality of Life in Childhood Epilepsy Questionnaire (QOLCE-J) was developed using international guidelines as a QOL scale for childhood epilepsy; its reliability and validity were examined, focusing on Japanese pediatric epilepsy patients applicability. A pilot test questionnaire survey was conducted; involving parents of pediatric epilepsy patients aged 4-15 undergoing outpatient treatment. 278 responses were obtained and analyzed. Internal consistency for the 16 QOLCE-J subscales, except for
Gagné, Myriam; Boulet, Louis-Philippe; Pérez, Norma; Moisan, Jocelyne
2018-04-30
To systematically identify the measurement properties of patient-reported outcome instruments (PROs) that evaluate adherence to inhaled maintenance medication in adults with asthma. We conducted a systematic review of six databases. Two reviewers independently included studies on the measurement properties of PROs that evaluated adherence in asthmatic participants aged ≥18 years. Based on the COnsensus-based Standards for the selection of health Measurement INstruments (COSMIN), the reviewers (1) extracted data on internal consistency, reliability, measurement error, content validity, structural validity, hypotheses testing, cross-cultural validity, criterion validity, and responsiveness; (2) assessed the methodological quality of the included studies; (3) assessed the quality of the measurement properties (positive or negative); and (4) summarised the level of evidence (limited, moderate, or strong). We screened 6,068 records and included 15 studies (14 PROs). No studies evaluated measurement error or responsiveness. Based on methodological and measurement property quality assessments, we found limited positive evidence of: (a) internal consistency of the Adherence Questionnaire, Refined Medication Adherence Reason Scale (MAR-Scale), Medication Adherence Report Scale for Asthma (MARS-A), and Test of the Adherence to Inhalers (TAI); (b) reliability of the TAI; and (c) structural validity of the Adherence Questionnaire, MAR-Scale, MARS-A, and TAI. We also found limited negative evidence of: (d) hypotheses testing of Adherence Questionnaire; (e) reliability of the MARS-A; and (f) criterion validity of the MARS-A and TAI. Our results highlighted the need to conduct further high-quality studies that will positively evaluate the reliability, validity, and responsiveness of the available PROs. This article is protected by copyright. All rights reserved.
Hoeboer, Joris; Krijger-Hombergen, Michiel; Savelsbergh, Geert; De Vries, Sanne
2018-07-01
The purpose of this study was to examine the test-retest reliability, internal consistency and concurrent validity of the Athletic Skills Track (AST). During a regular PE lesson, 930 4- to 12-year old children (448 girls, 482 boys) completed two motor skill competence tests: (1) the Körperkoordination-Test für Kinder (KTK) and (2) an age-related version of the AST (age 4-6 years: AST-1, age 6-9 years: AST-2, and age 9-12 years: AST-3). The test-retest reliability of the AST was high (AST-1: ICC = 0.881 (95% CI: 0.780-0.934); AST-2: ICC = 0.802 (95% CI: 0.717-0.858); and AST-3: ICC = 0.800 (95% CI: 0.669-0.871). The internal consistency, concerning the three age-bands of the AST was above the acceptable level of Cronbach's α > 0.70 (AST-1: α = 0.764; AST-2: α = 0.700; and AST-3: α = 0.763). There was a moderate to high correlation between the time to complete the AST, and the age- and gender-related motor quotients of the KTK (AST-1: r = -0.747, p = 0.01; AST-2: r = -0.646, p = 0.01; and AST-3: r = -0.602, p = 0.01). The Athletic Skills Track is a reliable and valid assessment tool to assess motor skill competence among 4- to 12-year old children in the PE setting.
Vélez, Claudia Marcela; Lugo, Luz Helena; García, Héctor Iván
2012-09-01
Validate the KIDSCREEN-27 for parents in the metropolitan area of Medellín, Colombia, including the Social Acceptance (SA) subscale of KIDSCREEN-52, as it evaluates the effect of bullying in Life Quality of children. The study population was made up by parents of children between 8 and 18, from Medellín and its metropolitan area. A sample of 1,150 parents was estimated according to the different psychometric properties to be measured. Construct validation was made by comparing the mean scores between groups of high and low socioeconomic conditions. The content validity and the measurement of reliability were verified by internal consistency and test-retest stability. The parent-child agreement was also measured. The internal consistency was adequate (Cronbach alpha 0,76-0,83). Parents of children with better socio-economic status had higher scores in all dimensions (p<0,05). Scores were higher among healthy children. Women had lower scores than men, while children registered higher scores than adolescents. The intraclass correlation coefficient for the reliability assessment was above 0.7 in all dimensions, except in School Environment-SE- (ICC 0,6-0,92). The parent-child agreement reached moderate and good levels (ICC 0,49-0,69). The exploratory factorial analysis, including social acceptance subscale, registered eight dimensions, four of which in agreement with the original questionnaire: Physical activity, SE, Social Support, and SA subscale. KIDSCREEN-27 for parents is a valid and reliable instrument to be used in the Colombian context. Copyright © 2012 Asociación Colombiana de Psiquiatría. Publicado por Elsevier España. All rights reserved.
Development and Validation of a Measure of Attitudes toward Fluffy Women
Barned, C; Lipps, GE
2014-01-01
ABSTRACT Background: There is an absence of research on the newly evolved term “fluffy” which describes body image and personality features among women. Research on “fluffiness” among Caribbean peoples has been limited by the lack of valid and reliable measures of the concept. Objective: This project addresses this problem by exploring the internal consistency reliability and the concurrent and discriminant validity of the Attitudes toward Fluffy Women Scale (ATFW) using a mixture of past and present students from The University of the West Indies (UWI), Mona, and the University of Technology (UTech), Kingston. Method: Past or present students from The UWI, Mona, and UTech, Kingston, were recruited for the study through the use of convenience sampling. A total of 80 students (38 males, 47.5%; 42 females, 52.5%) participated in the study. Results: Overall, the ATFW was found to have an acceptable degree of internal consistency reliability (α = 0.90). The scale also had reasonably good concurrent validity as evidenced by moderate correlations with scores on the Attitudes Toward Obese Persons Scale (r = −0.42) and acceptable discriminant validity as demonstrated through low correlations with a Bogardus Social Distance Scale designed to assess prejudice toward people living with the human immunodeficiency virus [HIV] (r = 0.29). This pattern of scores suggests that the majority of the stable variance underlying the ATFW assesses the “fluffy” concept (17.6%) while a smaller degree of the variability (8%) measures a conceptually similar but distinct concept. Conclusion: The Attitudes toward Fluffy Women scale was found to be a reliable and valid scale for assessing the attitudes of young adults toward fluffy women. PMID:25803379
Chen, Yu-Ming; He, Li-Ping; Mai, Jin-Cheng; Hao, Yuan-Tao; Xiong, Li-Hua; Chen, Wei-Qing; Wu, Jiang-Nan
2008-06-01
To evaluate the reliability and validity of parent proxy-report scales of Pediatric Quality of Life Inventory Version 4.0 (PedsQL 4.0) Generic Core Scales, the Chinese Version. 3493 school students aged 6-18 years were recruited using multistage cluster sampling method. Health-related quality of life was assessed using the above-mentioned PedsQL 4.0 scales. The internal consistency was assessed, using Cronbach's a coefficient, while its validity was tested through correlation analysis, t-test and exploratory factor analysis. The internal consistency reliability for Total Scale Score (Cronbach's alpha = 0.90), Physical Health Summary Score (alpha= 0.81), and Psychosocial Health Summary Score (alpha= 0.89) were excellent. Six major factors were extracted by factor analysis which basically matched the designed structure of the original version accounting for nearly 66% of the variance. The total Scale Score significantly decreased by 3.5 to 13.3 (P < 0.05) in children and adolescents who had diseases including cold, skin hypersensitiveness, food allergy, courbature or arthralgia, breathlessness with a frequency of 6 times or more per year or had asthma as compared to those with lower frequency (< or = 5 times/y) of the diseases or without asthma. We found moderate to high correlations between items and the subscales. Correlation coefficients ranged between 0.45 to 0.84 (P < 0.01). The reliability and validity of the parent proxy-report scales of PedsQL 4.0 Generic Core Scales of the Chinese Version were as good as the original version. Our findings suggested that the scales could be applied to evaluate the health-related quality of life in childhood children in similar Chinese regions to Guangzhou.
Ogden, C A; Akobeng, A K; Abbott, J; Aggett, P; Sood, M R; Thomas, A G
2011-09-01
To validate IMPACT-III (UK), a health-related quality of life (HRQoL) instrument, in British children with inflammatory bowel disease (IBD). One hundred six children and parents were invited to participate. IMPACT-III (UK) was validated by inspection by health professionals and children to assess face and content validity, factor analysis to determine optimum domain structure, use of Cronbach alpha coefficients to test internal reliability, ANOVA to assess discriminant validity, correlation with the Child Health Questionnaire to assess concurrent validity, and use of intraclass correlation coefficients to assess test-retest reliability. The independent samples t test was used to measure differences between sexes and age groups, and between paper and computerised versions of IMPACT-III (UK). IMPACT-III (UK) had good face and content validity. The most robust factor solution was a 5-domain structure: body image, embarrassment, energy, IBD symptoms, and worries/concerns about IBD, all of which demonstrated good internal reliability (α = 0.74-0.88). Discriminant validity was demonstrated by significant (P < 0.05, P < 0.01) differences in HRQoL scores between the severe, moderate, and inactive/mild symptom severity groups for the embarrassment scale (63.7 vs 81.0 vs 81.2), IBD symptom scale (45.0 vs 64.2 vs 80.6), and the energy scale (46.4 vs 62.1 vs 77.7). Concurrent validity of IMPACT-III (UK) with comparable domains of the Child Health Questionnaire was confirmed. Test-retest reliability was confirmed with good intraclass correlation coefficients of 0.66 to 0.84. Paper and computer versions of IMPACT-III (UK) collected comparable scores, and there were no differences between the sexes and age groups. IMPACT-III (UK) appears to be a useful tool to measure HRQoL in British children with IBD.
Gergov, Vera; Lahti, Jari; Marttunen, Mauri; Lipsanen, Jari; Evans, Chris; Ranta, Klaus; Laitila, Aarno; Lindberg, Nina
2017-05-01
An increasing need exists for suitable measures to evaluate treatment outcome in adolescents. YP-CORE is a pan-theoretical brief questionnaire developed for this purpose, but it lacks studies in different cultures or languages. To explore the acceptability, factor structure, reliability, validity, and sensitivity to change of the Finnish translation of YP-CORE. The study was conducted at the Department of Adolescent Psychiatry, Helsinki University Central Hospital. A Finnish translation was prepared by a team of professionals and adolescents. A clinical sample of 104 patients was asked to complete the form together with BDI-21 and BAI, and 92 of them filled the forms again after a 3-month treatment. Analysis included acceptability, confirmatory factor analysis, internal and test-re-test reliability, concurrent validity, influence of gender and age, and criteria for reliable change. YP-CORE was well accepted, and the rate of missing values was low. Internal consistency (α = 0.83-.92) and test-re-test reliability were good (r = 0.69), and the results of CFA supported a one-factor model. YP-CORE showed good concurrent validity against two widely used symptom-specific measures (r = 0.62-0.87). Gender had a moderately strong effect on the scores (d = 0.67), but the effect of age was not as evident. The measure was sensitive to change, showing a larger effect size (d = 0.55) than in the BDI-21 and BAI (d = 0.31-0.50). The results show that the translation of YP-CORE into Finnish has been successful, the YP-CORE has good psychometric properties, and the measure could be taken into wider use in clinical settings for outcome measurement in adolescents.
Translation, validity and reliability of the British Sign Language (BSL) version of the EQ-5D-5L.
Rogers, Katherine D; Pilling, Mark; Davies, Linda; Belk, Rachel; Nassimi-Green, Catherine; Young, Alys
2016-07-01
To translate the health questionnaire EuroQol EQ-5D-5L into British Sign Language (BSL), to test its reliability with the signing Deaf population of BSL users in the UK and to validate its psychometric properties. The EQ-5D-5L BSL was developed following the international standard for translation required by EuroQol, with additional agreed features appropriate to a visual language. Data collection used an online platform to view the signed (BSL) version of the tests. The psychometric testing included content validity, assessed by interviewing a small sample of Deaf people. Reliability was tested by internal consistency of the items and test-retest, and convergent validity was assessed by determining how well EQ-5D-5L BSL correlates with CORE-10 BSL and CORE-6D BSL. The psychometric properties of the EQ-5D-5L BSL are good, indicating that it can be used to measure health status in the Deaf signing population in the UK. Convergent validity between EQ-5D-5L BSL and CORE-10 BSL and CORE-6D BSL is consistent, demonstrating that the BSL version of EQ-5D-5L is a good measure of the health status of an individual. The test-retest reliability of EQ-5D-5L BSL, for each dimension of health, was shown to have Cohen's kappa values of 0.47-0.61; these were in the range of moderate to good and were therefore acceptable. This is the first time EQ-5D-5L has been translated into a signed language for use with Deaf people and is a significant step forward towards conducting studies of health status and cost-effectiveness in this population.
Elison, Sarah; Davies, Glyn; Ward, Jonathan
2016-07-28
There is a growing literature around substance use disorder treatment outcomes measures. Various constructs have been suggested as being appropriate for measuring recovery outcomes, including "recovery capital" and "treatment progression." However, these previously proposed constructs do not measure changes in psychosocial functioning during the recovery process. Therefore, a new psychometric assessment, the "Recovery Progression Measure" (RPM), has been developed to measure this recovery oriented psychosocial change. The aims of this study were to evaluate the reliability and factor structure of the RPM via data collected from 2218 service users being treated for their substance dependence. Data were collected from service users accessing the Breaking Free Online (BFO) substance use disorder treatment and recovery program, which has within its baseline assessment a 36-item psychometric measure previously developed by the authors to assess the six areas of functioning described in the RPM. Reliability analyses and exploratory factor analyses (EFA) were conducted to examine the underlying factor structure of the RPM measure. Internal reliability of the RPM measure was found to be excellent (α > .70) with the overall assessment to have reliability α = .89, with item-total correlations revealing moderate-excellent reliability of individual items. EFA revealed the RPM to contain an underlying factor structure of eight components. This study provides initial data to support the reliability of the RPM as a recovery measure. Further work is now underway to extend these findings, including convergent and predictive validity analyses.
Negahban, Hossein; Mohtasebi, Elham; Goharpey, Shahin
2015-01-01
The aim of this methodological study was to cross-culturally translate the Shoulder Activity Scale (SAS) into the Persian and determine its clinimetric properties including reliability, validity, and responsiveness in patients with shoulder disorders. Persian version of the SAS was obtained after standard forward-backward translation. Three questionnaires were completed by the respondents: SAS, shoulder pain and disability index (SPADI), and Short-Form 36 Health Survey (SF-36). The patients completed the SAS, 1 week after the first visit to evaluate the test-retest reliability. Construct validity was evaluated by examining the associations between the scores on the SAS and the scores obtained from the SPADI, SF-36, and age of the patients. To assess responsiveness, data were collected in the first visit and then again after 4 weeks physiotherapy intervention. Test-retest reliability and internal consistency were assessed using Intra-class Correlation Coefficient (ICC) and Cronbach's alpha, respectively. To evaluate construct validity, Spearman's rank correlation was used. The ability of the SAS to detect changes was evaluated by the receiver-operating characteristics method. No problem or language difficulties were reported during translation process. Test-retest reliability of the SAS was excellent with an ICC of 0.98. Also, the marginal Cronbach's alpha level of 0.64 was obtained. The correlation between the SAS and the SPADI was low, proving divergent validity, whereas the correlations between the SAS and the SF-36/age were moderate proving convergent validity. A marginally acceptable responsiveness was achieved for the Persian SAS. The study provides some evidences to support the test-retest reliability, internal consistency, construct validity, and responsiveness of the Persian version of the SAS in patients with shoulder disorders. Therefore, it seems that this instrument is a useful measure of shoulder activity level in research setting and clinical practice. The shoulder activity scale (SAS) is a reliable, valid, and responsive measure of shoulder activity level in Persian-speaking patients with different shoulder disorders. The results on clinimetric properties of the Persian SAS are comparable with its original, English version. Persian version of the SAS can be used in "clinical" and "research" settings of patients with shoulder disorders.
Lersilp, Suchitporn; Suchart, Sumana
2017-01-01
The purpose of this study was to improve upon the first version of the basic work skills assessment tool for adolescents with autism spectrum disorder (ASD) and examine interrater and intrarater reliability using Intraclass Correlation Coefficient (ICC). The modified tool includes 2 components: (1) three tasks measuring work abilities and work attitudes and (2) a form to record the number of verbal and nonverbal prompts. 26 participants were selected by purposive sampling and divided into 3 groups—group 1 (10 subjects, aged 11–13 years), group 2 (10, aged 14–16 years), and group 3 (6, aged 17–19 years). The results show that interrater reliabilities of work abilities and work attitudes were high in all groups except that the work attitude in group 1 was moderate. Intrarater reliabilities of work abilities in group 1 and group 2 were high. Group 3 was moderate. Intrarater reliabilities of work attitudes in group 1 and group 3 were high but not in group 2 in which they were moderate. Nevertheless, interrater and intrarater reliabilities in the total scores of all groups were high, which implies that this tool is applicable for adolescents aged 11–19 years with consideration of relevance for each group. PMID:28280769
Environmental education curriculum evaluation questionnaire: A reliability and validity study
NASA Astrophysics Data System (ADS)
Minner, Daphne Diane
The intention of this research project was to bridge the gap between social science research and application to the environmental domain through the development of a theoretically derived instrument designed to give educators a template by which to evaluate environmental education curricula. The theoretical base for instrument development was provided by several developmental theories such as Piaget's theory of cognitive development, Developmental Systems Theory, Life-span Perspective, as well as curriculum research within the area of environmental education. This theoretical base fueled the generation of a list of components which were then translated into a questionnaire with specific questions relevant to the environmental education domain. The specific research question for this project is: Can a valid assessment instrument based largely on human development and education theory be developed that reliably discriminates high, moderate, and low quality in environmental education curricula? The types of analyses conducted to answer this question were interrater reliability (percent agreement, Cohen's Kappa coefficient, Pearson's Product-Moment correlation coefficient), test-retest reliability (percent agreement, correlation), and criterion-related validity (correlation). Face validity and content validity were also assessed through thorough reviews. Overall results indicate that 29% of the questions on the questionnaire demonstrated a high level of interrater reliability and 43% of the questions demonstrated a moderate level of interrater reliability. Seventy-one percent of the questions demonstrated a high test-retest reliability and 5% a moderate level. Fifty-five percent of the questions on the questionnaire were reliable (high or moderate) both across time and raters. Only eight questions (8%) did not show either interrater or test-retest reliability. The global overall rating of high, medium, or low quality was reliable across both coders and time, indicating that the questionnaire can discriminate differences in quality of environmental education curricula. Of the 35 curricula evaluated, 6 were high quality, 14 were medium quality and 15 were low quality. The criterion-related validity of the instrument is at current time unable to be established due to the lack of comparable measures or a concretely usable set of multidisciplinary standards. Face and content validity were sufficiently demonstrated.
The Yale-Brown Obsessive Compulsive Scale: A Reliability Generalization Meta-Analysis.
López-Pina, José Antonio; Sánchez-Meca, Julio; López-López, José Antonio; Marín-Martínez, Fulgencio; Núñez-Núñez, Rosa Maria; Rosa-Alcázar, Ana I; Gómez-Conesa, Antonia; Ferrer-Requena, Josefa
2015-10-01
The Yale-Brown Obsessive Compulsive Scale (Y-BOCS) is the most frequently applied test to assess obsessive compulsive symptoms. We conducted a reliability generalization meta-analysis on the Y-BOCS to estimate the average reliability, examine the variability among the reliability estimates, search for moderators, and propose a predictive model that researchers and clinicians can use to estimate the expected reliability of the Y-BOCS. We included studies where the Y-BOCS was applied to a sample of adults and reliability estimate was reported. Out of the 11,490 references located, 144 studies met the selection criteria. For the total scale, the mean reliability was 0.866 for coefficients alpha, 0.848 for test-retest correlations, and 0.922 for intraclass correlations. The moderator analyses led to a predictive model where the standard deviation of the total test and the target population (clinical vs. nonclinical) explained 38.6% of the total variability among coefficients alpha. Finally, clinical implications of the results are discussed. © The Author(s) 2014.
Reliability and validity of the Microsoft Kinect for evaluating static foot posture
2013-01-01
Background The evaluation of foot posture in a clinical setting is useful to screen for potential injury, however disagreement remains as to which method has the greatest clinical utility. An inexpensive and widely available imaging system, the Microsoft Kinect™, may possess the characteristics to objectively evaluate static foot posture in a clinical setting with high accuracy. The aim of this study was to assess the intra-rater reliability and validity of this system for assessing static foot posture. Methods Three measures were used to assess static foot posture; traditional visual observation using the Foot Posture Index (FPI), a 3D motion analysis (3DMA) system and software designed to collect and analyse image and depth data from the Kinect. Spearman’s rho was used to assess intra-rater reliability and concurrent validity of the Kinect to evaluate foot posture, and a linear regression was used to examine the ability of the Kinect to predict total visual FPI score. Results The Kinect demonstrated moderate to good intra-rater reliability for four FPI items of foot posture (ρ = 0.62 to 0.78) and moderate to good correlations with the 3DMA system for four items of foot posture (ρ = 0.51 to 0.85). In contrast, intra-rater reliability of visual FPI items was poor to moderate (ρ = 0.17 to 0.63), and correlations with the Kinect and 3DMA systems were poor (absolute ρ = 0.01 to 0.44). Kinect FPI items with moderate to good reliability predicted 61% of the variance in total visual FPI score. Conclusions The majority of the foot posture items derived using the Kinect were more reliable than the traditional visual assessment of FPI, and were valid when compared to a 3DMA system. Individual foot posture items recorded using the Kinect were also shown to predict a moderate degree of variance in the total visual FPI score. Combined, these results support the future potential of the Kinect to accurately evaluate static foot posture in a clinical setting. PMID:23566934
ERIC Educational Resources Information Center
Maiano, Christophe; Morin, Alexandre J. S.; Begarie, Jerome
2011-01-01
The purpose of this study was to test the factor validity and reliability of the Center for Epidemiologic Studies Depression Scale (CES-D) within a sample of adolescents with mild to moderate Intellectual Disability (ID). A total sample of 189 adolescents (121 boys and 68 girls), aged between 12 and 18 years old, with mild to moderate ID were…
Oosterhuis, Ingrid; Rolfes, Leàn; Ekhart, Corine; Muller-Hansma, Annemarie; Härmark, Linda
2018-02-01
To make a proper causality assessment of an adverse drug reaction (ADR) report, a certain level of clinical information is necessary. A tool was developed to measure the level of clinical information present in ADR reports. The aim of this study was to test the validity and reliability of the clinical documentation tool (ClinDoc) in an international setting. The tool was developed by a panel of pharmacovigilance experts. It includes four domains: ADR, chronology of the ADR, suspected drug and patient characteristics. The final score categorizes reports into: excellent, well, moderately or poorly documented. In two rounds, eight pharmacovigilance assessors of different countries made a total of 224 assessments using the tool, with the expert panels judgement as a standard. Sensitivity and specificity were calculated. The tool with four outcome-categories demonstrated low sensitivity. A lack of distinctiveness was demonstrated between the categories moderate and well. Results for the second round were re-analysed using three categories. This demonstrated a better validity. This is the first tool to give insight in the level of relevant clinical information present in ADR reports. It can be used internationally to compare reports coming from different reporting methods and different types of reporters in pharmacovigilance.
Reliability and validity of the international physical activity questionnaire for assessing walking.
van der Ploeg, Hidde P; Tudor-Locke, Catrine; Marshall, Alison L; Craig, Cora; Hagströmer, Maria; Sjöström, Michael; Bauman, Adrian
2010-03-01
Physical inactivity and its accompanying adverse sequelae (e.g., obesity and diabetes) are global health concerns. The single most commonly reported physical activity in public health surveys is walking (Centers for Disease Control and Prevention, 2000; Rafferty, Reeves, McGee, & Pivarnik, 2002). As evidence accumulates that walking is important for preventing weight gain (Levine et al., 2008) and reducing the risk of diabetes (Jeon, Lokken, Hu, & van Dam, 2007), there is increased need to capture this behavior in a valid and reliable manner. Although the disadvantages of a self-report methodology are well known (Sallis, & Saelens, 2000), it still represents the most feasible approach for conducting population-level surveillance across developed and developing countries. The International Physical Activity Questionnaire (IPAQ) was created and evaluated as a standardized instrument for this purpose. Although two versions of the IPAQwere designed and evaluated (short: nine items; and long: 31 items), the short form was recommended for population monitoring (Craig et al., 2003). However, it has not been recommended for intervention or research studies that require precise physical activity quantification to examine changes in physical activity at the individual level. IPAQ was also not intended to replace instruments that are more responsive to individual changes in activity level, such as objective measures. In addition to walking behaviors, IPAQ also assesses time spent in moderate- and vigorous-intensity activity as well as sitting behaviors, although the latter is not the focus of this analysis. Aggregated IPAQ data have been previously validated compared to accelerometers, and overall reliability was confirmed across 12 countries (Craig et al., 2003). Previous research showed criterion validity Spearman correlations with a median of 0.30 and test-retest reliability Spearman correlations clustered around 0.8 (Craig et al., 2003). The purpose of this study, however, was to reanalyze these data with respect to validity (again compared to an accelerometer) and test-retest reliability specifically for population monitoring of walking.
Elboim-Gabyzon, Michal; Agmon, Maayan; Azaiza, Faisal; Laufer, Yocheved
2015-04-24
The Late-Life Function and Disability Instrument (LLFDI) provides a comprehensive, reliable, and valid assessment of physical function and disability in community-dwelling adults. There does not appear to be a validated, comprehensive instrument for assessing function and disability in Arabic. The objective of the present study was to translate and culturally adapt the LLFDI to Arabic, and to determine its test-retest reliability and validity. The LLFDI was translated to Arabic through a forward and backward translation process, and approved by a bilingual committee of experts. Sixty-one (26 male and 35 female) Arabic speaking, healthy, older adults, ages 65-88, living in northern Israel participated in the study. To determine test-retest reliability, the questionnaire was administered twice to 41 subjects with a 6 to 8day interval. Construct validity was examined by correlating the LLFDI responses with the 10-item physical function (PF-10) subscales of the General Health Survey (SF-36), with the physical component of SF-36 (SF-36 PCS), and with two performance measures, the Berg Balance Scale (BBS) and Time Up and Go (TUG) test. Additionally, gender and fall related differences in the LLFDI were also examined. Internal consistency (Cronbach's alpha) was good to excellent (0.77 to 0.97). Test-retest agreement was good to very good (function component: 0.86-0.93, disability component: 0.77-0.93). Correlation with the SF-36 PCS and PF-10 was moderate to strong for both LLFDI components (function, r = 0.53-0.65 and r = 0.57-0.63, and LLFDI disability, r = 0.57-0.76 and 0.53-0.73, respectively). Significant, moderate-to-strong correlations between the LLFDI and BBS (r = 0.73-0.87) and a significant, moderate, negative correlation between LLFDI and TUG test (r = -0.59- -0.68) were noted. The standard error of measure was 6-12%, and the smallest real difference was 18-33%. Discriminative validity for both gender and fall status were also demonstrated. The Arabic version of the LLFDI is a highly reliable and valid instrument for assessing function and disability in community dwelling, Arab older adults. The translated instrument has a discriminative ability between genders and between fallers and non-fallers. The translated instrument may be used in clinical settings and for research purposes.
Dunleavy, Kim; Neil, Joseph; Tallon, Allison; Adamo, Diane E
2015-09-01
The cervical range of motion device (CROM) has been shown to provide reliable forward head position (FHP) measurement when the upper cervical angle (UCA) is controlled. However, measurement without UCA standardization is reflective of habitual patterns. Criterion validity has not been reported. The purposes of this study were to establish: (1) criterion validity of CROM FHP and UCA compared to Optotrak data, (2) relative reliability and minimal detectable change (MDC95) in patients with and without cervical pain, and (3) to compare UCA and FHP in patients with and without pain in habitual postures. (1) Within-subjects single session concurrent criterion validity design. Simultaneous CROM and OP measurement was conducted in habitual sitting posture in 16 healthy young adults. (2) Reliability and MDC95 of UCA and FHP were calculated from three trials. (3) Values for adults over 35 years with cervical pain and age-matched healthy controls were compared. (1) Forward head position distances were moderately correlated and UCA angles were highly correlated. The mean (standard deviation) differences can be expected to vary between 1·48 cm (1·74) for FHP and -1·7 (2·46)° for UCA. (2) Reliability for CROM FHP measurements were good to excellent (no pain) and moderate (pain). Cervical range of motion FHP MDC95 was moderately low (no pain), and moderate (pain). Reliability for CROM UCA measurements was excellent and MDC95 low for both groups. There was no difference in FHP distances between the pain and no pain groups, UCA was significantly more extended in the pain group (P<0·05). Cervical range of motion FHP measurements were only moderately correlated with Optotrak data, and limits of agreement (LOA) and MDC95 were relatively large. There was also no difference in CROM FHP distance between older symptomatic and asymptomatic individuals. Cervical range of motion FHP measurement is therefore not recommended as a clinical outcome measure. Cervical range of motion UCA measurements showed good criterion validity, excellent test-retest reliability, and achievable MDC95 in asymptomatic and symptomatic participants. Differences of more than 6° are required to exceed error. Cervical range of motion UCA shows promise as a useful reliable and valid measurement, particularly as patients with cervical pain exhibited significantly more extended angles.
Neil, Joseph; Tallon, Allison; Adamo, Diane E.
2015-01-01
Objectives The cervical range of motion device (CROM) has been shown to provide reliable forward head position (FHP) measurement when the upper cervical angle (UCA) is controlled. However, measurement without UCA standardization is reflective of habitual patterns. Criterion validity has not been reported. The purposes of this study were to establish: (1) criterion validity of CROM FHP and UCA compared to Optotrak data, (2) relative reliability and minimal detectable change (MDC95) in patients with and without cervical pain, and (3) to compare UCA and FHP in patients with and without pain in habitual postures. Methods (1) Within-subjects single session concurrent criterion validity design. Simultaneous CROM and OP measurement was conducted in habitual sitting posture in 16 healthy young adults. (2) Reliability and MDC95 of UCA and FHP were calculated from three trials. (3) Values for adults over 35 years with cervical pain and age-matched healthy controls were compared. Results (1) Forward head position distances were moderately correlated and UCA angles were highly correlated. The mean (standard deviation) differences can be expected to vary between 1·48 cm (1·74) for FHP and −1·7 (2·46)° for UCA. (2) Reliability for CROM FHP measurements were good to excellent (no pain) and moderate (pain). Cervical range of motion FHP MDC95 was moderately low (no pain), and moderate (pain). Reliability for CROM UCA measurements was excellent and MDC95 low for both groups. There was no difference in FHP distances between the pain and no pain groups, UCA was significantly more extended in the pain group (P<0·05). Discussion Cervical range of motion FHP measurements were only moderately correlated with Optotrak data, and limits of agreement (LOA) and MDC95 were relatively large. There was also no difference in CROM FHP distance between older symptomatic and asymptomatic individuals. Cervical range of motion FHP measurement is therefore not recommended as a clinical outcome measure. Cervical range of motion UCA measurements showed good criterion validity, excellent test–retest reliability, and achievable MDC95 in asymptomatic and symptomatic participants. Differences of more than 6° are required to exceed error. Cervical range of motion UCA shows promise as a useful reliable and valid measurement, particularly as patients with cervical pain exhibited significantly more extended angles. PMID:26917936
Borloz, S; Trippolini, M A; Ballabeni, P; Luthi, F; Deriaz, O
2012-09-01
Functional subjective evaluation through questionnaire is fundamental, but not often realized in patients with back complaints, lacking validated tools. The Spinal Function Sort (SFS) was only validated in English. We aimed to translate, adapt and validate the French (SFS-F) and German (SFS-G) versions of the SFS. Three hundred and forty-four patients, experiencing various back complaints, were recruited in a French (n = 87) and a German-speaking (n = 257) center. Construct validity was estimated via correlations with SF-36 physical and mental scales, pain intensity and hospital anxiety and depression scales (HADS). Scale homogeneities were assessed by Cronbach's α. Test-retest reliability was assessed on 65 additional patients using intraclass correlation (IC). For the French and German translations, respectively, α were 0.98 and 0.98; IC 0.98 (95% CI: [0.97; 1.00]) and 0.94 (0.90; 0.98). Correlations with physical functioning were 0.63 (0.48; 0.74) and 0.67 (0.59; 0.73); with physical summary 0.60 (0.44; 0.72) and 0.52 (0.43; 0.61); with pain -0.33 (-0.51; -0.13) and -0.51 (-0.60; -0.42); with mental health -0.08 (-0.29; 0.14) and 0.25 (0.13; 0.36); with mental summary 0.01 (-0.21; 0.23) and 0.28 (0.16; 0.39); with depression -0.26 (-0.45; -0.05) and -0.42 (-0.52; -0.32); with anxiety -0.17 (-0.37; -0.04) and -0.45 (-0.54; -0.35). Reliability was excellent for both languages. Convergent validity was good with SF-36 physical scales, moderate with VAS pain. Divergent validity was low with SF-36 mental scales in both translated versions and with HADS for the SFS-F (moderate in SFS-G). Both versions seem to be valid and reliable for evaluating perceived functional capacity in patients with back complaints.
Bacorro, Warren R; Sy Ortin, Teresa T; Suarez, Consuelo G; Mendoza, Tito R; Que, Jocelyn C
2017-06-01
Symptom burden and quality of life (QOL) are of particular importance in head-and-neck cancer treatment. The MD Anderson Symptom Inventory-Head-and-Neck (MDASI-HN) is a simple symptom assessment tool practicable for patient follow-up, but a validated Filipino translation was previously unavailable. The objectives of this study were to develop a valid Filipino translation of the MDASI-HN, to test the sensitivity of the validated MDASI core-F, and to report the prevalence and pattern of head-and-neck symptoms in our cohort. An MDASI-HN-Filipino (MDASI-HN-F) version was developed and examined for convergent validity, internal consistency, test-retest reliability, known-group validity and sensitivity to change. Eligible participants were aged 18-80 years, with histopathologically-proven head-and-neck (except thyroid) cancer, able to understand and read English and Filipino, and without cognitive impairment or other conditions precluding self-administration of the questionnaire. Participants (n=100) were aged 18-76 years; the majority were aged <60, male, married, had college schooling, or were from a Tagalog-speaking region. The validity of the MDASI HN-F was demonstrated in all parameters. Age or educational attainment did not affect convergent validity or test-retest reliability. At baseline, 48% had multiple moderate/severe symptoms and 38% had at least one severe symptom. The MDASI-HN-F is valid, reliable and sensitive. The sensitivity of the MDASI core-F is demonstrated, and its validity and reliability reaffirmed. Moderate and severe head-and-neck symptoms are prevalent in early-stage and advanced-stage head-and-neck cancers, reflecting the utility of symptom screening for improvement of symptom management, QOL and compliance to treatment. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Beardsley, Chris; Egerton, Tim; Skinner, Brendon
2016-01-01
Objective. The purpose of this study was to investigate the reliability of a digital pelvic inclinometer (DPI) for measuring sagittal plane pelvic tilt in 18 young, healthy males and females. Method. The inter-rater reliability and test-re-test reliabilities of the DPI for measuring pelvic tilt in standing on both the right and left sides of the pelvis were measured by two raters carrying out two rating sessions of the same subjects, three weeks apart. Results. For measuring pelvic tilt, inter-rater reliability was designated as good on both sides (ICC = 0.81-0.88), test-re-test reliability within a single rating session was designated as good on both sides (ICC = 0.88-0.95), and test-re-test reliability between two rating sessions was designated as moderate on the left side (ICC = 0.65) and good on the right side (ICC = 0.85). Conclusion. Inter-rater reliability and test-re-test reliability within a single rating session of the DPI in measuring pelvic tilt were both good, while test-re-test reliability between rating sessions was moderate-to-good. Caution is required regarding the interpretation of the test-re-test reliability within a single rating session, as the raters were not blinded. Further research is required to establish validity.
Erkes, Jérôme; Camp, Cameron J; Raffard, Stéphane; Gély-Nargeot And, Marie-Christine; Bayard, Sophie
2017-01-01
This study evaluated the validity and reliability of the Montessori Assessment System. The Montessori Assessment System assesses preserved abilities in persons with moderate to severe dementia. In this respect, this instrument provides crucial information for the development of effective person-centered care plans. A total of 196 persons with a diagnosis of dementia in the moderate to severe stages of dementia were recruited in 10 long-term care facilities in France. All participants completed the Montessori Assessment System, the Clinical Dementia Rating Scale and/or the Mini Mental State Examination and the Severe Impairment Battery-short form. The internal consistency and temporal stability of the Montessori Assessment System were high. Additionally, good construct and divergent validity were demonstrated. Factor analysis showed a one-factor structure. The Montessori Assessment System demonstrated satisfactory psychometric properties while being a useful instrument to assess capabilities in persons with advanced stages of dementia and hence to develop person-centered plans of care.
Garaigordobil, Maite
2015-08-19
The purpose of the study was to analyze the psychometric properties of the Cyberbullying Test. The sample included 3,026 participants from the Basque Country (northern Spain), aged 12 to 18 years. Results confirmed high internal consistency and moderate temporal stability. Exploratory factor analysis yielded three moderately correlated factors (cyberobserver, cyberaggressor, and cybervictim). Confirmatory factor analysis ratified adequate model fit of the three factors. Convergent and discriminant validity were confirmed: (a) cybervictims use a variety of conflict resolution strategies, scoring high in neuroticism, openness, antisocial behavior, emotional attention, school-academic problems, shyness-withdrawal, psychopathological disorders, anxiety, and psychosomatic complaints, and low in agreeableness, responsibility, self-esteem, and social adjustment and (b) cyberaggressors use many aggressive conflict resolution strategies, scoring high in neuroticism, antisocial behavior, school-academic problems, psychopathological and psychosomatic disorders, and low in empathy, agreeableness, responsibility, emotion regulation, and social adjustment. The study confirms the test's reliability and validity. © The Author(s) 2015.
Le, Minh Thi Hong; Tran, Thach Duc; Holton, Sara; Nguyen, Huong Thanh; Wolfe, Rory; Fisher, Jane
2017-01-01
To assess the internal consistency, latent structure and convergent validity of the Depression, Anxiety and Stress Scale-21 (DASS-21) among adolescents in Vietnam. An anonymous, self-completed questionnaire was conducted among 1,745 high school students in Hanoi, Vietnam between October, 2013 and January, 2014. Confirmatory factor analyses were performed to assess the latent structure of the DASS-21. Factorial invariance between girls and boys was examined. Cronbach alphas and correlation coefficients between DASS-21 factor scores and the domain scores of the Duke Health Profile Adolescent Vietnamese validated version (ADHP-V) were calculated to assess DASS-21 internal consistency and convergent validity. A total of 1,606/ 1,745 (92.6%) students returned the questionnaire. Of those, 1,387 students provided complete DASS-21 data. The scale demonstrated adequate internal consistency (Cronbach α: 0.761 to 0.906). A four-factor model showed the best fit to the data. Items loaded significantly on a common general distress factor, the depression, and the anxiety factors, but few on the stress factor (p<0.05). DASS-21 convergent validity was confirmed with moderate correlation coefficients (-0.47 to -0.66) between its factor scores and the ADHP-V mental health related domains. The DASS-21 is reliable and suitable for use to assess symptoms of common mental health problems, especially depression and anxiety among Vietnamese adolescents. However, its ability in detecting stress among these adolescents may be limited. Further research is warrant to explore these results.
Validation of a Russian Language Oswestry Disability Index Questionnaire.
Yu, Elizabeth M; Nosova, Emily V; Falkenstein, Yuri; Prasad, Priya; Leasure, Jeremi M; Kondrashov, Dimitriy G
2016-11-01
Study Design Retrospective reliability and validity study. Objective To validate a recently translated Russian language version of the Oswestry Disability Index (R-ODI) using standardized methods detailed from previous validations in other languages. Methods We included all subjects who were seen in our spine surgery clinic, over the age of 18, and fluent in the Russian language. R-ODI was translated by six bilingual people and combined into a consensus version. R-ODI and visual analog scale (VAS) questionnaires for leg and back pain were distributed to subjects during both their initial and follow-up visits. Test validity, stability, and internal consistency were measured using standardized psychometric methods. Results Ninety-seven subjects participated in the study. No change in the meaning of the questions on R-ODI was noted with translation from English to Russian. There was a significant positive correlation between R-ODI and VAS scores for both the leg and back during both the initial and follow-up visits ( p < 0.01 for all). The instrument was shown to have high internal consistency (Cronbach α = 0.82) and moderate test-retest stability (interclass correlation coefficient = 0.70). Conclusions The R-ODI is both valid and reliable for use among the Russian-speaking population in the United States.
Mirsoleymani, Seyed Reza; Matbouei, Mahsa; Nasiri, Malihe; Vasli, Parvaneh
2016-01-01
Objective. The aim of this study was to investigate the psychometric properties of the Family Inventory of Resources for Management (FIRM) in a sample of family caregivers of cancer patients. Methods. In this methodological study, construct validity of the FIRM was evaluated by known groups and convergent validity in a convenience sample of family caregivers of cancer patients (n = 104) referred to the outpatient oncology wards of five educational hospitals in Tehran from January to April 2016. Reliability was determined by assessing the internal consistency and stability of the instrument. Results. The known-groups findings showed that there is a significant difference between the scores of the FIRM in family caregivers with different levels of caregiver burden (p < 0.001). Also, the results of convergent validity showed that there is a moderate negative correlation (r = −0.50; p < 0.001) between the total scores of the FIRM and the scores of the caregiver burden inventory (CBI). The FIRM showed a good internal consistency (α = 0.85) and a good stability of the test-retest reliability result. Conclusions. There is a sound psychometric basis for the use of the Persian translation of the FIRM for family studies in the Iranian population. PMID:28127470
Patel, Amit S; Siegert, Richard J; Bajwah, Sabrina; Brignall, Kate; Gosker, Harry R; Moxham, John; Maher, Toby M; Renzoni, Elisabetta A; Wells, Athol U; Higginson, Irene J; Birring, Surinder S
2015-09-01
Rasch analysis has largely replaced impact factor methodology for developing health status measures. The aim of this study was to develop a health status questionnaire for patients with interstitial lung disease (ILD) using impact factor methodology and to compare its validity with that of another version developed using Rasch analysis. A preliminary 71-item questionnaire was developed and evaluated in 173 patients with ILD. Items were reduced by the impact factor method (King's Brief ILD questionnaire, KBILD-I) and Rasch analysis (KBILD-R). Both questionnaires were validated by assessing their relationship with forced vital capacity (FVC) and St Georges Respiratory Questionnaire (SGRQ) and by evaluating internal reliability, repeatability, and longitudinal responsiveness. The KBILD-R and KBILD-I comprised 15 items each. The content of eight items differed between the KBILD-R and KBILD-I. Internal and test-retest reliability was good for total scores of both questionnaires. There was a good relationship with SGRQ and moderate relationship with FVC for both questionnaires. Effect sizes were comparable. Both questionnaires discriminated patients with differing disease severity. Despite considerable differences in the content of retained items, both KBILD-R and KBILD-I questionnaires demonstrated acceptable measurement properties and performed comparably in a clinical setting. Copyright © 2015 Elsevier Inc. All rights reserved.
Mollayeva, Tatyana; Thurairajah, Pravheen; Burton, Kirsteen; Mollayeva, Shirin; Shapiro, Colin M; Colantonio, Angela
2016-02-01
This review appraises the process of development and the measurement properties of the Pittsburgh sleep quality index (PSQI), gauging its potential as a screening tool for sleep dysfunction in non-clinical and clinical samples; it also compares non-clinical and clinical populations in terms of PSQI scores. MEDLINE, Embase, PsycINFO, and HAPI databases were searched. Critical appraisal of studies of measurement properties was performed using COSMIN. Of 37 reviewed studies, 22 examined construct validity, 19 - known-group validity, 15 - internal consistency, and three - test-retest reliability. Study quality ranged from poor to excellent, with the majority designated fair. Internal consistency, based on Cronbach's alpha, was good. Discrepancies were observed in factor analytic studies. In non-clinical and clinical samples with known differences in sleep quality, the PSQI global scores and all subscale scores, with the exception of sleep disturbance, differed significantly. The best evidence synthesis for the PSQI showed strong reliability and validity, and moderate structural validity in a variety of samples, suggesting the tool fulfills its intended utility. A taxonometric analysis can contribute to better understanding of sleep dysfunction as either a dichotomous or continuous construct. Copyright © 2015 Elsevier Ltd. All rights reserved.
Validation and properties of the verbal numeric scale in children with acute pain.
Bailey, Benoit; Daoust, Raoul; Doyon-Trottier, Evelyne; Dauphin-Pierre, Sabine; Gravel, Jocelyn
2010-05-01
Although the verbal numeric scale (VNS) is used frequently at patients' bedsides, it has never been formally validated in children with acute pain. In order to validate this scale, a prospective cohort study was performed in children between 8 and 17years presenting to a pediatric emergency department (ED) with acute pain. Pain was graded using the VNS, the visual analogue scale (VAS), and the verbal rating scale (VRS). A second assessment was done before discharge. We determined a priori that in order to be valid, the VNS would need to: correlate with the VAS (concurrent validity); decrease after intervention to reduce pain (construct validity); and be associated with the VRS categories (content validity). The VNS interchangeability with the VAS, its minimal clinically significant difference, and test-retest reliability were also determined. A total of 202 patients (mean age: 12.2+/-2.6years) were enrolled. The VNS correlated with the VAS: r(ic)=0.93, p<0.001. There were differences in the VNS before versus after interventions (p<0.001), and between VRS categories (mild versus moderate, p<0.001; moderate versus severe, p<0.001). The 95% limits of agreement (interchangeability) between VNS/VAS were outside the a priori set limit of +/-2.0: -1.8, 2.5. The VNS minimal clinically significant difference was 1. The VNS had good test-retest reliability with 95% limits of agreement of -0.9 and 1.2. In conclusion, the VNS provides a valid and reliable scale to evaluate acute pain in children aged 8-17years but is not interchangeable with the VAS. Copyright 2009 International Association for the Study of Pain. Published by Elsevier B.V. All rights reserved.
Reliability and validity of the Haitian Creole PHQ-9.
Marc, Linda G; Henderson, Whitney R; Desrosiers, Astrid; Testa, Marcia A; Jean, Samuel E; Akom, Eniko Edit
2014-12-01
There is limited information on depression in Haitians and this is partly attributable to the absence of culturally and linguistically adapted measures for depression. To perform a psychometric evaluation of the Haitian-Creole version of the PHQ-9 administered to men who have sex with men (MSM) in the Republic of Haiti. This study uses a cross-sectional design and data are from the Integrated Behavioral and Biological HIV Survey (IBBS) for MSM in Haiti. Inclusion criteria required that participants be male, ≥ 18 years, report sexual relations with a male partner in the last 12 months, and lived in Haiti during the past 3 months. Respondent Driven Sampling was used for participant recruitment. A structured questionnaire was verbally administered in Haitian-Creole capturing information on sociodemographics, sexual behaviors, human immunodeficiency virus (HIV) status and depressive symptomatology using the PHQ-9. Psychometric analyses of the translated PHQ-9 assessed unidimensionality, factor structure, reliability, construct validity, and differential item functioning (DIF) across subgroups (age, educational level, sexual orientation and HIV status). In a study population of 1,028 MSM, the Haitian-Creole version of the PHQ-9 is unidimensional, has moderately high internal consistency reliability (α = 0.78), and shows evidence of construct validity where HIV-positive subjects have greater depression (p = 0.002). There is no evidence of DIF across age, education, sexual orientation or HIV status. HIV-positive MSM are twice as likely to screen positive for moderately severe and severe depressive symptoms compared to their HIV-negative counterparts. There is strong evidence for the psychometric adequacy of the translated PHQ-9 screening tool as a measure of depression with MSM in Haiti. Future research is necessary to examine the predictive validity of depression for subsequent health behaviors or clinical outcomes among Haitian MSM.
Systematic review of the multidimensional fatigue symptom inventory-short form.
Donovan, Kristine A; Stein, Kevin D; Lee, Morgan; Leach, Corinne R; Ilozumba, Onaedo; Jacobsen, Paul B
2015-01-01
Fatigue is a subjective complaint that is believed to be multifactorial in its etiology and multidimensional in its expression. Fatigue may be experienced by individuals in different dimensions as physical, mental, and emotional tiredness. The purposes of this study were to review and characterize the use of the 30-item Multidimensional Fatigue Symptom Inventory-Short Form (MFSI-SF) in published studies and to evaluate the available evidence for its psychometric properties. A systematic review was conducted to identify published articles reporting results for the MFSI-SF. Data were analyzed to characterize internal consistency reliability of multi-item MFSI-SF scales and test-retest reliability. Correlation coefficients were summarized to characterize concurrent, convergent, and divergent validity. Standardized effect sizes were calculated to characterize the discriminative validity of the MFSI-SF and its sensitivity to change. Seventy articles were identified. Sample sizes reported ranged from 10 to 529 and nearly half consisted exclusively of females. More than half the samples were composed of cancer patients; of those, 59% were breast cancer patients. Mean alpha coefficients for MFSI-SF fatigue subscales ranged from 0.84 for physical fatigue to 0.93 for general fatigue. The MFSI-SF demonstrated moderate test-retest reliability in a small number of studies. Correlations with other fatigue and vitality measures were moderate to large in size and in the expected direction. The MFSI-SF fatigue subscales were positively correlated with measures of distress, depressive, and anxious symptoms. Effect sizes for discriminative validity ranged from medium to large, while effect sizes for sensitivity to change ranged from small to large. Findings demonstrate the positive psychometric properties of the MFSI-SF, provide evidence for its usefulness in medically ill and nonmedically ill individuals, and support its use in future studies.
Osypuk, Theresa L; Kehm, Rebecca; Misra, Dawn P
2015-01-01
Early life exposures influence numerous social determinants of health, as distal causes or confounders of later health outcomes. Although a growing literature is documenting how early life socioeconomic position affects later life health, few epidemiologic studies have tested measures for operationalizing early life neighborhood context, or examined their effects on later life health. In the Life-course Influences on Fetal Environments (LIFE) Study, a retrospective cohort study among Black women in Southfield, Michigan (71% response rate), we tested the validity and reliability of retrospectively-reported survey-based subjective measures of early life neighborhood context(N=693). We compared 3 subjective childhood neighborhood measures (disorder, informal social control, victimization), with 3 objective childhood neighborhood measures derived from 4 decades of historical census tract data 1970-2000, linked through geocoded residential histories (tract % poverty, tract % black, tract deprivation score derived from principal components analysis), as well as with 2 subjective neighborhood measures in adulthood. Our results documented that internal consistency reliability was high for the subjective childhood neighborhood scales (Cronbach's α =0.89, 0.93). Comparison of subjective with objective childhood neighborhood measures found moderate associations in hypothesized directions. Associations with objective variables were strongest for neighborhood disorder (rhos=.40), as opposed to with social control or victimization. Associations between subjective neighborhood context in childhood versus adulthood were moderate and stronger for residentially-stable populations. We lastly formally tested for, but found little evidence of, recall bias of the retrospective subjective reports of childhood context. These results provide evidence that retrospective reports of subjective neighborhood context may be a cost-effective, valid, and reliable method to operationalize early life context for health studies.
Hornsveld, Ruud H J; Nijman, Henk L I; Hollin, Clive R; Kraaimaat, Floor W
2007-01-01
The Observation Scale for Aggressive Behavior (OSAB) has been developed to evaluate inpatient treatment programs designed to reduce aggressive behavior in Dutch forensic psychiatric patients with an antisocial personality disorder, who are "placed at the disposal of the government". The scale should have the sensitivity to measure changes in the possible determinants of aggressive behavior, such as limited control of displayed negative emotions (irritation, anger or rage) and a general deficiency of social skills. In developing the OSAB 40 items were selected from a pool of 82 and distributed among the following a priori scales: Irritation/anger, Anxiety/gloominess, Aggressive behavior, Antecedent (to aggressive behavior), Sanction (for aggressive behavior) and Social behavior. The internal consistency of these subscales was good, the inter-rater reliability was moderate to good, and the test-retest reliability over a two to three week period was moderate to good. The correlation between the subscales Irritation/anger, Anxiety/gloominess, Aggressive behavior, Antecedent, Sanction was substantial and significant, but the anticipated negative correlation between these subscales and the Social behavior subscale could not be shown. Relationships between the corresponding subscales of the OSAB and the FIOS, used to calculate concurrent validity, yielded relatively high correlations. The validity of the various OSAB subscales could be further supported by significant correlations with the PCL-R and by significant but weak correlations with corresponding subscales of the self-report questionnaires. The Observation Scale for Aggressive Behavior (OSAB) seems to measure aggressive behavior in Dutch forensic psychiatric inpatients with an antisocial personality disorder reliably and validly. Contrary to expectations, a negative relationship was not found between aggressive and social behavior in either the OSAB or FIOS, which were used for calculating concurrent validity.
Kasitanon, N; Wangkaew, S; Puntana, S; Sukitawut, W; Leong, K P; Louthrenoo, W
2013-03-01
The English version of the Systemic Lupus Erythematosus Quality of Life Questionnaire (SLEQOL) is a validated disease-specific quality of life instrument. The aim of this study was to evaluate the psychometric properties of the Thai version of the SLEQOL (SLEQOL-TH). Two independent translators translated the SLEQOL into Thai. The back translation of this version was performed by two other independent translators. The final version, SLEQOL-TH, was completed after resolving the discrepancies revealed by the back translation. One hundred and nine patients with SLE were enrolled to test the reliability, construct validity, floor and ceiling effects, and sensitivity to the changes of the SLEQOL-TH at six months. The differential item functioning (DIF) between the Thai and English versions was analyzed using the partial gamma. The internal consistency of the SLEQOL-TH was satisfactory with the overall Cronbach's alpha of 0.86. The test-retest reliability of the SLEQOL-TH was acceptable with the intra-class correlation coefficient of 0.86. Low correlations between the SLEQOL-TH and SLEDAI were observed. The total score of the SLEQOL-TH was moderately responsive to changes in quality of life, with a standardized response mean of 0.50. When comparing the SLEQOL-TH from Thai SLE patients with the original SLEQOL version obtained from Singapore SLE patients, 11 out of 40 items showed a moderate to large DIF. The SLEQOL-TH has acceptable psychometric properties and shows construct validity. In comparison with the English version of SLEQOL, there are some items that showed DIF. The applicability of the SLEQOL-TH in real-life clinical practice and clinical trials needs to be determined.
Wilde, Elisabeth A.; Kelly, Tara M.; Weyand, Annie M.; Yallampalli, Ragini; Waldron, Eric J.; Pedroza, Claudia; Schnelle, Kathleen P.; Boake, Corwin; Levin, Harvey S.; Moretti, Paolo
2010-01-01
Abstract A standardized measure of neurological dysfunction specifically designed for TBI currently does not exist and the lack of assessment of this domain represents a substantial gap. To address this, the Neurological Outcome Scale for Traumatic Brain Injury (NOS-TBI) was developed for TBI outcomes research through the addition to and modification of items specifically relevant to patients with TBI, based on the National Institutes of Health Stroke Scale. In a sample of 50 participants (mean age = 33.3 years, SD = 12.9) ≤18 months (mean = 3.1, SD = 3.2) following moderate (n = 8) to severe (n = 42) TBI, internal consistency of the NOS-TBI was high (Cronbach's alpha = 0.942). Test-retest reliability also was high (ρ = 0.97, p < 0.0001), and individual item kappas between independent raters were excellent, ranging from 0.83 to 1.0. Overall inter-rater agreement between independent raters (Kendall's coefficient of concordance) for the NOS-TBI total score was excellent (W = 0.995). Convergent validity was demonstrated through significant Spearman rank-order correlations between the NOS-TBI and the concurrently administered Disability Rating Scale (ρ = 0.75, p < 0.0001), Rancho Los Amigos Scale (ρ = −0.60, p < 0.0001), Supervision Rating Scale (ρ = 0.59, p < 0.0001), and the FIM™ (ρ = −0.68, p < 0.0001). These results suggest that the NOS-TBI is a reliable and valid measure of neurological functioning in patients with moderate to severe TBI. PMID:20210595
Traynor, Marian; Galanouli, Despina; Roberts, Martin; Leonard, Lawrence; Gale, Thomas
2017-06-01
The aim of this study was to complement existing evidence on the suitability of Multiple Mini Interviews as a potential tool for the selection of nursing candidates on to a BSc (Hons) nursing programme. This study aimed to trial the Multiple Mini Interview approach to recruitment with a group of first year nursing students (already selected using traditional interviews). Cross-sectional validation study. This paper reports on the evaluation of the participants' detailed scores from the Multiple Mini Interview stations; their original interview scores and their end of year results. This study took place in March 2015. Scores from the seven Multiple Mini Interview stations were analysed to show the internal structure, reliability and generalizability of the stations. Original selection scores from interviews and in-course assessment were correlated with the MMI scores and variation by students' age, gender and disability status was explored. Reliability of the Multiple Mini Interview score was moderate (G = 0·52). The Multiple Mini Interview score provided better differentiation between more able students than did the original interview score but neither score was correlated with the module results. Multiple Mini Interview scores were positively associated with students' age but not their gender or disability status. The Multiple Mini Interview reported in this study offers a selection process that is based on the values and personal attributes regarded as desirable for a career in nursing and does not necessarily predict academic success. Its moderate reliability indicates the need for further improvement but it is capable of discriminating between candidates and shows little evidence of bias. © 2016 John Wiley & Sons Ltd.
2013-01-01
Summary of background data Recent smartphones, such as the iPhone, are often equipped with an accelerometer and magnetometer, which, through software applications, can perform various inclinometric functions. Although these applications are intended for recreational use, they have the potential to measure and quantify range of motion. The purpose of this study was to estimate the intra and inter-rater reliability as well as the criterion validity of the clinometer and compass applications of the iPhone in the assessment cervical range of motion in healthy participants. Methods The sample consisted of 28 healthy participants. Two examiners measured cervical range of motion of each participant twice using the iPhone (for the estimation of intra and inter-reliability) and once with the CROM (for the estimation of criterion validity). Estimates of reliability and validity were then established using the intraclass correlation coefficient (ICC). Results We observed a moderate intra-rater reliability for each movement (ICC = 0.65-0.85) but a poor inter-rater reliability (ICC < 0.60). For the criterion validity, the ICCs are moderate (>0.50) to good (>0.65) for movements of flexion, extension, lateral flexions and right rotation, but poor (<0.50) for the movement left rotation. Conclusion We found good intra-rater reliability and lower inter-rater reliability. When compared to the gold standard, these applications showed moderate to good validity. However, before using the iPhone as an outcome measure in clinical settings, studies should be done on patients presenting with cervical problems. PMID:23829201
Critically re-evaluating a common technique: Accuracy, reliability, and confirmation bias of EMG.
Narayanaswami, Pushpa; Geisbush, Thomas; Jones, Lyell; Weiss, Michael; Mozaffar, Tahseen; Gronseth, Gary; Rutkove, Seward B
2016-01-19
(1) To assess the diagnostic accuracy of EMG in radiculopathy. (2) To evaluate the intrarater reliability and interrater reliability of EMG in radiculopathy. (3) To assess the presence of confirmation bias in EMG. Three experienced academic electromyographers interpreted 3 compact discs with 20 EMG videos (10 normal, 10 radiculopathy) in a blinded, standardized fashion without information regarding the nature of the study. The EMGs were interpreted 3 times (discs A, B, C) 1 month apart. Clinical information was provided only with disc C. Intrarater reliability was calculated by comparing interpretations in discs A and B, interrater reliability by comparing interpretation between reviewers. Confirmation bias was estimated by the difference in correct interpretations when clinical information was provided. Sensitivity was similar to previous reports (77%, confidence interval [CI] 63%-90%); specificity was 71%, CI 56%-85%. Intrarater reliability was good (κ 0.61, 95% CI 0.41-0.81); interrater reliability was lower (κ 0.53, CI 0.35-0.71). There was no substantial confirmation bias when clinical information was provided (absolute difference in correct responses 2.2%, CI -13.3% to 17.7%); the study lacked precision to exclude moderate confirmation bias. This study supports that (1) serial EMG studies should be performed by the same electromyographer since intrarater reliability is better than interrater reliability; (2) knowledge of clinical information does not bias EMG interpretation substantially; (3) EMG has moderate diagnostic accuracy for radiculopathy with modest specificity and electromyographers should exercise caution interpreting mild abnormalities. This study provides Class III evidence that EMG has moderate diagnostic accuracy and specificity for radiculopathy. © 2015 American Academy of Neurology.
Charlton, Paula C; Mentiplay, Benjamin F; Pua, Yong-Hao; Clark, Ross A
2015-05-01
Traditional methods of assessing joint range of motion (ROM) involve specialized tools that may not be widely available to clinicians. This study assesses the reliability and validity of a custom Smartphone application for assessing hip joint range of motion. Intra-tester reliability with concurrent validity. Passive hip joint range of motion was recorded for seven different movements in 20 males on two separate occasions. Data from a Smartphone, bubble inclinometer and a three dimensional motion analysis (3DMA) system were collected simultaneously. Intraclass correlation coefficients (ICCs), coefficients of variation (CV) and standard error of measurement (SEM) were used to assess reliability. To assess validity of the Smartphone application and the bubble inclinometer against the three dimensional motion analysis system, intraclass correlation coefficients and fixed and proportional biases were used. The Smartphone demonstrated good to excellent reliability (ICCs>0.75) for four out of the seven movements, and moderate to good reliability for the remaining three movements (ICC=0.63-0.68). Additionally, the Smartphone application displayed comparable reliability to the bubble inclinometer. The Smartphone application displayed excellent validity when compared to the three dimensional motion analysis system for all movements (ICCs>0.88) except one, which displayed moderate to good validity (ICC=0.71). Smartphones are portable and widely available tools that are mostly reliable and valid for assessing passive hip range of motion, with potential for large-scale use when a bubble inclinometer is not available. However, caution must be taken in its implementation as some movement axes demonstrated only moderate reliability. Copyright © 2014 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.
Prowse, Ashleigh; Aslaksen, Berit; Kierkegaard, Marie; Furness, James; Gerdhem, Paul; Abbott, Allan
2017-01-18
To investigate the reliability and concurrent validity of the Baseline ® Body Level/Scoliosis meter for adolescent idiopathic scoliosis postural assessment in three anatomical planes. This is an observational reliability and concurrent validity study of adolescent referrals to the Orthopaedic department for scoliosis screening at Karolinska University Hospital, Stockholm, Sweden between March-May 2012. A total of 31 adolescents with idiopathic scoliosis (13.6 ± 0.6 years old) of mild-moderate curvatures (25° ± 12°) were consecutively recruited. Measurement of cervical, thoracic and lumbar curvatures, pelvic and shoulder tilt, and axial thoracic rotation (ATR) were performed by two trained physiotherapists in one day. The intraclass correlation coefficient (ICC) was used to determine the inter-examiner reliability (ICC2,1) and the intra-rater reliability (ICC3,3) of the Baseline ® Body Level/Scoliosis meter. Spearman's correlation analyses were used to estimate concurrent validity between the Baseline ® Body Level/Scoliosis meter and Gold Standard Cobb angles from radiographs and the Orthopaedic Systems Inc. Scoliometer. There was excellent reliability between examiners for thoracic kyphosis (ICC2,1 = 0.94), ATR (ICC2,1 = 0.92) and lumbar lordosis (ICC2,1 = 0.79). There was adequate reliability between examiners for cervical lordosis (ICC2,1 = 0.51), however poor reliability for pelvic and shoulder tilt. Both devices were reproducible in the measurement of ATR when repeated by one examiner (ICC3,3 0.98-1.00). The device had a good correlation with the Scoliometer (rho = 0.78). When compared with Cobb angle from radiographs, there was a moderate correlation for ATR (rho = 0.627). The Baseline ® Body Level/Scoliosis meter provides reliable transverse and sagittal cervical, thoracic and lumbar measurements and valid transverse plan measurements of mild-moderate scoliosis deformity.
Validation of general job satisfaction in the Korean Labor and Income Panel Study.
Park, Shin Goo; Hwang, Sang Hee
2017-01-01
The purpose of this study is to assess the validity and reliability of general job satisfaction (JS) in the Korean Labor and Income Panel Study (KLIPS). We used the data from the 17th wave (2014) of the nationwide KLIPS, which selected a representative panel sample of Korean households and individuals aged 15 or older residing in urban areas. We included in this study 7679 employed subjects (4529 males and 3150 females). The general JS instrument consisted of five items rated on a scale from 1 (strongly disagree) to 5 (strongly agree). The general JS reliability was assessed using the corrected item-total correlation and Cronbach's alpha coefficient. The validity of general JS was assessed using confirmatory factor analysis (CFA) and Pearson's correlation. The corrected item-total correlations ranged from 0.736 to 0.837. Therefore, no items were removed. Cronbach's alpha for general JS was 0.925, indicating excellent internal consistency. The CFA of the general JS model showed a good fit. Pearson's correlation coefficients for convergent validity showed moderate or strong correlations. The results obtained in our study confirm the validity and reliability of general JS.
Validity of the Neurology Quality-of-Life (Neuro-QoL) measurement system in adult epilepsy.
Victorson, David; Cavazos, Jose E; Holmes, Gregory L; Reder, Anthony T; Wojna, Valerie; Nowinski, Cindy; Miller, Deborah; Buono, Sarah; Mueller, Allison; Moy, Claudia; Cella, David
2014-02-01
Epilepsy is a chronic neurological disorder that results in recurring seizures and can have a significant adverse effect on health-related quality of life (HRQL). The Neuro-QoL measurement initiative is an NINDS-funded system of patient-reported outcome measures for neurology clinical research, which was designed to provide a precise and standardized way to measure HRQL in epilepsy and other neurological disorders. Using mixed-method and item response theory-based approaches, we developed generic item banks and targeted scales for adults and children with major neurological disorders. This paper provides empirical results from a clinical validation study with a sample of adults diagnosed with epilepsy. One hundred twenty-one people diagnosed with epilepsy participated, the majority of which were male (62%) and Caucasian (95%), with a mean age of 47.3 (SD=16.9). Baseline assessments included Neuro-QoL short forms and general and external validity measures. The Neuro-QoL short forms that are not typically found in other epilepsy-specific HRQL instruments include Stigma, Sleep Disturbance, Emotional and Behavioral Dyscontrol, and Positive Affect and Well-Being. Neurology Quality-of-Life short forms demonstrated adequate reliability (internal consistency range=.86-.96; test-retest range=.57-.89). Pearson correlations (p<.01) between Neuro-QoL forms of emotional distress (anxiety, depression, stigma) and the QOLIE-31 Emotional Well-Being subscale were in the moderate-to-strong range (r's=.66, .71 and .53, respectively), as were relations with the PROMIS Global Mental Health subscale (r's=.59, .74 and .52, respectively). Moderate correlations were observed between Neuro-QoL Social Role Performance and Satisfaction and the QOLIE-31 Social Function (r's=.58 and .52, respectively). In measuring aspects of physical function, the Neuro-QoL Mobility and Upper Extremity forms demonstrated moderate associations with the PROMIS Global Physical Function subscale (r's=.60 and .61, respectively). Neuro-QoL measures of perceived cognitive function (executive function and general concerns) produced moderate-to-strong correlations with the QOLIE-31 Cognition subscale (r's=.65 and .75, respectively) and moderate relations with the Liverpool Adverse Events Profile (r's=.51 and .69, respectively). Finally, the Neuro-QoL Fatigue measure demonstrated moderate associations with the QOLIE-31 Energy/Fatigue subscale (r=-.65), Liverpool Adverse Events Profile (r=.69), and the Liverpool Seizure Severity Scale (r=.50). Five Neuro-QoL short forms demonstrated statistically significant responsiveness to change at 5-7months, including Fatigue, Sleep Disturbance, Depression, Positive Affect and Well-Being, and Emotional and Behavioral Dyscontrol. Overall, Neuro-QoL instruments showed good evidence for internal consistency, test-retest reliability, convergent validity, and responsiveness to change over several months. These results support the validity of Neuro-QoL to measure HRQL in adults with epilepsy. Copyright © 2013 Elsevier Inc. All rights reserved.
Rauseo Vera, Mayra; Gutiérrez-González, Luis Arturo; Maldonado, Irama; Al Snih, Soham
2017-09-21
Spondyloarthropathies (SpA) are disabling diseases with a prevalence of 1.9% in the general population. The indices designed for monitoring the disease should be valid, reliable and cross-culturally adapted for decision-making concerning the appropriate treatment. Changing an adjective or pronoun in a self-administered questionnaire could be the big difference in condensing an idea in a few words and transmitting that concept to all those who share the same language. To develop a Venezuelan version of the original English version of the BASDAI/BASFI and to evaluate its reliability and validity in Venezuelan patients with SpA. Certified linguists were needed for the translation of a Venezuelan version of the BASDAI/BASFI. The evaluation of reliability and validity was performed by calculating correlation coefficients in addition to Cronbach's alpha correlation between the BASDAI score and the clinical parameters (for example: erythrocyte sedimentation rate, C-reactive protein, modified Schöber test, occiput-to-wall distance and enthesis count). We studied 40 patients including 31 men (77.5%) and 9 women (22.5%). The mean age was 35.9 years ± standard deviation (SD) 12.01 and the disease duration was 11.5 years (± SD 9.5). The most common diagnoses were undifferentiated spondyloarthritis (45%), ankylosing spondylitis (27.5%) and psoriatic arthritis (20%). The incidences of reactive arthritis, ankylosing spondylitis and juvenile Reiter's syndrome were 2.5% each. The test-retest reliability of the BASDAI and BASFI was high (R = 0.99 and 0.99, respectively; P<.0001). The internal consistency for the BASDAI was high (Cronbach's alpha = 0.88; P=.002) and the intraclass correlation coefficient for internal consistency: 0.9867 (P=.001). Internal consistency for the BASFI: Cronbach's alpha = 0.7985 (P=.002), intraclass correlation coefficient for internal consistency: 0.9055 (P=.001). Construct validity of the BASDAI was high for general well-being of the patient (R = 0.84) and for enthesis count (R = 0.84). Low back pain showed moderate correlation with BASDAI (R = 0.69; P<.0001) and the erythrocyte sedimentation rate showed a low correlation (R = 0.39683; P=.0112). The Venezuelan version of the BASDAI/BASFI could be used in clinical research to assess and evaluate the course of disease activity in Venezuelan SpA patients. Copyright © 2017 Elsevier España, S.L.U. and Sociedad Española de Reumatología y Colegio Mexicano de Reumatología. All rights reserved.
Test-retest reliability of an fMRI paradigm for studies of cardiovascular reactivity.
Sheu, Lei K; Jennings, J Richard; Gianaros, Peter J
2012-07-01
We examined the reliability of measures of fMRI, subjective, and cardiovascular reactions to standardized versions of a Stroop color-word task and a multisource interference task. A sample of 14 men and 12 women (30-49 years old) completed the tasks on two occasions, separated by a median of 88 days. The reliability of fMRI BOLD signal changes in brain areas engaged by the tasks was moderate, and aggregating fMRI BOLD signal changes across the tasks improved test-retest reliability metrics. These metrics included voxel-wise intraclass correlation coefficients (ICCs) and overlap ratio statistics. Task-aggregated ratings of subjective arousal, valence, and control, as well as cardiovascular reactions evoked by the tasks showed ICCs of 0.57 to 0.87 (ps < .001), indicating moderate-to-strong reliability. These findings support using these tasks as a battery for fMRI studies of cardiovascular reactivity. Copyright © 2012 Society for Psychophysiological Research.
A survey of “mental hardiness” and “mental toughness” in professional male football players
2014-01-01
Background It is not uncommon for chiropractors to be associated with sports teams for injury prevention, treatment, or performance enhancement. There is increasing acceptance of the importance of sports psychology in the overall management of athletes. Recent findings indicate mental hardiness can be determined reliably using specific self-assessment questionnaires. This study set out to investigate the hardiness scores of professional footballers and examine the correlation between two questionnaires. It also included a mental hardiness rating of players by two coaches, and examined differences in hardiness and mental toughness between national and international players. Methods Two self-assessment questionnaires (modified Sports Mental Toughness Questionnaire [SMTQ-M] and Psychological Performance Inventory [PPI-A]) were completed by 20 male professional footballers. Two coaches, independently rated each player. A percentage score from each questionnaire was awarded each player and an average score was calculated ({SMTQ-M % + PPI-A %} ÷ 2). The PPI-A and SMTQ-M scores obtained for each player were analysed for correlation with Pearson’s correlation coefficient. Cohen’s kappa inter-reliability coefficient was used to determine agreement between coaches, and between the players’ hardiness scores and coaches’ ratings. The independent t-test was used to examine differences between national and international players. Results The players’ scores obtained from PPI-A and SMTQ-M correlated well (r = 0.709, p < 0.001). The coaches ratings showed significant, weak to moderate agreement (Cohen's kappa = 0.33). No significant agreement was found between player self-assessments and coaches’ ratings. The average ({SMTQ-M % + PPI-A %} ÷ 2) mean score was 77% (SD = 7.98) with international players scoring 7.4% (p = 0.04) higher than non-international players. Conclusions The questionnaires (SMTQ-M and PPI-A) correlated well in their outcome scores. These findings suggest that coaches moderately agree when assessing the level of mental hardiness of football players. There was no agreement between player self-assessment and ratings by coaches. Footballers who play or had played for national teams achieved slightly higher mental hardiness scores. Either questionnaire can offer the clinician a cost-effective, valuable measure of an individual’s psychological attributes, which could be relevant within the wider context of bio-psycho-social model of care. PMID:24735867
Psychometrics of the Personal Questionnaire: A client-generated outcome measure.
Elliott, Robert; Wagner, John; Sales, Célia M D; Rodgers, Brian; Alves, Paula; Café, Maria J
2016-03-01
We present a range of evidence for the reliability and validity of data generated by the Personal Questionnaire (PQ), a client-generated individualized outcome measure, using 5 data sets from 3 countries. Overall pretherapy mean internal consistency (alpha) across clients was .80, and within-client alphas averaged .77; clients typically had 1 or 2 items that did not vary with the other items. Analyses of temporal structure indicated high levels of between-clients variance (58%), moderate pretherapy test-retest correlation (r = .57), and high session-to-session Lag-1 autocorrelation (.82). Scores on the PQ provided clear evidence of convergence with a range of outcome measures (within-client r = .41). Mean pre-post effects were large (d = 1.25). The results support a revised caseness cutoff of 3.25 and a reliable change index interval of 1.67. We conclude that PQ data meet criteria for evidence-based, norm-referenced measurement of client psychological distress for supporting psychotherapy practice and research. (c) 2016 APA, all rights reserved).
Storch, Eric A; Wood, Jeffrey J; Ehrenreich-May, Jill; Jones, Anna M; Park, Jennifer M; Lewin, Adam B; Murphy, Tanya K
2012-11-01
The psychometric properties of the Pediatric Anxiety Rating Scale (PARS), a clinician-administered measure for assessing severity of anxiety symptoms, were examined in 72 children and adolescents diagnosed with an autism spectrum disorder (ASD). The internal consistency of the PARS was 0.59, suggesting that the items were related but not repetitive. The PARS showed high 26-day test-retest (ICC = 0.83) and inter-rater reliability (ICC = 0.86). The PARS was strongly correlated with clinician-ratings of overall anxiety severity and parent-report anxiety measures, supporting convergent validity. Results for divergent validity were mixed. Although the PARS was not associated with the sum of the Social and Communication items on the Autism Diagnostic Observation System, it was moderately correlated with parent-reported inattention, aggression and externalizing behavior. Overall, these results suggest that the psychometric properties of the PARS are adequate for assessing anxiety symptoms in youth with ASD, although additional clarification of divergent validity is needed.
Nakagami, Katsuyuki; Yamauchi, Toyoaki; Noguchi, Hiroyuki; Maeda, Tohru; Nakagami, Tomoko
2014-06-01
This study aimed to develop a reliable and valid measure of functional health literacy in a Japanese clinical setting. Test development consisted of three phases: generation of an item pool, consultation with experts to assess content validity, and comparison with external criteria (the Japanese Health Knowledge Test) to assess criterion validity. A trial version of the test was administered to 535 Japanese outpatients. Internal consistency reliability, calculated by Cronbach's alpha, was 0.81, and concurrent validity was moderate. Receiver Operating Characteristics and Item Response Theory were used to classify patients as having adequate, marginal, or inadequate functional health literacy. Both inadequate and marginal functional health literacy were associated with older age, lower income, lower educational attainment, and poor health knowledge. The time required to complete the test was 10-15 min. This test should enable health workers to better identify patients with inadequate health literacy. © 2013 Wiley Publishing Asia Pty Ltd.
Measuring hope among families impacted by cognitive impairment.
Hunsaker, Amanda E; Terhorst, Lauren; Gentry, Amanda; Lingler, Jennifer H
2016-07-01
The current exploratory investigation aims to establish the reliability and validity of a hope measure, the Herth Hope Index, among families impacted by early cognitive impairment (N = 96). Exploratory factor analysis was used to examine the dimensionality of the measure. Bivariate analyses were used to examine construct validity. The sample had moderately high hope scores. A two-factor structure emerged from the factor analysis, explaining 51.44% of the variance. Both factors exhibited strong internal consistency (Cronbach's alphas ranged from .83 to .86). Satisfaction with social support was positively associated with hope, supporting convergent validity. Neurocognitive status, illness insight, and depression were not associated with hope, indicating discriminant validity. Families impacted by cognitive impairment may maintain hope in the face of a potentially progressive illness, regardless of cognitive status. The Herth Hope Index can be utilized as a reliable and valid measure of hope by practitioners providing support to families impacted by cognitive impairment. © The Author(s) 2014.
Vachon, David D; Lynam, Donald R
2016-04-01
Low empathy is a criterion for most externalizing disorders, and empathy training is a regular component of treatment for aggressive people, from school bullies to sex offenders. However, recent meta-analytic evidence suggests that current measures of empathy explain only 1% of the variance in aggressive behavior. A new assessment of empathy was developed to more fully represent the empathy construct and better predict important outcomes--particularly aggressive behavior and externalizing psychopathology. Across three independent samples (N = 210-708), the 36-item Affective and Cognitive measure of Empathy (ACME) was internally consistent, structurally reliable, and invariant across sex. The ACME bore significant associations to important outcomes, which were incremental relative to other measures of empathy and generalizable across sex. Importantly, the affective scales of the ACME-particularly a new "Affective Dissonance" scale--yielded moderate to strong associations with aggressive behavior and externalizing disorders. The ACME is a short, reliable, and useful measure of empathy. © The Author(s) 2015.
Moderate and late preterm birth: effect on brain size and maturation at term-equivalent age.
Walsh, Jennifer M; Doyle, Lex W; Anderson, Peter J; Lee, Katherine J; Cheong, Jeanie L Y
2014-10-01
To compare the size of multiple brain structures, maturation in terms of both brain myelination and gyral development, and evidence of brain injury between moderate and late preterm (MLPT) and term-born infants at term-equivalent age. The study was approved by the human research ethics committees of the participating hospitals, and informed parental consent was obtained for all infants. One hundred ninety-nine MLPT and 50 term-born infants underwent 3-T magnetic resonance (MR) imaging brain examinations at 38-44 weeks of corrected gestational age. T1- and T2-weighted MR images were compared between groups for size of multiple cerebral structures, degree of myelination in the posterior limb of the internal capsule, gyral maturation, signal intensity abnormalities, and presence of cysts by a single assessor who was blinded to the gestational group and perinatal course of the infants. Group differences were compared by using linear regression for continuous variables and logistic regression for categorical variables, and interrater and intrarater reliability was assessed by using intraclass correlation coefficients. Compared with those in the term-born control group, measurements of brain biparietal diameter, corpus callosum, basal ganglia and thalami, and cerebellum were smaller in infants in the MLPT group (all P ≤ .01), while extracerebral space was larger (P < .0001). Myelination of the posterior limb of the internal capsule was less developed, and gyral maturation was delayed in the MLPT group (both P < .001). Signal intensity abnormalities and cysts were uncommon in both groups, with 13 (6.5%) MLPT infants and one (2%) term infant having abnormalities. Inter- and intrarater reliability was good for most measures, with intraclass correlation coefficients generally greater than 0.68. MLPT birth is associated with smaller brain size, less-developed myelination of the posterior limb of the internal capsule, and more immature gyral folding than those associated with full-term birth. These brain changes may form the basis of some of the long-term neurodevelopmental deficits observed in MLPT children. Online supplemental material is available for this article. © RSNA, 2014.
Moderation for Professional Learning
ERIC Educational Resources Information Center
Earle, Sarah
2017-01-01
Moderation is put forward as they key strategy for improving the reliability of teacher assessment. However, for many teachers the word "moderation" conjures up ideas of uncomfortable situations in which marking is being checked by others and there are prolonged arguments about tiny features of individual work. In this article, the…
Igwesi-Chidobe, Chinonso N; Obiekwe, Chinwe; Sorinola, Isaac O; Godfrey, Emma L
2017-12-14
Cross-culturally adapt and validate the Igbo Roland Morris Disability Questionnaire. Cross-cultural adaptation, test-retest, and cross-sectional psychometric testing. Roland Morris Disability Questionnaire was forward and back translated by clinical/non-clinical translators. An expert committee appraised the translations. Twelve participants with chronic low back pain pre-tested the measure in a rural Nigerian community. Internal consistency using Cronbach's alpha; test-retest reliability using intra-class correlation coefficient and Bland-Altman plot; and minimal detectable change were investigated in a convenient sample of 50 people with chronic low back pain in rural and urban Nigeria. Pearson's correlation analyses using the eleven-point box scale and back performance scale, and exploratory factor analysis were used to examine construct validity in a random sample of 200 adults with chronic low back pain in rural Nigeria. Ceiling and floor effects were investigated in the two samples. Modifications gave the option of interviewer-administration and reflected Nigerian social context. The measure had excellent internal consistency (α = 0.91) and intraclass correlation coefficient (ICC =0.84), moderately high correlations (r > 0.6) with performance-based disability and pain intensity, and a predominant uni-dimensional structure, with no ceiling or floor effects. Igbo Roland Morris Disability Questionnaire is a valid and reliable measure of pain-related disability. Implications for rehabilitation Low back pain is the leading cause of years lived with disability worldwide, and is particularly prevalent in rural Nigeria, but there are no self-report measures to assess its impact due to low literacy rates. This study describes the cross-cultural adaptation and validation of a core self-report back pain specific disability measure in a low-literate Nigerian population. The Igbo Roland Morris Disability Questionnaire is a reliable and valid measure of self-reported disability in Igbo populations as indicated by excellent internal consistency (α = 0.91) and intra-class correlation coefficient (ICC =0.84), moderately high correlations (r > 0.6) with performance-based disability and pain intensity that supports a pain-related disability construct, a predominant one factor structure with no ceiling or floor effects. The measure will be useful for researchers and clinicians examining the factors associated with low back pain disability or the effects of interventions on low back pain disability in this culture. This measure will support global health initiatives concurrently involving people from several cultures or countries, and may inform cross-cultural disability research in other populations.
Dijkhuizen, Annemarie; Douma, Rob K; Krijnen, Wim P; van der Schans, Cees P; Waninge, Aly
2018-05-30
A feasible and reliable instrument to measure strength in persons with severe intellectual and visual disabilities (SIVD) is lacking. The aim of our study was to determine feasibility, learning period and reliability of three strength tests. Twenty-nine participants with SIVD performed the Minimum Sit-to-Stand Height test (MSST), the Leg Extension test (LE) and the 30 seconds Chair-Stand test (30sCS), once per week for 5 weeks. Feasibility was determined by the percentage of successful measurements; learning effect by using paired t test between two consecutive measurements; test-retest reliability by intraclass correlation coefficient and Limits of Agreement and, correlations by Pearson correlations. A sufficient feasibility and learning period of the tests was shown. The methods had sufficient test-retest reliability and moderate-to-sufficient correlations. The MSST, the LE, and the 30sCS are feasible tests for measuring muscle strength in persons with SIVD, having sufficient test re-test reliability. © 2018 John Wiley & Sons Ltd.
2009-02-17
Identification of Classified Information in Unclassified DoD Systems During the Audit of Internal Controls and Data Reliability in the Deployable...TITLE AND SUBTITLE Identification of Classified Information in Unclassified DoD Systems During the Audit of Internal Controls and Data Reliability...Systems During the Audit ofInternal Controls and Data Reliability in the Deployable Disbursing System (Report No. D-2009-054) Weare providing this
Yi, Honglei; Wei, Xianzhao; Zhang, Wei; Chen, Ziqiang; Wang, Xinhui; Ji, Xinran; Zhu, Xiaodong; Wang, Fei; Xu, Ximing; Li, Zhikun; Fan, Jianping; Wang, Chuanfeng; Chen, Kai; Zhang, Guoyou; Zhao, Yinchuan; Li, Ming
2014-05-01
This was a prospective clinical validation study. To evaluate the reliability and validity of the adapted simplified Chinese version of Swiss Spinal Stenosis (SC-SSS) Questionnaire. The SSS Questionnaire is a reliable and valid instrument to assess the perception of function and pain for patients with degenerative lumbar spinal stenosis. However, there is no culturally adapted SSS Questionnaire for use in mainland China. This was a prospective clinical validation study. The adaption was conducted according to International Quality of Life Assessment Project guidelines. To examine the psychometric properties of the adapted SC-SSS Questionnaire, a sample of 105 patients with lumbar spinal stenosis were included. Thirty-two patients were randomly selected to evaluate the test-retest reliability. Reliability assessment of the SC-SSS Questionnaire was determined by calculating Cronbach α and intraclass coefficient values. Concurrent validity was assessed by correlating SC-SSS Questionnaire scores with relevant domains of the 36-Item Short Form Health Survey. Cronbach α of the symptom severity scale, physical function scale, patients, and satisfaction scale of SC-SSS Questionnaire are 0.89, 0.86, 0.91, respectively, which revealed very good internal consistency. The test-retest reproducibility was found to be excellent with the intraclass correlation coefficient of 0.93, 0.91, and 0.95. In terms of concurrent validity, SC-SSS Questionnaire had good correlation with physical functioning and bodily pain of 36-Item Short Form Health Survey (r = 0.663, 0.653) and low correlation with mental health (r = 0.289). The physical function scale had good correlation with physical functioning of 36-Item Short Form Health Survey (r = 0.637), whereas the rest had moderate correlation. The satisfaction scale score was highly correlated with the change in the symptom severity (r = 0.71) and physical function (r = 0.68) scale score. The SC-SSS Questionnaire showed satisfactory reliability and validity in the evaluation of functionality in patients with lumbar spinal stenosis who are experiencing neurogenic claudication. It is simple and easy to use and can be recommended in clinical and research practice in mainland China. 3.
Rolland, Yan; Vérin, Marc; Payan, Christine A; Duchesne, Simon; Kraft, Eduard; Hauser, Till K; Jarosz, Josef; Deasy, Neil; Defevbre, Luc; Delmaire, Christine; Dormont, Didier; Ludolph, Albert C; Bensimon, Gilbert
2011-01-01
Aim To evaluate a standardised MRI acquisition protocol and a new image rating scale for disease severity in patients with progressive supranuclear palsy (PSP) and multiple systems atrophy (MSA) in a large multicentre study. Methods The MRI protocol consisted of two-dimensional sagittal and axial T1, axial PD, and axial and coronal T2 weighted acquisitions. The 32 item ordinal scale evaluated abnormalities within the basal ganglia and posterior fossa, blind to diagnosis. Among 760 patients in the study population (PSP=362, MSA=398), 627 had per protocol images (PSP=297, MSA=330). Intra-rater (n=60) and inter-rater (n=555) reliability were assessed through Cohen's statistic, and scale structure through principal component analysis (PCA) (n=441). Internal consistency and reliability were checked. Discriminant and predictive validity of extracted factors and total scores were tested for disease severity as per clinical diagnosis. Results Intra-rater and inter-rater reliability were acceptable for 25 (78%) of the items scored (≥0.41). PCA revealed four meaningful clusters of covarying parameters (factor (F) F1: brainstem and cerebellum; F2: midbrain; F3: putamen; F4: other basal ganglia) with good to excellent internal consistency (Cronbach α 0.75–0.93) and moderate to excellent reliability (intraclass coefficient: F1: 0.92; F2: 0.79; F3: 0.71; F4: 0.49). The total score significantly discriminated for disease severity or diagnosis; factorial scores differentially discriminated for disease severity according to diagnosis (PSP: F1–F2; MSA: F2–F3). The total score was significantly related to survival in PSP (p<0.0007) or MSA (p<0.0005), indicating good predictive validity. Conclusions The scale is suitable for use in the context of multicentre studies and can reliably and consistently measure MRI abnormalities in PSP and MSA. Clinical Trial Registration Number The study protocol was filed in the open clinical trial registry (http://www.clinicaltrials.gov) with ID No NCT00211224. PMID:21386111
Karazsia, Bryan T; van Dulmen, Manfred H M; Wong, Kendal; Crowther, Janis H
2013-09-01
Internalization of societal standards of physical attractiveness (i.e., internalization of the thin ideal for women and internalization of the mesomorphic ideal for men) is a widely studied and robust risk factor for body dissatisfaction and maladaptive body change behaviors. Substantial empirical research supports internalization as both a mediator and a moderator of the relation between societal influences and body dissatisfaction. In this paper, a primer on mediation and moderation is followed by a review of literature and discussion of the extent to which internalization can theoretically fulfill the roles of both mediation and moderation. The literature review revealed a stark contrast in research design (experimental versus non-experimental design) when alternate conceptualizations of internalization are adopted. A meta-theoretical, moderated mediation model is presented. This model integrates previous research and can inform future empirical and clinical endeavors. Copyright © 2013 Elsevier Ltd. All rights reserved.
Spaan, Suzanne; Pronk, Anjoeka; Koch, Holger M; Jusko, Todd A; Jaddoe, Vincent W V; Shaw, Pamela A; Tiemeier, Henning M; Hofman, Albert; Pierik, Frank H; Longnecker, Matthew P
2015-05-01
The widespread use of organophosphate (OP) pesticides has resulted in ubiquitous exposure in humans, primarily through their diet. Exposure to OP pesticides may have adverse health effects, including neurobehavioral deficits in children. The optimal design of new studies requires data on the reliability of urinary measures of exposure. In the present study, urinary concentrations of six dialkyl phosphate (DAP) metabolites, the main urinary metabolites of OP pesticides, were determined in 120 pregnant women participating in the Generation R Study in Rotterdam. Intra-class correlation coefficients (ICCs) across serial urine specimens taken at <18, 18-25, and >25 weeks of pregnancy were determined to assess reliability. Geometric mean total DAP metabolite concentrations were 229 (GSD 2.2), 240 (GSD 2.1), and 224 (GSD 2.2) nmol/g creatinine across the three periods of gestation. Metabolite concentrations from the serial urine specimens in general correlated moderately. The ICCs for the six DAP metabolites ranged from 0.14 to 0.38 (0.30 for total DAPs), indicating weak to moderate reliability. Although the DAP metabolite levels observed in this study are slightly higher and slightly more correlated than in previous studies, the low to moderate reliability indicates a high degree of within-person variability, which presents challenges for designing well-powered epidemiological studies.
Alyusuf, Raja H; Prasad, Kameshwar; Abdel Satir, Ali M; Abalkhail, Ali A; Arora, Roopa K
2013-01-01
The exponential use of the internet as a learning resource coupled with varied quality of many websites, lead to a need to identify suitable websites for teaching purposes. The aim of this study is to develop and to validate a tool, which evaluates the quality of undergraduate medical educational websites; and apply it to the field of pathology. A tool was devised through several steps of item generation, reduction, weightage, pilot testing, post-pilot modification of the tool and validating the tool. Tool validation included measurement of inter-observer reliability; and generation of criterion related, construct related and content related validity. The validated tool was subsequently tested by applying it to a population of pathology websites. Reliability testing showed a high internal consistency reliability (Cronbach's alpha = 0.92), high inter-observer reliability (Pearson's correlation r = 0.88), intraclass correlation coefficient = 0.85 and κ =0.75. It showed high criterion related, construct related and content related validity. The tool showed moderately high concordance with the gold standard (κ =0.61); 92.2% sensitivity, 67.8% specificity, 75.6% positive predictive value and 88.9% negative predictive value. The validated tool was applied to 278 websites; 29.9% were rated as recommended, 41.0% as recommended with caution and 29.1% as not recommended. A systematic tool was devised to evaluate the quality of websites for medical educational purposes. The tool was shown to yield reliable and valid inferences through its application to pathology websites.
Reliability and Validity of the PAQ-C Questionnaire to Assess Physical Activity in Children.
Benítez-Porres, Javier; López-Fernández, Iván; Raya, Juan Francisco; Álvarez Carnero, Sabrina; Alvero-Cruz, José Ramón; Álvarez Carnero, Elvis
2016-09-01
Physical activity (PA) assessment by questionnaire is a cornerstone in the field of sport epidemiology studies. The Physical Activity Questionnaire for Children (PAQ-C) has been used widely to assess PA in healthy school populations. The aim of this study was to evaluate the reliability and validity of the PAQ-C questionnaire in Spanish children using triaxial accelerometry as criterion. Eighty-three (N = 46 boys, N = 37 girls) healthy children (age 10.98 ± 1.17 years, body mass index 19.48 ± 3.51 kg/m(2) ) were volunteers and completed the PAQ-C twice and wore an accelerometer for 8 consecutive days. Reliability was analyzed by the intraclass correlation coefficient (ICC) and the internal consistency by the Cronbach's α coefficient. The PAQ-C was compared against total PA and moderate to vigorous PA (MVPA) obtained by accelerometry. Test-retest reliability showed an ICC = 0.96 for the final score of PAQ-C. Small differences between first and second questionnaire administration were detected. Few and low correlations (rho = 0.228-0.278, all ps < .05) were observed between PAQ-C and accelerometry. The highest correlation was observed for item 9 (rho = 0.311, p < .01). PAQ-C had a high reliability but a questionable validity for assessing total PA and MVPA in Spanish children. Therefore, PA measurement in children should not be limited only to self-report measurements. © 2016, American School Health Association.
A Turkish Version of the Critical-Care Pain Observation Tool: Reliability and Validity Assessment.
Aktaş, Yeşim Yaman; Karabulut, Neziha
2017-08-01
The study aim was to evaluate the validity and reliability of the Critical-Care Pain Observation Tool in critically ill patients. A repeated measures design was used for the study. A convenience sample of 66 patients who had undergone open-heart surgery in the cardiovascular surgery intensive care unit in Ordu, Turkey, was recruited for the study. The patients were evaluated by using the Critical-Care Pain Observation Tool at rest, during a nociceptive procedure (suctioning), and 20 minutes after the procedure while they were conscious and intubated after surgery. The Turkish version of the Critical-Care Pain Observation Tool has shown statistically acceptable levels of validity and reliability. Inter-rater reliability was supported by moderate-to-high-weighted κ coefficients (weighted κ coefficient = 0.55 to 1.00). For concurrent validity, significant associations were found between the scores on the Critical-Care Pain Observation Tool and the Behavioral Pain Scale scores. Discriminant validity was also supported by higher scores during suctioning (a nociceptive procedure) versus non-nociceptive procedures. The internal consistency of the Critical-Care Pain Observation Tool was 0.72 during a nociceptive procedure and 0.71 during a non-nociceptive procedure. The validity and reliability of the Turkish version of the Critical-Care Pain Observation Tool was determined to be acceptable for pain assessment in critical care, especially for patients who cannot communicate verbally. Copyright © 2016 American Society of PeriAnesthesia Nurses. Published by Elsevier Inc. All rights reserved.
Lee, Jennifer; Koh, Jung Hee; Kwok, Seung-Ki; Park, Sung-Hwan
2016-05-01
This study was conducted to generate and validate a cross-culturally adapted Korean version of the xerostomia inventory (XI), an 11-item questionnaire designed to measure the severity of xerostomia. The original English version of the XI was translated into Korean according to the guidelines for cross-cultural adaptation of health-related quality-of-life measures. Among a prospective cohort of primary Sjögren's syndrome (pSS) in Korea, 194 patients were analyzed. Internal consistency was evaluated by using Cronbach's alpha, and test-retest reliability was obtained by using an intraclass correlation coefficient (ICC) analysis. Construct validity was investigated by performing a correlation analysis between XI total score and salivary flow rate (SFR). Cronbach's alpha for internal consistency was 0.868, and the ICC for test-retest reliability ranged from 0.48 to 0.827, with a median value of 0.72. Moderate negative correlations between XI score and stimulated SFR, unstimulated SFR, and differential (stimulated minus unstimulated) SFR were observed (Spearman's rho, ρ = -0.515, -0.447, and -0.482, respectively; P < 0.001). The correlation analysis between the visual analogue scale (VAS) score of overall dryness and SFR indicated a smaller ρ value (-0.235 [P = 0.006], -0.243 [P = 0.002], and -0.252 [P = 0.003], respectively), which supports that XI more accurately reflects the degree of xerostomia in the pSS patients. In conclusion, the Korean version of the XI is a reliable tool to estimate the severity of xerostomia in patients with pSS.
Haustein, Thomas; Hollmeyer, Helge; Hardiman, Max; Harbarth, Stephan; Pittet, Didier
2011-04-01
To investigate the reliability of the public health event notification assessment process under the International Health Regulations (2005) (IHR). In 2009, 193 National IHR Focal Points (NFPs) were invited to use the decision instrument in Annex 2 of the IHR to determine whether 10 fictitious public health events should be notified to WHO. Each event's notifiability was assessed independently by an expert panel. The degree of consensus among NFPs and of concordance between NFPs and the expert panel was considered high when more than 70% agreed on a response. Overall, 74% of NFPs responded. The median degree of consensus among NFPs on notification decisions was 78%. It was high for the six events considered notifiable by the majority (median: 80%; range: 76-91) but low for the remaining four (median: 55%; range: 54-60). The degree of concordance between NFPs and the expert panel was high for the five events deemed notifiable by the panel (median: 82%; range: 76-91) but low (median: 51%; range: 42-60) for those not considered notifiable. The NFPs identified notifiable events with greater sensitivity than specificity (P < 0.001). When used by NFPs, the notification assessment process in Annex 2 of the IHR was sensitive in identifying public health events that were considered notifiable by an expert panel, but only moderately specific. The reliability of the assessments could be increased by expanding guidance on the use of the decision instrument and by including more specific criteria for assessing events and clearer definitions of terms.
Development and Psychometric Properties of the OCD Family Functioning (OFF) Scale
Stewart, S. Evelyn; Hu, Yu-Pei; Hezel, Dianne M.; Proujansky, Rachel; Lamstein, Abby; Walsh, Casey; Ben-Joseph, Elana Pearl; Gironda, Christina; Jenike, Michael; Geller, Daniel A.; Pauls, David L.
2013-01-01
Obsessive–compulsive disorder (OCD) influences not only patients but also family members. Although the construct of family accommodation has received attention in OCD literature, no measures of overall family functioning are currently available. The OCD Family Functioning (OFF) Scale was developed to explore the context, extent, and perspectives of functional impairment in families affected by OCD. It is a three-part, self-report measure capturing independent perspectives of patients and relatives. A total of 400 subjects were enrolled between 2008 and 2010 from specialized OCD clinics and OCD research studies. Psychometric properties of this scale were examined including internal consistency, test–retest reliability, convergent and divergent validity, and exploratory factor analyses. Both patient and relative versions of the OFF Scale demonstrated excellent internal consistency (Cronbach’s alpha coefficient = 0.96). The test–retest reliability was also adequate (ICC = 0.80). Factor analyses determined that the OFF Scale comprises a family functioning impairment factor and four OCD symptom factors that were consistent with previously reported OCD symptom dimension studies. The OFF Scale demonstrated excellent convergent validity with the Family Accommodation Scale and the Work and Social Adjustment Scale. Information gathered regarding emotional impact and family role-specific impairment was novel and not captured by other examined scales. The OFF Scale is a reliable and valid instrument for the clinical and research assessment of family functioning in pediatric and adult OCD. This will facilitate the exploration of family functioning impairment as a potential risk factor, as a moderator and as a treatment outcome measure in OCD. PMID:21553962
Reliability and validity of the Cancer Therapy Satisfaction Questionnaire in lung cancer.
Cheung, K; de Mol, M; Visser, S; Den Oudsten, B L; Stricker, B H; Aerts, J G J V
2016-01-01
To test the reliability and validity of the Cancer Treatment Satisfaction Questionnaire (CTSQ), to assess its relation with quality of life (QoL), and to assess the interpretability of the domain scores in lung cancer patients receiving intravenous chemotherapy. Patients with stage IIIB and IV non-squamous non-small cell lung carcinoma treated with pemetrexed were enrolled in our study. They completed the 16-item CTSQ and two other (health-related) QoL questionnaires. Information about sociodemographic characteristics, cancer stage, and the experience of adverse events was collected. Internal consistency, construct validity, and clinical interpretability were calculated. Fifty-five patients completed the CTSQ. Correlations of the CTSQ items with its domain were all above 0.40. A high correlation between item 8 and the expectations of therapy and satisfaction with therapy domain was observed (0.50 and 0.48, respectively). The CTSQ domains demonstrated good internal consistency and low to moderate correlations of the CTSQ with the European Organization for Research and Treatment of Cancer Quality of Life Questionnaire-C30 and World Health Organization Quality of Life-BREF. No significant differences in mean domain scores were observed in relation to the number and severity of different adverse events and chemotherapy-related adverse events. The Dutch version of the CTSQ was found to be a reliable and valid instrument to assess satisfaction and expectations of treatment in lung cancer patients receiving intravenous chemotherapy. Furthermore, the CTSQ proved to be of additional informative value as not all of its domains correlated with the various domains of the existing HRQoL instruments.
Bamm, Elena L; Rosenbaum, Peter; Wilkins, Seanne; Stratford, Paul
2015-01-01
In recent years, client-centered care has been embraced as a new philosophy of care by many organizations around the world. Clinicians and researchers have identified the need for valid and reliable outcome measures that are easy to use to evaluate success of implementation of new concepts. The current study was developed to complete adaptation and field testing of the companion patient-reported measures of processes of care for adults (MPOC-A) and the service provider self-reflection measure of processes of care for service providers working with adult clients (MPOC-SP(A)). A validation study. In-patient rehabilitation facilities. MPOC-A and measure of processes of care for service providers working with adult clients (MPOC-SP(A)). Three hundred and eighty-four health care providers, 61 patients, and 16 family members completed the questionnaires. Good to excellent internal consistency (0.71-0.88 for health care professionals, 0.82-0.90 for patients, and 0.87-0.94 for family members), as well as moderate to good correlations between domains (0.40-0.78 for health care professionals and 0.52-0.84 for clients) supported internal reliability of the tools. Exploratory factor analysis of the MPOC-SP(A) responses supported the multidimensionality of the questionnaire. MPOC-A and MPOC-SP(A) are valid and reliable tools to assess patient and service-provider accounts, respectively, of the extent to which they experience, or are able to provide, client-centered service. Research should now be undertaken to explore in more detail the relationships between client experience and provider reports of their own behavior.
[Validation of the Montgomery-Åsberg Depression Rating Scale (MADRS) in Colombia].
Cano, Juan Fernando; Gomez Restrepo, Carlos; Rondón, Martín
2016-01-01
To adapt and to validate the Montgomery-Åsberg Depression Rating Scale (MADRS) in Colombia. Observational study for scale validation. Validity criteria were used to determine the severity cut-off points of the tool. Taking into account sensitivity and specificity values, those cut points were contrasted with ICD-10 criteria for depression severity. A a factor analysis was performed. The internal consistencY was determined with the same sample of patients used for the validity criteria. Inter-rater reliability was assessed by evaluating the 22 records of the patients that consented to a video interview. Sensitivity to change was established through a second application of the scale in 28 subjects after a lapse of 14 to 28 days. The study was performed in Bogotá, the tool was applied in 150 patients suffering from major depressive disorder. The cut-off point for moderate depression was 20 (sensitivity, 98%; specificity, 96%), and the cut-off point for severe depression was 34 (sensitivity, 98%; specificity, 92%). The tool appears as a unidimensional scale, which possesses a good internal consistency with (α=.9168). The findings of inter-rater reliability evaluation showed the scale as highly reliable (intraclass correlation coefficient=.9833). The instrument has a good sensitivity to change. The Colombian version of the Montgomery-Åsberg Depression Rating Scale has good psychometric properties and can be used in clinical practice and in clinical research in the field of depressive disorder. Copyright © 2015 Asociación Colombiana de Psiquiatría. Publicado por Elsevier España. All rights reserved.
Wong, P K S; Wong, D F K; Zhuang, X Y; Liu, Y
2017-03-01
The construct of self-determination has received considerable attention in the international field of intellectual disabilities (ID). Recently, there has been a rapid development of this construct in Chinese societies including Hong Kong. However, there is no locally validated instrument to measure self-determination in people with ID. This article explains the validation process of the AIR Self-Determination Scale - Chinese version (AIR SDS-C) adapted from the 24-item AIR Self-Determination Scale, developed by Wolman and his colleagues, which is used in school setting. People with mild/moderate ID aged 15 years or above were recruited from special schools and social services units in different regions of Hong Kong. Factor analysis and reliability tests were conducted. Data for a total of 356 participants were used for the analysis. A confirmatory factor analysis was performed to test the factorial construct, and Mplus 7.0 was used for the analysis. The factor structure proposed in the original English version was supported by the data, and all factor loadings were between 0.42 and 0.76. The whole scale achieved good reliability (Cronbach's α = 0.88 and ω = 0.90). The AIR SDS-C appears to be a valid and reliable scale. This study examined adult groups as well as student groups. The application of the scale can thus be extended to a wider population. The implications for theory building and practice are discussed. © 2016 MENCAP and International Association of the Scientific Study of Intellectual and Developmental Disabilities and John Wiley & Sons Ltd.
Olaya, Beatriz; Marsà, Ferran; Ochoa, Susana; Balanzá-Martínez, Vicent; Barbeito, Sara; García-Portilla, Mari Paz; González-Pinto, Ana; Lobo, Antonio; López-Antón, Raúl; Usall, Judith; Arranz, Belén; Haro, Josep Maria
2012-12-15
Research on insight in patients with mood disorders has grown in recent years. Several instruments to assess insight have been used, but most of them have been specifically designed for psychosis and may not appear relevant to mood disorders. The aim of the present study is to develop a short, multidimensional, reliable and valid scale to measure insight in patients with mood disorders, based on the Amador's Scale to Assess Unawareness of Mental Disorders (SUMD). A Delphi method was used to facilitate expert participation and ensure face and content validity. The SUMD structure and items were used as a reference in the scale development. A new scale with 17 items was obtained. Internal consistency, test-retest and inter-rater reliability and validity were studied in a sample of 76 outpatients with a DSM-IV diagnosis of major depression or bipolar disorder (type I or II). Internal consistency of the general items was moderate, and high for the symptoms awareness subscale. Scores on ISAD correlated with other measures of insight and with some clinical measures, thus supporting its validity. The majority of the sample came from community services. Future studies should use inpatients or patients with severe symptoms to broaden the range of responses. Moreover, the rating of insight and other measures by the same clinician might introduce a methodological bias. The ISAD, with a multidimensional approach, appears as a short, reliable and valid measure of insight in mood disorders. Expert consensus ensures its face and content validity. Copyright © 2012 Elsevier B.V. All rights reserved.
Back to the future: estimating pre-injury brain volume in patients with traumatic brain injury.
Ross, David E; Ochs, Alfred L; D Zannoni, Megan; Seabaugh, Jan M
2014-11-15
A recent meta-analysis by Hedman et al. allows for accurate estimation of brain volume changes throughout the life span. Additionally, Tate et al. showed that intracranial volume at a later point in life can be used to estimate reliably brain volume at an earlier point in life. These advancements were combined to create a model which allowed the estimation of brain volume just prior to injury in a group of patients with mild or moderate traumatic brain injury (TBI). This volume estimation model was used in combination with actual measurements of brain volume to test hypotheses about progressive brain volume changes in the patients. Twenty six patients with mild or moderate TBI were compared to 20 normal control subjects. NeuroQuant® was used to measure brain MRI volume. Brain volume after the injury (from MRI scans performed at t1 and t2) was compared to brain volume just before the injury (volume estimation at t0) using longitudinal designs. Groups were compared with respect to volume changes in whole brain parenchyma (WBP) and its 3 major subdivisions: cortical gray matter (GM), cerebral white matter (CWM) and subcortical nuclei+infratentorial regions (SCN+IFT). Using the normal control data, the volume estimation model was tested by comparing measured brain volume to estimated brain volume; reliability ranged from good to excellent. During the initial phase after injury (t0-t1), the TBI patients had abnormally rapid atrophy of WBP and CWM, and abnormally rapid enlargement of SCN+IFT. Rates of volume change during t0-t1 correlated with cross-sectional measures of volume change at t1, supporting the internal reliability of the volume estimation model. A logistic regression analysis using the volume change data produced a function which perfectly predicted group membership (TBI patients vs. normal control subjects). During the first few months after injury, patients with mild or moderate TBI have rapid atrophy of WBP and CWM, and rapid enlargement of SCN+IFT. The magnitude and pattern of the changes in volume may allow for the eventual development of diagnostic tools based on the volume estimation approach. Copyright © 2014 Elsevier Inc. All rights reserved.
Appleyard, Karen; Yang, Chongming; Runyan, Desmond K
2010-05-01
The current study investigated concurrent and longitudinal mediated and mediated moderation pathways among maltreatment, self-perception (i.e., loneliness and self-esteem), social support, and internalizing and externalizing behavior problems. For both genders, early childhood maltreatment (i.e., ages 0-6) was related directly to internalizing and externalizing behavior problems at age 6, and later maltreatment (i.e., ages 6-8) was directly related to internalizing and externalizing behavior problems at age 8. Results of concurrent mediation and mediated moderation indicated that early maltreatment was significantly related to internalizing and externalizing behavior problems at age 6 indirectly both through age 6 loneliness and self-esteem for boys and through age 6 loneliness for girls. Significant moderation of the pathway from early maltreatment to self-esteem, and for boys, significant mediated moderation to emotional and behavioral problems were found, such that the mediated effect through self-esteem varied across levels of social support, though in an unexpected direction. No significant longitudinal mediation or mediated moderation was found, however, between the age 6 mediators and moderator and internalizing or externalizing problems at age 8. The roles of the hypothesized mediating and moderating mechanisms are discussed, with implications for designing intervention and prevention programs.
Appleyard, Karen; Yang, Chongming; Runyan, Desmond K.
2014-01-01
The current study investigated concurrent and longitudinal mediated and mediated moderation pathways among maltreatment, self perception (i.e., loneliness and self esteem), social support, and internalizing and externalizing behavior problems. For both genders, early childhood maltreatment (i.e., ages 0–6) was related directly to internalizing and externalizing behavior problems at age 6, and later maltreatment (i.e., ages 6–8) was directly related to internalizing and externalizing behavior problems at age 8. Results of concurrent mediation and mediated moderation indicated that early maltreatment was significantly related to internalizing and externalizing behavior problems at age 6 indirectly both through age 6 loneliness and self esteem for boys and through age 6 loneliness for girls. Significant moderation of the pathway from early maltreatment to self esteem, and, for boys, significant mediated moderation to emotional and behavioral problems were found, such that the mediated effect through self esteem varied across levels of social support, though in an unexpected direction. No significant longitudinal mediation or mediated moderation was found, however, between the age 6 mediators and moderator and internalizing or externalizing problems at age 8. The roles of the hypothesized mediating and moderating mechanisms are discussed, with implications for designing intervention and prevention programs. PMID:20423545
Cobb, Stephen C; James, C Roger; Hjertstedt, Matthew; Kruk, James
2011-01-01
Although abnormal foot posture long has been associated with lower extremity injury risk, the evidence is equivocal. Poor intertester reliability of traditional foot measures might contribute to the inconsistency. To investigate the validity and reliability of a digital photographic measurement method (DPMM) technology, the reliability of DPMM-quantified foot measures, and the concurrent validity of the DPMM with clinical-measurement methods (CMMs) and to report descriptive data for DPMM measures with moderate to high intratester and intertester reliability. Descriptive laboratory study. Biomechanics research laboratory. A total of 159 people participated in 3 groups. Twenty-eight people (11 men, 17 women; age = 25 ± 5 years, height = 1.71 ± 0.10 m, mass = 77.6 ± 17.3 kg) were recruited for investigation of intratester and intertester reliability of the DPMM technology; 20 (10 men, 10 women; age = 24 ± 2 years, height = 1.71 ± 0.09 m, mass = 76 ± 16 kg) for investigation of DPMM and CMM reliability and concurrent validity; and 111 (42 men, 69 women; age = 22.8 ± 4.7 years, height = 168.5 ± 10.4 cm, mass = 69.8 ± 13.3 kg) for development of a descriptive data set of the DPMM foot measurements with moderate to high intratester and intertester reliabilities. The dimensions of 10 model rectangles and the 28 participants' feet were measured, and DPMM foot posture was measured in the 111 participants. Two clinicians assessed the DPMM and CMM foot measures of the 20 participants. Validity and reliability were evaluated using mean absolute and percentage errors and intraclass correlation coefficients. Descriptive data were computed from the DPMM foot posture measures. The DPMM technology intratester and intertester reliability intraclass correlation coefficients were 1.0 for each tester and variable. Mean absolute errors were equal to or less than 0.2 mm for the bottom and right-side variables and 0.1° for the calculated angle variable. Mean percentage errors between the DPMM and criterion reference values were equal to or less than 0.4%. Intratester and intertester reliabilities of DPMM-computed structural measures of arch and navicular indices were moderate to high (>0.78), and concurrent validity was moderate to strong. The DPMM is a valid and reliable clinical and research tool for quantifying foot structure. The DPMM and the descriptive data might be used to define groups in future studies in which the relationship between foot posture and function or injury risk is investigated.
Ford-Gilboe, Marilyn; Wathen, C Nadine; Varcoe, Colleen; MacMillan, Harriet L; Scott-Storey, Kelly; Mantler, Tara; Hegarty, Kelsey; Perrin, Nancy
2016-01-01
Objectives Approaches to measuring intimate partner violence (IPV) in populations often privilege physical violence, with poor assessment of other experiences. This has led to underestimating the scope and impact of IPV. The aim of this study was to develop a brief, reliable and valid self-report measure of IPV that adequately captures its complexity. Design Mixed-methods instrument development and psychometric testing to evolve a brief version of the Composite Abuse Scale (CAS) using secondary data analysis and expert feedback. Setting Data from 5 Canadian IPV studies; feedback from international IPV experts. Participants 31 international IPV experts including academic researchers, service providers and policy actors rated CAS items via an online survey. Pooled data from 6278 adult Canadian women were used for scale development. Primary/secondary outcome measures Scale reliability and validity; robustness of subscales assessing different IPV experiences. Results A 15-item version of the CAS has been developed (Composite Abuse Scale (Revised)—Short Form, CASR-SF), including 12 items developed from the original CAS and 3 items suggested through expert consultation and the evolving literature. Items cover 3 abuse domains: physical, sexual and psychological, with questions asked to assess lifetime, recent and current exposure, and abuse frequency. Factor loadings for the final 3-factor solution ranged from 0.81 to 0.91 for the 6 psychological abuse items, 0.63 to 0.92 for the 4 physical abuse items, and 0.85 and 0.93 for the 2 sexual abuse items. Moderate correlations were observed between the CASR-SF and measures of depression, post-traumatic stress disorder and coercive control. Internal consistency of the CASR-SF was 0.942. These reliability and validity estimates were comparable to those obtained for the original 30-item CAS. Conclusions The CASR-SF is brief self-report measure of IPV experiences among women that has demonstrated initial reliability and validity and is suitable for use in population studies or other studies. Additional validation of the 15-item scale with diverse samples is required. PMID:27927659
Smith, L F
1999-01-01
BACKGROUND: Antenatal services continue to change, stimulated by the Changing Childbirth report. Women's views should be an important component of assessing the quality of such services. To date, no published quantitative multidimensional assessment instrument has been available to measure their satisfaction with care. AIM: To develop a valid, reliable, multidimensional questionnaire to assess quality of antenatal care. METHOD: A multidimensional satisfaction questionnaire was developed using psychometric methods. Following fieldwork to pilot a questionnaire, three successive versions of it were given by midwives to pregnant women in their final trimester in nine trusts in the old South Western region of England. Their replies were analysed by principal components analysis (PCA) with varimax rotation; internal reliability was assessed by Cronbach's alpha. Face, content, and construct validity were all assessed during development. RESULTS: Out of 196 women, 134 (68.4%) returned the pilot questionnaires. One hundred and seventy-two (57.3%) out of 300 women returned version 1 of the WOMB (WOMen's views of Birth) antenatal satisfaction questionnaire proper, 283 (56.6%) out of 500 returned version 2, and 328 (65.6%) out of 500 returned the final development version. This final version consisted of 11 dimensions in addition to a general satisfaction one. These were [Cronbach's alpha]: five related to antenatal clinic characteristics (travelling to clinic [0.75], waiting at clinic [0.90], clinic environment [0.69], timing of appointment [0.78], car parking [0.85]), three 'professional' characteristics (professional competence [0.80], knowing carers [0.79], information provided [0.81]), antenatal classes [0.76], social support from other pregnant women [0.83], checking for the baby's heart beat [0.63]. There were significant moderate correlations (range = 0.24 to 0.77) between individual dimensions and the general satisfaction dimension. Women's dimension scores were significantly related to age, parity, social class, and best educational achievement. CONCLUSION: This multidimensional satisfaction instrument has good face, content, and construct validity, and excellent internal reliability. It could be used to generally assess antenatal services or to screen them to detect areas where further in-depth qualitative enquiry is merited. Its sensitivity to change over time, external reliability, and transferability to non-Caucasian groups needs to be assessed. PMID:10824341
Komro, Kelli A; Livingston, Melvin D; Kominsky, Terrence K; Livingston, Bethany J; Garrett, Brady A; Molina, Mildred Maldonado; Boyd, Misty L
2015-01-01
Objective: American Indians (AIs) suffer from significant alcohol-related health disparities, and increased risk begins early. This study examined the reliability and validity of measures to be used in a preventive intervention trial. Reliability and validity across racial/ethnic subgroups are crucial to evaluate intervention effectiveness and promote culturally appropriate evidence-based practice. Method: To assess reliability and validity, we used three baseline surveys of high school students participating in a preventive intervention trial within the jurisdictional service area of the Cherokee Nation in northeastern Oklahoma. The 15-minute alcohol risk survey included 16 multi-item scales and one composite score measuring key proximal, primary, and moderating variables. Forty-four percent of the students indicated that they were AI (of whom 82% were Cherokee), including 23% who reported being AI only (n = 435) and 18% both AI and White (n = 352). Forty-seven percent reported being White only (n = 901). Results: Scales were adequately reliable for the full sample and across race/ethnicity defined by AI, AI/White, and White subgroups. Among the full sample, all scales had acceptable internal consistency, with minor variation across race/ethnicity. All scales had extensive to exemplary test–retest reliability and showed minimal variation across race/ethnicity. The eight proximal and two primary outcome scales were each significantly associated with the frequency of alcohol use during the past month in both the cross-sectional and the longitudinal models, providing support for both criterion validity and predictive validity. For most scales, interpretation of the strength of association and statistical significance did not differ between the racial/ethnic subgroups. Conclusions: The results support the reliability and validity of scales of a brief questionnaire measuring risk and protective factors for alcohol use among AI adolescents, primarily members of the Cherokee Nation. PMID:25486402
Paesani, Daniel A; Guarda-Nardini, Luca; Gelos, Carlota; Salmaso, Luigi; Manfredini, Daniele
2014-03-01
The aim was to answer the clinical research question: is incisal/occlusal tooth wear assessment on dental casts performed by five professionals with expertise in different fields of dentistry reliable? Five examiners with different fields of expertise in the dental profession assessed tooth wear on dental casts of 45 subjects, based on a six-degree rating of incisal/occlusal wear. After a calibration meeting, the examiners evaluated the casts individually and various issues concerning interexaminer agreement and reliability were assessed. A total of 872 teeth were evaluated. The five examiners agreed only for the rating of 6.6% of the teeth. The teeth with the highest percentage of agreement were the premolars. Pairwise comparison of the assessments of the examiners #1 (bruxism expert), #2 (orthodontist), #3 (temporomandibular disorders [TMD] and occlusion expert), #4 (dental nurse) showed fair to moderate agreement, with κ-values ranging from 0.306 to 0.577, whilst the examiner #5 (lab technician) achieved low interexaminer reliability values with all the other four examiners. The interexaminer reliability of tooth wear assessment on dental casts performed by five professionals with expertise in different fields of dentistry is highly variable. General practitioners should keep in mind that consensus decisions by the examiners and assessment by raters belonging to the same dental discipline are recommended strategies to increase the reliability of tooth wear evaluation in the clinical setting. This investigation adds to the literature suggesting that, in a clinical setting, a single examiner's assessment of tooth wear on dental casts does not have optimal reliability and that it may be source of internal validity problems in the research setting.
Music therapy career aptitude and generalized self-efficacy in music therapy students.
Lim, Hayoung A; Befi, Cathy M
2014-01-01
While the Music Therapy Career Aptitude Test (MTCAT) provides a measure of student aptitude, measures of perceived self-efficacy may provide additional information about a students' suitability for a music therapy career. As a first step in determining whether future studies examining combined scores from the MTCAT and the Generalized Self-Efficacy (GSE) scale would be useful to help predict academic success in music therapy, we explored the internal reliability of these two measures in a sample of undergraduate students, and the relationship (concurrent validity) of the measures to one another. Eighty undergraduate music therapy students (14 male; 66 female) completed the MTCAT and GSE. To determine internal reliability we conducted tests of normality and calculated Cronbach's Coefficient Alpha for each measure. Pearson correlation coefficients were calculated to ascertain the strength of the relationship between the MTCAT and GSE. MTCAT scores were normally distributed and had high internal consistency (Cronbach's α = 0.706). GSE scores were not normally distributed, but had high internal consistency (Cronbach's α = 0.748). The correlation coefficient analysis revealed that MTCAT and GSE scores were moderately correlated ((r = 0.426, p < 0.0001). MTCAT scores can be used to partially determine perceived self-efficacy in undergraduate music therapy students; however, a more complete picture of student suitability for music therapy may be determined by administering the GSE alongside the MTCAT. Future studies are needed to determine whether combined MTCAT and GSE scores can be used to predict student success in an undergraduate music therapy program. © the American Music Therapy Association 2014. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Bermúdez-de-Alvear, Rosa M; Gálvez-Ruiz, Pablo; Martínez-Arquero, A Ginés; Rando-Márquez, Sara; Fernández-Contreras, Elena
2018-06-11
This study aimed to analyze the psychometric properties of the Spanish version of the Voice Activity and Participation Profile (SVAPP) questionnaire. A randomized, cross-sectional sampling strategy with controls was used. Two samples with a total of 169 participants were analyzed, specifically 61 men (mean age 37.02) and 108 women (mean age 37.78). Of these participants, 112 were patients and 57 were controls. The instrument was submitted to reliability (internal consistency and corrected item-total correlations) and reproducibility analyses. Validation assessment was based on the construct validity, convergent validity, discriminant validity, and concurrent validity. The global internal consistency was excellent (Cronbach's α = 0.976), corrected item-total correlations were satisfactory and ranged 0.63-0.89, and factor loadings were above 0.50. The different subscales showed good internal consistency (alpha coefficients ranged 0.830-0.956) and test-retest values were consistently associated. The exploratory factor analysis evidenced a strongly defined five factors internal structure, with factors loadings ranging 0.51-0.86. Convergent validity demonstrated that all subscales and scores were very strongly correlated (Pearson r above 0.735) and significantly associated. The discriminant validity analysis showed that SVAPP had good specificity to distinguish dysphonic from healthy voice subjects. Concurrent validity with Voice Handicap Index Spanish version (SVHI) showed very strong correlations between total scores, and between SVHI total score and SVAPP Daily and Social Communication subscales; correlations between both tests subscales were strong; only between SVAPP Work and SVHI Physical sections correlations were moderate. The findings of the present study demonstrated evidence for the SVAPP questionnaire reliability and validity, and provided insightful implications of voice disorders on Spanish patients' quality of life. However, further investigations are required. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
A multi-source feedback tool for measuring a subset of Pediatrics Milestones.
Schwartz, Alan; Margolis, Melissa J; Multerer, Sara; Haftel, Hilary M; Schumacher, Daniel J
2016-10-01
The Pediatrics Milestones Assessment Pilot employed a new multisource feedback (MSF) instrument to assess nine Pediatrics Milestones among interns and subinterns in the inpatient context. To report validity evidence for the MSF tool for informing milestone classification decisions. We obtained MSF instruments by different raters per learner per rotation. We present evidence for validity based on the unified validity framework. One hundred and ninety two interns and 41 subinterns at 18 Pediatrics residency programs received a total of 1084 MSF forms from faculty (40%), senior residents (34%), nurses (22%), and other staff (4%). Variance in ratings was associated primarily with rater (32%) and learner (22%). The milestone factor structure fit data better than simpler structures. In domains except professionalism, ratings by nurses were significantly lower than those by faculty and ratings by other staff were significantly higher. Ratings were higher when the rater observed the learner for longer periods and had a positive global opinion of the learner. Ratings of interns and subinterns did not differ, except for ratings by senior residents. MSF-based scales correlated with summative milestone scores. We obtain moderately reliable MSF ratings of interns and subinterns in the inpatient context to inform some milestone assignments.
Ahlqvist, Margary; Berglund, Britta; Nordström, Gun; Klang, Birgitta; Johansson, Eva
2014-01-01
Nursing students should be given opportunities to participate in clinical audits during their education. However, audit tools are seldom tested for reliability among nursing students. The aim of this study was to present reliability among nursing students using the instrument PVC assess to assess management of peripheral venous catheters (PVCs) and PVC-related signs of thrombophlebitis. PVC assess was used to assess 67 inserted PVCs in 60 patients at ten wards at a university hospital. One group of nursing students (n=4) assessed PVCs at the bedside (inter-rater reliability) and photographs of these PVCs were taken. Another group of students (n=3) assessed the PVCs in the photographs after 4 weeks (test-retest reliability). To determine reliability, proportion of agreement [P(A)] and Cohen's kappa coefficient (κ) were calculated. For bedside assessment of PVCs, P(A) ranged from good to excellent (0.80-1.0) in 55% of the 26 PVC assess items that were tested. P(A) was poor (<0.70) for two items: "adherence of inner dressing to the skin" and "PVC location." In 81% of the items, κ was between moderate and almost perfect: moderate (n=5), substantial (n=3), almost perfect (n=5). For edema at insertion site and two items on PVC dressing, κ was fair (0.21-0.40). Regarding test-retest reliability, P(A) varied between good and excellent (0.81-1) in 85%-95% of the items, and the κ ranged between moderate and almost perfect (0.41-1) in 90%-95%. PVC assess demonstrated satisfactory reliability among nursing students. However, students need training in how to use the instrument before assessing PVCs.
Singh, Amika S; Vik, Froydis N; Chinapaw, Mai J M; Uijtdewilligen, Léonie; Verloigne, Maïté; Fernández-Alvira, Juan M; Stomfai, Sarolta; Manios, Yannis; Martens, Marloes; Brug, Johannes
2011-12-09
Insight in children's energy balance-related behaviours (EBRBs) and their determinants is important to inform obesity prevention research. Therefore, reliable and valid tools to measure these variables in large-scale population research are needed. To examine the test-retest reliability and construct validity of the child questionnaire used in the ENERGY-project, measuring EBRBs and their potential determinants among 10-12 year old children. We collected data among 10-12 year old children (n = 730 in the test-retest reliability study; n = 96 in the construct validity study) in six European countries, i.e. Belgium, Greece, Hungary, the Netherlands, Norway, and Spain. Test-retest reliability was assessed using the intra-class correlation coefficient (ICC) and percentage agreement comparing scores from two measurements, administered one week apart. To assess construct validity, the agreement between questionnaire responses and a subsequent face-to-face interview was assessed using ICC and percentage agreement. Of the 150 questionnaire items, 115 (77%) showed good to excellent test-retest reliability as indicated by ICCs > .60 or percentage agreement ≥ 75%. Test-retest reliability was moderate for 34 items (23%) and poor for one item. Construct validity appeared to be good to excellent for 70 (47%) of the 150 items, as indicated by ICCs > .60 or percentage agreement ≥ 75%. From the other 80 items, construct validity was moderate for 39 (26%) and poor for 41 items (27%). Our results demonstrate that the ENERGY-child questionnaire, assessing EBRBs of the child as well as personal, family, and school-environmental determinants related to these EBRBs, has good test-retest reliability and moderate to good construct validity for the large majority of items.
Critically re-evaluating a common technique
Geisbush, Thomas; Jones, Lyell; Weiss, Michael; Mozaffar, Tahseen; Gronseth, Gary; Rutkove, Seward B.
2016-01-01
Objectives: (1) To assess the diagnostic accuracy of EMG in radiculopathy. (2) To evaluate the intrarater reliability and interrater reliability of EMG in radiculopathy. (3) To assess the presence of confirmation bias in EMG. Methods: Three experienced academic electromyographers interpreted 3 compact discs with 20 EMG videos (10 normal, 10 radiculopathy) in a blinded, standardized fashion without information regarding the nature of the study. The EMGs were interpreted 3 times (discs A, B, C) 1 month apart. Clinical information was provided only with disc C. Intrarater reliability was calculated by comparing interpretations in discs A and B, interrater reliability by comparing interpretation between reviewers. Confirmation bias was estimated by the difference in correct interpretations when clinical information was provided. Results: Sensitivity was similar to previous reports (77%, confidence interval [CI] 63%–90%); specificity was 71%, CI 56%–85%. Intrarater reliability was good (κ 0.61, 95% CI 0.41–0.81); interrater reliability was lower (κ 0.53, CI 0.35–0.71). There was no substantial confirmation bias when clinical information was provided (absolute difference in correct responses 2.2%, CI −13.3% to 17.7%); the study lacked precision to exclude moderate confirmation bias. Conclusions: This study supports that (1) serial EMG studies should be performed by the same electromyographer since intrarater reliability is better than interrater reliability; (2) knowledge of clinical information does not bias EMG interpretation substantially; (3) EMG has moderate diagnostic accuracy for radiculopathy with modest specificity and electromyographers should exercise caution interpreting mild abnormalities. Classification of evidence: This study provides Class III evidence that EMG has moderate diagnostic accuracy and specificity for radiculopathy. PMID:26701380
2011-01-01
Background Insight in children's energy balance-related behaviours (EBRBs) and their determinants is important to inform obesity prevention research. Therefore, reliable and valid tools to measure these variables in large-scale population research are needed. Objective To examine the test-retest reliability and construct validity of the child questionnaire used in the ENERGY-project, measuring EBRBs and their potential determinants among 10-12 year old children. Methods We collected data among 10-12 year old children (n = 730 in the test-retest reliability study; n = 96 in the construct validity study) in six European countries, i.e. Belgium, Greece, Hungary, the Netherlands, Norway, and Spain. Test-retest reliability was assessed using the intra-class correlation coefficient (ICC) and percentage agreement comparing scores from two measurements, administered one week apart. To assess construct validity, the agreement between questionnaire responses and a subsequent face-to-face interview was assessed using ICC and percentage agreement. Results Of the 150 questionnaire items, 115 (77%) showed good to excellent test-retest reliability as indicated by ICCs > .60 or percentage agreement ≥ 75%. Test-retest reliability was moderate for 34 items (23%) and poor for one item. Construct validity appeared to be good to excellent for 70 (47%) of the 150 items, as indicated by ICCs > .60 or percentage agreement ≥ 75%. From the other 80 items, construct validity was moderate for 39 (26%) and poor for 41 items (27%). Conclusions Our results demonstrate that the ENERGY-child questionnaire, assessing EBRBs of the child as well as personal, family, and school-environmental determinants related to these EBRBs, has good test-retest reliability and moderate to good construct validity for the large majority of items. PMID:22152048
VFS interjudge reliability using a free and directed search.
Bryant, Karen N; Finnegan, Eileen; Berbaum, Kevin
2012-03-01
Reports in the literature suggest that clinicians demonstrate poor reliability in rating videofluoroscopic swallow (VFS) variables. Contemporary perception theories suggest that the methods used in VFS reliability studies constrain subjects to make judgments in an abnormal way. The purpose of this study was to determine whether a directed search or a free search approach to rating swallow studies results in better interjudge reliability. Ten speech pathologists served as judges. Five clinical judges were assigned to the directed search group (use checklist) and five to the free search group (unguided observations). Clinical judges interpreted 20 VFS examinations of swallowing. Interjudge reliability of ratings of dysphagia severity, affected stage of swallow, dysphagia symptoms, and attributes identified by clinical judges using a directed search was compared with that using a free search approach. Interjudge reliability for rating the presence of aspiration and penetration was significantly better using a free search ("substantial" to "almost perfect" agreement) compared to a directed search ("moderate" agreement). Reliability of dysphagia severity ratings ranged from "moderate" to "almost perfect" agreement for both methods of search. Reliability for reporting all other symptoms and attributes of dysphagia was variable and was not significantly different between the groups.
Nooijen, Carla F J; Post, Marcel W M; Spijkerman, Dorien C M; Bergen, Michael P; Stam, Henk J; van den Berg-Emons, Rita J G
2013-04-01
To assess the reliability and validity of the Dutch version of the exercise self-efficacy scale (ESES) in persons with spinal cord injury. This is the first independent study of ESES psychometric properties, and the first report on ESES test-retest reliability. A total of 53 Dutch persons with spinal cord injury. Subjects completed the Dutch ESES twice, with 2 weeks between (ESES_1 and ESES_2). Subjects also completed the General self-efficacy scale (GSE), and a questionnaire regarding demographic characteristics and lesion characteristics. Psychometric properties of the Dutch translation of the ESES were assessed and compared with those of the original English-language version. The Dutch ESES was found to have good internal consistency (Cronbach's α for ESES_1 = 0.90, ESES_2 = 0.88). Test-retest reliability was adequate (intra-class correlation coefficient = 0.81, 95% confidence interval 0.70-0.89). For validity, a moderate, statistically significant correlation was found between ESES and the GSE (Spearman's ρ ESES_1 = 0.52, ESES_2 = 0.66, p < 0.01). Furthermore, the psychometric properties of the Dutch ESES were found to be similar to those of the original English version. The results of this study support the use of the ESES as a reliable and valid measure of exercise self-efficacy.
The Environmental Reward Observation Scale (EROS): development, validity, and reliability.
Armento, Maria E A; Hopko, Derek R
2007-06-01
Researchers acknowledge a strong association between the frequency and duration of environmental reward and affective mood states, particularly in relation to the etiology, assessment, and treatment of depression. Given behavioral theories that outline environmental reward as a strong mediator of affect and the unavailability of an efficient, reliable, and valid self-report measure of environmental reward, we developed the Environmental Reward Observation Scale (EROS) and examined its psychometric properties. In Experiment 1, exploratory factor analysis supported a unidimensional 10-item measure with strong internal consistency and test-retest reliability. When administered to a replication sample, confirmatory factor analysis suggested an excellent fit to the 1-factor model and convergent/discriminant validity data supported the construct validity of the EROS. In Experiment 2, further support for the convergent validity of the EROS was obtained via moderate correlations with the Pleasant Events Schedule (PES; MacPhillamy & Lewinsohn, 1976). In Experiment 3, hierarchical regression supported the ecological validity of the EROS toward predicting daily diary reports of time spent in highly rewarding behaviors and activities. Above and beyond variance accounted for by depressive symptoms (BDI), the EROS was associated with significant incremental variance in accounting for time spent in both low and high reward behaviors. The EROS may represent a brief, reliable and valid measure of environmental reward that may improve the psychological assessment of negative mood states such as clinical depression.
Sense of competence in dementia care staff (SCIDS) scale: development, reliability, and validity.
Schepers, Astrid Kristine; Orrell, Martin; Shanahan, Niamh; Spector, Aimee
2012-07-01
Sense of competence in dementia care staff (SCIDS) may be associated with more positive attitudes to dementia among care staff and better outcomes for those being cared for. There is a need for a reliable and valid measure of sense of competence specific to dementia care staff. This study describes the development and evaluation of a measure to assess "sense of competence" in dementia care staff and reports on its psychometric properties. The systematic measure development process involved care staff and experts. For item selection and assessment of psychometric properties, a pilot study (N = 37) and a large-scale study (N = 211) with a test-retest reliability (N = 58) sub-study were undertaken. The final measure consists of 17 items across four subscales with acceptable to good internal consistency and moderate to substantial test-retest reliability. As predicted, the measure was positively associated with work experience, job satisfaction, and person-centered approaches to dementia care, giving a first indication for its validity. The SCIDS scale provides a useful and user-friendly means of measuring sense of competence in care staff. It has been developed using a robust process and has adequate psychometric properties. Further exploration of the construct and the scale's validity is warranted. It may be useful to assess the impact of training and perceived abilities and skills in dementia care.
Wilson, Annabelle; Magarey, Anthea; Mastersson, Nadia
2013-01-01
Childhood overweight and obesity are a growing concern globally, and environments, including the home and school, can contribute to this epidemic. This paper assesses the reliability of two questionnaires (parent and teacher) used in the evaluation of a community-based childhood obesity prevention intervention, the eat well be active (ewba) Community Programs. Parents and teachers were recruited from two primary schools and they completed the same questionnaire twice in 2008 and 2009. Data from both questionnaires were classified into outcomes relevant to healthy eating and activity, and target outcomes, based on the goals of the ewba Community Programs, were identified. Fourteen and 12 outcomes were developed from the parent and teacher questionnaires, respectively. Sixty parents and 28 teachers participated in the reliability study. Intraclass correlation coefficients for outcomes ranged from 0.37 to 0.92 (parent) (P < 0.05) and from 0.42 to 0.86 (teacher) (P < 0.05). Internal consistency, measured by Cronbach's alpha, of teacher scores ranged from 0.11 to 0.91 and 0.13 to 0.78 for scores from the parent questionnaire. The parent and teacher questionnaires are moderately reliable tools for simultaneously assessing child intakes, environments, attitudes, and knowledge associated with healthy eating and physical activity in the home and school and may be useful for evaluation of similar programs.
Petersen, Solveig; Hägglöf, Bruno; Stenlund, Hans; Bergström, Erik
2009-09-01
To study the psychometric performance of the Swedish version of the Pediatric Quality of Life Inventory (PedsQL) 4.0 generic core scales in a general child population in Sweden. PedsQL forms were distributed to 2403 schoolchildren and 888 parents in two different school settings. Reliability and validity was studied for self-reports and proxy reports, full forms and short forms. Confirmatory factor analysis tested the factor structure and multigroup confirmatory factor analysis tested measurement invariance between boys and girls. Test-retest reliability was demonstrated for all scales and internal consistency reliability was shown with alpha value exceeding 0.70 for all scales but one (self-report short form: social functioning). Child-parent agreement was low to moderate. The four-factor structure of the PedsQL and factorial invariance across sex subgroups were confirmed for the self-report forms and for the proxy short form, while model fit indices suggested improvement of several proxy full-form scales. The Swedish PedsQL 4.0 generic core scales are a reliable and valid tool for health-related quality of life (HRQoL) assessment in Swedish child populations. The proxy full form, however, should be used with caution. The study also support continued use of the PedsQL as a four-factor model, capable of revealing meaningful HRQoL differences between boys and girls.
How reliable are Functional Movement Screening scores? A systematic review of rater reliability.
Moran, Robert W; Schneiders, Anthony G; Major, Katherine M; Sullivan, S John
2016-05-01
Several physical assessment protocols to identify intrinsic risk factors for injury aetiology related to movement quality have been described. The Functional Movement Screen (FMS) is a standardised, field-expedient test battery intended to assess movement quality and has been used clinically in preparticipation screening and in sports injury research. To critically appraise and summarise research investigating the reliability of scores obtained using the FMS battery. Systematic literature review. Systematic search of Google Scholar, Scopus (including ScienceDirect and PubMed), EBSCO (including Academic Search Complete, AMED, CINAHL, Health Source: Nursing/Academic Edition), MEDLINE and SPORTDiscus. Studies meeting eligibility criteria were assessed by 2 reviewers for risk of bias using the Quality Appraisal of Reliability Studies checklist. Overall quality of evidence was determined using van Tulder's levels of evidence approach. 12 studies were appraised. Overall, there was a 'moderate' level of evidence in favour of 'acceptable' (intraclass correlation coefficient ≥0.6) inter-rater and intra-rater reliability for composite scores derived from live scoring. For inter-rater reliability of composite scores derived from video recordings there was 'conflicting' evidence, and 'limited' evidence for intra-rater reliability. For inter-rater reliability based on live scoring of individual subtests there was 'moderate' evidence of 'acceptable' reliability (κ≥0.4) for 4 subtests (Deep Squat, Shoulder Mobility, Active Straight-leg Raise, Trunk Stability Push-up) and 'conflicting' evidence for the remaining 3 (Hurdle Step, In-line Lunge, Rotary Stability). This review found 'moderate' evidence that raters can achieve acceptable levels of inter-rater and intra-rater reliability of composite FMS scores when using live ratings. Overall, there were few high-quality studies, and the quality of several studies was impacted by poor study reporting particularly in relation to rater blinding. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/
Reliability of Computerized Neurocognitive Tests for Concussion Assessment: A Meta-Analysis.
Farnsworth, James L; Dargo, Lucas; Ragan, Brian G; Kang, Minsoo
2017-09-01
Although widely used, computerized neurocognitive tests (CNTs) have been criticized because of low reliability and poor sensitivity. A systematic review was published summarizing the reliability of Immediate Post-Concussion Assessment and Cognitive Testing (ImPACT) scores; however, this was limited to a single CNT. Expansion of the previous review to include additional CNTs and a meta-analysis is needed. Therefore, our purpose was to analyze reliability data for CNTs using meta-analysis and examine moderating factors that may influence reliability. A systematic literature search (key terms: reliability, computerized neurocognitive test, concussion) of electronic databases (MEDLINE, PubMed, Google Scholar, and SPORTDiscus) was conducted to identify relevant studies. Studies were included if they met all of the following criteria: used a test-retest design, involved at least 1 CNT, provided sufficient statistical data to allow for effect-size calculation, and were published in English. Two independent reviewers investigated each article to assess inclusion criteria. Eighteen studies involving 2674 participants were retained. Intraclass correlation coefficients were extracted to calculate effect sizes and determine overall reliability. The Fisher Z transformation adjusted for sampling error associated with averaging correlations. Moderator analyses were conducted to evaluate the effects of the length of the test-retest interval, intraclass correlation coefficient model selection, participant demographics, and study design on reliability. Heterogeneity was evaluated using the Cochran Q statistic. The proportion of acceptable outcomes was greatest for the Axon Sports CogState Test (75%) and lowest for the ImPACT (25%). Moderator analyses indicated that the type of intraclass correlation coefficient model used significantly influenced effect-size estimates, accounting for 17% of the variation in reliability. The Axon Sports CogState Test, which has a higher proportion of acceptable outcomes and shorter test duration relative to other CNTs, may be a reliable option; however, future studies are needed to compare the diagnostic accuracy of these instruments.
2013-01-01
Background This study investigates the reliability of muscle performance tests using cost- and time-effective methods similar to those used in clinical practice. When conducting reliability studies, great effort goes into standardising test procedures to facilitate a stable outcome. Therefore, several test trials are often performed. However, when muscle performance tests are applied in the clinical setting, clinicians often only conduct a muscle performance test once as repeated testing may produce fatigue and pain, thus variation in test results. We aimed to investigate whether cervical muscle performance tests, which have shown promising psychometric properties, would remain reliable when examined under conditions similar to those of daily clinical practice. Methods The intra-rater (between-day) and inter-rater (within-day) reliability was assessed for five cervical muscle performance tests in patients with (n = 33) and without neck pain (n = 30). The five tests were joint position error, the cranio-cervical flexion test, the neck flexor muscle endurance test performed in supine and in a 45°-upright position and a new neck extensor test. Results Intra-rater reliability ranged from moderate to almost perfect agreement for joint position error (ICC ≥ 0.48-0.82), the cranio-cervical flexion test (ICC ≥ 0.69), the neck flexor muscle endurance test performed in supine (ICC ≥ 0.68) and in a 45°-upright position (ICC ≥ 0.41) with the exception of a new test (neck extensor test), which ranged from slight to moderate agreement (ICC = 0.14-0.41). Likewise, inter-rater reliability ranged from moderate to almost perfect agreement for joint position error (ICC ≥ 0.51-0.75), the cranio-cervical flexion test (ICC ≥ 0.85), the neck flexor muscle endurance test performed in supine (ICC ≥ 0.70) and in a 45°-upright position (ICC ≥ 0.56). However, only slight to fair agreement was found for the neck extensor test (ICC = 0.19-0.25). Conclusions Intra- and inter-rater reliability ranged from moderate to almost perfect agreement with the exception of a new test (neck extensor test), which ranged from slight to moderate agreement. The significant variability observed suggests that tests like the neck extensor test and the neck flexor muscle endurance test performed in a 45°-upright position are too unstable to be used when evaluating neck muscle performance. PMID:24299621
Moore, T R; Longo, J; Leopold, G R; Casola, G; Gosink, B B
1989-05-01
Sixty-two cases of oligohydramnios diagnosed by ultrasound between 13-28 weeks' gestation were reviewed. Three experienced ultrasonographers used a subjective scale to rate the oligohydramnios as mild, moderate, severe, or anhydramniotic. Interobserver reliability was excellent (intraclass correlation coefficient 0.81). The overall perinatal mortality rate was 43%, and the incidence of pulmonary hypoplasia was 33%. One-third had lethal congenital anomalies. The frequency of adverse outcome correlated strongly with the most severe degrees of oligohydramnios; 88% of the fetuses with severe oligohydramnios or anhydramnios had lethal outcomes, compared with 11% in the mild/moderate group. The presence of an anuric urinary tract anomaly was associated with the most severe grades of oligohydramnios and was uniformly fatal. Pulmonary hypoplasia was diagnosed in 60% of the severe group versus 6% in the moderate group. We conclude that subjective grading of oligohydramnios by experienced observers is both reliable and predictive of outcome. The finding of severe oligohydramnios in the second trimester is highly predictive of poor fetal outcome and should stimulate a thorough search for etiology and consideration of intervention. Moderate grades of reduced amniotic fluid may be managed with relative optimism.
Grzybowska, Magdalena Emilia; Piaskowska-Cala, Justyna; Wydra, Dariusz Grzegorz
2017-12-29
The aim of the study was to translate into Polish the Pelvic Organ Prolapse/Incontinence Sexual Questionnaire, IUGA-Revised (PISQ-IR), which evaluates sexual function in sexually active (SA) and not SA (NSA) women with pelvic floor disorders (PFD), and to validate the Polish version. After translation, back-translation and cognitive interviews, the final version of PISQ-IR was established. The study group included 252 women with PFD (124 NSA and 128 SA). All women underwent clinical evaluation and completed the PISQ-IR. For test-retest reliability, the questionnaire was administered to 99 patients twice at an interval of 2 weeks. The analysis of criterion validity required the subjects to complete self-reported measures. Internal consistency and criterion validity were assessed separately for NSA and SA women for the PISQ-IR subscales. The mean age of the women was 60.9 ± 10.6 years and their mean BMI was 27.9 ± 4.9 kg/m 2 . Postmenopausal women constituted 82.5% of the study group. Urinary incontinence (UI) was diagnosed in 60 women (23.8%), pelvic organ prolapse (POP) in 90 (35.7%), and UI and POP in 102 (40.5%). Fecal incontinence was reported by 45 women (17.9%). The PISQ-IR Polish version proved to have good internal consistency in NSA women (α 0.651 to 0.857) and SA women (α 0.605 to 0.887), and strong reliability in all subscales (Pearson's coefficient 0.759-0.899; p < 0.001). Criterion validity confirmed moderate to strong correlations between PISQ-IR scores and self-reported measures in SA subscales, as well the SA summary score, and weak to moderate correlations in NSA women. The PISQ-IR Polish version is a valid tool for evaluating sexual function in women with PFD.
[Test for assessing levels of alcohol consumption in Bucaramanga, Colombia: design and validation].
Herrán, Oscar F; Ardila, María F; Barba, Diana M
2008-03-01
Excessive alcohol intake can pose a serious problem in public health. The development of instruments to classify the consumers correctly is the first stage in the epidemiologic investigation. The internal validity and the reliability was evaluated for a test of problematic alcohol consumption (CP-alcohol) in Bucaramanga, Colombia. 2005--2006. This work provides a measure that is internally consistent and improved reliability of diagnostic technology. Six hundred one subjects between 18 and 60 years participated in the test for CP-alcohol on two occasions. At the same time, a survey on biological variables (VB), socioeconomic (VSE) and dietary (D) was administered. The internal consistency of CP-alcohol was evaluated by calculating the coefficient alpha of Cronbach, and the reliability with coefficients of Spearman and Cohens Kappa. To evaluate the associations among problematic consumption, VB, VSE, D and the risk of alcoholism, the prevalence ratios were calculated using binomial regression. The frequency of problematic alcohol consumption was of 46.9 (CI 42.9-50.9). Men presented an increased frequency of problematic alcohol use 1.6 times that of women (p<0.001). The coefficient alpha of Cronbach was moderate for all the questions of the test (minimum 0.41, maximum 0.61). In the first application of CP-alcohol, Cronbachs alpha was 0.63, and, in the second, 0.49. Spearmans correlation coefficient was of 0.87 (CI 0.84-0.90) for the population-for men 0.86 (CI 0.82-0.90) and for women 0.86 (CI 0.82-0.90). The Kappas obtained were very good, 0.70 to 0.89. Sex, pleasure provided by alcoholic drinks , risk of alcoholism according to Cut Down on Drinking, Annoyed by Criticism, Guilty Feeling, and Eye Opener (CAGE) and the quantity of consumed alcohol were all correlated with problematic consumption. CP-alcohol is a useful test for investigating the epidemiology of health problems associated with alcohol use.
A new real-time visual assessment method for faulty movement patterns during a jump-landing task.
Rabin, Alon; Levi, Ran; Abramowitz, Shai; Kozol, Zvi
2016-07-01
Determine the interrater reliability of a new real-time assessment of faulty movement patterns during a jump-landing task. Interrater reliability study. Human movement laboratory. 50 healthy females. Assessment included 6 items which were evaluated from a front and a side view. Two Physical Therapy students used a 9-point scale (0-8) to independently rate the quality of movement as good (0-2), moderate (3-5), or poor (6-8). Interrater reliability was expressed by percent agreement and weighted kappa. One examiner rated the quality of movement of 6 subjects as good, 34 subjects as moderate, and 10 subjects as poor. The second examiner rated the quality of movement of 12 subjects as good, 23 subjects as moderate, and 15 subjects as poor. Percent agreement and weighted kappa (95% confidence interval) were 78% and 0.68 (0.51, 0.85), respectively. A new real-time assessment of faulty movement patterns during jump-landing demonstrated adequate interrater reliability. Further study is warranted to validate this method against a motion analysis system, as well as to establish its predictive validity for injury. Copyright © 2015 Elsevier Ltd. All rights reserved.
ERIC Educational Resources Information Center
Davenport, Ernest C.; Davison, Mark L.; Liou, Pey-Yan; Love, Quintin U.
2015-01-01
This article uses definitions provided by Cronbach in his seminal paper for coefficient a to show the concepts of reliability, dimensionality, and internal consistency are distinct but interrelated. The article begins with a critique of the definition of reliability and then explores mathematical properties of Cronbach's a. Internal consistency…
Reliability and validity of the instrument used in BRFSS to assess physical activity.
Yore, Michelle M; Ham, Sandra A; Ainsworth, Barbara E; Kruger, Judy; Reis, Jared P; Kohl, Harold W; Macera, Caroline A
2007-08-01
State-level statistics of adherence to the physical activity objectives in Healthy People 2010 are derived from the Behavioral Risk Factor Surveillance System (BRFSS) data. BRFSS physical activity questions were updated in 2001 to include domains of leisure time, household, and transportation-related activity of moderate- and vigorous intensity, and walking questions. This article reports the reliability and validity of these questions. The BRFSS Physical Activity Study (BPAS) was conducted from September 2000 to May 2001 in Columbia, SC. Sixty participants were followed for 22 d; they answered the physical activity questions three times via telephone, wore a pedometer and accelerometer, and completed a daily physical activity log for 1 wk. Measures for moderate, vigorous, recommended (i.e., met the criteria for moderate or vigorous), and strengthening activities were created according to Healthy People 2010 operational definitions. Reliability and validity were assessed using Cohen's kappa (kappa) and Pearson correlation coefficients. Seventy-three percent of participants met the recommended activity criteria compared with 45% in the total U.S. population. Test-retest reliability (kappa) was 0.35-0.53 for moderate activity, 0.80-0.86 for vigorous activity, 0.67-0.84 for recommended activity, and 0.85-0.92 for strengthening. Validity (kappa) of the survey (using the accelerometer as the standard) was 0.17-0.22 for recommended activity. Validity (kappa) of the survey (using the physical activity log as the standard) was 0.40-0.52 for recommended activity. The validity and reliability of the BRFSS physical activity questions suggests that this instrument can classify groups of adults into the levels of recommended and vigorous activity as defined by Healthy People 2010. Repeated administration of these questions over time will help to identify trends in physical activity.
Ocular dominance stability and reading skill: a controversial relationship.
Zeri, Fabrizio; De Luca, Maria; Spinelli, Donatella; Zoccolotti, Pierluigi
2011-11-01
Evidence is mixed concerning the relationship between stability of ocular dominance and reading deficits. Contrasting results may be due to the use of different tests of dominance, different samples of readers, and different scoring methods. The aim of this study was to investigate the relationship among ocular dominance, general visual abilities, and reading performance, and to evaluate the consistency and reliability of different tests of ocular dominance and the effects of different types of eye dominance scoring. In a group of young adults, we measured: (a) main optometric parameters; (b) reading time and accuracy; and (c) ocular dominance in two sighting and four motor tests. Dominance was determined using different scoring methods (relative, absolute, and binary scores). All dominance tests showed good levels of internal reliability. Sighting tests were consistent regardless of the scoring method, and all participants had stable dominance. Three of four motor tests were moderately consistent when dominance was measured with relative scores but not when it was measured with absolute or binary scores. No relationship was found between stability of dominance and reading performance, regardless of the type of test or scoring method. No systematic pattern of correlation was found between binocular vision variables and dominance measures. Choosing the type of motor test to measure ocular dominance is crucial, because the level of consistency among tests is low to moderate. Furthermore, motor tests were not correlated with reading performances. Present results suggest caution when trying to link reading difficulties with specific profiles of ocular dominance.
ERIC Educational Resources Information Center
Worrell, Frank C.; Mello, Zena R.
2007-01-01
In this study, the authors examined the reliability, structural validity, and concurrent validity of Zimbardo Time Perspective Inventory (ZTPI) scores in a group of 815 academically talented adolescents. Reliability estimates of the purported factors' scores were in the low to moderate range. Exploratory factor analysis supported a five-factor…
Huang, Meiju; Chen, Mei-Yen
2013-08-01
Associations among internal marketing, customer orientation, and organizational commitment were examined, particularly with regard to the moderating effects of work status on the relationships between internal marketing and customer orientation or organizational commitment, in a cross-sectional design with structural equation modeling. Two studies (Ns = 119 and 251) were conducted among full- and part-time service employees at Taipei Sports Centers. Internal marketing was associated with organizational commitment and customer orientation. Customer orientation was associated with organizational commitment and partially mediated the relation between internal marketing and organizational commitment. Furthermore, work status significantly moderated the relationships between internal marketing and customer orientation but not between internal marketing and organizational commitment. Implications and directions for future research were discussed.
Prathanee, Benjamas; Angsupakorn, Nipa; Pumnum, Tawitree; Seepuaham, Cholada; Jaiyong, Pechcharat
2012-11-01
To find reliability of parental or caregiver's report and testing of the Thai Speech and Language Test for Children Aged 0-4 Years Old. Five investigators assessed speech and language abilities from video both contexts: parental or caregivers' report and test forms of Thai Speech and Language Test for Children Aged 0-4 Years Old. Twenty-five normal and 30 children with delayed development or risk for delayed speech and language skills were assessed at age intervals of 3, 6, 9, 12, 15, 18, 24, 30, 36 and 48 months. Reliability of parental or caregivers' testing and reporting was at a moderate level (0.41-0.60). Inter-rater reliability among investigators was excellent (0.86-1.00). The parental or caregivers' report form of the Thai Speech and Language test for Children aged 0-4 years old was an indicator for success at a moderate level. Trained professionals could use both forms of this test as reliable tools at an excellent level.
Effect of stimulus type and temperature on EEG reactivity in cardiac arrest.
Fantaneanu, Tadeu A; Tolchin, Benjamin; Alvarez, Vincent; Friolet, Raymond; Avery, Kathleen; Scirica, Benjamin M; O'Brien, Molly; Henderson, Galen V; Lee, Jong Woo
2016-11-01
Electroencephalogram (EEG) background reactivity is a reliable outcome predictor in cardiac arrest patients post therapeutic hypothermia. However, there is no consensus on modality testing and prior studies reveal only fair to moderate agreement rates. The aim of this study was to explore different stimulus modalities and report interrater agreements. We studied a multicenter, prospectively collected cohort of cardiac arrest patients who underwent therapeutic hypothermia between September 2014 and December 2015. We identified patients with reactivity data and evaluated interrater agreements of different stimulus modalities tested in hypothermia and normothermia. Of the 60 patients studied, agreement rates were moderate to substantial during hypothermia and fair to moderate during normothermia. Bilateral nipple pressure is more sensitive (80%) when compared to other modalities in eliciting a reactive background in hypothermia. Auditory, nasal tickle, nailbed pressure and nipple pressure reactivity were associated with good outcomes in both hypothermia and normothermia. EEG reactivity varies depending on the stimulus testing modality as well as the temperature during which stimulation is performed, with nipple pressure emerging as the most sensitive during hypothermia for reactivity and outcome determination. This highlights the importance of multiple stimulus testing modalities in EEG reactivity determination to reduce false negatives and optimize prognostication. Copyright © 2016 International Federation of Clinical Neurophysiology. Published by Elsevier Ireland Ltd. All rights reserved.
Lewis, Frank D; Horn, Gordon J
2017-01-01
A need exists to better understand the impact of depression on functional outcomes following TBI. To evaluate the prevalence and severity of depression among a large group of chronic TBI adults; to determine the impact of depression on outcomes of post-hospital residential rehabilitation programs; and to assess effectiveness of post-hospital residential rehabilitation programs in treating depression. 820 adults with moderate to severe traumatic brain injury (TBI) were assigned to one of four groups based on MPAI- 4 depression ratings: (1) Not Depressed, (2) Mildly Depressed, (3) Moderately Depressed, and (4) Severely Depressed. Functional status was assessed at admission and discharge with the MPAI-4 Participation Index. Differences among groups were evaluated using conventional parametric tests. Rasch analysis established reliability and validity of MPAI-4 data. Rasch analysis demonstrated satisfactory construct validity and internal consistency (Person reliability = 0.89-0.92, Item reliability = 0.99). Of the 820 subjects, 39% presented with moderate to severe depressive symptoms at admission, These subjects demonstrated significantly higher MPAI-4 Participation scores than the mild and not depressed groups. Depressed groups realized significant improvement in symptoms, but, those remaining depressed at discharge had significantly greater disability than those who improved. Depressive symptoms had a deleterious impact on outcome. Remediation of symptoms during rehabilitation significantly improved outcomes.
Clinical audit project in undergraduate medical education curriculum: an assessment validation study
Steketee, Carole; Mak, Donna
2016-01-01
Objectives To evaluate the merit of the Clinical Audit Project (CAP) in an assessment program for undergraduate medical education using a systematic assessment validation framework. Methods A cross-sectional assessment validation study at one medical school in Western Australia, with retrospective qualitative analysis of the design, development, implementation and outcomes of the CAP, and quantitative analysis of assessment data from four cohorts of medical students (2011- 2014). Results The CAP is fit for purpose with clear external and internal alignment to expected medical graduate outcomes. Substantive validity in students’ and examiners’ response processes is ensured through relevant methodological and cognitive processes. Multiple validity features are built-in to the design, planning and implementation process of the CAP. There is evidence of high internal consistency reliability of CAP scores (Cronbach’s alpha > 0.8) and inter-examiner consistency reliability (intra-class correlation>0.7). Aggregation of CAP scores is psychometrically sound, with high internal consistency indicating one common underlying construct. Significant but moderate correlations between CAP scores and scores from other assessment modalities indicate validity of extrapolation and alignment between the CAP and the overall target outcomes of medical graduates. Standard setting, score equating and fair decision rules justify consequential validity of CAP scores interpretation and use. Conclusions This study provides evidence demonstrating that the CAP is a meaningful and valid component in the assessment program. This systematic framework of validation can be adopted for all levels of assessment in medical education, from individual assessment modality, to the validation of an assessment program as a whole. PMID:27716612
Tran, Thach Duc; Holton, Sara; Nguyen, Huong Thanh; Wolfe, Rory; Fisher, Jane
2017-01-01
Objectives To assess the internal consistency, latent structure and convergent validity of the Depression, Anxiety and Stress Scale-21 (DASS-21) among adolescents in Vietnam. Method An anonymous, self-completed questionnaire was conducted among 1,745 high school students in Hanoi, Vietnam between October, 2013 and January, 2014. Confirmatory factor analyses were performed to assess the latent structure of the DASS-21. Factorial invariance between girls and boys was examined. Cronbach alphas and correlation coefficients between DASS-21 factor scores and the domain scores of the Duke Health Profile Adolescent Vietnamese validated version (ADHP-V) were calculated to assess DASS-21 internal consistency and convergent validity. Results A total of 1,606/ 1,745 (92.6%) students returned the questionnaire. Of those, 1,387 students provided complete DASS-21 data. The scale demonstrated adequate internal consistency (Cronbach α: 0.761 to 0.906). A four-factor model showed the best fit to the data. Items loaded significantly on a common general distress factor, the depression, and the anxiety factors, but few on the stress factor (p<0.05). DASS-21 convergent validity was confirmed with moderate correlation coefficients (-0.47 to -0.66) between its factor scores and the ADHP-V mental health related domains. Conclusions The DASS-21 is reliable and suitable for use to assess symptoms of common mental health problems, especially depression and anxiety among Vietnamese adolescents. However, its ability in detecting stress among these adolescents may be limited. Further research is warrant to explore these results. PMID:28723909
Tor, Elina; Steketee, Carole; Mak, Donna
2016-09-24
To evaluate the merit of the Clinical Audit Project (CAP) in an assessment program for undergraduate medical education using a systematic assessment validation framework. A cross-sectional assessment validation study at one medical school in Western Australia, with retrospective qualitative analysis of the design, development, implementation and outcomes of the CAP, and quantitative analysis of assessment data from four cohorts of medical students (2011- 2014). The CAP is fit for purpose with clear external and internal alignment to expected medical graduate outcomes. Substantive validity in students' and examiners' response processes is ensured through relevant methodological and cognitive processes. Multiple validity features are built-in to the design, planning and implementation process of the CAP. There is evidence of high internal consistency reliability of CAP scores (Cronbach's alpha > 0.8) and inter-examiner consistency reliability (intra-class correlation>0.7). Aggregation of CAP scores is psychometrically sound, with high internal consistency indicating one common underlying construct. Significant but moderate correlations between CAP scores and scores from other assessment modalities indicate validity of extrapolation and alignment between the CAP and the overall target outcomes of medical graduates. Standard setting, score equating and fair decision rules justify consequential validity of CAP scores interpretation and use. This study provides evidence demonstrating that the CAP is a meaningful and valid component in the assessment program. This systematic framework of validation can be adopted for all levels of assessment in medical education, from individual assessment modality, to the validation of an assessment program as a whole.
Spaan, Suzanne; Pronk, Anjoeka; Koch, Holger M.; Jusko, Todd A.; Jaddoe, Vincent W.V.; Shaw, Pamela A.; Tiemeier, Henning M.; Hofman, Albert; Pierik, Frank H.; Longnecker, Matthew P.
2014-01-01
The widespread use of organophosphate (OP) pesticides has resulted in ubiquitous exposure in humans, primarily through their diet. Exposure to OP pesticides may have adverse health effects, including neurobehavioral deficits in children. The optimal design of new studies requires data on the reliability of urinary measures of exposure. In the present study, urinary concentrations of six dialkyl phosphate (DAP) metabolites, the main urinary metabolites of OP pesticides, were determined in 120 pregnant women participating in the Generation R Study in Rotterdam. Intra-class correlation coefficients (ICCs) across serial urine specimens taken at <18, 18–25, and >25 weeks of pregnancy were determined to assess reliability. Geometric mean total DAP metabolite concentrations were 229 (GSD 2.2), 240 (GSD 2.1), and 224 (GSD 2.2) nmol/g creatinine across the three periods of gestation. Metabolite concentrations from the serial urine specimens in general correlated moderately. The ICCs for the six DAP metabolites ranged from 0.14 to 0.38 (0.30 for total DAPs), indicating weak to moderate reliability. Although the DAP metabolite levels observed in this study are slightly higher and slightly more correlated than in previous studies, the low to moderate reliability indicates a high degree of within-person variability, which presents challenges for designing well-powered epidemiologic studies. PMID:25515376
Prowse, Ashleigh; Aslaksen, Berit; Kierkegaard, Marie; Furness, James; Gerdhem, Paul; Abbott, Allan
2017-01-01
AIM To investigate the reliability and concurrent validity of the Baseline® Body Level/Scoliosis meter for adolescent idiopathic scoliosis postural assessment in three anatomical planes. METHODS This is an observational reliability and concurrent validity study of adolescent referrals to the Orthopaedic department for scoliosis screening at Karolinska University Hospital, Stockholm, Sweden between March-May 2012. A total of 31 adolescents with idiopathic scoliosis (13.6 ± 0.6 years old) of mild-moderate curvatures (25° ± 12°) were consecutively recruited. Measurement of cervical, thoracic and lumbar curvatures, pelvic and shoulder tilt, and axial thoracic rotation (ATR) were performed by two trained physiotherapists in one day. The intraclass correlation coefficient (ICC) was used to determine the inter-examiner reliability (ICC2,1) and the intra-rater reliability (ICC3,3) of the Baseline® Body Level/Scoliosis meter. Spearman’s correlation analyses were used to estimate concurrent validity between the Baseline® Body Level/Scoliosis meter and Gold Standard Cobb angles from radiographs and the Orthopaedic Systems Inc. Scoliometer. RESULTS There was excellent reliability between examiners for thoracic kyphosis (ICC2,1 = 0.94), ATR (ICC2,1 = 0.92) and lumbar lordosis (ICC2,1 = 0.79). There was adequate reliability between examiners for cervical lordosis (ICC2,1 = 0.51), however poor reliability for pelvic and shoulder tilt. Both devices were reproducible in the measurement of ATR when repeated by one examiner (ICC3,3 0.98-1.00). The device had a good correlation with the Scoliometer (rho = 0.78). When compared with Cobb angle from radiographs, there was a moderate correlation for ATR (rho = 0.627). CONCLUSION The Baseline® Body Level/Scoliosis meter provides reliable transverse and sagittal cervical, thoracic and lumbar measurements and valid transverse plan measurements of mild-moderate scoliosis deformity. PMID:28144582
Cheng, Hui Lin; Molassiotis, Alex
2018-06-05
To validate and compare the Chinese version of the European Organization for Research and Treatment of Cancer Quality of Life-Chemotherapy-Induced Peripheral Neuropathy Questionnaire (EORTC QLQ-CIPN20) and the Functional Assessment of Cancer-Gynecologic Oncology Group-Neurotoxicity subscale (FACT/GOG-Ntx) for measuring chemotherapy-induced peripheral neuropathy (CIPN) in cancer patients. Patients were assessed with the EORTC QLQ-CIPN20, FACT/GOG-Ntx, National Cancer Institute-Common Terminology Criteria for Adverse Events (NCI-CTCAE) and World Health Organization criterion of CIPN (WHO-CIPN) from baseline up to 10 assessment points. Internal consistency reliability, convergent validity, discriminant validity and responsiveness of the EORTC QLQ-CIPN20 and FACT/GOG-Ntx were evaluated, respectively. Correlation and regression analysis were used to examine the relationships between these two scales. Internal reliability coefficients for both scales were above 0.80 across all assessment points. Moderate correlations of the two scales were found with WHO-CIPN (r s = 0.40-0.44; r s = -0.42 to -0.46, all P < 0.05) and NCI-CTCAE (r s = 0.46-0.57; r s = -0.44 to -0.55, all P < 0.01) at most assessment points. Older patients reported significantly more CIPN symptoms than younger counterparts did (P < 0.05). The hypothesized factor structures of both scales were not confirmed (χ2/df = 3.70-7.01; χ2/df = 2.14-10.43, all P < 0.001). Both scales demonstrated responsiveness with small-to-moderate effect size (r = 0.09-0.46, r = 0.11-0.35). The two scales were highly correlated and were predicted by all domains of each other at specific assessment points (R 2 = 0.62-0.87; R 2 = 0.76-0.85; respectively, all P < 0.001). The Chinese version of the EORTC QLQ-CIPN20 and FACT/GOG-Ntx demonstrated acceptable reliability, validity and responsiveness and was found comparable in measuring CIPN among Chinese cancer patients at specific assessment points. © 2018 John Wiley & Sons Australia, Ltd.
Dueñas, Héctor; Lara, Carmen; Walton, Richard J; Granger, Renee E; Dossenbach, Martin; Raskin, Joel
2011-09-01
To assess the reliability and validity of the Integral Inventory for Depression (IID) scale using post hoc analyses of data from a multi-country study (ClinicalTrials.gov: NCT00561509) of patients with major depressive disorder (MDD). Patients (N = 1629) completed the IID (comprising two separate dimensions for emotional and physically painful symptoms; maximum score of 65) and a reference scale (16-item Quick Inventory of Depressive Symptomatology Self-Report) at baseline and at follow-up (8 and 24 weeks). Physicians rated MDD symptoms using the Clinical Global Impressions of Severity scale at each visit. Inter-item correlation, internal consistency, external validity, factor structure, and exploratory analysis of an optimal severity cut-off point were assessed. The IID displayed two distinct dimensions (i.e. painful and emotional) with little item redundancy and good internal consistency (Cronbach's α > 0.83 at each visit). The IID displayed good external validity (Pearson's correlations coefficients >0.60 at each visit) and statistically significant agreement (McNemar's test; P < 0.001 at follow-up) with the reference scale. Results suggest that a cut-off score of ≤24 had adequate precision (>80%) to identify patients with and without moderate MDD. Results suggest that the IID may be a reliable and valid tool for assessing emotional and painful symptoms of MDD.
Grauvogl, Andrea; Peters, Madelon L; Evers, Silvia M A A; van Lankveld, Jacques J D M
2015-01-01
The Sexual Competence and Interaction Competence in Youth is a self-report questionnaire that aims to measure sexual competence and interaction competence in adolescents. The study sample consisted of 276 female undergraduate students (M = 20.95 years, SD = 2.00 years). The factor structure of the questionnaire was calculated on full sample data. A subsample was used to calculate the validity and internal consistency (N = 236; M = 20.88 years, SD = 1.96). The test-retest reliability was also calculated in a subsample (N = 82; M = 21.45 years, SD = 1.74 years). On the basis of an exploratory factor analysis, 8 factors were extracted: (a) communication about sex, (b) refusing sex, (c) positive sexual attitudes, (d) male role in sexual interaction, (e) contraceptive use, (f) not suppressing problems and desires regarding sex, (g) sexual assertiveness, and (h) sexual hedonism. The subscales possess adequate internal consistency and moderate to excellent test-retest reliability. A higher order principal component analysis revealed a 2-factor structure that appears to adequately represent the sexual competence and interaction competence constructs. Furthermore, convergent and discriminant validity were considered to be good. The results indicate that the Sexual Competence and Interaction Competence in Youth may be a useful instrument to measure sexual and interaction competence among adolescents.
Medina, Maria Del Mar; Carrillo, Alvaro; Polo, Ruben; Fernandez, Borja; Alonso, Daniel; Vaca, Miguel; Cordero, Adela; Perez, Cecilia; Muriel, Alfonso; Cobeta, Ignacio
2017-04-01
Objective To perform translation, cross-cultural adaptation, and validation of the Penn Acoustic Neuroma Quality-of-Life Scale (PANQOL) to the Spanish language. Study Design Prospective study. Setting Tertiary neurotologic referral center. Subjects and Methods PANQOL was translated and translated back, and a pretest trial was performed. The study included 27 individuals diagnosed with vestibular schwannoma. Inclusion criteria were adults with untreated vestibular schwannoma, diagnosed in the past 12 months. Feasibility, internal consistency, test-retest reliability, construct validity, and ceiling and floor effects were assessed for the present study. Results The mean overall score of the PANQOL was 69.21 (0-100 scale, lowest to highest quality of life). Cronbach's α was 0.87. Intraclass correlation coefficient was performed for each item, with an overall score of 0.92. The κ coefficient scores were between moderate and almost perfect in more than 92% of patients. Anxiety and energy domains of the PANQOL were correlated with both physical and mental components of the SF-12. Hearing, balance, and pain domains were correlated with the SF-12 physical component. Facial and general domains were not significantly correlated with any component of the SF-12. Furthermore, the overall score of the PANQOL was correlated with the physical component of the SF-12. Conclusion Feasibility, internal consistency, reliability, and construct validity outcomes in the current study support the validity of the Spanish version of the PANQOL.
Cobb, Stephen C.; James, C. Roger; Hjertstedt, Matthew; Kruk, James
2011-01-01
Abstract Context: Although abnormal foot posture long has been associated with lower extremity injury risk, the evidence is equivocal. Poor intertester reliability of traditional foot measures might contribute to the inconsistency. Objectives: To investigate the validity and reliability of a digital photographic measurement method (DPMM) technology, the reliability of DPMM-quantified foot measures, and the concurrent validity of the DPMM with clinical-measurement methods (CMMs) and to report descriptive data for DPMM measures with moderate to high intratester and intertester reliability. Design: Descriptive laboratory study. Setting: Biomechanics research laboratory. Patients or Other Participants: A total of 159 people participated in 3 groups. Twenty-eight people (11 men, 17 women; age = 25 ± 5 years, height = 1.71 ± 0.10 m, mass = 77.6 ± 17.3 kg) were recruited for investigation of intratester and intertester reliability of the DPMM technology; 20 (10 men, 10 women; age = 24 ± 2 years, height = 1.71 ± 0.09 m, mass = 76 ± 16 kg) for investigation of DPMM and CMM reliability and concurrent validity; and 111 (42 men, 69 women; age = 22.8 ± 4.7 years, height = 168.5 ± 10.4 cm, mass = 69.8 ± 13.3 kg) for development of a descriptive data set of the DPMM foot measurements with moderate to high intratester and intertester reliabilities. Intervention(s): The dimensions of 10 model rectangles and the 28 participants' feet were measured, and DPMM foot posture was measured in the 111 participants. Two clinicians assessed the DPMM and CMM foot measures of the 20 participants. Main Outcome Measure(s): Validity and reliability were evaluated using mean absolute and percentage errors and intraclass correlation coefficients. Descriptive data were computed from the DPMM foot posture measures. Results: The DPMM technology intratester and intertester reliability intraclass correlation coefficients were 1.0 for each tester and variable. Mean absolute errors were equal to or less than 0.2 mm for the bottom and right-side variables and 0.1° for the calculated angle variable. Mean percentage errors between the DPMM and criterion reference values were equal to or less than 0.4%. Intratester and intertester reliabilities of DPMM-computed structural measures of arch and navicular indices were moderate to high (>0.78), and concurrent validity was moderate to strong. Conclusions: The DPMM is a valid and reliable clinical and research tool for quantifying foot structure. The DPMM and the descriptive data might be used to define groups in future studies in which the relationship between foot posture and function or injury risk is investigated. PMID:21214347
Oliveira, Ana; Lage, Susan; Rodrigues, João; Marques, Alda
2017-11-17
Computerized respiratory sounds (CRS) are closely related to the movement of air within the tracheobronchial tree and are promising outcome measures in patients with chronic obstructive pulmonary disease (COPD). However, CRS measurement properties have been poorly tested. The aim of this study was to assess the reliability, validity and the minimal detectable changes (MDC) of CRS in patients with stable COPD. Fifty patients (36♂, 67.26 ± 9.31y, FEV 1 49.52 ± 19.67%predicted) were enrolled. CRS were recorded simultaneously at seven anatomic locations (trachea; right and left anterior, lateral and posterior chest). The number of crackles, wheeze occupation rate, median frequency (F50) and maximum intensity (Imax) were processed using validated algorithms. Within-day and between-days reliability, criterion and construct validity, validity to predict exacerbations and MDC were established. CRS presented moderate-to-excellent within-day reliability (ICC 1,3 ≥ 0.51; P < .05) and moderate-to-good between-days reliability (ICC 1,2 ≥ 0.47; P < .05) for most locations. Negligible-to-moderate correlations with FEV 1 %predicted were found (-0.53 < r s < -0.28; P < .05), and the inspiratory number of crackles were the best discriminator between mild-to-moderate and severe-to-very severe airflow limitations (area under the curve >0.78). CRS correlated poorly with patient-reported outcomes (r s < 0.48; P < .05) and did not predict exacerbations. Inspiratory number of crackles at posterior right chest, inspiratory F50 at trachea and anterior left chest and expiratory Imax at anterior right chest were simultaneously reliable and valid, and their MDC were 2.41, 55.27, 29.55 and 3.98, respectively. CRS are reliable and valid. Their use, integrated with other clinical and patient-reported measures, may fill the gap of assessing small airways and contribute toward a patient's comprehensive evaluation. © 2017 John Wiley & Sons Ltd.
McKinney, Christy M; Harris, T Robert; Caetano, Raul
2009-01-01
Little is known about the reliability of self-reported child physical abuse (CPA) or CPA reporting practices. We estimated reliability and prevalence of self-reported CPA and identified factors predictive of inconsistent CPA reporting among 2,256 participants using surveys administered in 1995 and 2000. Reliability of CPA was fair to moderate (kappa = 0.41). Using a positive report from either survey, the prevalence of moderate (61.8%) and severe (12.0%) CPA was higher than at either survey alone. Compared to consistent reporters of having experienced CPA, inconsistent reporters were less likely to be > or = 30 years old (vs. 18-29) or Black (vs. White) and more likely to have < 12 years of education (vs. 12), have no alcohol-related problems (vs. having problems), or report one type (vs. > or = 2) of CPA. These findings may assist researchers conducting and interpreting studies of CPA.
Internalizing Problems among Cyberbullying Victims and Moderator Effects of Friendship Quality
ERIC Educational Resources Information Center
Aoyama, Ikuko; Saxon, Terrill F.; Fearon, Danielle D.
2011-01-01
Purpose: The purpose of this paper is to examine the relationship between cyberbullying victimization and internalizing problems among the youth. Moderator effects of a friendship quality were also investigated to examine if higher friendship quality moderated the negative effects of cyberbullying on psychological states of students.…
Okamoto, Nozomi; Hisashige, Akinori; Tanaka, Yuu; Kurumatani, Norio
2013-01-01
The 15D is a self-administered questionnaire for assessment of health-related quality of life, which contains 15 questions with 5 response options each. This study was conducted to evaluate the reliability and validity of the Japanese 15D. The subjects were 430 community-dwelling elderly people. Each item of the 15D was scored on a 5-point Likert scale, with level 1 being the best, score 1. Reliability was assessed by determination of the internal consistency and test-retest reliability. Criterion-based validity was assessed using the Japanese version of the Nottingham Health Profile (NHP) and Tokyo Metropolitan Institute of Gerontology Index of Competence (TMIG index). Acceptability was assessed by inquiring about the time required to complete the questionnaire and the burden felt in responding to it. The answers of 423 individuals who responded to all items were analyzed. The median time required to complete the questionnaire was 5.0 minutes, and the proportion of subjects who indicated that the questionnaire was easy to complete was 98.3%. The Cronbach's alpha coefficients for all 15 items in the 2 surveys were 0.793 and 0.792, respectively. The intraclass correlation coefficients for the 15 items ranged from 0.44 to 0.72. In the relationship between the 15D and the NHP, the correlation coefficients between the corresponding domains were higher than those between non-corresponding domains. The prevalence of disability in higher-level functional capacity was higher in the "level 2 to 5" group than in the "level 1" group. The Japanese version of the 15D showed sufficient internal consistency and moderate repeatability. Because of the short time required to complete the Japanese 15D and the significant relationships between the scores on the 15D and the NHP, and between the 15D and higher-level functional capacity, the acceptability and validity of the Japanese 15D were considered to be sufficient.
Validity, reliability and Norwegian adaptation of the Stroke-Specific Quality of Life (SS-QOL) scale
Pedersen, Synne Garder; Heiberg, Guri Anita; Nielsen, Jørgen Feldbæk; Friborg, Oddgeir; Stabel, Henriette Holm; Anke, Audny; Arntzen, Cathrine
2018-01-01
Background: There is a paucity of stroke-specific instruments to assess health-related quality of life in the Norwegian language. The objective was to examine the validity and reliability of a Norwegian version of the 12-domain Stroke-Specific Quality of Life scale. Methods: A total of 125 stroke survivors were prospectively recruited. Questionnaires were administered at 3 months; 36 test–retests were performed at 12 months post stroke. The translation was conducted according to guidelines. The internal consistency was assessed with Cronbach’s alpha; convergent validity, with item-to-subscale correlations; and test–retest, with Spearman’s correlations. Scaling validity was explored by calculating both floor and ceiling effects. A priori hypotheses regarding the associations between the Stroke-Specific Quality of Life domain scores and scores of established measures were tested. Standard error of measurement was assessed. Results: The Norwegian version revealed no major changes in back translations. The internal consistency values of the domains were Cronbach’s alpha = 0.79–0.93. Rates of missing items were small, and the item-to-subscale correlation coefficients supported convergent validity (0.48–0.87). The observed floor effects were generally small, whereas the ceiling effects had moderate or high values (16%–63%). Test–retest reliability indicated stability in most domains, with Spearman’s rho = 0.67–0.94 (all p < 0.001), whereas the rho was 0.35 (p < 0.05) for the ‘Vision’ domain. Hypothesis testing supported the construct validity of the scale. Standard error of measurement values for each domain were generated to indicate the required magnitudes of detectable change. Conclusions: The Norwegian version of the Stroke-Specific Quality of Life scale is a reliable and valid instrument with good psychometric properties. It is suited for use in health research as well as in individual assessments of persons with stroke. PMID:29344360
Pedersen, Synne Garder; Heiberg, Guri Anita; Nielsen, Jørgen Feldbæk; Friborg, Oddgeir; Stabel, Henriette Holm; Anke, Audny; Arntzen, Cathrine
2018-01-01
There is a paucity of stroke-specific instruments to assess health-related quality of life in the Norwegian language. The objective was to examine the validity and reliability of a Norwegian version of the 12-domain Stroke-Specific Quality of Life scale. A total of 125 stroke survivors were prospectively recruited. Questionnaires were administered at 3 months; 36 test-retests were performed at 12 months post stroke. The translation was conducted according to guidelines. The internal consistency was assessed with Cronbach's alpha; convergent validity, with item-to-subscale correlations; and test-retest, with Spearman's correlations. Scaling validity was explored by calculating both floor and ceiling effects. A priori hypotheses regarding the associations between the Stroke-Specific Quality of Life domain scores and scores of established measures were tested. Standard error of measurement was assessed. The Norwegian version revealed no major changes in back translations. The internal consistency values of the domains were Cronbach's alpha = 0.79-0.93. Rates of missing items were small, and the item-to-subscale correlation coefficients supported convergent validity (0.48-0.87). The observed floor effects were generally small, whereas the ceiling effects had moderate or high values (16%-63%). Test-retest reliability indicated stability in most domains, with Spearman's rho = 0.67-0.94 (all p < 0.001), whereas the rho was 0.35 (p < 0.05) for the 'Vision' domain. Hypothesis testing supported the construct validity of the scale. Standard error of measurement values for each domain were generated to indicate the required magnitudes of detectable change. The Norwegian version of the Stroke-Specific Quality of Life scale is a reliable and valid instrument with good psychometric properties. It is suited for use in health research as well as in individual assessments of persons with stroke.
Caçola, Priscila M; Gabbard, Carl; Montebelo, Maria I L; Santos, Denise C C
2015-06-01
Affordances in the home environment may play a significant role in infant motor development. The purpose of this study was to further develop and validate the Affordances in the Home Environment for Motor Development-Infant Scale (AHEMD-IS), an inventory that measures the quantity and quality of motor affordances in the home. A cross-sectional study was conducted to evaluate criteria for content validity, reliability, internal consistency, floor and ceiling effects, and interpretability of the instrument. A pilot version of the inventory with 5 dimensions was used for expert panel analysis and administered to parents of infants (N=419). Data were analyzed with Cronbach alpha, intraclass correlation coefficients (ICCs), ceiling and floor effects, and item and dimension interpretability analyses for creation of a scoring system with descriptive categories for each dimension and total score. Average agreement among the expert panel was 95% across all evaluation criteria. Cronbach alpha values with the 41-item scale ranged between .639 and .824 for the separate dimensions, with a total value of .824 (95% confidence interval [95% CI]=.781, .862). The ICC values were .990 for interrater reliability and .949 for intrarater reliability. There was a ceiling effect on 3 questions for the Inside Space dimension and on 3 questions for the Variety of Stimulation dimension. These results demonstrated the need for reduction in total items (from 41 to 35) and the combination of space dimensions. After removal of questions, internal consistency was .766 (95% CI=.729, .800) for total score. Overall assessment categories were created as: less than adequate, moderately adequate, adequate, and excellent. The inventory does not determine specific use (time, frequency) of affordances in the home, and it does not account for infants' out-of-home activities. The AHEMD-IS is a reliable and valid instrument to assess affordances in the home environment that promote infant motor development. © 2015 American Physical Therapy Association.
Zanetti, Ana C G; Wiedemann, Georg; Dantas, Rosana A S; Hayashida, Miyeko; de Azevedo-Marques, João M; Galera, Sueli A F
2013-06-01
To evaluate the internal reliability and validity of the Brazilian Portuguese version of the Family Questionnaire among families of schizophrenia outpatients. The main studies about the family environment of schizophrenia patients are related to the concept of Expressed Emotion. There is currently no instrument to evaluate this concept in Brazil that is easily applicable and comparable with studies from other countries. Methodological and cross-sectional research design. A convenience sample of 130 relatives of schizophrenia outpatients was selected. The translation and cultural adaptation of the instrument involved experts in mental health and experts in the German language and included back translation, semantic evaluation of items and pretesting of the instrument with 30 relatives of schizophrenia outpatients. The psychometric properties of the instrument were studied with another 100 relatives, which fulfilled the requirements for the Brazilian Portuguese version of the instrument. The psychometric properties of the instrument were assessed by construct validity (using an analysis of its key components, comparisons between distinct groups-convergent validity with the Antonovsky's Sense of Coherence Scale) and reliability (checking the internal consistency of its items and its test-retest reproducibility). The analysis of main components confirmed dimensionality patterns that were comparable between the original and adapted versions. In two domains of the instrument, critical comments and emotional over-involvement had moderate and significant correlations, respectively, with Antonovsky's Sense of Coherence Scale, appropriate values of Cronbach's alpha and strong and significant correlations, respectively, in test-retest reproducibility. We observed significant differences between distinct groups of parents in the category of emotional over-involvement. We conclude that the Portuguese-adapted version of the Family Questionnaire is valid and reliable for the study sample. This study provided evidence that the Family Questionnaire is a reliable and valid instrument for assessing expressed emotion. It is easy and practical to use and is acceptable for use in a Brazilian cultural population. © 2012 Blackwell Publishing Ltd.
Psychometric properties of the medical outcomes study sleep scale in Spanish postmenopausal women.
Zagalaz-Anula, Noelia; Hita-Contreras, Fidel; Martínez-Amat, Antonio; Cruz-Díaz, David; Lomas-Vega, Rafael
2017-07-01
This study aimed to analyze the reliability and validity of the Spanish version of the Medical Outcomes Study Sleep Scale (MOS-SS), and its ability to discriminate between poor and good sleepers among a Spanish population with vestibular disorders. In all, 121 women (50-76 years old) completed the Spanish version of the MOS-SS. Internal consistency, test-retest reliability, and construct validity (exploratory factor analysis) were analyzed. Concurrent validity was evaluated using the Pittsburgh Sleep Quality Index and the 36-item Short Form Health Survey. To analyze the ability of the MOS-SS scores to discriminate between poor and good sleepers, a receiver-operating characteristic curve analysis was performed. The Spanish version of the MOS-SS showed excellent and substantial reliability in Sleep Problems Index I (two sleep disturbance items, one somnolence item, two sleep adequacy items, and awaken short of breath or with headache) and Sleep Problems Index II (four sleep disturbance items, two somnolence items, two sleep adequacy items, and awaken short of breath or with headache), respectively, and good internal consistency with optimal Cronbach's alpha values in all domains and indexes (0.70-0.90). Factor analysis suggested a coherent four-factor structure (explained variance 70%). In concurrent validity analysis, MOS-SS indexes showed significant and strong correlation with the Pittsburgh Sleep Quality Index total score, and moderate with the 36-item Short Form Health Survey component summaries. Several domains and the two indexes were significantly able to discriminate between poor and good sleepers (P < 0.05). Optimal cut-off points were above 20 for "sleep disturbance" domain, with above 22.22 and above 33.33 for Sleep Problems Index I and II. The Spanish version of the MOS-SS is a valid and reliable instrument, suitable to assess sleep quality in Spanish postmenopausal women, with satisfactory general psychometric properties. It discriminates well between good and poor sleepers.
Almeida, Gustavo J; Irrgang, James J; Fitzgerald, G Kelley; Jakicic, John M; Piva, Sara R
2016-06-01
Few instruments that measure physical activity (PA) can accurately quantify PA performed at light and moderate intensities, which is particularly relevant in older adults. The evidence of their reliability in free-living conditions is limited. The study objectives were: (1) to determine the test-retest reliability of the Actigraph (ACT), SenseWear Armband (SWA), and Community Healthy Activities Model Program for Seniors (CHAMPS) questionnaire in assessing free-living PA at light and moderate intensities in people after total knee arthroplasty; (2) to compare the reliability of the 3 instruments relative to each other; and (3) to determine the reliability of commonly used monitoring time frames (24 hours, waking hours, and 10 hours from awakening). A one-group, repeated-measures design was used. Participants wore the activity monitors for 2 weeks, and the CHAMPS questionnaire was completed at the end of each week. Test-retest reliability was determined by using the intraclass correlation coefficient (ICC [2,k]) to compare PA measures from one week with those from the other week. Data from 28 participants who reported similar PA during the 2 weeks were included in the analysis. The mean age of these participants was 69 years (SD=8), and 75% of them were women. Reliability ranged from moderate to excellent for the ACT (ICC=.75-.86) and was excellent for the SWA (ICC=.93-.95) and the CHAMPS questionnaire (ICC=.86-.92). The 95% confidence intervals (95% CI) of the ICCs from the SWA were the only ones within the excellent reliability range (.85-.98). The CHAMPS questionnaire showed systematic bias, with less PA being reported in week 2. The reliability of PA measures in the waking-hour time frame was comparable to that in the 24-hour time frame and reflected most PA performed during this period. Reliability may be lower for time intervals longer than 1 week. All PA measures showed good reliability. The reliability of the ACT was lower than those of the SWA and the CHAMPS questionnaire. The SWA provided more precise reliability estimates. Wearing PA monitors during waking hours provided sufficiently reliable measures and can reduce the burden on people wearing them. © 2016 American Physical Therapy Association.
Chen, Yu; Hicks, Allan; While, Alison E
2014-12-01
This study aimed to test the validity and reliability of a modified Chinese version of the OPQOL among older people living alone in China. China has an ageing population with an increasing number of older people living alone who may have a poorer quality of life (QoL) in the light of the traditional culture of collectivism and filial piety. An appropriate instrument is important to assess their QoL. The Older People's Quality of Life Questionnaire (OPQOL) was developed directly from the views of older people and has been validated in England. There has been no psychometric evaluation of the scale in China. The OPQOL was translated and modified prior to being administered to a stratified random cluster sample of 521 older people living alone. Validity was assessed through convergent validity, discriminant validity and construct validity. Reliability was assessed through internal consistency and test-retest reliability. Exploratory factor analysis indicated eight factors accounting for 63.77% of the variance. The convergent validity was supported by moderate correlations with functional ability, social support and loneliness with Spearman's rho of -0.50, 0.49 and -0.53, respectively. The discriminant validity was confirmed by differentiating QoL scores between the depressed and non-depressed groups. The Cronbach's α coefficient was 0.90 for the total scale and over 0.70 for most of its dimensions. The 2-week test-retest reliability ranged from 0.53 to 0.87. The modified Chinese version of the Older People's Quality of Life has acceptable validity and reliability as a useful instrument to measure the QoL of older people living alone in China. © 2013 John Wiley & Sons Ltd.
Tan, Christine L; Hassali, Mohamed A; Saleem, Fahad; Shafie, Asrul A; Aljadhey, Hisham; Gan, Vincent B
2015-01-01
(i) To develop the Pharmacy Value-Added Services Questionnaire (PVASQ) using emerging themes generated from interviews. (ii) To establish reliability and validity of questionnaire instrument. Using an extended Theory of Planned Behavior as the theoretical model, face-to-face interviews generated salient beliefs of pharmacy value-added services. The PVASQ was constructed initially in English incorporating important themes and later translated into the Malay language with forward and backward translation. Intention (INT) to adopt pharmacy value-added services is predicted by attitudes (ATT), subjective norms (SN), perceived behavioral control (PBC), knowledge and expectations. Using a 7-point Likert-type scale and a dichotomous scale, test-retest reliability (N=25) was assessed by administrating the questionnaire instrument twice at an interval of one week apart. Internal consistency was measured by Cronbach's alpha and construct validity between two administrations was assessed using the kappa statistic and the intraclass correlation coefficient (ICC). Confirmatory Factor Analysis, CFA (N=410) was conducted to assess construct validity of the PVASQ. The kappa coefficients indicate a moderate to almost perfect strength of agreement between test and retest. The ICC for all scales tested for intra-rater (test-retest) reliability was good. The overall Cronbach' s alpha (N=25) is 0.912 and 0.908 for the two time points. The result of CFA (N=410) showed most items loaded strongly and correctly into corresponding factors. Only one item was eliminated. This study is the first to develop and establish the reliability and validity of the Pharmacy Value-Added Services Questionnaire instrument using the Theory of Planned Behavior as the theoretical model. The translated Malay language version of PVASQ is reliable and valid to predict Malaysian patients' intention to adopt pharmacy value-added services to collect partial medicine supply.
Alkhamees, Hadeel A; Selai, Caroline E; Shorvon, Simon D; Kanner, Andres M
2014-03-01
The aims of the current study were to translate and to validate the NDDI-E to the Arabic language to be used as a screening instrument to identify moderately severe symptoms of depression in people with epilepsy. The English version of the NDDI-E was translated to Arabic and back translated to English by two independent translators. A total of 51 patients, aged 18-56years old, with a diagnosis of epilepsy, completed the Arabic versions of the Beck Depression Inventory (BDI-II) and the NDDI-E. Patients with BDI scores >20 were considered to be suffering from moderately severe depressive symptoms. Cutoff scores, sensitivity, specificity, and positive and negative predictive values of the NDDI-E to identify symptomatic patients on the BDI were calculated. A sensitivity of 93.33% and a specificity of 94.44% were found with NDDI-E total scores >15. The positive predictive value was 87.5%, and the negative predictive value was 97.14%. Spearman's rank correlation between the BDI and the NDDI-E was high (r=.78, p=0.000, N=51). Internal consistency was at 0.926 (Cronbach's alpha). The Arabic version of the NDDI-E appears to be a reliable and sensitive instrument in the identification of moderately severe or severe depressive symptoms in people with epilepsy, and it can be used with all Arabic-speaking patients. Copyright © 2014 Elsevier Inc. All rights reserved.
Coluci, Marina Zambon Orpinelli; Alexandre, Neusa Maria Costa
2014-11-01
The objectives of this study were to develop a questionnaire that evaluates the perception of nursing workers to job factors that may contribute to musculoskeletal symptoms, and to evaluate its psychometric properties. Internationally recommended methodology was followed: construction of domains, items and the instrument as a whole, content validity, and pre-test. Psychometric properties were evaluated among 370 nursing workers. Construct validity was analyzed by the factorial analysis, known-groups technique, and convergent validity. Reliability was assessed through internal consistency and stability. Results indicated satisfactory fit indices during confirmatory factor analysis, significant difference (p < 0.01) between the responses of nursing and office workers, and moderate correlations between the new questionnaire and Numeric Pain Scale, SF-36 and WRFQ. Cronbach's alpha was close to 0.90 and ICC values ranged from 0.64 to 0.76. Therefore, results indicated that the new questionnaire had good psychometric properties for use in studies involving nursing workers. Copyright © 2014 Elsevier Ltd and The Ergonomics Society. All rights reserved.
Development and Validation of the Faceted Inventory of the Five-Factor Model (FI-FFM).
Watson, David; Nus, Ericka; Wu, Kevin D
2017-06-01
The Faceted Inventory of the Five-Factor Model (FI-FFM) is a comprehensive hierarchical measure of personality. The FI-FFM was created across five phases of scale development. It includes five facets apiece for neuroticism, extraversion, and conscientiousness; four facets within agreeableness; and three facets for openness. We present reliability and validity data obtained from three samples. The FI-FFM scales are internally consistent and highly stable over 2 weeks (retest rs ranged from .64 to .82, median r = .77). They show strong convergent and discriminant validity vis-à-vis the NEO, the Big Five Inventory, and the Personality Inventory for DSM-5. Moreover, self-ratings on the scales show moderate to strong agreement with corresponding ratings made by informants ( rs ranged from .26 to .66, median r = .42). Finally, in joint analyses with the NEO Personality Inventory-3, the FI-FFM neuroticism facet scales display significant incremental validity in predicting indicators of internalizing psychopathology.
Phakthongsuk, Pitchaya
2009-04-01
To test the construct validity of the Thai version of the job content questionnaire (TJCQ). The present descriptive study recruited 10415 participants from all occupations according to the International Standard Classification of Occupations. The instrument consisted of a 48-item of the job content questionnaire. Eight items newly developed by the authors from in-depth interviews were added. Exploratory factor analysis showed six factor models of work hazards, decision latitude, psychological demand, social support, physical demand, and job security. However, supervisor and co-worker support were not distinguished into two factors and some items distributed differently along the factors extracted. Confirmatory factor analysis supported the construct of six latent factors, although the overall fit was moderately acceptable. Cronbach's alpha coefficients higher than 0.7, supported the internal consistency of TJCQ scales except for job security (0.55). These findings suggest that TJCQ is valid and reliable for assessing job stress among Thai populations.
Psychometric properties of a Chinese asthma quality of life questionnaire.
Wang, Ningqun; Huang, Xiaobo; Chen, Wenqiang; Zhang, Xiaomei; Zhang, Yongsheng; Chen, Yujing
2017-12-01
To assess the acceptability, reliability, validity, and responsiveness of the Chinese Asthma Quality of Life Questionnaire (C-AQLQ) in a sample of Chinese asthma patients. The C-AQLQ and Short Form 36 Health Survey (SF-36) scales were administered to patients at baseline and 3 months later. Asthma severity condition and lung function were evaluated. Necessary data were gathered to assess the psychometric properties such as the feasibility, internal consistency, test-retest reliability, structural validity, discriminant validity, convergent validity, and responsiveness of the C-AQLQ. One hundred and thirty-seven patients completed the investigation. The Cronbach's alpha coefficient for the total scale was 0.96. Factor analysis yielded five factors that generally corresponded to the five proposed subscales. Patients with mild asthma reported higher scores than patients with moderate/severe asthma on all subscales other than environmental stimuli. Lung function measurement and the asthma severity score correlated significantly with domains of the C-AQOL but with fewer domains of the SF-36. The questionnaire detected within-subject changes in patients' asthma status during follow-up. Results indicated preliminary support that the C-AQLQ is a reliable, valid, discriminating, and responsive measure of quality of life in Chinese asthma patients. It is more sensitive than the generic SF-36 in detecting differences in asthma severity.
Health related quality of life in disorders of defecation: the Defecation Disorder List
Voskuijl, W; van der Zaag-Loon..., H J; Ketel, I; Grootenhuis, M; Derkx, B; Benninga, M
2004-01-01
Background: Constipation and encopresis frequently cause problems with respect to emotional wellbeing, and social and family life. Instruments to measure Health Related Quality of Life (HRQoL) in these disorders are not available. Methods: A disease specific HRQoL instrument, the "Defecation Disorder List" (DDL) for children with constipation or functional non-retentive faecal soiling (FNRFS) was developed using accepted guidelines. For each phase of the process, different samples of patients were used. The final phase of development included 27 children. Reliability was assessed in two ways: internal consistency of domains with Cronbach's alpha, and test-retest reliability with intra-class correlation coefficients (ICC). To assess validity, comparable items and domains were correlated with Tacqol, a generic HRQoL instrument for children (TNO-AZL). Results: In the final phase of the development, 27 children completed the instrument. It consisted of 37 items in four domains. The response rate was 96%. Reliability was good for all domains, with Cronbach's alpha values ranging from 0.61 to 0.76. Measures of test-retest stability were good for all four domains with ICCs ranging from 0.82 to 0.92. Validity based on comparison with the Tacqol instrument was moderate. Conclusion: The DDL is promising as a measure of HRQoL in childhood defecation disorders. PMID:15557046
Brazilian version of the body dysmorphic disorder examination.
Jorge, Renata Trajano Borges; Sabino Neto, Miguel; Natour, Jamil; Veiga, Daniela Francescato; Jones, Anamaria; Ferreira, Lydia Masako
2008-03-06
Body image improvement is considered to be the main reason for undergoing plastic surgery. The objective was to translate the Body Dysmorphic Disorder Examination (BDDE) into Brazilian Portuguese and to adapt and validate this questionnaire for use in Brazil. Cross-sectional survey, at the Department of Plastic Surgery of Universidade Federal de São Paulo. The BDDE was first translated into Portuguese and then back-translated into English. These translations were then discussed by healthcare professionals in order to establish the final Brazilian version. In a second stage, the validity and reliability of the BDDE were assessed. For this, patients were initially interviewed by two interviewers and subsequently, by only one of these interviewers. On the first occasion, in addition to the BDDE, the body shape questionnaire (BSQ) and the Rosenberg self-esteem scale were also applied. These questionnaires were applied to 90 patients. Six questions were modified during the assessment of cultural equivalence. Cronbach's alpha was 0.89 and the intraclass correlation coefficients for interobserver and test-retest reliability were 0.91 and 0.87, respectively. Pearson's coefficient showed no correlation between the BDDE and the Rosenberg self-esteem scale (0.22), whereas there was a moderate correlation between the BDDE and the BSQ (0.64). The BDDE was successfully translated and adapted, with good internal consistency, reliability and construct validity.
Rahavi-Ezabadi, Sara; Amali, Amin; Sadeghniiat-Haghighi, Khosro; Montazeri, Ali; Nedjat, Saharnaz
2016-05-01
The aim of this study was the translation, cross-cultural adaptation, and validation of the Sleep Apnea Quality of Life Index (SAQLI) in Persian-speaking patients with obstructive sleep apnea (OSA). Ninety-six patients with OSA completed a series of questionnaires including SAQLI, Epworth Sleepiness Scale (ESS),10-item Functional Outcomes of Sleep Questionnaire (FOSQ-10), and Medical Outcome Survey Short form 12 (SF-12) for assessment of reliability, validity, and responsiveness of Persian version of SAQLI. The Persian version of SAQLI had a very good internal consistency and also demonstrated good test-retest reliability. Concurrent validity was confirmed by significant correlations with ESS, FOSQ-10 and SF-12 subscale scores. Comparison of SAQLI scores in groups of patients categorized by ESS showed the high discriminative power of this instrument. However, there was no significant difference in the SAQLI scores of patients with mild, moderate, and severe sleep apnea. The results of sensitivity to change verified that the SAQLI was able to detect changes after continuous positive airway pressure (CPAP) treatment. The findings of this study indicate that the Persian version of SAQLI is a reliable, valid, and responsive measure for evaluation of quality of life in patients with OSA.
Kawata, Ariane K; Wilson, Hilary; Ong, Siew Hwa; Kulich, Karoly; Coyne, Karin
2016-10-01
The aim of this study was to evaluate the factor structure and psychometric characteristics of the Hypoglycemia Perspectives Questionnaire (HPQ) assessing experience and perceptions of hypoglycemia in patients with type 2 diabetes mellitus (T2DM). HPQ was administered to adults with T2DM in a clinical sample from Cyprus (HYPO-Cyprus, n = 500) and a community sample in the United States (US, n = 1257) from the 2011 US National Health and Wellness Survey. Demographic and clinical data were collected. Analysis of HPQ data from two convenience samples examined item performance, factor structure, and HPQ measurement properties (reliability, convergent validity, known-groups validity). Analyses supported three HPQ domains: symptom concern (six items), compensatory behavior (five items), and worry (five items). Internal consistency was high for all three domains (all ≥0.75), supporting reliability. Convergent validity was supported by moderate Spearman correlations between HPQ domain scores and the Audit of Diabetes-Dependent Quality of Life (ADDQoL-19) total score. Patients with recent hypoglycemia events had significantly higher HPQ scores, supporting known-group validity. HPQ may be a valid and reliable measure capturing the experience and impact of hypoglycemia and useful in clinical trials and community-based settings.
Development and Validation of the Numeracy Understanding in Medicine Instrument Short Form
Schapira, Marilyn M.; Walker, Cindy M.; Miller, Tamara; Fletcher, Kathlyn A; Ganschow, Pamela G.; Jacobs, Elizabeth A; Imbert, Diana; O'Connell, Maria; Neuner, Joan M.
2014-01-01
Background Health numeracy can be defined as the ability to understand and use numeric information and quantitative concepts in the context of health. We previously reported the development of the Numeracy Understanding in Medicine Instrument (NUMi); a 20-item test developed using item response theory. We now report the development and validation of a short form of the NUMi. Methods Item statistics were used to identify a subset of 8-items representing a range of difficulty and content areas. Internal reliability was evaluated with Cronbach's alpha. Divergent and convergent validity was assessed by comparing scores of the S-NUMI with existing measures of education, print and numeric health literacy, mathematic achievement, cognitive reasoning, and the original NUMi. Results The 8-item scale had adequate reliability (Cronbach's alpha: 0.72) and was strongly correlated to the 20-item NUMi (0.92). The S-NUMi scores were strongly correlated with the Lipkus numeracy test (0.62), Wide Range of Achievement Test-Mathematics (WRAT-M) (0.72), and Wonderlic cognitive reasoning test (0.76). Moderate correlation was found with education level (0.58) and print literacy as measured by the TOFHLA (0.49). Conclusion The short Numeracy Understanding in Medicine Instrument is a reliable and valid measure of health numeracy feasible for use in clinical and research settings. PMID:25315596
Elfering, Achim; Cronenberg, Sonja; Grebner, Simone; Tamcan, Oezguer; Müller, Urs
2017-12-01
A newly developed questionnaire assessing limitations in activity of daily living (LADL-Q) that should improve assessment of LADL is tested in a large population-based validation study. This survey was paper-based. Overall, 16,634 individuals who were representative of the working population in the German-speaking part of Switzerland participated in the study. Item analysis was used the final version of the LADL-Q to four items per subscale that correspond to potential problems in three body regions (back and neck, upper extremities, lower extremities). Analysis included tests for reliability, internal consistency, dimensionality and convergent validity. Test-retest reliability coefficients after 2 weeks ranged from 0.82 to 0.99 (Mdn = 0.87), with no item having a coefficient below 0.60. The median item-total coefficients ranged between moderate and good. Correlation coefficients between LADL-Q subscales and three validated clinical instruments (Western Ontario and McMaster Universities osteoarthritis index, shoulder pain disability index, Oswestry) ranged from 0.63 to 0.81. In structural equation modeling the three subscales were significantly related with two important outcomes in occupational rehabilitation: self-reported general health and daily task performance. The new LADL-Q is a brief, reliable and valid tool for assessment of LADL in studies on musculoskeletal health.
Alyusuf, Raja H.; Prasad, Kameshwar; Abdel Satir, Ali M.; Abalkhail, Ali A.; Arora, Roopa K.
2013-01-01
Background: The exponential use of the internet as a learning resource coupled with varied quality of many websites, lead to a need to identify suitable websites for teaching purposes. Aim: The aim of this study is to develop and to validate a tool, which evaluates the quality of undergraduate medical educational websites; and apply it to the field of pathology. Methods: A tool was devised through several steps of item generation, reduction, weightage, pilot testing, post-pilot modification of the tool and validating the tool. Tool validation included measurement of inter-observer reliability; and generation of criterion related, construct related and content related validity. The validated tool was subsequently tested by applying it to a population of pathology websites. Results and Discussion: Reliability testing showed a high internal consistency reliability (Cronbach's alpha = 0.92), high inter-observer reliability (Pearson's correlation r = 0.88), intraclass correlation coefficient = 0.85 and κ =0.75. It showed high criterion related, construct related and content related validity. The tool showed moderately high concordance with the gold standard (κ =0.61); 92.2% sensitivity, 67.8% specificity, 75.6% positive predictive value and 88.9% negative predictive value. The validated tool was applied to 278 websites; 29.9% were rated as recommended, 41.0% as recommended with caution and 29.1% as not recommended. Conclusion: A systematic tool was devised to evaluate the quality of websites for medical educational purposes. The tool was shown to yield reliable and valid inferences through its application to pathology websites. PMID:24392243
Validity and reliability of acoustic analysis of respiratory sounds in infants
Elphick, H; Lancaster, G; Solis, A; Majumdar, A; Gupta, R; Smyth, R
2004-01-01
Objective: To investigate the validity and reliability of computerised acoustic analysis in the detection of abnormal respiratory noises in infants. Methods: Blinded, prospective comparison of acoustic analysis with stethoscope examination. Validity and reliability of acoustic analysis were assessed by calculating the degree of observer agreement using the κ statistic with 95% confidence intervals (CI). Results: 102 infants under 18 months were recruited. Convergent validity for agreement between stethoscope examination and acoustic analysis was poor for wheeze (κ = 0.07 (95% CI, –0.13 to 0.26)) and rattles (κ = 0.11 (–0.05 to 0.27)) and fair for crackles (κ = 0.36 (0.18 to 0.54)). Both the stethoscope and acoustic analysis distinguished well between sounds (discriminant validity). Agreement between observers for the presence of wheeze was poor for both stethoscope examination and acoustic analysis. Agreement for rattles was moderate for the stethoscope but poor for acoustic analysis. Agreement for crackles was moderate using both techniques. Within-observer reliability for all sounds using acoustic analysis was moderate to good. Conclusions: The stethoscope is unreliable for assessing respiratory sounds in infants. This has important implications for its use as a diagnostic tool for lung disorders in infants, and confirms that it cannot be used as a gold standard. Because of the unreliability of the stethoscope, the validity of acoustic analysis could not be demonstrated, although it could discriminate between sounds well and showed good within-observer reliability. For acoustic analysis, targeted training and the development of computerised pattern recognition systems may improve reliability so that it can be used in clinical practice. PMID:15499065
The assessment of fatigue: Psychometric qualities and norms for the Checklist individual strength.
Worm-Smeitink, M; Gielissen, M; Bloot, L; van Laarhoven, H W M; van Engelen, B G M; van Riel, P; Bleijenberg, G; Nikolaus, S; Knoop, H
2017-07-01
The Checklist Individual Strength (CIS) measures four dimensions of fatigue: Fatigue severity, concentration problems, reduced motivation and activity. On the fatigue severity subscale, a cut-off score of 35 is used. This study 1) investigated the psychometric qualities of the CIS; 2) validated the cut-off score for severe fatigue and 3) provided norms. Representatives of the Dutch general population (n=2288) completed the CIS. The factor structure was investigated using an exploratory factor analysis. Internal consistency and test-retest reliability were determined. Concurrent validity was assessed in two additional samples by correlating the CIS with other fatigue scales (Chalder Fatigue Questionnaire, MOS Short form-36 Vitality subscale, EORTC QLQ-C30 fatigue subscale). To validate the fatigue severity cut-off score, a Receiver Operating Characteristics analysis was performed with patients referred to a chronic fatigue treatment centre (n=5243) and a healthy group (n=1906). Norm scores for CIS subscales were calculated for the general population, patients with chronic fatigue syndrome (CFS; n=1407) and eight groups with other medical conditions (n=1411). The original four-factor structure of the CIS was replicated. Internal consistency (α=0.84-0.95) and test-retest reliability (r=0.74-0.86) of the subscales were high. Correlations with other fatigue scales were moderate to high. The 35 points cut-off score for severe fatigue is appropriate, but, given the 17% false positive rate, should be adjusted to 40 for research in CFS. The CIS is a valid and reliable tool for the assessment of fatigue, with a validated cut-off score for severe fatigue that can be used in clinical practice. Copyright © 2017. Published by Elsevier Inc.
Medina, Catalina; Barquera, Simón; Janssen, Ian
2013-07-01
To determine the test-retest reliability and validity of the Spanish version of the short-form International Physical Activity Questionnaire (IPAQ) among adults in Mexico. This was a cross-sectional study of a convenience sample of 267 adult factory workers in Mexico City. Participants were 19 - 68 years of age; 48% were female. Participants wore an accelerometer for 9 consecutive days and were administered the Spanish version of the short form IPAQ on two occasions (IPAQ1 and IPAQ2, separated by 9 days). The relation and differences between moderate-to-vigorous physical activity (MVPA) measures obtained from IPAQ1, IPAQ2, and the accelerometer were determined using correlations, linear regression, and paired t-tests. IPAQ1 and IPAQ2 measures of MVPA were significantly correlated to each other (r = 0.55, P < 0.01). However, MVPA was 44 ± 408 minutes/week lower in IPAQ1 than in IPAQ2, although this difference did not reach statistical significance (P = 0.08). The (min/week) measures from IPAQ1 and IPAQ2 were only modestly correlated with the accelerometer measures (r = 0.26 and r = 0.31, P < 0.01), and by comparison to accelerometer measures, MVPA values were higher when based on IPAQ1 (174 ± 357 min/week, P < 0.01) than for IPAQ2 (135 ± 360 min/week, P < 0.01). The percentage of participants who were classified as physically inactive according to the World Health Organization guidelines was 18.0% in IPAQ1, 25.1% in IPAQ2, and 28.2% based on the accelerometer. Similar to what has been observed in other populations, the short form IPAQ has a modest reliability and poor validity for assessing MVPA among Mexican adults.
Hollmeyer, Helge; Hardiman, Max; Harbarth, Stephan; Pittet, Didier
2011-01-01
Abstract Objective To investigate the reliability of the public health event notification assessment process under the International Health Regulations (2005) (IHR). Methods In 2009, 193 National IHR Focal Points (NFPs) were invited to use the decision instrument in Annex 2 of the IHR to determine whether 10 fictitious public health events should be notified to WHO. Each event’s notifiability was assessed independently by an expert panel. The degree of consensus among NFPs and of concordance between NFPs and the expert panel was considered high when more than 70% agreed on a response. Findings Overall, 74% of NFPs responded. The median degree of consensus among NFPs on notification decisions was 78%. It was high for the six events considered notifiable by the majority (median: 80%; range: 76–91) but low for the remaining four (median: 55%; range: 54–60). The degree of concordance between NFPs and the expert panel was high for the five events deemed notifiable by the panel (median: 82%; range: 76–91) but low (median: 51%; range: 42–60) for those not considered notifiable. The NFPs identified notifiable events with greater sensitivity than specificity (P < 0.001). Conclusion When used by NFPs, the notification assessment process in Annex 2 of the IHR was sensitive in identifying public health events that were considered notifiable by an expert panel, but only moderately specific. The reliability of the assessments could be increased by expanding guidance on the use of the decision instrument and by including more specific criteria for assessing events and clearer definitions of terms. PMID:21479094
Validation of a Spanish version of the Spine Functional Index.
Cuesta-Vargas, Antonio I; Gabel, Charles P
2014-06-27
The Spine Functional Index (SFI) is a recently published, robust and clinimetrically valid patient reported outcome measure. The purpose of this study was the adaptation and validation of a Spanish-version (SFI-Sp) with cultural and linguistic equivalence. A two stage observational study was conducted. The SFI was cross-culturally adapted to Spanish through double forward and backward translation then validated for its psychometric characteristics. Participants (n = 226) with various spine conditions of >12 weeks duration completed the SFI-Sp and a region specific measure: for the back, the Roland Morris Questionnaire (RMQ) and Backache Index (BADIX); for the neck, the Neck Disability Index (NDI); for general health the EQ-5D and SF-12. The full sample was employed to determine internal consistency, concurrent criterion validity by region and health, construct validity and factor structure. A subgroup (n = 51) was used to determine reliability at seven days. The SFI-Sp demonstrated high internal consistency (α = 0.85) and reliability (r = 0.96). The factor structure was one-dimensional and supported construct validity. Criterion specific validity for function was high with the RMQ (r = 0.79), moderate with the BADIX (r = 0.59) and low with the NDI (r = 0.46). For general health it was low with the EQ-5D and inversely correlated (r = -0.42) and fair with the Physical and Mental Components of the SF-12 and inversely correlated (r = -0.56 and r = -0.48), respectively. The study limitations included the lack of longitudinal data regarding other psychometric properties, specifically responsiveness. The SFI-Sp was demonstrated as a valid and reliable spine-regional outcome measure. The psychometric properties were comparable to and supported those of the English-version, however further longitudinal investigations are required.
Jean-Pierre, Pascal; Fiscella, Kevin; Winters, Paul C; Paskett, Electra; Wells, Kristen; Battaglia, Tracy
2012-09-01
Patient satisfaction (PS), a key measure of quality of cancer care, is a core study outcome of the multi-site National Cancer Institute-funded Patient Navigation Research Program. Despite large numbers of underserved monolingual Spanish speakers (MSS) residing in USA, there is no validated Spanish measure of PS that spans the whole spectrum of cancer-related care. The present study reports on the validation of the Patient Satisfaction with Cancer Care (PSCC) measure for Spanish (PSCC-Sp) speakers receiving diagnostic and therapeutic cancer-related care. Original PSCC items were professionally translated and back translated to ensure cultural appropriateness, meaningfulness, and equivalence. Then, the resulting 18-item PSCC-Sp measure was administered to 285 MSS. We evaluated latent structure and internal consistency of the PSCC-Sp using principal components analysis (PCA) and Cronbach coefficient alpha (α). We used correlation analyses to demonstrate divergence and convergence of the PSCC-Sp with a Spanish version of the Patient Satisfaction with Interpersonal Relationship with Navigator (PSN-I-Sp) measure and patients' demographics. The PCA revealed a coherent set of items that explicates 47% of the variance in PS. Reliability assessment demonstrated that the PSCC-Sp had high internal consistency (α = 0.92). The PSCC-Sp demonstrated good face validity and convergent and divergent validities as indicated by moderate correlations with the PSN-I-Sp (p = 0.003) and nonsignificant correlations with marital status and household income (all p(s) > 0.05). The PSCC-Sp is a valid and reliable measure of PS and should be tested in other MSS populations.
Bonanad, S; De la Rubia, J; Gironella, M; Pérez Persona, E; González, B; Fernández Lago, C; Arnan, M; Zudaire, M; Hernández Rivas, J A; Soler, A; Marrero, C; Olivier, C; Altés, A; Valcárcel, D; Hernández, M T; Oiartzabal, I; Fernández Ordoño, R; Arnao, M; Esquerra, A; Sarrá, J; González-Barca, E; González, J; Calvo, X; Nomdedeu, M; García Guiñón, A; Ramírez Payer, A; Casado, A; López, S; Durán, M; Marcos, M; Cruz-Jentoft, A J
2015-09-01
The purpose of this study was to develop a new brief, comprehensive geriatric assessment scale for older patients diagnosed with different hematological malignancies, the Geriatric Assessment in Hematology (GAH scale), and to determine its psychometric properties. The 30-item GAH scale was designed through a multi-step process to cover 8 relevant dimensions. This is an observational study conducted in 363 patients aged≥65years, newly diagnosed with different hematological malignancies (myelodysplasic syndrome/acute myeloblastic leukemia, multiple myeloma, or chronic lymphocytic leukemia), and treatment-naïve. The scale psychometric validation process included the analyses of feasibility, floor and ceiling effect, validity and reliability criteria. Mean time taken to complete the GAH scale was 11.9±4.7min that improved through a learning-curve effect. Almost 90% of patients completed all items, and no floor or ceiling effects were identified. Criterion validity was supported by reasonable correlations between the GAH scale dimensions and three contrast variables (global health visual analogue scale, ECOG and Karnofsky), except for comorbidities. Factor analysis (supported by the scree plot) revealed nine factors that explained almost 60% of the total variance. Moderate internal consistency reliability was found (Cronbach's α: 0.610), and test-retest was excellent (ICC coefficients, 0.695-0.928). Our study suggests that the GAH scale is a valid, internally reliable and a consistent tool to assess health status in older patients with different hematological malignancies. Future large studies should confirm whether the GAH scale may be a tool to improve clinical decision-making in older patients with hematological malignancies. Copyright © 2015 Elsevier Inc. All rights reserved.
Psychometric analyses to improve the Dutch ICF Activity Inventory.
Bruijning, Janna E; van Rens, Ger; Knol, Dirk; van Nispen, Ruth
2013-08-01
In the past, rehabilitation centers for the visually impaired used unstructured or semistructured methods to assess rehabilitation needs of their patients. Recently, an extensive instrument, the Dutch ICF Activity Inventory (D-AI), was developed to systematically investigate rehabilitation needs of visually impaired adults and to evaluate rehabilitation outcomes. The purpose of this study was to investigate the underlying factor structure and other psychometric properties to shorten and improve the D-AI. The D-AI was administered to 241 visually impaired persons who recently enrolled in a multidisciplinary rehabilitation center. The D-AI uses graded scores to assess the importance and difficulty of 65 rehabilitation goals. For high-priority goals (e.g., daily meal preparation), the difficulty of underlying tasks (e.g., read recipes, cut vegetables) was assessed. To reduce underlying task items (>950), descriptive statistics were investigated and factor analyses were performed for several goals. The internal consistency reliability and test-retest reliability of the D-AI were investigated by calculating Cronbach α and Cohen (weighted) κ. Finally, consensus-based discussions were used to shorten and improve the D-AI. Except for one goal, factor analysis model parameters were at least reasonable. Internal consistency reliability was satisfactory (range, 0.74 to 0.93). In total, 60% of the 65 goal importance items and 84.4% of the goal difficulty items showed moderate to almost perfect κ values (≥0.40). After consensus-based discussions, a new D-AI was produced, containing 48 goals and less than 500 tasks. The analyses were an important step in the validation process of the D-AI and to develop a more feasible assessment tool to investigate rehabilitation needs of visually impaired persons in a systematic way. The D-AI is currently implemented in all Dutch rehabilitation centers serving all visually impaired adults with various rehabilitation needs.
de Oliveira, Flávia Augusta; Luna, Stelio Pacca Loureiro; do Amaral, Jackson Barros; Rodrigues, Karoline Alves; Sant'Anna, Aline Cristina; Daolio, Milena; Brondani, Juliana Tabarelli
2014-09-06
The recognition and measurement of pain in cattle are important in determining the necessity for and efficacy of analgesic intervention. The aim of this study was to record behaviour and determine the validity and reliability of an instrument to assess acute pain in 40 cattle subjected to orchiectomy after sedation with xylazine and local anaesthesia. The animals were filmed before and after orchiectomy to record behaviour. The pain scale was based on previous studies, on a pilot study and on analysis of the camera footage. Three blinded observers and a local observer assessed the edited films obtained during the preoperative and postoperative periods, before and after rescue analgesia and 24 hours after surgery. Re-evaluation was performed one month after the first analysis. Criterion validity (agreement) and item-total correlation using Spearman's coefficient were employed to refine the scale. Based on factor analysis, a unidimensional scale was adopted. The internal consistency of the data was excellent after refinement (Cronbach's α coefficient = 0.866). There was a high correlation (p < 0.001) between the proposed scale and the visual analogue, simple descriptive and numerical rating scales. The construct validity and responsiveness were confirmed by the increase and decrease in pain scores after surgery and rescue analgesia, respectively (p < 0.001). Inter- and intra-observer reliability ranged from moderate to very good. The optimal cut-off point for rescue analgesia was > 4, and analysis of the area under the curve (AUC = 0.963) showed excellent discriminatory ability. The UNESP-Botucatu unidimensional pain scale for assessing acute postoperative pain in cattle is a valid, reliable and responsive instrument with excellent internal consistency and discriminatory ability. The cut-off point for rescue analgesia provides an additional tool for guiding analgesic therapy.
Guo, Yiting Emily; Togher, Leanne; Power, Emma; Hutomo, Edwin; Yang, Yi-Fei; Tay, Arthur; Yen, Shih-Cheng; Koh, Gerald Choon-Huat
2017-04-01
Access2Aphasia™ is an iPad™-based aphasia assessment application that enables real-time audiovisual communication between people with aphasia (PWA) and speech-language pathologists (SLPs), and the use of supported conversation techniques. This study aimed to establish the reliability of aphasia assessment across the International Classification of Functioning, Disability and Health (ICF) using Access2Aphasia, and compare it with face-to-face (FTF) assessment. Consumer perspectives of Access2Aphasia were also examined. Thirty PWA were randomized into two conditions: online-led and FTF assessment. Participants in the online-led group were assessed remotely using Access2Aphasia™ in their own homes, while an FTF SLP scored silently simultaneously. Participants in the FTF group were assessed FTF using standard administration materials. Assessment included two subtests of the Psycholinguistic Assessment of Language Processing Activities (PALPA) and the Assessment of Living with Aphasia (ALA) to allow for outcomes to be captured across the ICF domains. Consumer perspectives on Access2Aphasia were obtained from both PWA and research SLPs in the online-led group. Kappa statistics indicated moderate to almost perfect agreement between online and FTF SLPs (k = 0.71-1.00). Intrarater and interrater reliability was excellent (ICC = 0.99-1.00) and equivalent for the online-led and FTF conditions. Both PWA and research SLPs in the online-led group reported being satisfied with the experience overall, with suggestions provided by research SLPs to improve Access2Aphasia. This study supports the provision of iPad-based aphasia assessments across the ICF in the online environment, with comparable reliability to FTF assessments. Future research is warranted to support the development of iPad-based aphasia assessment and treatment as an alternative mode of service delivery to PWA.
Yang, Peirong; Chen, Gang; Wang, Peng; Zhang, Kejian; Deng, Feng; Yang, Haifeng; Zhuang, Guihua
2018-05-05
The Child Health Utility 9D (CHU9D), a new generic preference-based health-related quality of life (HRQoL) instrument, was developed specifically for the application in cost-effectiveness analyses of treatments and interventions for children and adolescents. The main objective of this study was to examine the psychometric property of the Chinese version of CHU9D (CHU9D-CHN) in a large school-based sample in China. Data were collected using a multi-stage sampling method from third-to-ninth-grade students in Shaanxi Province, China. Participants self-completed a hard-copy questionnaire including the CHU9D-CHN instrument, the Pediatric Quality of Life Inventory™ 4.0 Generic Core Scales (PedsQL), information on socio-demographic characteristics and self-reported health status. The psychometric properties of the CHU9D-CHN, including the internal consistency, 2-week test-retest reliability, convergent and known-groups validity were studied. A total of 1912 students participated in the survey. The CHU9D-CHN internal consistency and test-retest reliability were good to excellent with a Cronbach's alpha of 0.77 and an intra-class correlation coefficient of 0.65, respectively. The CHU9D utility scores moderately correlated with the PedsQL total scores (r = .57, P < .001), demonstrating good convergent validity. Difference of the CHU9D utility scores among the different participants with levels of self-reported general health, health services utilisation and left-behind status demonstrated good construct validity. The findings demonstrated adequate psychometric performance for the CHU9D-CHN. The CHU9D-CHN was a satisfactory, reliable and valid instrument to measure and value HRQoL for children and adolescents in China.
Improving quality in healthcare: What makes a satisfied patient?
Más, A; Parra, P; Bermejo, R M; Hidalgo, M D; Calle, J E
2016-01-01
To update the metric properties of a perceived quality questionnaire for patients admitted to hospital medical departments, to determine the level of patient satisfaction achieved, and to identify the variables which predict satisfaction. Self-administered questionnaire completed at home following patient discharge, using a questionnaire prepared by the authors on a sample of 7207 users of medical departments in 9 public hospitals during the years 2006-2009. A principal component analysis with varimax rotation was performed. Reliability was assessed using internal consistency coefficient. An analysis was made of the compliance with each indicator reported by respondents. A logistic regression analysis was performed to determine the perceived quality dimensions which predicted overall patient satisfaction. The results of the reliability analysis indicated good coefficients for interpersonal manner (0.94) and professional competence (0.85) dimensions, and moderate values for the other dimensions (comfort 0.55, information 0.38, and organisation 0.37). Factor analyses showed single factors in each of the perceived quality dimensions, with a percentage of explained variance greater than 35% for information, interpersonal manner, professional competence, and comfort, and less than 30% for organisation. The dimensions which predicted satisfaction were interpersonal manner of healthcare staff, professional competence, and information. The metric properties of the questionnaire used have been updated, yielding a valid and reliable questionnaire for assessing patient satisfaction in quality management programmes, both for internal purposes and for conducting external comparisons. A positive relationship was obtained between the level of patient satisfaction and level of professional competence, interpersonal manner of healthcare staff, and information received. Copyright © 2016 SECA. Publicado por Elsevier España, S.L.U. All rights reserved.
Brown, Heidi Wendell; Wise, Meg E.; Westenberg, Danielle; Schmuhl, Nicholas B.; Brezoczky, Kelly Lewis; Rogers, Rebecca G.; Constantine, Melissa L.
2017-01-01
Introduction and hypothesis Fewer than 30% of women with accidental bowel leakage (ABL) seek care, despite the existence of effective, minimally invasive therapies. We developed and validated a condition-specific instrument to assess barriers to care-seeking for ABL in women. Methods Adult women with ABL completed an electronic survey about condition severity, patient activation, previous care-seeking, and demographics. The Barriers to Care-seeking for Accidental Bowel Leakage (BCABL) instrument contained 42 potential items completed at baseline and again 2 weeks later. Paired t tests evaluated test–retest reliability. Factor analysis evaluated factor structure and guided item retention. Cronbach’s alpha evaluated internal consistency. Within and across factor item means generated a summary BCABL score used to evaluate scale validity with six external criterion measures. Results Among 1,677 click-throughs, 736 (44%) entered the survey; 95% of eligible female respondents (427 out of 458) provided complete data. Fifty-three percent of respondents had previously sought care for their ABL; median age was 62 years (range 27–89); mean Vaizey score was 12.8 (SD = 5.0), indicating moderate to severe ABL. Test–retest reliability was excellent for all items. Factor extraction via oblique rotation resulted in the final structure of 16 items in six domains, within which internal consistency was high. All six external criterion measures correlated significantly with BCABL score. Conclusions The BCABL questionnaire, with 16 items mapping to six domains, has excellent criterion validity and test–retest reliability when administered electronically in women with ABL. The BCABL can be used to identify care-seeking barriers for ABL in different populations, inform targeted interventions, and measure their effectiveness. PMID:28236039
Vuurberg, Gwendolyn; Kluit, Lana; van Dijk, C Niek
2018-03-01
To develop a translated Dutch version of the Cumberland Ankle Instability Tool (CAIT) and test its psychometric properties in a Dutch population with foot and ankle complaints. The CAIT was translated into the Dutch language using a forward-backward translation design. Of the 130 subsequent patients visiting the outpatient clinic for foot and ankle complaints who were asked to fill out a questionnaire containing the CAIT, the Foot and Ankle Outcome Score (FAOS), and the numeric rating scale (NRS) pain, 98 completed the questionnaire. After a 1-week period, patients were asked to fill out a second questionnaire online containing the CAIT and NRS pain. This second questionnaire was completed by 70 patients. With these data, the construct validity, test-retest reliability, internal consistency, measurement error, and ceiling and floor effects were assessed. Additionally, a cut-off value to discriminate between stable and unstable ankles, in patients with ankle complaints, was calculated. Construct validity showed moderate correlations between the CAIT and FAOS subscales (Spearman's correlation coefficient (SCC) = 0.36-0.43), and the NRS pain (SCC = -0.55). The cut-off value was found at 11.5 points of the total CAIT score (range 0-30). Test-retest reliability showed to be excellent with an intraclass correlation coefficient of 0.94. Internal consistency was high (Cronbach's α = 0.86). No ceiling or floor effects were detected. Based on the results, the Dutch version of the CAIT is a valid and reliable questionnaire to assess ankle instability in the Dutch population and is able to differentiate between a functionally unstable and stable ankle. The tool is the first suitable tool to objectify the severity of ankle instability specific complaints and assess change in the Dutch population. Level of evidence II.
Evidence for universality in phenomenological emotion response system coherence.
Matsumoto, David; Nezlek, John B; Koopmann, Birgit
2007-02-01
The authors reanalyzed data from Scherer and Wallbott's (Scherer, 1997b; Scherer & Wallbott, 1994) International Study of Emotion Antecedents and Reactions to examine how phenomenological reports of emotional experience, expression, and physiological sensations were related to each other within cultures and to determine if these relationships were moderated by cultural differences, which were operationally defined using Hofstede's (2001) typology. Multilevel random coefficient modeling analyses produced several findings of note. First, the vast majority of the variance in ratings was within countries (i.e., at the individual level); a much smaller proportion of the total variance was between countries. Second, there were negative relationships between country-level means and long- versus short-term orientation for numerous measures. Greater long-term orientation was associated with lowered emotional expressivity and fewer physiological sensations. Third, at the individual (within-culture) level, across the 7 emotions, there were consistent and reliable positive relationships among the response systems, indicating coherence among them. Fourth, such relationships were not moderated by cultural differences, as measured by the Hofstede dimensions. (c) 2007 APA, all rights reserved.
Myers, Taryn A; Crowther, Janis H
2007-09-01
Theory and research suggest that sociocultural pressures, thin-ideal internalization, and self-objectification are associated with body dissatisfaction, while feminist beliefs may serve a protective function. This research examined thin-ideal internalization and self-objectification as mediators and feminist beliefs as a moderator in the relationship between sociocultural pressures to meet the thin-ideal and body dissatisfaction. Female undergraduate volunteers (N=195) completed self-report measures assessing sociocultural influences, feminist beliefs, thin-ideal internalization, self-objectification, and body dissatisfaction. Multisample structural equation modeling showed that feminist beliefs moderate the relationship between media awareness and thin-ideal internalization, but not the relationship between social influence and thin-ideal internalization. Research and clinical implications of these findings are discussed.
Hummel, Alexandra C.; Kiel, Elizabeth J.
2014-01-01
Maternal depression relates to child internalizing outcomes, but one missing aspect of this association is how variation in depressive symptoms, including mild and moderate symptoms, relates to young children’s outcomes. The current study examined a moderated mediation model to investigate how maternal behaviors may mediate this association in the context of child temperament and gender. Mothers and toddlers completed a free-play/clean-up task in the laboratory. Mothers rated their depressive symptoms and their toddlers’ temperament and internalizing behaviors. Results indicated a significant indirect of maternal warmth on the relation between maternal depressive symptoms and toddler internalizing outcomes for boys with low negative emotionality. Toddler gender and temperament moderated the relation between maternal intrusiveness and toddler internalizing outcomes, but mediation was not supported. Results highlight the important interaction between child and maternal variables in predicting child outcomes, and suggest mechanisms by and conditions under which mild maternal depressive symptomatology can be a risk factor for toddler internalizing outcomes. PMID:24553739
Chan, Kin Sun
2018-01-01
Objectives This study aimed to evaluate the internal consistency, reliability, convergent validity, known-group comparisons, and structural validity of the Chinese version of Fear of Intimacy with Helping Professionals (C–FIS–HP) scale in Macau. Methods A cross-sectional design was used on a sample of 593 older people in 6 health centers. We used Chinese version of Exercise of Self-Care Agency Scale (C-ESCAS) and Morisky 4-item medication adherence scale to evaluate self-care actions and medication adherence. The internal consistency and reliability of C–FIS–HP were analyzed using the Spearman-Brown split-half reliability, Cronbach’s alpha, and test–retest reliability. Convergent validity was tested the construct of C–FIS–HP and self-care actions. Known-group comparisons differentiated predefined groups in an expected direction. Two separated samples were used to test the structural validity. An exploratory factor analysis (EFA) tested the factor structure of C–FISHP using the principal axis factoring. A confirmatory factor analysis (CFA) was further conducted to confirm the factor structure constructed in the prior EFA. Results The C–FIS–HP had a Spearman-Brown split-half coefficient, Cronbach’s alpha, and intraclass correlation coefficient of 0.96, 0.93, and 0.96, respectively. Convergent validity was satisfactory with significantly correlations between the C-FIS-HP and C-ESCAS. C–FIS–HP to differentiate the differences between high-, moderate-, and low- medication adherence groups. EFA demonstrated a two-factor structure among 297 older people. A first-order CFA was performed to confirm the construct dimensionality of C–FIS–HP with satisfactory fit indices (NFI = 0.92; IFI = 0.95; TLI = 0.94; CFI = 0.95 and RMSEA = 0.07) among 296 older people. Conclusions C–FIS–HP is a reliable and valid test for assessing helping relationships in older Chinese people. Health professionals can use C–FIS–HP as a clinical tool to assess the comfort level of patients in a helping relationship, and use this information to develop culturally sensitive therapeutic interventions and treatment plans. Further studies need to be conducted concerning the different psychometric properties, as well as the application of C–FIS–HP in various regions. PMID:29795563
Gebremariam, Mekdes K; Lien, Nanna; Torheim, Liv Elin; Andersen, Lene F; Melbye, Elisabeth L; Glavin, Kari; Hausken, Solveig E S; Sleddens, Ester F C; Bjelland, Mona
2016-08-17
The existence of socioeconomic differences in dietary behaviors is well documented. However, studies exploring the mechanisms behind these differences among adolescents using comprehensive and reliable measures of mediators are lacking. The aims of this study were (a) to assess the psychometric properties of new scales assessing the perceived rules and accessibility related to the consumption of vegetables and soft drinks and (b) to explore their mediating role in the association between parental education and the corresponding dietary behaviors. A cross-sectional survey including 440 adolescents from three counties in Norway (mean age 14.3 years (SD = 0.6)) was conducted using a web-based questionnaire. Principal component analysis, test-retest and internal reliability analysis were conducted. The mediating role of perceived accessibility and perceived rules in the association between parental education and the dietary behaviors was explored using linear regression analyses. Factor analyses confirmed two separate subscales, named "accessibility" and "rules", both for vegetables and soft drinks (factor loadings >0.60). The scales had good internal consistency reliability (0.70-0.87). The test-retest reliability of the scales was moderate to good (0.44-0.62). Parental education was inversely related to the consumption of soft drinks and positively related to the consumption of vegetables. Perceived accessibility and perceived rules related to soft drink consumption were found to mediate the association between parental education and soft drink consumption (47.5 and 8.5 % of total effect mediated). Accessibility of vegetables was found to mediate the association between parental education and the consumption of vegetables (51 % of total effect mediated). The new scales developed in this study are comprehensive and have adequate validity and reliability; they are therefore considered appropriate for use among 13-15 year-olds. Parents, in particular those with a low educational background, should be encouraged to increase the accessibility of vegetables and to decrease the accessibility of soft drinks, in particular during dinner. Enforcing parental rules limiting soft drink intake in families with low parental education also appears relevant.
Warren, Cortney S; Castillo, Linda G; Gleaves, David H
2010-01-01
White American cultural values of appearance are implicated in the development of body dissatisfaction. This study examined whether the relationships between awareness of White American appearance ideals, internalization of such ideals, and body dissatisfaction are moderated by behavioral acculturation and attitudinal marginalization in a sample of 94 Mexican American women. Results indicated that behavioral acculturation moderated the relationship between awareness and internalization and cognitive marginalization moderated the relationship between internalization and body dissatisfaction. Body size was positively correlated with body dissatisfaction and negatively correlated with behavioral acculturation. These findings have important implications for clinical practice and research with Mexican American women.
Huang, Bin; Chi, Guangyu; Chen, Xin; Shi, Yi
2011-11-01
The performance of acetic acid-supported pH-heterogenized heterotrophic denitrification (HD) facilitated with ferrous sulfide-based autotrophic denitrification (AD) was investigated in upflow activated carbon-packed column reactors for reliable removal of highly elevated nitrate (42 mg NO(3)-Nl(-1)) in drinking water. The use of acetic acid as substrate provided sufficient internal carbon dioxide to completely eliminate the need of external pH adjustment for HD, but simultaneously created vertically heterogenized pH varying from 4.8 to 7.8 in the HD reactor. After 5-week acclimation, the HD reactor developed a moderate nitrate removal capacity with about one third of nitrate removal occurring in the acidic zone (pH 4.8-6.2). To increase the treatment reliability, acetic acid-supported HD was operated under 10% carbon limitation to remove >85% of nitrate, and ferrous sulfide-based AD was supplementally operated to remove residual nitrate and formed nitrite without excess of soluble organic carbon, nitrite or sulfate in the final effluent. Copyright © 2011 Elsevier Ltd. All rights reserved.
Demeyer, Ineke; Romero, Nuria; De Raedt, Rudi
2018-04-01
The interplay between actual and ideal self-esteem may be a key component in emotional disorders. Since automatic self-evaluations are not always consciously accessible, assessment through implicit measures is necessary. Given the lack of implicit self-esteem measures in late life, we aimed to identify a reliable measure and to clarify the role of actual and ideal self-esteem in mood and depressive symptoms in older adults. Forty-nine older adults completed two adapted Go/No go Association tasks measuring implicit actual and ideal self-esteem and measures of mood and depressive symptoms. The two Go/No go Association tasks showed satisfactory internal consistency. Moderation analyses revealed that lower actual self-esteem in older adults is related to higher levels of sad mood when ideal self-esteem is high. Moreover, lower actual self-esteem is related to more anxious mood. Given the role of self-esteem in emotional well-being, a reliable measure for older adults is crucial to improve age-appropriate diagnostics and treatment.
Effectiveness of health management departments of universities that train health managers in Turkey.
Karagoz, Sevgul; Balci, Ali
2007-01-01
This research has [corrected] aimed to examine the effectiveness of the health management departments of universities which [corrected] train health managers in Turkey. The study compares - for lecturers and students - nine variables of organisational effectiveness [corrected] These nine dimensions are derived from Cameron (1978; 1981; 1986) [corrected] Factor analysis was used to validate [corrected] the scale developed by the researcher. For internal consistency and reliability, the [corrected] Cronbach Alpha reliability coefficient and item total correlation were applied. A questionnaire was administered to a [corrected] total of [corrected] 207 people [corrected] in health management departments in [corrected]Turkey. In analysis of the data, [corrected] descriptive statistics and the [corrected] t-test were [corrected]used. According to our [corrected] research findings, at individual [corrected] university level, lecturers found their departments more effective than did [corrected] their students. The highest effectiveness was perceived at Baskent University, a private university [corrected] The best outcome was achieved for 'organisational health', and 'the [corrected] ability to acquire resources' achieved [corrected] the lowest outcome [corrected] Effectiveness overall [corrected] was found to be moderate [corrected] Copyright (c) 2006 John Wiley & Sons, Ltd.
Psychometric evaluation of a daily gastro-oesophageal reflux disease symptom measure.
Bytzer, Peter; Reimer, Christina; Smith, Gary; Anatchkova, Milena D; Hsieh, Ray; Wilkinson, Joanne; Thomas, S Jane; Lenderking, William R
2017-03-01
The objective of this study was to evaluate the validity of the Heartburn Reflux Dyspepsia Questionnaire (HRDQ), a newly developed measure of gastro-oesophageal reflux disease (GORD) symptoms. Specifically, the HRDQ was developed for patients, who still experience symptoms with proton pump inhibitor (PPI) treatment. The psychometric properties of HRDQ were evaluated based on data from two clinical trials of patients with GORD with a partial response to PPIs, one from the UK and one from Denmark and Germany. The HRDQ had good internal consistency (Cronbach's alpha range .83-.88) and test-retest reliability (intraclass correlation coefficient range .71-.90). Convergent and discriminant validity were supported by high correlations with ReQuest™ and ability to differentiate between groups based on ReQuest™ cut-off values. Responsiveness of HRDQ was demonstrated by moderate to high correlations with ReQuest™ change scores and time with symptoms. An HRDQ cut-off value of 0.70 for definition of 'bad day' was also evaluated. Based on existing evidence, the HRDQ is a valid and reliable measure of GORD symptoms that can be used as a study outcome in clinical trials.
Hassett, Leanne; Moseley, Anne; Harmer, Alison; van der Ploeg, Hidde P
2015-01-01
To determine the reliability and validity of the Physical Activity Scale for Individuals with a Physical Disability (PASIPD) in adults with severe traumatic brain injury (TBI) and estimate the proportion of the sample participants who fail to meet the World Health Organization guidelines for physical activity. A single-center observational study recruited a convenience sample of 30 community-based ambulant adults with severe TBI. Participants completed the PASIPD on 2 occasions, 1 week apart, and wore an accelerometer (ActiGraph GT3X; ActiGraph LLC, Pensacola, Florida) for the 7 days between these 2 assessments. The PASIPD test-retest reliability was substantial (intraclass correlation coefficient = 0.85; 95% confidence interval, 0.70-0.92), and the correlation with the accelerometer ranged from too low to be meaningful (R = 0.09) to moderate (R = 0.57). From device-based measurement of physical activity, 56% of participants failed to meet the World Health Organization physical activity guidelines. The PASIPD is a reliable measure of the type of physical activity people with severe TBI participate in, but it is not a valid measure of the amount of moderate to vigorous physical activity in which they engage. Accelerometers should be used to quantify moderate to vigorous physical activity in people with TBI.
Barreira, Paulo; Robinson, Mark A; Drust, Barry; Nedergaard, Niels; Raja Azidin, Raja Mohammed Firhad; Vanrenterghem, Jos
2017-09-01
The aim of the present study was to examine reliability and construct convergent validity of Player Load™ (PL) from trunk-mounted accelerometry, expressed as a cumulative measure and an intensity measure (PL · min - 1 ). Fifteen male participants twice performed an overground football match simulation that included four different multidirectional football actions (jog, side cut, stride and sprint) whilst wearing a trunk-mounted accelerometer inbuilt in a global positioning system unit. Results showed a moderate-to-high reliability as indicated by the intra-class correlation coefficient (0.806-0.949) and limits of agreement. Convergent validity analysis showed considerable between-participant variation (coefficient of variation range 14.5-24.5%), which was not explained from participant demographics despite a negative association with body height for the stride task. Between-task variations generally showed a moderate correlation between ranking of participants for PL (0.593-0.764) and PL · min - 1 (0.282-0.736). It was concluded that monitoring PL ® in football multidirectional actions presents moderate-to-high reliability, that between-participant variability most likely relies on the individual's locomotive skills and not their anthropometrics, and that the intensity of a task expressed by PL · min - 1 is largely related to the running velocity of the task.
Baschung Pfister, Pierrette; Sterkele, Iris; Maurer, Britta; de Bie, Rob A.; Knols, Ruud H.
2018-01-01
Manual muscle testing (MMT) and hand-held dynamometry (HHD) are commonly used in people with inflammatory myopathy (IM), but their clinimetric properties have not yet been sufficiently studied. To evaluate the reliability and validity of MMT and HHD, maximum isometric strength was measured in eight muscle groups across three measurement events. To evaluate reliability of HHD, intra-class correlation coefficients (ICC), the standard error of measurements (SEM) and smallest detectable changes (SDC) were calculated. To measure reliability of MMT linear Cohen`s Kappa was computed for single muscle groups and ICC for total score. Additionally, correlations between MMT8 and HHD were evaluated with Spearman Correlation Coefficients. Fifty people with myositis (56±14 years, 76% female) were included in the study. Intra-and interrater reliability of HHD yielded excellent ICCs (0.75–0.97) for all muscle groups, except for interrater reliability of ankle extension (0.61). The corresponding SEMs% ranged from 8 to 28% and the SDCs% from 23 to 65%. MMT8 total score revealed excellent intra-and interrater reliability (ICC>0.9). Intrarater reliability of single muscle groups was substantial for shoulder and hip abduction, elbow and neck flexion, and hip extension (0.64–0.69); moderate for wrist (0.53) and knee extension (0.49) and fair for ankle extension (0.35). Interrater reliability was moderate for neck flexion (0.54) and hip abduction (0.44); fair for shoulder abduction, elbow flexion, wrist and ankle extension (0.20–0.33); and slight for knee extension (0.08). Correlations between the two tests were low for wrist, knee, ankle, and hip extension; moderate for elbow flexion, neck flexion and hip abduction; and good for shoulder abduction. In conclusion, the MMT8 total score is a reliable assessment to consider general muscle weakness in people with myositis but not for single muscle groups. In contrast, our results confirm that HHD can be recommended to evaluate strength of single muscle groups. PMID:29596450
Test-Retest Reliability of Pediatric Heart Rate Variability: A Meta-Analysis.
Weiner, Oren M; McGrath, Jennifer J
2017-01-01
Heart rate variability (HRV), an established index of autonomic cardiovascular modulation, is associated with health outcomes (e.g., obesity, diabetes) and mortality risk. Time- and frequency-domain HRV measures are commonly reported in longitudinal adult and pediatric studies of health. While test-retest reliability has been established among adults, less is known about the psychometric properties of HRV among infants, children, and adolescents. The objective was to conduct a meta-analysis of the test-retest reliability of time- and frequency-domain HRV measures from infancy to adolescence. Electronic searches (PubMed, PsycINFO; January 1970-December 2014) identified studies with nonclinical samples aged ≤ 18 years; ≥ 2 baseline HRV recordings separated by ≥ 1 day; and sufficient data for effect size computation. Forty-nine studies ( N = 5,170) met inclusion criteria. Methodological variables coded included factors relevant to study protocol, sample characteristics, electrocardiogram (ECG) signal acquisition and preprocessing, and HRV analytical decisions. Fisher's Z was derived as the common effect size. Analyses were age-stratified (infant/toddler < 5 years, n = 3,329; child/adolescent 5-18 years, n = 1,841) due to marked methodological differences across the pediatric literature. Meta-analytic results revealed HRV demonstrated moderate reliability; child/adolescent studies ( Z = 0.62, r = 0.55) had significantly higher reliability than infant/toddler studies ( Z = 0.42, r = 0.40). Relative to other reported measures, HF exhibited the highest reliability among infant/toddler studies ( Z = 0.42, r = 0.40), while rMSSD exhibited the highest reliability among child/adolescent studies ( Z = 1.00, r = 0.76). Moderator analyses indicated greater reliability with shorter test-retest interval length, reported exclusion criteria based on medical illness/condition, lower proportion of males, prerecording acclimatization period, and longer recording duration; differences were noted across age groups. HRV is reliable among pediatric samples. Reliability is sensitive to pertinent methodological decisions that require careful consideration by the researcher. Limited methodological reporting precluded several a priori moderator analyses. Suggestions for future research, including standards specified by Task Force Guidelines, are discussed.
Test-Retest Reliability of Pediatric Heart Rate Variability
Weiner, Oren M.; McGrath, Jennifer J.
2017-01-01
Heart rate variability (HRV), an established index of autonomic cardiovascular modulation, is associated with health outcomes (e.g., obesity, diabetes) and mortality risk. Time- and frequency-domain HRV measures are commonly reported in longitudinal adult and pediatric studies of health. While test-retest reliability has been established among adults, less is known about the psychometric properties of HRV among infants, children, and adolescents. The objective was to conduct a meta-analysis of the test-retest reliability of time- and frequency-domain HRV measures from infancy to adolescence. Electronic searches (PubMed, PsycINFO; January 1970–December 2014) identified studies with nonclinical samples aged ≤ 18 years; ≥ 2 baseline HRV recordings separated by ≥ 1 day; and sufficient data for effect size computation. Forty-nine studies (N = 5,170) met inclusion criteria. Methodological variables coded included factors relevant to study protocol, sample characteristics, electrocardiogram (ECG) signal acquisition and preprocessing, and HRV analytical decisions. Fisher’s Z was derived as the common effect size. Analyses were age-stratified (infant/toddler < 5 years, n = 3,329; child/adolescent 5–18 years, n = 1,841) due to marked methodological differences across the pediatric literature. Meta-analytic results revealed HRV demonstrated moderate reliability; child/adolescent studies (Z = 0.62, r = 0.55) had significantly higher reliability than infant/toddler studies (Z = 0.42, r = 0.40). Relative to other reported measures, HF exhibited the highest reliability among infant/toddler studies (Z = 0.42, r = 0.40), while rMSSD exhibited the highest reliability among child/adolescent studies (Z = 1.00, r = 0.76). Moderator analyses indicated greater reliability with shorter test-retest interval length, reported exclusion criteria based on medical illness/condition, lower proportion of males, prerecording acclimatization period, and longer recording duration; differences were noted across age groups. HRV is reliable among pediatric samples. Reliability is sensitive to pertinent methodological decisions that require careful consideration by the researcher. Limited methodological reporting precluded several a priori moderator analyses. Suggestions for future research, including standards specified by Task Force Guidelines, are discussed. PMID:29307951
Urdu version of the neck disability index: a reliability and validity study.
Farooq, Muhammad Nazim; Mohseni-Bandpei, Mohammad A; Gilani, Syed Amir; Hafeez, Ambreen
2017-04-08
Despite the wide use of the neck disability index (NDI) for assessing disability in patients with neck pain, the NDI has not yet been translated and validated in Urdu. The first purpose of the present study was to translate and cross-culturally adapt the NDI into the Urdu language (NDI-U). The second purpose was to investigate the reliability, validity and responsiveness of the NDI-U in Urdu-speaking patients experiencing chronic mechanical neck pain (CMNP). Translation and cross-cultural adaptation of the original version of the NDI were carried out using previously described procedures. Seventy-six patients with CMNP and thirty healthy participants were recruited for the study. NDI-U and visual analogue scales for pain intensity (VAS pain ) and disability (VAS disability ) were administered to all the participants at baseline and to the patients 3 weeks after receiving physiotherapy intervention. The global rating of change scale (GROC) was also administered at this time. Test-retest reliability and internal consistency were carried out on forty-six randomly selected patients two days after they completed the NDI-U. The NDI-U was evaluated for factor analysis, content validity, construct validity (discriminative and convergent validity) and responsiveness. An intra-class correlation coefficient (ICC 2,1 ) revealed excellent test-retest reliability for all items (ICC 2,1 = 0.86-0.98) and total scores (ICC 2,1 = 0.99) of the NDI-U. The NDI-U was found internally consistent with a Cronbach's alpha of 0.90 and a fair to good correlation between single items and the NDI-U total scores (r = 0.34 to 0.89). Factor analysis of the NDI-U produced two factors explaining 66.71% of the variance. Content validity was good, as no floor or ceiling effects were detected for the NDI-U total score. To determine discriminative validity, an independent t-test revealed a significant difference in the NDI-U total scores between the patients and healthy controls (P < 0.001). For convergent validity, Pearson's correlation coefficient showed a strong correlation between NDI-U and VAS disability (r = 0.83, P < 0.001) and a moderate correlation between NDI-U and VAS pain (r = 0.62, P < 0.001). To measure responsiveness, an independent t-test showed a significant difference in the NDI-U change scores between the stable and the improved groups (P < 0.001). Furthermore, moderate correlations were found between the NDI-U change scores and the GROC (r = 0.50, P < 0.001), VAS disability change scores (r = 0.58, P < 0.001) and VAS pain change scores (r = 0.55, P < 0.001). The results showed that the NDI-U is a reliable, valid and responsive questionnaire to measure disability in Urdu-speaking patients with CMNP.
Fresson, Megan; Dardenne, Benoit; Geurten, Marie; Meulemans, Thierry
2017-11-01
Diagnosis threat has been shown to produce detrimental effects on neuropsychological performance in individuals with mild traumatic brain injury (mTBI). Focusing on contact-sport players who are at great risk of mTBI, our study was designed to examine the moderating role of internal locus of control. Specifically, we predicted that following diagnosis threat (reminder of their risk of sustaining mTBI and of its consequences), low-internal contact-sport players would underperform (assimilation to the stereotype), while their high-internal counterparts would outperform (contrast effect). We predicted that effort and anxiety would mediate these effects. Contact-sport players and non-contact-sport players ("control" group) were randomly assigned to one condition (diagnosis threat or neutral) and then completed attention, executive, episodic memory, and working memory tasks. Regarding mediating and moderating variables, participants rated their effort and anxiety (self-report measures) and completed the Levenson (1974) locus of control scale. Regression-based path analyses were carried out to examine the direct and indirect effects. As expected, there was no effect of condition on the control group's performance. Contact-sport players with moderate and high levels of internal control outperformed (contrast effect) on executive and episodic memory tasks following diagnosis threat compared to the neutral condition. Additionally, the less anxiety moderate- and high-internal contact-sport participants felt, the better they performed on episodic memory and executive tasks. However, contact-sport players low in internal control did not underperform (assimilation effect) under diagnosis threat. Our results suggest that diagnosis threat instructions may have challenged moderate- and high-internal contact-sport participants, leading them to outperform compared to the neutral condition. Individuals who have moderate and high levels of internal locus of control may have higher performance under diagnosis threat compared to the neutral condition because of their feeling of control over their cognitive performance.
Carlozzi, Noelle E; Ianni, Phillip A; Tulsky, David S; Brickell, Tracey A; Lange, Rael T; French, Louis M; Cella, David; Kallen, Michael A; Miner, Jennifer A; Kratz, Anna L
2018-06-19
To examine the reliability and validity of Patient Reported Outcomes Measurement Information System (PROMIS) measures of sleep disturbance and fatigue in TBI caregivers and to determine the severity of fatigue and sleep disturbance in these caregivers. Cross-sectional survey data collected through an online data capture platform. Four rehabilitation hospitals and Walter Reed National Military Medical Center. Caregivers (N=560) of civilians (n=344) and service member/veterans (n=216) with TBI. Not Applicable MAIN OUTCOME MEASURES: PROMIS sleep and fatigue measures administered as both computerized adaptive tests (CATs) and 4-item short forms (SFs). For both samples, floor and ceiling effects for the PROMIS measures were low (<11%), internal consistency was very good (all alphas ≥0.80), and test-retest reliability was acceptable (all r≥0.70 except for the fatigue CAT in the service member/veteran sample r=0.63). Convergent validity was supported by moderate correlations between the PROMIS and related measures. Discriminant validity was supported by low correlations between PROMIS measures and measures of dissimilar constructs. PROMIS scores indicated significantly worse sleep and fatigue for those caring for someone with high levels versus low levels of impairment. Findings support the reliability and validity of the PROMIS CAT and SF measures of sleep disturbance and fatigue in caregivers of civilians and service members/veterans with TBI. Copyright © 2018. Published by Elsevier Inc.
Junkes, Monica C; Fraiz, Fabian C; Sardenberg, Fernanda; Lee, Jessica Y; Paiva, Saul M; Ferreira, Fernanda M
2015-01-01
The aim of the present study was to translate, perform the cross-cultural adaptation of the Rapid Estimate of Adult Literacy in Dentistry to Brazilian-Portuguese language and test the reliability and validity of this version. After translation and cross-cultural adaptation, interviews were conducted with 258 parents/caregivers of children in treatment at the pediatric dentistry clinics and health units in Curitiba, Brazil. To test the instrument's validity, the scores of Brazilian Rapid Estimate of Adult Literacy in Dentistry (BREALD-30) were compared based on occupation, monthly household income, educational attainment, general literacy, use of dental services and three dental outcomes. The BREALD-30 demonstrated good internal reliability. Cronbach's alpha ranged from 0.88 to 0.89 when words were deleted individually. The analysis of test-retest reliability revealed excellent reproducibility (intraclass correlation coefficient = 0.983 and Kappa coefficient ranging from moderate to nearly perfect). In the bivariate analysis, BREALD-30 scores were significantly correlated with the level of general literacy (rs = 0.593) and income (rs = 0.327) and significantly associated with occupation, educational attainment, use of dental services, self-rated oral health and the respondent's perception regarding his/her child's oral health. However, only the association between the BREALD-30 score and the respondent's perception regarding his/her child's oral health remained significant in the multivariate analysis. The BREALD-30 demonstrated satisfactory psychometric properties and is therefore applicable to adults in Brazil.
Junkes, Monica C.; Fraiz, Fabian C.; Sardenberg, Fernanda; Lee, Jessica Y.; Paiva, Saul M.; Ferreira, Fernanda M.
2015-01-01
Objective The aim of the present study was to translate, perform the cross-cultural adaptation of the Rapid Estimate of Adult Literacy in Dentistry to Brazilian-Portuguese language and test the reliability and validity of this version. Methods After translation and cross-cultural adaptation, interviews were conducted with 258 parents/caregivers of children in treatment at the pediatric dentistry clinics and health units in Curitiba, Brazil. To test the instrument's validity, the scores of Brazilian Rapid Estimate of Adult Literacy in Dentistry (BREALD-30) were compared based on occupation, monthly household income, educational attainment, general literacy, use of dental services and three dental outcomes. Results The BREALD-30 demonstrated good internal reliability. Cronbach’s alpha ranged from 0.88 to 0.89 when words were deleted individually. The analysis of test-retest reliability revealed excellent reproducibility (intraclass correlation coefficient = 0.983 and Kappa coefficient ranging from moderate to nearly perfect). In the bivariate analysis, BREALD-30 scores were significantly correlated with the level of general literacy (rs = 0.593) and income (rs = 0.327) and significantly associated with occupation, educational attainment, use of dental services, self-rated oral health and the respondent’s perception regarding his/her child's oral health. However, only the association between the BREALD-30 score and the respondent’s perception regarding his/her child's oral health remained significant in the multivariate analysis. Conclusion The BREALD-30 demonstrated satisfactory psychometric properties and is therefore applicable to adults in Brazil. PMID:26158724
Translated Versions of Voice Handicap Index (VHI)-30 across Languages: A Systematic Review
SEIFPANAHI, Sadegh; JALAIE, Shohreh; NIKOO, Mohammad Reza; SOBHANI-RAD, Davood
2015-01-01
Background: In this systematic review, the aim is to investigate different VHI-30 versions between languages regarding their validity, reliability and their translation process. Methods: Articles were extracted systematically from some of the prime databases including Cochrane, googlescholar, MEDLINE (via PubMed gate), Sciencedirect, Web of science, and their reference lists by Voice Handicap Index keyword with only title limitation and time of publication (from 1997 to 2014). However the other limitations (e.g. excluding non-English, other versions of VHI ones, and so on) applied manually after studying the papers. In order to appraise the methodology of the papers, three authors did it by 12-item diagnostic test checklist in “Critical Appraisal Skills Programme” or (CASP) site. After applying all of the screenings, the papers that had the study eligibility criteria such as; translation, validity, and reliability processes, included in this review. Results: The remained non-repeated articles were 12 from different languages. All of them reported validity, reliability and translation method, which presented in details in this review. Conclusion: Mainly the preferred method for translation in the gathered papers was “Brislin’s classic back-translation model (1970), although the procedure was not performed completely but it was more prominent than other translation procedures. High test-retest reliability, internal consistency and moderate construct validity between different languages in regards to all 3 VHI-30 domains confirm the applicability of translated VHI-30 version across languages. PMID:26056664
Validity and Reliability of a New Instrument to Measure Cancer-Related Fatigue in Adolescents
Hinds, Pamela S.; Hockenberry, Marilyn; Tong, Xin; Rai, Shesh N.; Gattuso, Jamie S.; McCarthy, Kathleen; Pui, Ching-Hon; Srivastava, Deo Kumar
2008-01-01
Adolescents undergoing treatment for cancer rate fatigue as their most prevalent and intense cancer- and treatment-related effect. Parents and staff rate it similarly. Despite its reported prevalence, intensity, and distressing effects, cancer-related fatigue in adolescents is not routinely assessed during or after cancer treatment. We contend that the insufficient clinical attention is primarily due to the lack of a reliable and valid self-report instrument with which adolescent cancer-related fatigue can be measured. Our aim was to determine the reliability and construct validity of a new instrument and its ability to measure change in fatigue over time. Initial testing involved 64 adolescents undergoing curative treatment of cancer who completed the Fatigue Scale-Adolescent (FS-A) at two to four key points in treatment in one of four studies. Internal consistency estimates ranged from 0.67 to 0.95. Validity estimates involving the FS-A with the parent version ranged from 0.13 to 0.76; estimates involving the staff version and the Reynolds Depression Scale were 0.27 and 0.87 respectively. Additional validity findings included significant fatigue differences between anemic and non-anemic patients (P = 0.042) and the emergence of four factors in an exploratory factor analysis. Findings further indicate that the FS-A can be used to measure change over time (t = 2.55, P <0.01). In summary, the FS-A has moderate to strong reliability and impressive validity coefficients for a new research instrument. PMID:17629669
Muir-Hunter, Susan W; Graham, Laura; Montero Odasso, Manuel
2015-08-01
To measure test-retest and interrater reliability of the Berg Balance Scale (BBS) in community-dwelling adults with mild to moderate Alzheimer disease (AD). Method : A sample of 15 adults (mean age 80.20 [SD 5.03] years) with AD performed three balance tests: the BBS, timed up-and-go test (TUG), and Functional Reach Test (FRT). Both relative reliability, using the intra-class correlation coefficient (ICC), and absolute reliability, using standard error of measurement (SEM) and minimal detectable change (MDC95) values, were calculated; Bland-Altman plots were constructed to evaluate inter-tester agreement. The test-retest interval was 1 week. Results : For the BBS, relative reliability values were 0.95 (95% CI, 0.85-0.98) for test-retest reliability and 0.72 (95% CI, 0.31-0.91) for interrater reliability; SEM was 6.01 points and MDC95 was 16.66 points; and interrater agreement was 16.62 points. The BBS performed better in test-retest reliability than the TUG and FRT, tests with established reliability in AD. Between 33% and 50% of participants required cueing beyond standardized instructions because they were unable to remember test instructions. Conclusions : The BBS achieved relative reliability values that support its clinical utility, but MDC95 and agreement values indicate the scale has performance limitations in AD. Further research to optimize balance assessment for people with AD is required.
The Reliability and Validity of Big Five Inventory Scores with African American College Students
ERIC Educational Resources Information Center
Worrell, Frank C.; Cross, William E., Jr.
2004-01-01
This article describes a study that examined the reliability and validity of scores on the Big Five Inventory (BFI; O. P. John, E. M. Donahue, & R. L. Kentle, 1991) in a sample of 336 African American college students. Results from the study indicated moderate reliability and structural validity for BFI scores. Additionally, BFI subscales had few…
A comprehensive review of the psychometric properties of the Drug Abuse Screening Test.
Yudko, Errol; Lozhkina, Olga; Fouts, Adriana
2007-03-01
This article reviews the reliability and the validity of the (10-, 20-, and 28-item) Drug Abuse Screening Test (DAST). The reliability and the validity of the adolescent version of the DAST are also reviewed. An extensive literature review was conducted using the Medline and Psychinfo databases from the years 1982 to 2005. All articles that addressed the reliability and the validity of the DAST were examined. Publications in which the DAST was used as a screening tool but had no data on its psychometric properties were not included. Descriptive information about each version of the test, as well as discussion of the empirical literature that has explored measures of the reliability and the validity of the DAST, has been included. The DAST tended to have moderate to high levels of test-retest, interitem, and item-total reliabilities. The DAST also tended to have moderate to high levels of validity, sensitivity, and specificity. In general, all versions of the DAST yield satisfactory measures of reliability and validity for use as clinical or research tools. Furthermore, these tests are easy to administer and have been used in a variety of populations.
ERIC Educational Resources Information Center
Reitz, E.; Dekovic, M.; Meijer, A. M.
2006-01-01
In this longitudinal study we investigated relations between parenting and externalizing and internalizing problem behaviour during early adolescence. First, we examined parenting effects on problem behaviour, including child behaviour as a moderator. Second, we examined child behaviour as predictor of parenting, also including moderator effects.…
Inter-rater reliability of an observation-based ergonomics assessment checklist for office workers.
Pereira, Michelle Jessica; Straker, Leon Melville; Comans, Tracy Anne; Johnston, Venerina
2016-12-01
To establish the inter-rater reliability of an observation-based ergonomics assessment checklist for computer workers. A 37-item (38-item if a laptop was part of the workstation) comprehensive observational ergonomics assessment checklist comparable to government guidelines and up to date with empirical evidence was developed. Two trained practitioners assessed full-time office workers performing their usual computer-based work and evaluated the suitability of workstations used. Practitioners assessed each participant consecutively. The order of assessors was randomised, and the second assessor was blinded to the findings of the first. Unadjusted kappa coefficients between the raters were obtained for the overall checklist and subsections that were formed from question-items relevant to specific workstation equipment. Twenty-seven office workers were recruited. The inter-rater reliability between two trained practitioners achieved moderate to good reliability for all except one checklist component. This checklist has mostly moderate to good reliability between two trained practitioners. Practitioner Summary: This reliable ergonomics assessment checklist for computer workers was designed using accessible government guidelines and supplemented with up-to-date evidence. Employers in Queensland (Australia) can fulfil legislative requirements by using this reliable checklist to identify and subsequently address potential risk factors for work-related injury to provide a safe working environment.
Reliability and validity of current physical examination techniques of the foot and ankle.
Wrobel, James S; Armstrong, David G
2008-01-01
This literature review was undertaken to evaluate the reliability and validity of the orthopedic, neurologic, and vascular examination of the foot and ankle. We searched PubMed-the US National Library of Medicine's database of biomedical citations-and abstracts for relevant publications from 1966 to 2006. We also searched the bibliographies of the retrieved articles. We identified 35 articles to review. For discussion purposes, we used reliability interpretation guidelines proposed by others. For the kappa statistic that calculates reliability for dichotomous (eg, yes or no) measures, reliability was defined as moderate (0.4-0.6), substantial (0.6-0.8), and outstanding (> 0.8). For the intraclass correlation coefficient that calculates reliability for continuous (eg, degrees of motion) measures, reliability was defined as good (> 0.75), moderate (0.5-0.75), and poor (< 0.5). Intraclass correlations, based on the various examinations performed, varied widely. The range was from 0.08 to 0.98, depending on the examination performed. Concurrent and predictive validity ranged from poor to good. Although hundreds of articles exist describing various methods of lower-extremity assessment, few rigorously assess the measurement properties. This information can be used both by the discerning clinician in the art of clinical examination and by the scientist in the measurement properties of reproducibility and validity.
Mortsiefer, Achim; Immecke, Janine; Rotthoff, Thomas; Karger, André; Schmelzer, Regine; Raski, Bianca; Schmitten, Jürgen In der; Altiner, Attila; Pentzek, Michael
2014-06-01
To evaluate the summative assessment (OSCE) of a communication training programme for dealing with challenging doctor-patient encounters in the 4th study year. Our OSCE consists of 4 stations (breaking bad news, guilt and shame, aggressive patients, shared decision making), using a 4-item global rating (GR) instrument. We calculated reliability coefficients for different levels, discriminability of single items and interrater reliability. Validity was estimated by gender differences and accordance between GR and a checklist. In a pooled sample of 456 students in 3 OSCEs over 3 terms, total reliability was α=0.64, reliability coefficients for single stations were >0.80, and discriminability in 3 of 4 stations was within the range of 0.4-0.7. Except for one station, interrater reliability was moderate to strong. Reliability on item level was poor and pointed to some problems with the use of the GR. The application of the GR on regular undergraduate medical education shows moderate reliability in need of improvement and some traits of validity. Ongoing development and evaluation is needed with particular regard to the training of the examiners. Our CoMeD-OSCE proved suitable for the summative assessment of communication skills in challenging doctor-patient encounters. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
2014-01-01
Background Foot disease complications, such as foot ulcers and infection, contribute to considerable morbidity and mortality. These complications are typically precipitated by “high-risk factors”, such as peripheral neuropathy and peripheral arterial disease. High-risk factors are more prevalent in specific “at risk” populations such as diabetes, kidney disease and cardiovascular disease. To the best of the authors’ knowledge a tool capturing multiple high-risk factors and foot disease complications in multiple at risk populations has yet to be tested. This study aimed to develop and test the validity and reliability of a Queensland High Risk Foot Form (QHRFF) tool. Methods The study was conducted in two phases. Phase one developed a QHRFF using an existing diabetes foot disease tool, literature searches, stakeholder groups and expert panel. Phase two tested the QHRFF for validity and reliability. Four clinicians, representing different levels of expertise, were recruited to test validity and reliability. Three cohorts of patients were recruited; one tested criterion measure reliability (n = 32), another tested criterion validity and inter-rater reliability (n = 43), and another tested intra-rater reliability (n = 19). Validity was determined using sensitivity, specificity and positive predictive values (PPV). Reliability was determined using Kappa, weighted Kappa and intra-class correlation (ICC) statistics. Results A QHRFF tool containing 46 items across seven domains was developed. Criterion measure reliability of at least moderate categories of agreement (Kappa > 0.4; ICC > 0.75) was seen in 91% (29 of 32) tested items. Criterion validity of at least moderate categories (PPV > 0.7) was seen in 83% (60 of 72) tested items. Inter- and intra-rater reliability of at least moderate categories (Kappa > 0.4; ICC > 0.75) was seen in 88% (84 of 96) and 87% (20 of 23) tested items respectively. Conclusions The QHRFF had acceptable validity and reliability across the majority of items; particularly items identifying relevant co-morbidities, high-risk factors and foot disease complications. Recommendations have been made to improve or remove identified weaker items for future QHRFF versions. Overall, the QHRFF possesses suitable practicality, validity and reliability to assess and capture relevant foot disease items across multiple at risk populations. PMID:24468080
French translation and validation of the "Anterior Knee Pain Scale" (AKPS).
Buckinx, F; Bornheim, S; Remy, G; Van Beveren, J; Reginster, Jy; Bruyère, O; Dardenne, N; Kaux, J F
2017-12-21
To linguistically and cross-culturally translate the Anterior Knee Pain Scale into French and to evaluate the reliability and validity of this translated version of the questionnaire. The translation part was performed in six stages, according to international guidelines: (i) two initial translations from English to French; (ii) synthesis of the two translations; (iii) backward translations into the original language; (iv) expert committee to compare the backward translations with the original questionnaire; (v) pre-final version testing and (VI) expert committee appraisal. To validate the French version of the Anterior Knee Pain Scale, we assessed its validity, reliability and floor/ceiling effects. To do this, volunteer patients from the French part of Belgium and from France, with patellofemoral pain were asked to answer the French version of the Anterior Knee Pain Scale at baseline and after 7 days, as well as the generic SF-36 questionnaire. The Anterior Knee Pain Scale was translated without any major difficulties. A total of 101 subjects aged 34.5 ± 11.4 years (58.4% of women) were included in this study. Results indicated an excellent test-retest reliability (Intra-class correlation coefficient (ICC) = 0.97, 95%CI: 0.96-0.98), a high internal consistency (Cronbach's alpha = 0.87), a consistent construct validity (high correlations with the SF-36 questionnaire were found with domains related to physical function (r = 0.80), physical role (r = 0.70) and pain (r = 0.64)) and low or moderate correlations with domains related to mental health (r = 0.26), vitality (r = 0.32) and social function (r = 0.41). Moreover, no floor/ceiling effects have been found. A valid French version of the Anterior Knee Pain Scale is now available and can be used with confidence to better assess the disease burden associated with patellofemoral pain. It was successfully cross-culturally adapted into French. Implications for rehabilitation The results on psychometric properties of the French Anterior Knee Pain Scale are comparable with six validated versions obtained for the Finnish, the Turkish, the Chinese, the Dutch, the Thai and the Persian populations. The French translated version of the Anterior Knee Pain Scale is a reliable and valid instrument for assessing the functional limitations associated with patellofemoral pain. The test-retest reliability of the French Anterior Knee Pain Scale was excellent, the internal consistency was high and the construct validity was consistent. There were no floor/ceiling effects.
Pourmomeny, Abbas Ali; Mazdak, Hamid
2017-06-01
The purpose of this study was to translate male lower urinary tract symptoms long form (MLUTS-LF) questionnaire and determine its psychometric properties in Persian speaking subjects. Assessment instrument is essential for research, making diagnosis, and for evaluating the treatment outcomes in subjects with lower urinary tract disorders of either gender. Long form of MLUTS questionnaire is a robust self-report questionnaire that investigates the major aspects of lower urinary tract symptoms and their impact on quality of life. After getting permission from the International Consultation International Questionnaire website, the forward and backward translation MLUTS carried out by researcher team and assess content/face/construct validity, reliability in sample of MLUTS Iranian patients and, quality rating and pilot testing. The irritating and obstructing lower urinary disorders were categorized as mild, moderate, and severe in the study sample. Twenty two subjects were suffering from urinary incontinence and most of the participants had benign prostate hyperplasia (BPH). Cronbach's alpha coefficient was 0.819. Correlations between the MLUTS and International prostate symptom score (IPSS) was 0.753. The MLUTS Questionnaire showed good internal consistency, content validity, and construct validity, as measured by correlation with scores on the IPSS. The Iranian version of the MLUTS questionnaire is a valid and robust instrument that can be used in clinical settings and in research. © 2016 Wiley Periodicals, Inc.
de la Cámara, Miguel Ángel; Higueras-Fresnillo, Sara; Martinez-Gomez, David; Veiga, Oscar L
2018-05-29
The inter-day reliability of the Intelligent Device for Energy Expenditure and Activity (IDEEA) has not been studied to date. The study purpose was to examine the inter-day variability and reliability on two consecutive days collected with the IDEEA, as well as to predict the number of days needed to provide a reliable estimate of several movement (walking and climbing stairs) and non-movement behaviors (lying, reclining, sitting) and standing in older adults. The sample included 126 older adults (74 women) who wore the IDEEA for 48-h. Results showed low variability between the two days and its reliability was from moderate (ICC=0.34) to high (ICC=0.80) in most of movement and non-movement behaviors analyzed. The Bland-Altman plots showed a high-moderate agreement between days and the Spearman-Brown formula estimated ranged from 1.2 and 9.1 days of monitoring with the IDEEA are needed to achieve ICCs≥0.70 in older adults for sitting and climbing stairs, respectively.
Camera-tracking gaming control device for evaluation of active wrist flexion and extension.
Shefer Eini, Dalit; Ratzon, Navah Z; Rizzo, Albert A; Yeh, Shih-Ching; Lange, Belinda; Yaffe, Batia; Daich, Alexander; Weiss, Patrice L; Kizony, Rachel
Cross sectional. Measuring wrist range of motion (ROM) is an essential procedure in hand therapy clinics. To test the reliability and validity of a dynamic ROM assessment, the Camera Wrist Tracker (CWT). Wrist flexion and extension ROM of 15 patients with distal radius fractures and 15 matched controls were assessed with the CWT and with a universal goniometer. One-way model intraclass correlation coefficient analysis indicated high test-retest reliability for extension (ICC = 0.92) and moderate reliability for flexion (ICC = 0.49). Standard error for extension was 2.45° and for flexion was 4.07°. Repeated-measures analysis revealed a significant main effect for group; ROM was greater in the control group (F[1, 28] = 47.35; P < .001). The concurrent validity of the CWT was partially supported. The results indicate that the CWT may provide highly reliable scores for dynamic wrist extension ROM, and moderately reliable scores for flexion, in people recovering from a distal radius fracture. N/A. Copyright © 2016 Hanley & Belfus. Published by Elsevier Inc. All rights reserved.
Laurent, Heidemarie K.; Leve, Leslie D.; Neiderhiser, Jenae M.; Natsuaki, Misaki N.; Shaw, Daniel S.; Fisher, Philip A.; Marceau, Kristine; Harold, Gordon T.; Reiss, David
2012-01-01
Child hypothalamic-pituitary-adrenal (HPA) activity was investigated as a moderator of parental depressive symptom effects on child behavior in an adoption sample ( n =210 families). Adoptive parents’ depressive symptoms and child internalizing and externalizing were assessed at 18, 27, and 54 months, and child morning and evening HPA activity measured through salivary cortisol at 54 months. Children’s daily cortisol levels and day-to-day variability were tested as moderators of longitudinal associations between parent and child symptoms at within- and between-family levels. Mothers’ symptoms related directly to child internalizing, but child evening cortisol moderated effects of fathers’ symptoms on internalizing, and of both parents’ symptoms on externalizing. Different paths of within-family risk dynamics vs. between-family risk synergy were found for internalizing vs. externalizing outcomes. PMID:23013523
Lohrer, Heinz; Nauck, Tanja
2009-10-30
Achilles tendinopathy is the predominant overuse injury in runners. To further investigate this overload injury in transverse and longitudinal studies a valid, responsive and reliable outcome measure is demanded. Most questionnaires have been developed for English-speaking populations. This is also true for the VISA-A score, so far representing the only valid, reliable, and disease specific questionnaire for Achilles tendinopathy. To internationally compare research results, to perform multinational studies or to exclude bias originating from subpopulations speaking different languages within one country an equivalent instrument is demanded in different languages. The aim of this study was therefore to cross-cultural adapt and validate the VISA-A questionnaire for German-speaking Achilles tendinopathy patients. According to the "guidelines for the process of cross-cultural adaptation of self-report measures" the VISA-A score was cross-culturally adapted into German (VISA-A-G) using six steps: Translation, synthesis, back translation, expert committee review, pretesting (n = 77), and appraisal of the adaptation process by an advisory committee determining the adequacy of the cross-cultural adaptation. The resulting VISA-A-G was then subjected to an analysis of reliability, validity, and internal consistency in 30 Achilles tendinopathy patients and 79 asymptomatic people. Concurrent validity was tested against a generic tendon grading system (Percy and Conochie) and against a classification system for the effect of pain on athletic performance (Curwin and Stanish). The "advisory committee" determined the VISA-A-G questionnaire as been translated "acceptable". The VISA-A-G questionnaire showed moderate to excellent test-retest reliability (ICC = 0.60 to 0.97). Concurrent validity showed good coherence when correlated with the grading system of Curwin and Stanish (rho = -0.95) and for the Percy and Conochie grade of severity (rho 0.95). Internal consistency (Cronbach's alpha) for the total VISA-A-G scores of the patients was calculated to be 0.737. The VISA-A questionnaire was successfully cross-cultural adapted and validated for use in German speaking populations. The psychometric properties of the VISA-A-G questionnaire are similar to those of the original English version. It therefore can be recommended as a sufficiently robust tool for future measuring clinical severity of Achilles tendinopathy in German speaking patients.
Lohrer, Heinz; Nauck, Tanja
2009-01-01
Background Achilles tendinopathy is the predominant overuse injury in runners. To further investigate this overload injury in transverse and longitudinal studies a valid, responsive and reliable outcome measure is demanded. Most questionnaires have been developed for English-speaking populations. This is also true for the VISA-A score, so far representing the only valid, reliable, and disease specific questionnaire for Achilles tendinopathy. To internationally compare research results, to perform multinational studies or to exclude bias originating from subpopulations speaking different languages within one country an equivalent instrument is demanded in different languages. The aim of this study was therefore to cross-cultural adapt and validate the VISA-A questionnaire for German-speaking Achilles tendinopathy patients. Methods According to the "guidelines for the process of cross-cultural adaptation of self-report measures" the VISA-A score was cross-culturally adapted into German (VISA-A-G) using six steps: Translation, synthesis, back translation, expert committee review, pretesting (n = 77), and appraisal of the adaptation process by an advisory committee determining the adequacy of the cross-cultural adaptation. The resulting VISA-A-G was then subjected to an analysis of reliability, validity, and internal consistency in 30 Achilles tendinopathy patients and 79 asymptomatic people. Concurrent validity was tested against a generic tendon grading system (Percy and Conochie) and against a classification system for the effect of pain on athletic performance (Curwin and Stanish). Results The "advisory committee" determined the VISA-A-G questionnaire as been translated "acceptable". The VISA-A-G questionnaire showed moderate to excellent test-retest reliability (ICC = 0.60 to 0.97). Concurrent validity showed good coherence when correlated with the grading system of Curwin and Stanish (rho = -0.95) and for the Percy and Conochie grade of severity (rho 0.95). Internal consistency (Cronbach's alpha) for the total VISA-A-G scores of the patients was calculated to be 0.737. Conclusion The VISA-A questionnaire was successfully cross-cultural adapted and validated for use in German speaking populations. The psychometric properties of the VISA-A-G questionnaire are similar to those of the original English version. It therefore can be recommended as a sufficiently robust tool for future measuring clinical severity of Achilles tendinopathy in German speaking patients. PMID:19878572
López-Pina, José Antonio; Sánchez-Meca, Julio; López-López, José Antonio; Marín-Martínez, Fulgencio; Núñez-Núñez, Rosa Ma; Rosa-Alcázar, Ana I; Gómez-Conesa, Antonia; Ferrer-Requena, Josefa
2015-01-01
The Yale-Brown Obsessive-Compulsive Scale for children and adolescents (CY-BOCS) is a frequently applied test to assess obsessive-compulsive symptoms. We conducted a reliability generalization meta-analysis on the CY-BOCS to estimate the average reliability, search for reliability moderators, and propose a predictive model that researchers and clinicians can use to estimate the expected reliability of the CY-BOCS scores. A total of 47 studies reporting a reliability coefficient with the data at hand were included in the meta-analysis. The results showed good reliability and a large variability associated to the standard deviation of total scores and sample size.
Randall Simpson, Janis; Gumbley, Jillian; Whyte, Kylie; Lac, Jane; Morra, Crystal; Rysdale, Lee; Turfryer, Mary; McGibbon, Kim; Beyers, Joanne; Keller, Heather
2015-09-01
Nutrition is vital for optimal growth and development of young children. Nutrition risk screening can facilitate early intervention when followed by nutritional assessment and treatment. NutriSTEP (Nutrition Screening Tool for Every Preschooler) is a valid and reliable nutrition risk screening questionnaire for preschoolers (aged 3-5 years). A need was identified for a similar questionnaire for toddlers (aged 18-35 months). The purpose was to develop a reliable and valid Toddler NutriSTEP. Toddler NutriSTEP was developed in 4 phases. Content and face validity were determined with a literature review, parent focus groups (n = 6; 48 participants), and experts (n = 13) (phase A). A draft questionnaire was refined with key intercept interviews of 107 parents/caregivers (phase B). Test-retest reliability (phase C), based on intra-class correlations (ICC), Kappa (κ) statistics, and Wilcoxon tests was assessed with 133 parents/caregivers. Criterion validity (phase D) was assessed using Receiver Operating Characteristic (ROC) curves by comparing scores on the Toddler NutriSTEP to a comprehensive nutritional assessment of 200 toddlers with a registered dietitian (RD). The Toddler NutriSTEP was reliable between 2 administrations (ICC = 0.951, F = 20.53, p < 0.001); most questions had moderate (κ ≥ 0.6) or excellent (κ ≥ 0.8) agreement. Scores on the RD nutrition risk rating and the Toddler NutriSTEP were correlated (r = 0.67, p < 0.000). The area under the ROC curve for moderate and high RD risk ratings were 84.6% and 82.7%, respectively. Cut-points of ≥21 (sensitivity 86%; specificity 61%) (moderate risk) and ≥26 (sensitivity 95%; specificity 63%) (high risk) were determined. The Toddler NutriSTEP questionnaire is both reliable and valid for screening for nutritional risk in toddlers.
Reliability of Examination Findings in Suspected Community-Acquired Pneumonia.
Florin, Todd A; Ambroggio, Lilliam; Brokamp, Cole; Rattan, Mantosh S; Crotty, Eric J; Kachelmeyer, Andrea; Ruddy, Richard M; Shah, Samir S
2017-09-01
The authors of national guidelines emphasize the use of history and examination findings to diagnose community-acquired pneumonia (CAP) in outpatient children. Little is known about the interrater reliability of the physical examination in children with suspected CAP. This was a prospective cohort study of children with suspected CAP presenting to a pediatric emergency department from July 2013 to May 2016. Children aged 3 months to 18 years with lower respiratory signs or symptoms who received a chest radiograph were included. We excluded children hospitalized ≤14 days before the study visit and those with a chronic medical condition or aspiration. Two clinicians performed independent examinations and completed identical forms reporting examination findings. Interrater reliability for each finding was reported by using Fleiss' kappa (κ) for categorical variables and intraclass correlation coefficient (ICC) for continuous variables. No examination finding had substantial agreement (κ/ICC > 0.8). Two findings (retractions, wheezing) had moderate to substantial agreement (κ/ICC = 0.6-0.8). Nine findings (abdominal pain, pleuritic pain, nasal flaring, skin color, overall impression, cool extremities, tachypnea, respiratory rate, and crackles/rales) had fair to moderate agreement (κ/ICC = 0.4-0.6). Eight findings (capillary refill time, cough, rhonchi, head bobbing, behavior, grunting, general appearance, and decreased breath sounds) had poor to fair reliability (κ/ICC = 0-0.4). Only 3 examination findings had acceptable agreement, with the lower 95% confidence limit >0.4: wheezing, retractions, and respiratory rate. In this study, we found fair to moderate reliability of many findings used to diagnose CAP. Only 3 findings had acceptable levels of reliability. These findings must be considered in the clinical management and research of pediatric CAP. Copyright © 2017 by the American Academy of Pediatrics.
Béliard, Sophie; Coudert, Mathieu; Valéro, René; Charbonnier, Laurie; Duchêne, Emilie; Allaert, François André; Bruckert, Éric
2012-12-01
The purpose of our study was to develop and validate a short food frequency questionnaire which could assess the nutritional lifestyles of hypercholesterolemic patients consulting in daily practice. The questionnaire explores 11 nutrient categories. Hundred and thirty-one patients were recruited for the construct validity and 58 patients for the external validity in La Pitié Hospital, Paris. The reference method used was the diet history. To measure the internal consistency and to test the sensibility to change on a large scale, the questionnaire was used in an observational study conducted in Spain in 1048 moderate hypercholesterolemic patients. Psychometric analyses included construct validity, internal consistency, test-retest reliability, external validity and sensibility to change. Validation of the questionnaire indicated a good internal consistency (Cronbach Coefficient Alpha at 0.69) and test-retest reliability (intraclass correlation coefficient=0.89). The correlation between the scores of the FFQ and those of the diet history was significant with a Pearson correlation coefficient at 0.3 (P=0.029). The comparison between the ranking of the patients showed an agreement of 72% with a kappa of 0.48 [0.10; 0.69]. The sensibility to change was good with a score evolution improving one and four months after nutrition advices: 28.2% of patients ranked in group 1 at inclusion versus 61.3% (P<0.0001) at one month and 75.2% (P<0.0001) at four months. In conclusion, we developed and validated a food questionnaire for hypercholesterolemic patients, which can be used as a therapeutic education tool in daily practice or in clinical research. Copyright © 2012. Published by Elsevier Masson SAS.
Kamamoto, Cristhine de Souza Leão; Hassun, Karime Marques; Bagatin, Ediléia; Tomimori, Jane
2014-01-01
BACKGROUND many studies about the psychosocial impact of acne have been reported in international medical literature describing quality of life as a relevant clinical outcome. It is well known that the patient's perception about the disease may be different from the physician's evaluation. Therefore, it is important to use validated instruments that turn the patient's subjective opinion into objective information. OBJECTIVES to translate into Brazilian-Portuguese language and to culturally adapt a quality of life questionnaire, the Acne-Specific Quality of Life Questionnaire (Acne-QoL), as well as to evaluate its reliability and validity. METHODS measurement properties were assessed: 1) validity: comparison between severity and Acne-QoL domain scores, correlations between acne duration and Acne-QoL domain scores, and correlation between Acne-QoL domain scores and SF-36 components; 2) internal consistency: Cronbach's α coefficient; 3) test-retest reproducibility: intraclass correlation coefficient and Wilcoxon test. RESULTS Eighty subjects with a mean age of 20.5 ± 4.8 years presenting mild (33.8%), moderate (36.2%) and severe (30%) facial acne were enrolled. Acne-QoL domain scores were similar among the different acne severity groups except for role-social domain. Subjects with shorter acne duration presented significant higher scores. Acne-QoL domains showed significant correlations, both between themselves and with SF-36 role-social and mental health components. Internal consistency (0.925-0.952) and test-retest reproducibility were considered acceptable (0.768-0.836). CONCLUSIONS the Brazilian-Portuguese version of the Acne-QoL is a reliable and valid satisfactory outcome measure to be used in facial acne studies. PMID:24626652
Dupeyron, Arnaud; Lanhers, Charlotte; Bastide, Sophie; Alonso, Sandrine; Toulotte, Matthias; Jourdan, Claire; Coudeyre, Emmanuel
2017-01-01
According to the fear avoidance model, beliefs and thoughts can modify the outcome of patient with low back pain. The Back Belief Questionnaire (BBQ)-a 14 items scale-assesses these consequences of low back pain. To test the psychometric properties of the French version of the BBQ. The BBQ was translated using the forward-backward translation process. Throughout three repeated evaluation time points (D1, D7 and D30), various aspects of validity were analysed: acceptability, quality of items, unidimentionality, internal consistency, temporal stability (between D1 and D7), responsiveness (between D7 and D30), and construct validity comparing it to other validated scales. One hundred and thirty-one patients were enrolled and 128 were analyzed. The acceptability and the quality of the items were excellent. The scale was unidimensional and reliable (internal consistency: Cronbach's α = 0.8). The responsiveness was moderate but in line with other scores. The BBQ was, as expected, convergent with day-to-day activities and fear avoidance (FABQ and Tampa), disability (Quebec and Dallas scores), or anxiety and depression (HAD); and not correlated with pain. Best correlations were found with Tampa and FABQ. The temporal stability (test-retest reliability) was poor. However, similar changes were observed in near conceptual score (FABQ), which confirmed that clinical status may have not been stable and suggesting sensitivity to early changes for BBQ. The BBQ showed good psychometric properties to assess false beliefs and related fear in French or English LBP populations and can be used either for evaluation in international trials or as a part of self-care training.
Cacchio, Angelo; Necozione, Stefano; MacDermid, Joy C; Rompe, Jan Dirk; Maffulli, Nicola; di Orio, Ferdinando; Santilli, Valter; Paoloni, Marco
2012-08-01
The Patient-Rated Tennis Elbow Evaluation (PRTEE) questionnaire is a tool designed for self-assessment of forearm pain and disability in patients with lateral elbow tendinopathy (LET). However, an Italian version of this questionnaire has not been available. The aims of this study were: (1) to translate and cross-culturally adapt the PRTEE questionnaire into Italian and (2) to evaluate its measurement properties. This was a longitudinal, observational measurement study. The PRTEE questionnaire was cross-culturally adapted to Italian according to established guidelines. Ninety-five individuals (41 women, 54 men) with unilateral, imaging-confirmed, chronic LET were selected consecutively to assess the measurement properties of the PRTEE questionnaire. Internal consistency, test-retest reliability, construct validity, and responsiveness were estimated. The Italian version of the PRTEE displayed a high degree of internal consistency, with a Cronbach alpha of .95. The test-retest reliability was high for both short-term and medium-term, with intraclass correlation coefficients (2,1) of .95 and .93, respectively. The PRTEE exhibited a strong correlation (r=.77-.91, P<.0001) with the Disabilities of the Arm, Shoulder and Hand (DASH) at the baseline and a moderate correlation (r=.58-.74, P<.0001) at discharge. The responsiveness was higher for the PRTEE than for the DASH. Limitations A methodological limitation of the study is that due to the small sample size, a factor analysis was not performed to assess convergent validity. The Italian version of the PRTEE questionnaire is internally consistent, demonstrates expected correlations with other measures, and is more responsive than the DASH in Italian patients with chronic LET.
Jurk, Sarah; Kuitunen-Paul, Sören; Kroemer, Nils B; Artiges, Eric; Banaschewski, Tobias; Bokde, Arun L W; Büchel, Christian; Conrod, Patricia; Fauth-Bühler, Mira; Flor, Herta; Frouin, Vincent; Gallinat, Jürgen; Garavan, Hugh; Heinz, Andreas; Mann, Karl F; Nees, Frauke; Paus, Tomáš; Pausova, Zdenka; Poustka, Luise; Rietschel, Marcella; Schumann, Gunter; Struve, Maren; Smolka, Michael N
2015-11-01
The aim of the present longitudinal study was the psychometric evaluation of the Substance Use Risk Profile Scale (SURPS). We analyzed data from N = 2,022 adolescents aged 13 to 15 at baseline assessment and 2 years later (mean interval 2.11 years). Missing data at follow-up were imputed (N = 522). Psychometric properties of the SURPS were analyzed using confirmatory factor analysis. We examined structural as well as convergent validity with other personality measurements and drinking motives, and predictive validity for substance use at follow-up. The hypothesized 4-factorial structure (i.e., anxiety sensitivity, hopelessness, impulsivity [IMP], and sensation seeking [SS]) based on all 23 items resulted in acceptable fit to empirical data, acceptable internal consistencies, low to moderate test-retest reliability coefficients, as well as evidence for factorial and convergent validity. The proposed factor structure was stable for both males and females and, to lesser degree, across languages. However, only the SS and the IMP subscales of the SURPS predicted substance use outcomes at 16 years of age. The SURPS is unique in its specific assessment of traits related to substance use disorders as well as the resulting shortened administration time. Test-retest reliability was low to moderate and comparable to other personality scales. However, its relation to future substance use was limited to the SS and IMP subscales, which may be due to the relatively low-risk substance use pattern in the present sample. Copyright © 2015 by the Research Society on Alcoholism.
Shukla, K; Shahane, S; D’Souza, W
2017-01-01
Background: Considering a huge working population in health sector faced with stressful work life, limited autonomy in work and declining work contentment calls for an overemphasis on evaluating and monitoring their satisfaction associated with work-related quality of life (WRQoL). This study evaluates WRQoL of hospital employees and validates the bilingual (English and Marathi) version of WRQoL scale. Methods: The study was conducted during March–April’2014 on employees of a corporate hospital of Pune, India after ethical approval and informed consent from employees. The bilingual WRQoL scale has been tested for reliability and validity, and WRQoL scores have been reported. Results: A total of 132 hospital employees (mean age 31 [±8] years, 55% males) who participated in the study reported overall moderate WRQoL scores. The scale showed high internal consistency (Cronbach's alpha = 0.82, P < 0.0001) and moderate to high validity. WRQoL did not significantly vary across marital status, family size, and gender. “Stress at work” score of WRQoL increased with age of employees. Higher work experience, employment at higher positions and those working in clinical and diagnostic departments reported a higher WRQoL. Conclusion: WRQoL scale is a reliable and valid instrument. Better WRQoL in employees placed in higher organizational positions indicates a need for focused measures to enhance WRQoL of employees in lower hierarchical levels, especially in control at work and home life interface domains. WRQoL needs regular monitoring for employees in lower positions and aging employees. PMID:27779152
Moen, Vegard Pihl; Drageset, Jorunn; Eide, Geir Egil; Klokkerud, Mari; Gjesdal, Sturla
2017-02-01
The World Health Organization Disability Assessment Schedule (WHODAS) 2.0 is a generic instrument to assess disability covering six domains. The purpose of this study was to investigate the potential of the instrument for monitoring disability in specialized somatic rehabilitation by testing reliability, construct validity and responsiveness of WHODAS 2.0, Norwegian version, among patients with various health conditions. For taxonomy, terminology and definitions, the Consensus-based Standards for the Selection of Health Measurement Instruments were followed. Reproducibility was investigated by the intra-class correlation coefficient (ICC) in a randomly selected sample. Internal consistency was assessed by Cronbach's alpha. Construct validity was evaluated by correlations between WHODAS 2.0 and the Medical Outcomes Study 36-item Short Form, and fit of the hypothesized structure using confirmatory factor analysis (CFA). Responsiveness was evaluated in another randomly selected sample by testing a priori formulated hypotheses. Nine hundred seventy patients were included in the study. Reproducibility and responsiveness were evaluated in 53 and 104 patients, respectively. The ICC for the WHODAS 2.0 domains ranged from 0.63 to 0.84 and was 0.87 for total score. Cronbach's alpha for domains ranged from 0.75 to 0.94 and was 0.93 for total score. For construct validity, 6 of 12 expected correlations were confirmed and CFA did not achieve satisfactory fit indices. For responsiveness, 3 of 8 hypotheses were confirmed. The Norwegian version of WHODAS 2.0 showed moderate to satisfactory reliability and moderate validity in rehabilitation patients. However, the present study indicated possible limitations in terms of responsiveness.
McLaren, Suzanne
2015-01-01
Internalized homophobia is a risk factor for depression among gay men and lesbians. The aim of the study was to test whether the internalized homophobia-depression relation was moderated by gender (stronger among gay men compared with lesbians), age (stronger among younger compared with older gay men and lesbians), and place of residence (stronger among gay men and lesbians who live in rural areas compared with those who live in urban areas). An Australian sample of 311 self-identified gay men and 570 self-identified lesbians, aged 18 to 70 years, completed the Internalized Homophobia Scale and the Centre for Epidemiological Studies Depression Scale. Results indicated that age and gender did not moderate the internalized homophobia-depressive symptoms relation. Place of residence was a significant moderator for gay men but not lesbians. In contrast to the hypothesis, the internalized homophobia-depression relation was significant only among gay men who resided in urban areas. Those who work with gay men should be particularly aware of the significant relationship between internalized homophobia and depressive symptoms among gay men who reside in urban areas.
Segura-Jiménez, Víctor; Alvarez-Gallardo, Inmaculada C; Romero-Zurita, Alejandro; Camiletti-Moirón, Daniel; Munguía-Izquierdo, Diego; Carbonell-Baeza, Ana; Ruiz, Jonatan R
2014-10-01
To compare the levels of physical activity (PA) assessed with questionnaires (Leisure Time Physical Activity Instrument [LTPAI], Physical Activity at Home and Work Instrument [PAHWI]) and accelerometry in patients with fibromyalgia; and to analyze the test-retest reliability of these questionnaires. Cross-sectional study. Local fibromyalgia association. Participants (N=99; 5 men) with fibromyalgia with a mean age of 50.2±9.5 years. Not applicable. Participants carried an accelerometer for 1 week and completed the LTPAI and PAHWI twice (separated by a 1-wk interval). The LTPAI and PAHWI were summed to obtain overall values of PA. Time spent in total, moderate, and moderate-vigorous PA was higher (P<.01) when assessed by the LTPAI and PAHWI compared with accelerometry. The Bland-Altman method showed an absence of agreement between the LTPAI and PAHWI and the accelerometer for moderate, moderate-vigorous, and total PA. The test-retest reliability for the workplace subscale and total score of the PAHWI showed high and moderate intraclass correlation coefficients (ICCs), respectively, but also manifested high SE of measurements (up to 179min/d). The LTPAI showed low to moderate ICCs and high SE of measurements (up to 79min/d). For the LTPAI and PAHWI, the ICCs for total activity across the population were low to moderate, and the Bland-Altman method confirmed this lack of agreement. The LTPAI and PAHWI and the accelerometer differ greatly when assessing PA. Furthermore, the LTPAI and PAHWI did not show good levels of test-retest reliability. Therefore, the self-administered LTPAI and PAHWI show questionable usefulness to assess PA in populations with fibromyalgia. Copyright © 2014 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Severity of anxiety and work-related outcomes of patients with anxiety disorders.
Erickson, Steven R; Guthrie, Sally; Vanetten-Lee, Michelle; Himle, Joseph; Hoffman, Jody; Santos, Susana F; Janeck, Amy S; Zivin, Kara; Abelson, James L
2009-01-01
This study examined associations between anxiety and work-related outcomes in an anxiety disorders clinic population, examining both pretreatment links and the impact of anxiety change over 12 weeks of treatment on work outcomes. Four validated instruments were used to also allow examination of their psychometric properties, with the goal of improving measurement of work-related quality of life in this population. Newly enrolled adult patients seeking treatment in a university-based anxiety clinic were administered four work performance measures: Work Limitations Questionnaire (WLQ), Work Productivity and Activity Impairment Questionnaire (WPAI), Endicott Work Productivity Scale (EWPS), and Functional Status Questionnaire Work Performance Scale (WPS). Anxiety severity was determined using the Beck Anxiety Inventory (BAI). The Clinical Global Impressions, Global Improvement Scale (CGI-I) was completed by patients to evaluate symptom change at a 12-week follow-up. Two severity groups (minimal/mild vs. moderate/severe, based on baseline BAI score) were compared to each other on work measures. Eighty-one patients provided complete baseline data. Anxiety severity groups did not differ in job type, time on job, job satisfaction, or job choice. Patients with greater anxiety generally showed lower work performance on all instruments. Job advancement was impaired for the moderate/severe group. The multi-item performance scales demonstrated better validity and internal consistency. The WLQ and the WPAI detected change with symptom improvement. Level of work performance was generally associated with severity of anxiety. Of the instruments tested, the WLQ and the WPAI questionnaire demonstrated acceptable validity and internal reliability.
Axon, Robert N; Penney, Fletcher T; Kyle, Thomas R; Zapka, Jane; Marsden, Justin; Zhao, Yumin; Mauldin, Patrick D; Moran, William P
2014-06-01
Discharge summaries are an important component of hospital care transitions typically completed by interns in teaching hospitals. However, these documents are often not completed in a timely fashion or do not include pertinent details of hospitalization. This report outlines the development and impact of a curriculum intervention to improve the quality of discharge summaries by interns and residents in Internal Medicine. A previous study demonstrated that a discharge summary curriculum featuring individualized feedback was associated with improved summary quality, but few subsequent studies have described implementation of similar curricula. No information exists on the utility of other strategies such as team-based feedback or academic detailing. Study participants were 96 Internal Medicine intern and resident physicians at an academic medical center-based training program. A comprehensive evidence-based discharge summary quality improvement program was developed and implemented that featured a discharge summary template to facilitate summary preparation, individual feedback, team-based feedback, academic detailing and an objective discharge summary evaluation instrument. The discharge summary evaluation instrument had moderate interrater reliability (κ = 0.72). Discharge summary scores improved from mean score of 70% to 82% (P = 0.05). Interns and residents participating in this program also reported increased confidence in producing and critiquing summaries. A comprehensive discharge summary curriculum can be feasibly implemented within the context of a residency program. Team-based feedback and academic detailing may serve to reinforce individual feedback and extend program reach.
The reliability of a quality appraisal tool for studies of diagnostic reliability (QAREL).
Lucas, Nicholas; Macaskill, Petra; Irwig, Les; Moran, Robert; Rickards, Luke; Turner, Robin; Bogduk, Nikolai
2013-09-09
The aim of this project was to investigate the reliability of a new 11-item quality appraisal tool for studies of diagnostic reliability (QAREL). The tool was tested on studies reporting the reliability of any physical examination procedure. The reliability of physical examination is a challenging area to study given the complex testing procedures, the range of tests, and lack of procedural standardisation. Three reviewers used QAREL to independently rate 29 articles, comprising 30 studies, published during 2007. The articles were identified from a search of relevant databases using the following string: "Reproducibility of results (MeSH) OR reliability (t.w.) AND Physical examination (MeSH) OR physical examination (t.w.)." A total of 415 articles were retrieved and screened for inclusion. The reviewers undertook an independent trial assessment prior to data collection, followed by a general discussion about how to score each item. At no time did the reviewers discuss individual papers. Reliability was assessed for each item using multi-rater kappa (κ). Multi-rater reliability estimates ranged from κ = 0.27 to 0.92 across all items. Six items were recorded with good reliability (κ > 0.60), three with moderate reliability (κ = 0.41 - 0.60), and two with fair reliability (κ = 0.21 - 0.40). Raters found it difficult to agree about the spectrum of patients included in a study (Item 1) and the correct application and interpretation of the test (Item 10). In this study, we found that QAREL was a reliable assessment tool for studies of diagnostic reliability when raters agreed upon criteria for the interpretation of each item. Nine out of 11 items had good or moderate reliability, and two items achieved fair reliability. The heterogeneity in the tests included in this study may have resulted in an underestimation of the reliability of these two items. We discuss these and other factors that could affect our results and make recommendations for the use of QAREL.
The neutron texture diffractometer at the China Advanced Research Reactor
NASA Astrophysics Data System (ADS)
Li, Mei-Juan; Liu, Xiao-Long; Liu, Yun-Tao; Tian, Geng-Fang; Gao, Jian-Bo; Yu, Zhou-Xiang; Li, Yu-Qing; Wu, Li-Qi; Yang, Lin-Feng; Sun, Kai; Wang, Hong-Li; Santisteban, J. r.; Chen, Dong-Feng
2016-03-01
The first neutron texture diffractometer in China has been built at the China Advanced Research Reactor, due to strong demand for texture measurement with neutrons from the domestic user community. This neutron texture diffractometer has high neutron intensity, moderate resolution and is mainly applied to study texture in commonly used industrial materials and engineering components. In this paper, the design and characteristics of this instrument are described. The results for calibration with neutrons and quantitative texture analysis of zirconium alloy plate are presented. The comparison of texture measurements with the results obtained in HIPPO at LANSCE and Kowari at ANSTO illustrates the reliability of the texture diffractometer. Supported by National Nature Science Foundation of China (11105231, 11205248, 51327902) and International Atomic Energy Agency-TC program (CPR0012)
Terslev, Lene; Gutierrez, Marwin; Schmidt, Wolfgang A; Keen, Helen I; Filippucci, Emilio; Kane, David; Thiele, Ralf; Kaeley, Gurjit; Balint, Peter; Mandl, Peter; Delle Sedie, Andrea; Hammer, Hilde Berner; Christensen, Robin; Möller, Ingrid; Pineda, Carlos; Kissin, Eugene; Bruyn, George A; Iagnocco, Annamaria; Naredo, Esperanza; D'Agostino, Maria Antonietta
2015-11-01
To summarize the work performed by the Outcome Measures in Rheumatology (OMERACT) Ultrasound (US) Working Group on the validation of US as a potential outcome measure in gout. Based on the lack of definitions, highlighted in a recent literature review on US as an outcome tool in gout, a series of iterative exercises were carried out to obtain consensus-based definitions on US elementary components in gout using a Delphi exercise and subsequently testing these definitions in static images and in patients with proven gout. Cohen's κ was used to test agreement, and values of 0-0.20 were considered poor, 0.20-0.40 fair, 0.40-0.60 moderate, 0.60-0.80 good, and 0.80-1 excellent. With an agreement of > 80%, consensus-based definitions were obtained for the 4 elementary lesions highlighted in the literature review: tophi, aggregates, erosions, and double contour (DC). In static images interobserver reliability ranged from moderate to almost perfect, and similar results were found for the intrareader reliability. In patients the intraobserver agreement was good for all lesions except DC (moderate). The interobserver agreement was poor for aggregates and DC but moderate for the other components. These first steps in evaluating the validity of US as an outcome measure for gout show that the reliability of the definitions ranged from moderate to excellent in static images and somewhat lower in patients, indicating that a standardized scanning technique may be needed, before testing the responsiveness of those definitions in a composite US score.
Reliable, Low-Cost, Low-Weight, Non-Hermetic Coating for MCM Applications
NASA Technical Reports Server (NTRS)
Jones, Eric W.; Licari, James J.
2000-01-01
Through an Air Force Research Laboratory sponsored STM program, reliable, low-cost, low-weight, non-hermetic coatings for multi-chip-module(MCK applications were developed. Using the combination of Sandia Laboratory ATC-01 test chips, AvanTeco's moisture sensor chips(MSC's), and silicon slices, we have shown that organic and organic/inorganic overcoatings are reliable and practical non-hermetic moisture and oxidation barriers. The use of the MSC and unpassivated ATC-01 test chips provided rapid test results and comparison of moisture barrier quality of the overcoatings. The organic coatings studied were Parylene and Cyclotene. The inorganic coatings were Al2O3 and SiO2. The choice of coating(s) is dependent on the environment that the device(s) will be exposed to. We have defined four(4) classes of environments: Class I(moderate temperature/moderate humidity). Class H(high temperature/moderate humidity). Class III(moderate temperature/high humidity). Class IV(high temperature/high humidity). By subjecting the components to adhesion, FTIR, temperature-humidity(TH), pressure cooker(PCT), and electrical tests, we have determined that it is possible to reduce failures 50-70% for organic/inorganic coated components compared to organic coated components. All materials and equipment used are readily available commercially or are standard in most semiconductor fabrication lines. It is estimated that production cost for the developed technology would range from $1-10/module, compared to $20-200 for hermetically sealed packages.
Li, Tianzhu; Ma, Lian; Mao, Chi
2016-03-01
The purpose of this study was to investigate the validity and reliability of the translated Chinese version of the Speech Handicap Index (SHI) questionnaire for Chinese-speaking patients with oral and oropharyngeal cancer. The original English version of the SHI was translated into Chinese. Forty-two consecutive patients with oral and oropharyngeal cancer were included in the study. All subjects were asked to complete the Chinese version of the SHI and the University of Washington Quality of Life Questionnaire (UWQOL V.04). Fifteen patients were randomly retested on both questionnaires 2 weeks later. The internal consistency, test-retest reliability, construct validity, and group validity of the Chinese version of the SHI were tested using Cronbach α, Spearman correlation coefficient (r), and Mann-Whitney U tests. Descriptive and bivariate statistics were computed, and the P value was set to 0.05. The Cronbach α for the total SHI, the speech domain, and the psychosocial domain were 0.96, 0.90, and 0.92, respectively. The test-retest reliability scores for the total SHI, the speech domain, the psychosocial domain, and the overall question were 0.94, 0.97, 0.90, and 0.83, respectively. To measure construct validity, Spearman correlation coefficients between different items of the SHI and the UWQOL were all >0.4, which signified a moderate to significant correlation. There were significant differences between patient groups when divided by age, clinical stage, educational level, radiotherapy, and reconstruction, on all or on parts of the various SHI domains. The Chinese version of the SHI is a valid and reliable tool for the speech assessment of patients with oral and oropharyngeal cancer. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.