coefficient test-retest reliability: Topics by Science.gov

Sample records for coefficient test-retest reliability

A reliability generalization meta-analysis of coefficient alpha and test-retest coefficient for the aging males' symptoms (AMS) scale.

PubMed

Lee, Chin-Pang; Chiu, Yu-Wen; Chu, Chun-Lin; Chen, Yu; Jiang, Kun-Hao; Chen, Jiun-Liang; Chen, Ching-Yen

2016-12-01

The aging males' symptoms (AMS) scale is an instrument used to determine the health-related quality of life in adult and elderly men. The purpose of this study was to synthesize internal consistency (Cronbach's alpha) and test-retest reliability for the AMS scale and its three subscales. Of the 123 studies reviewed, 12 provided alpha coefficients which were then used in the meta-analyses of internal consistency. Seven of the 12 included studies provided test-retest coefficients, and these were used in the meta-analyses of test-retest reliability. The AMS scale had excellent internal consistency [α = 0.89 (95% CI 0.88-0.90)]; the mean alpha estimates across the AMS subscales ranged from 0.79 to 0.82. The AMS scale also had good test-retest reliability [r = 0.85 (95% CI 0.82-0.88]; the test-retest reliability coefficients of the AMS subscales ranged from 0.76 to 0.83. There was significant heterogeneity among the included studies. The AMS scale and the three subscales had fairly good internal consistency and test-retest reliability. Future psychometric studies of the AMS scale should report important characteristics of the participants, details of item scores, and test-retest reliability.
Test-retest reliability of the Capute scales for neurodevelopmental screening of a high risk sample: Impact of test-retest interval and degree of neonatal risk.

PubMed

McCurdy, M; Bellows, A; Deng, D; Leppert, M; Mahone, E; Pritchard, A

2015-01-01

Reliable and valid screening and assessment tools are necessary to identify children at risk for neurodevelopmental disabilities who may require additional services. This study evaluated the test-retest reliability of the Capute Scales in a high-risk sample, hypothesizing adequate reliability across 6- and 12-month intervals. Capute Scales scores (N = 66) were collected via retrospective chart review from a NICU follow-up clinic within a large urban medical center spanning three age-ranges: 12-18, 19-24, and 25-36 months. On average, participants were classified as very low birth weight and premature. Reliability of the Capute Scales was evaluated with intraclass correlation coefficients across length of test-retest interval, age at testing, and degree of neonatal complications. The Capute Scales demonstrated high reliability, regardless of length of test-retest interval (ranging from 6 to 14 months) or age of participant, for all index scores, including overall Developmental Quotient (DQ), language-based skill index (CLAMS) and nonverbal reasoning index (CAT). Linear regressions revealed that greater neonatal risk was related to poorer test-retest reliability; however, reliability coefficients remained strong. The Capute Scales afford clinicians a reliable and valid means of screening and assessing for neurodevelopmental delay within high-risk infant populations.
MEASURING SPORT-SPECIFIC PHYSICAL ABILITIES IN MALE GYMNASTS: THE MEN'S GYMNASTICS FUNCTIONAL MEASUREMENT TOOL.

PubMed

Sleeper, Mark D; Kenyon, Lisa K; Elliott, James M; Cheng, M Samuel

2016-12-01

Despite the availability of various field-tests for many competitive sports, a reliable and valid test specifically developed for use in men's gymnastics has not yet been developed. The Men's Gymnastics Functional Measurement Tool (MGFMT) was designed to assess sport-specific physical abilities in male competitive gymnasts. The purpose of this study was to develop the MGFMT by establishing a scoring system for individual test items and to initiate the process of establishing test-retest reliability and construct validity. A total of 83 competitive male gymnasts ages 7-18 underwent testing using the MGFMT. Thirty of these subjects underwent re-testing one week later in order to assess test-retest reliability. Construct validity was assessed using a simple regression analysis between total MGFMT scores and the gymnasts' USA-Gymnastics competitive level to calculate the coefficient of determination (r 2 ). Test-retest reliability was analyzed using Model 1 Intraclass correlation coefficients (ICC). Statistical significance was set at the p<0.05 level. The relationship between total MGFMT scores and subjects' current USA-Gymnastics competitive level was found to be good (r 2 = 0.63). Reliability testing of the MGFMT composite test score showed excellent test-retest reliability over a one-week period (ICC = 0.97). Test-retest reliability of the individual component tests ranged from good to excellent (ICC = 0.75-0.97). The results of this study provide initial support for the construct validity and test-retest reliability of the MGFMT. Level 3.
MEASURING SPORT-SPECIFIC PHYSICAL ABILITIES IN MALE GYMNASTS: THE MEN'S GYMNASTICS FUNCTIONAL MEASUREMENT TOOL

PubMed Central

Kenyon, Lisa K.; Elliott, James M; Cheng, M. Samuel

2016-01-01

Purpose/Background Despite the availability of various field-tests for many competitive sports, a reliable and valid test specifically developed for use in men's gymnastics has not yet been developed. The Men's Gymnastics Functional Measurement Tool (MGFMT) was designed to assess sport-specific physical abilities in male competitive gymnasts. The purpose of this study was to develop the MGFMT by establishing a scoring system for individual test items and to initiate the process of establishing test-retest reliability and construct validity. Methods A total of 83 competitive male gymnasts ages 7-18 underwent testing using the MGFMT. Thirty of these subjects underwent re-testing one week later in order to assess test-retest reliability. Construct validity was assessed using a simple regression analysis between total MGFMT scores and the gymnasts’ USA-Gymnastics competitive level to calculate the coefficient of determination (r2). Test-retest reliability was analyzed using Model 1 Intraclass correlation coefficients (ICC). Statistical significance was set at the p<0.05 level. Results The relationship between total MGFMT scores and subjects’ current USA-Gymnastics competitive level was found to be good (r2 = 0.63). Reliability testing of the MGFMT composite test score showed excellent test-retest reliability over a one-week period (ICC = 0.97). Test-retest reliability of the individual component tests ranged from good to excellent (ICC = 0.75-0.97). Conclusions The results of this study provide initial support for the construct validity and test-retest reliability of the MGFMT. Level of Evidence Level 3 PMID:27999723
Test-retest and inter- and intrareliability of the quality of the upper-extremity skills test in preschool-age children with cerebral palsy.

PubMed

Haga, Nienke; van der Heijden-Maessen, Hélène C; van Hoorn, Jessika F; Boonstra, Anne M; Hadders-Algra, Mijna

2007-12-01

To investigate the test-retest, inter-, and intraobserver reliability of the Quality of Upper Extremity Skills Test (QUEST) in young children with cerebral palsy (CP). For test-retest reliability, a test-retest design was used; for the intra- and interobserver reliability, the videotaped test was scored on 2 occasions by 1 observer and by various observers. Groups of preschool-age children in 2 general rehabilitation centers. Twenty-one children with CP (12 boys, 9 girls) aged 2 to 4.5 years (mean, 39 mo). Not applicable. Spearman correlation coefficient. The data indicated that test-retest reliability was strong (rho range, .85-.94). Intraobserver agreement (rho range, .63-.95) and agreement between various observers (rho range, .72-.90) were moderate to strong. Test-retest and inter- and intraobserver reliability of the QUEST in preschool-age children with CP is good.
[The appraisal of reliability and validity of subjective workload assessment technique and NASA-task load index].

PubMed

Xiao, Yuan-mei; Wang, Zhi-ming; Wang, Mian-zhen; Lan, Ya-jia

2005-06-01

To test the reliability and validity of two mental workload assessment scales, i.e. subjective workload assessment technique (SWAT) and NASA task load index (NASA-TLX). One thousand two hundred and sixty-eight mental workers were sampled from various kinds of occupations, such as scientific research, education, administration and medicine, etc, with randomized cluster sampling. The re-test reliability, split-half reliability, Cronbach's alpha coefficient and correlation coefficients between item score and total score were adopted to test the reliability. The test of validity included structure validity. The re-test reliability coefficients of these two scales and their items were ranged from 0.516 to 0.753 (P < 0.01), indicating the two scales had good re-test reliability; the split-half reliability of SWAT was 0.645, and its Cronbach's alpha coefficient was more than 0.80, all the correlation coefficients between its items score and total score were more than 0.70; as for NASA-TLX, both the split-half reliability and Cronbach's alpha coefficient were more than 0.80, the correlation coefficients between its items score and total score were all more than 0.60 (P < 0.01) except the item of performance. Both scales had good inner consistency. The Pearson correlation coefficient between the two scales was 0.492 (P < 0.01), implying the results of the two scales had good consistency. Factor analysis showed that the two scales had good structure validity. Both SWAT and NASA-TLX have good reliability and validity and may be used as a valid tool to assess mental workload in China after being revised properly.
Development and reliability testing of the Worksite and Energy Balance Survey.

PubMed

Hoehner, Christine M; Budd, Elizabeth L; Marx, Christine M; Dodson, Elizabeth A; Brownson, Ross C

2013-01-01

Worksites represent important venues for health promotion. Development of psychometrically sound measures of worksite environments and policy supports for physical activity and healthy eating are needed for use in public health research and practice. Assess the test-retest reliability of the Worksite and Energy Balance Survey (WEBS), a self-report instrument for assessing perceptions of worksite supports for physical activity and healthy eating. The WEBS included items adapted from existing surveys or new items on the basis of a review of the literature and expert review. Cognitive interviews among 12 individuals were used to test the clarity of items and further refine the instrument. A targeted random-digit-dial telephone survey was administered on 2 occasions to assess test-retest reliability (mean days between time periods = 8; minimum = 5; maximum = 14). Five Missouri census tracts that varied by racial-ethnic composition and walkability. Respondents included 104 employed adults (67% white, 64% women, mean age = 48.6 years). Sixty-three percent were employed at worksites with less than 100 employees, approximately one-third supervised other people, and the majority worked a regular daytime shift (75%). Test-retest reliability was assessed using Spearman correlations for continuous variables, Cohen's κ statistics for nonordinal categorical variables, and 1-way random intraclass correlation coefficients for ordinal categorical variables. Test-retest coefficients ranged from 0.41 to 0.97, with 80% of items having reliability coefficients of more than 0.6. Items that assessed participation in or use of worksite programs/facilities tended to have lower reliability. Reliability of some items varied by gender, obesity status, and worksite size. Test-retest reliability and internal consistency for the 5 scales ranged from 0.84 to 0.94 and 0.63 to 0.84, respectively. The WEBS items and scales exhibited sound test-retest reliability and may be useful for research and surveillance. Further evaluation is needed to document the validity of the WEBS and associations with energy balance outcomes.
Test-Retest Reliability of the Salutogenic Wellness Promotion Scale (SWPS)

ERIC Educational Resources Information Center

Anderson, L. M.; Moore, J. B.; Hayden, B. M.; Becker, C. M.

2014-01-01

Objective: This study examined the temporal stability (i.e. test-retest reliability) of the Salutogenic Wellness Promotion Scale (SWPS) using intraclass correlation coefficients (ICC). Current intraclass results were also compared to previously published interclass correlations to support the use of the intraclass method for test-retest…
[The reliability of a questionnaire regarding Colombian children's physical activity].

PubMed

Herazo-Beltrán, Aliz Y; Domínguez-Anaya, Regina

2012-10-01

Reporting the Physical Activity Questionnaire for school children's (PAQ-C) test-retest reliability and internal consistency. This was a descriptive study of 100 school-aged children aged 9 to 11 years old attending a school in Cartagena, Colombia. The sample was randomly selected. The PAQ-C was given twice, one week apart, after the informed consent forms had been signing by the children's parents and school officials. Cronbach's alpha coefficient of reliability was used for assessing internal consistency and an intra-class correlation coefficient for test-retest reliability SPSS (version 17.0) was used for statistical analysis. The questionnaire scored 0.73 internal consistencies during the first measurement and 0.78 on the second; intra-class correlation coefficient was 0.60. There were differences between boys and girls regarding both measurements. The PAQ-C had acceptable internal consistency and test-retest reliability, thereby making it useful for measuring children's self-reported physical activity and a valuable tool for population studies in Colombia.
The Comprehensive Snack Parenting Questionnaire (CSPQ): Development and Test-Retest Reliability.

PubMed

Gevers, Dorus W M; Kremers, Stef P J; de Vries, Nanne K; van Assema, Patricia

2018-04-26

The narrow focus of existing food parenting instruments led us to develop a food parenting practices instrument measuring the full range of food practices constructs with a focus on snacking behavior. We present the development of the questionnaire and our research on the test-retest reliability. The developed Comprehensive Snack Parenting Questionnaire (CSPQ) covers 21 constructs. Test-retest reliability was assessed by calculating intra class correlation coefficients and percentage agreement after two administrations of the CSPQ among a sample of 66 Dutch parents. Test-retest reliability analysis revealed acceptable intra class correlation coefficients (≥0.41) or agreement scores (≥0.60) for all items. These results, together with earlier work, suggest sufficient psychometric characteristics. The comprehensive, but brief CSPQ opens up chances for highly essential but unstudied research questions to understand and predict children’s snack intake. Example applications include studying the interactional nature of food parenting practices or interactions of food parenting with general parenting or child characteristics.
Reliability of the Swedish version of the Exercise Self-Efficacy Scale (S-ESES): a test-retest study in adults with neurological disease.

PubMed

Ahlström, Isabell; Hellström, Karin; Emtner, Margareta; Anens, Elisabeth

2015-03-01

To examine the test-retest reliability of the Swedish translated version of the Exercise Self-Efficacy Scale (S-ESES) in people with neurological disease and to examine internal consistency. Test-retest study. A total of 30 adults with neurological diseases including: Parkinson's disease; Multiple Sclerosis; Cervical Dystonia; and Charcot-Marie-Tooth disease. The S-ESES was sent twice by surface mail. Completion interval mean was 16 days apart. Weighted kappa, intraclass correlation coefficient 2,1 [ICC (2,1)], standard error of measurement (SEM), also expressed as a percentage value (SEM%), and Cronbach's alpha were calculated. The relative reliability of the test-retest results showed substantial agreement measured using weighted kappa (MD = 0.62) and a very high-reliability ICC (2,1) (0.92). Absolute reliability measured using SEM was 5.3 and SEM% was 20.7. Excellent internal consistency was shown, with an alpha coefficient of 0.91 (test 1) and 0.93 (test 2). The S-ESES is recommended for use in research and in clinical work for people with neurological diseases. The low-absolute reliability, however, indicates a limited ability to measure changes on an individual level.
Developing an oropharyngeal cancer (OPC) knowledge and behaviors survey.

PubMed

Dodd, Virginia J; Riley Iii, Joseph L; Logan, Henrietta L

2012-09-01

To use the community participation research model to (1) develop a survey assessing knowledge about mouth and throat cancer and (2) field test and establish test-retest reliability with newly developed instrument. Cognitive interviews with primarily rural African American adults to assess their perception and interpretation of survey items. Test-retest reliability was established with a racially diverse rural population. Test-retest reliabilities ranged from .79 to .40 for screening awareness and .74 to .19 for knowledge. Coefficients increased for composite scores. Community participation methodology provided a culturally appropriate survey instrument that demonstrated acceptable levels of reliability.
Test-retest reliability of the safe driving behavior measure for community-dwelling elderly drivers.

PubMed

Song, Chiang-Soon; Lee, Joo-Hyun; Han, Sang-Woo

2016-06-01

[Purpose] The Safe Driving Behavior Measure (SDBM) is a self-report measurement tools that assesses the safe-driving behaviors of the elderly. The purpose of this study was to evaluate the test-retest reliability of the SDBM among community-dwelling elderly drivers. [Subjects and Methods] A total of sixty-one community-dwelling elderly were enrolled to investigate the reliability of the SDBM. The SDBM was assessed in two sessions that were conducted three days apart in a quiet and well-organized assessment room. That test-retest reliability of overall scores and three domain scores of the SDBM were statistically evaluated using intraclass correlation coefficients [ICC (2.1)]. Pearson correlation coefficients were used to quantify bivariate associations among the three domains of the SDBM. [Results] The SDBM demonstrated excellent rest-retest reliability for community-dwelling elderly drivers. The Cronbach alpha coefficients of the three domains of person-vehicle (0.979), person-environment (0.944), and person-vehicle-environment (0.971) of the SDBM indicate high internal consistency. [Conclusion] The results of this study suggest that the SDBM is a reliable measure for evaluating the safe- driving of automobiles by community-dwelling elderly, and is adequate for detecting changes in scores in clinical settings.
Test-retest reliability of jump execution variables using mechanography: a comparison of jump protocols.

PubMed

Fitzgerald, John S; Johnson, LuAnn; Tomkinson, Grant; Stein, Jesse; Roemmich, James N

2018-05-01

Mechanography during the vertical jump may enhance screening and determining mechanistic causes underlying physical performance changes. Utility of jump mechanography for evaluation is limited by scant test-retest reliability data on force-time variables. This study examined the test-retest reliability of eight jump execution variables assessed from mechanography. Thirty-two women (mean±SD: age 20.8 ± 1.3 yr) and 16 men (age 22.1 ± 1.9 yr) attended a familiarization session and two testing sessions, all one week apart. Participants performed two variations of the squat jump with squat depth self-selected and controlled using a goniometer to 80º knee flexion. Test-retest reliability was quantified as the systematic error (using effect size between jumps), random error (using coefficients of variation), and test-retest correlations (using intra-class correlation coefficients). Overall, jump execution variables demonstrated acceptable reliability, evidenced by small systematic errors (mean±95%CI: 0.2 ± 0.07), moderate random errors (mean±95%CI: 17.8 ± 3.7%), and very strong test-retest correlations (range: 0.73-0.97). Differences in random errors between controlled and self-selected protocols were negligible (mean±95%CI: 1.3 ± 2.3%). Jump execution variables demonstrated acceptable reliability, with no meaningful differences between the controlled and self-selected jump protocols. To simplify testing, a self-selected jump protocol can be used to assess force-time variables with negligible impact on measurement error.
Test-retest and interrater reliability of the functional lower extremity evaluation.

PubMed

Haitz, Karyn; Shultz, Rebecca; Hodgins, Melissa; Matheson, Gordon O

2014-12-01

Repeated-measures clinical measurement reliability study. To establish the reliability and face validity of the Functional Lower Extremity Evaluation (FLEE). The FLEE is a 45-minute battery of 8 standardized functional performance tests that measures 3 components of lower extremity function: control, power, and endurance. The reliability and normative values for the FLEE in healthy athletes are unknown. A face validity survey for the FLEE was sent to sports medicine personnel to evaluate the level of importance and frequency of clinical usage of each test included in the FLEE. The FLEE was then administered and rated for 40 uninjured athletes. To assess test-retest reliability, each athlete was tested twice, 1 week apart, by the same rater. To assess interrater reliability, 3 raters scored each athlete during 1 of the testing sessions. Intraclass correlation coefficients were used to assess the test-retest and interrater reliability of each of the FLEE tests. In the face validity survey, the FLEE tests were rated as highly important by 58% to 71% of respondents but frequently used by only 26% to 45% of respondents. Interrater reliability intraclass correlation coefficients ranged from 0.83 to 1.00, and test-retest reliability ranged from 0.71 to 0.95. The FLEE tests are considered clinically important for assessing lower extremity function by sports medicine personnel but are underused. The FLEE also is a reliable assessment tool. Future studies are required to determine if use of the FLEE to make return-to-play decisions may reduce reinjury rates.
Reliability and validity of the revised Gibson Test of Cognitive Skills, a computer-based test battery for assessing cognition across the lifespan.

PubMed

Moore, Amy Lawson; Miller, Terissa M

2018-01-01

The purpose of the current study is to evaluate the validity and reliability of the revised Gibson Test of Cognitive Skills, a computer-based battery of tests measuring short-term memory, long-term memory, processing speed, logic and reasoning, visual processing, as well as auditory processing and word attack skills. This study included 2,737 participants aged 5-85 years. A series of studies was conducted to examine the validity and reliability using the test performance of the entire norming group and several subgroups. The evaluation of the technical properties of the test battery included content validation by subject matter experts, item analysis and coefficient alpha, test-retest reliability, split-half reliability, and analysis of concurrent validity with the Woodcock Johnson III Tests of Cognitive Abilities and Tests of Achievement. Results indicated strong sources of evidence of validity and reliability for the test, including internal consistency reliability coefficients ranging from 0.87 to 0.98, test-retest reliability coefficients ranging from 0.69 to 0.91, split-half reliability coefficients ranging from 0.87 to 0.91, and concurrent validity coefficients ranging from 0.53 to 0.93. The Gibson Test of Cognitive Skills-2 is a reliable and valid tool for assessing cognition in the general population across the lifespan.
Test-retest reliability of a standardized psychiatric interview (DIS/CIDI).

PubMed

Semler, G; Wittchen, H U; Joschke, K; Zaudig, M; von Geiso, T; Kaiser, S; von Cranach, M; Pfister, H

1987-01-01

The reliability of DSM-III diagnoses using an expanded version of the Diagnostic Interview Schedule (DIS), called the Composite International Diagnostic Interview (CIDI), was evaluated by examining 60 psychiatric inpatients on a test-retest basis. Acceptable agreement coefficients of (kappa) 0.5 or above were found for all but two disorders: dysthymic disorder and generalized anxiety disorder. The subclassification of DSM-III affective disorders also revealed some discrepancies between the test and the retest interviews. When compared with results from earlier versions of the DIS, diagnostic reliability was found to have improved for the DSM-III anxiety disorders in particular. These improvements can possibly be attributed to some changes in the wording of the respective items of this section. Several reasons for lowered test-retest reliability are discussed.
Reliability of the ecSatter Inventory as a tool to measure eating competence.

PubMed

Stotts, Jodi L; Lohse, Barbara

2007-01-01

To examine the reliability of the ecSatter Inventory (ecSI), a measure of eating competence. Self-report questionnaires were administered in person or by mail. Retesting occurred 2 to 6 weeks after completion of the first questionnaire. Both administrations of the questionnaire were completed by 259 participants who were mostly food secure, white females with some college education; mean age was 26.9 +/- 10.4 years. Test-retest reliability and internal consistency. Spearman's rank correlation coefficients to estimate test-retest reliability and Cronbach alpha coefficients to estimate internal consistency. Spearman's rank correlation coefficient for ecSI total score was 0.68; subscale coefficients were 0.70 for eating attitudes, 0.70 for contextual skills, 0.65 for food acceptance, and 0.52 for internal regulation. Cronbach alpha coefficient for ecSI total score was 0.77. Subscale alphas coefficients were 0.80 for eating attitudes, 0.69 for contextual skills, 0.68 for food acceptance, and 0.66 for internal regulation. This study provides psychometric evidence about the reliability of ecSI as a measure of eating competence in this sample. Although some ecSI items may require revision, results suggest that the instrument may be used to evaluate nutrition education designed to improve eating competence.
The Reliability of Pharyngeal High Resolution Manometry with Impedance for Derivation of Measures of Swallowing Function in Healthy Volunteers

PubMed Central

Omari, Taher I.; Savilampi, Johanna; Kokkinn, Karmen; Schar, Mistyka; Lamvik, Kristin; Doeltgen, Sebastian; Cock, Charles

2016-01-01

Purpose. We evaluated the intra- and interrater agreement and test-retest reliability of analyst derivation of swallow function variables based on repeated high resolution manometry with impedance measurements. Methods. Five subjects swallowed 10 × 10 mL saline on two occasions one week apart producing a database of 100 swallows. Swallows were repeat-analysed by six observers using software. Swallow variables were indicative of contractility, intrabolus pressure, and flow timing. Results. The average intraclass correlation coefficients (ICC) for intra- and interrater comparisons of all variable means showed substantial to excellent agreement (intrarater ICC 0.85–1.00; mean interrater ICC 0.77–1.00). Test-retest results were less reliable. ICC for test-retest comparisons ranged from slight to excellent depending on the class of variable. Contractility variables differed most in terms of test-retest reliability. Amongst contractility variables, UES basal pressure showed excellent test-retest agreement (mean ICC 0.94), measures of UES postrelaxation contractile pressure showed moderate to substantial test-retest agreement (mean Interrater ICC 0.47–0.67), and test-retest agreement of pharyngeal contractile pressure ranged from slight to substantial (mean Interrater ICC 0.15–0.61). Conclusions. Test-retest reliability of HRIM measures depends on the class of variable. Measures of bolus distension pressure and flow timing appear to be more test-retest reliable than measures of contractility. PMID:27190520
Test-retest reliability of the scale of participation in organized activities among adolescents in the Czech Republic and Slovakia.

PubMed

Bosakova, Lucia; Kolarcik, Peter; Bobakova, Daniela; Sulcova, Martina; Van Dijk, Jitse P; Reijneveld, Sijmen A; Geckova, Andrea Madarasova

2016-04-01

Participation in organized activities is related with a range of positive outcomes, but the way such participation is measured has not been scrutinized. Test-retest reliability as an important indicator of a scale's reliability has been assessed rarely and for "The scale of participation in organized activities" lacks completely. This test-retest study is based on the Health Behaviour in School-aged Children study and is consistent with its methodology. We obtained data from 353 Czech (51.9 % boys) and 227 Slovak (52.9 % boys) primary school pupils, grades five and nine, who participated in this study in 2013. We used Cohen's kappa statistic and single measures of the intraclass correlation coefficient to estimate the test-retest reliability of all selected items in the sample, stratified by gender, age and country. We mostly observed a large correlation between the test and retest in all of the examined variables (κ ranged from 0.46 to 0.68). Test-retest reliability of the sum score of individual items showed substantial agreement (ICC = 0.64). The scale of participation in organized activities has an acceptable level of agreement, indicating good reliability.

Determining the Appropriateness of the "What If" Situations Test (WIST) with Turkish Pre-Schoolers.

PubMed

Citak Tunc, Gulseren; Gorak, Gulay; Ozyazicioglu, Nurcan; Ak, Bedriye; Isil, Ozlem; Vural, Pinar

2018-04-01

Measurement instruments are needed to assess the child's sexual abuse prevention program. The purpose of the study was to determine the reliability and validity of the WIST (What If Situations Test) for Turkish culture. Participants were children of the 3-6 age group attending pre-school education institutions and the sample size was identified by means of a power analysis. Seventy children were identified as the sample with 0.85 power and 0.05 type I error according to the power analysis. Language validity, content validity, internal validity coefficient (Cronbach alpha coefficient), and test-retest analyses were conducted in terms of validity and reliability in the scope of efforts for adaptation to Turkish culture. Firstly, Kendall W = 0.83 was the score for the expert opinions concerning the content validity of the language validity scale. It was found that the Cronbach alpha coefficients were between 0.68 and 0.90 for the scale sub-dimensions of appropriate and inappropriate recognition, saying, doing, telling, and reporting. The test-retest reliability of the scale was found to be r = 0.89 and the test-retest reliabilities for the sub-dimensions (appropriate recognition, inappropriate recognition, say skills, do skills, tell skills, and reporting skills) were between r = 0.48 and r = 0.92. The test-retest reliability for the Personal Safety Questionnaire (PSQ), as having complimentary items to the WIST, was found to be r = 0.82. The reliability and validity analysis of the 'What If' Situations Test (WIST), used to evaluate pre-schoolers' skills regarding self-protection against sexual abuse, showed that the Test's adaptation to Turkish culture was reliable and valid.
Statistical Considerations in Choosing a Test Reliability Coefficient. ACT Research Report Series, 2012 (10)

ERIC Educational Resources Information Center

Woodruff, David; Wu, Yi-Fang

2012-01-01

The purpose of this paper is to illustrate alpha's robustness and usefulness, using actual and simulated educational test data. The sampling properties of alpha are compared with the sampling properties of several other reliability coefficients: Guttman's lambda[subscript 2], lambda[subscript 4], and lambda[subscript 6]; test-retest reliability;…
Reliability of instruments in a cooperative, multisite study: employment intervention demonstration program.

PubMed

Salyers, M P; McHugo, G J; Cook, J A; Razzano, L A; Drake, R E; Mueser, K T

2001-09-01

Reliability of well-known instruments was examined in 202 people with severe mental illness participating in a multisite vocational study. We examined interrater reliability of the Positive and Negative Syndrome Scale (PANSS) and the internal consistency and test-retest reliability of the PANSS, the Rosenberg Self-Esteem Scale, the Medical Outcomes Study Short Form-36 (SF-36), and the Quality of Life Interview. Most scales had good levels of reliability, with intraclass correlation coefficients (ICCs) and coefficient alphas above .70. However, the SF-36 scales were generally less stable over time, particularly Social Functioning (ICC = .55). Test-retest reliability was lower among less educated respondents and among ethnic minorities. We recommend close monitoring of psychometric issues in future multisite studies.
Reliability of laboratory measurement of human food intake.

PubMed

Laessle, R; Geiermann, L

2012-02-01

The universal eating monitor (UEM) of Kissileff for laboratory measurement of food intake was modified and used with a newly developed special software to compute cumulative intake data. To explore the measurement precision of the UEM an investigation of test-retest-reliability of food intake parameters was conducted. The intake characteristics of 125 males and females were measured repeatedly in the laboratory with a measurement interval of 1 week. Pudding of preferred flavour served as test meal. Test-retest-reliability of intake characteristics ranged from .49 (change of eating rate) to .89 (initial eating rate). All test-retest correlations were highly significant. Sex, BMI and eating habits according to TFEQ-factors had no significant effects on reliability of intake characteristics. The test-retest-reliability of the laboratory intake measures is as good as those of personality questionnaires, where it should be better than .80. Reliability coefficients are valid independent of sex, BMI or trait characteristics of eating behaviour. Copyright © 2011 Elsevier Ltd. All rights reserved.
Reliability of Autism-Tics, AD/HD, and other Comorbidities (A-TAC) inventory in a test-retest design.

PubMed

Larson, Tomas; Kerekes, Nóra; Selinus, Eva Norén; Lichtenstein, Paul; Gumpert, Clara Hellner; Anckarsäter, Henrik; Nilsson, Thomas; Lundström, Sebastian

2014-02-01

The Autism-Tics, AD/HD, and other Comorbidities (A-TAC) inventory is used in epidemiological research to assess neurodevelopmental problems and coexisting conditions. Although the A-TAC has been applied in various populations, data on retest reliability are limited. The objective of the present study was to present additional reliability data. The A-TAC was administered by lay assessors and was completed on two occasions by parents of 400 individual twins, with an average interval of 70 days between test sessions. Intra- and inter-rater reliability were analysed with intraclass correlations and Cohen's kappa. A-TAC showed excellent test-retest intraclass correlations for both autism spectrum disorder and attention deficit hyperactivity disorder (each at .84). Most modules in the A-TAC had intra- and inter-rater reliability intraclass correlation coefficients of > or = .60. Cohen's kappa indi- cated acceptable reliability. The current study provides statistical evidence that the A-TAC yields good test-retest reliability in a population-based cohort of children.
Development and evaluation of the McKnight Risk Factor Survey for assessing potential risk and protective factors for disordered eating in preadolescent and adolescent girls.

PubMed

Shisslak, C M; Renger, R; Sharpe, T; Crago, M; McKnight, K M; Gray, N; Bryson, S; Estes, L S; Parnaby, O G; Killen, J; Taylor, C B

1999-03-01

To describe the development, test-retest reliability, internal consistency, and convergent validity of the McKnight Risk Factor Survey-III (MRFS-III). The MRFS-III was designed to assess a number of potential risk and protective factors for the development of disordered eating in preadolescent and adolescent girls. Several versions of the MRFS were pilot tested before the MRFS-III was administered to a sample of 651 4th through 12th- grade girls to establish its psychometric properties. Most of the test-retest reliability coefficients of individual items on the MRFS-III were r > .40. Alpha coefficients for each risk and protective factor domain on the MRFS-III were also computed. The majority of these coefficients were r > .60. High convergent validity coefficients were obtained for specific items on the MRFS-III and measures of self-esteem (Rosenberg Self-Esteem Scale) and weight concerns (Weight Concerns Scale). The test-retest reliability, internal consistency, and convergent validity of the MRFS-III suggest that it is a useful new instrument to assess potential risk and protective factors for the development of disordered eating in preadolescent and adolescent girls.
A Pilot Study of the Snap & Sniff Threshold Test.

PubMed

Jiang, Rong-San; Liang, Kai-Li

2018-05-01

The Snap & Sniff ® Threshold Test (S&S) has been recently developed to determine the olfactory threshold. The aim of this study was to further evaluate the validity and test-retest reliability of the S&S. The olfactory thresholds of 120 participants were determined using both the Smell Threshold Test (STT) and the S&S. The participants included 30 normosmic volunteers and 90 patients (60 hyposmic, 30 anosmic). The normosmic participants were retested using the STT and S&S at an intertest interval of at least 1 day. The mean olfactory threshold determined with the S&S was -6.76 for the normosmic participants, -3.79 for the hyposmic patients, and -2 for the anosmic patients. The olfactory thresholds were significantly different across the 3 groups ( P < .001). Snap & Sniff-based and STT-based olfactory thresholds were correlated weakly in the normosmic group (correlation coefficient = 0.162, P = .391) but more strongly correlated in the patient groups (hyposmic: correlation coefficient = 0.376, P = .003; anosmic: correlation coefficient = 1.0). The test-retest correlation for the S&S-based olfactory thresholds was 0.384 ( P = .036). Based on validity and test-retest reliability, we concluded that the S&S is a proper test for olfactory thresholds.
Test-retest reliability of the multifocal photopic negative response.

PubMed

Van Alstine, Anthony W; Viswanathan, Suresh

2017-02-01

To assess the test-retest reliability of the multifocal photopic negative response (mfPhNR) of normal human subjects. Multifocal electroretinograms were recorded from one eye of 61 healthy adult subjects on two separate days using a Visual Evoked Response Imaging System software version 4.3 (EDI, San Mateo, California). The visual stimulus delivered on a 75-Hz monitor consisted of seven equal-sized hexagons each subtending 12° of visual angle. The m-step exponent was 9, and the m-sequence was slowed to include at least 30 blank frames after each flash. Only the first slice of the first-order kernel was analyzed. The mfPhNR amplitude was measured at a fixed time in the trough from baseline (BT) as well as at the same fixed time in the trough from the preceding b-wave peak (PT). Additionally, we also analyzed BT normalized either to PT (BT/PT) or to the b-wave amplitude (BT/b-wave). The relative reliability of test-retest differences for each test location was estimated by the Wilcoxon matched-pair signed-rank test and intraclass correlation coefficients (ICC). Absolute test-retest reliability was estimated by Bland-Altman analysis. The test-retest amplitude differences for neither of the two measurement techniques were statistically significant as determined by Wilcoxon matched-pair signed-rank test. PT measurements showed greater ICC values than BT amplitude measurements for all test locations. For each measurement technique, the ICC value of the macular response was greater than that of the surrounding locations. The mean test-retest difference was close to zero for both techniques at each of the test locations, and while the coefficient of reliability (COR-1.96 times the standard deviation of the test-retest difference) was comparable for the two techniques at each test location when expressed in nanovolts, the %COR (COR normalized to the mean test and retest amplitudes) was superior for PT than BT measurements. The ICC and COR were comparable for the BT/PT and BT/b-wave ratios and were better than the ICC and COR for BT but worse than PT. mfPhNR amplitude measured at a fixed time in the trough from the preceding b-wave peak (PT) shows greater test-retest reliability when compared to amplitude measurement from baseline (BT) or BT amplitude normalized to either the PT or b-wave amplitudes.
Validity and reliability of the Diagnostic Adaptive Behaviour Scale.

PubMed

Tassé, M J; Schalock, R L; Balboni, G; Spreat, S; Navas, P

2016-01-01

The Diagnostic Adaptive Behaviour Scale (DABS) is a new standardised adaptive behaviour measure that provides information for evaluating limitations in adaptive behaviour for the purpose of determining a diagnosis of intellectual disability. This article presents validity evidence and reliability data for the DABS. Validity evidence was based on comparing DABS scores with scores obtained on the Vineland Adaptive Behaviour Scale, second edition. The stability of the test scores was measured using a test and retest, and inter-rater reliability was assessed by computing the inter-respondent concordance. The DABS convergent validity coefficients ranged from 0.70 to 0.84, while the test-retest reliability coefficients ranged from 0.78 to 0.95, and the inter-rater concordance as measured by intraclass correlation coefficients ranged from 0.61 to 0.87. All obtained validity and reliability indicators were strong and comparable with the validity and reliability coefficients of the most commonly used adaptive behaviour instruments. These results and the advantages of the DABS for clinician and researcher use are discussed. © 2015 MENCAP and International Association of the Scientific Study of Intellectual and Developmental Disabilities and John Wiley & Sons Ltd.
Short-interval test-retest interrater reliability of the Structured Clinical Interview for DSM-III-R personality disorders (SCID-II) in outpatients.

PubMed

Dreessen, L; Arntz, A

1998-01-01

The short-interval test-retest interrater reliability of the Structured Clinical Interview for DSM-III-R personality disorders (SCID-II) was studied in a psychotherapy outpatient group whose main complaint was mostly an Axis I anxiety disorder. Using a test-retest approach to assess interrater reliability, three sources of variance were taken into account (rater variance in the elicitation and interpretation of information and patient variance across interviews). Base rate requirements were established before calculating reliability coefficients. On the whole, interrater agreement on the SCID-II was found to be satisfactory, except for the histrionic personality traits. This is the first study that has estimated short-interval test-retest interrater reliability of the SCID-II in outpatients, and also the first that has studied single SCID-II traits and dimensional diagnoses. The results found support the use of the SCID-II as a diagnostic instrument for clinical and research purposes.
Measuring Quadriceps strength in adults with severe or moderate intellectual and visual disabilities: Feasibility and reliability.

PubMed

Dijkhuizen, Annemarie; Douma, Rob K; Krijnen, Wim P; van der Schans, Cees P; Waninge, Aly

2018-05-30

A feasible and reliable instrument to measure strength in persons with severe intellectual and visual disabilities (SIVD) is lacking. The aim of our study was to determine feasibility, learning period and reliability of three strength tests. Twenty-nine participants with SIVD performed the Minimum Sit-to-Stand Height test (MSST), the Leg Extension test (LE) and the 30 seconds Chair-Stand test (30sCS), once per week for 5 weeks. Feasibility was determined by the percentage of successful measurements; learning effect by using paired t test between two consecutive measurements; test-retest reliability by intraclass correlation coefficient and Limits of Agreement and, correlations by Pearson correlations. A sufficient feasibility and learning period of the tests was shown. The methods had sufficient test-retest reliability and moderate-to-sufficient correlations. The MSST, the LE, and the 30sCS are feasible tests for measuring muscle strength in persons with SIVD, having sufficient test re-test reliability. © 2018 John Wiley & Sons Ltd.
Reliability and construct validity of the Spanish version of the 6-item CTS symptoms scale for outcomes assessment in carpal tunnel syndrome.

PubMed

Rosales, Roberto S; Martin-Hidalgo, Yolanda; Reboso-Morales, Luis; Atroshi, Isam

2016-03-03

The purpose of this study was to assess the reliability and construct validity of the Spanish version of the 6-item carpal tunnel syndrome (CTS) symptoms scale (CTS-6). In this cross-sectional study 40 patients diagnosed with CTS based on clinical and neurophysiologic criteria, completed the standard Spanish versions of the CTS-6 and the disabilities of the arm, shoulder and hand (QuickDASH) scales on two occasions with a 1-week interval. Internal-consistency reliability was assessed with the Cronbach alpha coefficient and test-retest reliability with the intraclass correlation coefficient, two way random effect model and absolute agreement definition (ICC2,1). Cross-sectional precision was analyzed with the Standard Error of the Measurement (SEM). Longitudinal precision for test-retest reliability coefficient was assessed with the Standard Error of the Measurement difference (SEMdiff) and the Minimal Detectable Change at 95 % confidence level (MDC95). For assessing construct validity it was hypothesized that the CTS-6 would have a strong positive correlation with the QuickDASH, analyzed with the Pearson correlation coefficient (r). The standard Spanish version of the CTS-6 presented a Cronbach alpha of 0.81 with a SEM of 0.3. Test-retest reliability showed an ICC of 0.85 with a SRMdiff of 0.36 and a MDC95 of 0.7. The correlation between CTS-6 and the QuickDASH was concordant with the a priori formulated construct hypothesis (r 0.69) CONCLUSIONS: The standard Spanish version of the 6-item CTS symptoms scale showed good internal consistency, test-retest reliability and construct validity for outcomes assessment in CTS. The CTS-6 will be useful to clinicians and researchers in Spanish speaking parts of the world. The use of standardized outcome measures across countries also will facilitate comparison of research results in carpal tunnel syndrome.
Developing a Danish version of the "Impact on Participation and Autonomy Questionnaire".

PubMed

Ghaziani, Emma; Krogh, Anne Grethe; Lund, Hans

2013-05-01

To translate the "Impact on Participation and Autonomy Questionnaire" into Danish (IPAQ-DK), and estimate its internal consistency and test-retest reliability in order to promote participation-based interventions and research. Translation and two successive reliability assessments through test-retest. 137 adults with varying degrees of impairment; of these, 67 participated in the final reliability assessment. The translation followed guidelines set forth by the "European Group for Quality of Life Assessment and Health Measurement". Internal consistency for subscales was estimated by Chronbach's alpha. Weighted kappa coefficients and intraclass correlation coefficients were calculated to assess the test-retest reliability at item and subscale level, respectively. A preliminary reliability assessment revealed residual issues regarding the translation and cultural adaptation of the instrument. The revised version (IPAQ-DK) was subsequently subjected to a similar assessment demonstrating Chronbach's alpha values from 0.698 to 0.817. Weighted kappa ranged from 0.370 to 0.880; 78% of these values were higher than 0.600. The intraclass correlation coefficient covered values from 0.701 to 0.818. IPAQ-DK is a useful instrument for identifying person-perceived participation restrictions and satisfaction with participation. Further studies of IPAQ-DK's floor/ceiling effects and responsiveness to change are recommended, and whether there is a need for further linguistic improvement of certain items.
Influences on and Limitations of Classical Test Theory Reliability Estimates.

ERIC Educational Resources Information Center

Arnold, Margery E.

It is incorrect to say "the test is reliable" because reliability is a function not only of the test itself, but of many factors. The present paper explains how different factors affect classical reliability estimates such as test-retest, interrater, internal consistency, and equivalent forms coefficients. Furthermore, the limits of classical test…
Test-retest reliability of neurophysiological tests of hand-arm vibration syndrome in vibration exposed workers and unexposed referents.

PubMed

Gerhardsson, Lars; Gillström, Lennart; Hagberg, Mats

2014-01-01

Exposure to hand-held vibrating tools may cause the hand-arm vibration syndrome (HAVS). The aim was to study the test-retest reliability of hand and muscle strength tests, and tests for the determination of thermal and vibration perception thresholds, which are used when investigating signs of neuropathy in vibration exposed workers. In this study, 47 vibration exposed workers who had been investigated at the department of Occupational and Environmental Medicine in Gothenburg were compared with a randomized sample of 18 unexposed subjects from the general population of the city of Gothenburg. All participants passed a structured interview, answered several questionnaires and had a physical examination including hand and finger muscle strength tests, determination of vibrotactile (VPT) and thermal perception thresholds (TPT). Two weeks later, 23 workers and referents, selected in a randomized manner, were called back for the same test-procedures for the evaluation of test-retest reliability. The test-retest reliability after a two week interval expressed as limits of agreement (LOA; Bland-Altman), intra-class correlation coefficients (ICC) and Pearson correlation coefficients was excellent for tests with the Baseline hand grip, Pinch-grip and 3-Chuck grip among the exposed workers and referents (N = 23: percentage of differences within LOA 91 - 100%; ICC-values ≥0.93; Pearson r ≥0.93). The test-retest reliability was also excellent (percentage of differences within LOA 96-100 %) for the determination of vibration perception thresholds in digits 2 and 5 bilaterally as well as for temperature perception thresholds in digits 2 and 5, bilaterally (percentage of differences within LOA 91 - 96%). For ICC and Pearson r the results for vibration perception thresholds were good for digit 2, left hand and for digit 5, bilaterally (ICC ≥ 0.84; r ≥0.85), and lower (ICC = 0.59; r = 0.59) for digit 2, right hand. For the latter two indices the test-retest reliability for the determination of temperature thresholds was lower and showed more varying results. The strong test-retest reliability for hand and muscle strength tests as well as for the determination of VPTs makes these procedures useful for diagnostic purposes and follow-up studies in vibration exposed workers.
Test-retest reliability at the item level and total score level of the Norwegian version of the Spinal Cord Injury Falls Concern Scale (SCI-FCS).

PubMed

Roaldsen, Kirsti Skavberg; Måøy, Åsa Blad; Jørgensen, Vivien; Stanghelle, Johan Kvalvik

2016-05-01

Translation of the Spinal Cord Injury Falls Concern Scale (SCI-FCS), and investigation of test-retest reliability on item-level and total-score-level. Translation, adaptation and test-retest study. A specialized rehabilitation setting in Norway. Fifty-four wheelchair users with a spinal cord injury. The median age of the cohort was 49 years, and the median number of years after injury was 13. Interventions/measurements: The SCI-FCS was translated and back-translated according to guidelines. Individuals answered the SCI-FCS twice over the course of one week. We investigated item-level test-retest reliability using Svensson's rank-based statistical method for disagreement analysis of paired ordinal data. For relative reliability, we analyzed the total-score-level test-retest reliability with intraclass correlation coefficients (ICC2.1), the standard error of measurement (SEM), and the smallest detectable change (SDC) for absolute reliability/measurement-error assessment and Cronbach's alpha for internal consistency. All items showed satisfactory percentage agreement (≥69%) between test and retest. There were small but non-negligible systematic disagreements among three items; we recovered an 11-13% higher chance for a lower second score. There was no disagreement due to random variance. The test-retest agreement (ICC2.1) was excellent (0.83). The SEM was 2.6 (12%), and the SDC was 7.1 (32%). The Cronbach's alpha was high (0.88). The Norwegian SCI-FCS is highly reliable for wheelchair users with chronic spinal cord injuries.
Five times sit-to-stand test in subjects with total knee replacement: Reliability and relationship with functional mobility tests.

PubMed

Medina-Mirapeix, Francesc; Vivo-Fernández, Iván; López-Cañizares, Juan; García-Vidal, José A; Benítez-Martínez, Josep Carles; Del Baño-Aledo, María Elena

2018-01-01

The objective was to determine the inter-observer and test/retest reliability of the "Five-repetition sit-to-stand" (5STS) test in patients with total knee replacement (TKR). To explore correlation between 5STS and two mobility tests. A reliability study was conducted among 24 (mean age 72.13, S.D. 10.67; 50% were women) outpatients with TKR. They were recruited from a traumatology unit of a public hospital via convenience sampling. A physiotherapist and trauma physician assessed each patient at the same time. The same physiotherapist realized a 5STS second measurement 45-60min after the first one. Reliability was assessed with intraclass correlation coefficients (ICCs) and Bland-Altman plots. Pearson coefficient was calculated to assess the correlation between 5STS, time up to go test (TUG) and four meters gait speed (4MGS). ICC for inter-observer and test-retest reliability of the 5STS were 0.998 (95% confidence interval [CI], 0.995-0.999) and 0.982 (95% CI, 0.959-0.992). Bland-Altman plot inter-observer showed limits between -0.82 and 1.06 with a mean of 0.11 and no heteroscedasticity within the data. Bland-Altman plot for test-retest showed the limits between 1.76 and 4.16, a mean of 1.20 and heteroscedasticity within the data. Pearson correlation coefficient revealed significant correlation between 5STS and TUG (r=0.7, p<0.001) and 4MGS (r=-0.583, p=0.003). This study demonstrates excellent inter-observer and test-retest reliability when it is used in people with TKR, and also significant correlation with other functional mobility tests. These findings support the use of 5STS as outcome measure in TKR population. Copyright © 2017 Elsevier B.V. All rights reserved.
Y-balance test: a reliability study involving multiple raters.

PubMed

Shaffer, Scott W; Teyhen, Deydre S; Lorenson, Chelsea L; Warren, Rick L; Koreerat, Christina M; Straseske, Crystal A; Childs, John D

2013-11-01

The Y-balance test (YBT) is one of the few field expedient tests that have shown predictive validity for injury risk in an athletic population. However, analysis of the YBT in a heterogeneous population of active adults (e.g., military, specific occupations) involving multiple raters with limited experience in a mass screening setting is lacking. The primary purpose of this study was to determine interrater test-retest reliability of the YBT in a military setting using multiple raters. Sixty-four service members (53 males, 11 females) actively conducting military training volunteered to participate. Interrater test-retest reliability of the maximal reach had intraclass correlation coefficients (2,1) of 0.80 to 0.85 with a standard error of measurement ranging from 3.1 to 4.2 cm for the 3 reach directions (anterior, posteromedial, and posterolateral). Interrater test-retest reliability of the average reach of 3 trails had an intraclass correlation coefficients (2,3) range of 0.85 to 0.93 with an associated standard error of measurement ranging from 2.0 to 3.5cm. The YBT showed good interrater test-retest reliability with an acceptable level of measurement error among multiple raters screening active duty service members. In addition, 31.3% (n = 20 of 64) of participants exhibited an anterior reach asymmetry of >4cm, suggesting impaired balance symmetry and potentially increased risk for injury. Reprint & Copyright © 2013 Association of Military Surgeons of the U.S.
Test-Retest Reliability of Rating of Perceived Exertion and Agreement With 1-Repetition Maximum in Adults.

PubMed

Bove, Allyn M; Lynch, Andrew D; DePaul, Samantha M; Terhorst, Lauren; Irrgang, James J; Fitzgerald, G Kelley

2016-09-01

Study Design Clinical measurement. Background It has been suggested that rating of perceived exertion (RPE) may be a useful alternative to 1-repetition maximum (1RM) to determine proper resistance exercise dosage. However, the test-retest reliability of RPE for resistance exercise has not been determined. Additionally, prior research regarding the relationship between 1RM and RPE is conflicting. Objectives The purpose of this study was to (1) determine test-retest reliability of RPE related to resistance exercise and (2) assess agreement between percentages of 1RM and RPE during quadriceps resistance exercise. Methods A sample of participants with and without knee pathology completed a series of knee extension exercises and rated the perceived difficulty of each exercise on a 0-to-10 RPE scale, then repeated the procedure 1 to 2 weeks later for test-retest reliability. To determine agreement between RPE and 1RM, participants completed knee extension exercises at various percentages of their 1RM (10% to 130% of predicted 1RM) and rated the perceived difficulty of each exercise on a 0-to-10 RPE scale. Percent agreement was calculated between the 1RM and RPE at each resistance interval. Results The intraclass correlation coefficient indicated excellent test-retest reliability of RPE for quadriceps resistance exercises (intraclass correlation coefficient = 0.895; 95% confidence interval: 0.866, 0.918). Overall percent agreement between RPE and 1RM was 60%, but agreement was poor within the ranges that would typically be used for training (50% 1RM for muscle endurance, 70% 1RM and greater for strength). Conclusion Test-retest reliability of perceived exertion during quadriceps resistance exercise was excellent. However, agreement between the RPE and 1RM was poor, especially in common training zones for knee extensor strengthening. J Orthop Sports Phys Ther 2016;46(9):768-774. Epub 5 Aug 2016. doi:10.2519/jospt.2016.6498.
Analysis of Test-Retest Reliability, Construct Validity, and Internal Consistency of the Brazilian Version of the Pelvic Girdle Questionnaire.

PubMed

Simões, Luan; Teixeira-Salmela, Luci Fuscaldi; Magalhães, Lívia; Stuge, Britt; Laurentino, Glória; Wanderley, Elaine; Barros, Raphaela; Lemos, Andrea

2018-04-24

The purpose of this study was to evaluate test-retest reliability, construct validity, and internal consistency of the Brazilian version of the Pelvic Girdle Questionnaire (PGQ-Brazil). Analysis of the measurement properties was carried out in 4 steps. Step 1 was the pilot study, on which basis 4 hypotheses were formulated. These hypotheses were tested during the next step (construct validity, step 2) by completion of the questionnaire by the 2 groups (in pain [n = 105] and not in pain [n = 52]). For implementation of the PGQ-Brazil in the group with pain, we calculated the internal consistency (step 3) and, 7 days later, test-retest reliability (step 4) by re-application of the instrument in this group. First, the PGQ-Brazil was able to discriminate between these groups (construct validity). Second, test-retest reliability (intraclass correlation coefficients for Activities subscale [0.97 with 95% confidence interval of 0.95-0.98] and Symptoms subscale [0.98 with 95% confidence interval of 0.97-0.98] and κ coefficient between 0.50 and 0.89 for the items) was found to be good; the Bland-Altman test indicated satisfactory agreement. The Rasch analysis indicated good internal consistency, and the instrument's ability to divide the participants into at least 3 levels of skills was confirmed. In contrast, a ceiling effect was observed, as 24% of pregnant women exhibited skills superior to what the PGQ-Brazil could evaluate. The PGQ-Brazil had good internal consistency, test-retest reliability, and construct validity in assessment of limitations in activities and symptoms of pregnant women with pelvic girdle pain. Copyright © 2018. Published by Elsevier Inc.

The Spaeth/Richman contrast sensitivity test (SPARCS): design, reproducibility and ability to identify patients with glaucoma.

PubMed

Richman, Jesse; Zangalli, Camila; Lu, Lan; Wizov, Sheryl S; Spaeth, Eric; Spaeth, George L

2015-01-01

(1) To determine the ability of a novel, internet-based contrast sensitivity test titled the Spaeth/Richman Contrast Sensitivity Test (SPARCS) to identify patients with glaucoma. (2) To determine the test-retest reliability of SPARCS. A prospective, cross-sectional study of patients with glaucoma and controls was performed. Subjects were assessed by SPARCS and the Pelli-Robson chart. Reliability of each test was assessed by the intraclass correlation coefficient and the coefficient of repeatability. Sensitivity and specificity for identifying glaucoma was also evaluated. The intraclass correlation coefficient for SPARCS was 0.97 and 0.98 for Pelli-Robson. The coefficient of repeatability for SPARCS was ±6.7% and ±6.4% for Pelli-Robson. SPARCS identified patients with glaucoma with 79% sensitivity and 93% specificity. SPARCS has high test-retest reliability. It is easily accessible via the internet and identifies patients with glaucoma well. NCT01300949. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Reliability of the Client-Centeredness of Goal Setting (C-COGS) Scale in Acquired Brain Injury Rehabilitation.

PubMed

Doig, Emmah; Prescott, Sarah; Fleming, Jennifer; Cornwell, Petrea; Kuipers, Pim

2016-01-01

To examine the internal reliability and test-retest reliability of the Client-Centeredness of Goal Setting (C-COGS) scale. The C-COGS scale was administered to 42 participants with acquired brain injury after completion of multidisciplinary goal planning. Internal reliability of scale items was examined using item-partial total correlations and Cronbach's α coefficient. The scale was readministered within a 1-mo period to a subsample of 12 participants to examine test-retest reliability by calculating exact and close percentage agreement for each item. After examination of item-partial total correlations, test items were revised. The revised items demonstrated stronger internal consistency than the original items. Preliminary evaluation of test-retest reliability was fair, with an average exact percent agreement across all test items of 67%. Findings support the preliminary reliability of the C-COGS scale as a tool to evaluate and promote client-centered goal planning in brain injury rehabilitation. Copyright © 2016 by the American Occupational Therapy Association, Inc.
Development and evaluation of the OHCITIES instrument: assessing alcohol urban environments in the Heart Healthy Hoods project.

PubMed

Sureda, Xisca; Espelt, Albert; Villalbí, Joan R; Cebrecos, Alba; Baranda, Lucía; Pearce, Jamie; Franco, Manuel

2017-10-05

To describe the development and test-retest reliability of OHCITIES, an instrument characterising alcohol urban environment in terms of availability, promotion and signs of consumption. This study involved: (1) developing the conceptual framework for alcohol urban environment by means of literature reviewing and previous alcohol environment research experience; (2) pilot testing and redesigning the instrument; (3) instrument digitalisation; (4) instrument evaluation using test-retest reliability. Data for testing the reliability of the instrument were collected in seven census sections in Madrid in 2016 by two observers. We computed per cent agreement and Cohen's kappa coefficients to estimate inter-rater and test-retest reliability for alcohol outlet environment measures. We calculated interclass coefficients and their 95% CIs to provide a measure of inter-rater reliability for signs of alcohol consumption measures. We collected information on 92 on-premise and 24 off-premise alcohol outlets identified in the studied areas about availability, accessibility and promotion of alcohol. Most per cent-agreement values for alcohol measures in on-premise and off-premise alcohol outlets were greater than 80%, and inter-rater and test-retest reliability values were generally above 0.80. Observers identified 26 streets and 3 public squares with signs of alcohol consumption. Intraclass correlation coefficient between observers for any type of signs of alcohol consumption was 0.50 (95% CI -0.09 to 0.77). Few items promoting alcohol unrelated to alcohol outlets were found on public spaces. The OHCITIES instrument is a reliable instrument to characterise alcohol urban environment. This instrument might be used to understand how alcohol environment associates with alcohol behaviours and its related health outcomes, and can help in the design and evaluation of policies to reduce the harm caused by alcohol. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Test-retest reliability and construct validity of the ENERGY-child questionnaire on energy balance-related behaviours and their potential determinants: the ENERGY-project.

PubMed

Singh, Amika S; Vik, Froydis N; Chinapaw, Mai J M; Uijtdewilligen, Léonie; Verloigne, Maïté; Fernández-Alvira, Juan M; Stomfai, Sarolta; Manios, Yannis; Martens, Marloes; Brug, Johannes

2011-12-09

Insight in children's energy balance-related behaviours (EBRBs) and their determinants is important to inform obesity prevention research. Therefore, reliable and valid tools to measure these variables in large-scale population research are needed. To examine the test-retest reliability and construct validity of the child questionnaire used in the ENERGY-project, measuring EBRBs and their potential determinants among 10-12 year old children. We collected data among 10-12 year old children (n = 730 in the test-retest reliability study; n = 96 in the construct validity study) in six European countries, i.e. Belgium, Greece, Hungary, the Netherlands, Norway, and Spain. Test-retest reliability was assessed using the intra-class correlation coefficient (ICC) and percentage agreement comparing scores from two measurements, administered one week apart. To assess construct validity, the agreement between questionnaire responses and a subsequent face-to-face interview was assessed using ICC and percentage agreement. Of the 150 questionnaire items, 115 (77%) showed good to excellent test-retest reliability as indicated by ICCs > .60 or percentage agreement ≥ 75%. Test-retest reliability was moderate for 34 items (23%) and poor for one item. Construct validity appeared to be good to excellent for 70 (47%) of the 150 items, as indicated by ICCs > .60 or percentage agreement ≥ 75%. From the other 80 items, construct validity was moderate for 39 (26%) and poor for 41 items (27%). Our results demonstrate that the ENERGY-child questionnaire, assessing EBRBs of the child as well as personal, family, and school-environmental determinants related to these EBRBs, has good test-retest reliability and moderate to good construct validity for the large majority of items.
Test-retest reliability and construct validity of the ENERGY-child questionnaire on energy balance-related behaviours and their potential determinants: the ENERGY-project

PubMed Central

2011-01-01

Background Insight in children's energy balance-related behaviours (EBRBs) and their determinants is important to inform obesity prevention research. Therefore, reliable and valid tools to measure these variables in large-scale population research are needed. Objective To examine the test-retest reliability and construct validity of the child questionnaire used in the ENERGY-project, measuring EBRBs and their potential determinants among 10-12 year old children. Methods We collected data among 10-12 year old children (n = 730 in the test-retest reliability study; n = 96 in the construct validity study) in six European countries, i.e. Belgium, Greece, Hungary, the Netherlands, Norway, and Spain. Test-retest reliability was assessed using the intra-class correlation coefficient (ICC) and percentage agreement comparing scores from two measurements, administered one week apart. To assess construct validity, the agreement between questionnaire responses and a subsequent face-to-face interview was assessed using ICC and percentage agreement. Results Of the 150 questionnaire items, 115 (77%) showed good to excellent test-retest reliability as indicated by ICCs > .60 or percentage agreement ≥ 75%. Test-retest reliability was moderate for 34 items (23%) and poor for one item. Construct validity appeared to be good to excellent for 70 (47%) of the 150 items, as indicated by ICCs > .60 or percentage agreement ≥ 75%. From the other 80 items, construct validity was moderate for 39 (26%) and poor for 41 items (27%). Conclusions Our results demonstrate that the ENERGY-child questionnaire, assessing EBRBs of the child as well as personal, family, and school-environmental determinants related to these EBRBs, has good test-retest reliability and moderate to good construct validity for the large majority of items. PMID:22152048
Inter- and intra-observer reliability of clinical movement-control tests for marines

PubMed Central

2012-01-01

Background Musculoskeletal disorders particularly in the back and lower extremities are common among marines. Here, movement-control tests are considered clinically useful for screening and follow-up evaluation. However, few studies have addressed the reliability of clinical tests, and no such published data exists for marines. The present aim was therefore to determine the inter- and intra-observer reliability of clinically convenient tests emphasizing movement control of the back and hip among marines. A secondary aim was to investigate the sensitivity and specificity of these clinical tests for discriminating musculoskeletal pain disorders in this group of military personnel. Methods This inter- and intra-observer reliability study used a test-retest approach with six standardized clinical tests focusing on movement control for back and hip. Thirty-three marines (age 28.7 yrs, SD 5.9) on active duty volunteered and were recruited. They followed an in-vivo observation test procedure that covered both low- and high-load (threshold) tasks relevant for marines on operational duty. Two independent observers simultaneously rated performance as “correct” or “incorrect” following a standardized assessment protocol. Re-testing followed 7–10 days thereafter. Reliability was analysed using kappa (κ) coefficients, while discriminative power of the best-fitting tests for back- and lower-extremity pain was assessed using a multiple-variable regression model. Results Inter-observer reliability for the six tests was moderate to almost perfect with κ-coefficients ranging between 0.56-0.95. Three tests reached almost perfect inter-observer reliability with mean κ-coefficients > 0.81. However, intra-observer reliability was fair-to-moderate with mean κ-coefficients between 0.22-0.58. Three tests achieved moderate intra-observer reliability with κ-coefficients > 0.41. Combinations of one low- and one high-threshold test best discriminated prior back pain, but results were inconsistent for lower-extremity pain. Conclusions Our results suggest that clinical tests of movement control of back and hip are reliable for use in screening protocols using several observers with marines. However, test-retest reproducibility was less accurate, which should be considered in follow-up evaluations. The results also indicate that combinations of low- and high-threshold tests have discriminative validity for prior back pain, but were inconclusive for lower-extremity pain. PMID:23273285
Adaptation, test-retest reliability, and construct validity of the Physical Activity Neighborhood Environment Scale in Nigeria (PANES-N).

PubMed

Oyeyemi, Adewale L; Sallis, James F; Oyeyemi, Adetoyeje Y; Amin, Mariam M; De Bourdeaudhuij, Ilse; Deforche, Benedicte

2013-11-01

This study adapted the Physical Activity Neighborhood Environment Scale (PANES) to the Nigerian context and assessed the test-retest reliability and construct validity of the Nigerian version (PANESN). A multidisciplinary panel of experts adapted the original PANES to reflect the built and social environment of Nigeria. The adapted PANES was subjected to cognitive testing and test retest reliability in a diverse sample of Nigerian adults (N = 132) from different neighborhood types. Intraclass Correlation Coefficients (ICC) was used to assess test-retest reliability, and construct validity was investigated with Analysis of Covariance for differences in environmental attributes between neighborhoods. Four of the 17 items on the original PANES were significantly modified, 3 were removed and 2 new items were incorporated into the final version of adapted PANES-N. Test-retest reliability was substantial to almost perfect (ICC = 0.62-1.00) for all items on the PANES-N, and residents of neighborhoods in the inner city reported higher residential density, land use mix and safety, but lower pedestrian facilities and aesthetics than did residents of government reserved area/new layout neighborhoods. The PANES-N appears promising for assessing environmental perceptions related to physical activity in Nigeria, but further testing is required to assess its applicability across Africa.
One year test-retest reliability of neurocognitive baseline scores in 10- to 12-year olds.

PubMed

Moser, Rosemarie Scolaro; Schatz, Philip; Grosner, Emily; Kollias, Kelly

2017-01-01

How often youth athletes 10-12 years of age should undergo neurocognitive baseline testing remains an unanswered question. We sought to examine the test-retest reliability of annual ImPACT data in a sample of middle school athletes. Participants were 30 youth athletes, ages 10-12 years (Mean = 11.6, SD = 0.6) selected from a larger database of 10-18 year old athletes, who completed two consecutive annual baseline evaluations using the online version of ImPACT. Athlete assent and parental consent were obtained for all participants. Assessments were conducted either individually or in small groups of 2 to 3 athletes, under the supervision of a neuropsychologist or post-doctoral fellow. Test-retest coefficients were as follows: Verbal Memory .71, Visual Memory .35, Visual Motor Speed .69, Reaction Time .34. Intra-class Correlation Coefficients (single/average) were as follows: Verbal Memory .70/.83, Visual Memory .35/.52, Visual Motor Speed .69/.82, Reaction Time .34/.50. Regression-based measures to correct for practice effects revealed that only a small percentage of cases fell outside 90 and 95% confidence intervals, reflecting stability across assessments. Findings indicate that test-retest reliability of Verbal Memory and Visual Motor Speed are generally stable in 10-12 year old athletes. Nevertheless, Visual Memory Index, Reaction Time Index, and Symptom Checklist scores appear to be less reliable over time, especially compared to published data on high school athletes, suggesting the utility of re-testing on an annual basis in this younger age group.
Clinical applications of correlational vestibular autorotation test.

PubMed

Hsieh, Li-Chun; Lin, Te-Ming; Chang, Yu-Min; Kuo, Terry B J; Lee, Gho-She

2015-06-01

The correlational vestibular autorotation test (VAT) system has the advantages of good test-retest reliability and calibrations of absolute degrees of eye movement are unnecessary when acquiring a cross correlation coefficient (CCC). The approach is able to efficiently detect peripheral vestibulopathies. A VAT has some drawbacks including poor test-retest reliability and slippage of sensor. This study aimed to develop a correlational VAT system and to evaluate the reliability and applicability of this system. Twenty healthy participants and 10 vertiginous patients were enrolled. Vertical and horizontal autorotations from 0 to 3 Hz with either closed or open eyes were performed. A small sensor and a wireless transmission technique were used to acquire the electro-ocular graph and head velocity signals. The two signals were analyzed using CCCs to assess the functioning of the vestibular ocular reflex (VOR). The results showed a significantly greater CCC for open-eye versus closed-eye of head autorotations. The CCCs also increased significantly with head rotational frequencies. Moreover, the CCCs significantly correlated with the VOR gains at autorotation frequencies ≥1.0 Hz. The test-retest reliability was good (intraclass correlation coefficients ≥0.85). The vertiginous participants had significantly lower individual CCCs and overall average CCC than age- and-gender matched controls.
The reliability of eyetracking to assess attentional bias to threatening words in healthy individuals.

PubMed

Skinner, Ian W; Hübscher, Markus; Moseley, G Lorimer; Lee, Hopin; Wand, Benedict M; Traeger, Adrian C; Gustin, Sylvia M; McAuley, James H

2017-08-15

Eyetracking is commonly used to investigate attentional bias. Although some studies have investigated the internal consistency of eyetracking, data are scarce on the test-retest reliability and agreement of eyetracking to investigate attentional bias. This study reports the test-retest reliability, measurement error, and internal consistency of 12 commonly used outcome measures thought to reflect the different components of attentional bias: overall attention, early attention, and late attention. Healthy participants completed a preferential-looking eyetracking task that involved the presentation of threatening (sensory words, general threat words, and affective words) and nonthreatening words. We used intraclass correlation coefficients (ICCs) to measure test-retest reliability (ICC > .70 indicates adequate reliability). The ICCs(2, 1) ranged from -.31 to .71. Reliability varied according to the outcome measure and threat word category. Sensory words had a lower mean ICC (.08) than either affective words (.32) or general threat words (.29). A longer exposure time was associated with higher test-retest reliability. All of the outcome measures, except second-run dwell time, demonstrated low measurement error (<6%). Most of the outcome measures reported high internal consistency (α > .93). Recommendations are discussed for improving the reliability of eyetracking tasks in future research.
ScoreRel CI: An Excel Program for Computing Confidence Intervals for Commonly Used Score Reliability Coefficients

ERIC Educational Resources Information Center

Barnette, J. Jackson

2005-01-01

An Excel program developed to assist researchers in the determination and presentation of confidence intervals around commonly used score reliability coefficients is described. The software includes programs to determine confidence intervals for Cronbachs alpha, Pearson r-based coefficients such as those used in test-retest and alternate forms…
Reliability and validity of selected measures associated with increased fall risk in females over the age of 45 years with distal radius fracture - A pilot study.

PubMed

Mehta, Saurabh P; MacDermid, Joy C; Richardson, Julie; MacIntyre, Norma J; Grewal, Ruby

2015-01-01

Clinical measurement. This study examined test-retest reliability and convergent/divergent construct validity of selected tests and measures that assess balance impairment, fear of falling (FOF), impaired physical activity (PA), and lower extremity muscle strength (LEMS) in females >45 years of age after the distal radius fracture (DRF) population. Twenty one female participants with DRF were assessed on two occasions. Timed Up and Go, Functional Reach, and One Leg Standing tests assessed balance impairment. Shortened Falls Efficacy Scale, Activity-specific Balance Confidence scale, and Fall Risk Perception Questionnaire assessed FOF. International Physical Activity Questionnaire and Rapid Assessment of Physical Activity were administered to assess PA level. Chair stand test and isometric muscle strength testing for hip and knee assessed LEMS. Intraclass correlation coefficients (ICC) examined the test-retest reliability of the measures. Pearson correlation coefficients (r) examined concurrent relationships between the measures. The results demonstrated fair to excellent test-retest reliability (ICC between 0.50 and 0.96) and low to moderate concordance between the measures (low if r ≤ 0.4; moderate if r = 0.4-0.7). The results provide preliminary estimates of test-retest reliability and convergent/divergent construct validity of selected measures associated with increased risk for falling in the females >45 years of age after DRF. Further research directions to advance knowledge regarding fall risk assessment in DRF population have been identified. Copyright © 2015 Hanley & Belfus. Published by Elsevier Inc. All rights reserved.
Reliability, Validity, and Cross-Cultural Adaptation of the Turkish Version of the Bournemouth Questionnaire.

PubMed

Gunaydin, Gurkan; Citaker, Seyit; Meray, Jale; Cobanoglu, Gamze; Gunaydin, Ozge Ece; Hazar Kanik, Zeynep

2016-11-01

Validation of a self-report questionnaire. The purpose of this study was to investigate adaptation, validity, and reliability of the Turkish version of the Bournemouth Questionnaire. Low back pain is one of the most frequent disorders leading to activity limitation. This pain affects most of people in their lives. The most important point to evaluate patient's functional abilities and to decide a successful therapy procedure is to manage the assessment questionnaires precisely. One hundred ten patients with chronic low back pain were included in present study. To assess reliability, test-retest and internal consistency analyses were applied. The results of test-retest analysis were assessed by using Intraclass Correlation Coefficient method (95% confidence interval). For internal consistency, Cronbach alpha value was calculated. Validity of the questionnaire was assessed in terms of construct validity. For construct validity, factor analysis and convergent validity were tested. For convergent validity, total points of the Bournemouth Questionnaire were assessed with the total points of Quebec Back Pain Disability Scale and Roland Morris Disability Questionnaire by using Pearson correlation coefficient analysis. Cronbach alpha value was found 0.914, showing that this questionnaire has high internal consistency. The results of test-retest analysis were varying between 0.851 and 0.927, which shows that test-retest results are highly correlated. Factor analysis test indicated that this questionnaire had one factor. Pearson correlation coefficient of the Bournemouth Questionnaire with Roland Morris Disability Questionnaire was calculated 0.703 and it was found with Quebec Back Pain Disability Scale is 0.659. These results showed that the Bournemouth Questionnaire is very good correlated with Roland Morris Disability Questionnaire and Quebec Back Pain Disability Scale. The Turkish version of the Bournemouth Questionnaire is valid and reliable. 3.
Urdu translation of the Hamilton Rating Scale for Depression: Results of a validation study

PubMed Central

Hashmi, Ali M.; Naz, Shahana; Asif, Aftab; Khawaja, Imran S.

2016-01-01

Objective: To develop a standardized validated version of the Hamilton Rating Scale for Depression (HAM-D) in Urdu. Methods: After translation of the HAM-D into the Urdu language following standard guidelines, the final Urdu version (HAM-D-U) was administered to 160 depressed outpatients. Inter-item correlation was assessed by calculating Cronbach alpha. Correlation between HAM-D-U scores at baseline and after a 2-week interval was evaluated for test-retest reliability. Moreover, scores of two clinicians on HAM-D-U were compared for inter-rater reliability. For establishing concurrent validity, scores of HAM-D-U and BDI-U were compared by using Spearman correlation coefficient. The study was conducted at Mayo Hospital, Lahore, from May to December 2014. Results: The Cronbach alpha for HAM-D-U was 0.71. Composite scores for HAM-D-U at baseline and after a 2-week interval were also highly correlated with each other (Spearman correlation coefficient 0.83, p-value < 0.01) indicating good test-retest reliability. Composite scores for HAM-D-U and BDI-U were positively correlated with each other (Spearman correlation coefficient 0.85, p < 0.01) indicating good concurrent validity. Scores of two clinicians for HAM-D-U were also positively correlated (Spearman correlation coefficient 0.82, p-value < 0.01) indicated good inter-rater reliability. Conclusion: The HAM-D-U is a valid and reliable instrument for the assessment of Depression. It shows good inter-rater and test-retest reliability. The HAM-D-U can be a tool either for clinical management or research. PMID:28083049
Urdu translation of the Hamilton Rating Scale for Depression: Results of a validation study.

PubMed

Hashmi, Ali M; Naz, Shahana; Asif, Aftab; Khawaja, Imran S

2016-01-01

To develop a standardized validated version of the Hamilton Rating Scale for Depression (HAM-D) in Urdu. After translation of the HAM-D into the Urdu language following standard guidelines, the final Urdu version (HAM-D-U) was administered to 160 depressed outpatients. Inter-item correlation was assessed by calculating Cronbach alpha. Correlation between HAM-D-U scores at baseline and after a 2-week interval was evaluated for test-retest reliability. Moreover, scores of two clinicians on HAM-D-U were compared for inter-rater reliability. For establishing concurrent validity, scores of HAM-D-U and BDI-U were compared by using Spearman correlation coefficient. The study was conducted at Mayo Hospital, Lahore, from May to December 2014. The Cronbach alpha for HAM-D-U was 0.71. Composite scores for HAM-D-U at baseline and after a 2-week interval were also highly correlated with each other (Spearman correlation coefficient 0.83, p-value < 0.01) indicating good test-retest reliability. Composite scores for HAM-D-U and BDI-U were positively correlated with each other (Spearman correlation coefficient 0.85, p < 0.01) indicating good concurrent validity. Scores of two clinicians for HAM-D-U were also positively correlated (Spearman correlation coefficient 0.82, p-value < 0.01) indicated good inter-rater reliability. The HAM-D-U is a valid and reliable instrument for the assessment of Depression. It shows good inter-rater and test-retest reliability. The HAM-D-U can be a tool either for clinical management or research.
Concordance of DSM-IV Axis I and II diagnoses by personal and informant's interview.

PubMed

Schneider, Barbara; Maurer, Konrad; Sargk, Dieter; Heiskel, Harald; Weber, Bernhard; Frölich, Lutz; Georgi, Klaus; Fritze, Jürgen; Seidler, Andreas

2004-06-30

The validity and reliability of using psychological autopsies to diagnose a psychiatric disorder is a critical issue. Therefore, interrater and test-retest reliability of the Structured Clinical Interview for DSM-IV Axis I and Personality Disorders and the usefulness of these instruments for the psychological autopsy method were investigated. Diagnoses by informant's interview were compared with diagnoses generated by a personal interview of 35 persons. Interrater reliability and test-retest reliability were assessed in 33 and 29 persons, respectively. Chi-square analysis, kappa and intraclass correlation coefficients, and Kendall's tau were used to determine agreement of diagnoses. Kappa coefficients were above 0.84 for substance-related disorders, mood disorders, and anxiety and adjustment disorders, and above 0.65 for Axis II disorders for interrater and test-retest reliability. Agreement by personal and relative's interview generated kappa coefficients above 0.79 for most Axis I and above 0.65 for most personality disorder diagnoses; Kendall's tau for dimensional individual personality disorder scores ranged from 0.22 to 0.72. Despite of a small number of psychiatric disorders in the selected population, the present results provide support for the validity of most diagnoses obtained through the best-estimate method using the Structured Clinical Interview for DSM-IV Axis I and Personality Disorders. This instrument can be recommended as a tool for the psychological autopsy procedure in post-mortem research. Copyright 2004 Elsevier Ireland Ltd.
Agreement between the spatio-temporal gait parameters from treadmill-based photoelectric cell and the instrumented treadmill system in healthy young adults and stroke patients.

PubMed

Lee, Myungmo; Song, Changho; Lee, Kyoungjin; Shin, Doochul; Shin, Seungho

2014-07-14

Treadmill gait analysis was more advantageous than over-ground walking because it allowed continuous measurements of the gait parameters. The purpose of this study was to investigate the concurrent validity and the test-retest reliability of the OPTOGait photoelectric cell system against the treadmill-based gait analysis system by assessing spatio-temporal gait parameters. Twenty-six stroke patients and 18 healthy adults were asked to walk on the treadmill at their preferred speed. The concurrent validity was assessed by comparing data obtained from the 2 systems, and the test-retest reliability was determined by comparing data obtained from the 1st and the 2nd session of the OPTOGait system. The concurrent validity, identified by the intra-class correlation coefficients (ICC [2, 1]), coefficients of variation (CVME), and 95% limits of agreement (LOA) for the spatial-temporal gait parameters, were excellent but the temporal parameters expressed as a percentage of the gait cycle were poor. The test-retest reliability of the OPTOGait System, identified by ICC (3, 1), CVME, 95% LOA, standard error of measurement (SEM), and minimum detectable change (MDC95%) for the spatio-temporal gait parameters, was high. These findings indicated that the treadmill-based OPTOGait System had strong concurrent validity and test-retest reliability. This portable system could be useful for clinical assessments.
Reliability and Validity of the TIMPSI for Infants With Spinal Muscular Atrophy Type I

PubMed Central

Krosschell, Kristin J.; Maczulski, Jo Anne; Scott, Charles; King, Wendy; Hartman, Jill T.; Case, Laura E.; Viazzo-Trussell, Donata; Wood, Janine; Roman, Carolyn A.; Hecker, Eva; Meffert, Marianne; Léveillé, Maude; Kienitz, Krista; Swoboda, Kathryn J.

2014-01-01

Purpose This study examined the reliability and validity of the Test of Infant Motor Performance Screening Items (TIMPSI) in infants with type I spinal muscular atrophy (SMA). Methods After training, 12 evaluators scored 4 videos of infants with type I SMA to assess interrater reliability. Intrarater and test-retest reliability was further assessed for 9 evaluators during a SMA type I clinical trial, with 9 evaluators testing a total of 38 infants twice. Relatedness of the TIMPSI score to ability to reach and ventilatory support was also examined. Results Excellent interrater video score reliability was noted (intraclass correlation coefficient, 0.97–0.98). Intrarater reliability was excellent (intraclass correlation coefficient, 0.91–0.98) and test-retest reliability ranged from r = 0.82 to r = 0.95. The TIMPSI score was related to the ability to reach (P ≤ .05). Conclusion The TIMPSI can reliably be used to assess motor function in infants with type I SMA. In addition, the TIMPSI scores are related to the ability to reach, an important functional skill in children with type I SMA. PMID:23542189
Cardiopulmonary exercise testing early after stroke using feedback-controlled robotics-assisted treadmill exercise: test-retest reliability and repeatability.

PubMed

Stoller, Oliver; de Bruin, Eling D; Schindelholz, Matthias; Schuster-Amft, Corina; de Bie, Rob A; Hunt, Kenneth J

2014-10-11

Exercise capacity is seriously reduced after stroke. While cardiopulmonary assessment and intervention strategies have been validated for the mildly and moderately impaired populations post-stroke, there is a lack of effective concepts for stroke survivors suffering from severe motor limitations. This study investigated the test-retest reliability and repeatability of cardiopulmonary exercise testing (CPET) using feedback-controlled robotics-assisted treadmill exercise (FC-RATE) in severely motor impaired individuals early after stroke. 20 subjects (age 44-84 years, <6 month post-stroke) with severe motor limitations (Functional Ambulatory Classification 0-2) were selected for consecutive constant load testing (CLT) and incremental exercise testing (IET) within a powered exoskeleton, synchronised with a treadmill and a body weight support system. A manual human-in-the-loop feedback system was used to guide individual work rate levels. Outcome variables focussed on standard cardiopulmonary performance parameters. Relative and absolute test-retest reliability were assessed by intraclass correlation coefficients (ICC), standard error of the measurement (SEM), and minimal detectable change (MDC). Mean difference, limits of agreement, and coefficient of variation (CoV) were estimated to assess repeatability. Peak performance parameters during IET yielded good to excellent relative reliability: absolute peak oxygen uptake (ICC =0.82), relative peak oxygen uptake (ICC =0.72), peak work rate (ICC =0.91), peak heart rate (ICC =0.80), absolute gas exchange threshold (ICC =0.91), relative gas exchange threshold (ICC =0.88), oxygen cost of work (ICC =0.87), oxygen pulse at peak oxygen uptake (ICC =0.92), ventilation rate versus carbon dioxide output slope (ICC =0.78). For these variables, SEM was 4-13%, MDC 12-36%, and CoV 0.10-0.36. CLT revealed high mean differences and insufficient test-retest reliability for all variables studied. This study presents first evidence on reliability and repeatability for CPET in severely motor impaired individuals early after stroke using a feedback-controlled robotics-assisted treadmill. The results demonstrate good to excellent test-retest reliability and appropriate repeatability for the most important peak cardiopulmonary performance parameters. These findings have important implications for the design and implementation of cardiovascular exercise interventions in severely impaired populations. Future research needs to develop advanced control strategies to enable the true limit of functional exercise capacity to be reached and to further assess test-retest reliability and repeatability in larger samples.
The analysis of reliability and validity of the IT-MAIS, MAIS and MUSS.

PubMed

Zhong, Yan; Xu, Tianqiu; Dong, Ruijuan; Lyu, Jing; Liu, Bo; Chen, Xueqing

2017-05-01

The aim of this study was to investigate the reliability and validity of the Infant-toddler Meaningful Auditory Integration Scale (IT-MAIS), Meaningful Auditory Integration Scale (MAIS), and Meaningful Use of Speech Scale (MUSS). IT-MAIS, MAIS and MUSS were divided into 3 sub dimensions. 300 children with cochlear implants (CI) were included in the investigation. To assess test-retest reliability of these questionnaires, 30 children were selected randomly to be evaluated at a two-week interval indicated that there were no significant changes between test and retest. Furthermore random test analysis by different evaluators was also administered to 30 users. Reliability test: Test-retest reliability of the three scales was proved to be satisfactory. All domains had correlation coefficients that exceeded 0.750(P < 0.01). The Cronbach's α of the three scales and their three domains were greater than 0.700. Reliability between evaluators of the three scales were considered to be satisfactory. All domains had correlation coefficients that exceeded 0.750(P < 0.01). Validity test: The evaluation of content validity by expert review showed the questionnaire had good content validity; The correlation coefficients between the overall scores of the three scales and their three domains were 0.699-0.978(P < 0.01). There were correlations among the three sub-domains but the strength of the correlations was relatively low. There was certain construct validity. IT-MAIS, MAIS, MUSS scales have good reliability and validity, and can be used to measure the outcome for children with cochlear implants hearing and speech evaluation. Copyright © 2017 Elsevier B.V. All rights reserved.

Brain GABA Detection in vivo with the J-editing 1H MRS Technique: A Comprehensive Methodological Evaluation of Sensitivity Enhancement, Macromolecule Contamination and Test-Retest Reliability

PubMed Central

Shungu, Dikoma C.; Mao, Xiangling; Gonzales, Robyn; Soones, Tacara N.; Dyke, Jonathan P.; van der Veen, Jan Willem; Kegeles, Lawrence S.

2016-01-01

Abnormalities in brain γ-aminobutyric acid (GABA) have been implicated in various neuropsychiatric and neurological disorders. However, in vivo GABA detection by proton magnetic resonance spectroscopy (1H MRS) presents significant challenges arising from low brain concentration, overlap by much stronger resonances, and contamination by mobile macromolecule (MM) signals. This study addresses these impediments to reliable brain GABA detection with the J-editing difference technique on a 3T MR system in healthy human subjects by (a) assessing the sensitivity gains attainable with an 8-channel phased-array head coil, (b) determining the magnitude and anatomic variation of the contamination of GABA by MM, and (c) estimating the test-retest reliability of measuring GABA with this method. Sensitivity gains and test-retest reliability were examined in the dorsolateral prefrontal cortex (DLPFC), while MM levels were compared across three cortical regions: the DLPFC, the medial prefrontal cortex (MPFC) and the occipital cortex (OCC). A 3-fold higher GABA detection sensitivity was attained with the 8-channel head coil compared to the standard single-channel head coil in DLPFC. Despite significant anatomic variation in GABA+MM and MM across the three brain regions (p < 0.05), the contribution of MM to GABA+MM was relatively stable across the three voxels, ranging from 41% to 49%, a non-significant regional variation (p = 0.58). The test-retest reliability of GABA measurement, expressed either as ratios to voxel tissue water (W) or total creatine, was found to be very high for both the single-channel coil and the 8-channel phased-array coil. For the 8-channel coil, for example, Pearson’s correlation coefficient of test vs. retest for GABA/W was 0.98 (R2 = 0.96, p = 0.0007), the percent coefficient of variation (CV) was 1.25%, and the intraclass correlation coefficient (ICC) was 0.98. Similar reliability was also found for the co-edited resonance of combined glutamate and glutamine (Glx) for both coils. PMID:27173449
An alternative to the balance error scoring system: using a low-cost balance board to improve the validity/reliability of sports-related concussion balance testing.

PubMed

Chang, Jasper O; Levy, Susan S; Seay, Seth W; Goble, Daniel J

2014-05-01

Recent guidelines advocate sports medicine professionals to use balance tests to assess sensorimotor status in the management of concussions. The present study sought to determine whether a low-cost balance board could provide a valid, reliable, and objective means of performing this balance testing. Criterion validity testing relative to a gold standard and 7 day test-retest reliability. University biomechanics laboratory. Thirty healthy young adults. Balance ability was assessed on 2 days separated by 1 week using (1) a gold standard measure (ie, scientific grade force plate), (2) a low-cost Nintendo Wii Balance Board (WBB), and (3) the Balance Error Scoring System (BESS). Validity of the WBB center of pressure path length and BESS scores were determined relative to the force plate data. Test-retest reliability was established based on intraclass correlation coefficients. Composite scores for the WBB had excellent validity (r = 0.99) and test-retest reliability (R = 0.88). Both the validity (r = 0.10-0.52) and test-retest reliability (r = 0.61-0.78) were lower for the BESS. These findings demonstrate that a low-cost balance board can provide improved balance testing accuracy/reliability compared with the BESS. This approach provides a potentially more valid/reliable, yet affordable, means of assessing sports-related concussion compared with current methods.
Evaluating the test-retest reliability of symptom indices associated with the ImPACT post-concussion symptom scale (PCSS).

PubMed

Merritt, Victoria C; Bradson, Megan L; Meyer, Jessica E; Arnett, Peter A

2018-05-01

The Immediate Post-Concussion Assessment and Cognitive Testing (ImPACT) is a commonly used tool in sports concussion assessment. While test-retest reliabilities have been established for the ImPACT cognitive composites, few studies have evaluated the psychometric properties of the ImPACT's Post-Concussion Symptom Scale (PCSS). The purpose of this study was to establish the test-retest reliability of symptom indices associated with the PCSS. Participants included 38 undergraduate students (50.0% male) who underwent neuropsychological testing as part of their participation in their psychology department's research subject pool. The majority of the participants were Caucasian (94.7%) and had no history of concussion (73.7%). All participants completed the ImPACT at two time points, approximately 6 weeks apart. The PCSS was the main outcome measure, and eight symptom indices were calculated (a total symptom score, three symptom summary indices, and four symptom clusters). Pearson correlations (r) and intraclass correlation coefficients (ICCs) were computed as measures of test-retest reliability. Overall, reliabilities ranged from low to high (r = .44 to .80; ICC = .44 to .77). The cognitive symptom cluster exhibited the highest test-retest reliability (r = .80, ICC = .77), followed by the positive symptom total (PST) index, an indicator of the total number of symptoms endorsed (r = .71, ICC = .69). In contrast, the commonly used total symptom score showed lower test-retest reliability (r = .67, ICC = .62). Paired-samples t tests revealed no significant differences between test and retest for any of the symptom variables (all p > .01). Finally, reliable change indices (RCI) were computed to determine whether differences observed between test and retest represented clinically significant change. RCI values were provided for each symptom index at the 80%, 90%, and 95% confidence intervals. These results suggest that evaluating additional symptom indices beyond the total symptom score from the PCSS is beneficial. Findings from this study can be applied to athlete samples to assess reliable change in symptoms following concussion.
Reliability of a tool for measuring theory of planned behaviour constructs for use in evaluating research use in policymaking

PubMed Central

2011-01-01

Background Although measures of knowledge translation and exchange (KTE) effectiveness based on the theory of planned behavior (TPB) have been used among patients and providers, no measure has been developed for use among health system policymakers and stakeholders. A tool that measures the intention to use research evidence in policymaking could assist researchers in evaluating the effectiveness of KTE strategies that aim to support evidence-informed health system decision-making. Therefore, we developed a 15-item tool to measure four TPB constructs (intention, attitude, subjective norm and perceived control) and assessed its face validity through key informant interviews. Methods We carried out a reliability study to assess the tool's internal consistency and test-retest reliability. Our study sample consisted of 62 policymakers and stakeholders that participated in deliberative dialogues. We assessed internal consistency using Cronbach's alpha and generalizability (G) coefficients, and we assessed test-retest reliability by calculating Pearson correlation coefficients (r) and G coefficients for each construct and the tool overall. Results The internal consistency of items within each construct was good with alpha ranging from 0.68 to alpha = 0.89. G-coefficients were lower for a single administration (G = 0.34 to G = 0.73) than for the average of two administrations (G = 0.79 to G = 0.89). Test-retest reliability coefficients for the constructs ranged from r = 0.26 to r = 0.77 and from G = 0.31 to G = 0.62 for a single administration, and from G = 0.47 to G = 0.86 for the average of two administrations. Test-retest reliability of the tool using G theory was moderate (G = 0.5) when we generalized across a single observation, but became strong (G = 0.9) when we averaged across both administrations. Conclusion This study provides preliminary evidence for the reliability of a tool that can be used to measure TPB constructs in relation to research use in policymaking. Our findings suggest that the tool should be administered on more than one occasion when the intervention promotes an initial 'spike' in enthusiasm for using research evidence (as it seemed to do in this case with deliberative dialogues). The findings from this study will be used to modify the tool and inform further psychometric testing following different KTE interventions. PMID:21702956
Reading Ability as an Estimator of Premorbid Intelligence: Does It Remain Stable Among Ethnically Diverse HIV+ Adults?

PubMed Central

Olsen, J. Pat; Fellows, Robert P.; Rivera-Mindt, Monica; Morgello, Susan; Byrd, Desiree A.

2015-01-01

The Wide Range Achievement Test, 3rd edition, Reading-Recognition subtest (WRAT-3 RR) is an established measure of premorbid ability. Furthermore, its long-term reliability is not well documented, particularly in diverse populations with CNS-relevant disease. Objective: We examined test-retest reliability of the WRAT-3 RR over time in an HIV+ sample of predominantly racial/ethnic minority adults. Method: Participants (N = 88) completed a comprehensive neuropsychological battery, including the WRAT-3 RR, on at least two separate study visits. Intraclass correlation coefficients (ICCs) were computed using scores from baseline and follow-up assessments to determine the test-retest reliability of the WRAT-3 RR across racial/ethnic groups and changes in medical (immunological) and clinical (neurocognitive) factors. Additionally, Fisher’s Z tests were used to determine the significance of the differences between ICCs. Results: The average test-retest interval was 58.7 months (SD=36.4). The overall WRAT-3 RR test-retest reliability was high (r = .97, p < .001), and remained robust across all demographic, medical, and clinical variables (all r’s > .92). Intraclass correlation coefficients did not differ significantly between the subgroups tested (all Fisher’s Z p’s > .05). Conclusions: Overall, this study supports the appropriateness of word-reading tests, such as the WRAT-3 RR, for use as stable premorbid IQ estimates among ethnically diverse groups. Moreover, this study supports the reliability of this measure in the context of change in health and neurocognitive status, and in lengthy inter-test intervals. These findings offer strong rationale for reading as a “hold” test, even in the presence of a chronic, variable disease such as HIV. PMID:26689235
Reliability of intra-oral quantitative sensory testing (QST) in patients with atypical odontalgia and healthy controls - a multicentre study.

PubMed

Baad-Hansen, L; Pigg, M; Yang, G; List, T; Svensson, P; Drangsholt, M

2015-02-01

The reliability of comprehensive intra-oral quantitative sensory testing (QST) protocol has not been examined systematically in patients with chronic oro-facial pain. The aim of the present multicentre study was to examine test-retest and interexaminer reliability of intra-oral QST measures in terms of absolute values and z-scores as well as within-session coefficients of variation (CV) values in patients with atypical odontalgia (AO) and healthy pain-free controls. Forty-five patients with AO and 68 healthy controls were subjected to bilateral intra-oral gingival QST and unilateral extratrigeminal QST (thenar) on three occasions (twice on 1 day by two different examiners and once approximately 1 week later by one of the examiners). Intra-class correlation coefficients and kappa values for interexaminer and test-retest reliability were computed. Most of the standardised intra-oral QST measures showed fair to excellent interexaminer (9-12 of 13 measures) and test-retest (7-11 of 13 measures) reliability. Furthermore, no robust differences in reliability measures or within-session variability (CV) were detected between patients with AO and the healthy reference group. These reliability results in chronic orofacial pain patients support earlier suggestions based on data from healthy subjects that intra-oral QST is sufficiently reliable for use as a part of a comprehensive evaluation of patients with somatosensory disturbances or neuropathic pain in the trigeminal region. © 2014 John Wiley & Sons Ltd.
Measurement Properties of the NIH-Minimal Dataset Dutch Language Version in Patients With Chronic Low Back Pain.

PubMed

Boer, Annemarie; Dutmer, Alisa L; Schiphorst Preuper, Henrica R; van der Woude, Lucas H V; Stewart, Roy E; Deyo, Richard A; Reneman, Michiel F; Soer, Remko

2017-10-01

Validation study with cross-sectional and longitudinal measurements. To translate the US National Institutes of Health (NIH)-minimal dataset for clinical research on chronic low back pain into the Dutch language and to test its validity and reliability among people with chronic low back pain. The NIH developed a minimal dataset to encourage more complete and consistent reporting of clinical research and to be able to compare studies across countries in patients with low back pain. In the Netherlands, the NIH-minimal dataset has not been translated before and measurement properties are unknown. Cross-cultural validity was tested by a formal forward-backward translation. Structural validity was tested with exploratory factor analyses (comparative fit index, Tucker-Lewis index, and root mean square error of approximation). Hypothesis testing was performed to compare subscales of the NIH dataset with the Pain Disability Index and the EurQol-5D (Pearson correlation coefficients). Internal consistency was tested with Cronbach α and test-retest reliability at 2 weeks was calculated in a subsample of patients with Intraclass Correlation Coefficients and weighted Kappa (κω). In total, 452 patients were included of which 52 were included for the test-retest study. factor analysis for structural validity pointed into the direction of a seven-factor model (Cronbach α = 0.78). Factors and total score of the NIH-minimal dataset showed fair to good correlations with Pain Disability Index (r = 0.43-0.70) and EuroQol-5D (r = -0.41 to -0.64). Reliability: test-retest reliability per item showed substantial agreement (κω=0.65). Test-retest reliability per factor was moderate to good (Intraclass Correlation Coefficient = 0.71). The Dutch language version measurement properties of the NIH-minimal were satisfactory. N/A.
Reliability of the Berg Balance Scale as a Clinical Measure of Balance in Community-Dwelling Older Adults with Mild to Moderate Alzheimer Disease: A Pilot Study.

PubMed

Muir-Hunter, Susan W; Graham, Laura; Montero Odasso, Manuel

2015-08-01

To measure test-retest and interrater reliability of the Berg Balance Scale (BBS) in community-dwelling adults with mild to moderate Alzheimer disease (AD). Method : A sample of 15 adults (mean age 80.20 [SD 5.03] years) with AD performed three balance tests: the BBS, timed up-and-go test (TUG), and Functional Reach Test (FRT). Both relative reliability, using the intra-class correlation coefficient (ICC), and absolute reliability, using standard error of measurement (SEM) and minimal detectable change (MDC95) values, were calculated; Bland-Altman plots were constructed to evaluate inter-tester agreement. The test-retest interval was 1 week. Results : For the BBS, relative reliability values were 0.95 (95% CI, 0.85-0.98) for test-retest reliability and 0.72 (95% CI, 0.31-0.91) for interrater reliability; SEM was 6.01 points and MDC95 was 16.66 points; and interrater agreement was 16.62 points. The BBS performed better in test-retest reliability than the TUG and FRT, tests with established reliability in AD. Between 33% and 50% of participants required cueing beyond standardized instructions because they were unable to remember test instructions. Conclusions : The BBS achieved relative reliability values that support its clinical utility, but MDC95 and agreement values indicate the scale has performance limitations in AD. Further research to optimize balance assessment for people with AD is required.
Effective Dynamic Range and Retest Reliability of Dark-Adapted Two-Color Fundus-Controlled Perimetry in Patients With Macular Diseases.

PubMed

Pfau, Maximilian; Lindner, Moritz; Müller, Philipp L; Birtel, Johannes; Finger, Robert P; Harmening, Wolf M; Fleckenstein, Monika; Holz, Frank G; Schmitz-Valckenberg, Steffen

2017-05-01

To determine the effective dynamic range (EDR), retest reliability, and number of discriminable steps (DS) for mesopic and dark-adapted two-color fundus-controlled perimetry (FCP) using the S-MAIA (Scotopic-Macular Integrity Assessment) "micro-perimeter." In this prospective cross-sectional study, each of the 52 eyes of 52 subjects with various macular diseases (mean age 62.0 ± 16.9 years; range, 19.1-90.1 years) underwent duplicate mesopic (achromatic stimuli, 400-800 nm), dark-adapted cyan (505 nm), and dark-adapted red (627 nm) FCP using a grid of 61 stimuli covering 18° of the central retina. The EDR, the number of DS, and the retest reliability for point-wise sensitivity (PWS) were analyzed. The effects of fixation stability, sensitivity, and age on retest reliability were examined using mixed-effects models. The EDR was 10 to 30 dB with five DS for mesopic and 4 to 17 dB with four DS for dark-adapted cyan and red testing. PWS retest reliability was good among all three types of retinal sensitivity assessments (coefficient of repeatability ±5.79, ±4.72, and ±4.77 dB, respectively) and did not depend on fixation stability or age. PWS had no effect on retest variability in dark-adapted cyan and dark-adapted red testing but had a minor effect in mesopic testing. Combined mesopic and dark-adapted two-color FCP allows for reliable topographic testing of cone and rod function in patients with various macular diseases with and without foveal fixation. Retest reliability is homogeneous across eccentricities and various degrees of scotoma depth, including zones at risk for disease progression. These reliability estimates can serve for the design of future clinical trials.
Test-Retest Reliability of a Novel Isokinetic Squat Device With Strength-Trained Athletes.

PubMed

Bridgeman, Lee A; McGuigan, Michael R; Gill, Nicholas D; Dulson, Deborah K

2016-11-01

Bridgeman, LA, McGuigan, MR, Gill, ND, and Dulson, DK. Test-retest reliability of a novel isokinetic squat device with strength-trained athletes. J Strength Cond Res 30(11): 3261-3265, 2016-The aim of this study was to investigate the test-retest reliability of a novel multijoint isokinetic squat device. The subjects in this study were 10 strength-trained athletes. Each subject completed 3 maximal testing sessions to assess peak concentric and eccentric force (N) over a 3-week period using the Exerbotics squat device. Mean differences between eccentric and concentric force across the trials were calculated. Intraclass correlation coefficients (ICCs) and coefficients of variation (CVs) for the variables of interest were calculated using an excel reliability spreadsheet. Between trials 1 and 2 an 11.0 and 2.3% increase in mean concentric and eccentric forces, respectively, was reported. Between trials 2 and 3 a 1.35% increase in the mean concentric force production and a 1.4% increase in eccentric force production was reported. The mean concentric peak force CV and ICC across the 3 trials was 10% (7.6-15.4) and 0.95 (0.87-0.98) respectively. However, the mean eccentric peak force CV and ICC across the trials was 7.2% (5.5-11.1) and 0.90 (0.76-0.97), respectively. Based on these findings it is suggested that the Exerbotics squat device shows good test-retest reliability. Therefore practitioners and investigators may consider its use to monitor changes in concentric and eccentric peak force.
Clinical usefulness of the pendulum test using a NK table to measure the spasticity of patients with brain lesions.

PubMed

Kim, Yong-Wook

2013-10-01

. [Purpose] The purpose of the present study was to investigate the clinical usefulness (reliability and validity) of the pendulum test using a Noland-Kuckhoff (NK) table with an attached electrogoniometer to measure the spasticity of patients with brain lesions. [Subjects] The subjects were 31 patients with stroke or traumatic brain injury. [Methods] The intraclass correlation coefficient (ICC) was used to verify the test-retest reliability of spasticity measures obtained using the pendulum test. Pearson's product correlation coefficient was used to examine the validity of the pendulum test using the amplitude of the patellar tendon reflex (PTR) test, an objective and quantitative measure of spasticity. [Results] The test-retest reliability was high, reflecting a significant correlation between the test and the retest (ICCs = 0.95-0.97). A significant negative correlation was found between the amplitude of the PTR test and the four variables measured in the pendulum test (r = -0.77- -0.85). [Conclusion] The pendulum test using a NK table is an objective measure of spasticity and can be used in the clinical setting in place of more expensive and complicated equipment. Further studies are needed to investigate the therapeutic effect of this method on spasticity.
Test-retest reliability of computer-based video analysis of general movements in healthy term-born infants.

PubMed

Valle, Susanne Collier; Støen, Ragnhild; Sæther, Rannei; Jensenius, Alexander Refsum; Adde, Lars

2015-10-01

A computer-based video analysis has recently been presented for quantitative assessment of general movements (GMs). This method's test-retest reliability, however, has not yet been evaluated. The aim of the current study was to evaluate the test-retest reliability of computer-based video analysis of GMs, and to explore the association between computer-based video analysis and the temporal organization of fidgety movements (FMs). Test-retest reliability study. 75 healthy, term-born infants were recorded twice the same day during the FMs period using a standardized video set-up. The computer-based movement variables "quantity of motion mean" (Qmean), "quantity of motion standard deviation" (QSD) and "centroid of motion standard deviation" (CSD) were analyzed, reflecting the amount of motion and the variability of the spatial center of motion of the infant, respectively. In addition, the association between the variable CSD and the temporal organization of FMs was explored. Intraclass correlation coefficients (ICC 1.1 and ICC 3.1) were calculated to assess test-retest reliability. The ICC values for the variables CSD, Qmean and QSD were 0.80, 0.80 and 0.86 for ICC (1.1), respectively; and 0.80, 0.86 and 0.90 for ICC (3.1), respectively. There were significantly lower CSD values in the recordings with continual FMs compared to the recordings with intermittent FMs (p<0.05). This study showed high test-retest reliability of computer-based video analysis of GMs, and a significant association between our computer-based video analysis and the temporal organization of FMs. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Interrater and Test-Retest Reliability and Minimal Detectable Change of the Balance Evaluation Systems Test (BESTest) and Subsystems With Community-Dwelling Older Adults.

PubMed

Wang-Hsu, Elizabeth; Smith, Susan S

2017-01-10

Falls are a common cause of injuries and hospital admissions in older adults. Balance limitation is a potentially modifiable factor contributing to falls. The Balance Evaluation Systems Test (BESTest), a clinical balance measure, categorizes balance into 6 underlying subsystems. Each of the subsystems is scored individually and summed to obtain a total score. The reliability of the BESTest and its individual subsystems has been reported in patients with various neurological disorders and cancer survivors. However, the reliability and minimal detectable change (MDC) of the BESTest with community-dwelling older adults have not been reported. The purposes of our study were to (1) determine the interrater and test-retest reliability of the BESTest total and subsystem scores; and (2) estimate the MDC of the BESTest and its individual subsystem scores with community-dwelling older adults. We used a prospective cohort methodological design. Community-dwelling older adults (N = 70; aged 70-94 years; mean = 85.0 [5.5] years) were recruited from a senior independent living community. Trained testers (N = 3) administered the BESTest. All participants were tested with the BESTest by the same tester initially and then retested 7 to 14 days later. With 32 of the participants, a second tester concurrently scored the retest for interrater reliability. Testers were blinded to each other's scores. Intraclass correlation coefficients [ICC(2,1)] were used to determine the interrater and test-retest reliability. Test-retest reliability was also analyzed using method error and the associated coefficients of variation (CVME). MDC was calculated using standard error of measurement. Interrater reliability (N = 32) of the BESTest total score was ICC(2, 1) = 0.97 (95% confidence interval [CI], 0.94-0.99). The ICCs for the individual subsystem scores ranged from 0.85 to 0.94. Test-retest reliability (N = 70) of the BESTest total score was ICC(2,1) = 0.93 (95% CI, 0.89-0.96). ICCs for the individual subsystem scores ranged from 0.72 to 0.89. The CVME (N = 70) of the BESTest total score was 4.1%. The CVME for the subsystem scores ranged from 5.0% to 10.7%. MDC (N = 70) for the BESTest total score at the 95% CI was 7.6%, or 8.2 points. MDC at the 95% CI for subsystem scores ranged from 11.7% to 19.0% (2.1-3.4 points). Results demonstrated generally good to excellent interrater and test-retest reliability in both the BESTest total and subsystem scores with community-dwelling older adults. The BESTest total and individual subsystem scores demonstrate good to excellent interrater and test-retest reliability with community-dwelling older adults. A change of 7.6% (8.2 points) or more in the BESTest total and a percentage change ranged from 11.7% to 19.0% (2.1-3.4 points) in the subsystem scores are suggested for clinicians to be 95% confident of true change when evaluating change in this population.
Reference values for the muscle power sprint test in 6- to 12-year-old children.

PubMed

Douma-van Riet, Danielle; Verschuren, Olaf; Jelsma, Dorothee; Kruitwagen, Cas; Smits-Engelsman, Bouwien; Takken, Tim

2012-01-01

The aims of this study were (1) to develop centile reference values for anaerobic performance of Dutch children tested using the Muscle Power Sprint Test (MPST) and (2) to examine the test-retest reliability of the MPST. Children who were developing typically (178 boys and 201 girls) and aged 6 to 12 years (mean = 8.9 years) were recruited. The MPST was administered to 379 children, and test-retest reliability was examined in 47 children. MPST scores were transformed into centile curves, which were created using generalized additive models for location, scale, and shape. Height-related reference curves were created for both genders. Excellent (intraclass correlation coefficient = 0.98) test-retest reliability was demonstrated. The reference values for the MPST of children who are developing typically and aged 6 to 12 years can serve as a clinical standard in pediatric physical therapy practice. The MPST is a reliable and practical method for determining anaerobic performance in children.
JCQ scale reliability and responsiveness to changes in manufacturing process.

PubMed

d'Errico, Angelo; Punnett, Laura; Gold, Judith E; Gore, Rebecca

2008-02-01

The job content questionnaire (JCQ) was administered to automobile manufacturing workers in two interviews, 5 years apart. Between the two interviews, the company introduced substantial changes in production technology in some production areas. The aims were: (1) to describe the impact of these changes on self-reported psychosocial exposures, and (2) to examine test-retest reliability of the JCQ scales, taking into account changes in job assignment and, for a subset of workers, physical ergonomic exposures as assessed through field observations. The study population included 790 subjects at the first and 519 at the second interview, of whom 387 were present in both. Differences in demand and control scores between interviews were analyzed by Wilcoxon matched-pairs signed-rank test. Test-retest reliability of these scales was evaluated by the intraclass correlation coefficient (ICC) and the Spearman's rho coefficient. The introduction of more automated technology produced an overall increase in job control but did not decrease psychological demand. The reliability of the control scale was low overall but increased to an acceptable level among workers who had not changed job. The demand scale had high reliability only among workers whose physical ergonomic exposures were similar on both survey occasions. These results show that 5-year test-retest reliability of self-reported psychosocial exposures is adequate among workers whose job assignment and ergonomic exposures have remained stable over time.
Test-retest reliability and predictors of unreliable reporting for a sexual behavior questionnaire for U.S. men.

PubMed

Nyitray, Alan G; Harris, Robin B; Abalos, Andrew T; Nielson, Carrie M; Papenfuss, Mary; Giuliano, Anna R

2010-12-01

Accurate knowledge about human sexual behaviors is important for increasing our understanding of human sexuality; however, there have been few studies assessing the reliability of sexual behavior questionnaires designed for community samples of adult men. A test-retest reliability study was conducted on a questionnaire completed by 334 men who had been recruited in Tucson, Arizona. Reliability coefficients and refusal rates were calculated for 39 non-sexual and sexual behavior questionnaire items. Predictors of unreliable reporting for lifetime number of female sexual partners were also assessed. Refusal rates were generally low, with slightly higher refusal rates for questions related to immigration, income, the frequency of sexual intercourse with women, lifetime number of female sexual partners, and the lifetime number of male anal sex partners. Kappa and intraclass correlation coefficients were substantial or almost perfect for all non-sexual and sexual behavior items. Reliability dropped somewhat, but was still substantial, for items that asked about household income and the men's knowledge of their sexual partners' health, including abnormal Pap tests and prior sexually transmitted diseases (STD). Age and lifetime number of female sexual partners were independent predictors of unreliable reporting while years of education was inversely associated with unreliable reporting. These findings among a community sample of adult men are consistent with other test-retest reliability studies with populations of women and adolescents.
Reliability and validity of migraine disability assessment questionnaire-Thai version (Thai-MIDAS).

PubMed

Seethong, Piman; Nimmannit, Akarin; Chaisewikul, Rungsan; Prayoonwiwat, Naraporn; Chotinaiwattarakul, Wattanachai

2013-02-01

To assess the validity and test-retest reliability of a Thai translation of the Migraine Disability Assessment (MIDAS) Questionnaire in Thai patients with migraine. Migraineurs from the Headache Clinic in Siriraj Hospital were recruited and asked to complete a 13-weeks diary and answered the Thai-MIDAS at once. Some participants were asked to provide the 2nd Thai-MIDAS in the next 2 weeks for test-retest reliability. Ninety-three patients had completed the 13-weeks diaries. Age range was 18-58 years with mean 37.69 +/- 9.60 years. All 5 items and the total score of Thai-MIDAS were moderately correlated with data from 13-weeks diary (Spearman's correlation coefficient = 0.32-0.62). The test-retest reliability of the total score of Thai-MIDAS in 30 patients demonstrated a highly reliable degree of intraclass correlation (ICC = 0.76, 95% CI 0.49-0.88). The present study reveals that the Thai-MIDAS has satisfactory validity and reliability in comparison with the original English MIDAS version.
Development, test-retest reliability and validity of the Pharmacy Value-Added Services Questionnaire (PVASQ).

PubMed

Tan, Christine L; Hassali, Mohamed A; Saleem, Fahad; Shafie, Asrul A; Aljadhey, Hisham; Gan, Vincent B

2015-01-01

(i) To develop the Pharmacy Value-Added Services Questionnaire (PVASQ) using emerging themes generated from interviews. (ii) To establish reliability and validity of questionnaire instrument. Using an extended Theory of Planned Behavior as the theoretical model, face-to-face interviews generated salient beliefs of pharmacy value-added services. The PVASQ was constructed initially in English incorporating important themes and later translated into the Malay language with forward and backward translation. Intention (INT) to adopt pharmacy value-added services is predicted by attitudes (ATT), subjective norms (SN), perceived behavioral control (PBC), knowledge and expectations. Using a 7-point Likert-type scale and a dichotomous scale, test-retest reliability (N=25) was assessed by administrating the questionnaire instrument twice at an interval of one week apart. Internal consistency was measured by Cronbach's alpha and construct validity between two administrations was assessed using the kappa statistic and the intraclass correlation coefficient (ICC). Confirmatory Factor Analysis, CFA (N=410) was conducted to assess construct validity of the PVASQ. The kappa coefficients indicate a moderate to almost perfect strength of agreement between test and retest. The ICC for all scales tested for intra-rater (test-retest) reliability was good. The overall Cronbach' s alpha (N=25) is 0.912 and 0.908 for the two time points. The result of CFA (N=410) showed most items loaded strongly and correctly into corresponding factors. Only one item was eliminated. This study is the first to develop and establish the reliability and validity of the Pharmacy Value-Added Services Questionnaire instrument using the Theory of Planned Behavior as the theoretical model. The translated Malay language version of PVASQ is reliable and valid to predict Malaysian patients' intention to adopt pharmacy value-added services to collect partial medicine supply.
Clinical Usefulness of the Pendulum Test Using a NK Table to Measure the Spasticity of Patients with Brain Lesions

PubMed Central

Kim, Yong-Wook

2013-01-01

. [Purpose] The purpose of the present study was to investigate the clinical usefulness (reliability and validity) of the pendulum test using a Noland-Kuckhoff (NK) table with an attached electrogoniometer to measure the spasticity of patients with brain lesions. [Subjects] The subjects were 31 patients with stroke or traumatic brain injury. [Methods] The intraclass correlation coefficient (ICC) was used to verify the test–retest reliability of spasticity measures obtained using the pendulum test. Pearson's product correlation coefficient was used to examine the validity of the pendulum test using the amplitude of the patellar tendon reflex (PTR) test, an objective and quantitative measure of spasticity. [Results] The test–retest reliability was high, reflecting a significant correlation between the test and the retest (ICCs = 0.95–0.97). A significant negative correlation was found between the amplitude of the PTR test and the four variables measured in the pendulum test (r = −0.77– −0.85). [Conclusion] The pendulum test using a NK table is an objective measure of spasticity and can be used in the clinical setting in place of more expensive and complicated equipment. Further studies are needed to investigate the therapeutic effect of this method on spasticity. PMID:24259775
Long-term stability of the Wechsler Intelligence Scale for Children--Fourth Edition.

PubMed

Watkins, Marley W; Smith, Lourdes G

2013-06-01

Long-term stability of the Wechsler Intelligence Scale for Children-Fourth Edition (WISC-IV; Wechsler, 2003) was investigated with a sample of 344 students from 2 school districts twice evaluated for special education eligibility at an average interval of 2.84 years. Test-retest reliability coefficients for the Verbal Comprehension Index (VCI), Perceptual Reasoning Index (PRI), Working Memory Index (WMI), Processing Speed Index (PSI), and the Full Scale IQ (FSIQ) were .72, .76, .66, .65, and .82, respectively. As predicted, the test-retest reliability coefficients for the subtests (Mdn = .56) were generally lower than the index scores (Mdn = .69) and the FSIQ (.82). On average, subtest scores did not differ by more than 1 point, and index scores did not differ by more than 2 points across the test-retest interval. However, 25% of the students earned FSIQ scores that differed by 10 or more points, and 29%, 39%, 37%, and 44% of the students earned VCI, PRI, WMI, and PSI scores, respectively, that varied by 10 or more points. Given this variability, it cannot be assumed that WISC-IV scores will be consistent across long test-retest intervals for individual students. PsycINFO Database Record (c) 2013 APA, all rights reserved.

The Reliability and Validity of Measures of Gait Variability in Community-Dwelling Older Adults

PubMed Central

Brach, Jennifer S.; Perera, Subashan; Studenski, Stephanie; Newman, Anne B.

2009-01-01

Objective To examine the test-retest reliability and concurrent validity of variability of gait characteristics. Design Cross-sectional study. Setting Research laboratory. Participants Older adults (N=558) from the Cardiovascular Health Study. Interventions Not applicable. Main Outcome Measures Gait characteristics were measured using a 4-m computerized walkway. SD determined from the steps recorded were used as the measures of variability. Intraclass correlation coefficients (ICC) were calculated to examine test-retest reliability of a 4-m walk and two 4-m walks. To establish concurrent validity, the measures of gait variability were compared across levels of health, functional status, and physical activity using independent t tests and analysis of variances. Results Gait variability measures from the two 4-m walks demonstrated greater test-retest reliability than those from the single 4-m walk (ICC=.22–.48 and ICC=.40–.63, respectively). Greater step length and stance time variability were associated with poorer health, functional status and physical activity (P<.05). Conclusions Gait variability calculated from a limited number of steps has fair to good test-retest reliability and concurrent validity. Reliability of gait variability calculated from a greater number of steps should be assessed to determine if the consistency can be improved. PMID:19061741
Validity and cross-cultural adaptation of the persian version of the oxford elbow score.

PubMed

Ebrahimzadeh, Mohammad H; Kachooei, Amir Reza; Vahedi, Ehsan; Moradi, Ali; Mashayekhi, Zeinab; Hallaj-Moghaddam, Mohammad; Azami, Mehran; Birjandinejad, Ali

2014-01-01

Oxford Elbow Score (OES) is a patient-reported questionnaire used to assess outcomes after elbow surgery. The aim of this study was to validate and adapt the OES into Persian language. After forward-backward translation of the OES into Persian, a total number of 92 patients after elbow surgeries completed the Persian OES along with the Persian DASH and SF-36. To assess test-retest reliability, 31 randomly selected patients (34%) completed the Persian OES again after three days while abstaining from all forms of therapeutic regimens. Reliability of the Persian OES was assessed by measuring intraclass correlation coefficient (ICC) for test-retest reliability and Cronbach's alpha for internal consistency. Spearman's correlation coefficient was used to test the construct validity. Cronbach's alpha coefficient was 0.92 showing excellent reliability. Cronbach's alpha for function, pain, and social-psychological subscales was 0.95, 0.86, and 0.85, respectively. Intraclass correlation coefficient (ICC) was 0.85 for the overall questionnaire and 0.90, 0.76, and 0.75 for function, pain, and social-psychological subscales, respectively. Construct validity was confirmed as the Spearman correlation between OES and DASH was 0.80. Persian OES is a valid and reliable patient-reported outcome measure to assess postsurgical elbow status in Persian speaking population.
Reliability Generalization of Scores on the Spielberger State-Trait Anxiety Inventory.

ERIC Educational Resources Information Center

Barnes, Laura L. B.; Harp, Diane; Jung, Woo Sik

2002-01-01

Conducted a reliability generalization study for the State-Trait Anxiety Inventory (C. Spielberger, 1983) by reviewing and classifying 816 research articles. Average reliability coefficients were acceptable for both internal consistency and test-retest reliability, but variation was present among the estimates. Other differences are discussed.…
Test-retest reliability and four-week changes in cardiopulmonary fitness in stroke patients: evaluation using a robotics-assisted tilt table.

PubMed

Saengsuwan, Jittima; Berger, Lucia; Schuster-Amft, Corina; Nef, Tobias; Hunt, Kenneth J

2016-09-06

Exercise testing devices for evaluating cardiopulmonary fitness in patients with severe disability after stroke are lacking, but we have adapted a robotics-assisted tilt table (RATT) for cardiopulmonary exercise testing (CPET). Using the RATT in a sample of patients after stroke, this study aimed to investigate test-retest reliability and repeatability of CPET and to prospectively investigate changes in cardiopulmonary outcomes over a period of four weeks. Stroke patients with all degrees of disability underwent 3 separate CPET sessions: 2 tests at baseline (TB1 and TB2) and 1 test at follow up (TF). TB1 and TB2 were at least 24 h apart. TB2 and TF were 4 weeks apart. A RATT equipped with force sensors in the thigh cuffs, a work rate estimation algorithm and a real-time visual feedback system was used to guide the patients' exercise work rate during CPET. Test-retest reliability and repeatability of CPET variables were analysed using paired t-tests, the intraclass correlation coefficient (ICC), the coefficient of variation (CoV), and Bland and Altman limits of agreement. Changes in cardiopulmonary fitness during four weeks were analysed using paired t-tests. Seventeen sub-acute and chronic stroke patients (age 62.7 ± 10.4 years [mean ± SD]; 8 females) completed the test sessions. The median time post stroke was 350 days. There were 4 severely disabled, 1 moderately disabled and 12 mildly disabled patients. For test-retest, there were no statistically significant differences between TB1 and TB2 for most CPET variables. Peak oxygen uptake, peak heart rate, peak work rate and oxygen uptake at the ventilatory anaerobic threshold (VAT) and respiratory compensation point (RCP) showed good to excellent test-retest reliability (ICC 0.65-0.94). For all CPET variables, CoV was 4.1-14.5 %. The mean difference was close to zero in most of the CPET variables. There were no significant changes in most cardiopulmonary performance parameters during the 4-week period (TB2 vs TF). These findings provide the first evidence of test-retest reliability and repeatability of the principal CPET variables using the novel RATT system and testing methodology, and high success rates in identification of VAT and RCP: good to excellent test-retest reliability and repeatability were found for all submaximal and maximal CPET variables. Reliability and repeatability of the main CPET parameters in stroke patients on the RATT were comparable to previous findings in stroke patients using standard exercise testing devices. The RATT has potential to be used as an alternative exercise testing device in patients who have limitations for use of standard exercise testing devices.
Test-retest reliability of a balance testing protocol with external perturbations in young healthy adults.

PubMed

Robbins, Shawn M; Caplan, Ryan M; Aponte, Daniel I; St-Onge, Nancy

2017-10-01

External perturbations are utilized to challenge balance and mimic realistic balance threats in patient populations. The reliability of such protocols has not been established. The purpose was to examine test-retest reliability of balance testing with external perturbations. Healthy adults (n=34; mean age 23 years) underwent balance testing over two visits. Participants completed ten balance conditions in which the following parameters were combined: perturbation or non-perturbation, single or double leg, and eyes open or closed. Three trials were collected for each condition. Data were collected on a force plate and external perturbations were applied by translating the plate. Force plate center of pressure (CoP) data were summarized using 13 different CoP measures. Test-retest reliability was examined using intraclass correlation coefficients (ICC) and Bland-Altman plots. CoP measures of total speed and excursion in both anterior-posterior and medial-lateral directions generally had acceptable ICC values for perturbation conditions (ICC=0.46 to 0.87); however, many other CoP measures (e.g. range, area of ellipse) had unacceptable test-retest reliability (ICC<0.70). Improved CoP measures were present on the second visit indicating a potential learning effect. Non-perturbation conditions generally produced more reliable CoP measures than perturbation conditions during double leg standing, but not single leg standing. Therefore, changes to balance testing protocols that include external perturbations should be made to improve test-retest reliability and diminish learning including more extensive participant training and increasing the number of trials. CoP measures that consider all data points (e.g. total speed) are more reliable than those that only consider a few data points. Copyright © 2017 Elsevier B.V. All rights reserved.
The influence of validity criteria on Immediate Post-Concussion Assessment and Cognitive Testing (ImPACT) test-retest reliability among high school athletes.

PubMed

Brett, Benjamin L; Solomon, Gary S

2017-04-01

Research findings to date on the stability of Immediate Post-Concussion Assessment and Cognitive Testing (ImPACT) Composite scores have been inconsistent, requiring further investigation. The use of test validity criteria across these studies also has been inconsistent. Using multiple measures of stability, we examined test-retest reliability of repeated ImPACT baseline assessments in high school athletes across various validity criteria reported in previous studies. A total of 1146 high school athletes completed baseline cognitive testing using the online ImPACT test battery at two time periods of approximately two-year intervals. No participant sustained a concussion between assessments. Five forms of validity criteria used in previous test-retest studies were applied to the data, and differences in reliability were compared. Intraclass correlation coefficients (ICCs) ranged in composite scores from .47 (95% confidence interval, CI [.38, .54]) to .83 (95% CI [.81, .85]) and showed little change across a two-year interval for all five sets of validity criteria. Regression based methods (RBMs) examining the test-retest stability demonstrated a lack of significant change in composite scores across the two-year interval for all forms of validity criteria, with no cases falling outside the expected range of 90% confidence intervals. The application of more stringent validity criteria does not alter test-retest reliability, nor does it account for some of the variation observed across previously performed studies. As such, use of the ImPACT manual validity criteria should be utilized in the determination of test validity and in the individualized approach to concussion management. Potential future efforts to improve test-retest reliability are discussed.
Health Service Quality Scale: Brazilian Portuguese translation, reliability and validity.

PubMed

Rocha, Luiz Roberto Martins; Veiga, Daniela Francescato; e Oliveira, Paulo Rocha; Song, Elaine Horibe; Ferreira, Lydia Masako

2013-01-17

The Health Service Quality Scale is a multidimensional hierarchical scale that is based on interdisciplinary approach. This instrument was specifically created for measuring health service quality based on marketing and health care concepts. The aim of this study was to translate and culturally adapt the Health Service Quality Scale into Brazilian Portuguese and to assess the validity and reliability of the Brazilian Portuguese version of the instrument. We conducted a cross-sectional, observational study, with public health system patients in a Brazilian university hospital. Validity was assessed using Pearson's correlation coefficient to measure the strength of the association between the Brazilian Portuguese version of the instrument and the SERVQUAL scale. Internal consistency was evaluated using Cronbach's alpha coefficient; the intraclass (ICC) and Pearson's correlation coefficients were used for test-retest reliability. One hundred and sixteen consecutive postoperative patients completed the questionnaire. Pearson's correlation coefficient for validity was 0.20. Cronbach's alpha for the first and second administrations of the final version of the instrument were 0.982 and 0.986, respectively. For test-retest reliability, Pearson's correlation coefficient was 0.89 and ICC was 0.90. The culturally adapted, Brazilian Portuguese version of the Health Service Quality Scale is a valid and reliable instrument to measure health service quality.
Cross-cultural adaptation, reliability and validity of the Arabic version of the reduced Western Ontario and McMaster Universities Osteoarthritis index in patients with knee osteoarthritis.

PubMed

Alghadir, Ahmad; Anwer, Shahnawaz; Iqbal, Zaheen Ahmed; Alsanawi, Hisham Abdulaziz

2016-01-01

We adapted the reduced Western Ontario and McMaster Universities Osteoarthritis (WOMAC) index for the Arabic language and tested its metric properties in patients with knee osteoarthritis (OA). One hundred and twenty-one consecutive patients who were referred for physiotherapy to the outpatient department were asked to answer the Arabic version of the reduced WOMAC index (ArWOMAC). After the completion of the ArWOMAC, the intensity of knee pain and general health status were assessed using the visual analog scale (VAS) and the 12-item short form health survey (SF-12), respectively. A second assessment was performed at least 48 h after the first session to assess test-retest reliability. The test-retest reliability was quantified using the intra-class correlation coefficient (ICC), and Cronbach's alpha was calculated to assess the internal consistency of the Arabic questionnaire. The construct validity was assessed using Spearman rank correlation coefficients. The total ArWOMAC scale and pain and function subscales were internally consistent with Cronbach's coefficient alpha of 0.91, 0.89 and 0.90, respectively. Test-retest reliability was good to excellent with ICC of 0.91, 0.89 and 0.90, respectively. SF-12 and VAS score significantly correlated with ArWOMAC index (p < 0.01), which support the construct validity. The standard error of measurement (SEM) of the total scale was 2.94, based on repeated measurements for test-retest. The minimum detectable change based on the SEM for test-retest was 8.15. The ArWOMAC index is a reliable and valid instrument for evaluating the severity of knee OA, with metric properties in agreement with the original version. Although, the reduced WOMAC index has been clinically utilized within the Saudi population, the Arabic version of this instrument is not validated for an Arab population to measure lower limb functional disability caused by OA. The Arabic version of reduced WOMAC (ArWOMAC) index is a reliable and valid scale to measure lower limb functional disability in patients with knee OA. The ArWOMAC index could be suitable in Saudi Arabia and other Arab countries where the language, culture and the life style are similar.
Translation, Cultural Adaptation and Validation of the Simple Shoulder Test to Spanish

PubMed Central

Arcuri, Francisco; Barclay, Fernando; Nacul, Ivan

2015-01-01

Background: The validation of widely used scales facilitates the comparison across international patient samples. Objective: The objective was to translate, culturally adapt and validate the Simple Shoulder Test into Argentinian Spanish. Methods: The Simple Shoulder Test was translated from English into Argentinian Spanish by two independent translators, translated back into English and evaluated for accuracy by an expert committee to correct the possible discrepancies. It was then administered to 50 patients with different shoulder conditions.Psycometric properties were analyzed including internal consistency, measured with Cronbach´s Alpha, test-retest reliability at 15 days with the interclass correlation coefficient. Results: The internal consistency, validation, was an Alpha of 0,808, evaluated as good. The test-retest reliability index as measured by intra-class correlation coefficient (ICC) was 0.835, evaluated as excellent. Conclusion: The Simple Shoulder Test translation and it´s cultural adaptation to Argentinian-Spanish demonstrated adequate internal reliability and validity, ultimately allowing for its use in the comparison with international patient samples.
The Trunk Impairment Scale - modified to ordinal scales in the Norwegian version.

PubMed

Gjelsvik, Bente; Breivik, Kyrre; Verheyden, Geert; Smedal, Tori; Hofstad, Håkon; Strand, Liv Inger

2012-01-01

To translate the Trunk Impairment Scale (TIS), a measure of trunk control in patients after stroke, into Norwegian (TIS-NV), and to explore its construct validity, internal consistency, intertester and test-retest reliability. TIS was translated according to international guidelines. The validity study was performed on data from 201 patients with acute stroke. Fifty patients with stroke and acquired brain injury were recruited to examine intertester and test-retest reliability. Construct validity was analyzed with exploratory and confirmatory factor analysis and item response theory, internal consistency with Cronbach's alpha test, and intertester and test-retest reliability with kappa and intraclass correlation coefficient tests. The back-translated version of TIS-NV was validated by the original developer. The subscale Static sitting balance was removed. By combining items from the subscales Dynamic sitting balance and Coordination, six ordinal superitems (testlets) were constructed. The TIS-NV was renamed the modified TIS-NV (TIS-modNV). After modifications the TIS-modNV fitted well to a locally dependent unidimensional item response theory model. It demonstrated good construct validity, excellent internal consistency, and high intertester and test-retest reliability for the total score. This study supports that the TIS-modNV is a valid and reliable scale for use in clinical practice and research.
The 10m incremental shuttle walk test is a highly reliable field exercise test for patients referred to cardiac rehabilitation: a retest reliability study.

PubMed

Hanson, Lisa C; Taylor, Nicholas F; McBurney, Helen

2016-09-01

To determine the retest reliability of the 10m incremental shuttle walk test (ISWT) in a mixed cardiac rehabilitation population. Participants completed two 10m ISWTs in a single session in a repeated measures study. Ten participants completed a third 10m ISWT as part of a pilot study. Hospital physiotherapy department. 62 adults aged a mean of 68 years (SD 10) referred to a cardiac rehabilitation program. Retest reliability of the 10m ISWT expressed as relative reliability and measurement error. Relative reliability was expressed in a ratio in the form of an intraclass correlation coefficient (ICC) and measurement error in the form of the standard error of measurement (SEM) and 95% confidence intervals for the group and individual. There was a high level of relative reliability over the two walks with an ICC of .99. The SEMagreement was 17m, and a change of at least 23m for the group and 54m for the individual would be required to be 95% confident of exceeding measurement error. The 10m ISWT demonstrated good retest reliability and is sufficiently reliable to be applied in practice in this population without the use of a practice test. Copyright © 2015 Chartered Society of Physiotherapy. Published by Elsevier Ltd. All rights reserved.
Reliability and validity of a questionnaire for self-assessment of complete dentures.

PubMed

Komagamine, Yuriko; Kanazawa, Manabu; Kaiba, Yoshinori; Sato, Yusuke; Minakuchi, Shunsuke

2014-05-02

Demand for complete denture treatment is expected to rise over several decades. However, to date, no questionnaire on complete dentures, as evaluated by edentulous patients, has been shown to be reliable and valid. This study sought to assess the reliability and validity of Patient's Denture Assessment (PDA), which provides a multidimensional evaluation of dentures among edentulous patients. Patients, who had new complete dentures fabricated at the University Hospital of Dentistry, Tokyo Medical and Dental University through 2009 to 2010, were enrolled. The reliability of the PDA was determined by examining internal consistency and test-retest reliability. Internal consistency for all of the question items and the six subscales was measured using Cronbach's α and average inter-item correlation coefficients among 93 participants. For 33 of these participants, test-retest reliability was determined at a 2 month-interval using the interclass correlation coefficients (ICCs) and 95% confidence interval for the summary scores and the six subscale scores. The PDA was validated in 93 participants by examining the difference in the summary score and the six subscale scores of the PDA before and after replacement with new dentures by the paired t-test. Ability to detect change was also tested in 93 patients using effect size. The Cronbach's α for the PDA ranged from 0.56 to 0.93. The average inter-item correlation coefficients ranged from 0.28 to 0.83. ICCs for the PDA ranged from 0.37 to 0.83. The paired t-test showed a significant difference between the summary score and the six subscale scores before and after replacement with new dentures (p < 0.05) and the effect size was 0.97. The PDA demonstrated good reliability by assessing internal consistency and test-retest reliability. In addition, the PDA demonstrated good validity by assessing discriminant validity. Thus, the PDA could help dentists obtain a detailed understanding of the patients' perceptions in using their dentures.
Assessing fear-avoidance beliefs in patients with cervical radiculopathy.

PubMed

Dedering, Asa; Börjesson, Tina

2013-12-01

The study sought to evaluate validity and reliability of the Fear Avoidance Beliefs Questionnaire and the Tampa Scale for Kinesiophobia in patients with cervical radiculopathy. A test-retest design was used to test stability over time in 46 patients with cervical radiculopathy. Differences between patients and healthy subjects were also evaluated comparing the patients with 41 physically active and healthy subjects. The patients answered the Fear Avoidance Beliefs Questionnaire and the Tampa Scale for Kinesiophobia twice. To test for differences between the patients and the healthy subjects, the latter answered the same questionnaires once. Questionnaires about activity, personal factors and health were also used. The test-retest reliability assessed with weighted kappa was 0.68 for the Fear Avoidance Beliefs Questionnaire and 0.45 for the Tampa Scale for Kinesiophobia. Only six of the 11 single items of the Fear Avoidance Beliefs Questionnaire and none of the single items of the Tampa Scale of Kinesiophobia showed kappa coefficients exceeding 0.60 (good reliability). Patients with cervical radiculopathy rated significantly worse on the Fear Avoidance Beliefs Questionnaire and the Tampa Scale for Kinesiophobia than the healthy subjects did. The Fear Avoidance Beliefs Questionnaire may be recommended for test-retest evaluations because 'good' reliability was found. The Tampa Scale for Kinesiophobia had only 'moderate' test-retest reliability, and this should be considered when using this scale in test-retest evaluations. Both questionnaires can discriminate between patients with cervical radiculopathy and healthy subjects. Copyright © 2012 John Wiley & Sons, Ltd.
Validity and Reliability of a New Device (WIMU®) for Measuring Hamstring Muscle Extensibility.

PubMed

Muyor, José M

2017-09-01

The aims of the current study were 1) to evaluate the validity of the WIMU ® system for measuring hamstring muscle extensibility in the passive straight leg raise (PSLR) test using an inclinometer for the criterion and 2) to determine the test-retest reliability of the WIMU ® system to measure hamstring muscle extensibility during the PSLR test. 55 subjects were evaluated on 2 separate occasions. Data from a Unilever inclinometer and WIMU ® system were collected simultaneously. Intraclass correlation coefficients (ICCs) for the validity were very high (0.983-1); a very low systematic bias (-0.21°--0.42°), random error (0.05°-0.04°) and standard error of the estimate (0.43°-0.34°) were observed (left-right leg, respectively) between the 2 devices (inclinometer and the WIMU ® system). The R 2 between the devices was 0.999 (p<0.001) in both the left and right legs. The test-retest reliability of the WIMU ® system was excellent, with ICCs ranging from 0.972-0.995, low coefficients of variation (0.01%), and a low standard error of the estimate (0.19-0.31°). The WIMU ® system showed strong concurrent validity and excellent test-retest reliability for the evaluation of hamstring muscle extensibility in the PSLR test. © Georg Thieme Verlag KG Stuttgart · New York.
Test-retest reliability of the Clinical Learning Environment, Supervision and Nurse Teacher (CLES + T) scale.

PubMed

Gustafsson, Margareta; Blomberg, Karin; Holmefur, Marie

2015-07-01

The Clinical Learning Environment, Supervision and Nurse Teacher (CLES + T) scale evaluates the student nurses' perception of the learning environment and supervision within the clinical placement. It has never been tested in a replication study. The aim of the present study was to evaluate the test-retest reliability of the CLES + T scale. The CLES + T scale was administered twice to a group of 42 student nurses, with a one-week interval. Test-retest reliability was determined by calculations of Intraclass Correlation Coefficients (ICCs) and weighted Kappa coefficients. Standard Error of Measurements (SEM) and Smallest Detectable Difference (SDD) determined the precision of individual scores. Bland-Altman plots were created for analyses of systematic differences between the test occasions. The results of the study showed that the stability over time was good to excellent (ICC 0.88-0.96) in the sub-dimensions "Supervisory relationship", "Pedagogical atmosphere on the ward" and "Role of the nurse teacher". Measurements of "Premises of nursing on the ward" and "Leadership style of the manager" had lower but still acceptable stability (ICC 0.70-0.75). No systematic differences occurred between the test occasions. This study supports the usefulness of the CLES + T scale as a reliable measure of the student nurses' perception of the learning environment within the clinical placement at a hospital. Copyright © 2015 Elsevier Ltd. All rights reserved.
Validity and reliability of the Spanish version of the DN4 (Douleur Neuropathique 4 questions) questionnaire for differential diagnosis of pain syndromes associated to a neuropathic or somatic component

PubMed Central

Perez, Concepcion; Galvez, Rafael; Huelbes, Silvia; Insausti, Joaquin; Bouhassira, Didier; Diaz, Silvia; Rejas, Javier

2007-01-01

Background This study assesses the validity and reliability of the Spanish version of DN4 questionnaire as a tool for differential diagnosis of pain syndromes associated to a neuropathic (NP) or somatic component (non-neuropathic pain, NNP). Methods A study was conducted consisting of two phases: cultural adaptation into the Spanish language by means of conceptual equivalence, including forward and backward translations in duplicate and cognitive debriefing, and testing of psychometric properties in patients with NP (peripheral, central and mixed) and NNP. The analysis of psychometric properties included reliability (internal consistency, inter-rater agreement and test-retest reliability) and validity (ROC curve analysis, agreement with the reference diagnosis and determination of sensitivity, specificity, and positive and negative predictive values in different subsamples according to type of NP). Results A sample of 164 subjects (99 women, 60.4%; age: 60.4 ± 16.0 years), 94 (57.3%) with NP (36 with peripheral, 32 with central, and 26 with mixed pain) and 70 with NNP was enrolled. The questionnaire was reliable [Cronbach's alpha coefficient: 0.71, inter-rater agreement coefficient: 0.80 (0.71–0.89), and test-retest intra-class correlation coefficient: 0.95 (0.92–0.97)] and valid for a cut-off value ≥ 4 points, which was the best value to discriminate between NP and NNP subjects. Discussion This study, representing the first validation of the DN4 questionnaire into another language different than the original, not only supported its high discriminatory value for identification of neuropathic pain, but also provided supplemental psychometric validation (i.e. test-retest reliability, influence of educational level and pain intensity) and showed its validity in mixed pain syndromes. PMID:18053212
Test-Retest Reliability and Minimal Detectable Change of the D2 Test of Attention in Patients with Schizophrenia.

PubMed

Lee, Posen; Lu, Wen-Shian; Liu, Chin-Hsuan; Lin, Hung-Yu; Hsieh, Ching-Lin

2017-12-08

The d2 Test of Attention (D2) is a commonly used measure of selective attention for patients with schizophrenia. However, its test-retest reliability and minimal detectable change (MDC) are unknown in patients with schizophrenia, limiting its utility in both clinical and research settings. The aim of the present study was to examine the test-retest reliability and MDC of the D2 in patients with schizophrenia. A rater administered the D2 on 108 patients with schizophrenia twice at a 1-month interval. Test-retest reliability was determined through the calculation of the intra-class correlation coefficient (ICC). We also carried out Bland-Altman analysis, which included a scatter plot of the differences between test and retest against their mean. Systematic biases were evaluated by use of a paired t-test. The ICCs for the D2 ranged from 0.78 to 0.94. The MDCs (MDC%) of the seven subscores were 102.3 (29.7), 19.4 (85.0), 7.2 (94.6), 21.0 (69.0), 104.0 (33.1), 105.0 (35.8), and 7.8 (47.8), which represented limited-to-acceptable random measurement error. Trends in the Bland-Altman plots of the omissions (E1), commissions (E2), and errors (E) were noted, presenting that the data had heteroscedasticity. According to the results, the D2 had good test-retest reliability, especially in the scores of TN, TN-E, and CP. For the further research, finding a way to improve the administration procedure to reduce random measurement error would be important for the E1, E2, E, and FR subscores. © The Author(s) 2017. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Stability of scores for the Slosson Full-Range Intelligence Test.

PubMed

Williams, Thomas O; Eaves, Ronald C; Woods-Groves, Suzanne; Mariano, Gina

2007-08-01

The test-retest stability of the Slosson Full-Range Intelligence Test by Algozzine, Eaves, Mann, and Vance was investigated with test scores from a sample of 103 students. With a mean interval of 13.7 mo. and different examiners for each of the two test administrations, the test-retest reliability coefficients for the Full-Range IQ, Verbal Reasoning, Abstract Reasoning, Quantitative Reasoning, and Memory were .93, .85, .80, .80, and .83, respectively. Mean differences from the test-retest scores were not statistically significantly different for any of the scales. Results suggest that Slosson scores are stable over time even when different examiners administer the test.
Reliability and validity of the Incontinence Quiz-Turkish version.

PubMed

Kara, Kerime C; Çıtak Karakaya, İlkim; Tunalı, Nur; Karakaya, Mehmet G

2018-01-01

The aim of this study was to investigate the reliability and validity of the Turkish version of the Incontinence Quiz, which was developed by Branch et al. (1994), to assess women's knowledge of and attitudes toward urinary incontinence. Comprehensibility of the Turkish version of the 14-item Incontinence Quiz, which was prepared following translation-back translation procedures, was tested on a pilot group of eight women, and its internal reliability, test-retest reliability and construct validity were assessed in 150 women who attended the gynecology clinics of three hospitals in İçel, Turkey. Physical and sociodemographic characteristics and presence of incontinence complaints were also recorded. Data were analyzed at the 0.05 alpha level, using SPSS version 22. The scale had good reliability and validity. The internal reliability coefficient (Cronbach α) was 0.80, test-retest correlation coefficients were 0.83-0.94; and with regard to construct validity, Kaiser-Meyer-Olkin coefficient was 0.76 and Barlett sphericity test was 562.777 (P = 0.000). Turkish version of the Incontinence Quiz had a four-factor structure, with Eigenvalues ranging from 1.17 to 4.08. The Incontinence Quiz-Turkish version is a highly comprehensible, reliable and valid scale, which may be used to assess Turkish-speaking women's knowledge of and attitudes toward urinary incontinence. © 2017 Japan Society of Obstetrics and Gynecology.
Validity and Reliability of the Turkish Version of the DSM-5 Posttraumatic Stress Symptom Severity Scale-Child Form.

PubMed

Yalin Sapmaz, Şermin; Ergin, Dilek; Özek Erkuran, Handan; Şen Celasin, Nesrin; Öztürk, Masum; Karaarslan, Duygu; Köroğlu, Ertuğrul; Aydemir, Ömer

2017-09-01

This study assessed the validity and reliability of the Turkish version of the DSM-5 Posttraumatic Stress Symptom Severity Scale-Child Form for use among the Turkish population. The study group consisted of 30 patients that had been treated in a child psychiatry unit and diagnosed with posttraumatic stress disorder and 83 healthy volunteers that were attending middle or high school during the study period. For reliability analyses, the internal consistency coefficient and the test-retest correlation coefficient were measured. For validity analyses, the exploratory factor analysis and correlation analysis with the Child Posttraumatic Stress Reaction Index for concurrent validity were measured. The Cronbach's alpha (the internal consistency coefficient) of the scale was 0.909, and the test-retest correlation coefficient was 0.663. One factor that could explain 58.5% of the variance was obtained and was congruent with the original construct of the scale. As for concurrent validity, the scale showed high correlation with the Child Posttraumatic Stress Reaction Index. It was concluded that the Turkish version of the DSM-5 Posttraumatic Stress Symptom Severity Scale-Child Form can be used as a valid and reliable tool.

Inter-vender and test-retest reliabilities of resting-state functional magnetic resonance imaging: Implications for multi-center imaging studies.

PubMed

An, Hyeong Su; Moon, Won-Jin; Ryu, Jae-Kyun; Park, Ju Yeon; Yun, Won Sung; Choi, Jin Woo; Jahng, Geon-Ho; Park, Jang-Yeon

2017-12-01

This prospective multi-center study aimed to evaluate the inter-vendor and test-retest reliabilities of resting-state functional magnetic resonance imaging (RS-fMRI) by assessing the temporal signal-to-noise ratio (tSNR) and functional connectivity. Study included 10 healthy subjects and each subject was scanned using three 3T MR scanners (GE Signa HDxt, Siemens Skyra, and Philips Achieva) in two sessions. The tSNR was calculated from the time course data. Inter-vendor and test-retest reliabilities were assessed with intra-class correlation coefficients (ICCs) derived from variant component analysis. Independent component analysis was performed to identify the connectivity of the default-mode network (DMN). In result, the tSNR for the DMN was not significantly different among the GE, Philips, and Siemens scanners (P=0.638). In terms of vendor differences, the inter-vendor reliability was good (ICC=0.774). Regarding the test-retest reliability, the GE scanner showed excellent correlation (ICC=0.961), while the Philips (ICC=0.671) and Siemens (ICC=0.726) scanners showed relatively good correlation. The DMN pattern of the subjects between the two sessions for each scanner and between three scanners showed the identical patterns of functional connectivity. The inter-vendor and test-retest reliabilities of RS-fMRI using different 3T MR scanners are good. Thus, we suggest that RS-fMRI could be used in multicenter imaging studies as a reliable imaging marker. Copyright © 2017 Elsevier Inc. All rights reserved.
Test-retest reliability of the irrational performance beliefs inventory.

PubMed

Turner, M J; Slater, M J; Dixon, J; Miller, A

2018-02-01

The irrational performance beliefs inventory (iPBI) was developed to measure irrational beliefs within performance domains such as sport, academia, business, and the military. Past research indicates that the iPBI has good construct, concurrent, and predictive validity, but the test-retest reliability of the iPBI has not yet been examined. Therefore, in the present study the iPBI was administered to university sport and exercise students (n = 160) and academy soccer athletes (n = 75) at three-time points. Time point two occurred 7 days after time point one, and time point three occurred 21 days after time point two. In addition, social desirability was also measured. Repeated-measures MANCOVAs, intra-class coefficients, and Pearson's (r) correlations demonstrate that the iPBI has good test-retest reliability, with iPBI scores remaining stable across the three-time points. Pearson's correlation coefficients revealed no relationships between the iPBI and social desirability, indicating that the iPBI is not highly susceptible to response bias. The results are discussed with reference to the continued usage and development of the iPBI, and future research recommendations relating to the investigation of irrational performance beliefs are proposed.
Reliability of the Test of Integrated Language and Literacy Skills (TILLS).

PubMed

Mailend, Marja-Liisa; Plante, Elena; Anderson, Michele A; Applegate, E Brooks; Nelson, Nickola W

2016-07-01

As new standardized tests become commercially available, it is critical that clinicians have access to the information about a test's psychometric properties, including aspects of reliability. The purpose of the three studies reported in this article was to investigate the reliability of a new test, the Test of Integrated Language and Literacy Skills (TILLS), with consideration of both internal and external sources of measurement error. The TILLS was administered to children aged 6;0-18;11 years. The participants varied in terms of their language and literacy skills and included children with typical language development as well as those diagnosed with language or learning disability. The sample of children also varied in terms of their racial and socioeconomic backgrounds. Study 1 (N = 1056) assessed the internal consistency of TILLS calculating the coefficient omega for each subtest. Study 2 (N = 103) and Study 3 (N = 39) used the intra-class correlation coefficients to report on test-retest and inter-rater reliability respectively. The results indicate strong internal consistency and inter-rater reliability for all subtests of TILLS. The test-retest reliability was strong for all but one subtest, for which the intra-class correlation coefficient was in the acceptable range. This article provides clinicians with essential scientific information that supports the internal and external reliability of a new test of oral and written language skills, the TILLS. Information about reliability is critical for guiding the selection of an appropriate diagnostic tool amongst a number of options. © 2016 Royal College of Speech and Language Therapists.
The Assessment of Reliability Under Range Restriction: A Comparison of [Alpha], [Omega], and Test-Retest Reliability for Dichotomous Data

ERIC Educational Resources Information Center

Fife, Dustin A.; Mendoza, Jorge L.; Terry, Robert

2012-01-01

Though much research and attention has been directed at assessing the correlation coefficient under range restriction, the assessment of reliability under range restriction has been largely ignored. This article uses item response theory to simulate dichotomous item-level data to assess the robustness of KR-20 ([alpha]), [omega], and test-retest…
Reliability of the Serbian version of the International Physical Activity Questionnaire for older adults.

PubMed

Milanović, Zoran; Pantelić, Saša; Trajković, Nebojša; Jorgić, Bojan; Sporiš, Goran; Bratić, Milovan

2014-01-01

The purpose of this study was to determine the test-retest reliability of the International Physical Activity Questionnaire (IPAQ) for older adults in Serbia. Six hundred and sixty older adults (352 men, 53%; 308 women, 47%; mean age 67.65±5.76 years) participated in the study. To examine test-retest reliability, the participants were asked to complete the IPAQ on two occasions 2 weeks apart. Moderate reliability was observed between the repeated IPAQ, with intraclass correlation coefficients ranging from 0.53 to 0.91. The least reliability was established in leisure time activity (0.53) and the most reliability in the transport domain (0.91). Men and women had similar intraclass correlation coefficients for total physical activity (0.71 versus 0.74, respectively), while the biggest difference was obtained for housework in men (0.68) and in women (0.90). Our study shows that the long version of the IPAQ is a reliable instrument for assessing physical activity levels in older adults and that it may be useful for generating internationally comparable data.
Development of a short version of the new brief job stress questionnaire.

PubMed

Inoue, Akiomi; Kawakami, Norito; Shimomitsu, Teruichi; Tsutsumi, Akizumi; Haratani, Takashi; Yoshikawa, Toru; Shimazu, Akihito; Odagiri, Yuko

2014-01-01

This study was aimed to investigate the test-retest reliability and validity of a short version of the New Brief Job Stress Questionnaire (New BJSQ) whose scales have one item selected from a standard version. Based on the results from an anonymous web-based questionnaire of occupational health staffs and personnel/labor staffs, we selected higher-priority scales from the standard version. After selecting one item with highest item-total correlation coefficient from each scale, a 23-item questionnaire was developed. A nationally representative survey was administered to Japanese employees (n=1,633) to examine test-retest reliability and validity. Most scales (or items) showed modest but adequate levels of test-retest reliability (r>0.50). Furthermore, job demands and job resources scales (or items) were associated with mental and physical stress reactions while job resources scales (or items) were also associated with positive outcomes. These findings provided a piece of evidence that the short version of the New BJSQ is reliable and valid.
Development of a Short Version of the New Brief Job Stress Questionnaire

PubMed Central

INOUE, Akiomi; KAWAKAMI, Norito; SHIMOMITSU, Teruichi; TSUTSUMI, Akizumi; HARATANI, Takashi; YOSHIKAWA, Toru; SHIMAZU, Akihito; ODAGIRI, Yuko

2014-01-01

This study was aimed to investigate the test-retest reliability and validity of a short version of the New Brief Job Stress Questionnaire (New BJSQ) whose scales have one item selected from a standard version. Based on the results from an anonymous web-based questionnaire of occupational health staffs and personnel/labor staffs, we selected higher-priority scales from the standard version. After selecting one item with highest item-total correlation coefficient from each scale, a 23-item questionnaire was developed. A nationally representative survey was administered to Japanese employees (n=1,633) to examine test-retest reliability and validity. Most scales (or items) showed modest but adequate levels of test-retest reliability (r>0.50). Furthermore, job demands and job resources scales (or items) were associated with mental and physical stress reactions while job resources scales (or items) were also associated with positive outcomes. These findings provided a piece of evidence that the short version of the New BJSQ is reliable and valid. PMID:24975108
A validation study of the Keyboard Personal Computer Style instrument (K-PeCS) for use with children.

PubMed

Green, Dido; Meroz, Anat; Margalit, Adi Edit; Ratzon, Navah Z

2012-11-01

This study examines a potential instrument for measurement of typing postures of children. This paper describes inter-rater, test-retest reliability and concurrent validity of the Keyboard Personal Computer Style instrument (K-PeCS), an observational measurement of postures and movements during keyboarding, for use with children. Two trained raters independently rated videos of 24 children (aged 7-10 years). Six children returned one week later for identifying test-retest reliability. Concurrent validity was assessed by comparing ratings obtained using the K-PECS to scores from a 3D motion analysis system. Inter-rater reliability was moderate to high for 12 out of 16 items (Kappa: 0.46 to 1.00; correlation coefficients: 0.77-0.95) and test-retest reliability varied across items (Kappa: 0.25 to 0.67; correlation coefficients: r = 0.20 to r = 0.95). Concurrent validity compared favourably across arm pathlength, wrist extension and ulnar deviation. In light of the limitations of other tools the K-PeCS offers a fairly affordable, reliable and valid instrument to address the gap for measurement of typing styles of children, despite the shortcomings of some items. However further research is required to refine the instrument for use in evaluating typing among children. Copyright © 2012 Elsevier Ltd and The Ergonomics Society. All rights reserved.
One-year test-retest reliability of intrinsic connectivity network fMRI in older adults

PubMed Central

Guo, Cong C.; Kurth, Florian; Zhou, Juan; Mayer, Emeran A.; Eickhoff, Simon B; Kramer, Joel H.; Seeley, William W.

2014-01-01

“Resting-state” or task-free fMRI can assess intrinsic connectivity network (ICN) integrity in health and disease, suggesting a potential for use of these methods as disease-monitoring biomarkers. Numerous analytical options are available, including model-driven ROI-based correlation analysis and model-free, independent component analysis (ICA). High test-retest reliability will be a necessary feature of a successful ICN biomarker, yet available reliability data remains limited. Here, we examined ICN fMRI test-retest reliability in 24 healthy older subjects scanned roughly one year apart. We focused on the salience network, a disease-relevant ICN not previously subjected to reliability analysis. Most ICN analytical methods proved reliable (intraclass coefficients > 0.4) and could be further improved by wavelet analysis. Seed-based ROI correlation analysis showed high map-wise reliability, whereas graph theoretical measures and temporal concatenation group ICA produced the most reliable individual unit-wise outcomes. Including global signal regression in ROI-based correlation analyses reduced reliability. Our study provides a direct comparison between the most commonly used ICN fMRI methods and potential guidelines for measuring intrinsic connectivity in aging control and patient populations over time. PMID:22446491
Development and evaluation of the OHCITIES instrument: assessing alcohol urban environments in the Heart Healthy Hoods project

PubMed Central

Sureda, Xisca; Espelt, Albert; Villalbí, Joan R; Cebrecos, Alba; Baranda, Lucía; Pearce, Jamie; Franco, Manuel

2017-01-01

Objectives To describe the development and test–retest reliability of OHCITIES, an instrument characterising alcohol urban environment in terms of availability, promotion and signs of consumption. Design This study involved: (1) developing the conceptual framework for alcohol urban environment by means of literature reviewing and previous alcohol environment research experience; (2) pilot testing and redesigning the instrument; (3) instrument digitalisation; (4) instrument evaluation using test–retest reliability. Setting Data for testing the reliability of the instrument were collected in seven census sections in Madrid in 2016 by two observers. Primary and secondary outcome measures We computed per cent agreement and Cohen’s kappa coefficients to estimate inter-rater and test–retest reliability for alcohol outlet environment measures. We calculated interclass coefficients and their 95% CIs to provide a measure of inter-rater reliability for signs of alcohol consumption measures. Results We collected information on 92 on-premise and 24 off-premise alcohol outlets identified in the studied areas about availability, accessibility and promotion of alcohol. Most per cent-agreement values for alcohol measures in on-premise and off-premise alcohol outlets were greater than 80%, and inter-rater and test–retest reliability values were generally above 0.80. Observers identified 26 streets and 3 public squares with signs of alcohol consumption. Intraclass correlation coefficient between observers for any type of signs of alcohol consumption was 0.50 (95% CI −0.09 to 0.77). Few items promoting alcohol unrelated to alcohol outlets were found on public spaces. Conclusions The OHCITIES instrument is a reliable instrument to characterise alcohol urban environment. This instrument might be used to understand how alcohol environment associates with alcohol behaviours and its related health outcomes, and can help in the design and evaluation of policies to reduce the harm caused by alcohol. PMID:28982829
A Review and Comparison of the Reliabilities of the MMPI-2, MCMI-III, and PAI Presented in Their Respective Test Manuals

ERIC Educational Resources Information Center

Wise, Edward A.; Streiner, David L.; Walfish, Steven

2010-01-01

This article provides a review of the literature to determine the most frequently used personality tests. Based on this review, internal consistency and test-retest reliability coefficients from the test manuals for the Minnesota Multiphasic Personality Inventory-2 (MMPI-2), Millon Clinical Multiaxial Inventory-III (MCMI-III), and Personality…
Test-retest Reliability in Reporting the Pain Induced by a Pain Provocation Test: Further Validation of a Novel Approach for Pain Drawing Acquisition and Analysis.

PubMed

Leoni, Diego; Falla, Deborah; Heitz, Carolin; Capra, Gianpiero; Clijsen, Ron; Egloff, Michele; Cescon, Corrado; Baeyens, Jean-Pierre; Barbero, Marco

2017-02-01

Pain drawings (PD) are frequently used in research to illustrate the pain response to pain provocation tests. However, there is a lack of data on the reliability in defining the extent and location of pain. We investigated the test-retest reliability in reporting an acute painful sensation induced by a pain provocation test using a novel approach for PD acquisition and analysis in healthy volunteers. Forty healthy volunteers participated. Each participant underwent 2 upper limb neurodynamic tests 1 (ULNT1), once to the point of pain onset (PO) and once until the point of submaximal pain (SP). After each ULNT1, participants completed 2 consecutive PD with an interval of 1 minute. Custom software was used to quantify the pain extent and analyze the pain overlap. The test-retest reliability of pain extent was examined using Intraclass Correlation Coefficient (ICC 2,1 ) and Bland-Altman plots. Pain location reliability was examined using the Jaccard similarity coefficient (JSC). The ICC values for PO and SP were 0.98 (95% CI: 0.96-0.99) and 0.97 (95% CI: 0.95-0.98), respectively. The mean difference and 95% limits of agreement (± 1.96 SD) in the Bland-Altman plots were 14 pixels (-1080;1110) for PO, and 145 (-1610;1900) for SP. The median JSCs (Q1;Q3) were 0.73 (0.64;0.80) for PO and 0.76 (0.65;0.79) for SP. Pain drawings is a reliable instrument to investigate pain extent and pain location in healthy individuals experiencing an acute painful sensation induced by a pain provocation test. © 2016 World Institute of Pain.
The efficiency of simultaneous binaural ocular vestibular evoked myogenic potentials: a comparative study with monaural acoustic stimulation in healthy subjects.

PubMed

Kim, Min-Beom; Ban, Jae Ho

2012-12-01

To evaluate the test-retest reliability and convenience of simultaneous binaural acoustic-evoked ocular vestibular evoked myogenic potentials (oVEMP). Thirteen healthy subjects with no history of ear diseases participated in this study. All subjects underwent oVEMP test with both separated monaural acoustic stimulation and simultaneous binaural acoustic stimulation. For evaluating test-retest reliability, three repetitive sessions were performed in each ear for calculating the intraclass correlation coefficient (ICC) for both monaural and binaural tests. We analyzed data from the biphasic n1-p1 complex, such as latency of peak, inter-peak amplitude, and asymmetric ratio of amplitude in both ears. Finally, we checked the total time required to complete each test for evaluating test convenience. No significant difference was observed in amplitude and asymmetric ratio in comparison between monaural and binaural oVEMP. However, latency was slightly delayed in binaural oVEMP. In test-retest reliability analysis, binaural oVEMP showed excellent ICC values ranging from 0.68 to 0.98 in latency, asymmetric ratio, and inter-peak amplitude. Additionally, the test time was shorter in binaural than monaural oVEMP. oVEMP elicited from binaural acoustic stimulation yields similar satisfactory results as monaural stimulation. Further, excellent test-retest reliability and shorter test time were achieved in binaural than in monaural oVEMP.
The intra-individual reproducibility of flash-evoked potentials in a sample of children.

PubMed

Schellberg, D; Gasser, T; Köhler, W

1987-07-01

Visual evoked potentials (VEPs) to flash stimuli were recorded twice from 26 children aged 10-13 years, with an intersession interval of about 10 months. Test-retest reliability was poor for recordings taken from scalp locations overlying non-specific cortex and somewhat better for specific cortex. The size of consistency coefficients (i.e. correlations within session) showed that noise and artefacts were not the decisive factors which lower reliability. A comparison with retest correlations of broad band parameters of the EEG at rest for the same sample showed, to our surprise, smaller retest reliability for VEP parameters. Variability of the VEP in children over time seems to be a substantial as its well-known inter-individual variability.
Test-retest reliability of the Military Pre-training Questionnaire.

PubMed

Robinson, M; Stokes, K; Bilzon, J; Standage, M; Brown, P; Thompson, D

2010-09-01

Musculoskeletal injuries are a significant cause of morbidity during military training. A brief, inexpensive and user-friendly tool that demonstrates reliability and validity is warranted to effectively monitor the relationship between multiple predictor variables and injury incidence in military populations. To examine the test-retest reliability of the Military Pre-training Questionnaire (MPQ), designed specifically to assess risk factors for injury among military trainees across five domains (physical activity, injury history, diet, alcohol and smoking). Analyses were based on a convenience sample of 58 male British Army trainees. Kappa (kappa), weighted kappa (kappa(w)) and intraclass correlation coefficients (ICC) were used to evaluate the 2-week test-retest reliability of the MPQ. For index measures constituting the assessment of a given construct, internal consistency was assessed by Cronbach's alpha (alpha) coefficients. Reliability of individual items ranged from poor to almost perfect (kappa range = 0.45-0.86; kappa(w) range = 0.11-0.91; ICC range = 0.34-0.86) with most items demonstrating moderate reliability. Overall scores related to physical activity, diet, alcohol and smoking constructs were reliable between both administrations (ICC = 0.63-0.85). Support for the internal consistency of the incorporated alcohol (alpha = 0.78) and cigarette (alpha = 0.75) scales was also provided. The MPQ is a reliable self-report instrument for assessing multiple injury-related risk factors during initial military training. Further assessment of the psychometric properties of the MPQ (e.g. different types of validity) with military populations/samples will support its interpretation and use in future surveillance and epidemiological studies.
Using a Web-Based Approach to Assess Test-Retest Reliability of the "Hypertension Self-Care Profile" Tool in an Asian Population: A Validation Study.

PubMed

Koh, Yi Ling Eileen; Lua, Yi Hui Adela; Hong, Liyue; Bong, Huey Shin Shirley; Yeo, Ling Sui Jocelyn; Tsang, Li Ping Marianne; Ong, Kai Zhi; Wong, Sook Wai Samantha; Tan, Ngiap Chuan

2016-03-01

Essential hypertension often requires affected patients to self-manage their condition most of the time. Besides seeking regular medical review of their life-long condition to detect vascular complications, patients have to maintain healthy lifestyles in between physician consultations via diet and physical activity, and to take their medications according to their prescriptions. Their self-management ability is influenced by their self-efficacy capacity, which can be assessed using questionnaire-based tools. The "Hypertension Self-Care Profile" (HTN-SCP) is 1 such questionnaire assessing self-efficacy in the domains of "behavior," "motivation," and "self-efficacy." This study aims to determine the test-retest reliability of HTN-SCP in an English-literate Asian population using a web-based approach. Multiethnic Asian patients, aged 40 years and older, with essential hypertension were recruited from a typical public primary care clinic in Singapore. The investigators guided the patients to fill up the web-based 60-item HTN-SCP in English using a tablet or smartphone on the first visit and refilled the instrument 2 weeks later in the retest. Internal consistency and test-retest reliability were evaluated using Cronbach's Alpha and intraclass correlation coefficients (ICC), respectively. The t test was used to determine the relationship between the overall HTN-SCP scores of the patients and their self-reported self-management activities. A total of 160 patients completed the HTN-SCP during the initial test, from which 71 test-retest responses were completed. No floor or ceiling effect was found for the scores for the 3 subscales. Cronbach's Alpha coefficients were 0.857, 0.948, and 0.931 for "behavior," "motivation," and "self-efficacy" domains respectively, indicating high internal consistency. The item-total correlation ranges for the 3 scales were from 0.105 to 0.656 for Behavior, 0.401 to 0.808 for Motivation, 0.349 to 0.789 for Self-efficacy. The corresponding ICC scores of 0.671, 0.762, and 0.720 for these respective domains showed good test-retest reliability. The correlation of the HTN-SCP scores and patients' reported self-management measures were significant, except for keeping their food diary. HTN-SCP showed satisfactory internal consistency and test-retest reliability in an English literate Asian population. A web-based approach is feasible if similar studies are needed to validate its translated versions of the tool for wider application in the local multilingual population.
Development, test-retest reliability and validity of the Pharmacy Value-Added Services Questionnaire (PVASQ)

PubMed Central

Tan, Christine L.; Hassali, Mohamed A.; Saleem, Fahad; Shafie, Asrul A.; Aljadhey, Hisham; Gan, Vincent B.

2015-01-01

Objective: (i) To develop the Pharmacy Value-Added Services Questionnaire (PVASQ) using emerging themes generated from interviews. (ii) To establish reliability and validity of questionnaire instrument. Methods: Using an extended Theory of Planned Behavior as the theoretical model, face-to-face interviews generated salient beliefs of pharmacy value-added services. The PVASQ was constructed initially in English incorporating important themes and later translated into the Malay language with forward and backward translation. Intention (INT) to adopt pharmacy value-added services is predicted by attitudes (ATT), subjective norms (SN), perceived behavioral control (PBC), knowledge and expectations. Using a 7-point Likert-type scale and a dichotomous scale, test-retest reliability (N=25) was assessed by administrating the questionnaire instrument twice at an interval of one week apart. Internal consistency was measured by Cronbach’s alpha and construct validity between two administrations was assessed using the kappa statistic and the intraclass correlation coefficient (ICC). Confirmatory Factor Analysis, CFA (N=410) was conducted to assess construct validity of the PVASQ. Results: The kappa coefficients indicate a moderate to almost perfect strength of agreement between test and retest. The ICC for all scales tested for intra-rater (test-retest) reliability was good. The overall Cronbach’ s alpha (N=25) is 0.912 and 0.908 for the two time points. The result of CFA (N=410) showed most items loaded strongly and correctly into corresponding factors. Only one item was eliminated. Conclusions: This study is the first to develop and establish the reliability and validity of the Pharmacy Value-Added Services Questionnaire instrument using the Theory of Planned Behavior as the theoretical model. The translated Malay language version of PVASQ is reliable and valid to predict Malaysian patients’ intention to adopt pharmacy value-added services to collect partial medicine supply. PMID:26445622
Reliability Measure of a Clinical Test: Appreciation of Music in Cochlear Implantees (AMICI)

PubMed Central

Cheng, Min-Yu; Spitzer, Jaclyn B.; Shafiro, Valeriy; Sheft, Stanley; Mancuso, Dean

2014-01-01

Purpose The goals of this study were (1) to investigate the reliability of a clinical music perception test, Appreciation of Music in Cochlear Implantees (AMICI), and (2) examine associations between the perception of music and speech. AMICI was developed as a clinical instrument for assessing music perception in persons with cochlear implants (CIs). The test consists of four subtests: (1) music versus environmental noise discrimination, (2) musical instrument identification (closed-set), (3) musical style identification (closed-set), and (4) identification of musical pieces (open-set). To be clinically useful, it is crucial for AMICI to demonstrate high test-retest reliability, so that CI users can be assessed and retested after changes in maps or programming strategies. Research Design Thirteen CI subjects were tested with AMICI for the initial visit and retested again 10–14 days later. Two speech perception tests (consonant-nucleus-consonant [CNC] and Bamford-Kowal-Bench Speech-in-Noise [BKB-SIN]) were also administered. Data Analysis Test-retest reliability and equivalence of the test’s three forms were analyzed using paired t-tests and correlation coefficients, respectively. Correlation analysis was also conducted between results from the music and speech perception tests. Results Results showed no significant difference between test and retest (p > 0.05) with adequate power (0.9) as well as high correlations between the three forms (Forms A and B, r = 0.91; Forms A and C, r = 0.91; Forms B and C, r = 0.95). Correlation analysis showed high correlation between AMICI and BKB-SIN (r = −0.71), and moderate correlation between AMICI and CNC (r = 0.4). Conclusions The study showed AMICI is highly reliable for assessing musical perception in CI users. PMID:24384082
Test-retest reliability and construct validity of the ENERGY-parent questionnaire on parenting practices, energy balance-related behaviours and their potential behavioural determinants: the ENERGY-project.

PubMed

Singh, Amika S; Chinapaw, Mai J M; Uijtdewilligen, Léonie; Vik, Froydis N; van Lippevelde, Wendy; Fernández-Alvira, Juan M; Stomfai, Sarolta; Manios, Yannis; van der Sluijs, Maria; Terwee, Caroline; Brug, Johannes

2012-08-13

Insight in parental energy balance-related behaviours, their determinants and parenting practices are important to inform childhood obesity prevention. Therefore, reliable and valid tools to measure these variables in large-scale population research are needed. The objective of the current study was to examine the test-retest reliability and construct validity of the parent questionnaire used in the ENERGY-project, assessing parental energy balance-related behaviours, their determinants, and parenting practices among parents of 10-12 year old children. We collected data among parents (n = 316 in the test-retest reliability study; n = 109 in the construct validity study) of 10-12 year-old children in six European countries, i.e. Belgium, Greece, Hungary, the Netherlands, Norway, and Spain. Test-retest reliability was assessed using the intra-class correlation coefficient (ICC) and percentage agreement comparing scores from two measurements, administered one week apart. To assess construct validity, the agreement between questionnaire responses and a subsequent interview was assessed using ICC and percentage agreement.All but one item showed good to excellent test-retest reliability as indicated by ICCs > .60 or percentage agreement ≥ 75%. Construct validity appeared to be good to excellent for 92 out of 121 items, as indicated by ICCs > .60 or percentage agreement ≥ 75%. From the other 29 items, construct validity was moderate for 24 and poor for 5 items. The reliability and construct validity of the items of the ENERGY-parent questionnaire on multiple energy balance-related behaviours, their potential determinants, and parenting practices appears to be good. Based on the results of the validity study, we strongly recommend adapting parts of the ENERGY-parent questionnaire if used in future research.
The Screening Test for Emotional Problems--Teacher-Report Version (Step-T): Studies of Reliability and Validity

ERIC Educational Resources Information Center

Erford, Bradley T.; Butler, Caitlin; Peacock, Elizabeth

2015-01-01

The Screening Test for Emotional Problems-Teacher Version (STEP-T) was designed to identify students aged 7-17 years with wide-ranging emotional disturbances. Coefficients alpha and test-retest reliability were adequate for all subscales except Anxiety. The hypothesized five-factor model fit the data very well and external aspects of validity were…

Evaluating test-retest reliability in patient-reported outcome measures for older people: A systematic review.

PubMed

Park, Myung Sook; Kang, Kyung Ja; Jang, Sun Joo; Lee, Joo Yun; Chang, Sun Ju

2018-03-01

This study aimed to evaluate the components of test-retest reliability including time interval, sample size, and statistical methods used in patient-reported outcome measures in older people and to provide suggestions on the methodology for calculating test-retest reliability for patient-reported outcomes in older people. This was a systematic literature review. MEDLINE, Embase, CINAHL, and PsycINFO were searched from January 1, 2000 to August 10, 2017 by an information specialist. This systematic review was guided by both the Preferred Reporting Items for Systematic Reviews and Meta-Analyses checklist and the guideline for systematic review published by the National Evidence-based Healthcare Collaborating Agency in Korea. The methodological quality was assessed by the Consensus-based Standards for the selection of health Measurement Instruments checklist box B. Ninety-five out of 12,641 studies were selected for the analysis. The median time interval for test-retest reliability was 14days, and the ratio of sample size for test-retest reliability to the number of items in each measure ranged from 1:1 to 1:4. The most frequently used statistical methods for continuous scores was intraclass correlation coefficients (ICCs). Among the 63 studies that used ICCs, 21 studies presented models for ICC calculations and 30 studies reported 95% confidence intervals of the ICCs. Additional analyses using 17 studies that reported a strong ICC (>0.09) showed that the mean time interval was 12.88days and the mean ratio of the number of items to sample size was 1:5.37. When researchers plan to assess the test-retest reliability of patient-reported outcome measures for older people, they need to consider an adequate time interval of approximately 13days and the sample size of about 5 times the number of items. Particularly, statistical methods should not only be selected based on the types of scores of the patient-reported outcome measures, but should also be described clearly in the studies that report the results of test-retest reliability. Copyright © 2017 Elsevier Ltd. All rights reserved.
Test-retest reliability and responsiveness of the Barthel Index-based Supplementary Scales in patients with stroke.

PubMed

Lee, Ya-Chen; Yu, Wan-Hui; Hsueh, I-Ping; Chen, Sheng-Shiung; Hsieh, Ching-Lin

2017-10-01

A lack of evidence on the test-retest reliability and responsiveness limits the utility of the BI-based Supplementary Scales (BI-SS) in both clinical and research settings. To examine the test-retest reliability and responsiveness of the BI-based Supplementary Scales (BI-SS) in patients with stroke. A repeated-assessments design (1 week apart) was used to examine the test-retest reliability of the BI-SS. For the responsiveness study, the participants were assessed with the BI-SS and BI (treated as an external criterion) at admission to and discharge from rehabilitation wards. Seven outpatient rehabilitation units and one inpatient rehabilitation unit. Outpatients with chronic stroke. Eighty-four outpatients with chronic stroke participated in the test-retest reliability study. Fifty-seven inpatients completed baseline and follow-up assessments in the responsiveness study. For the test-retest reliability study, the values of the intra-class correlation coefficient and the overall percentage of minimal detectable change for the Ability Scale and Self-perceived Difficulty Scale were 0.97, 12.8%, and 0.78, 35.8%, respectively. For the responsiveness study, the standardized effect size and standardized response mean (representing internal responsiveness) of the Ability Scale and Self-perceived Difficulty Scale were 1.17 and 1.56, and 0.78 and 0.89, respectively. Regarding external responsiveness, the change in score of the Ability Scale had significant and moderate association with that of the BI (r=0.61, P<0.001). The change in score of the Self-perceived Difficulty Scale had non-significant and weak association with that of the BI (r=0.23, P=0.080). The Ability Scale of the BI-SS has satisfactory test-retest reliability and sufficient responsiveness for patients with stroke. However, the Self-perceived Difficulty Scale of the BI-SS has substantial random measurement error and insufficient external responsiveness, which may affect its utility in clinical settings. The findings of this study provide empirical evidence of psychometric properties of the BI-SS for assessing ability and self-perceived difficulty of ADL in patients with stroke.
Test-Retest Reliability of fMRI Brain Activity during Memory Encoding

PubMed Central

Brandt, David J.; Sommer, Jens; Krach, Sören; Bedenbender, Johannes; Kircher, Tilo; Paulus, Frieder M.; Jansen, Andreas

2013-01-01

The mechanisms underlying hemispheric specialization of memory are not completely understood. Functional magnetic resonance imaging (fMRI) can be used to develop and test models of hemispheric specialization. In particular for memory tasks however, the interpretation of fMRI results is often hampered by the low reliability of the data. In the present study we therefore analyzed the test-retest reliability of fMRI brain activation related to an implicit memory encoding task, with a particular focus on brain activity of the medial temporal lobe (MTL). Fifteen healthy subjects were scanned with fMRI on two sessions (average retest interval 35 days) using a commonly applied novelty encoding paradigm contrasting known and unknown stimuli. To assess brain lateralization, we used three different stimuli classes that differed in their verbalizability (words, scenes, fractals). Test-retest reliability of fMRI brain activation was assessed by an intraclass-correlation coefficient (ICC), describing the stability of inter-individual differences in the brain activation magnitude over time. We found as expected a left-lateralized brain activation network for the words paradigm, a bilateral network for the scenes paradigm, and predominantly right-hemispheric brain activation for the fractals paradigm. Although these networks were consistently activated in both sessions on the group level, across-subject reliabilities were only poor to fair (ICCs ≤ 0.45). Overall, the highest ICC values were obtained for the scenes paradigm, but only in strongly activated brain regions. In particular the reliability of brain activity of the MTL was poor for all paradigms. In conclusion, for novelty encoding paradigms the interpretation of fMRI results on a single subject level is hampered by its low reliability. More studies are needed to optimize the retest reliability of fMRI activation for memory tasks. PMID:24367338
A mechano-acoustic indentor system for in vivo measurement of nonlinear elastic properties of soft tissue.

PubMed

Koo, Terry K; Cohen, Jeffrey H; Zheng, Yongping

2011-11-01

Soft tissue exhibits nonlinear stress-strain behavior under compression. Characterizing its nonlinear elasticity may aid detection, diagnosis, and treatment of soft tissue abnormality. The purposes of this study were to develop a rate-controlled Mechano-Acoustic Indentor System and a corresponding finite element optimization method to extract nonlinear elastic parameters of soft tissue and evaluate its test-retest reliability. An indentor system using a linear actuator to drive a force-sensitive probe with a tip-mounted ultrasound transducer was developed. Twenty independent sites at the upper lateral quadrant of the buttock from 11 asymptomatic subjects (7 men and 4 women from a chiropractic college) were indented at 6% per second for 3 sessions, each consisting of 5 trials. Tissue thickness, force at 25% deformation, and area under the load-deformation curve from 0% to 25% deformation were calculated. Optimized hyperelastic parameters of the soft tissue were calculated with a finite element model using a first-order Ogden material model. Load-deformation response on a standardized block was then simulated, and the corresponding area and force parameters were calculated. Between-trials repeatability and test-retest reliability of each parameter were evaluated using coefficients of variation and intraclass correlation coefficients, respectively. Load-deformation responses were highly reproducible under repeated measurements. Coefficients of variation of tissue thickness, area under the load-deformation curve from 0% to 25% deformation, and force at 25% deformation averaged 0.51%, 2.31%, and 2.23%, respectively. Intraclass correlation coefficients ranged between 0.959 and 0.999, indicating excellent test-retest reliability. The automated Mechano-Acoustic Indentor System and its corresponding optimization technique offers a viable technology to make in vivo measurement of the nonlinear elastic properties of soft tissue. This technology showed excellent between-trials repeatability and test-retest reliability with potential to quantify the effects of a wide variety of manual therapy techniques on the soft tissue elastic properties. Copyright © 2011 National University of Health Sciences. Published by Mosby, Inc. All rights reserved.
Validity, Reliability, and Sensitivity of a Volleyball Intermittent Endurance Test.

PubMed

Rodríguez-Marroyo, Jose A; Medina-Carrillo, Javier; García-López, Juan; Morante, Juan C; Villa, José G; Foster, Carl

2017-03-01

To analyze the concurrent and construct validity of a volleyball intermittent endurance test (VIET). The VIET's test-retest reliability and sensitivity to assess seasonal changes was also studied. During the preseason, 71 volleyball players of different competitive levels took part in this study. All performed the VIET and a graded treadmill test with gas-exchange measurement (GXT). Thirty-one of the players performed an additional VIET to analyze the test-retest reliability. To test the VIET's sensitivity, 28 players repeated the VIET and GXT at the end of their season. Significant (P < .001) relationships between VIET distance and maximal oxygen uptake (r = .74) and GXT maximal speed (r = .78) were observed. There were no significant differences between the VIET performance test and retest (1542.1 ± 338.1 vs 1567.1 ± 358.2 m). Significant (P < .001) relationships and intraclass correlation coefficient (ICC) were found (r = .95, ICC = .96) for VIET performance. VIET performance increased significantly (P < .001) with player performance level and was sensitive to fitness changes across the season (1458.8 ± 343.5 vs 1581.1 ± 334.0 m, P < .01). The VIET may be considered a valid, reliable, and sensitive test to assess the aerobic endurance in volleyball players.
Further Examination of the Reliability of the Modified Rathus Assertiveness Schedule.

ERIC Educational Resources Information Center

Del Greco, Linda; And Others

1986-01-01

Examined the reliability of the 30-item Modified Rathus Assertiveness Schedule (MRAS) using the test-retest method over a three-week period. The MRAS yielded correlations of .74 using the Pearson product and Spearman Brown correlation coefficient. Correlations for males yielded .77 and .72. For females correlations for both tests were .72.…
What to Do With "Moderate" Reliability and Validity Coefficients?

PubMed

Post, Marcel W

2016-07-01

Clinimetric studies may use criteria for test-retest reliability and convergent validity such that correlation coefficients as low as .40 are supportive of reliability and validity. It can be argued that moderate (.40-.60) correlations should not be interpreted in this way and that reliability coefficients <.70 should be considered as indicative of unreliability. Convergent validity coefficients in the .40 to .60 or .40 to .70 range should be considered as indications of validity problems, or as inconclusive at best. Studies on reliability and convergent should be designed in such a way that it is realistic to expect high reliability and validity coefficients. Multitrait multimethod approaches are preferred to study construct (convergent-divergent) validity. Copyright © 2016 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Reliability of perceived neighbourhood conditions and the effects of measurement error on self-rated health across urban and rural neighbourhoods.

PubMed

Pruitt, Sandi L; Jeffe, Donna B; Yan, Yan; Schootman, Mario

2012-04-01

Limited psychometric research has examined the reliability of self-reported measures of neighbourhood conditions, the effect of measurement error on associations between neighbourhood conditions and health, and potential differences in the reliabilities between neighbourhood strata (urban vs rural and low vs high poverty). We assessed overall and stratified reliability of self-reported perceived neighbourhood conditions using five scales (social and physical disorder, social control, social cohesion, fear) and four single items (multidimensional neighbouring). We also assessed measurement error-corrected associations of these conditions with self-rated health. Using random-digit dialling, 367 women without breast cancer (matched controls from a larger study) were interviewed twice, 2-3 weeks apart. Test-retest (intraclass correlation coefficients (ICC)/weighted κ) and internal consistency reliability (Cronbach's α) were assessed. Differences in reliability across neighbourhood strata were tested using bootstrap methods. Regression calibration corrected estimates for measurement error. All measures demonstrated satisfactory internal consistency (α ≥ 0.70) and either moderate (ICC/κ=0.41-0.60) or substantial (ICC/κ=0.61-0.80) test-retest reliability in the full sample. Internal consistency did not differ by neighbourhood strata. Test-retest reliability was significantly lower among rural (vs urban) residents for two scales (social control, physical disorder) and two multidimensional neighbouring items; test-retest reliability was higher for physical disorder and lower for one multidimensional neighbouring item among the high (vs low) poverty strata. After measurement error correction, the magnitude of associations between neighbourhood conditions and self-rated health were larger, particularly in the rural population. Research is needed to develop and test reliable measures of perceived neighbourhood conditions relevant to the health of rural populations.
Intrarater test-retest reliability of static and dynamic stability indexes measurement using the Biodex Stability System during unilateral stance.

PubMed

Arifin, Nooranida; Abu Osman, Noor Azuan; Wan Abas, Wan Abu Bakar

2014-04-01

The measurements of postural balance often involve measurement error, which affects the analysis and interpretation of the outcomes. In most of the existing clinical rehabilitation research, the ability to produce reliable measures is a prerequisite for an accurate assessment of an intervention after a period of time. Although clinical balance assessment has been performed in previous study, none has determined the intrarater test-retest reliability of static and dynamic stability indexes during dominant single stance. In this study, one rater examined 20 healthy university students (female=12, male=8) in two sessions separated by 7 day intervals. Three stability indexes--the overall stability index (OSI), anterior/posterior stability index (APSI), and medial/ lateral stability index (MLSI) in static and dynamic conditions--were measured during single dominant stance. Intraclass correlation coefficient (ICC), standard error measurement (SEM) and 95% confidence interval (95% CI) were calculated. Test-retest ICCs for OSI, APSI, and MLSI were 0.85, 0.78, and 0.84 during static condition and were 0.77, 0.77, and 0.65 during dynamic condition, respectively. We concluded that the postural stability assessment using Biodex stability system demonstrates good-to-excellent test-retest reliability over a 1 week time interval.
Impact of Alzheimer's Disease on Caregiver Questionnaire: internal consistency, convergent validity, and test-retest reliability of a new measure for assessing caregiver burden.

PubMed

Cole, Jason C; Ito, Diane; Chen, Yaozhu J; Cheng, Rebecca; Bolognese, Jennifer; Li-McLeod, Josephine

2014-09-04

There is a lack of validated instruments to measure the level of burden of Alzheimer's disease (AD) on caregivers. The Impact of Alzheimer's Disease on Caregiver Questionnaire (IADCQ) is a 12-item instrument with a seven-day recall period that measures AD caregiver's burden across emotional, physical, social, financial, sleep, and time aspects. Primary objectives of this study were to evaluate psychometric properties of IADCQ administered on the Web and to determine most appropriate scoring algorithm. A national sample of 200 unpaid AD caregivers participated in this study by completing the Web-based version of IADCQ and Short Form-12 Health Survey Version 2 (SF-12v2™). The SF-12v2 was used to measure convergent validity of IADCQ scores and to provide an understanding of the overall health-related quality of life of sampled AD caregivers. The IADCQ survey was also completed four weeks later by a randomly selected subgroup of 50 participants to assess test-retest reliability. Confirmatory factor analysis (CFA) was implemented to test the dimensionality of the IADCQ items. Classical item-level and scale-level psychometric analyses were conducted to estimate psychometric characteristics of the instrument. Test-retest reliability was performed to evaluate the instrument's stability and consistency over time. Virtually none (2%) of the respondents had either floor or ceiling effects, indicating the IADCQ covers an ideal range of burden. A single-factor model obtained appropriate goodness of fit and provided evidence that a simple sum score of the 12 items of IADCQ can be used to measure AD caregiver's burden. Scales-level reliability was supported with a coefficient alpha of 0.93 and an intra-class correlation coefficient (for test-retest reliability) of 0.68 (95% CI: 0.50-0.80). Low-moderate negative correlations were observed between the IADCQ and scales of the SF-12v2. The study findings suggest the IADCQ has appropriate psychometric characteristics as a unidimensional, Web-based measure of AD caregiver burden and is supported by strong model fit statistics from CFA, high degree of item-level reliability, good internal consistency, moderate test-retest reliability, and moderate convergent validity. Additional validation of the IADCQ is warranted to ensure invariance between the paper-based and Web-based administration and to determine an appropriate responder definition.
Test-retest reliability and validity of a web-based food-frequency questionnaire for adolescents aged 13-14 to be used in the Norwegian Mother and Child Cohort Study (MoBa).

PubMed

Overby, Nina Cecilie; Johannesen, Elisabeth; Jensen, Grete; Skjaevesland, Anne-Kirsti; Haugen, Margaretha

2014-01-01

The assessment of food intake is challenging and prone to errors; it is therefore important to consider the reliability and validity of the assessment methods. The aim of this study was to analyze the reproducibility and validity of a developed food-frequency questionnaire (FFQ) for use among adolescents. In total, 58 students (aged 13-14) from four different schools in the southern part of Norway participated in the reproducibility study of filling out the FFQ 4 weeks apart. In addition, 93 students participated in the relative validity study where the FFQ was compared to 2×24-hour dietary recalls, while 92 students participated in the absolute validity study where the intakes of fatty acids and vitamin D from the FFQ were compared to fatty acids and 25-hydroxy-vitamin D3 in whole blood. The median Spearman correlation coefficient for all nutrients in the test-retest reliability study was 0.57. The median Spearman correlation for all nutrients in the relative validity study was 0.26, while the correlations coefficients were low in the absolute validity study with n-3 fatty acid coefficients ranging from 0.05 to 0.25, and absent for vitamin D (r=0.000). The test-retest reproducibility was considered good, the relative validity was considered poor to good, and the absolute validity was considered poor. However, the results are comparable to other studies among adolescents.
Demonstration of the test-retest reliability and sensitivity of the Lower Limb Functional Index-10 as a measure of functional recovery post burn injury: a cross-sectional repeated measures study design.

PubMed

Ryland, Margaret E; Grisbrook, Tiffany L; Wood, Fiona M; Phillips, Michael; Edgar, Dale W

2016-01-01

Lower limb burns can significantly delay recovery of function. Measuring lower limb functional outcomes is challenging in the unique burn patient population and necessitates the use of reliable and valid tools. The aims of this study were to examine the test-retest reliability, sensitivity, and internal consistency of Sections 1 and 3 of the Lower Limb Functional Index-10 (LLFI-10) questionnaire for measuring functional ability in patients with lower limb burns over time. Twenty-nine adult patients who had sustained a lower limb burn injury in the previous 12 months completed the test-retest procedure of the study. In addition, the minimal detectable change (MDC) was calculated for Section 1 and 3 of the LLFI-10. Section 1 is focused on the activity limitations experienced by patients with a lower limb disorder whereas Section 3 involves patients indicating their current percentage of pre-injury duties. Section 1 of the LLFI-10 demonstrated excellent test-retest reliability (intra-class correlation coefficient (ICC) 0.98, 95 % CI 0.96-0.99) whilst Section 3 demonstrated high test-retest reliability (ICC 0.88, 95 % CI 0.79-0.94). MDC scores for Sections 1 and 3 were 1.27 points and 30.22 %, respectively. Internal consistency was demonstrated with a significant negative association (r s = -0.83) between Sections 1 and 3 of the LLFI-10 (p < 0.001). This study demonstrates that Section 1 and 3 of the LLFI-10 are reliable for measuring functional ability in patients who have sustained lower limb burns in the previous 12 months, and furthermore, Section 1 is sensitive to changes in patient function over time.
Test-Retest Reliability, Agreement and Responsiveness of Productivity Loss (iPCQ-VR) and Healthcare Utilization (TiCP-VR) Questionnaires for Sick Workers with Chronic Musculoskeletal Pain.

PubMed

Beemster, Timo T; van Velzen, Judith M; van Bennekom, Coen A M; Reneman, Michiel F; Frings-Dresen, Monique H W

2018-03-16

The purpose of this study was to assess test-retest reliability, agreement, and responsiveness of questionnaires on productivity loss (iPCQ-VR) and healthcare utilization (TiCP-VR) for sick-listed workers with chronic musculoskeletal pain who were referred to vocational rehabilitation. Methods Test-retest reliability and agreement was assessed with a 2-week interval. Responsiveness was assessed at discharge after a 15-week vocational rehabilitation (VR) program. Data was obtained from six Dutch VR centers. Test-retest reliability was determined with intraclass correlation coefficient (ICC) and Cohen's kappa. Agreement was determined by Standard Error of Measurement (SEM), smallest detectable changes (on group and individual level), and percentage observed, positive and negative agreement. Responsiveness was determined with area under the curve (AUC) obtained from receiver operation characteristic (ROC). Results A sample of 52 participants on test-retest reliability and agreement, and a sample of 223 on responsiveness were included in the analysis. Productivity loss (iPCQ-VR): ICCs ranged from 0.52 to 0.90, kappa ranged from 0.42 to 0.96, and AUC ranged from 0.55 to 0.86. Healthcare utilization (TiCP-VR): ICC was 0.81, and kappa values of the single healthcare utilization items ranged from 0.11 to 1.00. Conclusions The iPCQ-VR showed good measurement properties on working status, number of hours working per week and long-term sick leave, and low measurement properties on short-term sick leave and presenteeism. The TiCP-VR showed adequate reliability on all healthcare utilization items together and medication use, but showed low measurement properties on the single healthcare utilization items.
[Reliability and validity of the PAQ-A questionnaire to assess physical activity in Spanish adolescents].

PubMed

Martínez-Gómez, David; Martínez-de-Haro, Vicente; Pozo, Tamara; Welk, Gregory J; Villagra, Ariel; Calle, Marisa E; Marcos, Ascensión; Veiga, Oscar L

2009-01-01

Questionnaires are feasible instruments to assess physical activity (PA) in large samples. The aim of the current study was to evaluate the reliability and validity of the PAQ-A questionnaire in Spanish adolescents using the measurement of PA by accelerometer as criterion. In a sample of 82 adolescents, aged 12 to 17 years, 1-week PAQ-A test-retest was administered. Reliability was analyzed by the Intraclass Correlation Coefficient (ICC) and the internal consistency by the Cronbach's alpha Coefficient. Two hundred thirty-two adolescents, aged 13-17 years, completed the PAQ-A and wore the ActiGraph GT1M accelerometer during 7-days. The PAQ-A was compared against total PA and moderate to vigorous PA (MVPA) obtained by the accelerometer. Test-retest reliability showed ICC = 0.71 for the final score of PAQ-A. Internal consistency was alpha = 0.65 in the first self-report, alpha = 0.67 in the retest in 82 adolescents sample, and alpha = 0.74 in the 232 adolescents sample. The PAQ-A was moderately correlated with total PA (rho = 0.39) and MVPA (rho= 0.34) assessed by the accelerometer. The PAQ-A obtained significantly moderate correlations in boys but not in girls against the accelerometer. The PAQ-A questionnaire shows an adequate reliability and a reasonable validity for assessing PA in Spanish adolescents.
Test-retest reliability of the Mandarin versions of the Hypertension Self-Care Profile instrument.

PubMed

Ngoh, Soh Heng Agnes; Lim, Hazel Wai Ling; Koh, Yi Ling Eileen; Tan, Ngiap Chuan

2017-11-01

Self-efficacy in essential hypertension can be measured using scales, such as the "Hypertension Self-Care Profile" (HTN-SCP) questionnaire. It assesses "Behavior", "Motivation", and "Self-efficacy" in 3 domains, respectively. This study aimed to validate the Mandarin version of HTN-SCP instrument (HTN-SCP-Mn) targeted at patients of Chinese ethnicity with hypertension.Our study recruited Chinese patients, aged 40 years and older, with essential hypertension from a public primary healthcare clinic in Singapore. The 60-item HTN-SCP-Mn questionnaire was completed online using a tablet or smartphone on enrolment. A retest was conducted 2 weeks after the initial test. Reliability was assessed by internal consistency and test-retest reliability using Cronbach alpha and intraclass correlation coefficients (ICC). Differences between the overall HTN-SCP-Mn scores of the patients and their self-reported self-management activities were also determined using independent t test.Of the 153 patients who completed the HTN-SCP-Mn during the initial test, 79 responded to the test-retest evaluation. Reliability of the 3 domains "Behavior", "Motivation", and "Self-efficacy" obtained high internal consistency (Cronbach alpha = 0.838, 0.929, and 0.927, respectively). The item total correlation ranged from 0.058 to 0.677 for Behavior, 0.374 to 0.798 for Motivation, and 0.326 to 0.767 for self-efficacy. The ICC indicated fair to good test-retest reliability with scores of 0.643, 0.579, and 0.710 for the respective domains.The results showed face validity of the HTN-SCP-Mn instrument, indicating its potential application in mandarin-proficient patients. Further study is needed to correlate its scores with objective demonstration of self-efficacy.
Examination of the Test-Retest Reliability of a Computerized Neurocognitive Test Battery.

PubMed

Nakayama, Yusuke; Covassin, Tracey; Schatz, Philip; Nogle, Sally; Kovan, Jeff

2014-08-01

Test-retest reliability is a critical issue in the utility of computer-based neurocognitive assessment paradigms employing baseline and postconcussion tests. Researchers have reported low test-retest reliability for the Immediate Post Concussion Assessment and Cognitive Testing (ImPACT) across an interval of 45 and 50 days. To re-examine the test-retest reliability of the ImPACT between baseline, 45 days, and 50 days. Descriptive laboratory study. Eighty-five physically active college students (51 male, 34 female) volunteered for this study. Participants completed the ImPACT as well as a 15-item memory test at baseline, 45 days, and 50 days. Intraclass correlation coefficients (ICCs) were calculated for ImPACT composite scores, and change scores were calculated using reliable change indices (RCIs) and regression-based methods (RBMs) at 80% and 95% confidence intervals (CIs). The respective ICCs for baseline to day 45, day 45 to day 50, baseline to day 50, and overall were as follows: verbal memory (0.76, 0.69, 0.65, and 0.78), visual memory (0.72, 0.66, 0.60, and 0.74), visual motor (processing) speed (0.87, 0.88, 0.85, and 0.91), and reaction time (0.67, 0.81, 0.71, and 0.80). All ICCs exceeded the threshold value of 0.60 for acceptable test-retest reliability. All cases fell well within the 80% CI for both the RCI and RBM, while 1% to 5% of cases fell outside the 95% CI for the RCI and 1% for the RBM. Results suggest that the ImPACT is a reliable neurocognitive test battery at 45 and 50 days after the baseline assessment. The current findings agree with those of other reliability studies that have reported acceptable ICCs across 30-day to 1-year testing intervals, and they support the utility of the ImPACT for the multidisciplinary approach to concussion management. This study suggests that the computerized neurocognitive test battery, ImPACT, is a reliable test for postconcussion serial assessments. However, when managing concussed athletes, the ImPACT should not be used as a stand-alone measure. © 2014 The Author(s).
The development and psychometric testing of a Disaster Response Self-Efficacy Scale among undergraduate nursing students.

PubMed

Li, Hong-Yan; Bi, Rui-Xue; Zhong, Qing-Ling

2017-12-01

Disaster nurse education has received increasing importance in China. Knowing the abilities of disaster response in undergraduate nursing students is beneficial to promote teaching and learning. However, there are few valid and reliable tools that measure the abilities of disaster response in undergraduate nursing students. To develop a self-report scale of self-efficacy in disaster response for Chinese undergraduate nursing students and test its psychometric properties. Nursing students (N=318) from two medical colleges were chosen by purposive sampling. The Disaster Response Self-Efficacy Scale (DRSES) was developed and psychometrically tested. Reliability and content validity were studied. Construct validity was tested by exploratory and confirmatory factor analysis. Reliability was tested by internal consistency and test-retest reliability. The DRSES consisted of 3 factors and 19 items with a 5-point rating. The content validity was 0.91, Cronbach's alpha coefficient was 0.912, and the intraclass correlation coefficient for test-retest reliability was 0.953. The construct validity was good (χ 2 /df=2.440, RMSEA=0.068, NFI=0.907, CFI=0.942, IFI=0.430, p<0.001). The newly developed DRSES has proven good reliability and validity. It could therefore be used as an assessment tool to evaluate self-efficacy in disaster response for Chinese undergraduate nursing students. Copyright © 2017. Published by Elsevier Ltd.
Measuring family-centred practices of professionals in early intervention services in Taiwan.

PubMed

Kang, L-J; Palisano, R J; Simeonsson, R J; Hwang, A-W

2017-09-01

Family-centred practices emphasize professional supports for forming partnerships with families in early intervention. The Measure of Processes of Care for Service Providers (MPOC-SP) measures the perceptions of paediatric service providers in supporting children and families. This study aimed to establish reliability of the Chinese version of the MPOC-SP (C-MPOC-SP) and to examine professional perceptions of family-centred practices in relation to professional discipline and years of experience. A convenience sample of 94 physical therapists, occupational therapists, speech-language pathologists, social workers and early childhood educators completed the C-MPOC-SP. Thirty-seven professionals completed the measure a second time within 2-4 weeks for test-retest reliability. Internal consistency and test-retest reliability were examined by Cronbach's α and intra-class correlation coefficient. Comparisons were made across professional disciplines by multivariate analyses of variance followed by analyses of variance. Relationships between years of experience and ratings of family-centred practices were examined by Pearson's correlation coefficients (r). Cronbach's α for items on each of the four scales of the C-MPOC-SP ranged from 0.80 to 0.92, indicating adequate internal consistency. Intra-class correlation coefficient between the initial and repeat completion of the C-MPOC-SP for each scale ranged from 0.56 to 0.77, indicating adequate to excellent test-retest reliability. Mean ratings for the Communicating Specific Information were significantly higher for physical therapists, occupational therapists and speech-language pathologists than for social workers (P = 0.001). The C-MPOC-SP scores were positively correlated with years of experience for all four scales (r = 0.23-0.38; P < 0.05). This study established adequate internal consistency and adequate to excellent test-retest reliability of the C-MPOC-SP in measuring perceptions of family centeredness of early intervention service providers. Cross-discipline differences were found in communicating specific information about the child. Higher perceptions of family centeredness were associated with more years of experience. The results support the utility of the C-MPOC-SP in professional education and programme evaluation of early intervention services in Taiwan. © 2017 John Wiley & Sons Ltd.
[Evaluation of a German version of WOMAC (Western Ontario and McMaster Universities) Arthrosis Index].

PubMed

Stucki, G; Meier, D; Stucki, S; Michel, B A; Tyndall, A G; Dick, W; Theiler, R

1996-01-01

The WOMAC (Western Ontario and McMaster Universities) Osteoarthritis Index is a tested questionnaire to assess symptoms and physical functional disability. We adapted the WOMAC for the German language and tested its metric properties, test-retest reliability and validity in 51 patients with knee and hip OA. All WOMAC scales (pain, stiffness, function) were internally consistent with Cronbach's coefficient alpha ranging from 0.80 to 0.96. Test-retest reliability was satisfactory with intraclass correlation coefficients ranging from 0.55 to 0.74. All scales and the global index calculated as the mean of scale scores had a bimodal distribution and a slight ceiling effect. As hypothesized the WOMAC scales were associated with radiological OA-severity and limitations of range-of-motion. Patients with more severe symptoms and functional disability perceived more limitations in their roles at home and at work. The presented German version of the WOMAC is a reliable and valid instrument for the assessment of symptoms and physical functional disability in patients with knee and hip OA.
Psychometric properties of the Activities-specific Balance Confidence Scale among individuals with a lower-limb amputation.

PubMed

Miller, William C; Deathe, A Barry; Speechley, Mark

2003-05-01

To evaluate the internal consistency, test-retest reliability, and construct validity of the Activities-specific Balance Confidence (ABC) Scale among people who have a lower-limb amputation. Retest design. A university-affiliated outpatient amputee clinic in Ontario. Two samples of individuals who have unilateral transtibial and transfemoral amputation. Sample 1 (n=54) was a consecutive and sample 2 (n=329) a convenience sample of all members of the clinic population. Not applicable. Repeated application of the ABC Scale, a 16-item questionnaire that assesses confidence in performing various mobility-related tasks. Correlation to test hypothesized relationships between the ABC Scale and the 2-minute walk (2MWT) and the timed up-and-go (TUG) tests; and assessment of the ability of the ABC Scale to discriminate among groups based on amputation cause, amputation level, mobility device use, automatic stepping ability, wearing time, stair climbing ability, and walking distance. Test-retest reliability (intraclass correlation coefficient) of the ABC Scale was .91 (95% confidence interval [CI], .84-.95) with individual item test-retest coefficients ranging from .53 to .87. Internal consistency, measured by Cronbach alpha, was .95. Hypothesized associations with the 2MWT and TUG test were observed with correlations of .72 (95% CI, .56-.84) and -.70 (95% CI, -.82 to -.53), respectively. The ABC Scale discriminated between all groups except those based on amputation level. Balance confidence, as measured by the ABC Scale, is a construct that provides unique information potentially useful to clinicians who provide amputee rehabilitation. The ABC Scale is reliable, with strong support for validity. Study of the scale's responsiveness is recommended.

RELIABILITY OF ANKLE-FOOT MORPHOLOGY, MOBILITY, STRENGTH, AND MOTOR PERFORMANCE MEASURES.

PubMed

Fraser, John J; Koldenhoven, Rachel M; Saliba, Susan A; Hertel, Jay

2017-12-01

Assessment of foot posture, morphology, intersegmental mobility, strength and motor control of the ankle-foot complex are commonly used clinically, but measurement properties of many assessments are unclear. To determine test-retest and inter-rater reliability, standard error of measurement, and minimal detectable change of morphology, joint excursion and play, strength, and motor control of the ankle-foot complex. Reliability study. 24 healthy, recreationally-active young adults without history of ankle-foot injury were assessed by two clinicians on two occasions, three to ten days apart. Measurement properties were assessed for foot morphology (foot posture index, total and truncated length, width, arch height), joint excursion (weight-bearing dorsiflexion, rearfoot and hallux goniometry, forefoot inclinometry, 1 st metatarsal displacement) and joint play, strength (handheld dynamometry), and motor control rating during intrinsic foot muscle (IFM) exercises. Clinician order was randomized using a Latin Square. The clinicians performed independent examinations and did not confer on the findings for the duration of the study. Test-retest and inter-tester reliability and agreement was assessed using intraclass correlation coefficients (ICC 2,k ) and weighted kappa ( K w ). Test-retest reliability ICC were as follows: morphology: .80-1.00, joint excursion: .58-.97, joint play: -.67-.84, strength: .67-.92, IFM motor rating: K W -.01-.71. Inter-rater reliability ICC were as follows: morphology: .81-1.00, joint excursion: .32-.97, joint play: -1.06-1.00, strength: .53-.90, and IFM motor rating: K w .02-.56. Measures of ankle-foot posture, morphology, joint excursion, and strength demonstrated fair to excellent test-retest and inter-rater reliability. Test-retest reliability for rating of perceived difficulty and motor performance was good to excellent for short-foot, toe-spread-out, and hallux exercises and poor to fair for lesser toe extension. Joint play measures had poor to fair reliability overall. The findings of this study should be considered when choosing methods of clinical assessment and outcome measures in practice and research. 3.
Adaptation and Assessment of Reliability and Validity of the Greek Version of the Ohkuma Questionnaire for Dysphagia Screening

PubMed Central

Papadopoulou, Soultana L.; Exarchakos, Georgios; Christodoulou, Dimitrios; Theodorou, Stavroula; Beris, Alexandre; Ploumis, Avraam

2016-01-01

Introduction The Ohkuma questionnaire is a validated screening tool originally used to detect dysphagia among patients hospitalized in Japanese nursing facilities. Objective The purpose of this study is to evaluate the reliability and validity of the adapted Greek version of the Ohkuma questionnaire. Methods Following the steps for cross-cultural adaptation, we delivered the validated Ohkuma questionnaire to 70 patients (53 men, 17 women) who were either suffering from dysphagia or not. All of them completed the questionnaire a second time within a month. For all of them, we performed a bedside and VFSS study of dysphagia and asked participants to undergo a second VFSS screening, with the exception of nine individuals. Statistical analysis included measurement of internal consistency with Cronbach's α coefficient, reliability with Cohen's Kappa, Pearson's correlation coefficient and construct validity with categorical components, and One-Way Anova test. Results According to Cronbach's α coefficient (0.976) for total score, there was high internal consistency for the Ohkuma Dysphagia questionnaire. Test-retest reliability (Cohen's Kappa) ranged from 0.586 to 1.00, exhibiting acceptable stability. We also estimated the Pearson's correlation coefficient for the test-retest total score, which reached high levels (0.952; p = 0.000). The One-Way Anova test in the two measurement times showed statistically significant correlation in both measurements (p = 0.02 and p = 0.016). Conclusion The adapted Greek version of the questionnaire is valid and reliable and can be used for the screening of dysphagia in the Greek-speaking patients. PMID:28050209
Adaptation and Assessment of Reliability and Validity of the Greek Version of the Ohkuma Questionnaire for Dysphagia Screening.

PubMed

Papadopoulou, Soultana L; Exarchakos, Georgios; Christodoulou, Dimitrios; Theodorou, Stavroula; Beris, Alexandre; Ploumis, Avraam

2017-01-01

Introduction The Ohkuma questionnaire is a validated screening tool originally used to detect dysphagia among patients hospitalized in Japanese nursing facilities. Objective The purpose of this study is to evaluate the reliability and validity of the adapted Greek version of the Ohkuma questionnaire. Methods Following the steps for cross-cultural adaptation, we delivered the validated Ohkuma questionnaire to 70 patients (53 men, 17 women) who were either suffering from dysphagia or not. All of them completed the questionnaire a second time within a month. For all of them, we performed a bedside and VFSS study of dysphagia and asked participants to undergo a second VFSS screening, with the exception of nine individuals. Statistical analysis included measurement of internal consistency with Cronbach's α coefficient, reliability with Cohen's Kappa, Pearson's correlation coefficient and construct validity with categorical components, and One-Way Anova test. Results According to Cronbach's α coefficient (0.976) for total score, there was high internal consistency for the Ohkuma Dysphagia questionnaire. Test-retest reliability (Cohen's Kappa) ranged from 0.586 to 1.00, exhibiting acceptable stability. We also estimated the Pearson's correlation coefficient for the test-retest total score, which reached high levels (0.952; p = 0.000). The One-Way Anova test in the two measurement times showed statistically significant correlation in both measurements ( p = 0.02 and p = 0.016). Conclusion The adapted Greek version of the questionnaire is valid and reliable and can be used for the screening of dysphagia in the Greek-speaking patients.
Reliability of measuring hip abductor strength following total knee arthroplasty using a hand-held dynamometer.

PubMed

Schache, Margaret B; McClelland, Jodie A; Webster, Kate E

2016-01-01

To investigate the test-retest reliability of measuring hip abductor strength in patients with total knee arthroplasty (TKA) using a hand-held dynamometer (HHD) with two different types of resistance: belt and manual resistance. Test-retest reliability of 30 subjects (17 female, 13 male, 71.9 ± 7.4 years old), 9.2 ± 2.7 days post TKA was measured using belt and therapist resistance. Retest reliability was calculated with intra-class coefficients (ICC3,1) and 95% confidence intervals (CI) for both the group average and the individual scores. A paired t-test assessed whether a difference existed between the belt and therapist methods of resistance. ICCs were 0.82 and 0.80 for the belt and therapist resisted methods, respectively. Hip abductor strength increases of 8 N (14%) for belt resisted and 14 N (17%) for therapist resisted measurements of the group average exceeded the 95% CI and may represent real change. For individuals, hip abductor strength increases of 33 N (72%) (belt resisted) and 57 N (79%) (therapist resisted) could be interpreted as real change. Hip abductor strength can be reliably measured using HHD in the clinical setting with the described protocol. Belt resistance demonstrated slightly higher test-retest reliability. Reliable measurement of hip abductor muscle strength in patients with TKA is important to ensure deficiencies are addressed in rehabilitation programs and function is maximized. Hip abductor strength can be reliably measured with a hand-held dynamometer in the clinical setting using manual or belt resistance.
Validity and test-retest reliability of the six-spot step test in persons after stroke.

PubMed

Arvidsson Lindvall, Mialinn; Anderzén-Carlsson, Agneta; Appelros, Peter; Forsberg, Anette

2018-06-06

After stroke, asymmetric weight distribution is common with decreased balance control in standing and walking. The six-spot step test (SSST) includes a 5-m walk during which one leg shoves wooden blocks out of circles marked on the floor, thus assessing the ability to take load on each leg. The aim of the present study was to investigate the convergent and discriminant validity and test-retest reliability of the SSST in persons with stroke. Eighty-one participants were included. A cross-sectional study was performed, in which the SSST was conducted twice, 3-7 days apart. Validity was investigated using measures of dynamic balance and walking. Reliability was assessed using intraclass correlation coefficient, standard error of the measurement (SEM), and smallest real difference (SRD). The convergent validity was strong to moderate, and the test-retest reliability was good. The SEM% was 14.7%, and the SRD% was 40.8% based on the mean of four walks shoving twice with the paretic and twice with the non-paretic leg. Values on random measurement error were high affecting the use of the SSST for follow-up evaluations but the SSST can be a complementary measure of gait and balance.
Test-Retest Reliability of Self-Reported Sexual Behavior, Sexual Orientation, and Psychosexual Milestones Among Gay, Lesbian, and Bisexual Youths

PubMed Central

Schrimshaw, Eric W.; Rosario, Margaret; Meyer-Bahlburg, Heino F. L.; Scharf-Matlick, Alice A.

2011-01-01

Despite the importance of reliable self-reported sexual information for research on sexuality and sexual health, research has not examined reliability of information provided by gay, lesbian, and bisexual (GLB) youths. Test-retest reliability of self-reported sexual behaviors, sexual orientation, sexual identity, and psychosexual developmental milestones was examined among an ethnically diverse sample of 64 self-identified GLB youths. Two face-to-face interviews were conducted approximately two weeks apart using the Sexual Risk Behavior Assessment Schedule for Homosexual Youths (SERBAS-Y-HM). Overall, the mean of the test-retest reliability coefficients was substantial for 6 of the 7 domains: lifetime sexual behaviors (M = .89), sexual behavior in the past 3 months (M = .96), unprotected sexual behavior in the past 3 months (M = .93), sexual identity (κ = .89), sexual orientation (M = .82), and ages of various psychosexual developmental milestones (M = .77). Inconsistent reliability was found for reports of sexual behaviors while using substances. A small number of gender differences emerged, with lower reliability among female youths in the lifetime number of same-sex partners. The overall findings suggest that a wide range of self-reported sexual information can be reliably assessed among GLB youths by means of interviewer-administered questionnaires, such as the SERBAS-Y-HM. PMID:16752124
Cross-Cultural Translation, Adaptation and Reliability of the Danish M. D. Andeson Dysphagia Inventory (MDADI) in Patients with Head and Neck Cancer.

PubMed

Hajdú, Sara Fredslund; Plaschke, Christina Caroline; Johansen, Christoffer; Dalton, Susanne Oksbjerg; Wessel, Irene

2017-08-01

The objectives were to translate and culturally adapt the M.D. Anderson Dysphagia Inventory (MDADI) into Danish and subsequently test the reliability of the Danish version. The MDADI was translated into Danish and cross culturally adapted through cognitive interviews. The final version was test-retest evaluated in a group of head and neck cancer (HNC) patients who responded to the questionnaire twice with a mean of eight days apart. Interclass correlation coefficient, Cronbach's alpha, floor and ceiling effects, standard error of measurement and minimal detectable change were investigated. Fourteen patients were interviewed on the comprehensibility of the Danish MDADI, and all found the questionnaire meaningful, easy to understand, non-offensive and to include relevant aspects of dysphagia related to HNC. Sixty-four patients were included in the test-retest study. Especially, one item in the emotional scale (E7) appeared to be often misinterpreted, and ceiling effects were found in all four subdomains (global, emotional, functional and physical). The four subdomains and the composite score showed acceptable test-retest reliability and internal consistency in a Danish population of HNC patients. The Danish MDADI is reliable in terms of internal consistency and test-retest reproducibility and can be used in assessing the health-related quality of life in head and neck cancer patients with dysphagia.
The Trojan Lifetime Champions Health Survey: development, validity, and reliability.

PubMed

Sorenson, Shawn C; Romano, Russell; Scholefield, Robin M; Schroeder, E Todd; Azen, Stanley P; Salem, George J

2015-04-01

Self-report questionnaires are an important method of evaluating lifespan health, exercise, and health-related quality of life (HRQL) outcomes among elite, competitive athletes. Few instruments, however, have undergone formal characterization of their psychometric properties within this population. To evaluate the validity and reliability of a novel health and exercise questionnaire, the Trojan Lifetime Champions (TLC) Health Survey. Descriptive laboratory study. A large National Collegiate Athletic Association Division I university. A total of 63 university alumni (age range, 24 to 84 years), including former varsity collegiate athletes and a control group of nonathletes. Participants completed the TLC Health Survey twice at a mean interval of 23 days with randomization to the paper or electronic version of the instrument. Content validity, feasibility of administration, test-retest reliability, parallel-form reliability between paper and electronic forms, and estimates of systematic and typical error versus differences of clinical interest were assessed across a broad range of health, exercise, and HRQL measures. Correlation coefficients, including intraclass correlation coefficients (ICCs) for continuous variables and κ agreement statistics for ordinal variables, for test-retest reliability averaged 0.86, 0.90, 0.80, and 0.74 for HRQL, lifetime health, recent health, and exercise variables, respectively. Correlation coefficients, again ICCs and κ, for parallel-form reliability (ie, equivalence) between paper and electronic versions averaged 0.90, 0.85, 0.85, and 0.81 for HRQL, lifetime health, recent health, and exercise variables, respectively. Typical measurement error was less than the a priori thresholds of clinical interest, and we found minimal evidence of systematic test-retest error. We found strong evidence of content validity, convergent construct validity with the Short-Form 12 Version 2 HRQL instrument, and feasibility of administration in an elite, competitive athletic population. These data suggest that the TLC Health Survey is a valid and reliable instrument for assessing lifetime and recent health, exercise, and HRQL, among elite competitive athletes. Generalizability of the instrument may be enhanced by additional, larger-scale studies in diverse populations.
Validity and reliability of the Turkish Migraine Disability Assessment (MIDAS) questionnaire.

PubMed

Ertaş, Mustafa; Siva, Aksel; Dalkara, Turgay; Uzuner, Nevzat; Dora, Babür; Inan, Levent; Idiman, Fethi; Sarica, Yakup; Selçuki, Deniz; Sirin, Hadiye; Oğuzhanoğlu, Atilla; Irkeç, Ceyla; Ozmenoğlu, Mehmet; Ozbenli, Taner; Oztürk, Musa; Saip, Sabahattin; Neyal, Münife; Zarifoğlu, Mehmet

2004-09-01

The aim of this study is to assess the comprehensibility, internal consistency, patient-physician reliability, test-retest reliability, and validity of Turkish version of Migraine Disability Assessment (MIDAS) questionnaire in patients with headache. MIDAS questionnaire has been developed by Stewart et al and shown to be reliable and valid to determine the degree of disability caused by migraine. This study was designed as a national multicenter study to demonstrate the reliability and validity of Turkish version of MIDAS questionnaire. Patients applying to 17 Neurology Clinics in Turkey were evaluated at the baseline (visit 1), week 4 (visit 2), and week 12 (visit 3) visits in terms of disease severity and comprehensibility, internal consistency, test-retest reliability, and validity of MIDAS. Since the severity of the disease has been found to change significantly at visit 2 compared to visit 1, test-retest reliability was assessed using the MIDAS scores of a subgroup of patients whose disease severity remained unchanged (up to +/-3 days difference in the number of days with headache between visits 1 and 2). A total of 306 patients (86.2% female, mean age: 35.0 +/- 9.8 years) were enrolled into the study. A total of 65.7%, 77.5%, 82.0% of patients reported that "they had fully understood the MIDAS questionnaire" in visits 1, 2, and 3, respectively. A highly positive correlation was found between physician and patient and the applied total MIDAS scores in all three visits (Spearman correlation coefficients were R= 0.87, 0.83, and 0.90, respectively, P <.001). Internal consistency of MIDAS was assessed using Cronbach's alpha and was found at acceptable (>0.7) or excellent (>0.8) levels in both patient and physician applied MIDAS scores, respectively. Total MIDAS score showed good test-retest reliability (R= 0.68). Both the number of days with headache and the total MIDAS scores were positively correlated at all visits with correlation coefficients between 0.47 and 0.63. There was also a moderate degree of correlation (R= 0.54) between the total MIDAS score at week 12 and the number of days with headache at visit 2 + visit 3, which quantify headache-related disability over a 3-month period similar to MIDAS questionnaire. These findings demonstrated that the Turkish translation is equivalent to the English version of MIDAS in terms of internal consistency, test-retest reliability, and validity. Physicians can reliably use the Turkish translation of the MIDAS questionnaire in defining the severity of illness and its treatment strategy when applied as a self-administered report by migraine patients themselves.
ASSOCIATIONS BETWEEN THREE CLINICAL ASSESSMENT TOOLS FOR POSTURAL STABILITY

PubMed Central

Saxion, Casie E.; Cameron, Kenneth L.; Gerber, J. Parry

2010-01-01

Study Design: Clinical Measurement, Correlation, Reliability Objectives: To assess the relationship between the Single Leg Balance (SLB), modified Balance Error Scoring System (mBESS), and modified Star Excursion Balance (mSEBT) tests and secondarily to assess inter-rater and test-retest reliability of these tests. Background: Ankle sprains often result in chronic instability and dysfunction. Several clinical tests assess postural deficits as a potential cause of this dysfunction; however, limited information exists pertaining to the relationship that these tests have with one another. Methods: Two independent examiners measured the performance of 34 healthy participants completing the SLB Test, mBESS test, and mSEBT at two different time periods. The relationship between tests was assessed using the Pearson Correlation and Fisher's Exact Tests. Inter-rater and test-retest reliability were assessed using the intraclass correlation coefficient (ICC) and Kappa statistics. Results: A significant correlation (r = -0.35) was observed between the mSEBT and the mBESS. Fisher's Exact Test showed a significant association between the SLB Test and mBESS (P = .048), but no association between the SLB and mSEBT (P = 1.000). Inter-rater reliability was excellent for the mSEBT and fair for the mBESS (ICCs of .91 and .61 respectively). Excellent agreement was observed between raters for the SLB test (k = 1.00). Test-retest reliability was excellent for the mSEBT (ICC = 0.98) and fair for the mBESS (ICC = 0.74). There was poor test-retest agreement for the SLB test (k = .211). Conclusion: There was a significant relationship observed between the SLB Test, mBESS test, and mSEBT: however; strength of association measures showed limited overlap between these tests. This suggests that these tests are interrelated but may not assess equal components of postural stability. PMID:21589668
Measurement of fatigue: Comparison of the reliability and validity of single-item and short measures to a comprehensive measure.

PubMed

Kim, Hee-Ju; Abraham, Ivo

2017-01-01

Evidence is needed on the clinicometric properties of single-item or short measures as alternatives to comprehensive measures. We examined whether two single-item fatigue measures (i.e., Likert scale, numeric rating scale) or a short fatigue measure were comparable to a comprehensive measure in reliability (i.e., internal consistency and test-retest reliability) and validity (i.e., convergent, concurrent, and predictive validity) in Korean young adults. For this quantitative study, we selected the Functional Assessment of Chronic Illness Therapy-Fatigue for the comprehensive measure and the Profile of Mood States-Brief, Fatigue subscale for the short measure; and constructed two single-item measures. A total of 368 students from four nursing colleges in South Korea participated. We used Cronbach's alpha and item-total correlation for internal consistency reliability and intraclass correlation coefficient for test-retest reliability. We assessed Pearson's correlation with a comprehensive measure for convergent validity, with perceived stress level and sleep quality for concurrent validity and the receiver operating characteristic curve for predictive validity. The short measure was comparable to the comprehensive measure in internal consistency reliability (Cronbach's alpha=0.81 vs. 0.88); test-retest reliability (intraclass correlation coefficient=0.66 vs. 0.61); convergent validity (r with comprehensive measure=0.79); concurrent validity (r with perceived stress=0.55, r with sleep quality=0.39) and predictive validity (area under curve=0.88). Single-item measures were not comparable to the comprehensive measure. A short fatigue measure exhibited similar levels of reliability and validity to the comprehensive measure in Korean young adults. Copyright Â© 2016 Elsevier Ltd. All rights reserved.
Reliability of the Timed Up and Go test and Ten-Metre Timed Walk Test in Pregnant Women with Pelvic Girdle Pain.

PubMed

Evensen, Natalie M; Kvåle, Alice; Braekken, Ingeborg H

2015-09-01

There is a lack of functional objective tests available to measure functional status in women with pelvic girdle pain (PGP). The purpose of this study was to establish test-retest and intertester reliability of the Timed Up and Go (TUG) test and Ten-metre Timed Walk Test (10mTWT) in pregnant women with PGP. A convenience sample of women was recruited over a 4-month period and tested on two occasions, 1 week apart to determine test-retest reliability. Intertester reliability was established between two assessors at the first testing session. Subjects were instructed to undertake the TUG and 10mTWT at maximum speed. One practise trial and two timed trials for each walking test was undertaken on Day 1 and one practise trial and one timed trial on Day 2. Seventeen women with PGP aged 31.1 years (SD [standard deviation] = 2.3) and 28.7 weeks pregnant (SD = 7.4) completed gait testing. Test-retest reliability using the intraclass correlation coefficient (ICC) was excellent for the TUG (0.88) and good for the 10mTWT (0.74). Intertester reliability was determined in the first 13 participants with excellent ICC values being found for both walking tests (TUG: 0.95; 10mTWT: 0.94). This study demonstrated that the TUG and 10mTWT undertaken at fast pace are reliable, objective functional tests in pregnant women with PGP. While both tests are suitable for use in the clinical and research settings, we would recommend the TUG given the findings of higher test-retest reliability and as this test requires less space and time to set up and score. Future studies in a larger sample size are warranted to confirm the results of this study. Copyright © 2015 John Wiley & Sons, Ltd.
Test-Retest Reliability and Minimal Detectable Change of Randomized Dichotic Digits in Learning-Disabled Children: Implications for Dichotic Listening Training.

PubMed

Mahdavi, Mohammad Ebrahim; Pourbakht, Akram; Parand, Akram; Jalaie, Shohreh

2018-03-01

Evaluation of dichotic listening to digits is a common part of many studies for diagnosis and managing auditory processing disorders in children. Previous researchers have verified test-retest relative reliability of dichotic digits results in normal children and adults. However, detecting intervention-related changes in the ear scores after dichotic listening training requires information regarding trial-to-trial typical variation of individual ear scores that is estimated using indices of absolute reliability. Previous studies have not addressed absolute reliability of dichotic listening results. To compare the results of the Persian randomized dichotic digits test (PRDDT) and its relative and absolute indices of reliability between typical achieving (TA) and learning-disabled (LD) children. A repeated measures observational study. Fifteen LD children were recruited from a previously performed study with age range of 7-12 yr. The control group consisted of 15 TA schoolchildren with age range of 8-11 yr. The Persian randomized dichotic digits test was administered on the children under free recall condition in two test sessions 7-12 days apart. We compared the average of the ear scores and ear advantage between TA and LD children. Relative indices of reliability included Pearson's correlation and intraclass correlation (ICC 2,1 ) coefficients and absolute reliability was evaluated by calculation of standard error of measurement (SEM) and minimal detectable change (MDC) using the raw ear scores. The Pearson correlation coefficient indicated that in both groups of children the ear scores of test and retest sessions were strongly and positively (greater than +0.8) correlated. The ear scores showed excellent ICC coefficient of consistency (0.78-0.82) and fair to excellent ICC coefficient of absolute agreement (0.62-0.74) in TA children and excellent ICC coefficients of consistency and absolute agreement in LD children (0.76-0.87). SEM and SEM% of the ear scores in TA children were 1.46 and 1.44% for the right ear and 4.68 and 5.47% for the left ear. SEM and SEM% of the ear scores in LD children were 4.55 and 5.88% for the right ear to 7.56 and 12.81% for the left ear. MDC and MDC% of the ear scores in TA children varied from 4.03 and 3.99% for the right ear to 12.93 and 15.13% for the left ear. MDC and MDC% of the ear scores in LD children varied from 12.57 and 16.25% for the right ear to 20.89 and 35.39% for the left ear. The LD children indicated test-retest relative reliability as high as TA children in the ear scores measured by PRDDT. However, within-subject variations of the ear scores calculated by indices of absolute reliability were considerably higher in LD children versus TA children. The results of the current study could have implications for detecting real training-related changes in the ear scores. American Academy of Audiology
[Developing Perceived Competence Scale (PCS) for Adolescents].

PubMed

Özer, Arif; Gençtanirim Kurt, Dilek; Kizildağ, Seval; Demırtaş Zorbaz, Selen; Arici Şahın, Fatma; Acar, Tülin; Ergene, Tuncay

2016-01-01

In this study, Perceived Competence Scale was developed to measure high school students' perceived competence. Scale development process was verified on three different samples. Participants of the research are some high school students in 2011-2012 academic terms from Ankara. Participants' numbers are incorporated in exploratory factor analysis, confirmatory factor analysis and test-retest reliability respectively, as follows: 372, 668 and 75. Internal consistency coefficients (Cronbach's and stratified α) are calculated separately for each group. For data analysis Factor 8.02 and LISREL 8.70 package programs were used. According to results of the analyses, internal consistency coefficients (α) are .90 - .93 for academic competence, .82 - .86 for social competence in the samples that exploratory and confirmatory factor analysis performed. For the whole scale internal consistency coefficient (stratified α) is calculated as .91. As a result of test-retest reliability, adjusted correlation coefficients (r) are .94 for social competence and .90 for academic competence. In addition, to fit indexes and regression weights obtained from factor analysis, findings related convergent and discriminant validity, indicating that competence can be addressed in two dimensions which are academic (16 items) and social (14 items).
Repeatability of chemical-shift-encoded water-fat MRI and diffusion-tensor imaging in lower extremity muscles in children.

PubMed

Ponrartana, Skorn; Andrade, Kristine E; Wren, Tishya A L; Ramos-Platt, Leigh; Hu, Houchun H; Bluml, Stefan; Gilsanz, Vicente

2014-06-01

The purpose of this study was to assess the repeatability of water-fat MRI and diffusion-tensor imaging (DTI) as quantitative biomarkers of pediatric lower extremity skeletal muscle. MRI at 3 T of a randomly selected thigh and lower leg of seven healthy children was studied using water-fat separation and DTI techniques. Muscle-fat fraction, apparent diffusion coefficient (ADC), and fractional anisotropy (FA) values were calculated. Test-retest and interrater repeatability were assessed by calculating the Pearson correlation coefficient, intraclass correlation coefficient, and Bland-Altman analysis. Bland-Altman plots show that the mean difference between test-retest and interrater measurements of muscle-fat fraction, ADC, and FA was near 0. The correlation coefficients and intraclass correlation coefficients were all between 0.88 and 0.99 (p < 0.05), suggesting excellent reliability of the measurements. Muscle-fat fraction measurements from water-fat MRI exhibited the highest intraclass correlation coefficient. Interrater agreement was consistently better than test-retest comparisons. Water-fat MRI and DTI measurements in lower extremity skeletal muscles are objective repeatable biomarkers in children. This knowledge should aid in the understanding of the number of participants needed in clinical trials when using these determinations as an outcome measure to noninvasively monitor neuromuscular disease.
Reliability and validity of the Dutch pediatric Voice Handicap Index.

PubMed

Veder, Laura; Pullens, Bas; Timmerman, Marieke; Hoeve, Hans; Joosten, Koen; Hakkesteegt, Marieke

2017-05-01

The pediatric voice handicap index (pVHI) has been developed to provide a better insight into the parents' perception of their child's voice related quality of life. The purpose of the present study was to validate the Dutch pVHI by evaluating its internal consistency and reliability. Furthermore, we determined the optimal cut-off point for a normal pVHI score. All items of the English pVHI were translated into Dutch. Parents of children in our dysphonic and control group were asked to fill out the questionnaire. For the test re-test analysis we used a different study group who filled out the pVHI twice as part of a large follow up study. Internal consistency was analyzed through Cronbach's α coefficient. The test-retest reliability was assessed by determining Pearson's correlation coefficient. Mann-Whitney test was used to compare the scores of the questionnaire of the control group with the dysphonic group. By calculating receiver operating characteristic (ROC) curves, sensitivity and specificity we were able to set a cut-off point. We obtained data from 122 asymptomatic children and from 79 dysphonic children. The scores of the questionnaire significantly differed between both groups. The internal consistency showed an overall Cronbach α coefficient of 0.96 and an excellent test-retest reliability of the total pVHI questionnaire with a Pearson's correlation coefficient of 0.90. A cut-off point for the total pVHI questionnaire was set at 7 points with a specificity of 85% and sensitivity of 100%. A cut-off point for the VAS score was set at 13 with a specificity of 93% and sensitivity of 97%. The Dutch pVHI is a valid and reliable tool for the assessment of children with voice problems. By setting a cut-off point for the score of the total pVHI questionnaire of 7 points and the VAS score of 13, the pVHI might be used as a screening tool to assess dysphonic complaints and the pVHI might be a useful and complementary tool to identify children with dysphonia. Copyright © 2017 Elsevier B.V. All rights reserved.
Reliability and measurement error of sagittal spinal motion parameters in 220 patients with chronic low back pain using a three-dimensional measurement device.

PubMed

Mieritz, Rune M; Bronfort, Gert; Jakobsen, Markus D; Aagaard, Per; Hartvigsen, Jan

2014-09-01

A basic premise for any instrument measuring spinal motion is that reliable outcomes can be obtained on a relevant sample under standardized conditions. The purpose of this study was to assess the overall reliability and measurement error of regional spinal sagittal plane motion in patients with chronic low back pain (LBP), and then to evaluate the influence of body mass index, examiner, gender, stability of pain, and pain distribution on reliability and measurement error. This study comprises a test-retest design separated by 7 to 14 days. The patient cohort consisted of 220 individuals with chronic LBP. Kinematics of the lumbar spine were sampled during standardized spinal extension-flexion testing using a 6-df instrumented spatial linkage system. Test-retest reliability and measurement error were evaluated using interclass correlation coefficients (ICC(1,1)) and Bland-Altman limits of agreement (LOAs). The overall test-retest reliability (ICC(1,1)) for various motion parameters ranged from 0.51 to 0.70, and relatively wide LOAs were observed for all parameters. Reliability measures in patient subgroups (ICC(1,1)) ranged between 0.34 and 0.77. In general, greater (ICC(1,1)) coefficients and smaller LOAs were found in subgroups with patients examined by the same examiner, patients with a stable pain level, patients with a body mass index less than below 30 kg/m(2), patients who were men, and patients in the Quebec Task Force classifications Group 1. This study shows that sagittal plane kinematic data from patients with chronic LBP may be sufficiently reliable in measurements of groups of patients. However, because of the large LOAs, this test procedure appears unusable at the individual patient level. Furthermore, reliability and measurement error varies substantially among subgroups of patients. Copyright © 2014 Elsevier Inc. All rights reserved.
Reliability of a device for the knee and ankle isometric and isokinetic strength testing in older adults.

PubMed

Bergamin, Marco; Gobbo, Stefano; Bullo, Valentina; Vendramin, Barbara; Duregon, Federica; Frizziero, Antonio; Di Blasio, Andrea; Cugusi, Lucia; Zaccaria, Marco; Ermolao, Andrea

2017-01-01

Lower extremity muscle mass, strength, power, and physical performance are critical determinants of independent functioning in later life. Isokinetic dynamometers are becoming very common in assessing different features of muscle strength, in both research and clinical practice; however, reliability studies are still needed to support the extended use of those devices. The purpose of this study is to assess the test-retest reliability of knee and ankle isokinetic and isometric strength testing protocols in a sample of older healthy subjects, using a new and untested isokinetic multi-joint evaluation system. Sixteen male and fourteen female older adults (mean age 65.2 ± 4.6 years) were assessed in two testing sessions. Each participant performed a randomized testing procedure that includes different isometric and isokinetic tests for knee and ankle joints. All participants concluded the trial safety and no subject reported any discomfort throughout the overall assessment. Coefficients of correlation between measures were calculated showing moderate to strong effects among all test-retest assessments and paired-sample t test showed only one significant difference (p<0.05) in the maximal isokinetic bilateral knee flexion torque. The multi-joint evaluation system for the assessment of knee and ankle isokinetic and isometric strength provided reliable test-retest measures in healthy older adults. Ib.
Reliability of Computerized Neurocognitive Tests for Concussion Assessment: A Meta-Analysis.

PubMed

Farnsworth, James L; Dargo, Lucas; Ragan, Brian G; Kang, Minsoo

2017-09-01

Although widely used, computerized neurocognitive tests (CNTs) have been criticized because of low reliability and poor sensitivity. A systematic review was published summarizing the reliability of Immediate Post-Concussion Assessment and Cognitive Testing (ImPACT) scores; however, this was limited to a single CNT. Expansion of the previous review to include additional CNTs and a meta-analysis is needed. Therefore, our purpose was to analyze reliability data for CNTs using meta-analysis and examine moderating factors that may influence reliability. A systematic literature search (key terms: reliability, computerized neurocognitive test, concussion) of electronic databases (MEDLINE, PubMed, Google Scholar, and SPORTDiscus) was conducted to identify relevant studies. Studies were included if they met all of the following criteria: used a test-retest design, involved at least 1 CNT, provided sufficient statistical data to allow for effect-size calculation, and were published in English. Two independent reviewers investigated each article to assess inclusion criteria. Eighteen studies involving 2674 participants were retained. Intraclass correlation coefficients were extracted to calculate effect sizes and determine overall reliability. The Fisher Z transformation adjusted for sampling error associated with averaging correlations. Moderator analyses were conducted to evaluate the effects of the length of the test-retest interval, intraclass correlation coefficient model selection, participant demographics, and study design on reliability. Heterogeneity was evaluated using the Cochran Q statistic. The proportion of acceptable outcomes was greatest for the Axon Sports CogState Test (75%) and lowest for the ImPACT (25%). Moderator analyses indicated that the type of intraclass correlation coefficient model used significantly influenced effect-size estimates, accounting for 17% of the variation in reliability. The Axon Sports CogState Test, which has a higher proportion of acceptable outcomes and shorter test duration relative to other CNTs, may be a reliable option; however, future studies are needed to compare the diagnostic accuracy of these instruments.
Reliability and Validity of Gaze-Dependent Functional Vision Space: A Novel Metric Quantifying Visual Function in Infantile Nystagmus Syndrome.

PubMed

Roberts, Tawna L; Kester, Kristi N; Hertle, Richard W

2018-04-01

This study presents test-retest reliability of optotype visual acuity (OVA) across 60° of horizontal gaze position in patients with infantile nystagmus syndrome (INS). Also, the validity of the metric gaze-dependent functional vision space (GDFVS) is shown in patients with INS. In experiment 1, OVA was measured twice in seven horizontal gaze positions from 30° left to right in 10° steps in 20 subjects with INS and 14 without INS. Test-retest reliability was assessed using intraclass correlation coefficient (ICC) in each gaze. OVA area under the curve (AUC) was calculated with horizontal eye position on the x-axis, and logMAR visual acuity on the y-axis and then converted to GDFVS. In experiment 2, validity of GDFVS was determined over 40° horizontal gaze by applying the 95% limits of agreement from experiment 1 to pre- and post-treatment GDFVS values from 85 patients with INS. In experiment 1, test-retest reliability for OVA was high (ICC ≥ 0.88) as the difference in test-retest was on average less than 0.1 logMAR in each gaze position. In experiment 2, as a group, INS subjects had a significant increase (P < 0.001) in the size of their GDFVS that exceeded the 95% limits of agreement found during test-retest. OVA is a reliable measure in INS patients across 60° of horizontal gaze position. GDFVS is a valid clinical method to be used to quantify OVA as a function of eye position in INS patients. This method captures the dynamic nature of OVA in INS patients and may be a valuable measure to quantify visual function patients with INS, particularly in quantifying change as part of clinical studies.

Test-retest reliability and minimal detectable change of the Beck Depression Inventory and the Taiwan Geriatric Depression Scale in patients with Parkinson's disease

PubMed Central

Huang, Sheau-Ling; Hsieh, Ching-Lin; Wu, Ruey-Meei

2017-01-01

Background The Beck Depression Inventory II (BDI-II) and the Taiwan Geriatric Depression Scale (TGDS) are self-report scales used for assessing depression in patients with Parkinson’s disease (PD) and geriatric people. The minimal detectable change (MDC) represents the least amount of change that indicates real difference (i.e., beyond random measurement error) for a single subject. Our aim was to investigate the test-retest reliability and MDC of the BDI-II and the TGDS in people with PD. Methods Seventy patients were recruited from special clinics for movement disorders at a medical center. The patients’ mean age was 67.7 years, and 63.0% of the patients were male. All patients were assessed with the BDI-II and the TGDS twice, 2 weeks apart. We used the intraclass correlation coefficient (ICC) to determine the reliability between test and retest. We calculated the MDC based on standard error of measurement. The MDC% was calculated (i.e., by dividing the MDC by the possible maximal score of the measure). Results The test-retest reliabilities of the BDI-II/TGDS were high (ICC = 0.86/0.89). The MDCs (MDC%s) of the BDI-II and TGDS were 8.7 (13.8%) and 5.4 points (18.0%), respectively. Both measures had acceptable to nearly excellent random measurement errors. Conclusions The test-retest reliabilities of the BDI-II and the TGDS are high. The MDCs of both measures are acceptable to nearly excellent in people with PD. These findings imply that the BDI-II and the TGDS are suitable for use in a research context and in clinical settings to detect real change in a single subject. PMID:28945776
Reliability of cognitive tests of ELSA-Brasil, the brazilian longitudinal study of adult health

PubMed Central

Batista, Juliana Alves; Giatti, Luana; Barreto, Sandhi Maria; Galery, Ana Roscoe Papini; Passos, Valéria Maria de Azeredo

2013-01-01

Cognitive function evaluation entails the use of neuropsychological tests, applied exclusively or in sequence. The results of these tests may be influenced by factors related to the environment, the interviewer or the interviewee. OBJECTIVES We examined the test-retest reliability of some tests of the Brazilian version from the Consortium to Establish a Registry for Alzheimer's disease. METHODS The ELSA-Brasil is a multicentre study of civil servants (35-74 years of age) from public institutions across six Brazilian States. The same tests were applied, in different order of appearance, by the same trained and certified interviewer, with an approximate 20-day interval, to 160 adults (51% men, mean age 52 years). The Intraclass Correlation Coefficient (ICC) was used to assess the reliability of the measures; and a dispersion graph was used to examine the patterns of agreement between them. RESULTS We observed higher retest scores in all tests as well as a shorter test completion time for the Trail Making Test B. ICC values for each test were as following: Word List Learning Test (0.56), Word Recall (0.50), Word Recognition (0.35), Phonemic Verbal Fluency Test (VFT, 0.61), Semantic VFT (0.53) and Trail B (0.91). The Bland-Altman plot showed better correlation of executive function (VFT and Trail B) than of memory tests. CONCLUSIONS Better performance in retest may reflect a learning effect, and suggest that retest should be repeated using alternate forms or after longer periods. In this sample of adults with high schooling level, reliability was only moderate for memory tests whereas the measurement of executive function proved more reliable. PMID:29213860
Reliability and validity of the Japanese version of the Resilience Scale and its short version.

PubMed

Nishi, Daisuke; Uehara, Ritei; Kondo, Maki; Matsuoka, Yutaka

2010-11-17

The clinical relevance of resilience has received considerable attention in recent years. The aim of this study is to demonstrate the reliability and validity of the Japanese version of the Resilience Scale (RS) and short version of the RS (RS-14). The original English version of RS was translated to Japanese and the Japanese version was confirmed by back-translation. Participants were 430 nursing and university psychology students. The RS, Center for Epidemiologic Studies Depression Scale (CES-D), Rosenberg Self-Esteem Scale (RSES), Social Support Questionnaire (SSQ), Perceived Stress Scale (PSS), and Sheehan Disability Scale (SDS) were administered. Internal consistency, convergent validity and factor loadings were assessed at initial assessment. Test-retest reliability was assessed using data collected from 107 students at 3 months after baseline. Mean score on the RS was 111.19. Cronbach's alpha coefficients for the RS and RS-14 were 0.90 and 0.88, respectively. The test-retest correlation coefficients for the RS and RS-14 were 0.83 and 0.84, respectively. Both the RS and RS-14 were negatively correlated with the CES-D and SDS, and positively correlated with the RSES, SSQ and PSS (all p < 0.05), although the correlation between the RS and CES-D was somewhat lower than that in previous studies. Factor analyses indicated a one-factor solution for RS-14, but as for RS, the result was not consistent with previous studies. This study demonstrates that the Japanese version of RS has psychometric properties with high degrees of internal consistency, high test-retest reliability, and relatively low concurrent validity. RS-14 was equivalent to the RS in internal consistency, test-retest reliability, and concurrent validity. Low scores on the RS, a positive correlation between the RS and perceived stress, and a relatively low correlation between the RS and depressive symptoms in this study suggest that validity of the Japanese version of the RS might be relatively low compared with the original English version.
Greek cultural adaption and validation of the Kujala anterior knee pain scale in patients with patellofemoral pain syndrome.

PubMed

Papadopoulos, Costas; Constantinou, Antonis; Cheimonidou, Areti-Zoi; Stasinopoulos, Dimitrios

2017-04-01

To cross-culturally adapt and validate the Greek version of the Kujala anterior knee pain scale (KAKPS). The Greek KAKPS was translated from the original English version following standard forward and backward translation procedures. The survey was then conducted in clinical settings by a questionnaire comprising the Greek KAKPS and patellofemoral pain syndrome (PFPS) severity scale. A total of 130 (62 women and 68 men) Greek-reading patients between 18 and 45 years old with anterior knee pain (AKP) for at least four weeks were recruited from physical therapy clinics. To establish test-retest reliability, the patients were asked to complete the KAKPS at initial visit and 2-3 days after the initial visit. The Greek version of the PFPS severity scale was also administered once at initial visit. Internal consistency of the translated instrument was measured using Cronbach's α. An intraclass correlation coefficient was used to assess the test-retest reliability of the KAKPS. Concurrent validity was measured by correlating the KAKPS with the PFPS severity scale using Pearson's correlation coefficient. The results showed that the Greek KAKPS has good internal consistency (Cronbach's α = 0.942), test-retest reliability (ICC = 0.921) and concurrent validity (r > 0.7). This study has shown that the Greek KAKPS has good internal consistency, test-retest reliability and concurrent validity when correlated with the PFPS severity scale in adult patients with AKP for at least four weeks. Implications for rehabilitation The Greek version of the KAKPS has been found to be reliable and valid when used in adult patients with AKP for at least four weeks. The results of the psychometric characteristics were compatible with those of the original English version. The KAKPS could be applied in a Greek-speaking population to assess functional limitations and symptoms in patients aged 18-45 years old with AKP for at least four weeks.
Youth health risk behavior assessment in Fiji: The reliability of Global School-based Health Survey content adapted for ethnic Fijian girls

PubMed Central

Becker, Anne E.; Roberts, Andrea L.; Perloe, Alexandra; Bainivualiku, Asenaca; Richards, Lauren K.; Gilman, Stephen E.; Striegel-Moore, Ruth H.

2010-01-01

Objective The Global School-based Student Health Survey (GSHS) is an assessment for adolescent health risk behaviors and exposures, supported by the World Health Organization. Although already widely implemented—and intended for youth assessment across diverse ethnic and national contexts—no reliability data have yet been reported for GSHS-based assessment in any ethnicity or country-specific population. This study reports test-retest reliability for GSHS content adapted for a female adolescent ethnic Fijian study sample in Fiji. Design We adapted and translated GSHS content to assess health risk behaviors as part of a larger study investigating the impact of social transition on ethnic Fijian secondary schoolgirls in Fiji. In order to evaluate the performance of this measure for our ethnic Fijian study sample (n=523), we examined its test-retest reliability with kappa coefficients, % agreement, and prevalence estimates in a sub-sample (n=81). Reliability among strata defined by topic, age, and language was also examined. Results Average agreement between test and retest was 77%, and average Cohen's kappa was 0.47. Mean kappas for questions from core modules about alcohol use, tobacco use, and sexual behavior were substantial, and higher than those for modules relating to other risk behaviors. Conclusions Although test-retest reliability of responses within this country-specific version of GSHS content was substantial in several topical domains for this ethnic Fijian sample, only fair reliability for the module assessing dietary behaviors and other individual items suggests that population-specific psychometric evaluation is essential to interpreting language and country-specific GSHS data. PMID:20234961
Test-retest agreement and reliability of quantitative sensory testing 1 year after breast cancer surgery.

PubMed

Andersen, Kenneth Geving; Kehlet, Henrik; Aasvang, Eske Kvanner

2015-05-01

Quantitative sensory testing (QST) is used to assess sensory dysfunction and nerve damage by examining psychophysical responses to controlled, graded stimuli such as mechanical and thermal detection and pain thresholds. In the breast cancer population, 4 studies have used QST to examine persistent pain after breast cancer treatment, suggesting neuropathic pain being a prominent pain mechanism. However, the agreement and reliability of QST has not been described in the postsurgical breast cancer population, hindering exact interpretation of QST studies in this population. The aim of the present study was to assess test-retest properties of QST after breast cancer surgery. A total of 32 patients recruited from a larger ongoing prospective trial were examined with QST 12 months after breast cancer surgery and reexamined a week later. A standardized QST protocol was used, including sensory mapping for mechanical, warmth and cold areas of sensory dysfunction, mechanical thresholds using monofilaments and pin-prick, thermal thresholds including warmth and cold detection thresholds and heat pain threshold, with bilateral examination. Agreement and reliability were assessed by Bland-Altman plots, descriptive statistics, coefficients of variance, and intraclass correlation. Bland-Altman plots showed high variation on the surgical side. Intraclass coefficients ranged from 0.356 to 0.847 (moderate to substantial reliability). Between-patient variation was generally higher (0.9 to 14.5 SD) than within-patient variation (0.23 to 3.55 SD). There were no significant differences between pain and pain-free patients. The individual test-retest variability was higher on the operated side compared with the nonoperated side. The QST protocol reliability allows for group-to-group comparison of sensory function, but less so for individual follow-up after breast cancer surgery.
Translation, cross-cultural adaptation, and validation of the Turkish version of the Harris Hip Score.

PubMed

Çelik, Derya; Can, Canan; Aslan, Yasemin; Ceylan, Hasan Huseyin; Bilsel, Kerem; Ozdincler, Arzu Razak

2014-01-01

The Harris Hip Score (HHS) developed to assess function and pain from the perspective of patients hip pathologies. The purpose of this study was to translate and culturally adapt the HHS into Turkish, and thereby determine the reliability and validity of the translated version. The HHS was translated into Turkish in accordance with the stages recommended by Beaton. The measurement properties of the HHS were tested in 80 patients; 52 males, mean age 51 years (range 21-75 years) suffering from different hip pathologies. The test-retest reliability was tested in 58 patients; 28 males mean age, 52 years (range 30-73 years) after an interval of seven days. The Cronbach's Alpha was used to assess internal consistency and the intra-class correlation coefficient (ICC) was used to estimate the test-retest reliability. Patients were asked to answer the Oxford Hip Score (OHS), the Western Ontario and McMaster Universities Arthritis Index (WOMAC), the VAS and the Short Form-36 (SF-36) for the validity of the estimation. The Turkish version of the HHS showed sufficient internal consistency (Cronbach's alpha,0.70) and test-retest reliability (ICC = 0.91). The correlation coefficients between the HHS, the WOMAC and the OHS were 0.64 and 0.89 respectively. The highest correlations between the HHS and SF-36 were with the physical function scale (r = 0.72), and the lowest correlations were with the mental function scale (r = 0.10). We observed no floor or ceiling effects. The Turkish version of the HHS has sufficient reliability and validity to measure patient-reported outcome for Turkish-speaking individuals with a variety of hip disorders.
A study of the development of the Korean version of PedsQL(TM) 3.0 cerebral palsy module and reliability and validity.

PubMed

Yun, Young-Ju; Shin, Yong-Beom; Kim, Soo-Yeon; Shin, Myung-Jun; Kim, Ra-Jin; Oh, Tae-Young

2016-07-01

[Purpose] The purpose of this study was to develop the Korean version of the PedsQL(TM) 3.0 Cerebral Palsy Module to evaluate the health-related quality of life of children with cerebral palsy and to test the reliability and validity. [Subjects and Methods] The study included 108 caregivers of children with cerebral palsy aged 2 to 4 years and 72 caregivers of children aged 5 to 7 years, who visited multiple sites between February and August 2015. The Translation Commission performed the first translation with the approval of the Mapi Research Trust Company to create a Korean-version of the PedsQL(TM). Afterwards, back-translation was performed by one translator specializing in health and medical treatment who was a native English-speaker fluent in Korean, and one native Korean-speaker fluent in English. The consistency of each question was confirmed and a translation-integrated version was created. Test components were explained to caregivers during a one-on-one interview; caregivers then completed the PedsQL(TM) questionnaire and a Pediatric Evaluation Disability Inventory (PEDI) questionnaire. Subjects contributing to test-retest measures were asked to repeat the PedsQL questionnaire one week later and return it by mail. To assess data quality for the survey question results, non-response rate, ceiling effect, and floor effect were analyzed. Test-retest reliability and internal consistency reliability were assessed. For test-retest reliability, an intraclass correlation coefficient (ICC) was calculated, and for internal consistency reliability, Cronbach's alpha was used. To test criterion-related validity, Pearson's correlation coefficient was used. [Results] The content validity of the PedsQL 3.0 Cerebral Palsy Module was high for both age groups, and demonstrated significant internal consistency (>0.7) in all areas. For test-retest reliability, both groups demonstrated a significant ICC (>0.61). Correlation with the PEDI was statistically significant in all areas except pain and hurt. [Conclusion] The Korean version of the PedsQL(TM) 3.0 Cerebral Palsy Module was found to be reliable and valid, and is expected to contribute greatly to the evaluation of the quality of life of children with cerebral palsy.
The Dutch language anterior cruciate ligament return to sport after injury scale (ACL-RSI) - validity and reliability.

PubMed

Slagers, Anton J; Reininga, Inge H F; van den Akker-Scheek, Inge

2017-02-01

The ACL-Return to Sport after Injury scale (ACL-RSI) measures athletes' emotions, confidence in performance, and risk appraisal in relation to return to sport after ACL reconstruction. Aim of this study was to study the validity and reliability of the Dutch version of the ACL-RSI (ACL-RSI (NL)). Total 150 patients, who were 3-16 months postoperative, completed the ACL-RSI(NL) and 5 other questionnaires regarding psychological readiness to return to sports, knee-specific physical functioning, kinesiophobia, and health-specific locus of control. Construct validity of the ACL-RSI(NL) was determined with factor analysis and by exploring 10 hypotheses regarding correlations between ACL-RSI(NL) and the other questionnaires. For test-retest reliability, 107 patients (5-16 months postoperative) completed the ACL-RSI(NL) again 2 weeks after the first administration. Cronbach's alpha, Intraclass Correlation Coefficient (ICC), SEM, and SDC, were calculated. Bland-Altman analysis was conducted to assess bias between test and retest. Nine hypotheses (90%) were confirmed, indicating good construct validity. The ACL-RSI(NL) showed good internal consistency (Cronbach's alpha 0.94) and test-retest reliability (ICC 0.93). SEM was 5.5 and SDC was 15. A significant bias of 3.2 points between test and retest was found. Therefore, the ACL-RSI(NL) can be used to investigate psychological factors relevant to returning to sport after ACL reconstruction.
Validity and Reliability of the 8-Item Work Limitations Questionnaire.

PubMed

Walker, Timothy J; Tullar, Jessica M; Diamond, Pamela M; Kohl, Harold W; Amick, Benjamin C

2017-12-01

Purpose To evaluate factorial validity, scale reliability, test-retest reliability, convergent validity, and discriminant validity of the 8-item Work Limitations Questionnaire (WLQ) among employees from a public university system. Methods A secondary analysis using de-identified data from employees who completed an annual Health Assessment between the years 2009-2015 tested research aims. Confirmatory factor analysis (CFA) (n = 10,165) tested the latent structure of the 8-item WLQ. Scale reliability was determined using a CFA-based approach while test-retest reliability was determined using the intraclass correlation coefficient. Convergent/discriminant validity was tested by evaluating relations between the 8-item WLQ with health/performance variables for convergent validity (health-related work performance, number of chronic conditions, and general health) and demographic variables for discriminant validity (gender and institution type). Results A 1-factor model with three correlated residuals demonstrated excellent model fit (CFI = 0.99, TLI = 0.99, RMSEA = 0.03, and SRMR = 0.01). The scale reliability was acceptable (0.69, 95% CI 0.68-0.70) and the test-retest reliability was very good (ICC = 0.78). Low-to-moderate associations were observed between the 8-item WLQ and the health/performance variables while weak associations were observed between the demographic variables. Conclusions The 8-item WLQ demonstrated sufficient reliability and validity among employees from a public university system. Results suggest the 8-item WLQ is a usable alternative for studies when the more comprehensive 25-item WLQ is not available.
Test-retest reliability of lower limb isokinetic endurance in COPD: A comparison of angular velocities

PubMed Central

Ribeiro, Fernanda; Lépine, Pierre-Alexis; Garceau-Bolduc, Corine; Coats, Valérie; Allard, Étienne; Maltais, François; Saey, Didier

2015-01-01

Background The purpose of this study was to determine and compare the test-retest reliability of quadriceps isokinetic endurance testing at two knee angular velocities in patients with chronic obstructive pulmonary disease (COPD). Methods After one familiarization session, 14 patients with moderate to severe COPD (mean age 65±4 years; forced expiratory volume in 1 second (FEV1) 55%±18% predicted) performed two quadriceps isokinetic endurance tests on two separate occasions within a 5–7-day interval. Quadriceps isokinetic endurance tests consisted of 30 maximal knee extensions at angular velocities of 90° and 180° per second, performed in random order. Test-retest reliability was assessed for peak torque, muscle endurance, work slope, work fatigue index, and changes in FEV1 for dyspnea and leg fatigue from rest to the end of the test. The intraclass correlation coefficient, minimal detectable change, and limits of agreement were calculated. Results High test-retest reliability was identified for peak torque and muscle total work at both velocities. Work fatigue index was considered reliable at 90° per second but not at 180° per second. A lower reliability was identified for dyspnea and leg fatigue scores at both angular velocities. Conclusion Despite a limited sample size, our findings support the use of a 30-maximal repetition isokinetic muscle testing procedure at angular velocities of 90° and 180° per second in patients with moderate to severe COPD. Endurance measurement (total isokinetic work) at 90° per second was highly reliable, with a minimal detectable change at the 95% confidence level of 10%. Peak torque and fatigue index could also be assessed reliably at 90° per second. Evaluation of dyspnea and leg fatigue using the modified Borg scale of perceived exertion was poorly reliable and its clinical usefulness is questionable. These results should be useful in the design and interpretation of future interventions aimed at improving muscle endurance in COPD. PMID:26124656
The Female Sexual Function Index (FSFI): linguistic validation of the Italian version.

PubMed

Filocamo, Maria Teresa; Serati, Maurizio; Li Marzi, Vincenzo; Costantini, Elisabetta; Milanesi, Martina; Pietropaolo, Amelia; Polledro, Patrizio; Gentile, Barbara; Maruccia, Serena; Fornia, Samanta; Lauri, Irene; Alei, Rosanna; Arcangeli, Paola; Sighinolfi, Maria Chiara; Manassero, Francesca; Andretta, Elena; Palazzetti, Anna; Bertelli, Elena; Del Popolo, Giulio; Villari, Donata

2014-02-01

Although several new measurements for female sexual dysfunction (FSD) have recently been developed, the Female Sexual Function Index (FSFI) remains the gold standard for screening and one of the most widely used questionnaires. The Italian translation of the FSFI has been used in several studies conducted in Italy, but a linguistic validation of the Italian version does not exist. The aim of this study was to perform a linguistic validation of the Italian version of the FSFI. A multicenter cross-sectional study conducted in 14 urological and gynecological clinics, uniformly distributed over Italian territory. We performed all steps necessary to determine the reliability and the test-retest reliability of the Italian version of the FSFI. The study population was a convenience sample of 409 Italian women. The reliability of the questionnaire was calculated using Cronbach's alpha, which was considered weak, moderate, or high if its value was found less than 0.6, between 0.6 and 0.8, or equal to or greater than 0.8, respectively. The test-retest reliability was assessed for all women in the sample by calculating Pearson's concordance correlation coefficient for each domain and for the total score, both at baseline and after 15 days (r range between -1.00 to +1.00, where +1.00 indicates the strongest positive association). Cronbach's alpha coefficients for total and domain score were sufficiently high, ranging from 0.92 to 0.97 for the total sample. The test-retest procedure revealed that the concordance correlation coefficient was very high both for FSFI-I total score (Pearson's P = 0.93) and for each domain (Pearson's P always >0.92). For the first time in the literature, our study has produced a validated and reliable Italian version of the FSFI questionnaire. Consequently, the Italian FSFI can be used as a reliable tool for preliminary screening for female sexual dysfunction for Italian women. © 2013 International Society for Sexual Medicine.
Assessing the test-retest repeatability of the Vietnamese version of the National Eye Institute 25-item Visual Function Questionnaire among bilateral cataract patients for a Vietnamese population.

PubMed

To, Kien Gia; Meuleners, Lynn; Chen, Huei-Yang; Lee, Andy; Do, Dung Van; Duong, Dat Van; Phi, Tien Duy; Tran, Hoang Huy; Nguyen, Nguyen Do

2014-06-01

To determine the test-retest repeatability of the National Eye Institute 25-item Visual Function Questionnaire (NEI VFQ-25) for use with older Vietnamese adults with bilateral cataract. The questionnaire was translated into Vietnamese and back-translated into English by two independent translators. Patients with bilateral cataract aged 50 and older completed the questionnaire on two separate occasions, one to two weeks after first administration of the questionnaire. Test-retest repeatability was assessed using the Cronbach's α and intraclass correlation coefficients. The average age of participants was 67 ± 8 years and most participants were female (73%). Internal consistency was acceptable with the α coefficient above 0.7 for all subscales and intraclass correlation coefficients were 0.6 or greater in all subscales. The Vietnamese NEI VFQ-25 is reliable for use in studies assessing vision-related quality of life in older adults with bilateral cataract in Vietnam. We propose some modifications to the NEI-VFQ questions to reflect activities of older people in Vietnam. © 2013 ACOTA.
Preliminary validation and reliability of the Short Form Chronic Respiratory Disease Questionnaire in a lung cancer population.

PubMed

Charalambous, A; Molassiotis, A

2017-01-01

The Short Form Chronic Respiratory Questionnaire (SF-CRQ) is frequently used in patients with obstructive pulmonary disease and it has demonstrated excellent psychometric properties. Since there is no psychometric information for its use with lung cancer patients, this study explored its validity and reliability in this population. Forty-six patients were assessed at two time points (with a 4-week interval) using the SF-CRQ, the modified Borg Scale, five numerical rating scales related to Perceived Severity of Breathlessness, and the Hospital Anxiety and Depression Scale. Internal consistency reliability was investigated by Cronbach's alpha reliability coefficient, test-retest reliability by Spearman-Brown reliability coefficient (P), content validity as well as convergent validity by Pearson's correlation coefficient between the SF-CRQ, and the conceptual similar scales mentioned above were explored. A principal component factor analysis was performed. The internal consistency was high [α = 0.88 (baseline) and 0.91 (after 1 month)]. The SF-CRQ had good stability with test-retest reliability ranging from r = 0.64 to 0.78, P < 0.001. Factor analysis suggests a single construct in this population. The preliminary data analyses supported the convergent, content, and construct validity of the SF-CRQ providing promising evidence that this can be a valid and reliable instrument for the assessment of quality of life related to breathlessness in lung cancer patients. © 2015 John Wiley & Sons Ltd.
The validity and reliability of the Functional Strength Measurement (FSM) in children with intellectual disabilities.

PubMed

Aertssen, W F M; Steenbergen, B; Smits-Engelsman, B C M

2018-06-07

There is lack of valid and reliable field-based tests for assessing functional strength in young children with mild intellectual disabilities (IDs). The aim of this study was to investigate the test-retest reliability and construct validity of the Functional Strength Measurement in children with ID (FSM-ID). Fifty-two children with mild ID (40 boys and 12 girls, mean age 8.48 years, SD = 1.48) were tested with the FSM. Test-retest reliability (n = 32) was examined by a two-way interclass correlation coefficient for agreement (ICC 2.1A). Standard error of measurement and smallest detectable change were calculated. Construct validity was determined by calculating correlations between the FSM-ID and handheld dynamometry (HHD) (convergent validity), FSM-ID, FSM-ID and subtest strength of the Bruininks-Oseretsky test of motor proficiency - second edition (BOT-2) (convergent validity) and the FSM-ID and balance subtest of the BOT-2 (discriminant validity). Test-retest reliability ICC ranged 0.89-0.98. Correlation between the items of the FSM-ID and HHD ranged 0.39-0.79 and between FSM-ID and BOT-2 (strength items) 0.41-0.80. Correlation between items of the FSM-ID and BOT-2 (balance items) ranged 0.41-0.70. The FSM-ID showed good test-retest reliability and good convergent validity with the HHD and BOT-2 subtest strength. The correlations assessing discriminant validity were higher than expected. Poor levels of postural control and core stability in children with mild IDs may be the underlying factor of those higher correlations. © 2018 MENCAP and International Association of the Scientific Study of Intellectual and Developmental Disabilities and John Wiley & Sons Ltd.
Test-retest reliability of an fMRI paradigm for studies of cardiovascular reactivity.

PubMed

Sheu, Lei K; Jennings, J Richard; Gianaros, Peter J

2012-07-01

We examined the reliability of measures of fMRI, subjective, and cardiovascular reactions to standardized versions of a Stroop color-word task and a multisource interference task. A sample of 14 men and 12 women (30-49 years old) completed the tasks on two occasions, separated by a median of 88 days. The reliability of fMRI BOLD signal changes in brain areas engaged by the tasks was moderate, and aggregating fMRI BOLD signal changes across the tasks improved test-retest reliability metrics. These metrics included voxel-wise intraclass correlation coefficients (ICCs) and overlap ratio statistics. Task-aggregated ratings of subjective arousal, valence, and control, as well as cardiovascular reactions evoked by the tasks showed ICCs of 0.57 to 0.87 (ps < .001), indicating moderate-to-strong reliability. These findings support using these tasks as a battery for fMRI studies of cardiovascular reactivity. Copyright © 2012 Society for Psychophysiological Research.
Reliability and validity of the Chinese pediatric voice handicap index.

PubMed

Liu, Kena; Liu, Shaofeng; Zhou, Zhou; Ren, Qinyi; Zhong, Jie; Luo, Renzhong; Qin, Huabiao; Zhang, Siyi; Ge, Pingjiang

2018-02-01

To evaluate the reliability and validity of the Chinese version of pediatric voice handicap index (pVHI). The original English version-pVHI was translated into Chinese. Parents of 52 children with voice dysphonia and 43 children with no history or symptoms of voice problems were asked to fill the Chinese pVHI questionnaires twice with an interval of 2 weeks. GRB (Grade, Roughness, Breathiness) scale was used for perceptual assessment by two otolaryngologists and one speech pathologist for each child's voice. The internal consistency was assessed using Cronbach's alpha coefficient. Pearson's correlation coefficient was used to evaluate the test-retest reliability. The Kendall's coefficient of concordance W was used to assess the consistency of GRB scores of 3 voice specialists. The nonparametric Mann-Whitney test was used to assess the differences between the dysphonia group and controls. The correlation between pVHI and GRB scores were assessed using Pearson's correlation coefficient. The internal consistency of total score and three subscales scores of Chinese pVHI were 0.788-0.944. The test-retest reliability was 0.631-0.887(P < .001). The pVHI scores of control group significantly were lower than the pathological group (P = .000). The GRB scores of 3 voice specialists have an excellent consistency (W = 0.694-0.807, P = .000). The pVHI scores positively correlated with GRB assessment (P < .01). The Chinese version of pVHI had a good reliability and validity. It can be applicable and useful supplementary tool for evaluating parents' perception of their children's dysphonia. Copyright © 2017. Published by Elsevier B.V.
Extensive validation of the pain disability index in 3 groups of patients with musculoskeletal pain.

PubMed

Soer, Remko; Köke, Albère J A; Vroomen, Patrick C A J; Stegeman, Patrick; Smeets, Rob J E M; Coppes, Maarten H; Reneman, Michiel F

2013-04-20

A cross-sectional study design was performed. To validate the pain disability index (PDI) extensively in 3 groups of patients with musculoskeletal pain. The PDI is a widely used and studied instrument for disability related to various pain syndromes, although there is conflicting evidence concerning factor structure, test-retest reliability, and missing items. Additionally, an official translation of the Dutch language version has never been performed. For reliability, internal consistency, factor structure, test-retest reliability and measurement error were calculated. Validity was tested with hypothesized correlations with pain intensity, kinesiophobia, Rand-36 subscales, Depression, Roland-Morris Disability Questionnaire, Quality of Life, and Work Status. Structural validity was tested with independent backward translation and approval from the original authors. One hundred seventy-eight patients with acute back pain, 425 patients with chronic low back pain and 365 with widespread pain were included. Internal consistency of the PDI was good. One factor was identified with factor analyses. Test-retest reliability was good for the PDI (intraclass correlation coefficient, 0.76). Standard error of measurement was 6.5 points and smallest detectable change was 17.9 points. Little correlations between the PDI were observed with kinesiophobia and depression, fair correlations with pain intensity, work status, and vitality and moderate correlations with the Rand-36 subscales and the Roland-Morris Disability Questionnaire. The PDI-Dutch language version is internally consistent as a 1-factor structure, and test-retest reliable. Missing items seem high in sexual and professional items. Using the PDI as a 2-factor questionnaire has no additional value and is unreliable.
Test-retest reliability and comparability of paper and computer questionnaires for the Finnish version of the Tampa Scale of Kinesiophobia.

PubMed

Koho, P; Aho, S; Kautiainen, H; Pohjolainen, T; Hurri, H

2014-12-01

To estimate the internal consistency, test-retest reliability and comparability of paper and computer versions of the Finnish version of the Tampa Scale of Kinesiophobia (TSK-FIN) among patients with chronic pain. In addition, patients' personal experiences of completing both versions of the TSK-FIN and preferences between these two methods of data collection were studied. Test-retest reliability study. Paper and computer versions of the TSK-FIN were completed twice on two consecutive days. The sample comprised 94 consecutive patients with chronic musculoskeletal pain participating in a pain management or individual rehabilitation programme. The group rehabilitation design consisted of physical and functional exercises, evaluation of the social situation, psychological assessment of pain-related stress factors, and personal pain management training in order to regain overall function and mitigate the inconvenience of pain and fear-avoidance behaviour. The mean TSK-FIN score was 37.1 [standard deviation (SD) 8.1] for the computer version and 35.3 (SD 7.9) for the paper version. The mean difference between the two versions was 1.9 (95% confidence interval 0.8 to 2.9). Test-retest reliability was 0.89 for the paper version and 0.88 for the computer version. Internal consistency was considered to be good for both versions. The intraclass correlation coefficient for comparability was 0.77 (95% confidence interval 0.66 to 0.85), indicating substantial reliability between the two methods. Both versions of the TSK-FIN demonstrated substantial intertest reliability, good test-retest reliability, good internal consistency and acceptable limits of agreement, suggesting their suitability for clinical use. However, subjects tended to score higher when using the computer version. As such, in an ideal situation, data should be collected in a similar manner throughout the course of rehabilitation or clinical research. Copyright © 2014 Chartered Society of Physiotherapy. Published by Elsevier Ltd. All rights reserved.
Test-retest and between-site reliability in a multicenter fMRI study.

PubMed

Friedman, Lee; Stern, Hal; Brown, Gregory G; Mathalon, Daniel H; Turner, Jessica; Glover, Gary H; Gollub, Randy L; Lauriello, John; Lim, Kelvin O; Cannon, Tyrone; Greve, Douglas N; Bockholt, Henry Jeremy; Belger, Aysenil; Mueller, Bryon; Doty, Michael J; He, Jianchun; Wells, William; Smyth, Padhraic; Pieper, Steve; Kim, Seyoung; Kubicki, Marek; Vangel, Mark; Potkin, Steven G

2008-08-01

In the present report, estimates of test-retest and between-site reliability of fMRI assessments were produced in the context of a multicenter fMRI reliability study (FBIRN Phase 1, www.nbirn.net). Five subjects were scanned on 10 MRI scanners on two occasions. The fMRI task was a simple block design sensorimotor task. The impulse response functions to the stimulation block were derived using an FIR-deconvolution analysis with FMRISTAT. Six functionally-derived ROIs covering the visual, auditory and motor cortices, created from a prior analysis, were used. Two dependent variables were compared: percent signal change and contrast-to-noise-ratio. Reliability was assessed with intraclass correlation coefficients derived from a variance components analysis. Test-retest reliability was high, but initially, between-site reliability was low, indicating a strong contribution from site and site-by-subject variance. However, a number of factors that can markedly improve between-site reliability were uncovered, including increasing the size of the ROIs, adjusting for smoothness differences, and inclusion of additional runs. By employing multiple steps, between-site reliability for 3T scanners was increased by 123%. Dropping one site at a time and assessing reliability can be a useful method of assessing the sensitivity of the results to particular sites. These findings should provide guidance toothers on the best practices for future multicenter studies.

Reliability of primary caregivers reports on lifestyle behaviours of European pre-school children: the ToyBox-study.

PubMed

González-Gil, E M; Mouratidou, T; Cardon, G; Androutsos, O; De Bourdeaudhuij, I; Góźdź, M; Usheva, N; Birnbaum, J; Manios, Y; Moreno, L A

2014-08-01

Reliable assessments of health-related behaviours are necessary for accurate evaluation on the efficiency of public health interventions. The aim of the current study was to examine the reliability of a self-administered primary caregivers questionnaire (PCQ) used in the ToyBox-intervention. The questionnaire consisted of six sections addressing sociodemographic and perinatal factors, water and beverages consumption, physical activity, snacking and sedentary behaviours. Parents/caregivers from six countries (Belgium, Bulgaria, Germany, Greece, Poland and Spain) were asked to complete the questionnaire twice within a 2-week interval. A total of 93 questionnaires were collected. Test-retest reliability was assessed using intra-class correlation coefficient (ICC). Reliability of the six questionnaire sections was assessed. A stronger agreement was observed in the questions addressing sociodemographic and perinatal factors as opposed to questions addressing behaviours. Findings showed that 92% of the ToyBox PCQ had a moderate-to-excellent test-retest reliability (defined as ICC values from 0.41 to 1) and less than 8% poor test-retest reliability (ICC < 0.40). Out of the total ICC values, 67% showed good-to-excellent reliability (ICC from 0.61 to 1). We conclude that the PCQ is a reliable tool to assess sociodemographic characteristics, perinatal factors and lifestyle behaviours of pre-school children and their families participating in the ToyBox-intervention. © 2014 World Obesity.
The reliability and validity of a Japanese version of symptom checklist 90 revised

PubMed Central

Tomioka, Mitsunao; Shimura, Midori; Hidaka, Mikio; Kubo, Chiharu

2008-01-01

Objective To examine the validity and reliability of a Japanese version of the Symptom Checklist 90 Revised (SCL-90-R (J)). Methods The English SCL-90-R was translated to Japanese and the Japanese version confirmed by back-translation. To determine the factor validity and internal consistency of the nine primary subscales, 460 people from the community completed SCL-90-R(J). Test-retest reliability was examined for 104 outpatients and 124 healthy undergraduate students. The convergent-discriminant validity was determined for 80 inpatients who replied to both SCL-90-R(J) and the Minnesota Multiphasic Personality Inventory (MMPI). Results The correlation coefficients between the nine primary subscales and items were .26 to .78. Cronbach's alpha coefficients were from .76 (Phobic Anxiety) to .86 (Interpersonal Sensitivity). Pearson's correlation coefficients between test-retest scores were from .81 (Psychoticism) to .90 (Somatization) for the outpatients and were from .64 (Phobic Anxiety) to .78 (Paranoid Ideation) for the students. Each of the nine primary subscales correlated well with their corresponding constructs in the MMPI. Conclusion We confirmed the validity and reliability of SCL-90-R(J) for the measurement of individual distress. The nine primary subscales were consistent with the items of the original English version. PMID:18957078
Translation, cultural adaption, and test-retest reliability of Chinese versions of the Edinburgh Handedness Inventory and Waterloo Footedness Questionnaire.

PubMed

Yang, Nan; Waddington, Gordon; Adams, Roger; Han, Jia

2018-05-01

Quantitative assessments of handedness and footedness are often required in studies of human cognition and behaviour, yet no reliable Chinese versions of commonly used handedness and footedness questionnaires are available. Accordingly, the objective of the present study was to translate the Edinburgh Handedness Inventory (EHI) and the Waterloo Footedness Questionnaire-Revised (WFQ-R) into Mandarin Chinese and to evaluate the reliability and validity of these translated versions in healthy Chinese people. In the first stage of the study, Chinese versions of the EHI and WFQ-R were produced from a process of translation, back translation and examination, with necessary cultural adaptations. The second stage involved determining the reliability and validity of the translated EHI and WFQ-R for the Chinese population. One hundred and ten Chinese participants were tested online, and the results showed that the Cronbach's alpha coefficient of internal consistency was 0.877 for the translated EHI and 0.855 for the translated WFQ-R. Another 170 Chinese participants were tested and re-tested after a 30-day interval. The intra-class correlation coefficients showed high reliability, 0.898 for the translated EHI and 0.869 for the translated WFQ-R. This preliminary validation study found the translated versions to be reliable and valid tools for assessing handedness and footedness in this population.
Validity and test-retest reliability of the self-completion adult social care outcomes toolkit (ASCOT-SCT4) with adults with long-term physical, sensory and mental health conditions in England.

PubMed

Rand, Stacey; Malley, Juliette; Towers, Ann-Marie; Netten, Ann; Forder, Julien

2017-08-18

The Adult Social Care Outcomes Toolkit (ASCOT-SCT4) is a multi-attribute utility index designed for the evaluation of long-term social care services. The measure comprises eight attributes that capture aspects of social care-related quality of life. The instrument has previously been validated with a sample of older adults who used home care services in England. This paper aims to demonstrate the instrument's test-retest reliability and provide evidence for its validity in a diverse sample of adults who use publicly-funded, community-based social care in England. A survey of 770 social care service users was conducted in England. A subsample of 100 services users participated in a follow-up interview between 7 and 21 days after baseline. Spearman rank correlation coefficients between the ASCOT-SCT4 index score and the EQ-5D-3 L, the ICECAP-A or ICECAP-O and overall quality of life were used to assess convergent validity. Data on variables hypothesised to be related to the ASCOT-SCT4 index score, as well as rating of individual attributes, were also collected. Hypothesised relationships were tested using one-way ANOVA or Fisher's exact test. Test-retest reliability was assessed using the intra-class correlation coefficient for the ASCOT-SCT4 index score at baseline and follow-up. There were moderate to strong correlations between the ASCOT-SCT4 index and EQ-5D-3 L, the ICECAP-A or ICECAP-O, and overall quality of life (all correlations ≥ 0.3). The construct validity was further supported by statistically significant hypothesised relationships between the ASCOT-SCT4 index and individual characteristics in univariate and multivariate analysis. There was also further evidence for the construct validity for the revised Food and drink and Dignity items. The test-retest reliability was considered to be good (ICC = 0.783; 95% CI: 0.678-0.857). The ASCOT-SCT4 index has good test-retest reliability for adults with physical or sensory disabilities who use social care services. The index score and the attributes appear to be valid for adults receiving social care for support reasons connected to underlying mental health problems, and physical or sensory disabilities. Further reliability testing with a wider sample of social care users is warranted, as is further exploration of the relationship between the ASCOT-SCT4, ICECAP-A/O and EQ-5D-3 L indices.
Validity of trunk extensor and flexor torque measurements using isokinetic dynamometry.

PubMed

Guilhem, Gaël; Giroux, Caroline; Couturier, Antoine; Maffiuletti, Nicola A

2014-12-01

This study aimed to evaluate the validity and test-retest reliability of trunk muscle strength testing performed with a latest-generation isokinetic dynamometer. Eccentric, isometric, and concentric peak torque of the trunk flexor and extensor muscles was measured in 15 healthy subjects. Muscle cross sectional area (CSA) and surface electromyographic (EMG) activity were respectively correlated to peak torque and submaximal isometric torque for erector spinae and rectus abdominis muscles. Reliability of peak torque measurements was determined during test and retest sessions. Significant correlations were consistently observed between muscle CSA and peak torque for all contraction types (r=0.74-0.85; P<0.001) and between EMG activity and submaximal isometric torque (r ⩾ 0.99; P<0.05), for both extensor and flexor muscles. Intraclass correlation coefficients were comprised between 0.87 and 0.95, and standard errors of measurement were lower than 9% for all contraction modes. The mean difference in peak torque between test and retest ranged from -3.7% to 3.7% with no significant mean directional bias. Overall, our findings establish the validity of torque measurements using the tested trunk module. Also considering the excellent test-retest reliability of peak torque measurements, we conclude that this latest-generation isokinetic dynamometer could be used with confidence to evaluate trunk muscle function for clinical or athletic purposes. Copyright © 2014 Elsevier Ltd. All rights reserved.
Test-retest reliability of quantitative sensory testing for mechanical somatosensory and pain modulation assessment of masticatory structures.

PubMed

Costa, Y M; Morita-Neto, O; de Araújo-Júnior, E N S; Sampaio, F A; Conti, P C R; Bonjardim, L R

2017-03-01

Assessing the reliability of medical measurements is a crucial step towards the elaboration of an applicable clinical instrument. There are few studies that evaluate the reliability of somatosensory assessment and pain modulation of masticatory structures. This study estimated the test-retest reliability, that is over time, of the mechanical somatosensory assessment of anterior temporalis, masseter and temporomandibular joint (TMJ) and the conditioned pain modulation (CPM) using the anterior temporalis as the test site. Twenty healthy women were evaluated in two sessions (1 week apart) by the same examiner. Mechanical detection threshold (MDT), mechanical pain threshold (MPT), wind-up ratio (WUR) and pressure pain threshold (PPT) were assessed on the skin overlying the anterior temporalis, masseter and TMJ of the dominant side. CPM was tested by comparing PPT before and during the hand immersion in a hot water bath. anova and intra-class correlation coefficients (ICCs) were applied to the data (α = 5%). The overall ICCs showed acceptable values for the test-retest reliability of mechanical somatosensory assessment of masticatory structures. The ICC values of 75% of all quantitative sensory measurements were considered fair to excellent (fair = 8·4%, good = 33·3% and excellent = 33·3%). However, the CPM paradigm presented poor reliability (ICC = 0·25). The mechanical somatosensory assessment of the masticatory structures, but not the proposed CPM protocol, can be considered sufficiently reliable over time to evaluate the trigeminal sensory function. © 2016 John Wiley & Sons Ltd.
The Validation and Reliability of the Chinese Version of the Speech Handicap Index for Patients With Oral and Oropharyngeal Cancer.

PubMed

Li, Tianzhu; Ma, Lian; Mao, Chi

2016-03-01

The purpose of this study was to investigate the validity and reliability of the translated Chinese version of the Speech Handicap Index (SHI) questionnaire for Chinese-speaking patients with oral and oropharyngeal cancer. The original English version of the SHI was translated into Chinese. Forty-two consecutive patients with oral and oropharyngeal cancer were included in the study. All subjects were asked to complete the Chinese version of the SHI and the University of Washington Quality of Life Questionnaire (UWQOL V.04). Fifteen patients were randomly retested on both questionnaires 2 weeks later. The internal consistency, test-retest reliability, construct validity, and group validity of the Chinese version of the SHI were tested using Cronbach α, Spearman correlation coefficient (r), and Mann-Whitney U tests. Descriptive and bivariate statistics were computed, and the P value was set to 0.05. The Cronbach α for the total SHI, the speech domain, and the psychosocial domain were 0.96, 0.90, and 0.92, respectively. The test-retest reliability scores for the total SHI, the speech domain, the psychosocial domain, and the overall question were 0.94, 0.97, 0.90, and 0.83, respectively. To measure construct validity, Spearman correlation coefficients between different items of the SHI and the UWQOL were all >0.4, which signified a moderate to significant correlation. There were significant differences between patient groups when divided by age, clinical stage, educational level, radiotherapy, and reconstruction, on all or on parts of the various SHI domains. The Chinese version of the SHI is a valid and reliable tool for the speech assessment of patients with oral and oropharyngeal cancer. Copyright © 2016 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Reliability and validity of the multimedia activity recall in children and adults (MARCA) in people with chronic obstructive pulmonary disease.

PubMed

Hunt, Toby; Williams, Marie T; Olds, Tim S

2013-01-01

To determine the reliability and validity of the Multimedia Activity Recall for Children and Adults (MARCA) in people with chronic obstructive pulmonary disease (COPD). People with COPD and their carers completed the Multimedia Activity Recall for Children and Adults (MARCA) for four, 24-hour periods (including test-retest of 2 days) while wearing a triaxial accelerometer (Actigraph GT3X+®), a multi-sensor armband (Sensewear Pro3®) and a pedometer (New Lifestyles 1000®). Self reported activity recalls (MARCA) and objective activity monitoring (Accelerometry) were recorded under free-living conditions. 24 couples were included in the analysis (COPD; age 74.4 ± 7.9 yrs, FEV1 54 ± 13% Carer; age 69.6 ± 10.9 yrs, FEV1 99 ± 24%). Not applicable. Test-retest reliability was compared for MARCA activity domains and different energy expenditure zones. Validity was assessed between MARCA-derived physical activity level (in metabolic equivalent of task (MET) per minute), duration of moderate to vigorous physical activity (min) and related data from the objective measurement devices. Analysis included intra-class correlation coefficients (ICC), Bland-Altman analyses, paired t-tests (p) and Spearman's rank correlation coefficients (rs). Reliability between occasions of recall for all activity domains was uniformly high, with test-retest correlations consistently >0.9. Validity correlations were moderate to strong (rs = 0.43-0.80) across all comparisons. The MARCA yields comparable PAL estimates and slightly higher moderate to vigorous physical activity (MVPA) estimates. In older adults with chronic illness, the MARCA is a valid and reliable tool for capturing not only the time and energy expenditure associated with physical and sedentary activities but also information on the types of activities.
Validity and reliability of Optojump photoelectric cells for estimating vertical jump height.

PubMed

Glatthorn, Julia F; Gouge, Sylvain; Nussbaumer, Silvio; Stauffacher, Simone; Impellizzeri, Franco M; Maffiuletti, Nicola A

2011-02-01

Vertical jump is one of the most prevalent acts performed in several sport activities. It is therefore important to ensure that the measurements of vertical jump height made as a part of research or athlete support work have adequate validity and reliability. The aim of this study was to evaluate concurrent validity and reliability of the Optojump photocell system (Microgate, Bolzano, Italy) with force plate measurements for estimating vertical jump height. Twenty subjects were asked to perform maximal squat jumps and countermovement jumps, and flight time-derived jump heights obtained by the force plate were compared with those provided by Optojump, to examine its concurrent (criterion-related) validity (study 1). Twenty other subjects completed the same jump series on 2 different occasions (separated by 1 week), and jump heights of session 1 were compared with session 2, to investigate test-retest reliability of the Optojump system (study 2). Intraclass correlation coefficients (ICCs) for validity were very high (0.997-0.998), even if a systematic difference was consistently observed between force plate and Optojump (-1.06 cm; p < 0.001). Test-retest reliability of the Optojump system was excellent, with ICCs ranging from 0.982 to 0.989, low coefficients of variation (2.7%), and low random errors (±2.81 cm). The Optojump photocell system demonstrated strong concurrent validity and excellent test-retest reliability for the estimation of vertical jump height. We propose the following equation that allows force plate and Optojump results to be used interchangeably: force plate jump height (cm) = 1.02 × Optojump jump height + 0.29. In conclusion, the use of Optojump photoelectric cells is legitimate for field-based assessments of vertical jump height.
Validity and reliability of head posture measurement using Microsoft Kinect.

PubMed

Oh, Baek-Lok; Kim, Jongmin; Kim, Jongshin; Hwang, Jeong-Min; Lee, Jehee

2014-11-01

To investigate the validity and reliability of Microsoft Kinect-based head tracker (KHT) for measuring head posture. Considering the cervical range of motion (CROM) as a reference, one-dimensional and three-dimensional (1D and 3D) head postures of 12 normal subjects (28-58 years of age; 6 women and 6 men) were obtained using the KHT. The KHT was validated by Pearson's correlation coefficient and intraclass correlation (ICC) coefficient. Test-retest reliability of the KHT was determined by its 95% limit of agreement (LoA) with the Bland-Altman plot. Face recognition success rate was evaluated for each head posture. Measurements of 1D and 3D head posture performed using the KHT were very close to those of the CROM with correlation coefficients of 0.99 and 0.97 (p<0.05), respectively, as well as with an ICC of >0.99 and 0.98, respectively. The reliability tests of the KHT in terms of 1D and 3D head postures had 95% LoA angles of approximately ±2.5° and ±6.5°, respectively. The KHT showed good agreement with the CROM and relatively favourable test-retest reliability. Considering its high performance, convenience and low cost, KHT could be clinically used as a head posture-measuring system. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Psychometrics of the Home Safety Self-Assessment Tool (HSSAT) to prevent falls in community-dwelling older adults.

PubMed

Tomita, Machiko R; Saharan, Sumandeep; Rajendran, Sheela; Nochajski, Susan M; Schweitzer, Jo A

2014-01-01

OBJECTIVE. To identify psychometric properties of the Home Safety Self-Assessment Tool (HSSAT) to prevent falls in community-dwelling older adults. METHOD. We tested content validity, test-retest reliability, interrater reliability, construct validity, convergent and discriminant validity, and responsiveness to change. RESULTS. The content validity index was .98, the intraclass correlation coefficient for test-retest reliability was .97, and the interrater reliability was .89. The difference on identified risk factors between the use and nonuse of the HSSAT was significant (p = .005). Convergent validity with the Centers for Disease Control and Prevention Home Safety Checklist was high (r = .65), and discriminant validity with fear of falling was very low (r = .10). The responsiveness to change was moderate (standardized response mean = 0.57). CONCLUSION. The HSSAT is a reliable and valid instrument to identify fall risks in a home environment, and the HSSAT booklet is effective as educational material leading to improvement in home safety. Copyright © 2014 by the American Occupational Therapy Association, Inc.
Reliability and validity of television food advertising questionnaire in Malaysia.

PubMed

Zalma, Abdul Razak; Safiah, Md Yusof; Ajau, Danis; Khairil Anuar, Md Isa

2015-09-01

Interventions to counter the influence of television food advertising amongst children are important. Thus, reliable and valid instrument to assess its effect is needed. The objective of this study was to determine the reliability and validity of such a questionnaire. The questionnaire was administered twice on 32 primary schoolchildren aged 10-11 years in Selangor, Malaysia. The interval between the first and second administration was 2 weeks. Test-retest method was used to examine the reliability of the questionnaire. Intra-rater reliability was determined by kappa coefficient and internal consistency by Cronbach's alpha coefficient. Construct validity was evaluated using factor analysis. The test-retest correlation showed moderate-to-high reliability for all scores (r = 0.40*, p = 0.02 to r = 0.95**, p = 0.00), with one exception, consumption of fast foods (r = 0.24, p = 0.20). Kappa coefficient showed acceptable-to-strong intra-rater reliability (K = 0.40-0.92), except for two items under knowledge on television food advertising (K = 0.26 and K = 0.21) and one item under preference for healthier foods (K = 0.33). Cronbach's alpha coefficient indicated acceptable internal consistency for all scores (0.45-0.60). After deleting two items under Consumption of Commonly Advertised Food, the items showed moderate-to-high loading (0.52, 0.84, 0.42 and 0.42) with the Scree plot showing that there was only one factor. The Kaiser-Meyer-Olkin was 0.60, showing that the sample was adequate for factor analysis. The questionnaire on television food advertising is reliable and valid to assess the effect of media literacy education on television food advertising on schoolchildren. © The Author (2013). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Measuring occupational balance and its relationship to perceived stress and health: Mesurer l'équilibre occupationnel et sa relation avec le stress perçus et la santé.

PubMed

Yu, Yu; Manku, Mandeep; Backman, Catherine L

2018-04-01

There is an assumption that occupational balance is integrally related to health and well-being. This study aimed to investigate test-retest reliability of the English-translated Occupational Balance Questionnaire (OBQ), its relationship to measures of health (Short Form Health Survey-36 Version 2.0 [SF-36v2]) and stress (Perceived Stress Scale-10; PSS-10), and demographic differences in OBQ scores in Canadian adults. Test-retest reliability (2 weeks) was assessed using intraclass correlation (ICC) coefficients. Online surveys from 86 adults were analyzed using descriptive, correlational, and t test statistics. OBQ test-retest reliability was ICC = 0.74 (95% CI [0.34, 0.90]; p = .003) when excluding an influential case ( n = 20). OBQ correlations with PSS-10 were r = -.72; with SF-36v2 Mental Component Score, r = .65; and with Physical Component Score, r = .31; all p < .001. Age and gender had no impact on OBQ scores. Findings help elucidate relationships among health, stress, and occupational balance; however, further psychometric testing is warranted before using OBQ for clinical purposes.
Reliability of a device for the knee and ankle isometric and isokinetic strength testing in older adults

PubMed Central

Bergamin, Marco; Gobbo, Stefano; Bullo, Valentina; Vendramin, Barbara; Duregon, Federica; Frizziero, Antonio; Di Blasio, Andrea; Cugusi, Lucia; Zaccaria, Marco; Ermolao, Andrea

2017-01-01

Summary Background Lower extremity muscle mass, strength, power, and physical performance are critical determinants of independent functioning in later life. Isokinetic dynamometers are becoming very common in assessing different features of muscle strength, in both research and clinical practice; however, reliability studies are still needed to support the extended use of those devices. Objective The purpose of this study is to assess the test-retest reliability of knee and ankle isokinetic and isometric strength testing protocols in a sample of older healthy subjects, using a new and untested isokinetic multi-joint evaluation system. Methods Sixteen male and fourteen female older adults (mean age 65.2 ± 4.6 years) were assessed in two testing sessions. Each participant performed a randomized testing procedure that includes different isometric and isokinetic tests for knee and ankle joints. Results All participants concluded the trial safety and no subject reported any discomfort throughout the overall assessment. Coefficients of correlation between measures were calculated showing moderate to strong effects among all test-retest assessments and paired-sample t test showed only one significant difference (p<0.05) in the maximal isokinetic bilateral knee flexion torque. Conclusions The multi-joint evaluation system for the assessment of knee and ankle isokinetic and isometric strength provided reliable test-retest measures in healthy older adults. Level of evidence Ib. PMID:29264344
Development and validation of a German version of the joint protection behavior assessment in patients with rheumatoid arthritis.

PubMed

Niedermann, K; Forster, A; Hammond, A; Uebelhart, D; de Bie, R

2007-03-15

Joint protection (JP) is an important part of the treatment concept for patients with rheumatoid arthritis (RA). The Joint Protection Behavior Assessment short form (JPBA-S) assesses the use of hand JP methods by patients with RA while preparing a hot drink. The purpose of this study was to develop a German version of the JPBA-S (D-JPBA-S) and to test its validity and reliability. A manual was developed through consensus with 8 occupational therapist (OT) experts as the reference for assessing patients' JP behavior. Twenty-four patients with RA and 10 healthy individuals were videotaped while performing 10 tasks reflecting the activity of preparing instant coffee. Recordings were repeated after 3 months for test-retest analysis. One rater assessed all available patient recordings (n = 23, recorded twice) for test-retest reliability. The video recordings of 10 randomly selected patients and all healthy individuals were independently assessed for interrater reliability by 6 OTs who were explicitly asked to follow the manual. Rasch analysis was performed to test construct validity and transform ordinal raw data into interval data for reliability calculations. Nine of the 10 tasks fit the Rasch model. The D-JPBA-S, consisting of 9 valid tasks, had an intraclass correlation coefficient of 0.77 for interrater reliability and 0.71 for test-retest reliability. The D-JPBA-S provides a valid and reliable instrument for assessing JP behavior of patients with RA and can be used in German-speaking countries.
Reliability and validity of the test of incremental respiratory endurance measures of inspiratory muscle performance in COPD.

PubMed

Formiga, Magno F; Roach, Kathryn E; Vital, Isabel; Urdaneta, Gisel; Balestrini, Kira; Calderon-Candelario, Rafael A; Campos, Michael A; Cahalin, Lawrence P

2018-01-01

The Test of Incremental Respiratory Endurance (TIRE) provides a comprehensive assessment of inspiratory muscle performance by measuring maximal inspiratory pressure (MIP) over time. The integration of MIP over inspiratory duration (ID) provides the sustained maximal inspiratory pressure (SMIP). Evidence on the reliability and validity of these measurements in COPD is not currently available. Therefore, we assessed the reliability, responsiveness and construct validity of the TIRE measures of inspiratory muscle performance in subjects with COPD. Test-retest reliability, known-groups and convergent validity assessments were implemented simultaneously in 81 male subjects with mild to very severe COPD. TIRE measures were obtained using the portable PrO2 device, following standard guidelines. All TIRE measures were found to be highly reliable, with SMIP demonstrating the strongest test-retest reliability with a nearly perfect intraclass correlation coefficient (ICC) of 0.99, while MIP and ID clustered closely together behind SMIP with ICC values of about 0.97. Our findings also demonstrated known-groups validity of all TIRE measures, with SMIP and ID yielding larger effect sizes when compared to MIP in distinguishing between subjects of different COPD status. Finally, our analyses confirmed convergent validity for both SMIP and ID, but not MIP. The TIRE measures of MIP, SMIP and ID have excellent test-retest reliability and demonstrated known-groups validity in subjects with COPD. SMIP and ID also demonstrated evidence of moderate convergent validity and appear to be more stable measures in this patient population than the traditional MIP.
Reliability of the modified Gross Motor Function Measure-88 (GMFM-88) for children with both Spastic Cerebral Palsy and Cerebral Visual Impairment: A preliminary study.

PubMed

Salavati, M; Krijnen, W P; Rameckers, E A A; Looijestijn, P L; Maathuis, C G B; van der Schans, C P; Steenbergen, B

2015-01-01

The aims of this study were to adapt the Gross Motor Function Measure-88 (GMFM-88) for children with Cerebral Palsy (CP) and Cerebral Visual Impairment (CVI) and to determine the test-retest and interobserver reliability of the adapted version. Sixteen paediatric physical therapists familiar with CVI participated in the adaptation process. The Delphi method was used to gain consensus among a panel of experts. Seventy-seven children with CP and CVI (44 boys and 33 girls, aged between 50 and 144 months) participated in this study. To assess test-retest and interobserver reliability, the GMFM-88 was administered twice within three weeks (Mean=9 days, SD=6 days) by trained paediatric physical therapists, one of whom was familiar with the child and one who wasn't. Percentages of identical scores, Cronbach's alphas and intraclass correlation coefficients (ICC) were computed for each dimension level. All experts agreed on the proposed adaptations of the GMFM-88 for children with CP and CVI. Test-retest reliability ICCs for dimension scores were between 0.94 and 1.00, mean percentages of identical scores between 29 and 71, and interobserver reliability ICCs of the adapted GMFM-88 were 0.99-1.00 for dimension scores. Mean percentages of identical scores varied between 53 and 91. Test-retest and interobserver reliability of the GMFM-88-CVI for children with CP and CVI was excellent. Internal consistency of dimension scores lay between 0.97 and 1.00. The psychometric properties of the adapted GMFM-88 for children with CP and CVI are reliable and comparable to the original GMFM-88. Copyright © 2015 Elsevier Ltd. All rights reserved.
The test-retest reliability and minimal detectable change of spatial and temporal gait variability during usual over-ground walking for younger and older adults.

PubMed

Almarwani, Maha; Perera, Subashan; VanSwearingen, Jessie M; Sparto, Patrick J; Brach, Jennifer S

2016-02-01

Gait variability is a marker of gait performance and future mobility status in older adults. Reliability of gait variability has been examined mainly in community dwelling older adults who are likely to fluctuate over time. The purpose of this study was to compare test-retest reliability and determine minimal detectable change (MDC) of spatial and temporal gait variability in younger and older adults. Forty younger (mean age=26.6 ± 6.0 years) and 46 older adults (mean age=78.1 ± 6.2 years) were included in the study. Gait characteristics were measured twice, approximately 1 week apart, using a computerized walkway (GaitMat II). Participants completed 4 passes on the GaitMat II at their self-selected walking speed. Test-retest reliability was calculated using Intra-class correlation coefficients (ICCs(2,1)), 95% limits of agreement (95% LoA) in conjunction with Bland-Altman plots, relative limits of agreement (LoA%) and standard error of measurement (SEM). The MDC at 90% and 95% level were also calculated. ICCs of gait variability ranged 0.26-0.65 in younger and 0.28-0.74 in older adults. The LoA% and SEM were consistently higher (i.e. less reliable) for all gait variables in older compared to younger adults except SEM for step width. The MDC was consistently larger for all gait variables in older compared to younger adults except step width. ICCs were of limited utility due to restricted ranges in younger adults. Based on absolute reliability measures and MDC, younger had greater test-retest reliability and smaller MDC of spatial and temporal gait variability compared to older adults. Copyright © 2015 Elsevier B.V. All rights reserved.
The test-retest reliability and criterion validity of a high-intensity, netball-specific circuit test: The Net-Test.

PubMed

Mungovan, Sean F; Peralta, Paula J; Gass, Gregory C; Scanlan, Aaron T

2018-04-12

To examine the test-retest reliability and criterion validity of a high-intensity, netball-specific fitness test. Repeated measures, within-subject design. Eighteen female netball players competing in an international competition completed a trial of the Net-Test, which consists of 14 timed netball-specific movements. Players also completed a series of netball-relevant criterion fitness tests. Ten players completed an additional Net-Test trial one week later to assess test-retest reliability using intraclass correlation coefficient (ICC), typical error of measurement (TEM), and coefficient of variation (CV). The typical error of estimate expressed as CV and Pearson correlations were calculated between each criterion test and Net-Test performance to assess criterion validity. Five movements during the Net-Test displayed moderate ICC (0.84-0.90) and two movements displayed high ICC (0.91-0.93). Seven movements and heart rate taken during the Net-Test held low CV (<5%) with values ranging from 1.7 to 9.5% across measures. Total time (41.63±2.05s) during the Net-Test possessed low CV and significant (p<0.05) correlations with 10m sprint time (1.98±0.12s; CV=4.4%, r=0.72), 20m sprint time (3.38±0.19s; CV=3.9%, r=0.79), 505 Change-of-Direction time (2.47±0.08s; CV=2.0%, r=0.80); and maximum oxygen uptake (46.59±2.58 mLkg -1 min -1 ; CV=4.5%, r=-0.66). The Net-Test possesses acceptable reliability for the assessment of netball fitness. Further, the high criterion validity for the Net-Test suggests a range of important netball-specific fitness elements are assessed in combination. Copyright © 2018 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.
Reliability of a new test battery for fitness assessment of the European Astronaut corps.

PubMed

Petersen, Nora; Thieschäfer, Lutz; Ploutz-Snyder, Lori; Damann, Volker; Mester, Joachim

2015-01-01

To optimise health for space missions, European astronauts follow specific conditioning programs before, during and after their flights. To evaluate the effectiveness of these programs, the European Space Agency conducts an Astronaut Fitness Assessment (AFA), but the test-retest reliability of elements within it remains unexamined. The reliability study described here presents a scientific basis for implementing the AFA, but also highlights challenges faced by operational teams supporting humans in such unique environments, especially with respect to health and fitness monitoring of crew members travelling not only into space, but also across the world. The AFA tests assessed parameters known to be affected by prolonged exposure to microgravity: aerobic capacity (VO2max), muscular strength (one repetition max, 1 RM) and power (vertical jumps), core stability, flexibility and balance. Intraclass correlation coefficients (ICC3.1), standard error of measurement and coefficient of variation were used to assess relative and absolute test-retest reliability. Squat and bench 1 RM (ICC3.1 = 0.94-0.99), hip flexion (ICC3.1 = 0.99) and left and right handgrip strength (ICC3.1 = 0.95 and 0.97), showed the highest test-retest reliability, followed by VO2max (ICC3.1 = 0.91), core strength (ICC3.1 = 0.78-0.89), hip extension (ICC3.1 = 0.63), the countermeasure (ICC3.1 = 0.76) and squat (ICC3.1 = 0.63) jumps, and single right- and left-leg jump height (ICC3.1 = 0.51 and 0.14). For balance, relative reliability ranged from ICC3.1 = 0.78 for path length (two legs, head tilted back, eyes open) to ICC3.1 = 0.04 for average rotation velocity (one leg, eyes closed). In a small sample (n = 8) of young, healthy individuals, the AFA battery of tests demonstrated acceptable test-retest reliability for most parameters except some balance and single-leg jump tasks. These findings suggest that, for the application with astronauts, most AFA tests appear appropriate to be maintained in the test battery, but that some elements may be unreliable, and require either modification (duration, selection of task) or removal (single-leg jump, balance test on sphere) from the battery. The test battery is mobile and universally applicable for occupational and general fitness assessment by its comprehensive composition of tests covering many systems involved in whole body movement.

Reliability and responsiveness of the Self-Efficacy in Assessing, Training and Spotting wheelchair skills (SEATS) outcome measure.

PubMed

Rushton, Paula W; Smith, Emma M; Miller, William C; Kirby, R Lee; Daoust, Geneviève

2018-01-31

The aim of this study was to evaluate the internal consistency, test-retest reliability and responsiveness of the Self-Efficacy in Assessing, Training and Spotting manual wheelchair skills (SEATS-M) and Self-Efficacy in Assessing, Training and Spotting power wheelchair skills (SEATS-P). A 2-week test-retest design was used with a convenience sample of occupational and physical therapists who worked at a provincial rehabilitation centre (inpatient and outpatient services). Sixteen participants completed the SEATS-M and 18 participants completed the SEATS-P. For the SEATS-M assessment, training, spotting and documentation sections, Cronbach's alpha coefficients ranged from 0.90 to 0.97, the 2-week intraclass correlation coefficients (ICC 1,1 ) ranged from 0.81 to 0.95, the standard error of measurements (SEM) ranged from 5.06 to 8.70 and the smallest real differences (SRD) ranged from 6.24 to 8.18. For the SEATS-P assessment, training, spotting and documentation sections, Cronbach's alpha coefficients ranged from 0.83 to 0.92, the ICCs ranged from 0.72 to 0.86, the SEMs ranged from 4.54 to 8.91 and the SRDs ranged from 5.90 to 8.27. There is preliminary evidence that both the SEATS-M and the SEATS-P have high internal consistency, good test-retest reliability and support for responsiveness. These tools can be used in evaluating clinician self-efficacy with assessing, training, spotting and documenting wheelchair skills included on the Wheelchair Skills Test. Implications for Rehabilitation There is preliminary evidence that the SEATS-M and SEATS-P are reliable and responsive outcome measures that can be used to evaluate the self-efficacy of clinicians to administer the Wheelchair Skills Program. Measurement of clinicians' self-efficacy in this area of practice may enable an enhanced understanding of the areas in which clinicians lack self-efficacy, thereby informing the development of improved knowledge translation interventions.
Test-retest reliability and cross validation of the functioning everyday with a wheelchair instrument.

PubMed

Mills, Tamara L; Holm, Margo B; Schmeler, Mark

2007-01-01

The purpose of this study was to establish the test-retest reliability and content validity of an outcomes tool designed to measure the effectiveness of seating-mobility interventions on the functional performance of individuals who use wheelchairs or scooters as their primary seating-mobility device. The instrument, Functioning Everyday With a Wheelchair (FEW), is a questionnaire designed to measure perceived user function related to wheelchair/scooter use. Using consumer-generated items, FEW Beta Version 1.0 was developed and test-retest reliability was established. Cross-validation of FEW Beta Version 1.0 was then carried out with five samples of seating-mobility users to establish content validity. Based on the content validity study, FEW Version 2.0 was developed and administered to seating-mobility consumers to examine its test-retest reliability. FEW Beta Version 1.0 yielded an intraclass correlation coefficient (ICC) Model (3,k) of .92, p < .001, and the content validity results revealed that FEW Beta Version 1.0 captured 55% of seating-mobility goals reported by consumers across five samples. FEW Version 2.0 yielded ICC(3,k) = .86, p < .001, and captured 98.5% of consumers' seating-mobility goals. The cross-validation study identified new categories of seating-mobility goals for inclusion in FEW Version 2.0, and the content validity of FEW Version 2.0 was confirmed. FEW Beta Version 1.0 and FEW Version 2.0 were highly stable in their measurement of participants' seating-mobility goals over a 1-week interval.
Linguistic Validation and Cultural Adaptation of Bulgarian Version of Hospital Survey on Patient Safety Culture (HSOPSC).

PubMed

Stoyanova, Rumyana; Dimova, Rositsa; Tarnovska, Miglena; Boeva, Tatyana

2018-05-20

Patient safety (PS) is one of the essential elements of health care quality and a priority of healthcare systems in most countries. Thus the creation of validated instruments and the implementation of systems that measure patient safety are considered to be of great importance worldwide. The present paper aims to illustrate the process of linguistic validation, cross-cultural verification and adaptation of the Bulgarian version of the Hospital Survey on Patient Safety Culture (B-HSOPSC) and its test-retest reliability. The study design is cross-sectional. The HSOPSC questionnaire consists of 42 questions, grouped in 12 different subscales that measure patient safety culture. Internal con-sistency was assessed using Cronbach's alpha. The Wilcoxon signed-rank test and the split-half method were used; the Spear-man-Brown coefficient was calculated. The overall Cronbach's alpha for B-HSOPSC is 0.918. Subscales 7 Staffing and 12 Overall perceptions of safety had the lowest coefficients. The high reliability of the instrument was confirmed by the Split-half method (0.97) and ICC-coefficient (0.95). The lowest values of Spearmen-Broun coefficients were found in items A13 and A14. The study offers an analysis of the results of the linguistic validation of the B-HSOPSC and its test-retest reliability. The psychometric characteristics of the questions revealed good validity and reliability, except two questions. In the future, the instrument will be administered to the target population in the main study so that the psychometric properties of the instrument can be verified.
The Reliability and Validity of the Coopersmith Self-Esteem Inventory for a Sample of Filipino High School Girls.

ERIC Educational Resources Information Center

Watkins, David; Astilla, Estela

1980-01-01

Evidence is presented partially supporting the reliability and construct validity of the Coopersmith Self-Esteem Inventory with Filipino adolescent girls. A test-retest coefficient of 0.61 was found over a nine-month period. Self-esteem scores were significantly associated with IQ scores and teacher ratings of pupils' self-esteem. (Author/BW)
[Translation and validation in italian of the Moral Distress Scale for psychiatric nurses (MDS-P)].

PubMed

Canciani, Eleonora; Spotti, Daniela; Bonetti, Loris

2016-01-01

Moral distress (MD) is a painful feeling and/or psychological disequilibrium, which may lead to negative consequences into the wellness of a nurse's working life. Nurses who work in psychiatry are more likely to experience a different type of MD compared with nurses of other contexts. In Italy a tool to evaluate MD in nurses who work in psychiatry doesn't exist. The aim of this study is to validate the Moral Distress Scale for Psychiatric Nurses (MDS-P) in Italian language. For translation the forward and back-translation has been used; the effectiveness regarding content and face validity of the translated scale has been analyzed through a focus group with experts of the field. In order to check the reliability of the scale the test-retest method has been used, by means of the determination of Spearman's correlation coefficient, Intraclass Correlation Coefficient (ICC) and Cronbach's alpha. The forward and back-translation process was successful. During the focus group analysis, 8 items were added to the 15 items of the original scale, due to experts suggestions. 32 nurses took part in the test-retest phase. Spearman's correlation coefficient resulted to be 0,91, ICC > 0,9, Cronbach's alpha calculated on test and retest, was always >0,9. The Italian version of the MDS-P proves to be an effective, appropriate and reliable instrument to measure the MD phenomenon within the population of nurses who work in the psychia- tric field in Italy.
The Trojan Lifetime Champions Health Survey: Development, Validity, and Reliability

PubMed Central

Sorenson, Shawn C.; Romano, Russell; Scholefield, Robin M.; Schroeder, E. Todd; Azen, Stanley P.; Salem, George J.

2015-01-01

Context Self-report questionnaires are an important method of evaluating lifespan health, exercise, and health-related quality of life (HRQL) outcomes among elite, competitive athletes. Few instruments, however, have undergone formal characterization of their psychometric properties within this population. Objective To evaluate the validity and reliability of a novel health and exercise questionnaire, the Trojan Lifetime Champions (TLC) Health Survey. Design Descriptive laboratory study. Setting A large National Collegiate Athletic Association Division I university. Patients or Other Participants A total of 63 university alumni (age range, 24 to 84 years), including former varsity collegiate athletes and a control group of nonathletes. Intervention(s) Participants completed the TLC Health Survey twice at a mean interval of 23 days with randomization to the paper or electronic version of the instrument. Main Outcome Measure(s) Content validity, feasibility of administration, test-retest reliability, parallel-form reliability between paper and electronic forms, and estimates of systematic and typical error versus differences of clinical interest were assessed across a broad range of health, exercise, and HRQL measures. Results Correlation coefficients, including intraclass correlation coefficients (ICCs) for continuous variables and κ agreement statistics for ordinal variables, for test-retest reliability averaged 0.86, 0.90, 0.80, and 0.74 for HRQL, lifetime health, recent health, and exercise variables, respectively. Correlation coefficients, again ICCs and κ, for parallel-form reliability (ie, equivalence) between paper and electronic versions averaged 0.90, 0.85, 0.85, and 0.81 for HRQL, lifetime health, recent health, and exercise variables, respectively. Typical measurement error was less than the a priori thresholds of clinical interest, and we found minimal evidence of systematic test-retest error. We found strong evidence of content validity, convergent construct validity with the Short-Form 12 Version 2 HRQL instrument, and feasibility of administration in an elite, competitive athletic population. Conclusions These data suggest that the TLC Health Survey is a valid and reliable instrument for assessing lifetime and recent health, exercise, and HRQL, among elite competitive athletes. Generalizability of the instrument may be enhanced by additional, larger-scale studies in diverse populations. PMID:25611315
Test–retest reliability, validity, and minimum detectable change of visual analog, numerical rating, and verbal rating scales for measurement of osteoarthritic knee pain

PubMed Central

Alghadir, Ahmad H; Anwer, Shahnawaz; Iqbal, Amir; Iqbal, Zaheen Ahmed

2018-01-01

Objective Several scales are commonly used for assessing pain intensity. Among them, the numerical rating scale (NRS), visual analog scale (VAS), and verbal rating scale (VRS) are often used in clinical practice. However, no study has performed psychometric analyses of their reliability and validity in the measurement of osteoarthritic (OA) pain. Therefore, the present study examined the test–retest reliability, validity, and minimum detectable change (MDC) of the VAS, NRS, and VRS for the measurement of OA knee pain. In addition, the correlations of VAS, NRS, and VRS with demographic variables were evaluated. Methods The study included 121 subjects (65 women, 56 men; aged 40–80 years) with OA of the knee. Test–retest reliability of the VAS, NRS, and VRS was assessed during two consecutive visits in a 24 h interval. The validity was tested using Pearson’s correlation coefficients between the baseline scores of VAS, NRS, and VRS and the demographic variables (age, body mass index [BMI], sex, and OA grade). The standard error of measurement (SEM) and the MDC were calculated to assess statistically meaningful changes. Results The intraclass correlation coefficients of the VAS, NRS, and VRS were 0.97, 0.95, and 0.93, respectively. VAS, NRS, and VRS were significantly related to demographic variables (age, BMI, sex, and OA grade). The SEM of VAS, NRS, and VRS was 0.03, 0.48, and 0.21, respectively. The MDC of VAS, NRS, and VRS was 0.08, 1.33, and 0.58, respectively. Conclusion All the three scales had excellent test–retest reliability. However, the VAS was the most reliable, with the smallest errors in the measurement of OA knee pain. PMID:29731662
Validity and reliability of a Nigerian-Yoruba version of the stroke-specific quality of life scale 2.0.

PubMed

Odetunde, Marufat Oluyemisi; Akinpelu, Aderonke Omobonike; Odole, Adesola Christiana

2017-10-19

Psychometric evidence is necessary to establish scientific integrity and clinical usefulness of translations and cultural adaptations of the Stroke-Specific Quality of Life (SS-QoL) scale. However, the limited evidence on psychometrics of Yoruba version of SS-QoL 2.0 (SS-QoL(Y)) is a significant shortcoming. This study assessed the test-retest reliability, internal consistency, convergent, divergent, discriminant and known-group validity of the SS-QoL(Y). Yoruba version of the WHOQoL-BREF was used to test the convergent and divergent validity of the SS-QoL(Y) among 100 consenting stroke survivors. The WHOQoL-BREF and SS-QoL(Y) was administered randomly in order to eliminate bias. The test-retest reliability of the SS-QoL(Y) was carried out among 68 of the respondents within an interval of 7 days. All respondents were purposively recruited from selected secondary and tertiary health facilities in South-west Nigeria. Data were analysed using descriptive statistics of mean and standard deviation, and inferential statistics of Spearman correlation, Cronbach's alpha, Intra-class Correlation Coefficient (ICC), Independent t-test and One-way ANOVA. Alpha level was set at p < 0.05. The physical health, psychological health, social relationship and environment domains on WHOQoL-BREF with correlation coefficient that ranged from 0.214 to 0.360 showed significant correlation with similar domains on SS-QoL(Y). Dissimilar domains between the two scales had r values from 0.035 to 0.366. Discriminant validity of SS-QoL(Y) showed that items' r value ranged from 0.711 to 0.920 with their hypothesized domains. The scale demonstrated moderate to strong test-retest reliability with Intra-class correlation coefficient (ICC) for the domains and overall scores (r = 0.47 to 0.81) and moderate to high internal consistency (Cronbach's alpha =0.61 to 0.82) for domains scores. These correlations were also significant for the domains and overall scores (p < 0.05). There were no significant differences across different age groups or gender for the domains or overall scores of SS-QoL(Y). Discriminant and known-group validity, test-retest reliability and internal consistency of the Yoruba version of the Stroke Specific Quality of Life 2.0 are adequate while the convergent and divergent validity are low but acceptable. The SS-QoL(Y) is recommended for assessing health-related quality of life among Yoruba stroke survivors.
Cross-cultural Adaption and Validation of the Danish Voice Handicap Index.

PubMed

Sorensen, Jesper Roed; Printz, Trine; Mehlum, Camilla Slot; Heidemann, Christian Hamilton; Groentved, Aagot Moeller; Godballe, Christian

2018-02-02

We aimed to assess psychometric properties, including internal consistency, reliability, and clinical validity of the Danish version of the Voice Handicap Index (VHI). A cross-sectional survey study was carried out. For validation, the existing nonvalidated Danish version of the VHI was used. Data from 208 patients with voice disorders of different etiology (neurogenic, functional, and structural) and a control group of 85 vocally healthy individuals were included. A test-retest reliability analysis of 42 patients and 45 control persons was performed. The internal consistency, test-retest reliability, and clinical validity of the questionnaire were assessed. Internal consistency was high with a Cronbach α >0.90 for both the patient and control group. Test-retest reliability measured as intraclass correlation coefficient was good with 0.93 (95% confidence interval [95% confidence interval]: 0.87-0.96) for patients and 0.78 (95% confidence interval: 0.63-0.87) for the control group which indicates sufficient reliability of the questionnaire. The Danish VHI has good clinical validity as it has a strong correlation between patient's perception of the severity of their voice disorder and the VHI score from the Spearman correlation of 0.69. The existing Danish version of the VHI has been thoroughly validated and found to be in line with the original VHI from Jacobsen et al. It showed good internal consistency, test-retest reliability, and clinical validity. It is suitable for use in daily practice and in research projects as it is able to assess patients' perception of their voice disorder severity. Copyright © 2018 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Functional gait assessment and balance evaluation system test: reliability, validity, sensitivity, and specificity for identifying individuals with Parkinson disease who fall.

PubMed

Leddy, Abigail L; Crowner, Beth E; Earhart, Gammon M

2011-01-01

Gait impairments, balance impairments, and falls are prevalent in individuals with Parkinson disease (PD). Although the Berg Balance Scale (BBS) can be considered the reference standard for the determination of fall risk, it has a noted ceiling effect. Development of ceiling-free measures that can assess balance and are good at discriminating "fallers" from "nonfallers" is needed. The purpose of this study was to compare the Functional Gait Assessment (FGA) and the Balance Evaluation Systems Test (BESTest) with the BBS among individuals with PD and evaluate the tests' reliability, validity, and discriminatory sensitivity and specificity for fallers versus nonfallers. This was an observational study of community-dwelling individuals with idiopathic PD. The BBS, FGA, and BESTest were administered to 80 individuals with PD. Interrater reliability (n=15) was assessed by 3 raters. Test-retest reliability was based on 2 tests of participants (n=24), 2 weeks apart. Intraclass correlation coefficients (2,1) were used to calculate reliability, and Spearman correlation coefficients were used to assess validity. Cutoff points, sensitivity, and specificity were based on receiver operating characteristic plots. Test-retest reliability was .80 for the BBS, .91 for the FGA, and .88 for the BESTest. Interrater reliability was greater than .93 for all 3 tests. The FGA and BESTest were correlated with the BBS (r=.78 and r=.87, respectively). Cutoff scores to identify fallers were 47/56 for the BBS, 15/30 for the FGA, and 69% for the BESTest. The overall accuracy (area under the curve) for the BBS, FGA, and BESTest was .79, .80, and .85, respectively. Fall reports were retrospective. Both the FGA and the BESTest have reliability and validity for assessing balance in individuals with PD. The BESTest is most sensitive for identifying fallers.
Test-retest reliability of infant event related potentials evoked by faces.

PubMed

Munsters, N M; van Ravenswaaij, H; van den Boomen, C; Kemner, C

2017-04-05

Reliable measures are required to draw meaningful conclusions regarding developmental changes in longitudinal studies. Little is known, however, about the test-retest reliability of face-sensitive event related potentials (ERPs), a frequently used neural measure in infants. The aim of the current study is to investigate the test-retest reliability of ERPs typically evoked by faces in 9-10 month-old infants. The infants (N=31) were presented with neutral, fearful and happy faces that contained only the lower or higher spatial frequency information. They were tested twice within two weeks. The present results show that the test-retest reliability of the face-sensitive ERP components is moderate (P400 and Nc) to substantial (N290). However, there is low test-retest reliability for the effects of the specific experimental manipulations (i.e. emotion and spatial frequency) on the face-sensitive ERPs. To conclude, in infants the face-sensitive ERP components (i.e. N290, P400 and Nc) show adequate test-retest reliability, but not the effects of emotion and spatial frequency on these ERP components. We propose that further research focuses on investigating elements that might increase the test-retest reliability, as adequate test-retest reliability is necessary to draw meaningful conclusions on individual developmental trajectories of the face-sensitive ERPs in infants. Copyright © 2017 The Authors. Published by Elsevier Ltd.. All rights reserved.
Reliability of the modified Paediatric Evaluation of Disability Inventory, Dutch version (PEDI-NL) for children with cerebral palsy and cerebral visual impairment.

PubMed

Salavati, M; Waninge, A; Rameckers, E A A; de Blécourt, A C E; Krijnen, W P; Steenbergen, B; van der Schans, C P

2015-02-01

The aims of this study were to adapt the Paediatric Evaluation of Disability Inventory, Dutch version (PEDI-NL) for children with cerebral visual impairment (CVI) and cerebral palsy (CP) and determine test-retest and inter-respondent reliability. The Delphi method was used to gain consensus among twenty-one health experts familiar with CVI. Test-retest and inter-respondent reliability were assessed for parents and caregivers of 75 children (aged 50-144 months) with CP and CVI. The percentage identical scores of item scores were computed, as well as the interclass coefficients (ICC) and Cronbach's alphas of scale scores over the domains self-care, mobility, and social function. All experts agreed on the adaptation of the PEDI-NL for children with CVI. On item score, for the Functional Skills scale, mean percentage identical scores variations for test-retest reliability were 73-79 with Caregiver Assistance scale 73-81, and for inter-respondent reliability 21-76 with Caregiver Assistance scale 40-43. For all scales over all domains ICCs exceeded 0.87. For the domains self-care, mobility, and social function, the Functional Skills scale and the Caregiver Assistance scale have Cronbach's alpha above 0.88. The adapted PEDI-NL for children with CP and CVI is reliable and comparable to the original PEDI-NL. Copyright © 2014 Elsevier Ltd. All rights reserved.
Psychometric Characteristics of the Modified World Affairs Questionnaire.

ERIC Educational Resources Information Center

Mayton, Daniel M., II

1988-01-01

Subjected Modified World Affairs Questionnaire (MWAQ) to comparable common factor analysis which identified five factors: civil defense, escalation, nuclear war outcome, probability/worry, and patriotic. Alpha coefficients and test-retest reliability were determined to be adequate for the first four subscales. Acceptable discriminant validity and…
Investigation of four self-report instruments (FABT, TSK-HC, Back-PAQ, HC-PAIRS) to measure healthcare practitioners' attitudes and beliefs toward low back pain: Reliability, convergent validity and survey of New Zealand osteopaths and manipulative physiotherapists.

PubMed

Moran, Robert W; Rushworth, Wendy M; Mason, Jesse

2017-12-01

Healthcare practitioner beliefs influence advice and management provided to patients with back pain. Several instruments measuring practitioner beliefs have been developed but psychometric properties for some have not been investigated. To investigate internal consistency, test-retest reliability and convergent validity of the Fear Avoidance Beliefs Tool (FABT), the Tampa Scale of Kinesiophobia for Health Care Providers (TSK-HC), the Back Pain Attitudes Questionnaire (Back-PAQ), and the Health Care Pain and Impairment Relationship Scale (HC-PAIRS). A secondary aim was to explore beliefs of New Zealand osteopaths and physiotherapists regarding low back pain. FABT, TSK-HC, Back-PAQ, and HC-PAIRS were administered twice, 14 days apart. Data from 91 osteopaths and 35 physiotherapists were analysed. The FABT, TSK-HC and Back-PAQ each demonstrated excellent internal consistency, (Cronbach's α = 0.92, 0.91, and 0.91 respectively), and excellent test-retest reliability (lower limit of 95% CI for intraclass correlation coefficient >0.75). Correlations between instruments (Pearson's r = 0.51 to 0.77, p < 0.001) demonstrated good convergent validity. There was a medium to large effect (Cohen's d > 0.47) for mean differences in scores, for all instruments, between professions. This study found excellent internal consistency, test-retest reliability and good convergent validity for the FABT, TSK-HC, and Back-PAQ. Previously reported internal consistency, test-retest and convergent validity of the HC-PAIRS were confirmed, and test-retest reliability was excellent. There were significant scoring differences on each instrument between professions, and while both groups demonstrated fear avoidant beliefs, physiotherapist respondent scores indicated that as a group, they held fewer fear-avoidant beliefs than osteopath respondents. Copyright © 2017 Elsevier Ltd. All rights reserved.
Reliability and Validity of Food Frequency Questions to Assess Beverage and Food Group Intakes among Low-Income 2- to 4-Year-Old Children.

PubMed

Koleilat, Maria; Whaley, Shannon E

2016-06-01

Fruits, vegetables, sweetened foods, and beverages have been found to have positive and negative associations with obesity in early childhood, yet no rapid assessment tools are available to measure intake of these foods among preschoolers. This study examines the test-retest reliability and validity of a 10-item Child Food and Beverage Intake Questionnaire designed to assess fruits, vegetables, and sweetened foods and beverages intake among 2- to 4-year-old children. The Child Food and Beverage Intake Questionnaire was developed for use in periodic phone surveys conducted with low-income families with preschool-aged children. Seventy primary caregivers of 2- to 4-year-old children completed two Child Food and Beverage Intake Questionnaires within a 2-week period for test-retest reliability. Participants also completed three 24-hour recalls to allow assessment of validity. Intraclass correlations were used to examine test-retest reliability. Spearman rank correlation coefficients, Bland-Altman plots, and linear regression analyses were used to examine validity of the Child Food and Beverage Intake Questionnaire compared with three 24-hour recalls. Intraclass correlations between Child Food and Beverage Intake Questionnaire administrations ranged from 0.48 for sweetened drinks to 0.87 for regular sodas. Intraclass correlations for fruits, vegetables, and sweetened food were 0.56, 0.49, and 0.56, respectively. Spearman rank correlation coefficients ranged from 0.15 to 0.59 for beverages, with 0.46 for sugar-sweetened beverages. Spearman rank correlation coefficients for fruits, vegetables, and sweetened food were 0.30, 0.33, and 0.30, respectively. Although observation of the Bland-Altman plots and linear regression analyses showed a slight upward trend in mean differences, with increasing mean intake for five beverage groups, at least 90% of data plots fell within the limits of agreement for all food/beverage groups. The Child Food and Beverage Intake Questionnaire exhibited fair to substantial test-retest reliability and moderate to strong validity in ranking fruits, vegetables, sweetened food, and the majority of beverages consumed by children aged 2 to 4 years old. Although the Child Food and Beverage Intake Questionnaire might not be able to assess the absolute intake of foods and beverages, given the scarcity of an easily administered, valid, and reliable questionnaire to assess nutritional intake among 2- to 4-year-old low-income children, this tool is a useful means for measuring trends in dietary intake among low-income preschoolers. Copyright © 2016 Academy of Nutrition and Dietetics. Published by Elsevier Inc. All rights reserved.
A Psychometric Study of the Bayley Scales of Infant and Toddler Development in Persian Language Children.

PubMed

Azari, Nadia; Soleimani, Farin; Vameghi, Roshanak; Sajedi, Firoozeh; Shahshahani, Soheila; Karimi, Hossein; Kraskian, Adis; Shahrokhi, Amin; Teymouri, Robab; Gharib, Masoud

2017-01-01

Bayley Scales of infant & toddler development is a well-known diagnostic developmental assessment tool for children aged 1-42 months. Our aim was investigating the validity & reliability of this scale in Persian speaking children. The method was descriptive-analytic. Translation- back translation and cultural adaptation was done. Content & face validity of translated scale was determined by experts' opinions. Overall, 403 children aged 1 to 42 months were recruited from health centers of Tehran, during years of 2013-2014 for developmental assessment in cognitive, communicative (receptive & expressive) and motor (fine & gross) domains. Reliability of scale was calculated through three methods; internal consistency using Cronbach's alpha coefficient, test-retest and interrater methods. Construct validity was calculated using factor analysis and comparison of the mean scores methods. Cultural and linguistic changes were made in items of all domains especially on communication subscale. Content and face validity of the test were approved by experts' opinions. Cronbach's alpha coefficient was above 0.74 in all domains. Pearson correlation coefficient in various domains, were ≥ 0.982 in test retest method, and ≥0.993 in inter-rater method. Construct validity of the test was approved by factor analysis. Moreover, the mean scores for the different age groups were compared and statistically significant differences were observed between mean scores of different age groups, that confirms validity of the test. The Bayley Scales of Infant and Toddler Development is a valid and reliable tool for child developmental assessment in Persian language children.
Interrater and test-retest reliability and validity of the Norwegian version of the BESTest and mini-BESTest in people with increased risk of falling.

PubMed

Hamre, Charlotta; Botolfsen, Pernille; Tangen, Gro Gujord; Helbostad, Jorunn L

2017-04-20

The Balance Evaluation Systems Test (BESTest) was developed to assess underlying systems for balance control in order to be able to individually tailor rehabilitation interventions to people with balance disorders. A short form, the Mini-BESTest, was developed as a screening test. The study aimed to assess interrater and test-retest reliability of the Norwegian version of the BESTest and the Mini-BESTest in community-dwelling people with increased risk of falling and to assess concurrent validity with the Fall Efficacy Scale-International (FES-I), and it was an observational study with a cross-sectional design. Forty-two persons with increased risk of falling (elderly over 65 years of age, persons with a history of stroke or Multiple Sclerosis) were assessed twice by two raters. Relative reliability was analysed with Intraclass Correlation Coefficient (ICC), and absolute reliability with standard error of measurement (SEM) and smallest detectable change (SDC). Concurrent validity was assessed against the FES-I using Spearman's rho. The BESTest showed very good interrater reliability (ICC = 0.98, SEM = 1.79, SDC 95 = 5.0) and test-retest reliability (rater A/rater B = ICC = 0.89/0.89, SEM = 3.9/4.3, SDC 95 = 10.8/11.8). The Mini-BESTest also showed very good interrater reliability (ICC = 0.95, SEM = 1.19, SDC 95 = 3.3) and test-retest reliability (rater A/rater B = ICC = 0.85/0.84, SEM = 1.8/1.9, SDC 95 = 4.9/5.2). The correlations were moderate between the FES-I and both the BESTest and the Mini-BESTest (Spearman's rho -0.51 and-0.50, p < 0.01). The BESTest and its short form, the Mini-BESTest, showed very good interrater and test-retest reliability when assessed in a heterogeneous sample of people with increased risk of falling. The concurrent validity measured against the FES-I showed moderate correlation. The results are comparable with earlier studies and indicate that the Norwegian versions can be used in daily clinic and in research.
Reliability and validity of a Turkish version of the Global Pelvic Floor Bother Questionnaire.

PubMed

Doğan, Hanife; Özengin, Nuriye; Bakar, Yeşim; Duran, Bülent

2016-10-01

The aim of this study was to translate the Global Pelvic Floor Bother Questionnaire (GPFBQ) into Turkish and to assess its validity and reliability. The Turkish adaptation of the GPFBQ was created by following the stages of the intercultural adaptation process. A test-retest interval of 1 week was used to assess the reliability, which was examined by the intraclass correlation coefficient. The validity of the GPFBQ was assessed and compared with the Pelvic Floor Distress Inventory-20 (PFDI-20) and the Pelvic Floor Impact Questionnaire-7 (PFIQ-7) using Spearman's rank correlation coefficients. For construct validity, confirmatory factor analysis was performed. A total of 131 women, whose mean age was 46.83 years, were included in the study. The test-retest reliability of the GPFBQ was excellent (0.998, p < 0.0001). The GPFBQ correlated significantly with the PFDI-20 (r = 0.860, p = 0.00) and PFIQ-7 (r = 0.802, p = 0.00). Confirmatory factor analysis was performed to determine construct validity, and it was found that it had four dimensions. The Turkish version of the GPFBQ is a valid and reliable tool for assessing the symptoms of bother and severity in Turkish-speaking women with pelvic floor dysfunction.
Validation of the MISSCARE-BRASIL survey - A tool to assess missed nursing care.

PubMed

Siqueira, Lillian Dias Castilho; Caliri, Maria Helena Larcher; Haas, Vanderlei José; Kalisch, Beatrice; Dantas, Rosana Aparecida Spadoti

2017-12-21

to analyze the metric validity and reliability properties of the MISSCARE-BRASIL survey. methodological research conducted by assessing construct validity and reliability via confirmatory factor analysis, known-groups validation, convergent construct validation, analysis of internal consistency and test-retest reliability. The sample consisted of 330 nursing professionals, of whom 86 participated in the retest phase. of the 330 participants, 39.7% were aides, 33% technicians, 20.9% nurses, and 6.4% nurses with administrative roles. Confirmatory factorial analysis demonstrated that the Brazilian Portuguese version of the instrument is adequately adjusted to the dimensional structure the scale authors originally proposed. The correlation between "satisfaction with position/role" and "satisfaction with teamwork" and the survey's missed care variables was moderate (Spearman's coefficient =0.35; p<0.001). The results of the Student's t-test indicated known-group validity. Professionals from closed units reported lower levels of missed care in comparison with the other units. The reliability showed a strong correlation, with the exception of "institutional management/leadership style" (intraclass correlation coefficient (ICC)=0.15; p=0.04). The internal consistency was adequate (Cronbach's alpha was greater than 0.70). the MISSCARE-BRASIL was valid and reliable in the group studied. The application of the MISSCARE-BRASIL can contribute to identifying solutions for missed nursing care.
Reliability of the penetration aspiration scale with flexible endoscopic evaluation of swallowing.

PubMed

Butler, Susan G; Markley, Lisa; Sanders, Brian; Stuart, Andrew

2015-06-01

The Penetration Aspiration Scale (PAS), although designed for videofluoroscopy, has been utilized with flexible endoscopic evaluation of swallowing (FEES) in both research and clinical practice. The purpose of this investigation was to determine inter- and intrarater reliability of the PAS with FEES as a function of clinician FEES experience and retest interval. Three groups of 3 clinicians (N=9) with varying FEES experience (beginning, intermediate, and advanced) assigned PAS scores to 35 swallows. Initial ratings were repeated following short-term (ie, 1 day) and long-term (ie, 1 week) retest intervals. Intraclass correlation coefficients were calculated to assess interrater reliability on the first rating for each group. The coefficients were .91, .82, and .89 for the beginning, intermediate, and advanced clinicians, respectively. Overall interrater reliability across all 9 clinicians, irrespective of experience, was .85. Intraclass correlation coefficients were also calculated to assess intrarater reliability. The intrarater reliability for short- and long-term ratings was .90, .94, and .96 and .96, .97, and .94 for the beginning, intermediate, and advanced clinicians, respectively. Overall intrarater reliability across all 9 clinicians and all 3 ratings was .94. Excellent inter- and intrarater reliability was evidenced with the application of the PAS for FEES regardless of clinician experience and retest interval. © The Author(s) 2015.

Cross-cultural translation, validity, and reliability of the French version of the Neurophysiology of Pain Questionnaire.

PubMed

Demoulin, Christophe; Brasseur, Pauline; Roussel, Nathalie; Brereton, Clara; Humblet, Fabienne; Flynn, Daniel; Van Beveren, Julien; Osinsky, Thomas; Donneau, Anne-Françoise; Crielaard, Jean-Michel; Vanderthommen, Marc; Bruyère, Olivier

2017-11-01

Pain physiology education is an important component in the management of patients with chronic musculoskeletal pain. The Neurophysiology of Pain Questionnaire (NPQ) was developed in English to assess pain physiology knowledge in patients. This study aimed to translate the NPQ into French (NPQ-Fr) and to investigate the main psychometric properties of the NPQ-Fr. The translation was performed using the best practice translation guidelines. One hundred and one French-speaking patients with chronic non-specific spinal pain completed the NPQ-Fr to assess its acceptability and presence of floor/ceiling effects and test its dimensionality. The construct validity was tested by comparing the patients' NPQ-Fr scores to those of 17 physiotherapists and investigating its correlation with subscales of the Short Form-36 questionnaire. The reliability (i.e., internal consistency and test-retest reliability) was also investigated. To test the test-retest reliability, 70 patients were asked to complete the NPQ-Fr twice with one week in between. Regarding the NPQ-Fr psychometric properties: 1) acceptability was good; 2) internal consistency reached a Cronbach α-coefficient of 0.44; 3) no floor and ceiling effects were observed in patients; 4) a principal factor analysis generated three major factors; 5) construct validity was good; and 6) reliability was acceptable (intraclass correlation coefficient = 0.644; standard error of measurement = 1.5). The NPQ-Fr has satisfactory basic psychometric properties in patients with chronic spinal pain.
Reliability and validity of the Chinese version of the autoimmune bullous disease quality of life (ABQOL) questionnaire.

PubMed

Yang, Baoqi; Chen, Guo; Yang, Qing; Yan, Xiaoxiao; Zhang, Zhaoxia; Murrell, Dédée F; Zhang, Furen

2017-02-02

The autoimmune bullous diseases quality of life (ABQOL) questionnaire was recently developed by an Australian group and has been validated in Australian and North American patient cohorts. It is a 17-item, multidimensional, self-administered English questionnaire. The study aimed to validate the Chinese version of the ABQOL questionnaire and evaluate the reliability in Chinese patients. The Chinese version of the ABQOL questionnaire was produced by forward-backward translation and cross-cultural adaptation of the original English version. The ABQOL questionnaire was then distributed to a total of 101 patients with autoimmune bullous diseases (AIBDs) together with the Dermatology Life Quality Index (DLQI) and the 36-item Short Form Health Survey (SF-36). Validity was analyzed across a range of indices and reliability was assessed using internal consistency and test-retest methods. The Chinese version of the ABQOL questionnaire has a high internal consistency (Cronbach's alpha coefficient, 0.88) and test-retest reliability (the intraclass correlation coefficient, 0.87). Face and content validity were satisfactory. Convergent validity testing showed that the correlation coefficients for the ABQOL and DLQI was 0.77 and for the ABQOL and SF-36 was -0.62. In terms of discriminant validity, there was no significant difference between the proportions of insensitive items in ABQOL and DLQI (p = 0.236). There was no significant difference between the proportions of insensitive items in ABQOL and SF-36 (p = 0.823). The Chinese version of the ABQOL questionnaire has adequate validity and reliability. It may constitute a useful instrument to measure disease burden in Chinese patients with AIBDs.
Validity and reliability of a new tool to evaluate handwriting difficulties in Parkinson's disease.

PubMed

Nackaerts, Evelien; Heremans, Elke; Smits-Engelsman, Bouwien C M; Broeder, Sanne; Vandenberghe, Wim; Bergmans, Bruno; Nieuwboer, Alice

2017-01-01

Handwriting in Parkinson's disease (PD) features specific abnormalities which are difficult to assess in clinical practice since no specific tool for evaluation of spontaneous movement is currently available. This study aims to validate the 'Systematic Screening of Handwriting Difficulties' (SOS-test) in patients with PD. Handwriting performance of 87 patients and 26 healthy age-matched controls was examined using the SOS-test. Sixty-seven patients were tested a second time within a period of one month. Participants were asked to copy as much as possible of a text within 5 minutes with the instruction to write as neatly and quickly as in daily life. Writing speed (letters in 5 minutes), size (mm) and quality of handwriting were compared. Correlation analysis was performed between SOS outcomes and other fine motor skill measurements and disease characteristics. Intrarater, interrater and test-retest reliability were assessed using the intraclass correlation coefficient (ICC) and Spearman correlation coefficient. Patients with PD had a smaller (p = 0.043) and slower (p<0.001) handwriting and showed worse writing quality (p = 0.031) compared to controls. The outcomes of the SOS-test significantly correlated with fine motor skill performance and disease duration and severity. Furthermore, the test showed excellent intrarater, interrater and test-retest reliability (ICC > 0.769 for both groups). The SOS-test is a short and effective tool to detect handwriting problems in PD with excellent reliability. It can therefore be recommended as a clinical instrument for standardized screening of handwriting deficits in PD.
Reliability of the detailed assessment of speed of handwriting on Flemish children.

PubMed

Simons, Johan; Probst, Michel

2014-01-01

This study evaluates the reliability of the Detailed Assessment of Speed of Handwriting (DASH) in a Dutch-speaking sample of children. The sample included 650 boys and 513 girls (age range = 9-16 years). Handwriting speed measurements were obtained using the DASH. Interrater agreement, test-retest reliability, and internal consistency were calculated; gender and age effects were analyzed. Interrater agreement shows excellent reliability with intraclass correlation coefficients of at least 0.94. Test-retest correlations ranged from r = 0.65 to r = 0.81. The internal consistency measures, calculated with Cronbach's alpha, were between 0.88 and 0.94. Both gender and age have a significant effect on handwriting speed, with F (7.1144) = 17.43 (P < .001) for gender and F (7.1144) = 21.8 (P < .001) for age. The DASH is a reliable assessment tool to evaluate handwriting speed of Dutch-speaking children. There is a tendency of girls to write faster than boys.
Development and validation of a beverage and snack questionnaire for use in evaluation of school nutrition policies.

PubMed

Neuhouser, Marian L; Lilley, Sonya; Lund, Anne; Johnson, Donna B

2009-09-01

School nutrition policies limiting access to sweetened beverages, candy, and salty snacks have the potential to improve the health of children. To effectively evaluate policy success, appropriate and validated dietary assessment instruments are needed. The objective of this study was to develop and validate a beverage and snack questionnaire suitable for use among young adolescents. A new 19-item Beverage and Snack Questionnaire (BSQ) was administered to middle school students on two occasions, 2 weeks apart, to measure test-retest reliability. The questionnaire inquired about frequency of consumption, both at school and away from school, of soft drinks, salty snacks, sweets, milk, and fruits and vegetables. Students also completed 4-day food records. To assess validity, food-record data were compared with BSQ data. Forty-six students of diverse backgrounds from metropolitan Seattle, WA, participated in this study. Participants answered the BSQ during class time and completed the food record at home. Pearson correlation coefficients assessed test-retest reliability and validity. Using frequency per week data, the test-retest reliability coefficients were r=0.85 for fruits and vegetables consumed at school and r=0.74 and r=0.72 for beverages and sweets/snacks, respectively, consumed at school. Correlations ranged from r=0.73 to 0.77 for foods consumed outside of school. Compared with the criterion food record, validity coefficients were very good: r=0.69 to 0.71 for foods consumed at school and r=0.63 to 0.70 for foods consumed away from school. The validity coefficients for the 19 individual food items ranged from r=0.56 to 0.87. This easy-to-administer 19-item questionnaire captures data on sugar-sweetened beverages, salty snacks, sweets, milk, and fruit and vegetables as well as a more lengthy and expensive food record does. The BSQ can be used by nutrition researchers and practitioners to accurately evaluate student consumption of foods that are the focus of school nutrition policies.
Periorbital Biometric Measurements using ImageJ Software: Standardisation of Technique and Assessment Of Intra- and Interobserver Variability

PubMed Central

Rajyalakshmi, R.; Prakash, Winston D.; Ali, Mohammad Javed; Naik, Milind N.

2017-01-01

Purpose: To assess the reliability and repeatability of periorbital biometric measurements using ImageJ software and to assess if the horizontal visible iris diameter (HVID) serves as a reliable scale for facial measurements. Methods: This study was a prospective, single-blind, comparative study. Two clinicians performed 12 periorbital measurements on 100 standardised face photographs. Each individual’s HVID was determined by Orbscan IIz and used as a scale for measurements using ImageJ software. All measurements were repeated using the ‘average’ HVID of the study population as a measurement scale. Intraclass correlation coefficient (ICC) and Pearson product-moment coefficient were used as statistical tests to analyse the data. Results: The range of ICC for intra- and interobserver variability was 0.79–0.99 and 0.86–0.99, respectively. Test-retest reliability ranged from 0.66–1.0 to 0.77–0.98, respectively. When average HVID of the study population was used as scale, ICC ranged from 0.83 to 0.99, and the test-retest reliability ranged from 0.83 to 0.96 and the measurements correlated well with recordings done with individual Orbscan HVID measurements. Conclusion: Periorbital biometric measurements using ImageJ software are reproducible and repeatable. Average HVID of the population as measured by Orbscan is a reliable scale for facial measurements. PMID:29403183
Measurement of acute nonspecific low back pain perception in primary care physical therapy: reliability and validity of the brief illness perception questionnaire.

PubMed

Hallegraeff, Joannes M; van der Schans, Cees P; Krijnen, Wim P; de Greef, Mathieu H G

2013-02-01

The eight-item Brief Illness Perception Questionnaire is used as a screening instrument in physical therapy to assess mental defeat in patients with acute low back pain, besides patient perception might determine the course and risk for chronic low back pain. However, the psychometric properties of the Brief Illness Perception Questionnaire in common musculoskeletal disorders like acute low back pain have not been adequately studied. Patients' perceptions vary across different populations and affect coping styles. Thus, our aim was to determine the internal consistency, test-retest reliability and validity of the Dutch language version of the Brief Illness Perception Questionnaire in acute non-specific low back pain patients in primary care physical therapy. A non-experimental cross-sectional study with two measurements was performed. Eighty-four acute low back pain patients, in multidisciplinary health care center in Dutch primary care with a sample mean (SD) age of 42 (12) years, participated in the study. Internal consistency (Cronbach's α) and test-retest procedures (Intraclass Correlation Coefficients and limits of agreement) were evaluated at a one-week interval. The concurrent validity of the Brief Illness Perception Questionnaire was examined by using the Mental Health Component of the Short Form 36 Health Survey. The Cronbach's α for internal consistency was 0.73 (95% CI, 0.67 - 0.83); and the Intraclass Correlation Coefficient test-retest reliability was acceptable: 0.72 (95% CI, 0.53 - 0.82), however, the limits of agreement were large. The Intraclass Correlation Coefficient measuring concurrent validity 0.65 (95% CI, 0.46 - 0.80). The Dutch version of the Brief Illness Perception Questionnaire is an appropriate instrument for measuring patients' perceptions in acute low back pain patients, showing acceptable internal consistency and reliability. Concurrent validity is adequate, however, the instrument may be unsuitable for detecting changes in low back pain perception over time.
Reliability of a Computerized Neurocognitive Test in Baseline Concussion Testing of High School Athletes.

PubMed

MacDonald, James; Duerson, Drew

2015-07-01

Baseline assessments using computerized neurocognitive tests are frequently used in the management of sport-related concussions. Such testing is often done on an annual basis in a community setting. Reliability is a fundamental test characteristic that should be established for such tests. Our study examined the test-retest reliability of a computerized neurocognitive test in high school athletes over 1 year. Repeated measures design. Two American high schools. High school athletes (N = 117) participating in American football or soccer during the 2011-2012 and 2012-2013 academic years. All study participants completed 2 baseline computerized neurocognitive tests taken 1 year apart at their respective schools. The test measures performance on 4 cognitive tasks: identification speed (Attention), detection speed (Processing Speed), one card learning accuracy (Learning), and one back speed (Working Memory). Reliability was assessed by measuring the intraclass correlation coefficient (ICC) between the repeated measures of the 4 cognitive tasks. Pearson and Spearman correlation coefficients were calculated as a secondary outcome measure. The measure for identification speed performed best (ICC = 0.672; 95% confidence interval, 0.559-0.760) and the measure for one card learning accuracy performed worst (ICC = 0.401; 95% confidence interval, 0.237-0.542). All tests had marginal or low reliability. In a population of high school athletes, computerized neurocognitive testing performed in a community setting demonstrated low to marginal test-retest reliability on baseline assessments 1 year apart. Further investigation should focus on (1) improving the reliability of individual tasks tested, (2) controlling for external factors that might affect test performance, and (3) identifying the ideal time interval to repeat baseline testing in high school athletes. Computerized neurocognitive tests are used frequently in high school athletes, often within a model of baseline testing of asymptomatic individuals before the start of a sporting season. This study adds to the evidence that suggests in this population such testing may lack sufficient reliability to support clinical decision making.
Validation of the Chinese Version of the Quality of Nursing Work Life Scale

PubMed Central

Fu, Xia; Xu, Jiajia; Song, Li; Li, Hua; Wang, Jing; Wu, Xiaohua; Hu, Yani; Wei, Lijun; Gao, Lingling; Wang, Qiyi; Lin, Zhanyi; Huang, Huigen

2015-01-01

Quality of Nursing Work Life (QNWL) serves as a predictor of a nurse’s intent to leave and hospital nurse turnover. However, QNWL measurement tools that have been validated for use in China are lacking. The present study evaluated the construct validity of the QNWL scale in China. A cross-sectional study was conducted conveniently from June 2012 to January 2013 at five hospitals in Guangzhou, which employ 1938 nurses. The participants were asked to complete the QNWL scale and the World Health Organization Quality of Life abbreviated version (WHOQOL-BREF). A total of 1922 nurses provided the final data used for analyses. Sixty-five nurses from the first investigated division were re-measured two weeks later to assess the test-retest reliability of the scale. The internal consistency reliability of the QNWL scale was assessed using Cronbach’s α. Test-retest reliability was assessed using the intra-class correlation coefficient (ICC). Criterion-relation validity was assessed using the correlation of the total scores of the QNWL and the WHOQOL-BREF. Construct validity was assessed with the following indices: χ2 statistics and degrees of freedom; relative mean square error of approximation (RMSEA); the Akaike information criterion (AIC); the consistent Akaike information criterion (CAIC); the goodness-of-fit index (GFI); the adjusted goodness of fit index; and the comparative fit index (CFI). The findings demonstrated high internal consistency (Cronbach’s α = 0.912) and test-retest reliability (interclass correlation coefficient = 0.74) for the QNWL scale. The chi-square test (χ2 = 13879.60, df [degree of freedom] = 813 P = 0.0001) was significant. The RMSEA value was 0.091, and AIC = 1806.00, CAIC = 7730.69, CFI = 0.93, and GFI = 0.74. The correlation coefficient between the QNWL total scores and the WHOQOL-BREF total scores was 0.605 (p<0.01). The QNWL scale was reliable and valid in Chinese-speaking nurses and could be used as a clinical and research instrument for measuring work-related factors among nurses in China. PMID:25950838
Age Band 1 of the Movement Assessment Battery for Children-Second Edition: Exploring Its Usefulness in Mainland China

ERIC Educational Resources Information Center

Hua, Jing; Gu, Guixiong; Meng, Wei; Wu, Zhuochun

2013-01-01

The aim of this paper was to examine the validity and reliability of age band 1 of the Movement Assessment Battery for Children-Second Edition (MABC-2) in preparation for its standardization in mainland China. Interrater and test-retest reliability of the MABC-2 was estimated using Intraclass Correlation Coefficient (ICC). Cronbach's alpha for…
[Reliability and validity of a Mexican version of the Pro Children Project questionnaire].

PubMed

Ochoa-Meza, Gerardo; Sierra, Juan Carlos; Pérez-Rodrigo, Carmen; Aranceta Bartrina, Javier; Esparza-Del Villar, Óscar A

2014-08-01

To determine the test-retest reliability, the internal consistency, and the predictive validity of the constructs of the Mexican version of the Pro Children Project questionnaire (PCHP) for assessing personal and environmental factors related to fruit and vegetable intake in 10-12 year-old schoolchildren. Test-retest design with a 14 days interval. A sample of 957 children completed the questionnaire with 82 items. The study was conducted at eight primary schools in 2012 in Ciudad Juarez, Chihuahua, Mexico. For all fruit constructs and vegetable constructs, the test-retest reliability was moderate (intraclass correlation coefficient (ICC) > 0.60). Cronbach s alpha values were from moderate to high (range of 0.54 to 0.92) similar to those in the original study. Values for predictive validity ranged from moderate to good with Spearman correlations between 0.23 and 0.60 for personal factors and between 0.14 and 0.40 for environmental factors. The results of the Mexican version of the PCHP questionnaire provide a sufficient reliability and validity for assessing personal and environmental factors of fruit and vegetable intake in 10-12 year old schoolchildren. Finally, implications to administer this instrument in scholar settings and guidelines for futures studies are discussed. Copyright AULA MEDICA EDICIONES 2014. Published by AULA MEDICA. All rights reserved.
Reliability and validity of the Iranian version of the QAPACE in adolescents.

PubMed

Amiri, Parisa; Jalali-Farahani, Sara; Zarkesh, Maryam; Barzin, Maryam; Kaviani, Robabeh; Ahmadizad, Sajad

2014-08-01

The aim of this study was to determine the reliability and validity of the Iranian version of the Quantification de l'Activite Physique en Altitude Chez les Enfants (QAPACE) in adolescents. After linguistic validation, the Iranian version of the QAPACE was completed by 359 (52.4 % girls) schoolchildren, aged 15-18 years. Test-retest reliability of the questionnaire was determined by intraclass correlation coefficients (ICCs). For validation purposes, two methods were used for (1) the correlation between VO2peak and the DEE and (2) known-group validity, which was examined by comparing the normal weight adolescents and those who were overweight/obese. ICCs for test-retest ranged from 0.79 to 0.98. The mean scores in test-retest surveys for total score and all of the subscores were significant (p < 0.05). Sex-specific analysis showed a significant correlation between VO2peak and DEE over 12-month, school, and vacation periods in girls (p < 0.05). The mean values for all activities except for transportation, other activities in school, personal artistic activities, sport competition, and home activities were significantly lower in overweight/obese group than normal group. Our results support the initial reliability and validity of the Iranian version of QAPACE as a daily physical activity measure in adolescents.
Study samples are too small to produce sufficiently precise reliability coefficients.

PubMed

Charter, Richard A

2003-04-01

In a survey of journal articles, test manuals, and test critique books, the author found that a mean sample size (N) of 260 participants had been used for reliability studies on 742 tests. The distribution was skewed because the median sample size for the total sample was only 90. The median sample sizes for the internal consistency, retest, and interjudge reliabilities were 182, 64, and 36, respectively. The author presented sample size statistics for the various internal consistency methods and types of tests. In general, the author found that the sample sizes that were used in the internal consistency studies were too small to produce sufficiently precise reliability coefficients, which in turn could cause imprecise estimates of examinee true-score confidence intervals. The results also suggest that larger sample sizes have been used in the last decade compared with those that were used in earlier decades.
A two-factor theory for concussion assessment using ImPACT: memory and speed.

PubMed

Schatz, Philip; Maerlender, Arthur

2013-12-01

We present the initial validation of a two-factor structure of Immediate Post-Concussion Assessment and Cognitive Testing (ImPACT) using ImPACT composite scores and document the reliability and validity of this factor structure. Factor analyses were conducted for baseline (N = 21,537) and post-concussion (N = 560) data, yielding "Memory" (Verbal and Visual) and "Speed" (Visual Motor Speed and Reaction Time) Factors; inclusion of Total Symptom Scores resulted in a third discrete factor. Speed and Memory z-scores were calculated, and test-retest reliability (using intra-class correlation coefficients) at 1 month (0.88/0.81), 1 year (0.85/0.75), and 2 years (0.76/0.74) were higher than published data using Composite scores. Speed and Memory scores yielded 89% sensitivity and 70% specificity, which was higher than composites (80%/62%) and comparable with subscales (91%/69%). This emergent two-factor structure has improved test-retest reliability with no loss of sensitivity/specificity and may improve understanding and interpretability of ImPACT test results.
The Jebsen Taylor Test of Hand Function: A Pilot Test-Retest Reliability Study in Typically Developing Children.

PubMed

Reedman, Sarah Elizabeth; Beagley, Simon; Sakzewski, Leanne; Boyd, Roslyn N

2016-08-01

The aim of this pilot study was to evaluate reproducibility of the Jebsen Taylor Test of Hand Function (JTTHF) in children. Eighty-seven typically developing children 5 to 10 years old were included from five Outside School Hours Care centers in the Greater Brisbane Region, Australia. Hand function was assessed on two occasions with a modified JTTHF, then reproducibility was assessed using Intraclass Correlation Coefficient (ICC [3,1]) and the Standard Error of Measurement (SEM). Total scores for male and female children were not significantly different. Five-year-old children were significantly different to all other age groups and were excluded from further analysis. Results for 71 children, 6 to 10 years old were analyzed (mean age 8.31 years (SD 1.32); 33 males). Test-retest reliability for total scores on the dominant and nondominant hands were ICC 0.74 (95% CI 0.61, 0.83) and ICC 0.72 (95% CI 0.59, 0.82), respectively. 'Writing' and 'Simulated Feeding' subtests demonstrated poor reproducibility. The Smallest Real Difference was 5.09 seconds for total score on the dominant hand. Findings indicate good test-retest reliability for the JTTHF total score to measure hand function in typically developing children aged 6 to 10 years.
Development and psychometric testing of a trans-professional evidence-based practice profile questionnaire.

PubMed

McEvoy, Maureen Patricia; Williams, Marie T; Olds, Timothy Stephen

2010-01-01

Previous survey tools operationalising knowledge, attitudes or beliefs about evidence-based practice (EBP) have shortcomings in content, psychometric properties and target audience. This study developed and psychometrically assessed a self-report trans-professional questionnaire to describe an EBP profile. Sixty-six items were collated from existing EBP questionnaires and administered to 526 academics and students from health and non-health backgrounds. Principal component factor analysis revealed the presence of five factors (Relevance, Terminology, Confidence, Practice and Sympathy). Following expert panel review and pilot testing, the 58-item final questionnaire was disseminated to 105 subjects on two occasions. Test-retest and internal reliability were quantified using intra-class correlation coefficients (ICCs) and Cronbach's alpha, convergent validity against a commonly used EBP questionnaire by Pearson's correlation coefficient and discriminative validity via analysis of variance (ANOVA) based on exposure to EBP training. The final questionnaire demonstrated acceptable internal consistency (Cronbach's alpha 0.96), test-retest reliability (ICCs range 0.77-0.94) and convergent validity (Practice 0.66, Confidence 0.80 and Sympathy 0.54). Three factors (Relevance, Terminology and Confidence) distinguished EBP exposure groups (ANOVA p < 0.001-0.004). The evidence-based practice profile (EBP(2)) questionnaire is a reliable instrument with the ability to discriminate for three factors, between respondents with differing EBP exposures.
Reliability of Triaxial Accelerometry for Measuring Load in Men's Collegiate Ice Hockey.

PubMed

Van Iterson, Erik H; Fitzgerald, John S; Dietz, Calvin C; Snyder, Eric M; Peterson, Ben J

2017-05-01

Van Iterson, EH, Fitzgerald, JS, Dietz, CC, Snyder, EM, and Peterson, BJ. Reliability of triaxial accelerometry for measuring load in men's collegiate ice hockey. J Strength Cond Res 31(5): 1305-1312, 2017-Wearable microsensor technology incorporating triaxial accelerometry is used to quantify an index of mechanical stress associated with sport-specific movements termed PlayerLoad. The test-retest reliability of PlayerLoad in the environmental setting of ice hockey is unknown. The primary aim of this study was to quantify the test-retest reliability of PlayerLoad in ice hockey players during performance of tasks simulating game conditions. Division I collegiate male ice hockey players (N = 8) wore Catapult Optimeye S5 monitors during repeat performance of 9 ice hockey tasks simulating game conditions. Ordered ice hockey tasks during repeated bouts included acceleration (forward or backward), 60% top-speed, top-speed (forward or backward), repeated shift circuit, ice coasting, slap shot, and bench sitting. Coefficient of variation (CV), intraclass correlation coefficient (ICC), and minimum difference (MD) were used to assess PlayerLoad reliability. Test-retest CVs and ICCs of PlayerLoad were as follows: 8.6% and 0.54 for forward acceleration, 13.8% and 0.78 for backward acceleration, 2.2% and 0.96 for 60% top-speed, 7.5% and 0.79 for forward top-speed, 2.8% and 0.96 for backward top-speed, 26.6% and 0.95 for repeated shift test, 3.9% and 0.68 for slap shot, 3.7% and 0.98 for coasting, and 4.1% and 0.98 for bench sitting, respectively. Raw differences between bouts were not significant for ice hockey tasks (p > 0.05). For each task, between-bout raw differences were lower vs. MD: 0.06 vs. 0.35 (forward acceleration), 0.07 vs. 0.36 (backward acceleration), 0.00 vs. 0.06 (60% top-speed), 0.03 vs. 0.20 (forward top-speed), 0.02 vs. 0.09 (backward top-speed), 0.18 vs. 0.64 (repeated shift test), 0.02 vs. 0.10 (slap shot), 0.00 vs. 0.10 (coasting), and 0.01 vs. 0.11 (bench sitting), respectively. These data suggest that PlayerLoad demonstrates moderate-to-large test-retest reliability in the environmental setting of male Division I collegiate ice hockey. Without previously testing reliability, these data are important as PlayerLoad is routinely quantified in male collegiate ice hockey to assess on ice physical activity.
Development and testing of the Youth Alcohol Norms Survey (YANS) instrument to measure youth alcohol norms and psychosocial influences.

PubMed

Burns, Sharyn K; Maycock, Bruce; Hildebrand, Janina; Zhao, Yun; Allsop, Steve; Lobo, Roanna; Howat, Peter

2018-05-14

This study aimed to develop and validate an online instrument to: (1) identify common alcohol-related social influences, norms and beliefs among adolescents; (2) clarify the process and pathways through which proalcohol norms are transmitted to adolescents; (3) describe the characteristics of social connections that contribute to the transmission of alcohol norms; and (4) identify the influence of alcohol marketing on adolescent norm development. The online Youth Alcohol Norms Survey (YANS) was administered in secondary schools in Western Australia PARTICIPANTS: Using a 2-week test-retest format, the YANS was administered to secondary school students (n=481, age=13-17 years, female 309, 64.2%). The development of the YANS was guided by social cognitive theory and comprised a systematic multistage process including evaluation of content and face validity. A 2-week test-retest format was employed. Exploratory factor analysis was conducted to determine the underlying factor structure of the instrument. Test-retest reliability was examined using intraclass correlation coefficient (ICC) and Cohen's kappa. A five-factor structure with meaningful components and robust factorial loads was identified, and the five factors were labelled as 'individual attitudes and beliefs', 'peer and community identity', 'sibling influences', 'school and community connectedness' and 'injunctive norms', respectively. The instrument demonstrated stability across the test-retest procedure (ICC=0.68-0.88, Cohen's kappa coefficient=0.69) for most variables. The results support the reliability and factorial validity of this instrument. The YANS presents a promising tool, which enables comprehensive assessment of reciprocal individual, behavioural and environmental factors that influence alcohol-related norms among adolescents. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Reliability of the Q Force; a mobile instrument for measuring isometric quadriceps muscle strength.

PubMed

Douma, K W; Regterschot, G R H; Krijnen, W P; Slager, G E C; van der Schans, C P; Zijlstra, W

2016-01-01

The ability to generate muscle strength is a pre-requisite for all human movement. Decreased quadriceps muscle strength is frequently observed in older adults and is associated with a decreased performance and activity limitations. To quantify the quadriceps muscle strength and to monitor changes over time, instruments and procedures with a sufficient reliability are needed. The Q Force is an innovative mobile muscle strength measurement instrument suitable to measure in various degrees of extension. Measurements between 110 and 130° extension present the highest values and the most significant increase after training. The objective of this study is to determine the test-retest reliability of muscle strength measurements by the Q Force in older adults in 110° extension. Forty-one healthy older adults, 13 males and 28 females were included in the study. Mean (SD) age was 81.9 (4.89) years. Isometric muscle strength of the Quadriceps muscle was assessed with the Q Force at 110° of knee extension. Participants were measured at two sessions with a three to eight day interval between sessions. To determine relative reliability, the intraclass correlation coefficient (ICC) was calculated. To determine absolute reliability, Bland and Altman Limits of Agreement (LOA) were calculated and t-tests were performed. Relative reliability of the Q Force is good to excellent as all ICC coefficients are higher than 0.75. Generally a large 95 % LOA, reflecting only moderate absolute reliability, is found as exemplified for the peak torque left leg of -18.6 N to 33.8 N and the right leg of -9.2 N to 26.4 N was between 15.7 and 23.6 Newton representing 25.2 % to 39.9 % of the size of the mean. Small systematic differences in mean were found between measurement session 1 and 2. The present study shows that the Q Force has excellent relative test-retest reliability, but limited absolute test-retest reliability. Since the Q Force is relatively cheap and mobile it is suitable for application in various clinical settings, however, its capability to detect changes in muscle force over time is limited but comparable to existing instruments.
The cross-cultural adaptation, reliability, and validity of the Copenhagen Neck Functional Disability Scale in patients with chronic neck pain: Turkish version study.

PubMed

Yapali, Gökmen; Günel, Mintaze Kerem; Karahan, Sevilay

2012-05-15

The study design was cross-cultural adaptation and investigation of reliability and validity of the Copenhagen Neck Functional Disability Scale (CNFDS). The aim of this study was to translate the CNFDS into Turkish language and assess its reliability and validity among patients with neck pain in Turkish population. The CNFDS is a reliable and valid evaluation instrument for disability, but there is no published the Turkish version of the CNFDS. One hundred one subjects who had chronic neck pain were included in this study. The CNFDS, Neck Pain and Disability Scale, and visual analogue scale were administered to all subjects. For investigating test-retest reliability, correlation between CNFDS scores, applied at 1-week interval, intraclass correlation coefficient score for test-retest reliability was 0.86 (95% confidence interval = 0.679-0.935). There was no difference between test-retest scores (P < 0.001). For investigating concurrent validity, correlation between total score of the CNFDS and the mean visual analogue scale was r = 0.73 (P < 0.001). Concurrent validity of the CNFDS was very good. For investigating construct validity, correlation between total score of the CNFDS and the Neck Pain and Disability Scale was r = 0.78 (P < 0.001). Construct validity of the CNFDS was also very good. Our results suggest that the Turkish version of the CNFDS is a reliable and valid instrument for Turkish people.

Test-retest reliability of evoked heat stimulation BOLD fMRI.

PubMed

Upadhyay, Jaymin; Lemme, Jordan; Anderson, Julie; Bleakman, David; Large, Thomas; Evelhoch, Jeffrey L; Hargreaves, Richard; Borsook, David; Becerra, Lino

2015-09-30

To date, the blood oxygenated-level dependent (BOLD) functional magnetic resonance imaging (fMRI) technique has enabled an objective and deeper understanding of pain processing mechanisms embedded within the human central nervous system (CNS). In order to further comprehend the benefits and limitations of BOLD fMRI in the context of pain as well as the corresponding subjective pain ratings, we evaluated the univariate response, test-retest reliability and confidence intervals (CIs) at the 95% level of both data types collected during evoked stimulation of 40°C (non-noxious), 44°C (mildly noxious) and a subject-specific temperature eliciting a 7/10 pain rating. The test-retest reliability between two scanning sessions was determined by calculating group-level interclass correlation coefficients (ICCs) and at the single-subject level. Across the three stimuli, we initially observed a graded response of increasing magnitude for both VAS (visual analog score) pain ratings and fMRI data. Test-retest reliability was observed to be highest for VAS pain ratings obtained during the 7/10 pain stimulation (ICC=0.938), while ICC values of pain fMRI data for a distribution of CNS structures ranged from 0.5 to 0.859 (p<0.05). Importantly, the upper and lower confidence interval CI bounds reported herein could be utilized in subsequent trials involving healthy volunteers to hypothesize the magnitude of effect required to overcome inherent variability of either VAS pain ratings or BOLD responses evoked during innocuous or noxious thermal stimulation. Copyright © 2015 Elsevier B.V. All rights reserved.
The MG Composite

PubMed Central

Burns, Ted M.; Conaway, Mark; Sanders, Donald B.

2010-01-01

Objective: To study the concurrent and construct validity and test-retest reliability in the practice setting of an outcome measure for myasthenia gravis (MG). Methods: Eleven centers participated in the validation study of the Myasthenia Gravis Composite (MGC) scale. Patients with MG were evaluated at 2 consecutive visits. Concurrent and construct validities of the MGC were assessed by evaluating MGC scores in the context of other MG-specific outcome measures. We used numerous potential indicators of clinical improvement to assess the sensitivity and specificity of the MGC for detecting clinical improvement. Test-retest reliability was performed on patients at the University of Virginia. Results: A total of 175 patients with MG were enrolled at 11 sites from July 1, 2008, to January 31, 2009. A total of 151 patients were seen in follow-up. Total MGC scores showed excellent concurrent validity with other MG-specific scales. Analyses of sensitivities and specificities of the MGC revealed that a 3-point improvement in total MGC score was optimal for signifying clinical improvement. A 3-point improvement in the MGC also appears to represent a meaningful improvement to most patients, as indicated by improved 15-item myasthenia gravis quality of life scale (MG-QOL15) scores. The psychometric properties were no better for an individualized subscore made up of the 2 functional domains that the patient identified as most important to treat. The test-retest reliability coefficient of the MGC was 98%, with a lower 95% confidence interval of 97%, indicating excellent test-retest reliability. Conclusions: The Myasthenia Gravis Composite is a reliable and valid instrument for measuring clinical status of patients with myasthenia gravis in the practice setting and in clinical trials. PMID:20439845
Test-Retest Reliability of Diffusion Tensor Imaging in Huntington's Disease.

PubMed

Cole, James H; Farmer, Ruth E; Rees, Elin M; Johnson, Hans J; Frost, Chris; Scahill, Rachael I; Hobbs, Nicola Z

2014-03-21

Diffusion tensor imaging (DTI) has shown microstructural abnormalities in patients with Huntington's Disease (HD) and work is underway to characterise how these abnormalities change with disease progression. Using methods that will be applied in longitudinal research, we sought to establish the reliability of DTI in early HD patients and controls. Test-retest reliability, quantified using the intraclass correlation coefficient (ICC), was assessed using region-of-interest (ROI)-based white matter atlas and voxelwise approaches on repeat scan data from 22 participants (10 early HD, 12 controls). T1 data was used to generate further ROIs for analysis in a reduced sample of 18 participants. The results suggest that fractional anisotropy (FA) and other diffusivity metrics are generally highly reliable, with ICCs indicating considerably lower within-subject compared to between-subject variability in both HD patients and controls. Where ICC was low, particularly for the diffusivity measures in the caudate and putamen, this was partly influenced by outliers. The analysis suggests that the specific DTI methods used here are appropriate for cross-sectional research in HD, and give confidence that they can also be applied longitudinally, although this requires further investigation. An important caveat for DTI studies is that test-retest reliability may not be evenly distributed throughout the brain whereby highly anisotropic white matter regions tended to show lower relative within-subject variability than other white or grey matter regions.
Reliability of measures of transient evoked otoacoustic emissions with contralateral suppression.

PubMed

Stuart, Andrew; Cobb, Kensi M

2015-01-01

The reliability of measures of transient evoked otoacoustic emissions (TEOAEs) with contralateral suppression was examined. The effect of test session (i.e., initial test; retest without probe removal; retest with probe removal; and retest 1-2 days post initial test), gender, and ear was examined in 14 young adult females and 14 young adult males. TEOAEs were obtained bilaterally with 60 dB peSPL linear click stimuli with and without a contralateral 65 dB SPL broadband noise suppressor. Absolute TEOAE suppression and a normalized index of TEOAE suppression (i.e., percentage of suppression) were examined. Reliability of these measures was assessed with repeated measures linear mixed model analysis of variance, a coefficient of reliability, and Bland-Altman analyses. There were no statistically significant (p>0.05) main effects of test, gender, and ear or interactions for both absolute dB and % TEOAE suppression values. Cronbach's α were greater than 0.90 across the four tests for both TEOAE measures. Mean test differences or bias (i.e., between the initial and subsequent tests) for absolute and % TEOAE suppression ranged from -0.05 to 0.11 dB and -1.5% to 1.1%, respectively. There was no proportional/systematic bias with the mean differences of the first and subsequent measurements. Data herein were consistent with the view that bilateral TEOAE suppression measures are reliable across test sessions of 1-2 days among females and males and may provide a method to monitor medial olivocochlear efferent reflex status over time. Copyright © 2015 Elsevier Inc. All rights reserved.
The Korean version of the Carpal Tunnel Questionnaire. Cross cultural adaptation, reliability, validity and responsiveness.

PubMed

Kim, J K; Lim, H M

2015-02-01

The purpose of this study was to translate and culturally adapt the Carpal Tunnel Questionnaire to produce an equivalent Korean version. A total of 53 patients completed the Korean version of the Carpal Tunnel Questionnaire pre-operatively and 3 months after open carpal tunnel release. All 53 also completed the Korean version of the Disabilities of Arm, Shoulder, and Hand questionnaire pre-operatively and 3 months post-operatively. Reliability was measured by determining the test-retest reliability and internal consistency. Test-retest reliability was assessed using intraclass correlation coefficients and paired t-tests, and internal consistency using Cronbach's alpha coefficients. Pearson correlation analysis was carried out on the Korean version of the Carpal Tunnel Questionnaire scores and the Korean version of the Disabilities of Arm, Shoulder, and Hand scores to assess construct validity. Responsiveness was evaluated using effect sizes and standardized response means. The reliability of the Korean version of the Carpal Tunnel Questionnaire was good. The scores in the Korean version of the Disabilities of Arm, Shoulder, and Hand strongly correlated with the scores in the Korean version of the Carpal Tunnel Questionnaire. Standardized response mean and effect size were both large for the Korean version of the Carpal Tunnel Questionnaire. The study shows that the Korean version of the Carpal Tunnel Questionnaire is a reliable, valid and responsive instrument for measuring outcomes in carpal tunnel syndrome. © The Author(s) 2014.
Validation of the Persian version of the dysphagia handicap index in patients with neurological disorders

PubMed Central

Barzegar-Bafrooei, Ebrahim; Bakhtiary, Jalal; Khatoonabadi, Ahmad Reza; Fatehi, Farzad; Maroufizadeh, Saman; Fathali, Mojtaba

2016-01-01

Background: Dysphagia as a common condition affecting many aspects of the patient’s life. The Dysphagia Handicap Index (DHI) is a reliable self-reported questionnaire developed specifically to measure the impact of dysphagia on the patient’s quality of life. The aim of this study was to translate the questionnaire to Persian and to measure its validity and reliability in patients with neurogenic oropharyngeal dysphagia. Methods: A formal forward-backward translation of DHI was performed based on the guidelines for the cross-cultural adaptation of self-report measures. A total of 57 patients with neurogenic dysphagia who were referred to the neurology clinics of Tehran University of Medical Sciences, Iran, participated in this study. Internal consistency reliability of the DHI was examined using Cronbach’s alpha, and test-retest reliability of the scale was evaluated using intraclass correlation coefficient (ICC). Results: The internal consistency of the Persian DHI (P-DHI) was considered to be good; Cronbach’s alpha coefficient for the total P-DHI was 0.88. The test-retest reliability for the total and three subscales of the P-DHI ranged from 0.95 to 0.98 using ICC. Conclusion: The P-DHI demonstrated a good reliability, and it can be a valid instrument for evaluating the dysphagia effects on quality of life among Persian language population. PMID:27648173
Validity and reliability of a Malay version of the Lawton instrumental activities of daily living scale among the Malay speaking elderly in Malaysia.

PubMed

Kadar, Masne; Ibrahim, Suhaili; Razaob, Nor Afifi; Chai, Siaw Chui; Harun, Dzalani

2018-02-01

The Lawton Instrumental Activities of Daily Living Scale is a tool often used to assess independence among elderly at home. Its suitability to be used with the elderly population in Malaysia has not been validated. This current study aimed to assess the validity and reliability of the Lawton Instrumental Activities of Daily Living Scale - Malay Version to Malay speaking elderly in Malaysia. This study was divided into three phases: (1) translation and linguistic validity involving both forward and backward translations; (2) establishment of face validity and content validity; and (3) establishment of reliability involving inter-rater, test-retest and internal consistency analyses. Data used for these analyses were obtained by interviewing 65 elderly respondents. Percentages of Content Validity Index for 4 criteria were from 88.89 to 100.0. The Cronbach α coefficient for internal consistency was 0.838. Intra-class Correlation Coefficient of inter-rater reliability and test-retest reliability was 0.957 and 0.950 respectively. The result shows that the Lawton Instrumental Activities of Daily Living Scale - Malay Version has excellent reliability and validity for use with the Malay speaking elderly people in Malaysia. This scale could be used by professionals to assess functional ability of elderly who live independently in community. © 2018 Occupational Therapy Australia.
Test-retest reliability of sensor-based sit-to-stand measures in young and older adults.

PubMed

Regterschot, G Ruben H; Zhang, Wei; Baldus, Heribert; Stevens, Martin; Zijlstra, Wiebren

2014-01-01

This study investigated test-retest reliability of sensor-based sit-to-stand (STS) peak power and other STS measures in young and older adults. In addition, test-retest reliability of the sensor method was compared to test-retest reliability of the Timed Up and Go Test (TUGT) and Five-Times-Sit-to-Stand Test (FTSST) in older adults. Ten healthy young female adults (20-23 years) and 31 older adults (21 females; 73-94 years) participated in two assessment sessions separated by 3-8 days. Vertical peak power was assessed during three (young adults) and five (older adults) normal and fast STS trials with a hybrid motion sensor worn on the hip. Older adults also performed the FTSST and TUGT. The average sensor-based STS peak power of the normal STS trials and the average sensor-based STS peak power of the fast STS trials showed excellent test-retest reliability in young adults (intra-class correlation (ICC)≥0.90; zero in 95% confidence interval of mean difference between test and retest (95%CI of D); standard error of measurement (SEM)≤6.7% of mean peak power) and older adults (ICC≥0.91; zero in 95%CI of D; SEM≤9.9%). Test-retest reliability of sensor-based STS peak power and TUGT (ICC=0.98; zero in 95%CI of D; SEM=8.5%) was comparable in older adults, test-retest reliability of the FTSST was lower (ICC=0.73; zero outside 95%CI of D; SEM=14.4%). Sensor-based STS peak power demonstrated excellent test-retest reliability and may therefore be useful for clinical assessment of functional status and fall risk. Copyright © 2014 Elsevier B.V. All rights reserved.
Transcultural adaptation to Brazilian Portuguese and reliability of the effort-reward imbalance in household and family work

PubMed Central

de Vasconcellos, Ilmeire Ramos Rosembach; Griep, Rosane Härter; Portela, Luciana; Alves, Márcia Guimarães de Mello; Rotenberg, Lúcia

2016-01-01

ABSTRACT OBJECTIVE To describe the steps in the transcultural adaptation of the scale in the Effort-reward imbalance model to household and family work to the Brazilian context. METHODS We performed the translation, back-translation, and initial psychometric evaluation of the questionnaire that comprised three dimensions: (i) effort (eight items, emphasizing quantitative workload), (ii) reward (11 items that seek to capture the intrinsic value of family and household work, societal esteem, recognition from the spouse/partner, and affection from the children), and (iii) overcommitment (four items related to intrinsic effort). The scale was included in a sectional study conducted with 1,045 nursing workers. A subsample of 222 subjects answered the questionnaire for a second time, seven to 15 days thereafter. The data were collected between October 2012 and May 2013. The internal consistency of the scale was evaluated using Cronbach’s alpha and test-retest reliability analysis, square weighted kappa, prevalence and bias adjusted Kappa, and intraclass correlation coefficient. RESULTS Prevalence and bias-adjusted Kappa (ka) of the scale dimensions ranged from 0.80-0.83 for overcommitment, 0.78-0.90 for effort, and 0.76-0.93 for reward. In most dimensions, the values of minimum and maximum scores, average, standard deviation, and Cronbach’s alpha were similar in test and retest scores. Only on societal esteem subdimension (reward) was there little variation in standard deviation (test score of 2.24 and retest score of 3.36) and in Cronbach’s alpha coefficient (test score of 0.38 and retest score of 0.59). CONCLUSIONS The Brazilian version of the scale was found to have proper reliability indices regarding time stability, which suggests adapting it to be used in population with characteristics that are similar to the one in this study. PMID:27355466
We need more replication research - A case for test-retest reliability.

PubMed

Leppink, Jimmie; Pérez-Fuster, Patricia

2017-06-01

Following debates in psychology on the importance of replication research, we have also started to see pleas for a more prominent role for replication research in medical education. To enable replication research, it is of paramount importance to carefully study the reliability of the instruments we use. Cronbach's alpha has been the most widely used estimator of reliability in the field of medical education, notably as some kind of quality label of test or questionnaire scores based on multiple items or of the reliability of assessment across exam stations. However, as this narrative review outlines, Cronbach's alpha or alternative reliability statistics may complement but not replace psychometric methods such as factor analysis. Moreover, multiple-item measurements should be preferred above single-item measurements, and when using single-item measurements, coefficients as Cronbach's alpha should not be interpreted as indicators of the reliability of a single item when that item is administered after fundamentally different activities, such as learning tasks that differ in content. Finally, if we want to follow up on recent pleas for more replication research, we have to start studying the test-retest reliability of the instruments we use.
Transcultural validation of the Oxford Shoulder Score for the French-speaking population.

PubMed

Tuton, D; Barbe, C; Salmon, J-H; Dramé, M; Nérot, C; Ohl, X

2016-09-01

Patient-reported outcome measures (PROMs) have been gaining in popularity over the last decade. The Oxford Shoulder Score (OSS) is a well-established self-administered questionnaire for shoulder evaluation adapted for the English-speaking population. The aim of the present study was to develop a translation and a transcultural adaptation of the OSS and to assess its validity in native French-speaker patients with shoulder pain. The translation process was carried out following a translation/back-translation methodology by two translators. All patients completed the French OSS, the Subjective Shoulder Value (SSV), and the Constant score. Internal consistency was tested using Cronbach's α coefficient. Validity was assessed by calculating the Pearson correlation coefficient between the OSS and the Constant score and the SSV. One hundred forty-four patients suffering from degenerative or inflammatory diseases of the shoulder were included in this study. The average time required to complete the French OSS was 2min and 45s. Seventy patients were asked to complete the questionnaire twice (test/retest reliability). Internal consistency was high with Cronbach's α coefficient=0.93. The intraclass correlation coefficient was 0.91 (95% CI: 0.88-0.94) for test/retest reliability. The French OSS score was significantly correlated with the Constant-Murley score (r=0.73 and P<0.0001) and with the SSV (r=0.68 and P<0.0001). The present study shows that the French version of the OSS is reliable, valid, and reproducible. The sensitivity to change now needs to be evaluated. This score was adapted to the French-speaking population for the self-assessment of patients with degenerative or inflammatory disorders of the shoulder. Level 1, Test of previously developed criteria, diagnostic test study. Copyright © 2016 Elsevier Masson SAS. All rights reserved.
The Yale-Brown Obsessive Compulsive Scale: A Reliability Generalization Meta-Analysis.

PubMed

López-Pina, José Antonio; Sánchez-Meca, Julio; López-López, José Antonio; Marín-Martínez, Fulgencio; Núñez-Núñez, Rosa Maria; Rosa-Alcázar, Ana I; Gómez-Conesa, Antonia; Ferrer-Requena, Josefa

2015-10-01

The Yale-Brown Obsessive Compulsive Scale (Y-BOCS) is the most frequently applied test to assess obsessive compulsive symptoms. We conducted a reliability generalization meta-analysis on the Y-BOCS to estimate the average reliability, examine the variability among the reliability estimates, search for moderators, and propose a predictive model that researchers and clinicians can use to estimate the expected reliability of the Y-BOCS. We included studies where the Y-BOCS was applied to a sample of adults and reliability estimate was reported. Out of the 11,490 references located, 144 studies met the selection criteria. For the total scale, the mean reliability was 0.866 for coefficients alpha, 0.848 for test-retest correlations, and 0.922 for intraclass correlations. The moderator analyses led to a predictive model where the standard deviation of the total test and the target population (clinical vs. nonclinical) explained 38.6% of the total variability among coefficients alpha. Finally, clinical implications of the results are discussed. © The Author(s) 2014.
Clinimetric properties of the Tinetti Mobility Test, Four Square Step Test, Activities-specific Balance Confidence Scale, and spatiotemporal gait measures in individuals with Huntington's disease.

PubMed

Kloos, Anne D; Fritz, Nora E; Kostyk, Sandra K; Young, Gregory S; Kegelmeyer, Deb A

2014-09-01

Individuals with Huntington's disease (HD) experience balance and gait problems that lead to falls. Clinicians currently have very little information about the reliability and validity of outcome measures to determine the efficacy of interventions that aim to reduce balance and gait impairments in HD. This study examined the reliability and concurrent validity of spatiotemporal gait measures, the Tinetti Mobility Test (TMT), Four Square Step Test (FSST), and Activities-specific Balance Confidence (ABC) Scale in individuals with HD. Participants with HD [n = 20; mean age ± SD=50.9 ± 13.7; 7 male] were tested on spatiotemporal gait measures and the TMT, FSST, and ABC Scale before and after a six week period to determine test-retest reliability and minimal detectable change (MDC) values. Linear relationships between gait and clinical measures were estimated using Pearson's correlation coefficients. Spatiotemporal gait measures, the TMT total and the FSST showed good to excellent test-retest reliability (ICC > 0.75). MDC values were 0.30 m/s and 0.17 m/s for velocity in forward and backward walking respectively, four points for the TMT, and 3s for the FSST. The TMT and FSST were highly correlated with most spatiotemporal measures. The ABC Scale demonstrated lower reliability and less concurrent validity than other measures. The high test-retest reliability over a six week period and concurrent validity between the TMT, FSST, and spatiotemporal gait measures suggest that the TMT and FSST may be useful outcome measures for future intervention studies in ambulatory individuals with HD. Copyright © 2014 Elsevier B.V. All rights reserved.
Adults' past-day recall of sedentary time: reliability, validity, and responsiveness.

PubMed

Clark, Bronwyn K; Winkler, Elisabeth; Healy, Genevieve N; Gardiner, Paul G; Dunstan, David W; Owen, Neville; Reeves, Marina M

2013-06-01

Past-day recall rather than recall of past week or a usual/typical day may improve the validity of self-reported sedentary time measures. This study examined the test-retest reliability, criterion validity, and responsiveness of the seven-item questionnaire, Past-day Adults' Sedentary Time (PAST). Participants (breast cancer survivors, n = 90, age = 33-75 yr, body mass index = 25-40 kg·m) in a 6-month randomized controlled trial of a lifestyle-based weight loss intervention completed the interviewer-administered PAST questionnaire about time spent sitting/lying on the previous day for work, transport, television viewing, nonwork computer use, reading, hobbies, and other purposes (summed for total sedentary time). The instrument was administered at baseline, 7 d later for test-retest reliability (n = 86), and at follow-up. ActivPAL3-assessed sit/lie time in bouts of ≥5 min during waking hours on the recall day was used as the validity criterion measure at both baseline (n = 72) and follow-up (n = 68). Analyses included intraclass correlation coefficients, Pearson's correlations (r), and Bland-Altman plots and responsiveness index. The PAST had fair to good test-retest reliability (intraclass correlation coefficient = 0.50, 95% confidence interval [CI] = 0.32-0.64). At baseline, the correlation between PAST and activPAL sit/lie time was r = 0.57 (95% CI = 0.39-0.71). The mean difference between PAST at baseline and retest was -25 min (5.2%), 95% limits of agreement = -5.9 to 5.0 h, and the activPAL sit/lie time was -9 min (1.8%), 95% limits of agreement = -4.9 to 4.6 h. The PAST showed small but significant responsiveness (-0.44, 95% CI = -0.92 to -0.04); responsiveness of activPAL sit/lie time was not significant. The PAST questionnaire provided an easy-to-administer measure of sedentary time in this sample. Validity and reliability findings compare favorably with other sedentary time questionnaires. Past-day recall of sedentary time shows promise for use in future health behavior, epidemiological, and population surveillance studies.
Test-retest reliability and agreement of the Satisfaction with the Assistive Technology Services (SATS) instrument in two Nordic countries.

PubMed

Sund, Terje; Iwarsson, Susanne; Anttila, Heidi; Helle, Tina; Brandt, Ase

2014-07-01

The purpose of this study was to investigate test-retest reliability, agreement, internal consistency, and floor- and ceiling effects of the Danish and Finnish versions of the Satisfaction with the Assistive Technology Services (SATS) instrument among adult users of powered wheelchairs (PWCs) or powered scooters (scooters). Test-retest design, two telephone interviews 7-18 days apart of 40 informants, with mean age of 67.5 (SD 13.09) years in the Danish; and 54 informants with mean age of 55.6 (SD 12.09) years in the Finnish sample. The intra-class correlation coefficient varied between 0.57 and 0.93 for items in the Danish and between 0.41 and 0.93 in the Finnish sample. The percentage agreement varied between 54.2 and 79.5 for items in the Danish and between 69.2 and 81.1 in the Finnish sample, while the Cronbach's alpha values varied between 0.87 and 0.96 in the two samples. A ceiling effect was found in all items of both samples. This study indicates that the SATS may be reliably administered for telephone interviews among adult PWC and scooter users, and give information about aspects of the service delivery process for quality development improvement purposes. Further psychometric testing of the SATS is required.
Development and validation of college students' tuberculosis knowledge, attitudes and practices questionnaire (CS-TBKAPQ).

PubMed

Jiang, Hualin; Zhang, Shaoru; Ding, Yi; Li, Yuelu; Zhang, Tianhua; Liu, Weiping; Fan, Yahui; Li, Yan; Zhang, Rongqiang; Ma, Xuexue

2017-12-12

China faces many challenges in controlling tuberculosis (TB). One significant challenge is the control of college students' TB. In particular, cross-sectional studies of college students' knowledge, attitudes and practices (KAP) in regard to TB have attracted substantial attention. However, few measurement tools have been developed to aid processes related to expert consultation, pre-testing, reliability and validity testing. Our study developed the College Students' TB Knowledge Attitudes and Practices Questionnaire (CS-TBKAPQ) following the scale development steps. The construction of the CS-TBKAPQ was based on the Theory of Knowledge, Attitude, Belief, and Practice (KABP or KAP). The item pool was compiled from literature reviews and individual interviews. The reliability validation was assessed by calculating Cronbach's α coefficient, the split-half reliability coefficient, and the test-retest reliability coefficient. Construct validity was assessed using exploratory factor analysis (EFA) and confirmatory factor analysis (CFA). The diagnostic accuracy was evaluated using the World Health Organization Advocacy, Communication and Social Mobilization KAP Survey Questionnaire (WHO-TBKAPQ) as the reference standard. A total of 31 questionnaire items were proposed. Cronbach's α coefficient, the split-half reliability coefficient and the test-retest reliability coefficient were 0.86, 0.78 and 0.91. Four factors that explained 62.52% of the total variance were also identified in EFA and confirmed in CFA. The CFA model fit indices were x 2 /df = 1.82 (p < 0.001), GFI = 0.925, AGFI = 0.900, RMR = 0.068, and RMSEA = 0.049. The CS-TBKAPQ was significantly correlated with the WHO-TBKAPQ and the Chinese Public TB KAP Questionnaire (CDC-TBKAPQ) developed by the Chinese Center for Disease Control and Prevention (r = 0.59, 0.60, p < 0.001). The receiver operating characteristics curve (ROC) analysis suggested a cut-off point of 47.5, with which the CS-TBKAPQ showed a sensitivity of 73.63% and a specificity of 80.51% in identifying students with low-level KAP. The positive and negative predictive values were 83.23% and 69.91%. The findings of this study demonstrate that the CS-TBKAPQ is a reliable and valid tool for measuring the KAP towards TB in college students.
Environmental education curriculum evaluation questionnaire: A reliability and validity study

NASA Astrophysics Data System (ADS)

Minner, Daphne Diane

The intention of this research project was to bridge the gap between social science research and application to the environmental domain through the development of a theoretically derived instrument designed to give educators a template by which to evaluate environmental education curricula. The theoretical base for instrument development was provided by several developmental theories such as Piaget's theory of cognitive development, Developmental Systems Theory, Life-span Perspective, as well as curriculum research within the area of environmental education. This theoretical base fueled the generation of a list of components which were then translated into a questionnaire with specific questions relevant to the environmental education domain. The specific research question for this project is: Can a valid assessment instrument based largely on human development and education theory be developed that reliably discriminates high, moderate, and low quality in environmental education curricula? The types of analyses conducted to answer this question were interrater reliability (percent agreement, Cohen's Kappa coefficient, Pearson's Product-Moment correlation coefficient), test-retest reliability (percent agreement, correlation), and criterion-related validity (correlation). Face validity and content validity were also assessed through thorough reviews. Overall results indicate that 29% of the questions on the questionnaire demonstrated a high level of interrater reliability and 43% of the questions demonstrated a moderate level of interrater reliability. Seventy-one percent of the questions demonstrated a high test-retest reliability and 5% a moderate level. Fifty-five percent of the questions on the questionnaire were reliable (high or moderate) both across time and raters. Only eight questions (8%) did not show either interrater or test-retest reliability. The global overall rating of high, medium, or low quality was reliable across both coders and time, indicating that the questionnaire can discriminate differences in quality of environmental education curricula. Of the 35 curricula evaluated, 6 were high quality, 14 were medium quality and 15 were low quality. The criterion-related validity of the instrument is at current time unable to be established due to the lack of comparable measures or a concretely usable set of multidisciplinary standards. Face and content validity were sufficiently demonstrated.
Biomechanical factors associated with time to complete a change of direction cutting maneuver.

PubMed

Marshall, Brendan M; Franklyn-Miller, Andrew D; King, Enda A; Moran, Kieran A; Strike, Siobhán C; Falvey, Éanna C

2014-10-01

Cutting ability is an important aspect of many team sports, however, the biomechanical determinants of cutting performance are not well understood. This study aimed to address this issue by identifying the kinetic and kinematic factors correlated with the time to complete a cutting maneuver. In addition, an analysis of the test-retest reliability of all biomechanical measures was performed. Fifteen (n = 15) elite multidirectional sports players (Gaelic hurling) were recruited, and a 3-dimensional motion capture analysis of a 75° cut was undertaken. The factors associated with cutting time were determined using bivariate Pearson's correlations. Intraclass correlation coefficients (ICCs) were used to examine the test-retest reliability of biomechanical measures. Five biomechanical factors were associated with cutting time (2.28 ± 0.11 seconds): peak ankle power (r = 0.77), peak ankle plantar flexor moment (r = 0.65), range of pelvis lateral tilt (r = -0.54), maximum thorax lateral rotation angle (r = 0.51), and total ground contact time (r = -0.48). Intraclass correlation coefficient scores for these 5 factors, and indeed for the majority of the other biomechanical measures, ranged from good to excellent (ICC >0.60). Explosive force production about the ankle, pelvic control during single-limb support, and torso rotation toward the desired direction of travel were all key factors associated with cutting time. These findings should assist in the development of more effective training programs aimed at improving similar cutting performances. In addition, test-retest reliability scores were generally strong, therefore, motion capture techniques seem well placed to further investigate the determinants of cutting ability.
Validity of the occupational sitting and physical activity questionnaire.

PubMed

Chau, Josephine Y; Van Der Ploeg, Hidde P; Dunn, Scott; Kurko, John; Bauman, Adrian E

2012-01-01

Sitting at work is an emerging occupational health risk. Few instruments designed for use in population-based research measure occupational sitting and standing as distinct behaviors. This study aimed to develop and validate brief measure of occupational sitting and physical activity. A convenience sample (n = 99, 61% female) was recruited from two medium-sized workplaces and by word-of-mouth in Sydney, Australia. Participants completed the newly developed Occupational Sitting and Physical Activity Questionnaire (OSPAQ) and a modified version of the MONICA Optional Study on Physical Activity Questionnaire (modified MOSPA-Q) twice, 1 wk apart. Participants also wore an ActiGraph accelerometer for the 7 d in between the test and retest. Analyses determined test-retest reliability with intraclass correlation coefficients and assessed criterion validity against accelerometers using the Spearman ρ. The test-retest intraclass correlation coefficients for occupational sitting, standing, and walking for OSPAQ ranged from 0.73 to 0.90, while that for the modified MOSPA-Q ranged from 0.54 to 0.89. Comparison of sitting measures with accelerometers showed higher Spearman correlations for the OSPAQ (r = 0.65) than for the modified MOSPA-Q (r = 0.52). Criterion validity correlations for occupational standing and walking measures were comparable for both instruments with accelerometers (standing: r = 0.49; walking: r = 0.27-0.29). The OSPAQ has excellent test-retest reliability and moderate validity for estimating time spent sitting and standing at work and is comparable to existing occupational physical activity measures for assessing time spent walking at work. The OSPAQ brief instrument measures sitting and standing at work as distinct behaviors and would be especially suitable in national health surveys, prospective cohort studies, and other studies that are limited by space constraints for questionnaire items.
Reliability of anthropometric measurements in young male and female artistic gymnasts.

PubMed

Siatras, Theophanis; Skaperda, Malamati; Mameletzi, Dimitra

2010-12-01

Body dimensions and body composition of children participating in artistic activities, such as gymnastics and many types of dancing, are important factors in performance improvement. The present study aimed to determine the reliability of a series of selected anthropometric measurements in young male and female gymnasts. Segment lengths, body breadths, circumferences, and skinfold thickness were measured in 20 young gymnasts by the same experienced examiner, using portable and easy-to-use instruments. All parameters were measured twice (test-retest) under the same conditions within a week's period. The high intra-class correlation coefficient (ICC) values ranging from 0.87 to 0.99, as well as the low coefficient of variation (CV) values (<5.3%), affirmed that the selected measurements were highly reliable. The technical error of measurement (TEM) values for lengths and breadths were 0.15 to 0.80 cm, for circumferences 0.22 to 1 cm, and for skinfold thickness 0.33 to 0.58 mm. The high test-retest ICC and the low CV and TEM values confirmed the reliability of all anthropometric measurements in young artistic gymnasts. Therefore, these measurements could contribute to further research in this field of investigation, helping to monitor young artistic gymnasts' growth status and identify specific characteristics for increased performance in this sport.

Test-retest reliability of biodex system 4 pro for isometric ankle-eversion and -inversion measurement.

PubMed

Tankevicius, Gediminas; Lankaite, Doanata; Krisciunas, Aleksandras

2013-08-01

The lack of knowledge about isometric ankle testing indicates the need for research in this area. to assess test-retest reliability and to determine the optimal position for isometric ankle-eversion and -inversion testing. Test-retest reliability study. Isometric ankle eversion and inversion were assessed in 3 different dynamometer foot-plate positions: 0°, 7°, and 14° of inversion. Two maximal repetitions were performed at each angle. Both limbs were tested (40 ankles in total). The test was performed 2 times with a period of 7 d between the tests. University hospital. The study was carried out on 20 healthy athletes with no history of ankle sprains. Reliability was assessed using intraclass correlation coefficient (ICC2,1); minimal detectable change (MDC) was calculated using a 95% confidence interval. Paired t test was used to measure statistically significant changes, and P <.05 was considered statistically significant. Eversion and inversion peak torques showed high ICCs in all 3 angles (ICC values .87-.96, MDC values 3.09-6.81 Nm). Eversion peak torque was the smallest when testing at the 0° angle and gradually increased, reaching maximum values at 14° angle. The increase of eversion peak torque was statistically significant at 7 ° and 14° of inversion. Inversion peak torque showed an opposite pattern-it was the smallest when measured at the 14° angle and increased at the other 2 angles; statistically significant changes were seen only between measures taken at 0° and 14°. Isometric eversion and inversion testing using the Biodex 4 Pro system is a reliable method. The authors suggest that the angle of 7° of inversion is the best for isometric eversion and inversion testing.
Factor validity and reliability of the aberrant behavior checklist-community (ABC-C) in an Indian population with intellectual disability.

PubMed

Lehotkay, R; Saraswathi Devi, T; Raju, M V R; Bada, P K; Nuti, S; Kempf, N; Carminati, G Galli

2015-03-01

In this study realised in collaboration with the department of psychology and parapsychology of Andhra University, validation of the Aberrant Behavior Checklist-Community (ABC-C) in Telugu, the official language of Andhra Pradesh, one of India's 28 states, was carried out. To assess the factor validity and reliability of this Telugu version, 120 participants with moderate to profound intellectual disability (94 men and 26 women, mean age 25.2, SD 7.1) were rated by the staff of the Lebenshilfe Institution for Mentally Handicapped in Visakhapatnam, Andhra Pradesh, India. Rating data were analysed with a confirmatory factor analysis. The internal consistency was estimated by Cronbach's alpha. To confirm the test-retest reliability, 50 participants were rated twice with an interval of 4 weeks, and 50 were rated by pairs of raters to assess inter-rater reliability. Confirmatory factor analysis revealed that the root mean square error of approximation (RMSEA) was equal to 0.06, the comparative fit index (CFI) was equal to 0.77, and the Tucker Lewis index (TLI) was equal to 0.77, which indicated that the model with five correlated factors had a good fit. Coefficient alpha ranged from 0.85 to 0.92 across the five subscales. Spearman's rank correlation coefficients for inter-rater reliability tests ranged from 0.65 to 0.75, and the correlations for test-retest reliability ranged from 0.58 to 0.76. All reliability coefficients were statistically significant (P < 0.01). The factor validity and reliability of Telugu version of the ABC-C evidenced factor validity and reliability comparable to the original English version and appears to be useful for assessing behaviour disorders in Indian people with intellectual disabilities. © 2014 MENCAP and International Association of the Scientific Study of Intellectual and Developmental Disabilities and John Wiley & Sons Ltd.
Toward a Common Language for Measuring Patient Mobility in the Hospital: Reliability and Construct Validity of Interprofessional Mobility Measures.

PubMed

Hoyer, Erik H; Young, Daniel L; Klein, Lisa M; Kreif, Julie; Shumock, Kara; Hiser, Stephanie; Friedman, Michael; Lavezza, Annette; Jette, Alan; Chan, Kitty S; Needham, Dale M

2018-02-01

The lack of common language among interprofessional inpatient clinical teams is an important barrier to achieving inpatient mobilization. In The Johns Hopkins Hospital, the Activity Measure for Post-Acute Care (AM-PAC) Inpatient Mobility Short Form (IMSF), also called "6-Clicks," and the Johns Hopkins Highest Level of Mobility (JH-HLM) are part of routine clinical practice. The measurement characteristics of these tools when used by both nurses and physical therapists for interprofessional communication or assessment are unknown. The purposes of this study were to evaluate the reliability and minimal detectable change of AM-PAC IMSF and JH-HLM when completed by nurses and physical therapists and to evaluate the construct validity of both measures when used by nurses. A prospective evaluation of a convenience sample was used. The test-retest reliability and the interrater reliability of AM-PAC IMSF and JH-HLM for inpatients in the neuroscience department (n = 118) of an academic medical center were evaluated. Each participant was independently scored twice by a team of 2 nurses and 1 physical therapist; a total of 4 physical therapists and 8 nurses participated in reliability testing. In a separate inpatient study protocol (n = 69), construct validity was evaluated via an assessment of convergent validity with other measures of function (grip strength, Katz Activities of Daily Living Scale, 2-minute walk test, 5-times sit-to-stand test) used by 5 nurses. The test-retest reliability values (intraclass correlation coefficients) for physical therapists and nurses were 0.91 and 0.97, respectively, for AM-PAC IMSF and 0.94 and 0.95, respectively, for JH-HLM. The interrater reliability values (intraclass correlation coefficients) between physical therapists and nurses were 0.96 for AM-PAC IMSF and 0.99 for JH-HLM. Construct validity (Spearman correlations) ranged from 0.25 between JH-HLM and right-hand grip strength to 0.80 between AM-PAC IMSF and the Katz Activities of Daily Living Scale. The results were obtained from inpatients in the neuroscience department of a single hospital. The AM-PAC IMSF and JH-HLM had excellent interrater reliability and test-retest reliability for both physical therapists and nurses. The evaluation of convergent validity suggested that AM-PAC IMSF and JH-HLM measured constructs of patient mobility and physical functioning. © 2017 American Physical Therapy Association
FACTOR ANALYSIS OF A SOCIAL SKILLS SCALE FOR HIGH SCHOOL STUDENTS.

PubMed

Wang, H-Y; Lin, C-K

2015-10-01

The objective of this study was to develop a social skills scale for high school students in Taiwan. This study adopted stratified random sampling. A total of 1,729 high school students were included. The students ranged in age from 16 to 18 years. A Social Skills Scale was developed for this study and was designed for classroom teachers to fill out. The test-retest reliability of this scale was tested by Pearson's correlation coefficient. Exploratory factor analysis was used to determine construct validity. The Social Skills Scale had good overall test-retest reliability of .92, and the internal consistency of the five subscales was above .90. The results of the factor analysis showed that the Social Skills Scale covered the five domains of classroom learning skills, communication skills, individual initiative skills, interaction skills, and job-related social skills, and the five factors explained 68.34% of the variance. Thus, the Social Skills Scale had good reliability and validity and would be applicable to and could be promoted for use in schools.
Test Performance and Test-Retest Reliability of the Vestibular/Ocular Motor Screening and King-Devick Test in Adolescent Athletes During a Competitive Sport Season.

PubMed

Worts, Phillip R; Schatz, Philip; Burkhart, Scott O

2018-05-01

The Vestibular/Ocular Motor Screening (VOMS) and King-Devick (K-D) test are tools designed to assess ocular or vestibular function after a sport-related concussion. To determine the test-retest reliability and rate of false-positive results of the VOMS and K-D test in a healthy athlete sample. Cohort study (diagnosis); Level of evidence, 2. Forty-five healthy high school student-athletes (mean age, 16.11 ± 1.43 years) completed self-reported demographics and medical history and were administered the VOMS and K-D test during rest on day 1 (baseline). The VOMS and K-D test were administered again once during rest (prepractice) and once within 5 minutes of removal from sport practice on day 2 (removal). The Borg rating of perceived exertion scale was administered at removal. Intraclass correlation coefficients were used to determine test-retest reliability on the K-D test and the average near point of convergence (NPC) distance on the VOMS. Level of agreement was used to examine VOMS symptom provocation over the 3 administration times. Multivariate base rates were used to determine the rate of false-positive results when simultaneously considering multiple clinical cutoffs. Test-retest reliability of total time on the K-D test (0.91 [95% CI, 0.86-0.95]) and NPC distance (0.91 [95% CI, 0.85-0.95]) was high across the 3 administration times. Level of agreement ranged from 48.9% to 88.9% across all 3 times for the VOMS items. Using established clinical cutoffs, false-positive results occurred in 2% of the sample using the VOMS at removal and 36% using the K-D test. The VOMS displayed a false-positive rate of 2% in this high school student-athlete cohort. The K-D test's false-positive rate was 36% while maintaining a high level of test-retest reliability (0.91). Results from this study support future investigation of VOMS administration in an acutely injured high school athletic sample. Going forward, the VOMS may be more stable than other neurological and symptom report screening measures and less vulnerable to false-positive results than the K-D test.
The role of test-retest reliability in measuring individual and group differences in executive functioning.

PubMed

Paap, Kenneth R; Sawi, Oliver

2016-12-01

Studies testing for individual or group differences in executive functioning can be compromised by unknown test-retest reliability. Test-retest reliabilities across an interval of about one week were obtained from performance in the antisaccade, flanker, Simon, and color-shape switching tasks. There is a general trade-off between the greater reliability of single mean RT measures, and the greater process purity of measures based on contrasts between mean RTs in two conditions. The individual differences in RT model recently developed by Miller and Ulrich was used to evaluate the trade-off. Test-retest reliability was statistically significant for 11 of the 12 measures, but was of moderate size, at best, for the difference scores. The test-retest reliabilities for the Simon and flanker interference scores were lower than those for switching costs. Standard practice evaluates the reliability of executive-functioning measures using split-half methods based on data obtained in a single day. Our test-retest measures of reliability are lower, especially for difference scores. These reliability measures must also take into account possible day effects that classical test theory assumes do not occur. Measures based on single mean RTs tend to have acceptable levels of reliability and convergent validity, but are "impure" measures of specific executive functions. The individual differences in RT model shows that the impurity problem is worse than typically assumed. However, the "purer" measures based on difference scores have low convergent validity that is partly caused by deficiencies in test-retest reliability. Copyright © 2016 Elsevier B.V. All rights reserved.
Evidence of Validity for the Japanese Version of the Foot and Ankle Ability Measure

PubMed Central

Uematsu, Daisuke; Suzuki, Hidetomo; Sasaki, Shogo; Nagano, Yasuharu; Shinozuka, Nobuyuki; Sunagawa, Norihiko; Fukubayashi, Toru

2015-01-01

Context: The Foot and Ankle Ability Measure (FAAM) is a valid, reliable, and self-reported outcome instrument for the foot and ankle region. Objective: To provide evidence for translation, cross-cultural adaptation, validity, and reliability of the Japanese version of the FAAM (FAAM-J). Design: Cross-sectional study. Setting: Collegiate athletic training/sports medicine clinical setting. Patients or Other Participants: Eighty-three collegiate athletes. Main Outcome Measure(s): All participants completed the Activities of Daily Living and Sports subscales of the FAAM-J and the Physical Functioning and Mental Health subscales of the Japanese version of the Short Form-36v2 (SF-36). Also, 19 participants (23%) whose conditions were expected to be stable completed another FAAM-J 2 to 6 days later for test-retest reliability. We analyzed the scores of those subscales for convergent and divergent validity, internal consistency, and test-retest reliability. Results: The Activities of Daily Living and Sports subscales of the FAAM-J had correlation coefficients of 0.86 and 0.75, respectively, with the Physical Functioning section of the SF-36 for convergent validity. For divergent validity, the correlation coefficients with Mental Health of the SF-36 were 0.29 and 0.27 for each subscale, respectively. Cronbach α for internal consistency was 0.99 for the Activities of Daily Living and 0.98 for the Sports subscale. A 95% confidence interval with a single measure was ±8.1 and ±14.0 points for each subscale. The test-retest reliability measures revealed intraclass correlation coefficient values of 0.87 for the Activities of Daily Living and 0.91 for the Sports subscales with minimal detectable changes of ±6.8 and ±13.7 for the respective subscales. Conclusions: The FAAM was successfully translated for a Japanese version, and the FAAM-J was adapted cross-culturally. Thus, the FAAM-J can be used as a self-reported outcome measure for Japanese-speaking individuals; however, the scores must be interpreted with caution, especially when applied to different populations and other types of injury than those included in this study. PMID:25310247
Spanish validation of Bad Sobernheim Stress Questionnaire (BSSQ (brace).es) for adolescents with braces

PubMed Central

2010-01-01

Background As a result of scientific and medical professionals gaining interest in Stress and Health Related Quality of Life (HRQL), the aim of our research is, thus, to validate into Spanish the German questionnaire Bad Sobernheim Stress Questionnaire (BSSQ) (mit Korsett), for adolescents wearing braces. Methods The methodology used adheres to literature on trans-cultural adaptation by doing a translation and a back translation; it involved 35 adolescents, ages ranging between 10 and 16, with Adolescent Idiopathic Scoliosis (AIS) and wearing the same kind of brace (Rigo System Chêneau Brace). The materials used were a socio-demographics data questionnaire, the SRS-22 and the Spanish version of BSSQ(brace).es. The statistical analysis calculated the reliability (test-retest reliability and internal consistency) and the validity (convergent and construct validity) of the BSSQ (brace).es. Results BSSQ(brace).es is reliable because of its satisfactory internal consistency (Cronbach's alpha coefficient was 0.809, p < 0.001) and temporal stability (test-retest method with a Pearson correlation coefficient of 0.902 (p < 0.01)). It demonstrated convergent validity with SRS-22 since the Pearson correlation coefficient was 0.656 (p < 0.01). By undertaking an Exploratory Principal Components Analysis, a latent structure was found based on two Components which explicate the variance at 60.8%. Conclusions BSSQ (brace).es is reliable and valid and can be used with Spanish adolescents to assess the stress level caused by the brace. PMID:20633253
Translation, cross-cultural adaptation and reliability of the German version of the migraine disability assessment (MIDAS) questionnaire.

PubMed

Benz, Thomas; Lehmann, Susanne; Gantenbein, Andreas R; Sandor, Peter S; Stewart, Walter F; Elfering, Achim; Aeschlimann, André G; Angst, Felix

2018-03-09

The Migraine Disability Assessment (MIDAS) is a brief questionnaire and measures headache-related disability. This study aimed to translate and cross-culturally adapt the original English version of the MIDAS to German and to test its reliability. The standardized translation process followed international guidelines. The pre-final version was tested for clarity and comprehensibility by 34 headache sufferers. Test-retest reliability of the final version was quantified by 36 headache patients completing the MIDAS twice with an interval of 48 h. Reliability was determined by intraclass correlation coefficients and internal consistency by Cronbach's α. All steps of the translation process were followed, documented and approved by the developer of the MIDAS. The expert committee discussed in detail the complex phrasing of the questions that refer to one to another, especially exclusion of headache-days from one item to the next. The German version contains more active verb sentences and prefers the perfect to the imperfect tense. The MIDAS scales intraclass correlation coefficients ranged from 0.884 to 0.994 and was 0.991 (95% CI: 0.982-0.995) for the MIDAS total score. Cronbach's α for the MIDAS as a whole was 0.69 at test and 0.67 at retest. The translation process was challenged by the comprehensibility of the questionnaire. The German version of the MIDAS is a highly reliable instrument for assessing headache related disability with moderate internal consistency. Provided validity testing of the German MIDAS is successful, it can be recommended for use in clinical practice as well as in research.
A Psychometric Study of the Bayley Scales of Infant and Toddler Development in Persian Language Children

PubMed Central

AZARI, Nadia; SOLEIMANI, Farin; VAMEGHI, Roshanak; SAJEDI, Firoozeh; SHAHSHAHANI, Soheila; KARIMI, Hossein; KRASKIAN, Adis; SHAHROKHI, Amin; TEYMOURI, Robab; GHARIB, Masoud

2017-01-01

Objective Bayley Scales of infant & toddler development is a well-known diagnostic developmental assessment tool for children aged 1–42 months. Our aim was investigating the validity & reliability of this scale in Persian speaking children. Materials & Methods The method was descriptive-analytic. Translation- back translation and cultural adaptation was done. Content & face validity of translated scale was determined by experts’ opinions. Overall, 403 children aged 1 to 42 months were recruited from health centers of Tehran, during years of 2013-2014 for developmental assessment in cognitive, communicative (receptive & expressive) and motor (fine & gross) domains. Reliability of scale was calculated through three methods; internal consistency using Cronbach’s alpha coefficient, test-retest and interrater methods. Construct validity was calculated using factor analysis and comparison of the mean scores methods. Results Cultural and linguistic changes were made in items of all domains especially on communication subscale. Content and face validity of the test were approved by experts’ opinions. Cronbach’s alpha coefficient was above 0.74 in all domains. Pearson correlation coefficient in various domains, were ≥ 0.982 in test retest method, and ≥0.993 in inter-rater method. Construct validity of the test was approved by factor analysis. Moreover, the mean scores for the different age groups were compared and statistically significant differences were observed between mean scores of different age groups, that confirms validity of the test. Conclusion The Bayley Scales of Infant and Toddler Development is a valid and reliable tool for child developmental assessment in Persian language children. PMID:28277556
Translation, reliability, and clinical utility of the Melbourne Assessment 2.

PubMed

Gerber, Corinna N; Plebani, Anael; Labruyère, Rob

2017-10-12

The aims were to (i) provide a German translation of the Melbourne Assessment 2 (MA2), a quantitative test to measure unilateral upper limb function in children with neurological disabilities and (ii) to evaluate its reliability and aspects of clinical utility. After its translation into German and approval of the back translation by the original authors, the MA2 was performed and videotaped twice with 30 children with neuromotor disorders. For each participant, two raters scored the video of the first test for inter-rater reliability. To determine test-retest reliability, one rater additionally scored the video of the second test while the other rater repeated the scoring of the first video to evaluate intra-rater reliability. Time needed for rater training, test administration, and scoring was recorded. The four subscale scores showed excellent intra-, inter-rater, and test-retest reliability with intraclass correlation coefficients of 0.90-1.00 (95%-confidence intervals 0.78-1.00). Score items revealed substantial to almost perfect intra-rater reliability (weighted kappa k w = 0.66-1.00) for the more affected side. Score item inter-rater and test-retest reliability of the same extremity were, with one exception, moderate to almost perfect (k w = 0.42-0.97; k w = 0.40-0.89). Furthermore, the MA2 was feasible and acceptable for patients and clinicians. The MA2 showed excellent subscale and moderate to almost perfect score item reliability. Implications for Rehabilitation There is a lack of high-quality studies about psychometric properties of upper limb measurement tools in the neuropediatric population. The Melbourne Assessment 2 is a promising tool for reliable measurement of unilateral upper limb movement quality in the neuropediatric population. The Melbourne Assessment 2 is acceptable and practicable to therapists and patients for routine use in clinical care.
Threat distractor and perceptual load modulate test-retest reliability of anterior cingulate cortex response.

PubMed

Bunford, Nora; Kinney, Kerry L; Michael, Jamie; Klumpp, Heide

2017-07-03

Accumulating data from fMRI studies implicate the rostral anterior cingulate cortex (rACC) in inhibition of attention to threat distractors that compete with task-relevant goals for processing resources. However, little data is available on the reliability of rACC activation. Our aim in the current study was to examine test-retest reliability of rACC activation over a 12-week period, in the context of a validated emotional interference paradigm that varied in perceptual load. During functional MRI, 23 healthy volunteers completed a task involving a target letter in a string of identical letters (low load) or in a string of mixed letters (high load) superimposed on angry, fearful, and neutral face distractors. Intraclass correlation coefficients (ICCs) indicated that under low, but not high perceptual load, rACC activation to fearful vs. neutral distractors was moderately reliable. Conversely, regardless of perceptual load, rACC activation to angry vs. neutral distractors was not reliable. Regarding behavioral performance, ICCs indicated that accuracy was not reliable regardless of distractor type or perceptual load. Although reaction time (RT) was similarly not reliable regardless of distractor type under low perceptual load, RT to angry vs. neutral distractors and to fearful vs. neutral distractors was reliable under high perceptual load. Together, results indicate the test-retest reliability of rACC activation and corresponding behavioral performance are context dependent; reliability of the former varies as a function of distractor type and level of cognitive demand, whereas reliability of the latter depends on behavioral index (accuracy vs. RT) and level of cognitive demand but not distractor type. Copyright © 2017 Elsevier Inc. All rights reserved.
Adaptation, reliability and validity testing of a Persian version of the Health Assessment Questionnaire-Disability Index in Iranian patients with rheumatoid arthritis.

PubMed

Nazary-Moghadam, Salman; Zeinalzadeh, Afsaneh; Salavati, Mahyar; Almasi, Simin; Negahban, Hossein

2017-01-01

The aim of the present study was to culturally adapt and evaluate reliability and validity of Health Assessment Questionnaire-Disability Index (HAQ-DI) in Iranian patients with rheumatoid arthritis (RA). 234 patients with RA for validation study, Eighty-six participants for reliability study. Test-retest relative reliability and internal consistency of Persian version of HAQ-DI were examined by intraclass correlation coefficient (ICC) and Cronbach's alpha, respectively. Additionally, HAQ-DI construct validity (Spearman's correlation) was examined using Persian version of Short-Form 36 Health survey (SF-36), activity and severity parameters. Persian version of HAQ-DI total score showed excellent test-retest reliability (ICC = 0.98) and internal consistency (Cronbach's alpha = 0.95). Spearman's correlations between the total PHAQ-DI score and activity and severity parameters were above 0.55. Correlation between PHAQ-DI and SF-36 Physical Health were higher as compared with SF-36 Mental Health. Persian version of HAQ-DI is a reliable and valid culturally-adapted instrument in order to measure functional limitations in Iranian people with RA. Copyright © 2016 Elsevier Ltd. All rights reserved.
Validation of EncephalApp, Smartphone-Based Stroop Test, for the Diagnosis of Covert Hepatic Encephalopathy.

PubMed

Bajaj, Jasmohan S; Heuman, Douglas M; Sterling, Richard K; Sanyal, Arun J; Siddiqui, Muhammad; Matherly, Scott; Luketic, Velimir; Stravitz, R Todd; Fuchs, Michael; Thacker, Leroy R; Gilles, HoChong; White, Melanie B; Unser, Ariel; Hovermale, James; Gavis, Edith; Noble, Nicole A; Wade, James B

2015-10-01

Detection of covert hepatic encephalopathy (CHE) is difficult, but point-of-care testing could increase rates of diagnosis. We aimed to validate the ability of the smartphone app EncephalApp, a streamlined version of Stroop App, to detect CHE. We evaluated face validity, test-retest reliability, and external validity. Patients with cirrhosis (n = 167; 38% with overt HE [OHE]; mean age, 55 years; mean Model for End-Stage Liver Disease score, 12) and controls (n = 114) were each given a paper and pencil cognitive battery (standard) along with EncephalApp. EncephalApp has Off and On states; results measured were OffTime, OnTime, OffTime+OnTime, and number of runs required to complete 5 off and on runs. Thirty-six patients with cirrhosis underwent driving simulation tests, and EncephalApp results were correlated with results. Test-retest reliability was analyzed in a subgroup of patients. The test was performed before and after transjugular intrahepatic portosystemic shunt placement, and before and after correction for hyponatremia, to determine external validity. All patients with cirrhosis performed worse on paper and pencil and EncephalApp tests than controls. Patients with cirrhosis and OHE performed worse than those without OHE. Age-dependent EncephalApp cutoffs (younger or older than 45 years) were set. An OffTime+OnTime value of >190 seconds identified all patients with CHE with an area under the receiver operator characteristic value of 0.91; the area under the receiver operator characteristic value was 0.88 for diagnosis of CHE in those without OHE. EncephalApp times correlated with crashes and illegal turns in driving simulation tests. Test-retest reliability was high (intraclass coefficient, 0.83) among 30 patients retested 1-3 months apart. OffTime+OnTime increased significantly (206 vs 255 seconds, P = .007) among 10 patients retested 33 ± 7 days after transjugular intrahepatic portosystemic shunt placement. OffTime+OnTime decreased significantly (242 vs 225 seconds, P = .03) in 7 patients tested before and after correction for hyponatremia (126 ± 3 to 132 ± 4 meq/L, P = .01) 10 ± 5 days apart. A smartphone app called EncephalApp has good face validity, test-retest reliability, and external validity for the diagnosis of CHE. Copyright © 2015 AGA Institute. Published by Elsevier Inc. All rights reserved.
Validity and reliability of a new tool to evaluate handwriting difficulties in Parkinson’s disease

PubMed Central

Nackaerts, Evelien; Heremans, Elke; Smits-Engelsman, Bouwien C. M.; Broeder, Sanne; Vandenberghe, Wim; Bergmans, Bruno; Nieuwboer, Alice

2017-01-01

Background Handwriting in Parkinson’s disease (PD) features specific abnormalities which are difficult to assess in clinical practice since no specific tool for evaluation of spontaneous movement is currently available. Objective This study aims to validate the ‘Systematic Screening of Handwriting Difficulties’ (SOS-test) in patients with PD. Methods Handwriting performance of 87 patients and 26 healthy age-matched controls was examined using the SOS-test. Sixty-seven patients were tested a second time within a period of one month. Participants were asked to copy as much as possible of a text within 5 minutes with the instruction to write as neatly and quickly as in daily life. Writing speed (letters in 5 minutes), size (mm) and quality of handwriting were compared. Correlation analysis was performed between SOS outcomes and other fine motor skill measurements and disease characteristics. Intrarater, interrater and test-retest reliability were assessed using the intraclass correlation coefficient (ICC) and Spearman correlation coefficient. Results Patients with PD had a smaller (p = 0.043) and slower (p<0.001) handwriting and showed worse writing quality (p = 0.031) compared to controls. The outcomes of the SOS-test significantly correlated with fine motor skill performance and disease duration and severity. Furthermore, the test showed excellent intrarater, interrater and test-retest reliability (ICC > 0.769 for both groups). Conclusion The SOS-test is a short and effective tool to detect handwriting problems in PD with excellent reliability. It can therefore be recommended as a clinical instrument for standardized screening of handwriting deficits in PD. PMID:28253374
Cross-Cultural adaption, validity and reliability of a Hindi version of the Corah’s Dental Anxiety Scale

PubMed Central

Jain, Meena; Tandon, Shourya; Sharma, Ankur; Jain, Vishal; Rani Yadav, Nisha

2018-01-01

Background: An appropriate scale to assess the dental anxiety of Hindi speaking population is lacking. This study, therefore, aims to evaluate the psychometric properties of Hindi version of one of the oldest dental anxiety scale, Corah’s Dental Anxiety Scale (CDAS) in Hindi speaking Indian adults. Methods: A total of 348 subjects from the outpatient department of a dental hospital in India participated in this cross-sectional study. The scale was cross-culturally adapted by forward and backward translation, committee review and pretesting method. The construct validity of the translated scale was explored with exploratory factor analysis. The correlation of the Hindi version of CDAS with visual analogue scale (VAS) was used to measure the convergent validity. Reliability was assessed through calculations of Cronbach’s alpha and intra class correlation 48 forms were completed for test-retest. Results: Prevalence of dental anxiety in the sample within the age range of 18-80 years was 85.63% [95% CI: 0.815-0.891]. The response rate was 100 %. Kaiser-Meyer-Olkin (KMO) test value was 0.776. After factor analysis, a single factor (dental anxiety) was obtained with 4 items.The single factor model explained 61% variance. Pearson correlation coefficient between CDASand VAS was 0.494. Test-retest showed the Cronbach’s alpha value of 0.814. The test-retest intraclass correlation coefficient of the total CDAS score was 0.881 [95% CI: 0.318-0.554]. Conclusion: Hindi version of CDAS is a valid and reliable scale to assess dental anxiety in Hindi speaking population. Convergent validity is well recognized but discriminant validity is limited and requires further study. PMID:29744307
Cross-Cultural adaption, validity and reliability of a Hindi version of the Corah's Dental Anxiety Scale.

PubMed

Jain, Meena; Tandon, Shourya; Sharma, Ankur; Jain, Vishal; Rani Yadav, Nisha

2018-01-01

Background: An appropriate scale to assess the dental anxiety of Hindi speaking population is lacking. This study, therefore, aims to evaluate the psychometric properties of Hindi version of one of the oldest dental anxiety scale, Corah's Dental Anxiety Scale (CDAS) in Hindi speaking Indian adults. Methods: A total of 348 subjects from the outpatient department of a dental hospital in India participated in this cross-sectional study. The scale was cross-culturally adapted by forward and backward translation, committee review and pretesting method. The construct validity of the translated scale was explored with exploratory factor analysis. The correlation of the Hindi version of CDAS with visual analogue scale (VAS) was used to measure the convergent validity. Reliability was assessed through calculations of Cronbach's alpha and intra class correlation 48 forms were completed for test-retest. Results: Prevalence of dental anxiety in the sample within the age range of 18-80 years was 85.63% [95% CI: 0.815-0.891]. The response rate was 100 %. Kaiser-Meyer-Olkin (KMO) test value was 0.776. After factor analysis, a single factor (dental anxiety) was obtained with 4 items.The single factor model explained 61% variance. Pearson correlation coefficient between CDASand VAS was 0.494. Test-retest showed the Cronbach's alpha value of 0.814. The test-retest intraclass correlation coefficient of the total CDAS score was 0.881 [95% CI: 0.318-0.554]. Conclusion: Hindi version of CDAS is a valid and reliable scale to assess dental anxiety in Hindi speaking population. Convergent validity is well recognized but discriminant validity is limited and requires further study.
Cultural adaptation and validation of the Filipino version of Kidney Disease Quality of Life--Short Form (KDQOL-SF version 1.3).

PubMed

Bataclan, Rommel P; Dial, Ma Antonietta D

2009-10-01

Chronic kidney disease is the 10th leading cause of death among Filipinos. Those with chronic kidney disease are exposed to stressors which effect their daily lives. Therefore, assessment of health-related quality of life is important in these patients. The objective of the present study was to translate the Kidney Disease Quality of Life--Short Form version 1.3 (KDQOL-SF ver. 1.3) into Filipino and measure its validity and reliability. Translation and cultural adaptation began with two translations into Filipino, with reconciliation of the forward translators. Pretesting with 10 renal patients, review by experts (nephrologist, translator and dialysis nurse) and back-translation was also done. The final questionnaire was administered to 80 patients with chronic renal disease undergoing haemodialysis for at least 3 months, who could understand Filipino, and were without life-threatening or terminal conditions at the time of the test. A convenience sample of 30 patients from the group had a repeat test 10-14 days after to determine test-retest reliability. Test-retest reliability was assessed by intraclass correlation coefficient and internal consistency reliability was measured by determining the Cronbach's alpha value. Validity was measured using Pearson's correlation between the overall health rating scale and the items from the questionnaire. All of the items showed good test-retest reliability (intraclass correlation coefficient >0.40), ranging from 0.58 (social interaction) to 0.98 (role--emotional). Internal consistency reliability values were acceptable, with Cronbach's alpha ranging from 0.60 (cognitive function) to 0.80 (physical functioning and role--physical). Regarding construct validity, overall health rating in kidney disease-targeted scales was significantly correlated with symptoms/problems, effects of kidney disease and burden of kidney disease. All items in the SF 36 scales had significant correlation with overall health rating (P < 0.05) except for role--emotional. The Filipino version of the Kidney Disease Quality of Life--Short Form can be used to evaluate the health-related quality of life of Filipinos with chronic renal disease on haemodialysis.
Test-Retest Reliability of the Short-Form Survivor Unmet Needs Survey.

PubMed

Taylor, Karen; Bulsara, Max; Monterosso, Leanne

2018-01-01

Reliable and valid needs assessment measures are important assessment tools in cancer survivorship care. A new 30-item short-form version of the Survivor Unmet Needs Survey (SF-SUNS) was developed and validated with cancer survivors, including hematology cancer survivors; however, test-retest reliability has not been established. The objective of this study was to assess the test-retest reliability of the SF-SUNS with a cohort of lymphoma survivors ( n = 40). Test-retest reliability of the SF-SUNS was conducted at two time points: baseline (time 1) and 5 days later (time 2). Test-retest data were collected from lymphoma cancer survivors ( n = 40) in a large tertiary cancer center in Western Australia. Intraclass correlation analyses compared data at time 1 (baseline) and time 2 (5 days later). Cronbach's alpha analyses were performed to assess the internal consistency at both time points. The majority (23/30, 77%) of items achieved test-retest reliability scores 0.45-0.74 (fair to good). A high degree of overall internal consistency was demonstrated (time 1 = 0.92, time 2 = 0.95), with scores 0.65-0.94 across subscales for both time points. Mixed test-retest reliability of the SF-SUNS was established. Our results indicate the SF-SUNS is responsive to the changing needs of lymphoma cancer survivors. Routine use of cancer survivorship specific needs-based assessments is required in oncology care today. Nurses are well placed to administer these assessments and provide tailored information and resources. Further assessment of test-retest reliability in hematology and other cancer cohorts is warranted.
Test-retest reliability of the eating disorder examination-questionnaire (EDE-Q) in a college sample

PubMed Central

2013-01-01

Background The Eating Disorder Examination-Questionnaire (EDE-Q), a widely used self-report instrument, is often used for measuring change in eating disorder symptoms over the course of treatment. However, limited data exist about test-retest reliability, particularly for men. The current study evaluated EDE-Q 7-day test-retest reliability in male (n = 47) and female (n = 44) undergraduate students together and separately by gender. Results Internal consistency was consistently higher for women and at Time 2, but remained acceptable for both men and women at both time points. Cronbach’s α ranged from .75 (Restraint at Time 1) to .93 (Shape Concern at Time 2) for women and from .73 (Eating Concern at Time 2) to .89 (Shape Concern at Time 2) for men. With the exception of some of the eating disorder behaviors, test re-test reliability was fairly strong for both men and women. Shape Concern and the global EDE-Q score were highest for both men and women (Spearman’s rho > 0.89 with the exception of Shape Concern for women for which Spearman’s rho = .86). Test re-test reliability was lower for the eating disorder behavior measures, particularly for men, for whom Kendall’s tau-b for frequency and phi for occurrence was less than 0.70 for all but objective bulimic episodes. Conclusions Results were consistent with past research for women, indicating strong test re-test reliability in attitudinal features of eating disorders, but lower test re-test reliability in behavioral features. Internal consistency and test re-test reliability was good for the attitudinal features of eating disorder in men, but tended to be lower for men compared to women. The EDE-Q appears to be a reliable instrument for assessing eating disorder attitudes in both male and female undergraduate students, but is less reliable for assessing ED behaviors, particularly in men. PMID:24999420

High test-retest-reliability of pain-related evoked potentials (PREP) in healthy subjects.

PubMed

Özgül, Özüm Simal; Maier, Christoph; Enax-Krumova, Elena K; Vollert, Jan; Fischer, Marc; Tegenthoff, Martin; Höffken, Oliver

2017-04-24

Pain-related evoked potentials (PREP) is an established electrophysiological method to evaluate the signal transmission of electrically stimulated A-delta fibres. Although prerequisite for its clinical use, test-retest-reliability and side-to-side differences of bilateral stimulation in healthy subjects have not been examined yet. We performed PREP twice within 3-14days in 33 healthy subjects bilaterally by stimulating the dorsal hand. Detection (DT) and pain thresholds (PT) after electrical stimulation, the corresponding pain ratings, latencies of P0, N1, P1 and N2 components and the corresponding amplitudes were assessed. Impact of electrically induced pain intensity, age, sex, and arm length on PREP was analysed. MANOVA, t-Test, interclass correlation coefficient (ICC), standard error of measurement (SEM), smallest real difference (SRD), Bland-Altmann-Analysis as well as ANCOVA were used for statistical analysis. Measurement from both sides on both days resulted in mean N1-latencies from 142.39±18.12ms to 144.03±16.62ms and in mean N1P1-amplitudes from 39.04±12.26μV to 40.53±12.9μV. Analysis of a side-to-side effect showed for the N1-latency a F-value of 0.038 and for the N1P1-amplitude of 0.004 (p>0.8). We found intraclass correlation coefficients (ICC) from 0.88 to 0.93 and a standard error of measurement (SEM)<10% of mean values for all measurements concerning the N1-Latency and N1P1-amplitude. Intraclass correlation coefficients, standard error of measurement and Bland-Altman-Analyses revealed excellent test-retest-reliability for N1-latency and N1P1-amplitude without systematic error and there was no side-to-side effect on PREP. N1-latency (r=0.35, p<0.05) and N1P1-amplitude (r=-0.45, p<0.05) correlated with age and additionally N1-latency correlated with arm length (r=0.45, p<0.001). In contrast, pain intensity during the stimulation had no effect on both N1-latency and N1P1-amplitude. In summary, PREP showed high test-retest-reliability and negligible side-to-side differences concerning the commonly used parameters N1-latency and N1P1-amplitude. Copyright © 2017 Elsevier B.V. All rights reserved.
Validation of an instrument to measure quality of life in British children with inflammatory bowel disease.

PubMed

Ogden, C A; Akobeng, A K; Abbott, J; Aggett, P; Sood, M R; Thomas, A G

2011-09-01

To validate IMPACT-III (UK), a health-related quality of life (HRQoL) instrument, in British children with inflammatory bowel disease (IBD). One hundred six children and parents were invited to participate. IMPACT-III (UK) was validated by inspection by health professionals and children to assess face and content validity, factor analysis to determine optimum domain structure, use of Cronbach alpha coefficients to test internal reliability, ANOVA to assess discriminant validity, correlation with the Child Health Questionnaire to assess concurrent validity, and use of intraclass correlation coefficients to assess test-retest reliability. The independent samples t test was used to measure differences between sexes and age groups, and between paper and computerised versions of IMPACT-III (UK). IMPACT-III (UK) had good face and content validity. The most robust factor solution was a 5-domain structure: body image, embarrassment, energy, IBD symptoms, and worries/concerns about IBD, all of which demonstrated good internal reliability (α = 0.74-0.88). Discriminant validity was demonstrated by significant (P < 0.05, P < 0.01) differences in HRQoL scores between the severe, moderate, and inactive/mild symptom severity groups for the embarrassment scale (63.7 vs 81.0 vs 81.2), IBD symptom scale (45.0 vs 64.2 vs 80.6), and the energy scale (46.4 vs 62.1 vs 77.7). Concurrent validity of IMPACT-III (UK) with comparable domains of the Child Health Questionnaire was confirmed. Test-retest reliability was confirmed with good intraclass correlation coefficients of 0.66 to 0.84. Paper and computer versions of IMPACT-III (UK) collected comparable scores, and there were no differences between the sexes and age groups. IMPACT-III (UK) appears to be a useful tool to measure HRQoL in British children with IBD.
Reliability and feasibility of physical fitness tests in female fibromyalgia patients.

PubMed

Carbonell-Baeza, A; Álvarez-Gallardo, I C; Segura-Jiménez, V; Castro-Piñero, J; Ruiz, J R; Delgado-Fernández, M; Aparicio, V A

2015-02-01

The aim of the present study was to determine the reliability and feasibility of physical fitness tests in female fibromyalgia patients. 100 female fibromyalgia patients (aged 50.6±8.6 years) performed the following tests twice (7 days interval test-retest): chair sit and reach, back scratch, handgrip strength, arm curl, chair stand, 8 feet up and go, and 6-min walk. Significant differences between test and retest were found in the arm curl (mean difference: 1.25±2.16 repetitions, Cohen d=0.251), chair stand (0.99±1.7 repetitions, Cohen d=0.254) and 8 feet up and go (-0.38±1.09 s, Cohen d=0.111) tests. Intraclass correlation coefficients (ICC) range from 0.92 in the arm curl test to 0.96 in the back scratch test. The feasibility of the tests (patients able to complete the test) ranged from 89% in the arm curl test to 100% in the handgrip strength test. Therefore, the reliability and feasibility of the physical fitness tests examined is acceptable for female fibromyalgia patients. © Georg Thieme Verlag KG Stuttgart · New York.
Test-Retest Reliability of a Survey to Measure Transport-Related Physical Activity in Adults

ERIC Educational Resources Information Center

Badland, Hannah; Schofield, Grant

2006-01-01

The present research details test-retest reliability of a newly developed, telephone-administered TPA survey for adults. This instrument examines barriers, perceptions, and current travel behaviors to place of work/study and local convenience shops. Demonstrated test-retest reliability of the Active Friendly Environments-Transport-Related Physical…
Evaluating the validity and reliability of the V-scale instrument (Turkish version) used to determine nurses' attitudes towards vital sign monitoring.

PubMed

Ertuğ, Nurcan

2018-06-01

The aim of this study was to determine the validity and reliability of the Turkish version of the V-scale, which measures nurses' attitudes towards vital signs monitoring in the detection of clinical deterioration. This validity and reliability study was conducted at a tertiary hospital in Ankara, Turkey, in 2016. A total of 169 ward nurses participated in the study. Exploratory factor analysis, Cronbach's alpha coefficient, and the intraclass correlation coefficient were used to determine the validity and reliability of the scale. A 5-factor, 16-item scale explained 60.823% of the total variance according to the validity analysis. Our version matched the original scale in terms of the number of items and factor structure. Cronbach's alpha coefficient of the Turkish version of the V-scale was 0.764. The test-retest reliability results were 0.855 for the overall intraclass correlation coefficient, and the t-test result was P > 0.05. The V-scale is a reliable and valid instrument to measure Turkish nurses' attitudes towards vital signs monitoring in the detection of clinical deterioration. © 2018 John Wiley & Sons Australia, Ltd.
Reliability of tensiomyography and myotonometry in detecting mechanical and contractile characteristics of the lumbar erector spinae in healthy volunteers.

PubMed

Lohr, Christine; Braumann, Klaus-Michael; Reer, Ruediger; Schroeder, Jan; Schmidt, Tobias

2018-04-20

Tensiomyography™ (TMG) and MyotonPRO ® (MMT) are two non-invasive devices for monitoring muscle contractile and mechanical characteristics. This study aimed to evaluate the test-retest reliability of TMG and MMT parameters for measuring (TMG:) muscle displacement (D m ), contraction time (T c ), and velocity (V c ) and (MMT:) frequency (F), stiffness (S), and decrement (D) of the erector spinae muscles (ES) in healthy adults. A particular focus was set on the establishment of reliability measures for the previously barely evaluated secondary TMG parameter V c . Twenty-four subjects (13 female and 11 male, mean ± SD, 38.0 ± 12.0 years) were measured using TMG and MMT over 2 consecutive days. Absolute and relative reliability was calculated by standard error of measurement (SEM, SEM%), Minimum detectable change (MDC, MDC%), coefficient of variation (CV%) and intraclass correlation coefficient (ICC, 3.1) with a 95% confidence interval (CI). The ICCs for all variables and test-retest intervals ranged from 0.75 to 0.99 indicating a good to excellent relative reliability for both TMG and MMT, demonstrating the lowest values for TMG T c and between-day MMT D (ICC < 0.90). Absolute reliability was suitable for all parameters (CV 2-8%) except for D m (10-12%). V c demonstrated to be the most reliable and repeatable TMG parameter (ICC > 0.95, CV < 8%). The reliability for TMG V c could be established successfully. Its further applicability needs to be confirmed in future studies. MMT was found to be more reliable on repeated testing than the two other TMG parameters D m and T c .
Validity and Reliability of a Portable Balance Tracking System, BTrackS, in Older Adults.

PubMed

Levy, Susan S; Thralls, Katie J; Kviatkovsky, Shiloah A

Falls are the leading cause of disability, injury, hospital admission, and injury-related death among older adults. Balance limitations have consistently been identified as predictors of falls and increased fall risk. Field measures of balance are limited by issues of subjectivity, ceiling effects, and low sensitivity to change. The gold standard for measuring balance is the force plate; however, its field use is untenable due to high cost and lack of portability. Thus, a critical need is observed for valid objective field measures of balance to accurately assess balance and identify limitations over time. The purpose of this study was to examine the concurrent validity and 3-day test-retest reliability of Balance Tracking System (BTrackS) in community-dwelling older adults. Minimal detectable change values were also calculated to reflect changes in balance beyond measurement error. Postural sway data were collected from community-dwelling older adults (N = 49, mean [SD] age = 71.3 [7.3] years) with a force plate and BTrackS in multitrial eyes open (EO) and eyes closed (EC) static balance conditions. Force sensors transmitted BTrackS data via a USB to a computer running custom software. Three approaches to concurrent validity were taken including calculation of Pearson product moment correlation coefficients, repeated-measures ANOVAs, and Bland-Altman plots. Three-day test-retest reliability of BTrackS was examined in a second sample of 47 community-dwelling older adults (mean [SD] age = 75.8 [7.7] years) using intraclass correlation coefficients and MDC values at 95% CI (MDC95) were calculated. BTrackS demonstrated good validity using Pearson product moment correlations (r > 0.90). Repeated-measures ANOVA and Bland-Altman plots indicated some BTrackS bias with center of pressure (COP) values higher than FP COP values in the EO (mean [SD] bias = 4.0 [6.8]) and EC (mean [SD] bias = 9.6 [12.3]) conditions. Test-retest reliability using intraclass correlation coefficients (ICC2.1 was excellent (0.83) and calculated MDC95 for EO (9.6 cm) and EC (19.4 cm) and suggested that postural sway changes of these amounts are meaningful. BTrackS showed some bias with values exceeding force plate values in both EO and EC conditions. Excellent test-retest reliability and resulting MDC95 values indicated that BTrackS has the potential to identify meaningful changes in balance that may warrant intervention. BTrackS is an objective measure of balance that can be used to monitor balance in community-dwelling older adults over time. It can reliably identify changes that may require further attention (eg, fall-prevention strategies, declines in physical function) and shows promise for assessing intervention efficacy in this growing segment of the population.
The reliability of WorkWell Systems Functional Capacity Evaluation: a systematic review

PubMed Central

2014-01-01

Background Functional capacity evaluation (FCE) determines a person’s ability to perform work-related tasks and is a major component of the rehabilitation process. The WorkWell Systems (WWS) FCE (formerly known as Isernhagen Work Systems FCE) is currently the most commonly used FCE tool in German rehabilitation centres. Our systematic review investigated the inter-rater, intra-rater and test-retest reliability of the WWS FCE. Methods We performed a systematic literature search of studies on the reliability of the WWS FCE and extracted item-specific measures of inter-rater, intra-rater and test-retest reliability from the identified studies. Intraclass correlation coefficients ≥ 0.75, percentages of agreement ≥ 80%, and kappa coefficients ≥ 0.60 were categorised as acceptable, otherwise they were considered non-acceptable. The extracted values were summarised for the five performance categories of the WWS FCE, and the results were classified as either consistent or inconsistent. Results From 11 identified studies, 150 item-specific reliability measures were extracted. 89% of the extracted inter-rater reliability measures, all of the intra-rater reliability measures and 96% of the test-retest reliability measures of the weight handling and strength tests had an acceptable level of reliability, compared to only 67% of the test-retest reliability measures of the posture/mobility tests and 56% of the test-retest reliability measures of the locomotion tests. Both of the extracted test-retest reliability measures of the balance test were acceptable. Conclusions Weight handling and strength tests were found to have consistently acceptable reliability. Further research is needed to explore the reliability of the other tests as inconsistent findings or a lack of data prevented definitive conclusions. PMID:24674029
Test-retest reliability of knee extensor rate of velocity and power development in older adults using the isotonic mode on a Biodex System 3 dynamometer.

PubMed

Van Driessche, Stijn; Van Roie, Evelien; Vanwanseele, Benedicte; Delecluse, Christophe

2018-01-01

Isotonic testing and measures of rapid power production are emerging as functionally relevant test methods for detection of muscle aging. Our objective was to assess reliability of rapid velocity and power measures in older adults using the isotonic mode of an isokinetic dynamometer. Sixty-three participants (aged 65 to 82 years) underwent a test-retest protocol with one week time interval. Isotonic knee extension tests were performed at four different loads: 0%, 25%, 50% and 75% of maximal isometric strength. Peak velocity (pV) and power (pP) were determined as the highest values of the velocity and power curve. Rate of velocity (RVD) and power development (RPD) were calculated as the linear slopes of the velocity- and power-time curve. Relative and absolute measures of test-retest reliability were analyzed using intraclass correlation coefficients (ICC), standard error of measurement (SEM) and Bland-Altman analyses. Overall, reliability was high for pV, pP, RVD and RPD at 0%, 25% and 50% load (ICC: .85 - .98, SEM: 3% - 10%). A trend for increased reliability at lower loads seemed apparent. The tests at 75% load led to range of motion failure and should be avoided. In addition, results demonstrated that caution is advised when interpreting early phase results (first 50ms). To conclude, our results support the use of the isotonic mode of an isokinetic dynamometer for testing rapid power and velocity characteristics in older adults, which is of high clinical relevance given that these muscle characteristics are emerging as the primary outcomes for preventive and rehabilitative interventions in aging research.
Defining physicians' readiness to screen and manage intimate partner violence in Greek primary care settings.

PubMed

Papadakaki, Maria; Prokopiadou, Dimitra; Petridou, Eleni; Kogevinas, Manolis; Lionis, Christos

2012-06-01

The current article aims to translate the PREMIS (Physician Readiness to Manage Intimate Partner Violence) survey into the Greek language and test its validity and reliability in a sample of primary care physicians. The validation study was conducted in 2010 and involved all the general practitioners serving two adjacent prefectures of Greece (n = 80). Maximum-likelihood factor analysis (MLF) was used to extract key survey factors. The instrument was further assessed for the following psychometric properties: (a) scale reliability, (b) item-specific reliability, (c) test-retest reliability, (d) scale construct validity, and (e) internal predictive validity. The MLF analysis of 23 opinion items revealed a seven-factor solution (preparation, constraint, workplace issues, screening, self-efficacy, alcohol/drugs, victim understanding), which was statistically sound (p = .293). Most of the newly derived scales displayed satisfactory internal consistency (α ≥ .60), high item-specific reliability, strong construct, and internal predictive validity (F = 2.82; p = .004), and high repeatability when retested with 20 individuals (intraclass correlation coefficient [ICC] > .70). The tool was found appropriate to facilitate the identification of competence deficits and the evaluation of training initiatives.
Measuring the Characteristic Topography of Brain Stiffness with Magnetic Resonance Elastography

PubMed Central

Murphy, Matthew C.; Huston, John; Jack, Clifford R.; Glaser, Kevin J.; Senjem, Matthew L.; Chen, Jun; Manduca, Armando; Felmlee, Joel P.; Ehman, Richard L.

2013-01-01

Purpose To develop a reliable magnetic resonance elastography (MRE)-based method for measuring regional brain stiffness. Methods First, simulation studies were used to demonstrate how stiffness measurements can be biased by changes in brain morphometry, such as those due to atrophy. Adaptive postprocessing methods were created that significantly reduce the spatial extent of edge artifacts and eliminate atrophy-related bias. Second, a pipeline for regional brain stiffness measurement was developed and evaluated for test-retest reliability in 10 healthy control subjects. Results This technique indicates high test-retest repeatability with a typical coefficient of variation of less than 1% for global brain stiffness and less than 2% for the lobes of the brain and the cerebellum. Furthermore, this study reveals that the brain possesses a characteristic topography of mechanical properties, and also that lobar stiffness measurements tend to correlate with one another within an individual. Conclusion The methods presented in this work are resistant to noise- and edge-related biases that are common in the field of brain MRE, demonstrate high test-retest reliability, and provide independent regional stiffness measurements. This pipeline will allow future investigations to measure changes to the brain’s mechanical properties and how they relate to the characteristic topographies that are typical of many neurologic diseases. PMID:24312570
Reliability and validity of a talent identification test battery for seated and standing Paralympic throws.

PubMed

Spathis, Jemima Grace; Connick, Mark James; Beckman, Emma Maree; Newcombe, Peter Anthony; Tweedy, Sean Michael

2015-01-01

Paralympic throwing events for athletes with physical impairments comprise seated and standing javelin, shot put, discus and seated club throwing. Identification of talented throwers would enable prediction of future success and promote participation; however, a valid and reliable talent identification battery for Paralympic throwing has not been reported. This study evaluates the reliability and validity of a talent identification battery for Paralympic throws. Participants were non-disabled so that impairment would not confound analyses, and results would provide an indication of normative performance. Twenty-eight non-disabled participants (13 M; 15 F) aged 23.6 years (±5.44) performed five kinematically distinct criterion throws (three seated, two standing) and nine talent identification tests (three anthropometric, six motor); 23 were tested a second time to evaluate test-retest reliability. Talent identification test-retest reliability was evaluated using Intra-class Correlation Coefficient (ICC) and Bland-Altman plots (Limits of Agreement). Spearman's correlation assessed strength of association between criterion throws and talent identification tests. Reliability was generally acceptable (mean ICC = 0.89), but two seated talent identification tests require more extensive familiarisation. Correlation strength (mean rs = 0.76) indicated that the talent identification tests can be used to validly identify individuals with competitively advantageous attributes for each of the five kinematically distinct throwing activities. Results facilitate further research in this understudied area.
The Malay Version of the Perceived Stress Scale (PSS)-10 is a Reliable and Valid Measure for Stress among Nurses in Malaysia.

PubMed

Sandhu, Sukhvinder Singh; Ismail, Noor Hassim; Rampal, Krishna Gopal

2015-11-01

The Perceived Stress Scale-10 (PSS-10) is widely used to assess stress perception. The aim of this study was to translate the original PSS-10 into Malay and assess the reliability and validity of the Malay version among nurses. The Malay version of the PSS-10 was distributed among 229 nurses from four government hospitals in Selangor State. Test-retest reliability and concurrent validity was conducted with 25 nurses with the Malay version of the Depression Anxiety Stress Scales (DASS) 21. Cronbach's alpha, confirmatory factor analysis (CFA), intraclass correlation coefficient and Pearson's r correlation coefficient were used to determine the psychometric properties of the Malay PSS-10. Two factor components were yielded through exploratory factor analysis with eigenvalues of 3.37 and 2.10, respectively. Both of the factors accounted for 54.6% of the variance. CFA yielded a two-factor structure with satisfactory goodness-of-fit indices [x 2 /df = 2.43; comparative fit index (CFI) = 0.92, goodness-of-fit Index (GFI) = 0.94; standardised root mean square residual (SRMR) = 0.07 and root mean square error of approximation (RMSEA) = 0.08 (90% CI = 0.07-0.09)]. The Cronbach's alpha coefficient for the total items was 0.63 (0.82 for factor 1 and 0.72 for factor 2). The intraclass correlation coefficient (ICC) was 0.81 (95% CI: 0.62-0.91) for test-retest reliability testing after seven days. The total score and the negative component of the PSS-10 correlated significantly with the stress component of the DASS-21: (r = 0.61, P < 0.001) and (r = 0.56, P < 0.004), respectively. The Malay version of the PSS-10 demonstrated a satisfactory level of validity and reliability to assess stress perception. Therefore, this questionnaire is valid in assessing stress perception among nurses in Malaysia.
Test-retest reliability and minimal detectable change of two simplified 3-point balance measures in patients with stroke.

PubMed

Chen, Yi-Miau; Huang, Yi-Jing; Huang, Chien-Yu; Lin, Gong-Hong; Liaw, Lih-Jiun; Lee, Shih-Chieh; Hsieh, Ching-Lin

2017-10-01

The 3-point Berg Balance Scale (BBS-3P) and 3-point Postural Assessment Scale for Stroke Patients (PASS-3P) were simplified from the BBS and PASS to overcome the complex scoring systems. The BBS-3P and PASS-3P were more feasible in busy clinical practice and showed similarly sound validity and responsiveness to the original measures. However, the reliability of the BBS-3P and PASS-3P is unknown limiting their utility and the interpretability of scores. We aimed to examine the test-retest reliability and minimal detectable change (MDC) of the BBS-3P and PASS-3P in patients with stroke. Cross-sectional study. The rehabilitation departments of a medical center and a community hospital. A total of 51 chronic stroke patients (64.7% male). Both balance measures were administered twice 7 days apart. The test-retest reliability of both the BBS-3P and PASS-3P were examined by intraclass correlation coefficients (ICC). The MDC and its percentage over the total score (MDC%) of each measure was calculated for examining the random measurement errors. The ICC values of the BBS-3P and PASS-3P were 0.99 and 0.97, respectively. The MDC% (MDC) of the BBS-3P and PASS-3P were 9.1% (5.1 points) and 8.4% (3.0 points), respectively, indicating that both measures had small and acceptable random measurement errors. Our results showed that both the BBS-3P and the PASS-3P had good test-retest reliability, with small and acceptable random measurement error. These two simplified 3-level balance measures can provide reliable results over time. Our findings support the repeated administration of the BBS-3P and PASS-3P to monitor the balance of patients with stroke. The MDC values can help clinicians and researchers interpret the change scores more precisely.
The psychometric properties of an Arabic numeric pain rating scale for measuring osteoarthritis knee pain.

PubMed

Alghadir, Ahmad H; Anwer, Shahnawaz; Iqbal, Zaheen Ahmed

2016-12-01

The aims of this study were to translate the numeric rating scale (NRS) into Arabic and to evaluate the test-retest reliability and convergent validity of an Arabic Numeric Pain Rating Scale (ANPRS) for measuring pain in osteoarthritis (OA) of the knee. The English version of the NRS was translated into Arabic as per the translation process guidelines for patient-rated outcome scales. One hundred twenty-one consecutive patients with OA of the knee who had experienced pain for more than 6 months were asked to report their pain levels on the ANPRS, visual analogue scale (VAS), and verbal rating scale (VRS). A second assessment was performed 48 h after the first to assess test-retest reliability. The test-retest reliability was calculated using the intraclass correlation coefficient (ICC2,1). The convergent validity was assessed using Spearman rank correlation coefficient. In addition, the minimum detectable change (MDC) and standard error of measurement (SEM) were also assessed. The repeatability of ANPRS was good to excellent (ICC 0.89). The SEM and MDC were 0.71 and 1.96, respectively. Significant correlations were found with the VAS and VRS scores (p <0.01). The Arabic numeric pain rating scale is a valid and reliable scale for measuring pain levels in OA of the knee. Implications for Rehabilitation The Arabic Numeric Pain Rating Scale (ANPRS) is a reliable and valid instrument for measuring pain in osteoarthritis (OA) of the knee, with psychometric properties in agreement with other widely used scales. The ANPRS is well correlated with the VAS and NRS scores in patients with OA of the knee. The ANPRS appears to measure pain intensity similar to the VAS, NRS, and VRS and may provide additional advantages to Arab populations, as Arabic numbers are easily understood by this population.
The test-retest reliability of the latent construct of executive function depends on whether tasks are represented as formative or reflective indicators.

PubMed

Willoughby, Michael T; Kuhn, Laura J; Blair, Clancy B; Samek, Anya; List, John A

2017-10-01

This study investigates the test-retest reliability of a battery of executive function (EF) tasks with a specific interest in testing whether the method that is used to create a battery-wide score would result in differences in the apparent test-retest reliability of children's performance. A total of 188 4-year-olds completed a battery of computerized EF tasks twice across a period of approximately two weeks. Two different approaches were used to create a score that indexed children's overall performance on the battery-i.e., (1) the mean score of all completed tasks and (2) a factor score estimate which used confirmatory factor analysis (CFA). Pearson and intra-class correlations were used to investigate the test-retest reliability of individual EF tasks, as well as an overall battery score. Consistent with previous studies, the test-retest reliability of individual tasks was modest (rs ≈ .60). The test-retest reliability of the overall battery scores differed depending on the scoring approach (r mean = .72; r factor_ score = .99). It is concluded that the children's performance on individual EF tasks exhibit modest levels of test-retest reliability. This underscores the importance of administering multiple tasks and aggregating performance across these tasks in order to improve precision of measurement. However, the specific strategy that is used has a large impact on the apparent test-retest reliability of the overall score. These results replicate our earlier findings and provide additional cautionary evidence against the routine use of factor analytic approaches for representing individual performance across a battery of EF tasks.
Short-term test-retest-reliability of conditioned pain modulation using the cold-heat-pain method in healthy subjects and its correlation to parameters of standardized quantitative sensory testing.

PubMed

Gehling, Julia; Mainka, Tina; Vollert, Jan; Pogatzki-Zahn, Esther M; Maier, Christoph; Enax-Krumova, Elena K

2016-08-05

Conditioned Pain Modulation (CPM) is often used to assess human descending pain inhibition. Nine different studies on the test-retest-reliability of different CPM paradigms have been published, but none of them has investigated the commonly used heat-cold-pain method. The results vary widely and therefore, reliability measures cannot be extrapolated from one CPM paradigm to another. Aim of the present study was to analyse the test-retest-reliability of the common heat-cold-pain method and its correlation to pain thresholds. We tested the short-term test-retest-reliability within 40 ± 19.9 h using a cold-water immersion (10 °C, left hand) as conditioning stimulus (CS) and heat pain (43-49 °C, pain intensity 60 ± 5 on the 101-point numeric rating scale, right forearm) as test stimulus (TS) in 25 healthy right-handed subjects (12females, 31.6 ± 14.1 years). The TS was applied 30s before (TSbefore), during (TSduring) and after (TSafter) the 60s CS. The difference between the pain ratings for TSbefore and TSduring represents the early CPM-effect, between TSbefore and TSafter the late CPM-effect. Quantitative sensory testing (QST, DFNS protocol) was performed on both sessions before the CPM assessment. paired t-tests, Intraclass correlation coefficient (ICC), standard error of measurement (SEM), smallest real difference (SRD), Pearson's correlation, Bland-Altman analysis, significance level p < 0.05 with Bonferroni correction for multiple comparisons, when necessary. Pain ratings during CPM correlated significantly (ICC: 0.411…0.962) between both days, though ratings for TSafter were lower on day 2 (p < 0.005). The early (day 1: 16.7 ± 11.7; day 2: 19.5 ± 11.9; ICC: 0.618, SRD: 20.2) and late (day 1: 1.7 ± 9.2; day 2: 7.6 ± 11.5; ICC: 0.178, SRD: 27.0) CPM effect did not differ significantly between both days. Both early and late CPM-effects did not correlate with the pain thresholds. The short-term test-retest-reliability of the early CPM-effect using the heat-cold-pain method in healthy subjects achieved satisfying results in terms of the ICC. The SRD of the early CPM effect showed that an individual change of > 20 NRS can be attributed to a real change rather than chance. The late CPM-effect was weaker and not reliable.
Repeatability of self-report measures of physical activity, sedentary and travel behaviour in Hong Kong adolescents for the iHealt(H) and IPEN - Adolescent studies.

PubMed

Cerin, Ester; Sit, Cindy H P; Huang, Ya-Jun; Barnett, Anthony; Macfarlane, Duncan J; Wong, Stephen S H

2014-06-06

Physical activity and sedentary behaviour are important contributors to adolescents' health. These behaviours may be affected by the school and neighbourhood built environments. However, current evidence on such effects is mainly limited to Western countries. The International Physical Activity and the Environment Network (IPEN)-Adolescent study aims to examine associations of the built environment with adolescent physical activity and sedentary behaviour across five continents.We report on the repeatability of measures of in-school and out-of school physical activity, plus measures of out-of-school sedentary and travel behaviours adopted by the IPEN - Adolescent study and adapted for Chinese-speaking Hong Kong adolescents participating in the international Healthy environments and active living in teenagers-(Hong Kong) [iHealt(H)] study, which is part of IPEN-Adolescent. Items gauging in-school physical activity and out-of-school physical activity, and out-of-school sedentary and travel behaviours developed for the IPEN - Adolescent study were translated from English into Chinese, adapted, and pilot tested. Sixty-eight Chinese-speaking 12-17 year old secondary school students (36 boys; 32 girls) residing in areas of Hong Kong differing in transport-related walkability were recruited. They self-completed the survey items twice, 8-16 days apart. Test-retest reliability was assessed for the whole sample and by gender using one-way random effects intra-class correlation coefficients (ICC). Test-retest reliability of items with restricted variability was assessed using percentage agreement. Overall test-retest reliability of items and scales was moderate to excellent (ICC = 0.47-0.92). Items with restricted variability in responses had a high percentage agreement (92%-100%). Test-retest reliability was similar in girls and boys, with the exception of daily hours of homework (reliability higher in girls) and number of school-based sports teams or after-school physical activity classes (reliability higher in boys). The translated and adapted self-report measures of physical activity, sedentary and travel behaviours used in the iHealt(H) study are sufficiently reliable. Levels of reliability are comparable or slightly higher than those observed for the original measures.
Test-retest reliability of Physical Activity Neighborhood Environment Scale among urban men and women in Nanjing, China.

PubMed

Zhao, L; Wang, Z; Qin, Z; Leslie, E; He, J; Xiong, Y; Xu, F

2018-03-01

The identification of physical-activity-friendly built environment (BE) constructs is highly useful for physical activity promotion and maintenance. The Physical Activity Neighborhood Environment Scale (PANES) was developed for assessing BE correlates. However, PANES reliability has not been investigated among adults in China. A cross-sectional study. With multistage sampling approaches, 1568 urban adults (aged 35-74 years) were recruited for the initial survey on all 17 items of PANES Chinese version (PANES-CHN), with the survey repeated 7 days later for each participant. Intraclass correlation coefficient (ICC) was used to assess the test-retest reliability of PANES-CHN for each item. Totally, 1551 participants completed both surveys (follow-up rate = 98.9%). Among participants (mean age: 54.7 ± 11.1 years), 47.8% were men, 22.1% were elders, and 22.7% had ≥13 years of education. Overall, the PANES-CHN demonstrated at least substantial reliability with ICCs ranging from 0.66 to 0.95 (core items), from 0.75 to 0.95 (recommended items), and from 0.78 to 0.87 (optional items). Similar outcomes were observed when data were analyzed by gender or age groups. The PANES-CHN has excellent test-retest reliability and thus has valuable utility for assessing urban BE attributes among Chinese adults. Copyright © 2017 The Royal Society for Public Health. Published by Elsevier Ltd. All rights reserved.
Reliability and criterion validity of measurements using a smart phone-based measurement tool for the transverse rotation angle of the pelvis during single-leg lifting.

PubMed

Jung, Sung-Hoon; Kwon, Oh-Yun; Jeon, In-Cheol; Hwang, Ui-Jae; Weon, Jong-Hyuck

2018-01-01

The purposes of this study were to determine the intra-rater test-retest reliability of a smart phone-based measurement tool (SBMT) and a three-dimensional (3D) motion analysis system for measuring the transverse rotation angle of the pelvis during single-leg lifting (SLL) and the criterion validity of the transverse rotation angle of the pelvis measurement using SBMT compared with a 3D motion analysis system (3DMAS). Seventeen healthy volunteers performed SLL with their dominant leg without bending the knee until they reached a target placed 20 cm above the table. This study used a 3DMAS, considered the gold standard, to measure the transverse rotation angle of the pelvis to assess the criterion validity of the SBMT measurement. Intra-rater test-retest reliability was determined using the SBMT and 3DMAS using intra-class correlation coefficient (ICC) [3,1] values. The criterion validity of the SBMT was assessed with ICC [3,1] values. Both the 3DMAS (ICC = 0.77) and SBMT (ICC = 0.83) showed excellent intra-rater test-retest reliability in the measurement of the transverse rotation angle of the pelvis during SLL in a supine position. Moreover, the SBMT showed an excellent correlation with the 3DMAS (ICC = 0.99). Measurement of the transverse rotation angle of the pelvis using the SBMT showed excellent reliability and criterion validity compared with the 3DMAS.

Reliability of sonographic assessment of tendinopathy in tennis elbow.

PubMed

Poltawski, Leon; Ali, Syed; Jayaram, Vijay; Watson, Tim

2012-01-01

To assess the reliability and compute the minimum detectable change using sonographic scales to quantify the extent of pathology and hyperaemia in the common extensor tendon in people with tennis elbow. The lateral elbows of 19 people with tennis elbow were assessed sonographically twice, 1-2 weeks apart. Greyscale and power Doppler images were recorded for subsequent rating of abnormalities. Tendon thickening, hypoechogenicity, fibrillar disruption and calcification were each rated on four-point scales, and scores were summed to provide an overall rating of structural abnormality; hyperaemia was scored on a five point scale. Inter-rater reliability was established using the intraclass correlation coefficient (ICC) to compare scores assigned independently to the same set of images by a radiologist and a physiotherapist with training in musculoskeletal imaging. Test-retest reliability was assessed by comparing scores assigned by the physiotherapist to images recorded at the two sessions. The minimum detectable change (MDC) was calculated from the test-retest reliability data. ICC values for inter-rater reliability ranged from 0.35 (95% CI: 0.05, 0.60) for fibrillar disruption to 0.77 (0.55, 0.88) for overall greyscale score, and 0.89 (0.79, 0.95) for hyperaemia. Test-retest reliability ranged from 0.70 (0.48, 0.84) for tendon thickening to 0.82 (0.66, 0.90) for overall greyscale score and 0.86 (0.73, 0.93) for calcification. The MDC for the greyscale total score was 2.0/12 and for the hyperaemia score was 1.1/5. The sonographic scoring system used in this study may be used reliably to quantify tendon abnormalities and change over time. A relatively inexperienced imager can conduct the assessment and use the rating scales reliably.
Development and clinical application of a computer-aided real-time feedback system for detecting in-bed physical activities.

PubMed

Lu, Liang-Hsuan; Chiang, Shang-Lin; Wei, Shun-Hwa; Lin, Chueh-Ho; Sung, Wen-Hsu

2017-08-01

Being bedridden long-term can cause deterioration in patients' physiological function and performance, limiting daily activities and increasing the incidence of falls and other accidental injuries. Little research has been carried out in designing effective detecting systems to monitor the posture and status of bedridden patients and to provide accurate real-time feedback on posture. The purposes of this research were to develop a computer-aided system for real-time detection of physical activities in bed and to validate the system's validity and test-retest reliability in determining eight postures: motion leftward/rightward, turning over leftward/rightward, getting up leftward/rightward, and getting off the bed leftward/rightward. The in-bed physical activity detecting system consists mainly of a clinical sickbed, signal amplifier, a data acquisition (DAQ) system, and operating software for computing and determining postural changes associated with four load cell sensing components. Thirty healthy subjects (15 males and 15 females, mean age = 27.8 ± 5.3 years) participated in the study. All subjects were asked to execute eight in-bed activities in a random order and to participate in an evaluation of the test-retest reliability of the results 14 days later. Spearman's rank correlation coefficient was used to compare the system's determinations of postural states with researchers' recordings of postural changes. The test-retest reliability of the system's ability to determine postures was analyzed using the interclass correlation coefficient ICC(3,1). The system was found to exhibit high validity and accuracy (r = 0.928, p < 0.001; accuracy rate: 87.9%) in determining in-bed displacement, turning over, sitting up, and getting off the bed. The system was particularly accurate in detecting motion rightward (90%), turning over leftward (83%), sitting up leftward or rightward (87-93%), and getting off the bed (100%). The test-retest reliability ICC(3,1) value was 0.968 (p < 0.001). The system developed in this study exhibits satisfactory validity and reliability in detecting changes in-bed body postures and can be beneficial in assisting caregivers and clinical nursing staff in detecting the in-bed physical activities of bedridden patients and in developing fall prevention warning systems. Copyright © 2017 Elsevier B.V. All rights reserved.
The Validity and Reliability Test of the Indonesian Version of Gastroesophageal Reflux Disease Quality of Life (GERD-QOL) Questionnaire.

PubMed

Siahaan, Laura A; Syam, Ari F; Simadibrata, Marcellus; Setiati, Siti

2017-01-01

to obtain a valid and reliable GERD-QOL questionnaire for Indonesian application. at the initial stage, the GERD-QOL questionnaire was first translated into Indonesian language and the translated questionnaire was subsequently translated back into the original language (back-to-back translation). The results were evaluated by the researcher team and therefore, an Indonesian version of GERD-QOL questionnaire was developed. Ninety-one patients who had been clinically diagnosed with GERD based on the Montreal criteria were interviewed using the Indonesian version of GERD-QOL questionnaire and the SF 36 questionnaire. The validity was evaluated using a method of construct validity and external validity, and reliability can be tested by the method of internal consistency and test retest. the Indonesian version of GERD-QOL questionnaire had a good internal consistency reliability with a Cronbach Alpha of 0.687-0.842 and a good test retest reliability with an intra-class correlation coefficient of 0.756-0.936; p<0.05). The questionnaire had also been demonstrated to have a good validity with a proven high correlation to each question of SF-36 (p<0.05). the Indonesian version of GERD-QOL questionnaire has been proven valid and reliable to evaluate the quality of life of GERD patients.
Development of a clinical static and dynamic standing balance measurement tool appropriate for use in adolescents.

PubMed

Emery, Carolyn A; Cassidy, J David; Klassen, Terry P; Rosychuk, Rhonda J; Rowe, Brian B

2005-06-01

There is a need in sports medicine for a static and dynamic standing balance measure to quantify balance ability in adolescents. The purposes of this study were to determine the test-retest reliability of timed static (eyes open) and dynamic (eyes open and eyes closed) unipedal balance measurements and to examine factors associated with balance. Adolescents (n=123) were randomly selected from 10 Calgary high schools. This study used a repeated-measures design. One rater measured unipedal standing balance, including timed eyes-closed static (ECS), eyes-open dynamic (EOD), and eyes-closed dynamic (ECD) balance at baseline and 1 week later. Dynamic balance was measured on a foam surface. Reliability was examined using both intraclass correlation coefficients (ICCs) and Bland and Altman statistical techniques. Multiple linear regressions were used to examine other potentially influencing factors. Based on ICCs, test-retest reliability was adequate for ECS, EOD, and ECD balance (ICC=.69, .59, and .46, respectively). The results of Bland and Altman methods, however, suggest that caution is required in interpreting reliability based on ICCs alone. Although both ECS balance and ECD balance appear to demonstrate adequate test-retest reliability by ICC, Bland and Altman methods of agreement demonstrate sufficient reliability for ECD balance only. Thirty percent of the subjects reached the 180-second maximum on EOD balance, suggesting that this test is not appropriate for use in this population. Balance ability (ECS and ECD) was better in adolescents with no past history of lower-extremity injury. Timed ECD balance is an appropriate and reliable clinical measurement for use in adolescents and is influenced by previous injury.
Isokinetic Strength and Endurance Tests used Pre- and Post-Spaceflight: Test-Retest Reliability

NASA Technical Reports Server (NTRS)

Laughlin, Mitzi S.; Lee, Stuart M. C.; Loehr, James A.; Amonette, William E.

2009-01-01

To assess changes in muscular strength and endurance after microgravity exposure, NASA measures isokinetic strength and endurance across multiple sessions before and after long-duration space flight. Accurate interpretation of pre- and post-flight measures depends upon the reliability of each measure. The purpose of this study was to evaluate the test-retest reliability of the NASA International Space Station (ISS) isokinetic protocol. Twenty-four healthy subjects (12 M/12 F, 32.0 +/- 5.6 years) volunteered to participate. Isokinetic knee, ankle, and trunk flexion and extension strength as well as endurance of the knee flexors and extensors were measured using a Cybex NORM isokinetic dynamometer. The first weekly session was considered a familiarization session. Data were collected and analyzed for weeks 2-4. Repeated measures analysis of variance (alpha=0.05) was used to identify weekly differences in isokinetic measures. Test-retest reliability was evaluated by intraclass correlation coefficients (ICC) (3,1). No significant differences were found between weeks in any of the strength measures and the reliability of the strength measures were all considered excellent (ICC greater than 0.9), except for concentric ankle dorsi-flexion (ICC=0.67). Although a significant difference was noted in weekly endurance measures of knee extension (p less than 0.01), the reliability of endurance measure by week were considered excellent for knee flexion (ICC=0.97) and knee extension (ICC=0.96). Except for concentric ankle dorsi-flexion, the isokinetic strength and endurance measures are highly reliable when following the NASA ISS protocol. This protocol should allow accurate interpretation isokinetic data even with a small number of crew members.
Clinimetric properties of the Tinetti Mobility Test, Four Square Step Test, Activities-specific Balance Confidence Scale, and spatiotemporal gait measures in individuals with Huntington's disease

PubMed Central

Kloos, Anne D.; Fritz, Nora E.; Kostyk, Sandra K.; Young, Gregory S.; Kegelmeyer, Deb A.

2014-01-01

Background and purpose Individuals with Huntington's disease (HD) experience balance and gait problems that lead to falls. Clinicians currently have very little information about the reliability and validity of outcome measures to determine the efficacy of interventions that aim to reduce balance and gait impairments in HD. This study examined the reliability and concurrent validity of spatiotemporal gait measures, the Tinetti Mobility Test (TMT), Four Square Step Test (FSST), and Activities-specific Balance Confidence (ABC) Scale in individuals with HD. Methods Participants with HD [n = 20; mean age ± SD = 50.9 ± 13.7; 7 male] were tested on spatiotemporal gait measures the TMT, FSST, and ABC Scale before and after a six week period to determine test–retest reliability and minimal detectable change (MDC) values. Linear relationships between gait and clinical measures were estimated using Pearson's correlation coefficients. Results Spatiotemporal gait measures, the TMT total and the FSST showed good to excellent test–retest reliability (ICC > 0.75). MDC values were 0.30 m/s and 0.17 m/s for velocity in forward and backward walking respectively, four points for the TMT, and 3 s for the FSST. The TMT and FSST were highly correlated with most spatiotemporal measures. The ABC Scale demonstrated lower reliability and less concurrent validity than other measures. Conclusions The high test–retest reliability over a six week period and concurrent validity between the TMT, FSST, and spatiotemporal gait measures suggest that the TMT and FSST may be useful outcome measures for future intervention studies in ambulatory individuals with HD. PMID:25128156
Test-retest reliability and practice effects of a rapid screen of mild traumatic brain injury.

PubMed

De Monte, Veronica Eileen; Geffen, Gina Malke; Kwapil, Karleigh

2005-07-01

Test-retest reliabilities and practice effects of measures from the Rapid Screen of Concussion (RSC), in addition to the Digit Symbol Substitution Test (Digit Symbol), were examined. Twenty five male participants were tested three times; each testing session scheduled a week apart. The test-retest reliability estimates for most measures were reasonably good, ranging from .79 to .97. An exception was the delayed word recall test, which has had a reliability estimate of .66 for the first retest, and .59 for the second retest. Practice effects were evident from Times 1 to 2 on the sentence comprehension and delayed recall subtests of the RSC, Digit Symbol and a composite score. There was also a practice effect of the same magnitude found from Time 2 to Time 3 on Digit Symbol, delayed recall and the composite score. Statistics on measures for both the first and second retest intervals, with associated practice effects, are presented to enable the calculation of reliable change indices (RCI). The RCI may be used to assess any improvement in cognitive functioning after mild Traumatic Brain Injury.
Hypertension Knowledge-Level Scale (HK-LS): a study on development, validity and reliability.

PubMed

Erkoc, Sultan Baliz; Isikli, Burhanettin; Metintas, Selma; Kalyoncu, Cemalettin

2012-03-01

This study was conducted to develop a scale to measure knowledge about hypertension among Turkish adults. The Hypertension Knowledge-Level Scale (HK-LS) was generated based on content, face, and construct validity, internal consistency, test re-test reliability, and discriminative validity procedures. The final scale had 22 items with six sub-dimensions. The scale was applied to 457 individuals aged ≥ 18 years, and 414 of them were re-evaluated for test-retest reliability. The six sub-dimensions encompassed 60.3% of the total variance. Cronbach alpha coefficients were 0.82 for the entire scale and 0.92, 0.59, 0.67, 0.77, 0.72, and 0.76 for the sub-dimensions of definition, medical treatment, drug compliance, lifestyle, diet, and complications, respectively. The scale ensured internal consistency in reliability and construct validity, as well as stability over time. Significant relationships were found between knowledge score and age, gender, educational level, and history of hypertension of the participants. No correlation was found between knowledge score and working at an income-generating job. The present scale, developed to measure the knowledge level of hypertension among Turkish adults, was found to be valid and reliable.
Translation and validation of the Spanish version of the Health of the Nation Outcome Scales for People with Learning Disabilities (HoNOS-LD).

PubMed

Esteba-Castillo, Susanna; Torrents-Rodas, David; García-Alba, Javier; Ribas-Vidal, Núria; Novell-Alsina, Ramon

2016-12-21

The Health of the Nation Outcome Scales for People with Learning Disabilities (HoNOS-LD) is a brief instrument that assesses functioning in people with intellectual development disorder and mental health problems/behaviour disorders. The aim of the present study was to examine the evidence on the validity of the scores based on the Spanish version of the HoNOS-LD. The study included 111 participants that were assessed by the Spanish version of the HoNOS-LD and other questionnaires that measured different variables related to the scale. Thirty-three participants were assessed by 2 examiners, and retested 7 days later, in order to study inter-examiner reliability and test-retest reliabilities. Based on clinical and conceptual criteria, and on the results of the parallel analysis, a factorial solution with one factor was selected. Internal consistency was good (Omega coefficient of 0.87). Inter-examiner and test-retest reliabilities were excellent (intraclass correlation coefficients of 0.95 and 0.98, respectively). Correlations between sections of the HoNOS-LD and the related instruments showed the expected direction, and were highly significant (P<.001), and the HoNOS-LD score increased with the intensity of the support required by the participants. These results showed evidence of the validity of association with other external variables. The Spanish version of the HoNOS-LD is a brief, valid and reliable instrument, which will enable a routine assessment of functioning for different uses, including diagnosis and intervention. Copyright © 2016 SEP y SEPB. Publicado por Elsevier España, S.L.U. All rights reserved.
The validity and reliability of the Thai version of the Kujala score for patients with patellofemoral pain syndrome.

PubMed

Apivatgaroon, Adinun; Angthong, Chayanin; Sanguanjit, Prakasit; Chernchujit, Bancha

2016-10-01

To develop a Thai version of the Kujala score and show the evaluation of the validity and reliability of the score. The Thai version of the Kujala score was developed using the forward-backward translation protocol. The 49 PFPS patients answered the Thai version of questionnaires including the Kujala score, Short Form-36 (SF-36) and International Knee Documentation Committee (IKDC) Subjective Knee Form. The validity between the scores has been tested. The reliability was assessed using test-retest reliability and internal consistency. The Thai version of the Kujala score showed a good correlation with Thai IKDC Subjective Knee Form (Pearson's correlation coefficient; r = 0.74: p < 0.01) and moderate correlation with the Thai SF-36 subscales of physical component summary, total score and role physical (r = 0.586, 0.571 and 0.524, respectively: p < 0.01). The test-retest reliability was excellent with an intra-class correlation coefficient of 0.908 (p < 0.001; 95% CI [0.842-0.947]). The internal consistency was strong with Cronbach's alpha of 0.952 (p < 0.001). No floor and ceiling effects were observed. The Thai version of the Kujala score has shown good validity and reliability. This score can be effectively used for evaluating Thai patients with patellofemoral pain syndrome. Implications for Rehabilitation The Kujala score is a self-administered questionnaire for patients with patellofemoral pain syndrome (PFPS). The validity and reliability of the Thai version of Kujala are compatible with other versions (Turkish, Chinese and Persian version). The Thai version of Kujala has been shown to have validity and reliability in Thai PFPS patients and can be used for clinical evaluation and also in the research work.
The Healthy Brain Network Serial Scanning Initiative: a resource for evaluating inter-individual differences and their reliabilities across scan conditions and sessions

PubMed Central

O’Connor, David; Potler, Natan Vega; Kovacs, Meagan; Xu, Ting; Ai, Lei; Pellman, John; Vanderwal, Tamara; Parra, Lucas C.; Cohen, Samantha; Ghosh, Satrajit; Escalera, Jasmine; Grant-Villegas, Natalie; Osman, Yael; Bui, Anastasia; Craddock, R. Cameron

2017-01-01

Abstract Background: Although typically measured during the resting state, a growing literature is illustrating the ability to map intrinsic connectivity with functional MRI during task and naturalistic viewing conditions. These paradigms are drawing excitement due to their greater tolerability in clinical and developing populations and because they enable a wider range of analyses (e.g., inter-subject correlations). To be clinically useful, the test-retest reliability of connectivity measured during these paradigms needs to be established. This resource provides data for evaluating test-retest reliability for full-brain connectivity patterns detected during each of four scan conditions that differ with respect to level of engagement (rest, abstract animations, movie clips, flanker task). Data are provided for 13 participants, each scanned in 12 sessions with 10 minutes for each scan of the four conditions. Diffusion kurtosis imaging data was also obtained at each session. Findings: Technical validation and demonstrative reliability analyses were carried out at the connection-level using the Intraclass Correlation Coefficient and at network-level representations of the data using the Image Intraclass Correlation Coefficient. Variation in intrinsic functional connectivity across sessions was generally found to be greater than that attributable to scan condition. Between-condition reliability was generally high, particularly for the frontoparietal and default networks. Between-session reliabilities obtained separately for the different scan conditions were comparable, though notably lower than between-condition reliabilities. Conclusions: This resource provides a test-bed for quantifying the reliability of connectivity indices across subjects, conditions and time. The resource can be used to compare and optimize different frameworks for measuring connectivity and data collection parameters such as scan length. Additionally, investigators can explore the unique perspectives of the brain's functional architecture offered by each of the scan conditions. PMID:28369458
A Structured Clinical Interview for Kleptomania (SCI-K): preliminary validity and reliability testing.

PubMed

Grant, Jon E; Kim, Suck Won; McCabe, James S

2006-06-01

Kleptomania presents difficulties in diagnosis for clinicians. This study aimed to develop and test a DSM-IV-based diagnostic instrument for kleptomania. To assess for current kleptomania the Structured Clinical Interview for Kleptomania (SCI-K) was administered to 112 consecutive subjects requesting psychiatric outpatient treatment for a variety of disorders. Reliability and validity were determined. Classification accuracy was examined using the longitudinal course of illness. The SCI-K demonstrated excellent test-retest (Phi coefficient = 0.956 (95% CI = 0.937, 0.970)) and inter-rater reliability (phi coefficient = 0.718 (95% CI = 0.506, 0.848)) in the diagnosis of kleptomania. Concurrent validity was observed with a self-report measure using DSM-IV kleptomania criteria (phi coefficient = 0.769 (95% CI = 0.653, 0.850)). Discriminant validity was observed with a measure of depression (point biserial coefficient = -0.020 (95% CI = -0.205, 0.166)). The SCI-K demonstrated both high sensitivity and specificity based on longitudinal assessment. The SCI-K demonstrated excellent reliability and validity in diagnosing kleptomania in subjects presenting with various psychiatric problems. These findings require replication in larger groups, including non-psychiatric populations, to examine their generalizability. Copyright (c) 2006 John Wiley & Sons, Ltd.
An Indian adaptation of the Involvement Evaluation Questionnaire: similarities and differences in assessment of caregiver burden.

PubMed

Grover, S; Chakrabarti, S; Ghormode, D; Dutt, A; Kate, N; Kulhara, P

2011-12-01

The Involvement Evaluation Questionnaire (IEQ) is a comprehensive, conceptually valid and reliable means of assessing caregiver burden. However, its psychometric properties have rarely been examined in non-European settings. The aim of the present study was to evaluate the psychometric properties of an Indian translation of the IEQ (Hindi-IEQ). The European Union (English) version of IEQ was translated into Hindi and reviewed by a group of experts and caregivers for translation accuracy, cultural appropriateness, and for relevance and acceptability of items and constructs. The Hindi-IEQ was then administered to 162 primary caregivers of patients with severe mental illnesses. Eighteen caregivers completed both the English and Hindi versions to check the level of agreement between them. Another 27 completed the Hindi-IEQ twice, a week apart, to evaluate its test-retest reliability. Factor structure of the Hindi-IEQ was examined using an exploratory, principal components and factor analysis. Pearson's correlation coefficients were significant for 24 items, while intraclass correlation coefficients were significant for 28 of the 31 items (P < 0.05), indicating a satisfactory level of agreement between the Hindi and English versions. Test-retest reliability for all items of the Hindi-IEQ was adequate, with kappa values ranging from 0.46 to 0.95 and intraclass correlation coefficients from 0.76 to 1.00. Internal consistency (Cronbach's alpha = 0.89) and the split-half reliability (Spearman-Brown coefficient = 0.68) of the Hindi-IEQ were also satisfactory. However, several differences were noted in the factor structure and distribution of scores of the Hindi-IEQ, which were quite unlike that of the European Union version. The similarities and differences between the 2 versions of the IEQ indicated that sociocultural factors could influence assessment of caregiver burden across different cultures.
Turkish version of the modified Constant-Murley score and standardized test protocol: reliability and validity.

PubMed

Çelik, Derya

2016-01-01

The Constant-Murley score (CMS) is widely used to evaluate disabilities associated with shoulder injuries, but it has been criticized for relying on imprecise terminology and a lack of standardized methodology. A modified guideline, therefore, was published in 2008 with several recommendations. This new version has not yet been translated or culturally adapted for Turkish-speaking populations. The purpose of this study was to translate and cross-culturally adapt the modified CMS and its test protocol, as well as define and measure its reliability and validity. The modified CMS was translated into Turkish, consistent with published methodological guidelines. The measurement properties of the Turkish version of the modified CMS were tested in 30 patients (12 males, 18 females; mean age: 59.5±13.5 years) with a variety of shoulder pathologies. Intraclass correlation coefficients (ICC) were used to estimate test-retest reliability. Construct validity was analyzed with the Turkish version of the American Shoulder and Elbow Surgeons (ASES) Standardized Shoulder Assessment Form and Short-Form Health Survey (SF-12). No difficulties were found in the translation process. The Turkish version of the modified CMS showed excellent test-retest reliability (ICC=0.86). The correlation coefficients between the Turkish version of the modified CMS and the ASES, SF-12-physical component score, and SF-12 mental component scores were found to be 0.48, 0.35, and 0.05, respectively. No floor or ceiling effects were found. The translation and cultural adaptation of the modified CMS and its standardized test protocol into Turkish were successful. The Turkish version of the modified CMS has sufficient reliability and validity to measure a variety of shoulder disorders for Turkish-speaking individuals.
Test-retest reliability of posture measurements in adolescents with idiopathic scoliosis.

PubMed

Heitz, Pierre-Henri; Aubin-Fournier, Jean-François; Parent, Éric; Fortin, Carole

2018-05-07

Posture changes are a major consequence of IS (IS). Posture changes can lead to psychosocial and physical impairments in adolescents with IS. Therefore, it is important to assess posture but the test-retest reliability of posture measurements still remains unknown in this population. The primary objective was to determine the test-retest reliability of 25 head and trunk posture indices using the Clinical Photographic Postural Assessment Tool (CPPAT) in adolescents with IS. The secondary objective was to determine the standard error of measurement and the minimal detectable change. This is a prospective test-retest reliability study carried out at two tertiary university hospital centers. Forty-one adolescents with IS, aged 10 to 16 years old with curves 10 to 45 o and treated non-operatively were recruited. Two posture assessments were done using the CPPAT five to 10 days apart following a standardized procedure. Photographs were analyzed with the CPPAT software by digitizing reference landmarks placed on the participant by a physiotherapist evaluator. Generalizability theory was used to obtain a coefficient of dependability, standard error of measurement and the minimal detectable change at the 90% confidence interval. This project was supported by the Canadian Pediatric Spine Society (CPSS: 10000$). There is no study-specific conflicts of interest-associated biases. Fourteen of 25 posture indices had a good reliability (ϕ ≥ 0.78), ten of 25 had moderate reliability (ϕ = 0.55 to 0.74) and one had poor reliability (ϕ = 0.45). The most reliable posture indices were waist angles asymmetry (ϕ = 0.93), right waist angle (ϕ = 0.91) and frontal trunk list (ϕ = 0.92). Right sagittal trunk list was the least reliable posture index (ϕ = 0.45). The MDC 90 values ranged from 2.6 to 10.3° for angular measurements and from 8.4 to 35.1 mm for linear measurements. This study demonstrates that most posture indices, especially the trunk posture indices, are reproducible in time among adolescents with IS and provides reference values. Clinicians and researchers can use these reference values in order to assess change in posture over time attributable to treatment effectiveness. Copyright © 2018. Published by Elsevier Inc.
Psychometric Evaluation of the Revised Michigan Diabetes Knowledge Test (V.2016) in Arabic: Translation and Validation

PubMed Central

Alhaiti, Ali Hassan; Alotaibi, Alanod Raffa; Jones, Linda Katherine; DaCosta, Cliff

2016-01-01

Objective. To translate the revised Michigan Diabetes Knowledge Test into the Arabic language and examine its psychometric properties. Setting. Of the 139 participants recruited through King Fahad Medical City in Riyadh, Saudi Arabia, 34 agreed to the second-round sample for retesting purposes. Methods. The translation process followed the World Health Organization's guidelines for the translation and adaptation of instruments. All translations were examined for their validity and reliability. Results. The translation process revealed excellent results throughout all stages. The Arabic version received 0.75 for internal consistency via Cronbach's alpha test and excellent outcomes in terms of the test-retest reliability of the instrument with a mean of 0.90 infraclass correlation coefficient. It also received positive content validity index scores. The item-level content validity index for all instrument scales fell between 0.83 and 1 with a mean scale-level index of 0.96. Conclusion. The Arabic version is proven to be a reliable and valid measure of patient's knowledge that is ready to be used in clinical practices. PMID:27995149
Development of the Facial Skin Care Index: A Health-Related Outcomes Index for Skin Cancer Patients

PubMed Central

Matthews, B. Alex; Rhee, John S.; Neuburg, Marcy; Burzynski, Mary L.; Nattinger, Ann B.

2006-01-01

BACKGROUND Existing health-related quality-of-life (HRQOL) tools do not appear to capture patients' specific skin cancer concerns. OBJECTIVE To describe the conceptual foundation, item generation, reduction process, and reliability testing for the Facial Skin Cancer Index (FSCI), a HRQOL outcomes tool for skin cancer researchers and clinicians. METHODS Participants in Phases I to III consisted of adult patients (N = 134) diagnosed with biopsy-proven nonmelanoma cervicofacial skin cancer. Data were collected via self-report surveys and clinical records. RESULTS Seventy-one distinct items were generated in Phase I and rated for their importance by an independent sample during Phase II; 36 items representing six theoretical HRQOL domains were retained. Test–retest I results indicated that four subscales showed adequate reliability coefficients (α = 0.60 to 0.91). Twenty-six items remained for test–retest II. Results indicated excellent internal consistency for emotional, social, appearance, and modified financial/work subscales (range 0.79 to 0.95); test–retest correlation coefficients were consistent across time (range 0.81 to 0.97; lifestyle omitted). CONCLUSION Pretesting afforded the opportunity to select items that optimally met our a priori conceptual and psychometric criteria for high data quality. Phase IV testing (validity and sensitivity before surgery and 4 months after Mohs micrographic surgery) for the 20-item FSCI is under way. PMID:16875475
Test-retest reliability of a handheld dynamometer for measurement of isometric cervical muscle strength.

PubMed

Vannebo, Katrine Tranaas; Iversen, Vegard Moe; Fimland, Marius Steiro; Mork, Paul Jarle

2018-03-02

There is a lack of test-retest reliability studies of measurements of cervical muscle strength, taking into account gender and possible learning effects. To investigate test-retest reliability of measurement of maximal isometric cervical muscle strength by handheld dynamometry. Thirty women (age 20-58 years) and 28 men (age 20-60 years) participated in the study. Maximal isometric strength (neck flexion, neck extension, and right/left lateral flexion) was measured on three separate days at least five days apart by one evaluator. Intra-rater consistency tended to improve from day 1-2 measurements to day 2-3 measurements in both women and men. In women, the intra-class correlation coefficients (ICC) for day 2 to day 3 measurements were 0.91 (95% confidence interval [CI], 0.82-0.95) for neck flexion, 0.88 (95% CI, 0.76-0.94) for neck extension, 0.84 (95% CI, 0.68-0.92) for right lateral flexion, and 0.89 (95% CI, 0.78-0.95) for left lateral flexion. The corresponding ICCs among men were 0.86 (95% CI, 0.72-0.93) for neck flexion, 0.93 (95% CI, 0.85-0.97) for neck extension, 0.82 (95% CI, 0.65-0.91) for right lateral flexion and 0.73 (95% CI, 0.50-0.87) for left lateral flexion. This study describes a reliable and easy-to-administer test for assessing maximal isometric cervical muscle strength.
Reliability testing of a portfolio assessment tool for postgraduate family medicine training in South Africa

PubMed Central

Mash, Bob; Derese, Anselme

2013-01-01

Abstract Background Competency-based education and the validity and reliability of workplace-based assessment of postgraduate trainees have received increasing attention worldwide. Family medicine was recognised as a speciality in South Africa six years ago and a satisfactory portfolio of learning is a prerequisite to sit the national exit exam. A massive scaling up of the number of family physicians is needed in order to meet the health needs of the country. Aim The aim of this study was to develop a reliable, robust and feasible portfolio assessment tool (PAT) for South Africa. Methods Six raters each rated nine portfolios from the Stellenbosch University programme, using the PAT, to test for inter-rater reliability. This rating was repeated three months later to determine test–retest reliability. Following initial analysis and feedback the PAT was modified and the inter-rater reliability again assessed on nine new portfolios. An acceptable intra-class correlation was considered to be > 0.80. Results The total score was found to be reliable, with a coefficient of 0.92. For test–retest reliability, the difference in mean total score was 1.7%, which was not statistically significant. Amongst the subsections, only assessment of the educational meetings and the logbook showed reliability coefficients > 0.80. Conclusion This was the first attempt to develop a reliable, robust and feasible national portfolio assessment tool to assess postgraduate family medicine training in the South African context. The tool was reliable for the total score, but the low reliability of several sections in the PAT helped us to develop 12 recommendations regarding the use of the portfolio, the design of the PAT and the training of raters.
The maximal width of the base of support (BSW): clinical applicability and reliability of a preferred-standing test for measuring the risk of falling.

PubMed

Swanenburg, Jaap; Nevzati, Arian; Mittaz Hager, Anne Gabrielle; de Bruin, Eling D; Klipstein, Andreas

2013-01-01

The aim of this study was to test the reliability and validity of a preferred-standing test for measuring the risk of falling. The preferred-standing position of elderly fallers and non-fallers and healthy young adults was measured. The maximal BSW was measured. The absolute and relative reliability and discriminant validity were assessed. The expanded timed get-up-and-go test (ETGUG), one-leg stance test (OS), tandem stance (TS), and falls efficacy scale international version (FES-I) were used to determine criterion validity. In total, 146 persons (102 females, 44 males; mean age 55±22 years, range 20-94) were recruited. Forty elderly community dwellers (8 fallers) and 26 young adults were tested twice to determine the test-retest reliability. The BSW showed acceptable test-retest reliability (Intraclass correlation coefficient, ICC2,1=0.77-0.83) and inter-rater reliability (ICC3,1=0.77-0.95) for all groups. The standard error of measurement (SEM) was between 0.77 and 1.87, and the smallest detectable change (SDC) was between 2.14cm and 5.19cm. The Bland-Altman plot revealed no systematic errors. There was significant difference between elderly fallers and non-fallers (F(1/75)=11.951; p=0.001. Spearman's rho coefficient values showed no correlation between the BSW and the ETGUG (-0.17, p=0.47), OLS (-0.04, p=0.65), TS (-0.11, p=0.21), and FES-I (-0.10; p=0.27). Only the BSW was a significant predictor for falling (odds ratio=0.736, p=0.007). The reliability and validity of the BSW protocol were acceptable overall. Prospective studies are warranted to evaluate the predictive value of the BSW for determining the risk of falling. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.

Design and development of food safety knowledge and attitude scales for consumer food safety education.

PubMed

Medeiros, Lydia C; Hillers, Virginia N; Chen, Gang; Bergmann, Verna; Kendall, Patricia; Schroeder, Mary

2004-11-01

The objective of this study was to design and develop food safety knowledge and attitude scales based on food-handling guidelines developed by a national panel of food safety experts. Knowledge (n=43) and attitude (n=49) questions were developed and pilot-tested with a variety of consumer groups. Final questions were selected based on item analysis and on validity and reliability statistical tests. Knowledge questions were tested in Washington State with participants in low-income nutrition education programs (pretest/posttest n=58, test/retest n=19) and college students (pretest/posttest n=34). Attitude questions were tested in Ohio with nutrition education program participants (n=30) and college students (non-nutrition majors n=138, nutrition majors n=57). Item analysis, paired sample t tests, Pearson's correlation coefficients, and Cronbach's alpha were used. Reliability and validity tests of individual items and the question sets were used to reduce the scales to 18 knowledge questions and 10 attitude questions. The knowledge and attitude scales covered topics ranked as important by a national panel of experts and met most validity and reliability standards. The 18-item knowledge questionnaire had instructional sensitivity (mean score increase of more than three points after instruction), internal reliability (Cronbach's alpha >.75), and produced similar results in test-retest without intervention (coefficient of stability=.81). Knowledge of correct procedures for hand washing and avoiding cross-contamination was widespread before instruction. Knowledge was limited regarding avoiding food preparation while ill, cooking hamburgers, high-risk foods, and whether cooked rice and potatoes could be stored at room temperature. The 10-item attitude scale had an appropriate range of responses (item difficulty) and produced similar results in test-retest ( P
Reliability and Validity of the Chinese Version of FACIT-AI, a New Tool for Assessing Quality of Life in Patients with Malignant Ascites.

PubMed

Lou, Yanni; Lu, Linghui; Li, Yuan; Liu, Meng; Bredle, Jason M; Jia, Liqun

2015-10-01

The study objective was to determine the reliability and validity of the Chinese version of the Functional Assessment of Chronic Illness Therapy - Ascites Index (FACIT-AI). A forward-backward translation procedure was adopted to develop the Chinese version of the FACIT-AI, which was tested in 69 patients with malignant ascites. Cronbach's α, split-half reliability, and test-retest reliability were used to assess the reliability of the scale. The content validity index was used to assess the content validity, while factor analysis was used for construct validity and correlation analysis was used for criterion validity. The Cronbach's α was 0.772 for the total scale, and the split-half reliability was 0.693. The test-retest correlation was 0.972. The content validity index for the scale was 0.8-1.0. Four factors were extracted by factor analysis, and these contributed 63.51% of the total variance. Item-total correlations ranged from 0.591 to 0.897, and these were correlated with visual analog scale scores (correlation coefficient, 0.889; P<0.01). The Chinese version of the FACIT-AI has good reliability and validity and can be used as a tool to measure quality of life in Chinese patients with malignant ascites.
Psychometric Properties of the Persian Version of the Simple Shoulder Test (SST) Questionnaire.

PubMed

Ebrahimzadeh, Mohammad H; Vahedi, Ehsan; Baradaran, Aslan; Birjandinejad, Ali; Seyyed-Hoseinian, Seyyed-Hadi; Bagheri, Farshid; Kachooei, Amir Reza

2016-10-01

To validate the Persian version of the simple shoulder test in patients with shoulder joint problems. Following Beaton`s guideline, translation and back translation was conducted. We reached to a consensus on the Persian version of SST. To test the face validity in a pilot study, the Persian SST was administered to 20 individuals with shoulder joint conditions. We enrolled 148 consecutive patients with shoulder problem to fill the Persian SST, shoulder specific measure including Oxford shoulder score (OSS) and two general measures including DASH and SF-36. To measure the test-retest reliability, 42 patients were randomly asked to fill the Persian-SST for the second time after one week. Cronbach's alpha coefficient was used to demonstrate internal consistency over the 12 items of Persian-SST. ICC for the total questionnaire was 0.61 showing good and acceptable test-retest reliability. ICC for individual items ranged from 0.32 to 0.79. The total Cronbach's alpha was 0.84 showing good internal consistency over the 12 items of the Persian-SST. Validity testing showed strong correlation between SST and OSS and DASH. The correlation with OSS was positive while with DASH scores was negative. The correlation was also good to strong with all physical and most mental subscales of the SF-36. Correlation coefficient was higher with DASH and OSS in compare to SF-36. Persian version of SST found to be valid and reliable instrument for shoulder joint pain and function assessment in Iranian population.
Multiple Sclerosis Walking Scale-12, translation, adaptation and validation for the Persian language population.

PubMed

Nakhostin Ansari, Noureddin; Naghdi, Soofia; Mohammadi, Roghaye; Hasson, Scott

2015-02-01

The Multiple Sclerosis Walking Scale-12 (MSWS-12) is a multi-item rating scale used to assess the perspectives of patients about the impact of MS on their walking ability. The aim of this study was to examine the reliability and validity of the MSWS-12 in Persian speaking patients with MS. The MSWS-12 questionnaire was translated into Persian language according to internationally adopted standards involving forward-backward translation, reviewed by an expert committee and tested on the pre-final version. In this cross-sectional study, 100 participants (50 patients with MS and 50 healthy subjects) were included. The MSWS-12 was administered twice 7 days apart to 30 patients with MS for test and retest reliability. Internal consistency reliability was Cronbach's α 0.96 for test and 0.97 for retest. There were no significant floor or ceiling effects. Test-retest reliability was excellent (intraclass correlation coefficient [ICC] agreement of 0.98, 95% CI, 0.95-0.99) confirming the reproducibility of the Persian MSWS-12. Construct validity using known group methods was demonstrated through a significant difference in the Persian MSWS-12 total score between the patients with MS and healthy subjects. Factor analysis extracted 2 latent factors (79.24% of the total variance). A second factor analysis suggested the 9-item Persian MSWS as a unidimensional scale for patients with MS. The Persian MSWS-12 was found to be valid and reliable for assessing walking ability in Persian speaking patients with MS. Copyright © 2014 Elsevier B.V. All rights reserved.
Test-retest reliability and responsiveness of gaze stability and dynamic visual acuity in high school and college football players.

PubMed

Kaufman, Denise R; Puckett, Mallory J; Smith, Mitchell J; Wilson, Kyle S; Cheema, Rebecca; Landers, Merrill R

2014-08-01

The purpose of this study was to establish reliability and responsiveness of the dynamic visual acuity test (DVAT) at head speeds of 150-200 degrees per second (deg/s) and the gaze stabilization test (GST) in high school and college football players. Reliability design. Fifty high school and college football athletes completed the DVAT and GST in both the yaw (horizontal) and pitch (vertical) planes twice within two weeks. Test-retest reliability for the DVAT was good in yaw, Intraclass Correlation Coefficient (ICC) = 0.770, and moderate/good in pitch, ICC = 0.725. Minimal detectable change (MDC) was 0.16 logMAR for yaw and 0.21 logMAR for pitch. GST reliability was moderate in yaw, ICC = 0.634, and poor in pitch, ICC = 0.411. MDCs were 73.4 deg/s (yaw) and 81.2 deg/s (pitch). The DVAT is reliable at high head speeds in high school and college football athletes in both yaw and pitch. GST head speeds were higher than previously reported in the literature, but reliability of this tool for this population was poor to moderate. From a clinical perspective, DVAT may be reliably used in the assessment of high school and college football athletes; however, GST requires further evaluation. Copyright © 2013 Elsevier Ltd. All rights reserved.
The Pittsburgh Sleep Quality Index: validation of the Urdu translation.

PubMed

Hashmi, Ali Madeeh; Khawaja, Imran Shuja; Butt, Zeeshan; Umair, Muhammad; Naqvi, Suhaib Haider; Jawad-Ul-Haq

2014-02-01

To translate and validate the Pittsburgh Sleep Quality Index (PSQI), a standardized self-administered questionnaire for the assessment of subjective sleep quality into the Urdu language. Validation study. Mayo Hospital, Lahore, from March to April 2012. The PSQI was translated into Urdu following standard guidelines. The final Urdu version (PSQI-U) was administered to 200 healthy volunteers comprising medical students, nursing staff and doctors. Inter-item correlation was assessed by calculating Cronbach alpha. Correlation of component scores with global score was assessed by calculating Spearman correlation coefficient. Correlation between global PSQI-U scores at baseline with global scores for each PSQI-U and PSQI-E at 4-week interval was evaluated by calculating Spearman correlation coefficient. Moreover, scores on individual items of the scale at baseline were compared with respective scores after 4-week by t-test. One hundred and eighty five (185) participants completed the PSQI-U at baseline. The Cronbach alpha for PSQI-U was 0.56. Scores on individual components of the PSQI-U and composite scores were all highly correlated with each other (all p-values < 0.01). Composite scores for PSQI-U at baseline and PSQI-E at 4-week interval were also highly correlated with each other (Spearman correlation coefficient 0.74, p-value < 0.01) indicating good linguistic interchangeability. Composite scores for PSQI-U at baseline and at 4-week interval were positively correlated with each other (Spearman correlation coefficient 0.70, p < 0.01) indicating good test-retest reliability. The PSQI-U is a valid and reliable instrument for the assessment of sleep quality. It shows good linguistic interchangeability and test-retest reliability in comparison to the original English version when applied to individuals who speak the Urdu language. The PSQI-U can be a tool either for clinical management or research.
Test-retest reliability of sudden ankle inversion measurements in subjects with healthy ankle joints.

PubMed

Eechaute, Christophe; Vaes, Peter; Duquet, William; Van Gheluwe, Bart

2007-01-01

Sudden ankle inversion tests have been used to investigate whether the onset of peroneal muscle activity is delayed in patients with chronically unstable ankle joints. Before interpreting test results of latency times in patients with chronic ankle instability and healthy subjects, the reliability of these measures must be first demonstrated. To investigate the test-retest reliability of variables measured during a sudden ankle inversion movement in standing subjects with healthy ankle joints. Validation study. Research laboratory. 15 subjects with healthy ankle joints (30 ankles). Subjects stood on an ankle inversion platform with both feet tightly fixed to independently moveable trapdoors. An unexpected sudden ankle inversion of 50 degrees was imposed. We measured latency and motor response times and electromechanical delay of the peroneus longus muscle, along with the time and angular position of the first and second decelerating moments, the mean and maximum inversion speed, and the total inversion time. Correlation coefficients and standard error of measurements were calculated. Intraclass correlation coefficients ranged from 0.17 for the electromechanical delay of the peroneus longus muscle (standard error of measurement = 2.7 milliseconds) to 0.89 for the maximum inversion speed (standard error of measurement = 34.8 milliseconds). The reliability of the latency and motor response times of the peroneus longus muscle, the time of the first and second decelerating moments, and the mean and maximum inversion speed was acceptable in subjects with healthy ankle joints and supports the investigation of the reliability of these measures in subjects with chronic ankle instability. The lower reliability of the electromechanical delay of the peroneus longus muscle and the angular positions of both decelerating moments calls the use of these variables into question.
Reliable change indices and standardized regression-based change score norms for evaluating neuropsychological change in children with epilepsy.

PubMed

Busch, Robyn M; Lineweaver, Tara T; Ferguson, Lisa; Haut, Jennifer S

2015-06-01

Reliable change indices (RCIs) and standardized regression-based (SRB) change score norms permit evaluation of meaningful changes in test scores following treatment interventions, like epilepsy surgery, while accounting for test-retest reliability, practice effects, score fluctuations due to error, and relevant clinical and demographic factors. Although these methods are frequently used to assess cognitive change after epilepsy surgery in adults, they have not been widely applied to examine cognitive change in children with epilepsy. The goal of the current study was to develop RCIs and SRB change score norms for use in children with epilepsy. Sixty-three children with epilepsy (age range: 6-16; M=10.19, SD=2.58) underwent comprehensive neuropsychological evaluations at two time points an average of 12 months apart. Practice effect-adjusted RCIs and SRB change score norms were calculated for all cognitive measures in the battery. Practice effects were quite variable across the neuropsychological measures, with the greatest differences observed among older children, particularly on the Children's Memory Scale and Wisconsin Card Sorting Test. There was also notable variability in test-retest reliabilities across measures in the battery, with coefficients ranging from 0.14 to 0.92. Reliable change indices and SRB change score norms for use in assessing meaningful cognitive change in children following epilepsy surgery are provided for measures with reliability coefficients above 0.50. This is the first study to provide RCIs and SRB change score norms for a comprehensive neuropsychological battery based on a large sample of children with epilepsy. Tables to aid in evaluating cognitive changes in children who have undergone epilepsy surgery are provided for clinical use. An Excel sheet to perform all relevant calculations is also available to interested clinicians or researchers. Copyright © 2015 Elsevier Inc. All rights reserved.
Validity and Reliability of a General Nutrition Knowledge Questionnaire for Japanese Adults.

PubMed

Matsumoto, Mai; Tanaka, Rie; Ikemoto, Shinji

2017-01-01

Nutrition knowledge is necessary for individuals to adopt appropriate dietary habits, and needs to be evaluated before nutrition education is provided. However, there is no tool to assess general nutrition knowledge of adults in Japan. Our aims were to determine the validity and reliability of a general nutrition knowledge questionnaire for Japanese adults. We developed the pilot version of the Japanese general nutrition knowledge questionnaire (JGNKQ) and administered the pilot study to assess content validity and internal reliability to 1,182 Japanese adults aged 18-64 y. The JGNKQ was further modified based on the pilot study and the final version consisted of 5 sections and 147 items. The JGNKQ was administered to female undergraduate Japanese students in their senior year twice in 2015 to assess construct validity and test-retest reliability. Ninety-six students majoring in nutrition and 44 students in other majors who studied at the same university completed the first questionnaire. Seventy-five students completed the questionnaire twice. The responses from the first questionnaire and both questionnaires were used to assess construct validity and test-retest reliability, respectively. The students in nutrition major had significantly higher scores than the students in other majors on all sections of the questionnaire (p=0.000); therefore, the questionnaire had good construct validity. The test-retest reliability correlation coefficient value of overall and each section except "The use of dietary information to make dietary choices" were 0.75, 0.67, 0.67, 0.68 and 0.61, respectively. We suggest that the JGNKQ is an effective tool to assess the nutrition knowledge level of Japanese adults.
Validation of EncephalApp, Smartphone-based Stroop Test, for the Diagnosis of Covert Hepatic Encephalopathy

PubMed Central

Bajaj, Jasmohan S; Heuman, Douglas M; Sterling, Richard K; Sanyal, Arun J; Siddiqui, Muhammad; Matherly, Scott; Luketic, Velimir; Stravitz, R Todd; Fuchs, Michael; Thacker, Leroy R; Gilles, HoChong; White, Melanie B; Unser, Ariel; Hovermale, James; Gavis, Edith; Noble, Nicole A; Wade, James B

2014-01-01

Background & Aims Detection of covert hepatic encephalopathy (CHE) is difficult but point of care testing could increase rates of diagnosis. We aimed to validate the ability of the smartphone app EncephalApp, a streamlined version of Stroop App, to detect CHE. We evaluated face validity, test–retest reliability, and external validity. Methods Patients with cirrhosis (n=167; 38% with overt HE [OHE]; mean age, 55 years; mean model for end-stage liver disease score, 12) and controls (n=114) were each given a paper and pencil cognitive battery (standard) along with EncephalApp. EncephalApp has Off and On states; results measured were: OffTime, OnTime, OffTime+OnTime, and number of runs required to complete 5 off and on runs. Thirty-six patients with cirrhosis underwent driving simulation tests, and EncephalApp results were correlated with results. Test–retest reliability was analyzed in a subgroup of patients. The test was performed before and after transjugular intra-hepatic portosystemic shunt placement, before and after correction for hyponatremia, to determine external validity. Results All patients with cirrhosis performed worse on paper and pencil and EncephalApp tests than controls. Patients with cirrhosis and OHE performed worse than those without OHE. Age-dependent EncephalApp cut-offs (younger or older than 45 years) were set. An OffTime+OnTime value of >190 seconds identified all patients with CHE with an area under the receiver operator characteristic (AUROC) value of 0.91; the AUROC value was 0.88 for diagnosis of CHE in those without OHE. EncephalApp times correlated with crashes and illegal turns in driving simulation tests. Test–retest reliability was high (intra-class coefficient, 0.83) among 30 patients retested 1–3 months apart. OffTime+OnTime increased significantly (206 vs 255, P=.007) among 10 patients retested 33±7 days after transjugular intra-hepatic portosystemic shunt placement. OffTime+OnTime decreased significantly (242 vs 225, P=.03) in 7 patients tested before and after correction for hyponatremia (126±3 to 132±4 meq/L, P=.01), 10±5 days apart. Conclusions A smartphone app called EncephalApp has good face validity, test–retest reliability, and external validity for the diagnosis of CHE. PMID:24846278
Reliability of temporal summation and diffuse noxious inhibitory control

PubMed Central

Cathcart, Stuart; Winefield, Anthony H; Rolan, Paul; Lushington, Kurt

2009-01-01

BACKGROUND: The test-retest reliability of temporal summation (TS) and diffuse noxious inhibitory control (DNIC) has not been reported to date. Establishing such reliability would support the possibility of future experimental studies examining factors affecting TS and DNIC. Similarly, the use of manual algometry to induce TS, or an occlusion cuff to induce DNIC of TS to mechanical stimuli, has not been reported to date. Such devices may offer a simpler method than current techniques for inducing TS and DNIC, affording assessment at more anatomical locations and in more varied research settings. METHOD: The present study assessed the test-retest reliability of TS and DNIC using the above techniques. Sex differences on these measures were also investigated. RESULTS: Repeated measures ANOVA indicated successful induction of TS and DNIC, with no significant differences across test-retest occasions. Sex effects were not significant for any measure or interaction. Intraclass correlations indicated high test-retest reliability for all measures; however, there was large interindividual variation between test and retest measurements. CONCLUSION: The present results indicate acceptable within-session test-retest reliability of TS and DNIC. The results support the possibility of future experimental studies examining factors affecting TS and DNIC. PMID:20011713
THE DYNAMIC LEAP AND BALANCE TEST (DLBT): A TEST-RETEST RELIABILITY STUDY

PubMed Central

Newman, Thomas M.; Smith, Brent I.; John Miller, Sayers

2017-01-01

Background There is a need for new clinical assessment tools to test dynamic balance during typical functional movements. Common methods for assessing dynamic balance, such as the Star Excursion Balance Test, which requires controlled movement of body segments over an unchanged base of support, may not be an adequate measure for testing typical functional movements that involve controlled movement of body segments along with a change in base of support. Purpose/hypothesis The purpose of this study was to determine the reliability of the Dynamic Leap and Balance Test (DLBT) by assessing its test-retest reliability. It was hypothesized that there would be no statistically significant differences between testing days in time taken to complete the test. Study Design Reliability study Methods Thirty healthy college aged individuals participated in this study. Participants performed a series of leaps in a prescribed sequence, unique to the DLBT test. Time required by the participants to complete the 20-leap task was the dependent variable. Subjects leaped back and forth from peripheral to central targets alternating weight bearing from one leg to the other. Participants landed on the central target with the tested limb and were required to stabilize for two seconds before leaping to the next target. Stability was based upon qualitative measures similar to Balance Error Scoring System. Each assessment was comprised of three trials and performed on two days with a separation of at least six days. Results Two-way mixed ANOVA was used to analyze the differences in time to complete the sequence between the three trial averages of the two testing sessions. Intraclass Correlation Coefficient (ICC3,1) was used to establish between session test-retest reliability of the test trial averages. Significance was set a priori at p ≤ 0.05. No significant differences (p > 0.05) were detected between the two testing sessions. The ICC was 0.93 with a 95% confidence interval from 0.84 to 0.96. Conclusion This test is a cost-effective, easy to administer and clinically relevant novel measure for assessing dynamic balance that has excellent test-retest reliability. Clinical relevance As a new measure of dynamic balance, the DLBT has the potential to be a cost-effective, challenging and functional tool for clinicians. Level of Evidence 2b PMID:28900556
Test-retest reliability of resting-state magnetoencephalography power in sensor and source space.

PubMed

Martín-Buro, María Carmen; Garcés, Pilar; Maestú, Fernando

2016-01-01

Several studies have reported changes in spontaneous brain rhythms that could be used as clinical biomarkers or in the evaluation of neuropsychological and drug treatments in longitudinal studies using magnetoencephalography (MEG). There is an increasing necessity to use these measures in early diagnosis and pathology progression; however, there is a lack of studies addressing how reliable they are. Here, we provide the first test-retest reliability estimate of MEG power in resting-state at sensor and source space. In this study, we recorded 3 sessions of resting-state MEG activity from 24 healthy subjects with an interval of a week between each session. Power values were estimated at sensor and source space with beamforming for classical frequency bands: delta (2-4 Hz), theta (4-8 Hz), alpha (8-13 Hz), low beta (13-20 Hz), high beta (20-30 Hz), and gamma (30-45 Hz). Then, test-retest reliability was evaluated using the intraclass correlation coefficient (ICC). We also evaluated the relation between source power and the within-subject variability. In general, ICC of theta, alpha, and low beta power was fairly high (ICC > 0.6) while in delta and gamma power was lower. In source space, fronto-posterior alpha, frontal beta, and medial temporal theta showed the most reliable profiles. Signal-to-noise ratio could be partially responsible for reliability as low signal intensity resulted in high within-subject variability, but also the inherent nature of some brain rhythms in resting-state might be driving these reliability patterns. In conclusion, our results described the reliability of MEG power estimates in each frequency band, which could be considered in disease characterization or clinical trials. © 2015 Wiley Periodicals, Inc.
Test-Retest Reliability of the Self-Reported Impairments in Persons With Late Effects of Polio (SIPP) Rating Scale.

PubMed

Brogårdh, Christina; Lexell, Jan

2016-05-01

A new 13-item rating scale, the Self-Reported Impairments in Persons with Late Effects of Polio (SIPP), has been developed. The SIPP has been analyzed using the Rasch method and has shown good construct validity and internal consistency. To establish its clinical utility, further evaluation of its psychometric properties is needed. To evaluate the test-retest reliability of the SIPP and to define limits for the smallest change that indicates a real change, both for a group of persons and a single individual. A postal survey. University Hospital. Fifty-one persons (31 men and 20 women; mean age, 72 years) with clinically verified late effects of polio. Not applicable. The participants completed the SIPP twice, 2 weeks apart. The response frequencies at test occasion 1 (T1) and test occasion 2 (T2) were calculated. Test-retest reliability was analyzed using the percentage agreement of each item, the intraclass correlation coefficient, and the mean difference between the test occasions (đ), together with the 95% confidence intervals for đ, the standard error of measurement, the smallest real difference, and a Bland-Altman plot. The percentage agreement (ie, the same scoring at both test occasions) was >70% for 10 of 13 items. The mean score (standard deviation) was 27.9 (5.7) points at T1 and 28.2 (6.0) points at T2, with no systematic difference between the test occasions. The intraclass correlation coefficient was 0.88, the standard error of measurement (the smallest change for a group of persons) was 2.0 points, and the smallest real difference (the smallest change for a single individual) was 5.6 points, respectively. The SIPP is a reliable rating scale in persons with late effects of polio and can be used to evaluate effects of rehabilitation interventions and changes of perceived impairments over time both for a group of persons and for a single individual. Copyright © 2016 American Academy of Physical Medicine and Rehabilitation. Published by Elsevier Inc. All rights reserved.
Translation, adaptation and validation of a Portuguese version of the Moorehead-Ardelt Quality of Life Questionnaire II.

PubMed

Maciel, João; Infante, Paulo; Ribeiro, Susana; Ferreira, André; Silva, Artur C; Caravana, Jorge; Carvalho, Manuel G

2014-11-01

The prevalence of obesity has increased worldwide. An assessment of the impact of obesity on health-related quality of life (HRQoL) requires specific instruments. The Moorehead-Ardelt Quality of Life Questionnaire II (MA-II) is a widely used instrument to assess HRQoL in morbidly obese patients. The objective of this study was to translate and validate a Portuguese version of the MA-II.The study included forward and backward translations of the original MA-II. The reliability of the Portuguese MA-II was estimated using the internal consistency and test-retest methods. For validation purposes, the Spearman's rank correlation coefficient was used to evaluate the correlation between the Portuguese MA-II and the Portuguese versions of two other questionnaires, the 36-item Short Form Health Survey (SF-36) and the Impact of Weight on Quality of Life-Lite (IWQOL-Lite).One hundred and fifty morbidly obese patients were randomly assigned to test the reliability and validity of the Portuguese MA-II. Good internal consistency was demonstrated by a Cronbach's alpha coefficient of 0.80, and a very good agreement in terms of test-retest reliability was recorded, with an overall intraclass correlation coefficient (ICC) of 0.88. The total sums of MA-II scores and each item of MA-II were significantly correlated with all domains of SF-36 and IWQOL-Lite. A statistically significant negative correlation was found between the MA-II total score and BMI. Moreover, age, gender and surgical status were independent predictors of MA-II total score.A reliable and valid Portuguese version of the MA-II was produced, thus enabling the routine use of MA-II in the morbidly obese Portuguese population.
Translation, Cross-Cultural Adaptation, and Validation of the Activity Rating Scale for Disorders of the Knee.

PubMed

Flosadottir, Vala; Roos, Ewa M; Ageberg, Eva

2017-09-01

The Activity Rating Scale (ARS) for disorders of the knee evaluates the level of activity by the frequency of participation in 4 separate activities with high demands on knee function, with a score ranging from 0 (none) to 16 (pivoting activities 4 times/wk). To translate and cross-culturally adapt the ARS into Swedish and to assess measurement properties of the Swedish version of the ARS. Cohort study (diagnosis); Level of evidence, 2. The COSMIN guidelines were followed. Participants (N = 100 [55 women]; mean age, 27 years) who were undergoing rehabilitation for a knee injury completed the ARS twice for test-retest reliability. The Knee injury and Osteoarthritis Outcome Score (KOOS), Tegner Activity Scale (TAS), and modernized Saltin-Grimby Physical Activity Level Scale (SGPALS) were administered at baseline to validate the ARS. Construct validity and responsiveness of the ARS were evaluated by testing predefined hypotheses regarding correlations between the ARS, KOOS, TAS, and SGPALS. The Cronbach alpha, intraclass correlation coefficients, absolute reliability, standard error of measurement, smallest detectable change, and Spearman rank-order correlation coefficients were calculated. The ARS showed good internal consistency (α ≈ 0.96), good test-retest reliability (intraclass correlation coefficient >0.9), and no systematic bias between measurements. The standard error of measurement was less than 2 points, and the smallest detectable change was less than 1 point at the group level and less than 5 points at the individual level. More than 75% of the hypotheses were confirmed, indicating good construct validity and good responsiveness of the ARS. The Swedish version of the ARS is valid, reliable, and responsive for evaluating the level of activity based on the frequency of participation in high-demand knee sports activities in young adults with a knee injury.
Facial Angiofibroma Severity Index (FASI): reliability assessment of a new tool developed to measure severity and responsiveness to therapy in tuberous sclerosis-associated facial angiofibroma.

PubMed

Salido-Vallejo, R; Ruano, J; Garnacho-Saucedo, G; Godoy-Gijón, E; Llorca, D; Gómez-Fernández, C; Moreno-Giménez, J C

2014-12-01

Tuberous sclerosis complex (TSC) is an autosomal dominant neurocutaneous disorder characterized by the development of multisystem hamartomatous tumours. Topical sirolimus has recently been suggested as a potential treatment for TSC-associated facial angiofibroma (FA). To validate a reproducible scale created for the assessment of clinical severity and treatment response in these patients. We developed a new tool, the Facial Angiofibroma Severity Index (FASI) to evaluate the grade of erythema and the size and extent of FAs. In total, 30 different photographs of patients with TSC were shown to 56 dermatologists at each evaluation. Three evaluations using the same photographs but in a different random order were performed 1 week apart. Test and retest reliability and interobserver reproducibility were determined. There was good agreement between the investigators. Inter-rater reliability showed strong correlations (> 0.98; range 0.97-0.99) with inter-rater correlation coefficients (ICCs) for the FASI. The global estimated kappa coefficient for the degree of intra-rater agreement (test-retest) was 0.94 (range 0.91-0.97). The FASI is a valid and reliable tool for measuring the clinical severity of TSC-associated FAs, which can be applied in clinical practice to evaluate the response to treatment in these patients. © 2014 British Association of Dermatologists.
CONSISTENCY OF FIELD-BASED MEASURES OF NEUROMUSCULAR CONTROL USING FORCE PLATE DIAGNOSTICS IN ELITE MALE YOUTH SOCCER PLAYERS

PubMed Central

READ, PAUL; OLIVER, JON L.; DE STE CROIX, MARK B.A.; MYER, GREGORY D.; LLOYD, RHODRI S.

2016-01-01

Deficits in neuromuscular control during movement patterns such as landing are suggested pathomechanics that underlie sport-related injury. A common mode of assessment is measurement of landing forces during jumping tasks; however, these measures have been used less frequently in male youth soccer players and reliability data is sparse. The aim of this study was to examine the reliability of a field-based neuromuscular control screening battery using force plate diagnostics in this cohort. Twenty six pre-peak height velocity (PHV) and twenty five post-PHV elite male youth soccer players completed a drop vertical jump (DVJ), single leg 75% horizontal hop and stick (75%HOP) and single leg countermovement jump (SLCMJ). Measures of peak landing vertical ground reaction force (pVGRF), time to stabilisation (TTS), time to pVGRF, and pVGRF asymmetry were recorded. A test, re-test design was used and reliability statistics included: change in mean, intraclass correlation coefficient (ICC) and coefficient of variation (CV). No significant differences in mean score were reported for any of the assessed variables between test sessions. In both groups, pVGRF and asymmetry during the 75%HOP and SLCMJ demonstrated largely acceptable reliability (CV ≤ 10%). Greater variability was evident in DVJ pVGRF and all other assessed variables, across the three protocols (CV range = 13.8 – 49.7%). ICC values ranged from small to large and were generally higher in the post-PHV players. The results of this study suggest that pVGRF and asymmetry can be reliably assessed using a 75%HOP and SLCMJ in this cohort. These measures could be utilized to support a screening battery for elite male youth soccer players and for test re-test comparison. PMID:27075641
Translation and validation of a Nepalese version of the Psychosocial Impact of Dental Aesthetic Questionnaire (PIDAQ).

PubMed

Singh, Varun Pratap; Singh, Rajkumar

2014-03-01

The aim of this study was to develop a reliable and valid Nepali version of the Psychosocial Impact of Dental Aesthetic Questionnaire (PIDAQ). Cross-sectional descriptive validation study. B.P. Koirala Institute of Health Sciences, Dharan, Nepal. A rigorous translation process including conceptual and semantic evaluation, translation, back translation and pre-testing was carried out. Two hundred and fifty-two undergraduates, including equal numbers of males and females with an age ranging from 18 to 29 years (mean age: 22·33±2·114 years), participated in this study. Reliability was assessed by Cronbach's alpha coefficient and the coefficient of correlation was used to assess correlation between items and test-retest reliability. The construct validity was tested by factorial analysis. Convergent construct validity was tested by comparison of PIDAQ scores with the aesthetic component of the index of orthodontic treatment needs (IOTN-AC) and perception of occlusion scale (POS), respectively. Discriminant construct validity was assessed by differences in score for those who demand treatment and those who did not. The response rate was 100%. One hundred and twenty-three individuals had a demand for orthodontic treatment. The Nepali PIDAQ had excellent reliability with Cronbach's alpha of 0·945, corrected item correlation between 0·525 and 0·790 and overall test-retest reliability of 0·978. The construct validity was good with formation of a new sub-domain 'Dental self-consciousness'. The scale had good correlation with IOTN-AC and POS fulfilling convergent construct validity. The discriminant construct validity was proved by significant differences in scores for subjects with demand and without demand for treatment. To conclude, Nepali version of PIDAQ has good psychometric properties and can be used effectively in this population group for further research.
Reliability and Agreement of Neck Functional Capacity Evaluation Tests in Patients With Chronic Multifactorial Neck Pain.

PubMed

Reneman, M F; Roelofs, M; Schiphorst Preuper, H R

2017-07-01

To analyze test-retest reliability and agreement, and to explore the safety of neck functional capacity evaluation (Neck-FCE) tests in patients with chronic multifactorial neck pain. Test-retest; 2 FCE sessions were held with a 2-week interval. University-based outpatient rehabilitation center. Individuals (N=18; 14 women) with a mean age of 34 years. Not applicable. The Neck-FCE protocol consists of 6 tests: lifting waist to overhead (kg), 2-handed carrying (kg), overhead working (s), bending and overhead reaching (s), and repetitive side reaching (left and right) (s). Intraclass correlation coefficients (ICCs) and limits of agreement (LoA) were calculated. ICC point estimates between .75 and .90 were considered as good, and >.90 were considered as excellent reliability. ICC point estimates ranged between .39 and .96. Ratios of the LoA ranged between 32.0% and 56.5%. Mean ± SD numeric rating scale pain scores in the neck and shoulder 24 hours after the test were 6.7±2.6 and 6.3±3.0, respectively. Based on ICC point estimates and 95% confidence intervals, 3 tests had excellent reliability and 3 had poor reliability. LoA were substantial in all 6 tests. Safety was confirmed. Copyright © 2016 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.

The european organization for research and treatment of cancer quality of life questionnaire-BR 23 breast cancer-specific quality of life questionnaire: psychometric properties in a Moroccan sample of breast cancer patients

PubMed Central

2014-01-01

Background Quality of life (QOL) and its measurement in cancer patients is becoming increasingly important. Breast cancer diagnosis and treatment are often associated with psychological distress and reduced QoL. In Arabic-speaking countries, QoL of patients with cancer is inadequately studied. The aim of this study was to test the reliability and validity of the Moroccan Arabic version of the European Organization for Research and Treatment of Cancer (EORTC) Breast Cancer-Specific Quality of Life Questionnaire (QLQ-BR23). Methods After translation and cross-cultural adaptation, the questionnaire was tested on breast cancer patients. The participants’ number for the test and the retest were 105 and 37 respectively. Internal consistency was tested using Cronbach’s alpha coefficient (α), the test-retest reliability using intraclass correlation coefficients (ICC). Construct validity was assessed by examining item-convergent and divergent validity. Results The questionnaire was administered to 105 patients. The mean age of patients was 48 years (SD: 16), 62.9% were married. 68.6% of all participants lived in urban area. The average time to complete the QLQ- BR23 was 15 min. Cronbach’s alpha coefficient, were all >0.7, with the exception of breast symptoms and arm symptoms. All items exceeded the 0.4 criterion for convergent validity except item 20 and 23 related to pain and skin problems in the affected breast respectively. Conclusion In general, the findings of this study indicated that the Moroccan Arabic version of the EORTC QLQ-BR23 is a reliable and valid supplementary measure of the QOL in breast cancer patients and can be used in clinical trials and studies of outcome research in oncology. PMID:24447401
The European Organization for Research and Treatment of Cancer quality of life questionnaire-BR23 Breast Cancer-Specific Quality of Life Questionnaire: psychometric properties in a Moroccan sample of breast cancer patients.

PubMed

El Fakir, Samira; Abda, Naima; Bendahhou, Karima; Zidouh, Ahmed; Bennani, Maria; Errihani, Hassan; Benider, Abdelatif; Bekkali, Rachid; Nejjari, Chakib

2014-01-21

Quality of life (QOL) and its measurement in cancer patients is becoming increasingly important. Breast cancer diagnosis and treatment are often associated with psychological distress and reduced QoL. In Arabic-speaking countries, QoL of patients with cancer is inadequately studied.The aim of this study was to test the reliability and validity of the Moroccan Arabic version of the European Organization for Research and Treatment of Cancer (EORTC) Breast Cancer-Specific Quality of Life Questionnaire (QLQ-BR23). After translation and cross-cultural adaptation, the questionnaire was tested on breast cancer patients. The participants' number for the test and the retest were 105 and 37 respectively. Internal consistency was tested using Cronbach's alpha coefficient (α), the test-retest reliability using intraclass correlation coefficients (ICC). Construct validity was assessed by examining item-convergent and divergent validity. The questionnaire was administered to 105 patients. The mean age of patients was 48 years (SD: 16), 62.9% were married. 68.6% of all participants lived in urban area.The average time to complete the QLQ- BR23 was 15 min. Cronbach's alpha coefficient, were all >0.7, with the exception of breast symptoms and arm symptoms. All items exceeded the 0.4 criterion for convergent validity except item 20 and 23 related to pain and skin problems in the affected breast respectively. In general, the findings of this study indicated that the Moroccan Arabic version of the EORTC QLQ-BR23 is a reliable and valid supplementary measure of the QOL in breast cancer patients and can be used in clinical trials and studies of outcome research in oncology.
Development and validation of the Myasthenia Gravis Impairment Index.

PubMed

Barnett, Carolina; Bril, Vera; Kapral, Moira; Kulkarni, Abhaya; Davis, Aileen M

2016-08-30

We aimed to develop a measure of myasthenia gravis impairment using a previously developed framework and to evaluate reliability and validity, specifically face, content, and construct validity. The first draft of the Myasthenia Gravis Impairment Index (MGII) included examination items from available measures enriched with newly developed, patient-reported items, modified after patient input. International neuromuscular specialists evaluated face and content validity via an e-mail survey. Test-retest reliability was assessed in stable patients at a 3-week interval and interrater reliability was evaluated in the same day. Construct validity was assessed through correlations between the MGII and other measures and by comparing scores in different patient groups. The first draft was assessed by 18 patients, and 72 specialists answered the survey. The second draft had 7 examination and 22 patient-reported items. Field testing included 200 patients, with 54 patients completing the reliability studies. Test-retest reliability of the total score was good (intraclass correlation coefficient 0.92; 95% confidence interval 0.79-0.94), as was interrater reliability of the examination component (intraclass correlation coefficient 0.81; 95% confidence interval 0.79-0.94). The MGII correlated well with comparison measures, with higher correlations with the MG-activities of daily living (r = 0.91) and MG-specific quality of life 15-item scale (r = 0.78). When assessing different patient groups, the scores followed expected patterns. The MGII was developed using a patient-centered framework of myasthenia-related impairments and incorporating patient input throughout the development process. It is reliable in an outpatient setting and has demonstrated construct validity. Responsiveness studies are under way. © 2016 American Academy of Neurology.
Reliability and validity of the Turkish version of the Berg Balance Scale.

PubMed

Sahin, Fusun; Yilmaz, Figen; Ozmaden, Asli; Kotevolu, Nurdan; Sahin, Tulay; Kuran, Banu

2008-01-01

The purpose of this study was to develop a Turkish version of the Berg Balance Scale (BBS) and assess its reliability and validity. Sixty healthy volunteers older than 65 years were included in to the study. Subjects who had lower extremity amputation, or were armchair or bedridden were excluded. After translation process, the Turkish version of the scale was administered to each participant twice with an interval of 2 weeks. The intraclass correlation coefficient (ICC) was calculated to assess intra- and inter-observer reliability. Chronbach alpha was calculated to evaluate internal consistency of the total BBS score. Interclass correlation coefficient was calcuated to examine test-retest reliability. Convergent validity was assessed by correlating the scale with Modified Barthel Index (MBI) and Timed Up and Go Test (TUG). Construct validity was assessed with factor analysis. The mean age in years of the participants were 77.00+/-5.67 (range: 67-92 yrs). The ICC for intra- and inter- observer reliability was 0.98 (p<0.0001) and 0.97 (p<0.0001), respectively. Chronbach alpha of the Turkish version of the BBS was 0.98. The test-retest reliability (ICC) of the Turkish version of the BBS was determined as 0.98 for the total score, and ranged from 0.86-0.99 for individual items. In terms of validity, the Turkish version of the BBS was correlated with the MBI (in positive direction) and TUG (in negative direction) (r=0.67 p<0.0001; r=-0.75 p<0.0001, respectively). The Turkish version of the BBS is a reliable and valid scale to be used in balance assessment of Turkish older adults.
[Cross-cultural adaptation and validation of the Health and Taste Attitude Scale (HTAS) in Portuguese].

PubMed

Koritar, Priscila; Philippi, Sonia Tucunduva; Alvarenga, Marle dos Santos; Santos, Bernardo dos

2014-08-01

The scope of this study was to show the cross-cultural adaptation and validation of the Health and Taste Attitude Scale in Portuguese. The methodology included translation of the scale; evaluation of conceptual, operational and item-based equivalence by 14 experts and 51 female undergraduates; semantic equivalence and measurement assessment by 12 bilingual women by the paired t-test, the Pearson correlation coefficient and the coefficient intraclass correlation; internal consistency and test-retest reliability by Cronbach's alpha and intraclass correlation coefficient, respectively, after application on 216 female undergraduates; assessment of discriminant and concurrent validity via the t-test and Spearman's correlation coefficient, respectively, in addition to Confirmatory Factor and Exploratory Factor Analysis. The scale was considered adequate and easily understood by the experts and university students and presented good internal consistency and reliability (µ 0.86, ICC 0.84). The results show that the scale is valid and can be used in studies with women to better understand attitudes related to taste.
The Unsupported Upper Limb Exercise Test in People Without Disabilities: Assessing the Within-Day Test-Retest Reliability and the Effects of Age and Gender.

PubMed

Oliveira, Ana; Cruz, Joana; Jácome, Cristina; Marques, Alda

2018-01-01

Purpose: To estimate the within-day test-retest reliability and standard error of measurement (SEM) of the unsupported upper limb exercise test (UULEX) in adults without disabilities and to determine the effects of age and gender on performance of the UULEX. Method: A cross-sectional study was conducted with 100 adults without disabilities (44 men, mean age 44.2 [SD 26] y; 56 women, mean age 38.1 [SD 24.1] y). Participants performed three UULEX tests to establish within-day reliability, measured using an intra-class correlation coefficient (ICC) model 2 (two-way random effects) with a single rater (ICC[2,1]) and SEM. The effects of age and gender were examined using two-factor mixed-design analysis of variance (ANOVA) and one-way repeated-measures ANOVA. For analysis purposes, four sub-groups were created: younger adults, older adults, men, and women. Results: Excellent within-day reliability and a small SEM were found in the four sub-groups (younger adults: ICC[2,1]=0.88; 95% CI: 0.82, 0.92; SEM∼40 s; older adults: ICC[2,1]=0.82; 95% CI: 0.72, 0.90; SEM∼50 s; men: ICC[2,1]=0.93; 95% CI: 0.88, 0.96; SEM∼30 s; women: ICC[2,1]=0.85; 95% CI: 0.78, 0.91; SEM∼45 s). Younger adults took, on average, 308.24 seconds longer than older adults to perform the test; older adults performed significantly better on the third test ( p <0.0001; η 2 =0.096). Gender effects were not found ( p >0.05). Conclusion: The within-day test-retest reliability and SEM values of the UULEX may be used to define the magnitude of the error obtained with repeated measures. One UULEX test seems to be adequate for younger adults to achieve reliable results, whereas three tests seem to be needed for older adults.
Reliability and Construct Validity of the 6-Minute Racerunner Test in Children and Youth with Cerebral Palsy, GMFCS Levels III and IV.

PubMed

Bolster, Eline A M; Dallmeijer, Annet J; de Wolf, G Sander; Versteegt, Marieke; Schie, Petra E M van

2017-05-01

To determine the test-retest reliability and construct validity of a novel 6-Minute Racerunner Test (6MRT) in children and youth with cerebral palsy (CP) classified as Gross Motor Function Classification System (GMFCS) levels III and IV. The racerunner is a step-propelled tricycle. The participants were 38 children and youth with CP (mean age 11 y 2 m, SD 3 y 7 m; GMFCS III, n = 19; IV, n = 19). Racerunner capability was determined as the distance covered during the 6MRT on three occasions. The intraclass correlation coefficient (ICC), standard error of measurement (SEM), and smallest detectable differences (SDD) were calculated to assess test-retest reliability. The ICC for tests 2 and 3 were 0.89 (SDD 37%; 147 m) for children in level III and 0.91 for children in level IV (SDD 52%; 118 m). When the average of two separate test occasions was used, the SDDs were reduced to 26% (104 m; level III) and 37% (118 m; level IV). For tests 1 to 3, the mean distance covered increased from 345 m (SD 148 m) to 413 m (SD 137 m) for children in level III, and from 193 m (SD 100 m) to 239 m (SD 148 m) for children in level IV. Results suggest high test-retest reliability. However, large SDDs indicate that a single 6MRT measurement is only useful for individual evaluation when large improvements are expected, or when taking the average of two tests. The 6MRT discriminated the distance covered between children and youth in levels III and IV, supporting construct validity.
Cigarette dependence questionnaire: development and psychometric testing with male smokers.

PubMed

Huang, Chih-Ling; Lin, Hsi-Hui; Wang, Hsiu-Hung

2010-10-01

This paper is a report of a study conducted to develop and test a theoretically derived Cigarette Dependence Questionnaire for adult male smokers. Fagerstrom questionnaires have been used worldwide to assess cigarette dependence. However, these assessments lack any theoretical perspective. A theory-based approach is needed to ensure valid assessment. In 2007, an initial pool of 103 Cigarette Dependence Questionnaire items was distributed to 109 adult smokers in Taiwan. Item analysis was conducted to select items for inclusion in the refined scale. The psychometric properties of the Cigarette Dependence Questionnaire were further evaluated 2007-08, when it was administered to 256 respondents and their saliva was collected and analysed for cotinine levels. Criterion validity was established through the Pearson correlation between the scale and saliva cotinine levels. Exploratory factor analysis was used to test construct validity. Reliability was determined with Cronbach's alpha coefficient and a 2-week test-retest coefficient. The selection of 30 items for seven perspectives was based on item analysis. One factor accounting for 44.9% of the variance emerged from the factor analysis. The factor was named as cigarette dependence. Cigarette Dependence Questionnaire scores were statistically significantly correlated with saliva cotinine levels (r = 0.21, P = 0.01). Cronbach's alpha was 0.95 and test-retest reliability using an intra-class correlation was 0.92. The Cigarette Dependence Questionnaire showed sound reliability and validity and could be used by nurses to set up smoking cessation interventions based on assessment of cigarette dependence. © 2010 Blackwell Publishing Ltd.
Reliability and Normative Data for the Dynamic Visual Acuity Test for Vestibular Screening.

PubMed

Riska, Kristal M; Hall, Courtney D

2016-06-01

The purpose of this study was to determine reliability of computerized dynamic visual acuity (DVA) testing and to determine reference values for younger and older adults. A primary function of the vestibular system is to maintain gaze stability during head motion. The DVA test quantifies gaze stabilization with the head moving versus stationary. Commercially available computerized systems allow clinicians to incorporate DVA into their assessment; however, information regarding reliability and normative values of these systems is sparse. Forty-six healthy adults, grouped by age, with normal vestibular function were recruited. Each participant completed computerized DVA testing including static visual acuity, minimum perception time, and DVA using the NeuroCom inVision System. Testing was performed by two examiners in the same session and then repeated at a follow-up session 3 to 14 days later. Intraclass correlation coefficients (ICCs) were used to determine inter-rater and test-retest reliability. ICCs for inter-rater reliability ranged from 0.323 to 0.937 and from 0.434 to 0.909 for horizontal and vertical head movements, respectively. ICCs for test-retest reliability ranged from 0.154 to 0.856 and from 0.377 to 0.9062 for horizontal and vertical head movements, respectively. Overall, raw scores (left/right DVA and up/down DVA) were more reliable than DVA loss scores. Reliability of a commercially available DVA system has poor-to-fair reliability for DVA loss scores. The use of a convergence paradigm and not incorporating the forced choice paradigm may contribute to poor reliability.
Validity and Reliability of Persian Version of HIV/AIDS Related Stigma Scale for People Living With HIV/AIDS in Iran.

PubMed

Pourmarzi, Davoud; Khoramirad, Ashraf; Ahmari Tehran, Hoda; Abedini, Zahra

2015-11-01

To assess the perceived HIV/AIDS related stigma a comprehensive and well developed stigma instrument is necessary. This study aimed to assess validity and reliability of the Persian version of HIV/AIDS related stigma scale which was developed by Kang et al for people living with HIV/AIDS in Iran. Thescale was forward translatedby two bilingual academic members then both translations were discussed by expert team. Back-translation was done by two other bilingual translators then we carried out discussion with both of them. To evaluate understandability the scale was administered to 10 Persons Living with HIV/AIDS (PLWHA). Final Persian version was administered to 80 PLWHA in Qom, Iran in 2014. Test-retest reliability was assessed in a sample of 20 PLWHA after a week by intra-class correlation coefficient (ICC). Cronbach's alpha coefficient for overall scale was 0.85. Also Cronbach's alpha coefficients for the five subscales were as follows: social rejection (9 items, α = 0.84), negative self-worth (4 items, α = 0.70), perceived interpersonal insecurity (2 items, α = 0.57), financial insecurity (3 items, α = 0.70), discretionary disclosure (2 items, α = 0.83). Test-retest reliability was also approved with ICC = 0.78. Correlation between items and their hypothesized subscale is greater than 0.5. Correlation between an item and its own subscale was significantly higher than its correlation with other subscales. This study demonstrate that the Persian version of HIV/AIDS related stigma scale is valid and reliable to assess HIV/AIDS related stigma perceived by people living whit HIV/AIDS in Iran.
Test-retest reliability of barbell velocity during the free-weight bench-press exercise.

PubMed

Stock, Matt S; Beck, Travis W; DeFreitas, Jason M; Dillon, Michael A

2011-01-01

The purpose of this study was to calculate test-retest reliability statistics for peak barbell velocity during the free-weight bench-press exercise for loads corresponding to 10-90% of the 1-repetition maximum (1RM). Twenty-one healthy, resistance-trained men (mean ± SD age = 23.5 ± 2.7 years; body mass = 90.5 ± 14.6 kg; 1RM bench press = 125.4 ± 18.4 kg) volunteered for this study. A minimum of 48 hours after a maximal strength testing and familiarization session, the subjects performed single repetitions of the free-weight bench-press exercise at each tenth percentile (10-90%) of the 1RM on 2 separate occasions. For each repetition, the subjects were instructed to press the barbell as rapidly as possible, and peak barbell velocity was measured with a Tendo Weightlifting Analyzer. The test-retest intraclass correlation coefficients (model 2,1) and corresponding standard errors of measurement (expressed as percentages of the mean barbell velocity values) were 0.717 (4.2%), 0.572 (5.0%), 0.805 (3.1%), 0.669 (4.7%), 0.790 (4.6%), 0.785 (4.8%), 0.811 (5.8%), 0.714 (10.3%), and 0.594 (12.6%) for the weights corresponding to 10-90% 1RM. There were no mean differences between the barbell velocity values from trials 1 and 2. These results indicated moderate to high test-retest reliability for barbell velocity from 10 to 70% 1RM but decreased consistency at 80 and 90% 1RM. When examining barbell velocity during the free-weight bench-press exercise, greater measurement error must be overcome at 80 and 90% 1RM to be confident that an observed change is meaningful.
Chinese adaptation and validation of the patellofemoral pain severity scale.

PubMed

Cheung, Roy T H; Ngai, Shirley P C; Lam, Priscillia L; Chiu, Joseph K W; Fung, Eric Y H

2013-05-01

This study validated the Patellofemoral Pain Severity Scale translated into Chinese. The Chinese Patellofemoral Pain Severity Scale was translated from the original English version following standard forward and backward translation procedures recommended by the International Society for Pharmacoeconomics and Outcomes Research. The survey was then conducted in clinical settings by a questionnaire comprising the Chinese Patellofemoral Pain Severity Scale, Kujala Scale and Western Ontario and McMaster Universities (WOMAC) Osteoarthritis Index. Eighty-four Chinese reading patients with patellofemoral pain were recruited from physical therapy clinics. Internal consistency of the translated instrument was measured by Cronbach alpha. Convergent validity was examined by Spearman rank correlation coefficient (rho) tests by comparing its score with the validated Chinese version of the Kujala Scale and the WOMAC Osteoarthritis Index while the test-retest reliability was evaluated by administering the questionnaires twice. Cronbach alpha values of individual questions and their overall value were above 0.85. Strong association was found between the Chinese Patellofemoral Pain Severity Scale and the Kujala Scale (rho = -0.72, p < 0.001). Moderate correlation was also found between Chinese Patellofemoral Pain Severity Scale with the WOMAC Osteoarthritis Index (rho = 0.63, p < 0.001). Excellent test-retest reliability (Intraclass correlation coefficient = 0.98) was demonstrated. The Chinese translated version of the Patellofemoral Pain Severity Scale is a reliable and valid instrument for patients with patellofemoral pain.
Arabic validation of the Urogenital Distress Inventory and Adapted Incontinence Impact Questionnaires--short forms.

PubMed

El-Azab, Ahmed S; Mascha, Edward J

2009-01-01

The purpose of this study was to adapt the IIQ-7 to suit the Egyptian culture and then to assess validity and reliability of the adapted and translated IIQ-7 and UDI-6. IIQ-7 was modified to suit Egyptian culture. Linguistic validation of the two questionnaires was done. Initial test-retest reliability and internal consistency of adapted translated questionnaires were done in a pilot study. The final validity, test-retest reliability and internal consistency study included 204 women with urinary incontinence (UI). Participants completed the two questionnaires at enrollment and after 2 weeks. All participants underwent urodynamics. Baseline urodynamic diagnosis was compared with diagnoses made by questionnaires to assess validity. Test-retest reliability was excellent for both the IIQ-7 and UDI-6. For the UDI-6, the mean difference (SD) between first and second visits was -1.63 (7.0), and the 95% CI for the mean difference was -2.6 and -0.68. The 95% limits of agreement were -15.3 and 12.0. Lin's concordance correlation coefficient (LCCC) (95% CI) for the UDI was 0.89 (0.85 and 0.91). For the IIQ-7, the mean difference (SD) was 0.37 (7.1), and the 95% CI for the mean difference was -0.60 and 1.3. The 95% limits of agreement were -13.5 and 14.2. LCCC (95% CI) for the IIQ was 0.90 (0.87 and 0.92). Internal consistency as assessed using Cronbach's alpha was 0.32 and 0.31 for the UDI-6 and IIQ-7, respectively. Validity assessments indicated that both IIQ and UDI scales can distinguish objective disease states. UDI-6 and the modified IIQ-7 are easy to administer, test-retest reliable, and valid questionnaires, with relatively low internal consistency. (c) 2008 Wiley-Liss, Inc.
Falls and confidence related quality of life outcome measures in an older British cohort

PubMed Central

Parry, S; Steen, N; Galloway, S; Kenny, R; Bond, J

2001-01-01

Falls are common in older subjects and result in loss of confidence and independence. The Falls Efficacy Scale (FES) and the Activities-specific Balance Confidence scale (ABC) were developed in North America to quantify these entities, but contain idiom unfamiliar to an older British population. Neither has been validated in the UK. The FES and the ABC were modified for use within British culture and the internal consistency and test-retest reliability of the modified scales (FES-UK and ABC-UK) assessed. A total of 193 consecutive, ambulant, new, and return patients (n=119; 62%) and their friends and relatives ("visitors", n=74; 38%) were tested on both scales, while the last 60 subjects were retested within one week. Internal reliability was excellent for both scales (Cronbach's alpha 0.97 (FES-UK), and 0.98 (ABC-UK)). Test-retest reliability was good for both scales, though superior for the ABC-UK (intraclass correlation coefficient 0.58 (FES-UK), 0.89 (ABC-UK)). There was evidence to suggest that the ABC-UK was better than the FES-UK at distinguishing between older patients and younger patients (|tABC| = 4.4; |tFES| = 2.3); and between fallers and non-fallers (|tABC| = 8.7; |tFES| = 5.0) where the t statistics are based on the comparison of two independent samples. The ABC-UK and FES-UK are both reliable and valid measures for the assessment of falls and balance related confidence in older adults. However, better test-retest reliability and more robust differentiation of subgroups in whom falls related quality of life would be expected to be different make the ABC-UK the current instrument of choice in assessing this entity in older British subjects.   Keywords: quality of life; falls; elderly; health status measurement PMID:11161077
[Turkish validity and reliability study of fear of pain questionnaire-III].

PubMed

Ünver, Seher; Turan, Fatma Nesrin

2018-01-01

This study aimed to develop a Turkish version of the Fear of Pain Questionnaire-III developed by McNeil and Rainwater (1998) and examine its validity and reliability indicators. The study was conducted with 459 university students studying in the nursing department. The Turkish translation of the scale was conducted by language experts and the original scale owner. Expert opinions were taken for language validity, and the Lawshe's content validity ratio formula was used to calculate the content validity. Exploratory factor analysis was used to assess the construct validity. The factors were rotated using the Varimax rotation (orthogonal) method. For reliability indicators of the questionnaire, the internal consistency coefficient and test re-test reliability were utilized. Explanatory factor analyses using the three-factor model (explaining 50.5% of the total variance) revealed that the item factor loads varied were above the limit value of 0.30 which indicated that the questionnaire had good construct validity. The Cronbach's alpha value for the total questionnaire was 0.938, and test re-test value was 0.846 for the total scale. The Turkish version of the Fear of Pain Questionnaire-III had sufficiently high reliability and validity to be used as a tool in evaluating the fear of pain among the young Turkish population.
Reliability and validity of a low load endurance strength test for upper and lower extremities in patients with fibromyalgia.

PubMed

Munguía-Izquierdo, Diego; Legaz-Arrese, Alejandro

2012-11-01

To evaluate the reliability, standard error of the mean (SEM), clinical significant change, and known group validity of 2 assessments of endurance strength to low loads in patients with fibromyalgia syndrome (FS). Cross-sectional reliability and comparative study. University Pablo de Olavide, Seville, Spain. Middle-aged women with FS (n=95) and healthy women (n=64) matched for age, weight, and body mass index (BMI) were recruited for the study. Not applicable. The endurance strength to low loads tests of the upper and lower extremities and anthropometric measures (BMI) were used for the evaluations. The differences between the readings (tests 1 and 2) and the SDs of the differences, intraclass correlation coefficient (ICC) model (2,1), 95% confidence interval for the ICC, coefficient of repeatability, intrapatient SD, SEM, Wilcoxon signed-rank test, and Bland-Altman plots were used to examine reliability. A Mann-Whitney U test was used to analyze the differences in test values between the patient group and the control group. We hypothesized that patients with FS would have an endurance strength to low loads performance in lower and upper extremities at least twice as low as that of the healthy controls. Satisfactory test-retest reliability and SEMs were found for the lower extremity, dominant arm, and nondominant arm tests (ICC=.973-.979; P<.001; SEMs=1.44-1.66 repetitions). The differences in the mean between the test and retest were lower than the SEM for all performed tests, varying from -.10 to .29 repetitions. No significant differences were found between the test and retest (P>.05 for all). The Bland-Altman plots showed 95% limits of agreement for the lower extremity (4.7 to -4.5), dominant arm (3.8 to -4.4), and nondominant arm (3.9 to -4.1) tests. The endurance strength to low loads test scores for the patients with FS were 4-fold lower than for the controls in all performed tests (P<.001 for all). The endurance strength to low loads tests showed good reliability and known group validity and can be recommended for evaluating endurance strength to low loads in patients with FS. For individual evaluation, however, an improved score of at least 4 and 5 repetitions for the upper and lower extremities, respectively, was required for the differences to be considered as substantial clinical change. Patients with FS showed impaired endurance strength to low loads performance when compared with the general population. Copyright © 2012 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Parental self-efficacy in childhood overweight: validation of the Lifestyle Behavior Checklist in the Netherlands.

PubMed

Gerards, Sanne M P L; Hummel, Karin; Dagnelie, Pieter C; de Vries, Nanne K; Kremers, Stef P J

2013-01-18

Evaluating whether parental challenges and self-efficacy toward managing children's lifestyle behaviors are successfully addressed by interventions requires valid instruments. The Lifestyle Behavior Checklist (LBC) has recently been developed in the Australian context. It consists of two subscales: the Problem scale, which measures parental perceptions of children's behavioral problems related to overweight and obesity, and the Confidence scale, measuring parental self-efficacy in dealing with these problems. The aim of the current study was to systematically translate the questionnaire into Dutch and to evaluate its internal consistency, construct validity and test-retest reliability. The LBC was systematically translated by four experts at Maastricht University. In total, 392 parents of 3-to13-year-old children were invited to fill out two successive online questionnaires with a two-week interval. Of these, 273 parents responded to the first questionnaire (test, response rate = 69.6%), and of the 202 who could be invited for the second questionnaire (retest), 100 responded (response rate = 49.5%). We assessed the questionnaire's internal consistency (Cronbach's α), construct validity (Spearman's Rho correlation tests, using the criterion measures: restrictiveness, nurturance, and psychological control), and test-retest reliability (Spearman's Rho correlation tests). Both scales had high internal consistency (Cronbach's α ≥ 0.90). Spearman correlation coefficients indicated acceptable test-retest reliability for both the Problem scale (rs = 0.74) and the Confidence scale (rs = 0.70). The LBC Problem scale was significantly correlated to all criterion scales (nurturance, restrictiveness, psychological control) in the hypothesized direction, and the LBC Confidence scale was significantly correlated with nurturance and psychological control in the hypothesized direction, but not with restrictiveness. The Dutch translation of the LBC was found to be a reliable and reasonably valid questionnaire to measure parental perceptions of children's weight-related problem behavior and the extent to which parents feel confident to manage these problems.
Validation of an adapted arabic version of fibromyalgia syndrome impact questionnaire.

PubMed

El-Naby, Mai Abd; Hefny, Mohamed Ahmed; Fahim, Ayman Ekram; Awadalla, Magdy Ahmed

2013-10-01

Fibromyalgia (FM) is the most common chronic pain syndrome encountered in medical practice, affecting females more than males, and the estimated prevalence of FM in Egypt is 1.3 %. The aim was to translate and adapt the Fibromyalgia Impact Questionnaire (FIQ) into Arabic and assess reliability and validity. The Arabic version of Fibromyalgia Impact Questionnaire (FIQ-A) was adapted following the forward/backward translation approach. Fifty-one female patients with FM were studied to assess psychometric properties of the FIQ-A. Reliability was analyzed by the correlation coefficient between test and retest. Internal consistency was checked by the Cronbach's alpha coefficient. Construct validity was assessed comparing FIQ-A with Health Assessment Questionnaire (HAQ), Health Assessment Questionnaire of Fibromyalgia (FHAQ), The Medical Outcome Survey Short-Form-36 (SF-36), and the Total Visual Analog Scale (TVAS) for FM symptom, and feasibility was assessed by the time taken in completing the FIQ-A and the proportion of patients completed the questionnaire. Patients studied were 33.2 ± 9.8 years old. Translation was concordant. Adaptation affected 4 sub-items of physical function. Test-retest correlation coefficient was 0.89 for total FIQ-A and Cronbach's alpha was 0.76. Excellent to good statistically significant correlations (p < 0.05) were found between the FIQ-A items and HAQ, FHAQ, and SF-36. The FIQ-A is a reliable, valid for measuring health status and physical function in Arabic-speaking FM patients.
Reliability and Construct Validity of Yo-Yo Tests in Untrained and Soccer-Trained Schoolgirls Aged 9-16.

PubMed

Póvoas, Susana C; Castagna, Carlo; da Costa Soares, José Manuel; Silva, Pedro; Coelho-E-Silva, Manuel João; Matos, Fernando; Krustrup, Peter

2016-05-01

The reliability and construct validity of three age-adapted-intensity Yo-Yo tests were evaluated in untrained (n = 67) vs. soccer-trained (n = 65) 9- to 16-year-old schoolgirls. Tests were performed 7 days apart for reliability (9- to 11-year-old: Yo-Yo intermittent recovery level 1 children's test; 12- to 13-yearold: Yo-Yo intermittent endurance level 1; and 14- to 16-year-old: Yo-Yo intermittent endurance level 2). Yo-Yo distance covered was 40% (776 ± 324 vs. 556 ± 156 m), 85% (1252 ± 484 vs. 675 ± 252 m) and 138% (674 ± 336 vs. 283 ± 66 m) greater (p ≤ .010) for the soccer-trained than for the untrained girls aged 9-11, 12-13 and 14-16 years, respectively. Typical errors of measurement for Yo-Yo distance covered, expressed as a percentage of the coefficient of variation (confidence limits), were 10.1% (8.1-13.7%), 11.0% (8.6-15.4%) and 11.6% (9.2-16.1%) for soccer players, and 11.5% (9.1-15.8%), 14.1% (11.0-19.8%) and 10.6% (8.5-14.2%) for untrained girls, aged 9-11, 12-13 and 14-16, respectively. Intraclass correlation coefficient values for test-retest were excellent (0.795-0.973) in both groups. No significant differences were observed in relative exercise peak heart rate (%HRpeak) between groups during test and retest. The Yo-Yo tests are reliable for determining intermittent-exercise capacity and %HRpeak for soccer players and untrained 9- to 16-year-old girls. They also possess construct validity with better performances for soccer players compared with untrained age-matched girls, despite similar %HRpeak.
The Brazilian version of the effort-reward imbalance questionnaire to assess job stress.

PubMed

Chor, Dóra; Werneck, Guilherme Loureiro; Faerstein, Eduardo; Alves, Márcia Guimarães de Mello; Rotenberg, Lúcia

2008-01-01

The effort-reward imbalance (ERI) model has been used to assess the health impact of job stress. We aimed at describing the cross-cultural adaptation of the ERI questionnaire into Portuguese and some psychometric properties, in particular internal consistency, test-retest reliability, and factorial structure. We developed a Brazilian version of the ERI using a back-translation method and tested its reliability. The test-retest reliability study was conducted with 111 health workers and University staff. The current analyses are based on 89 participants, after exclusion of those with missing data. Reproducibility (interclass correlation coefficients) for the "effort", "'reward", and "'overcommitment"' dimensions of the scale was estimated at 0.76, 0.86, and 0.78, respectively. Internal consistency (Cronbach's alpha) estimates for these same dimensions were 0.68, 0.78, and 0.78, respectively. The exploratory factorial structure was fairly consistent with the model's theoretical components. We conclude that the results of this study represent the first evidence in favor of the application of the Brazilian Portuguese version of the ERI scale in health research in populations with similar socioeconomic characteristics.

Validation of the Fatigue Impact Scale in Hungarian patients with multiple sclerosis.

PubMed

Losonczi, Erika; Bencsik, Krisztina; Rajda, Cecília; Lencsés, Gyula; Török, Margit; Vécsei, László

2011-03-01

Fatigue is one of the most frequent complaints of patients with multiple sclerosis (MS). The Fatigue Impact Scale (FIS), one of the 30 available fatigue questionnaires, is commonly applied because it evaluates multidimensional aspects of fatigue. The main purposes of this study were to test the validity, test-retest reliability, and internal consistency of the Hungarian version of the FIS. One hundred and eleven MS patients and 85 healthy control (HC) subjects completed the FIS and the Beck Depression Inventory, a large majority of them on two occasions, 3 months apart. The total FIS score and subscale scores differed statistically between the MS patients and the HC subjects in both FIS sessions. In the test-retest reliability assessment, statistically, the intraclass correlation coefficients were high in both the MS and HC groups. Cronbach's alpha values were also notably high. The results of this study indicate that the FIS can be regarded as a valid and reliable scale with which to improve our understanding of the impact of fatigue on the health-related quality of life in MS patients without severe disability.
Psychometric Properties of Performance-based Measurements of Functional Capacity: Test-Retest Reliability, Practice Effects, and Potential Sensitivity to Change

PubMed Central

Leifker, Feea R.; Patterson, Thomas L.; Bowie, Christopher R.; Mausbach, Brent T.; Harvey, Philip D.

2010-01-01

Performance-based measures of the ability to perform social and everyday living skills are being more widely used to assess functional capacity in people with serious mental illnesses such as schizophrenia and bipolar disorder. Since they are also being used as outcome measures in pharmacological and cognitive remediation studies aimed at cognitive impairments in schizophrenia, understanding their measurement properties and potential sensitivity to change is important. In this study, the test-retest reliability, practice effects, and reliable change indices of two different performance-based functional capacity measures, the UCSD Performance-based skills assessment (UPSA) and Social skills performance assessment (SSPA) were examined over several different retest intervals in two different samples of people with schizophrenia (n’s=238 and 116) and a healthy comparison sample (n=109). These psychometric properties were compared to those of a neuropsychological assessment battery. Test-retest reliabilities of the long form of the UPSA ranged from r=.63 to r=.80 over follow-up periods up to 36 months in people with schizophrenia, while brief UPSA reliabilities ranged from r=.66 to r=.81. Test-retest reliability of the NP performance scores ranged from r=.77 to r=.79. Test-retest reliabilities of the UPSA were lower in healthy controls, while NP performance was slightly more reliable. SSPA test-retest reliability was lower. Practice effect sizes ranged from .05 to .16 for the UPSA and .07 to .19 for the NP assessment in patients, with HC having more practice effects. Reliable change intervals were consistent across NP and both FC measures, indicating equal potential for detection of change. These performance-based measures of functional capacity appear to have similar potential to be sensitive to change compared to NP performance in people with schizophrenia. PMID:20399613
Reliability and Validity of the Turkish Version of the Gastrointestinal Symptom Rating Scale.

PubMed

Turan, Nuray; Aşt, Türkinaz Atabek; Kaya, Nurten

The purpose of this methodological study is to investigate the validity and reliability of the Turkish version of the Gastrointestinal Symptom Rating Scale (GSRS). The scale was adapted to the Turkish language via backward translation. Content validity was examined by referring to experts. Reliability was examined via test-retest reliability and internal consistency, and validity was examined with divergent and convergent validity. The Epworth Sleepiness Scale (ESS) and the Marlowe-Crowne Social Desirability Scale (MCSDS) were used for divergent validity. As for convergent validity, the Constipation Severity Instrument (CSI) and the Patient Assessment of Constipation Quality of Life Scale (PAC-QOLQ) were utilized. The relationship between the GSRS and the health-related quality of life (36-item short-form health survey [SF-36]) was also analyzed. The study population consisted of patients in orthopedic clinic who volunteered to participate. Test-retest reliability was examined with the participation of 30 patients; internal consistency and validity were examined with 150 patients. Test-retest reliability correlation coefficients of the GSRS varied from 0.39 to 0.87 for all items. For internal consistency, the GSRS's item total correlation was found to be 0.17-0.67, and Cronbach α was 0.82 for all items. There was a positive linear significant correlation between the GSRS, CSI, and PAC-QOLQ. There was no significant correlation between the GSRS, MCSDS, and ESS. Higher GSRS scores inversely correlated with general quality of life (SF-36). The Turkish version of the GSRS has been found to be a reliable and valid instrument for assessing patients' gastrointestinal symptoms. Therefore, this instrument can be confidently used with Turkish individuals.
Intraobserver reliability of contact pachymetry in children.

PubMed

Weise, Katherine K; Kaminski, Brett; Melia, Michele; Repka, Michael X; Bradfield, Yasmin S; Davitt, Bradley V; Johnson, David A; Kraker, Raymond T; Manny, Ruth E; Matta, Noelle S; Schloff, Susan

2013-04-01

Central corneal thickness (CCT) is an important measurement in the treatment and management of pediatric glaucoma and potentially of refractive error, but data regarding reliability of CCT measurement in children are limited. The purpose of this study was to evaluate the reliability of CCT measurement with the use of handheld contact pachymetry in children. We conducted a multicenter intraobserver test-retest reliability study of more than 3,400 healthy eyes in children aged from newborn to 17 years by using a handheld contact pachymeter (Pachmate DGH55; DGH Technology Inc, Exton, PA) in 2 clinical settings--with the use of topical anesthesia in the office and with the patient under general anesthesia in a surgical facility. The overall standard error of measurement, including only measurements with standard deviation ≤5 μm, was 8 μm; the corresponding coefficient of repeatability, or limits within which 95% of test-retest differences fell, was ±22.3 μm. However, standard error of measurement increased as CCT increased, from 6.8 μm for CCT less than 525 μm, to 12.9 μm for CCT 625 μm and greater. The standard error of measurement including measurements with standard deviation >5 μm was 10.5 μm. Age, sex, race/ethnicity group, and examination setting did not influence the magnitude of test-retest differences. CCT measurement reliability in children via the Pachmate DGH55 handheld contact pachymeter is similar to that reported for adults. Because thicker CCT measurements are less reliable than thinner measurements, a second measure may be helpful when the first exceeds 575 μm. Reliability is also improved by disregarding measurements with instrument-reported standard deviations >5 μm. Copyright © 2013 American Association for Pediatric Ophthalmology and Strabismus. Published by Mosby, Inc. All rights reserved.
Validation and cross-cultural pilot testing of compliance with standard precautions scale: self-administered instrument for clinical nurses.

PubMed

Lam, Simon C

2014-05-01

To perform detailed psychometric testing of the compliance with standard precautions scale (CSPS) in measuring compliance with standard precautions of clinical nurses and to conduct cross-cultural pilot testing and assess the relevance of the CSPS on an international platform. A cross-sectional and correlational design with repeated measures. Nursing students from a local registered nurse training university, nurses from different hospitals in Hong Kong, and experts in an international conference. The psychometric properties of the CSPS were evaluated via internal consistency, 2-week and 3-month test-retest reliability, concurrent validation, and construct validation. The cross-cultural pilot testing and relevance check was examined by experts on infection control from various developed and developing regions. Among 453 participants, 193 were nursing students, 165 were enrolled nurses, and 95 were registered nurses. The results showed that the CSPS had satisfactory reliability (Cronbach α = 0.73; intraclass correlation coefficient, 0.79 for 2-week test-retest and 0.74 for 3-month test-retest) and validity (optimum correlation with criterion measure; r = 0.76, P < .001; satisfactory results on known-group method and hypothesis testing). A total of 19 experts from 16 countries assured that most of the CSPS findings were relevant and globally applicable. The CSPS demonstrated satisfactory results on the basis of the standard international criteria on psychometric testing, which ascertained the reliability and validity of this instrument in measuring the compliance of clinical nurses with standard precautions. The cross-cultural pilot testing further reinforced the instrument's relevance and applicability in most developed and developing regions.
The Pareidolia Test: A Simple Neuropsychological Test Measuring Visual Hallucination-Like Illusions.

PubMed

Mamiya, Yasuyuki; Nishio, Yoshiyuki; Watanabe, Hiroyuki; Yokoi, Kayoko; Uchiyama, Makoto; Baba, Toru; Iizuka, Osamu; Kanno, Shigenori; Kamimura, Naoto; Kazui, Hiroaki; Hashimoto, Mamoru; Ikeda, Manabu; Takeshita, Chieko; Shimomura, Tatsuo; Mori, Etsuro

2016-01-01

Visual hallucinations are a core clinical feature of dementia with Lewy bodies (DLB), and this symptom is important in the differential diagnosis and prediction of treatment response. The pareidolia test is a tool that evokes visual hallucination-like illusions, and these illusions may be a surrogate marker of visual hallucinations in DLB. We created a simplified version of the pareidolia test and examined its validity and reliability to establish the clinical utility of this test. The pareidolia test was administered to 52 patients with DLB, 52 patients with Alzheimer's disease (AD) and 20 healthy controls (HCs). We assessed the test-retest/inter-rater reliability using the intra-class correlation coefficient (ICC) and the concurrent validity using the Neuropsychiatric Inventory (NPI) hallucinations score as a reference. A receiver operating characteristic (ROC) analysis was used to evaluate the sensitivity and specificity of the pareidolia test to differentiate DLB from AD and HCs. The pareidolia test required approximately 15 minutes to administer, exhibited good test-retest/inter-rater reliability (ICC of 0.82), and moderately correlated with the NPI hallucinations score (rs = 0.42). Using an optimal cut-off score set according to the ROC analysis, and the pareidolia test differentiated DLB from AD with a sensitivity of 81% and a specificity of 92%. Our study suggests that the simplified version of the pareidolia test is a valid and reliable surrogate marker of visual hallucinations in DLB.
Individualized quality of life in patients with low back pain: reliability and validity of the Patient Generated Index.

PubMed

Løchting, Ida; Grotle, Margreth; Storheim, Kjersti; Werner, Erik L; Garratt, Andrew M

2014-09-01

To evaluate the reliability and validity of the improved version of the Patient Generated Index (PGI) in patients with low back pain. The PGI was administered to 90 patients attending care in 1 of 6 institutions in Norway and evaluated for reliability and validity. The questionnaire was given out to 61 patients for re-test purposes. The PGI was completed correctly by 80 (88.9%) patients and, of the 61 patients responding to the re-test, 50 (82.0%) completed both surveys correctly. PGI scores were approximately normally distributed, with a median of 40 (range 80), where 100 is the best possible quality of life. There were no floor or ceiling effects. The 5 most frequently listed areas affecting quality of life were pain, sleep, stiffness, socializing and housework. The test-retest intraclass correlation coefficient was 0.73. The smallest detectable changes for individual and group purposes were 32.8 and 4.6, respectively. The correlations between PGI scores and other instrument scores followed a priori hypotheses of low to moderate correlations. The PGI has evidence for reliability and validity in Norwegian patients with low back pain at the group level and may be considered for application in intervention studies when a comprehensive evaluation of quality of life is important. However, the smallest detectable change, of approximately 30 points, may be considered too large for individual purposes in clinical applications.
Exercise-Induced Hypoalgesia After Isometric Wall Squat Exercise: A Test-Retest Reliabilty Study.

PubMed

Vaegter, Henrik Bjarke; Lyng, Kristian Damgaard; Yttereng, Fredrik Wannebo; Christensen, Mads Holst; Sørensen, Mathias Brandhøj; Graven-Nielsen, Thomas

2018-05-19

Isometric exercises decrease pressure pain sensitivity in exercising and nonexercising muscles known as exercise-induced hypoalgesia (EIH). No studies have assessed the test-retest reliability of EIH after isometric exercise. This study investigated the EIH on pressure pain thresholds (PPTs) after an isometric wall squat exercise. The relative and absolute test-retest reliability of the PPT as a test stimulus and the EIH response in exercising and nonexercising muscles were calculated. In two identical sessions, PPTs of the thigh and shoulder were assessed before and after three minutes of quiet rest and three minutes of wall squat exercise, respectively, in 35 healthy subjects. The relative test-retest reliability of PPT and EIH was determined using analysis of variance models, Person's r, and intraclass correlations (ICCs). The absolute test-retest reliability of EIH was determined based on PPT standard error of measurements and Cohen's kappa for agreement between sessions. Squat increased PPTs of exercising and nonexercising muscles by 16.8% ± 16.9% and 6.7% ± 12.9%, respectively (P < 0.001), with no significant differences between sessions. PPTs within and between sessions showed moderately strong correlations (r ≥ 0.74) and excellent (ICC ≥ 0.84) within-session (rest) and between-session test-retest reliability. EIH responses of exercising and nonexercising muscles showed no systematic errors between sessions; however, the relative test-retest reliability was low (ICCs = 0.03-0.43), and agreement in EIH responders and nonresponders between sessions was not significant (κ < 0.13, P > 0.43). A wall squat exercise increased PPTs compared with quiet rest; however, the relative and absolute reliability of the EIH response was poor. Future research is warranted to investigate the reliability of EIH in clinical pain populations.
Development of the Seasonal Migrant Agricultural Worker Stress Scale in Sanliurfa, Southeast Turkey.

PubMed

Simsek, Zeynep; Ersin, Fatma; Kirmizitoprak, Evin

2016-01-01

Stress is one of the main causes of health problems, especially mental disorders. These health problems cause a significant amount of ability loss and increase cost. It is estimated that by 2020, mental disorders will constitute 15% of the total disease burden, and depression will rank second only after ischemic heart disease. Environmental experiences are paramount in increasing the liability of mental disorders in those who constantly face sustained high levels of stress. The objective of this study was to develop a stress scale for seasonal migrant agricultural workers aged 18 years and older. The sample consisted of 270 randomly selected seasonal migrant agricultural workers. The average age of the participants was 33.1 ± 14, and 50.7% were male. The Cronbach alpha coefficient and test-retest methods were used for reliability analyses. Although the factor analysis was performed for the structure validity of the scale, the Kaiser-Meyer-Olkin coefficient and Bartlett test were used to determine the convenience of the data for the factor analysis. In the reliability analyses, the Cronbach alpha coefficient of internal consistency was calculated as .96, and the test-retest reliability coefficient was .81. In the exploratory factor analysis for validity of the scale, four factors were obtained, and the factors represented workplace physical conditions (25.7% of the total variance), workplace psychosocial and economic factors (19.3% of the total variance), workplace health problems (15.2% of the total variance), and school problems (10.1% of the total variance). The four factors explained 70.3% of the total variance. As a result of the expert opinions and analyses, a stress scale with 48 items was developed. The highest score to be obtained from the scale was 144, and the lowest score was 0. The increase in the score indicates the increase in the stress levels. The findings show that the scale is a valid and reliable assessment instrument that can be used in epidemiological research and planning interventions.
Inter-rater and test-retest reliability of quality assessments by novice student raters using the Jadad and Newcastle-Ottawa Scales.

PubMed

Oremus, Mark; Oremus, Carolina; Hall, Geoffrey B C; McKinnon, Margaret C

2012-01-01

Quality assessment of included studies is an important component of systematic reviews. The authors investigated inter-rater and test-retest reliability for quality assessments conducted by inexperienced student raters. Student raters received a training session on quality assessment using the Jadad Scale for randomised controlled trials and the Newcastle-Ottawa Scale (NOS) for observational studies. Raters were randomly assigned into five pairs and they each independently rated the quality of 13-20 articles. These articles were drawn from a pool of 78 papers examining cognitive impairment following electroconvulsive therapy to treat major depressive disorder. The articles were randomly distributed to the raters. Two months later, each rater re-assessed the quality of half of their assigned articles. McMaster Integrative Neuroscience Discovery and Study Program. 10 students taking McMaster Integrative Neuroscience Discovery and Study Program courses. The authors measured inter-rater reliability using κ and the intraclass correlation coefficient type 2,1 or ICC(2,1). The authors measured test-retest reliability using ICC(2,1). Inter-rater reliability varied by scale question. For the six-item Jadad Scale, question-specific κs ranged from 0.13 (95% CI -0.11 to 0.37) to 0.56 (95% CI 0.29 to 0.83). The ranges were -0.14 (95% CI -0.28 to 0.00) to 0.39 (95% CI -0.02 to 0.81) for the NOS cohort and -0.20 (95% CI -0.49 to 0.09) to 1.00 (95% CI 1.00 to 1.00) for the NOS case-control. For overall scores on the six-item Jadad Scale, ICC(2,1)s for inter-rater and test-retest reliability (accounting for systematic differences between raters) were 0.32 (95% CI 0.08 to 0.52) and 0.55 (95% CI 0.41 to 0.67), respectively. Corresponding ICC(2,1)s for the NOS cohort were -0.19 (95% CI -0.67 to 0.35) and 0.62 (95% CI 0.25 to 0.83), and for the NOS case-control, the ICC(2,1)s were 0.46 (95% CI -0.13 to 0.92) and 0.83 (95% CI 0.48 to 0.95). Inter-rater reliability was generally poor to fair and test-retest reliability was fair to excellent. A pilot rating phase following rater training may be one way to improve agreement.
Development and validation of the Japanese version of cognitive flexibility scale.

PubMed

Oshiro, Keiko; Nagaoka, Sawako; Shimizu, Eiji

2016-05-17

Various instruments have been developed to assess cognitive flexibility, which is an important construct in psychology. Among these, the self-report cognitive flexibility scale (CFS) is particularly popular for use with English speakers; however, there is not yet a Japanese version of this scale. This study reports on the development of a Japanese version of the cognitive flexibility scale (CFS-J), and the assessment of its internal consistency, test-retest reliability, and validities. We used the standard translation-back-translation process to develop the Japanese wording of the items and tested these using a sample of 335 eligible participants who did not have a mental illness, were aged 18 years or older, and lived in the suburbs of Tokyo. Participants included office workers, public servants, and college students; 71.6 % were women and 64.8 % were students. The translated scale's internal consistency reliability was assessed by calculating Cronbach's alpha and McDonald's omega, and test-retest reliability was assessed with 107 eligible participants via intra-class correlation coefficient (ICC) and Spearman's correlation of coefficient. Exploratory factory analysis (EFA) and correlations with other scales were used to examine the factor-based and concurrent validities of the CFS-J. Results indicated that the CFS-J has good internal consistency (Cronbach's alpha = 0.847, McDonald's omega = 0.871) and acceptable test-retest reliability (Spearman's = 0.687, ICC = 0.689). EFA provided evidence that the CFS-J has a one-factor structure and factor loadings were generally appropriate. The total CFS-J score was significantly and positively correlated with the cognitive flexibility inventory-Japanese version and its two subscales, along with the cognitive control scale and the positive subscale of the short Japanese version of the automatic thought questionnaire-revised (ATQ-R); further, it had a significantly negative correlation with the negative subscale of the ATQ-R (ps < 0.001). This study developed a Japanese version of the cognitive flexibility scale and confirmed its reliability and validity among a sample of people with no current mental illness, who were living in the suburbs of Tokyo.
Reliability and validity of the test of incremental respiratory endurance measures of inspiratory muscle performance in COPD

PubMed Central

Formiga, Magno F; Roach, Kathryn E; Vital, Isabel; Urdaneta, Gisel; Balestrini, Kira; Calderon-Candelario, Rafael A

2018-01-01

Purpose The Test of Incremental Respiratory Endurance (TIRE) provides a comprehensive assessment of inspiratory muscle performance by measuring maximal inspiratory pressure (MIP) over time. The integration of MIP over inspiratory duration (ID) provides the sustained maximal inspiratory pressure (SMIP). Evidence on the reliability and validity of these measurements in COPD is not currently available. Therefore, we assessed the reliability, responsiveness and construct validity of the TIRE measures of inspiratory muscle performance in subjects with COPD. Patients and methods Test–retest reliability, known-groups and convergent validity assessments were implemented simultaneously in 81 male subjects with mild to very severe COPD. TIRE measures were obtained using the portable PrO2 device, following standard guidelines. Results All TIRE measures were found to be highly reliable, with SMIP demonstrating the strongest test–retest reliability with a nearly perfect intraclass correlation coefficient (ICC) of 0.99, while MIP and ID clustered closely together behind SMIP with ICC values of about 0.97. Our findings also demonstrated known-groups validity of all TIRE measures, with SMIP and ID yielding larger effect sizes when compared to MIP in distinguishing between subjects of different COPD status. Finally, our analyses confirmed convergent validity for both SMIP and ID, but not MIP. Conclusion The TIRE measures of MIP, SMIP and ID have excellent test–retest reliability and demonstrated known-groups validity in subjects with COPD. SMIP and ID also demonstrated evidence of moderate convergent validity and appear to be more stable measures in this patient population than the traditional MIP. PMID:29805255
Development and assessment of the validity and reliability of a scale for measuring the mentoring competencies of Japanese clinical midwives: An exploratory quantitative research study.

PubMed

Hishinuma, Yuri; Horiuchi, Shigeko; Yanai, Haruo

2016-06-01

Midwives are always involved in educational activities whenever novice midwives are present. Although various scales for measuring the educational competencies of nurses have already been developed in previous studies, a scale for the educational competencies particular to midwives has yet to be developed, or even no previous studies have revealed their functions as clinical educators. The purpose of this study was to develop a scale to measure the mentoring competencies of clinical midwives (MCCM Scale) and to confirm its validity and reliability. An exploratory quantitative research study. Questionnaires were distributed to 1,645 midwives at 148 facilities who had previously instructed novice midwives. 1,004 midwives (61.0%) voluntarily returned valid responses and 296 (18.0%) voluntarily agreed to participate in the survey for test-retest reliability. Exploratory factor analyses were performed over 41 items and the following seven factors were extracted with a reliability coefficient (Cronbach's α) of 0.953: (i) supporting experimental study, (ii) personal characteristics particularly in clinical educators, (iii) thoughtfulness and empathy for new midwives, (iv) self-awareness and self-reflection for finding confidence, (v) making effective use of the new midwives' own experience, (vi) commitment to educational activities, and (vii) sharing their midwifery practice. Test-retest reliability was measured based on a convenience sample of 246 (83.1%). Pearson's test-retest correlation coefficient for the entire scale was r=0.863. The factor loadings of each item on its respective factor were 0.313-0.925. The total score of the MCCM Scale was positively correlated with that of the Quality of Nurses' Occupational Experience Scale (r=0.641, p=0.000) and was negatively correlated with the total score of the Japanese Burnout Scale (r=-0.480, p=0.000). The MCCM Scale is composed of 41 items and three subscales measured from a total of seven factors. The validity and reliability of the MCCM Scale was supported by the statistical analyses. Copyright © 2016 Elsevier Ltd. All rights reserved.
Reliability, validity, and responsiveness of the Persian version of Shoulder Activity Scale in a group of patients with shoulder disorders.

PubMed

Negahban, Hossein; Mohtasebi, Elham; Goharpey, Shahin

2015-01-01

The aim of this methodological study was to cross-culturally translate the Shoulder Activity Scale (SAS) into the Persian and determine its clinimetric properties including reliability, validity, and responsiveness in patients with shoulder disorders. Persian version of the SAS was obtained after standard forward-backward translation. Three questionnaires were completed by the respondents: SAS, shoulder pain and disability index (SPADI), and Short-Form 36 Health Survey (SF-36). The patients completed the SAS, 1 week after the first visit to evaluate the test-retest reliability. Construct validity was evaluated by examining the associations between the scores on the SAS and the scores obtained from the SPADI, SF-36, and age of the patients. To assess responsiveness, data were collected in the first visit and then again after 4 weeks physiotherapy intervention. Test-retest reliability and internal consistency were assessed using Intra-class Correlation Coefficient (ICC) and Cronbach's alpha, respectively. To evaluate construct validity, Spearman's rank correlation was used. The ability of the SAS to detect changes was evaluated by the receiver-operating characteristics method. No problem or language difficulties were reported during translation process. Test-retest reliability of the SAS was excellent with an ICC of 0.98. Also, the marginal Cronbach's alpha level of 0.64 was obtained. The correlation between the SAS and the SPADI was low, proving divergent validity, whereas the correlations between the SAS and the SF-36/age were moderate proving convergent validity. A marginally acceptable responsiveness was achieved for the Persian SAS. The study provides some evidences to support the test-retest reliability, internal consistency, construct validity, and responsiveness of the Persian version of the SAS in patients with shoulder disorders. Therefore, it seems that this instrument is a useful measure of shoulder activity level in research setting and clinical practice. The shoulder activity scale (SAS) is a reliable, valid, and responsive measure of shoulder activity level in Persian-speaking patients with different shoulder disorders. The results on clinimetric properties of the Persian SAS are comparable with its original, English version. Persian version of the SAS can be used in "clinical" and "research" settings of patients with shoulder disorders.
Cross-cultural adaptation and validation of the Saudi Arabic version of the Knee Injury and Osteoarthritis Outcome Score (KOOS).

PubMed

Alfadhel, Saud A; Vennu, Vishal; Alnahdi, Ali H; Omar, Mohammed T; Alasmari, Saeed H; AlJafri, Zahra; Bindawas, Saad M

2018-06-07

The Knee Injury Osteoarthritis Outcome Score (KOOS) is a widely used joint-specific measure employed to evaluate pain, symptoms, activities of daily living, recreational activities, and quality of life in patients with knee osteoarthritis (OA). Although the original KOOS has been translated into many languages, a Saudi Arabic version is not available. This study aimed to culturally adapt and evaluate the psychometric properties of the Saudi Arabic version of the KOOS in patients with knee OA. The original KOOS was translated and adapted into Saudi Arabic version over six stages according to the guidelines suggested by Beaton and recommended by the American Association of Orthopedic Surgeons Outcome Committee. Patients diagnosed with knee OA (n = 136) were recruited to examine the psychometric properties, such as internal consistency that was tested using Cronbach's alpha, test-retest reliability that was analyzed using the intra-class correlation coefficient (ICC 2,1 ), and construct validity that examined by testing the correlations between the new version subscales, Form 36 Health Survey subscales, and the Visual Analog Scale, Spearman's correlation coefficient (r s ) was used to measure the correlations. A total of 122 (89.7%) of the 136 participants with knee OA completed the second re-test of new Saudi Arabic version. Excellent internal consistency (Cronbach's alpha = 0.87-0.92) was detected in the subscales of the adapted version, as well as excellent test-retest reliability (ICC 2,1 = 0.92-0.94). The pattern of correlation between the subscales of the Saudi Arabic version of the KOOS, SF-36 domains and the Visual Analog Scale for pain supported the construct validity of the adapted version. The Saudi Arabic version of the KOOS was well accepted and exhibited excellent reliability, internal consistency, and construct validity in Saudi patients with knee OA.
Three-dimensional assessment of the asymptomatic and post-stroke shoulder: intra-rater test-retest reliability and within-subject repeatability of the palpation and digitization approach.

PubMed

Pain, Liza A M; Baker, Ross; Sohail, Qazi Zain; Richardson, Denyse; Zabjek, Karl; Mogk, Jeremy P M; Agur, Anne M R

2018-03-23

Altered three-dimensional (3D) joint kinematics can contribute to shoulder pathology, including post-stroke shoulder pain. Reliable assessment methods enable comparative studies between asymptomatic shoulders of healthy subjects and painful shoulders of post-stroke subjects, and could inform treatment planning for post-stroke shoulder pain. The study purpose was to establish intra-rater test-retest reliability and within-subject repeatability of a palpation/digitization protocol, which assesses 3D clavicular/scapular/humeral rotations, in asymptomatic and painful post-stroke shoulders. Repeated measurements of 3D clavicular/scapular/humeral joint/segment rotations were obtained using palpation/digitization in 32 asymptomatic and six painful post-stroke shoulders during four reaching postures (rest/flexion/abduction/external rotation). Intra-class correlation coefficients (ICCs), standard error of the measurement and 95% confidence intervals were calculated. All ICC values indicated high to very high test-retest reliability (≥0.70), with lower reliability for scapular anterior/posterior tilt during external rotation in asymptomatic subjects, and scapular medial/lateral rotation, humeral horizontal abduction/adduction and axial rotation during abduction in post-stroke subjects. All standard error of measurement values demonstrated within-subject repeatability error ≤5° for all clavicular/scapular/humeral joint/segment rotations (asymptomatic ≤3.75°; post-stroke ≤5.0°), except for humeral axial rotation (asymptomatic ≤5°; post-stroke ≤15°). This noninvasive, clinically feasible palpation/digitization protocol was reliable and repeatable in asymptomatic shoulders, and in a smaller sample of painful post-stroke shoulders. Implications for Rehabilitation In the clinical setting, a reliable and repeatable noninvasive method for assessment of three-dimensional (3D) clavicular/scapular/humeral joint orientation and range of motion (ROM) is currently required. The established reliability and repeatability of this proposed palpation/digitization protocol will enable comparative 3D ROM studies between asymptomatic and post-stroke shoulders, which will further inform treatment planning. Intra-rater test-retest repeatability, which is measured by the standard error of the measure, indicates the range of error associated with a single test measure. Therefore, clinicians can use the standard error of the measure to determine the "true" differences between pre-treatment and post-treatment test scores.
Reliable Change Indices and Standardized Regression-Based Change Score Norms for Evaluating Neuropsychological Change in Children with Epilepsy

PubMed Central

Busch, Robyn M.; Lineweaver, Tara T.; Ferguson, Lisa; Haut, Jennifer S.

2015-01-01

Reliable change index scores (RCIs) and standardized regression-based change score norms (SRBs) permit evaluation of meaningful changes in test scores following treatment interventions, like epilepsy surgery, while accounting for test-retest reliability, practice effects, score fluctuations due to error, and relevant clinical and demographic factors. Although these methods are frequently used to assess cognitive change after epilepsy surgery in adults, they have not been widely applied to examine cognitive change in children with epilepsy. The goal of the current study was to develop RCIs and SRBs for use in children with epilepsy. Sixty-three children with epilepsy (age range 6–16; M=10.19, SD=2.58) underwent comprehensive neuropsychological evaluations at two time points an average of 12 months apart. Practice adjusted RCIs and SRBs were calculated for all cognitive measures in the battery. Practice effects were quite variable across the neuropsychological measures, with the greatest differences observed among older children, particularly on the Children’s Memory Scale and Wisconsin Card Sorting Test. There was also notable variability in test-retest reliabilities across measures in the battery, with coefficients ranging from 0.14 to 0.92. RCIs and SRBs for use in assessing meaningful cognitive change in children following epilepsy surgery are provided for measures with reliability coefficients above 0.50. This is the first study to provide RCIs and SRBs for a comprehensive neuropsychological battery based on a large sample of children with epilepsy. Tables to aid in evaluating cognitive changes in children who have undergone epilepsy surgery are provided for clinical use. An excel sheet to perform all relevant calculations is also available to interested clinicians or researchers. PMID:26043163
Assessing the psychometric properties of two food addiction scales.

PubMed

Lemeshow, Adina R; Gearhardt, Ashley N; Genkinger, Jeanine M; Corbin, William R

2016-12-01

While food addiction is well accepted in popular culture and mainstream media, its scientific validity as an addictive behavior is still under investigation. This study evaluated the reliability and validity of the Yale Food Addiction Scale and Modified Yale Food Addiction Scale using data from two community-based convenience samples. We assessed the internal and test-retest reliability of the Yale Food Addiction Scale and Modified Yale Food Addiction Scale, and estimated the sensitivity and negative predictive value of the Modified Yale Food Addiction Scale using the Yale Food Addiction Scale as the benchmark. We calculated Cronbach's alphas and 95% confidence intervals (CIs) for internal reliability and Cohen's Kappa coefficients and 95% CIs for test-retest reliability. Internal consistency (n=232) was marginal to good, ranging from α=0.63 to 0.84. The test-retest reliability (n=45) for food addiction diagnosis was substantial, with Kappa=0.73 (95% CI, 0.48-0.88) (Yale Food Addiction Scale) and 0.79 (95% CI, 0.66-1.00) (Modified Yale Food Addiction Scale). Sensitivity and negative predictive value for classifying food addiction status were excellent: compared to the Yale Food Addiction Scale, the Modified Yale Food Addiction Scale's sensitivity was 92.3% (95% CI, 64%-99.8%), and the negative predictive value was 99.5% (95% CI, 97.5%-100%). Our analyses suggest that the Modified Yale Food Addiction Scale may be an appropriate substitute for the Yale Food Addiction Scale when a brief measure is needed, and support the continued use of both scales to investigate food addiction. Copyright Â© 2016 Elsevier Ltd. All rights reserved.
Bruininks-Oseretsky Test of Motor Proficiency: Further Verification with 3- to 5- yr. -old Children.

ERIC Educational Resources Information Center

Beitel, Patricia A.; Mead, Barbara J.

1982-01-01

The Bruininks-Oseretsky Test of Motor Proficiency was evaluated to determine test-retest reliability and if there were presensitizing effects at retest for four- to five-year olds. Test reliability was significantly high. No significant test sensitization of the short form to retesting with the short form or subtests was found. (Author/RD)
Psychometric Properties of the Chinese Version of the Occupational Fatigue Exhaustion/Recovery Scale: A Test in a Nursing Population.

PubMed

Fang, Jin-Bo; Zhou, Chun-Fen; Huang, Jing; Qiu, Chang-Jian

2018-06-01

The Occupational Fatigue Exhaustion/Recovery Scale (OFER) was designed to assess occupational fatigue in nurses. Although the original English version of this instrument has shown high degrees of reliability and validity, a Chinese version of this scale has yet to be verified. The aim of this study was to evaluate the psychometric properties of the OFER in a population of Chinese nurses. The scale was translated using translation and back-translation. The validities and reliabilities were evaluated on 923 qualified participants using content validity index, concurrent validity, factorial validity, internal consistency reliability, and test-retest reliability. The content validity index for the OFER was .92. The correlation coefficients between the scores of the OFER subscales and the criteria in this study (varying from -.498 to .705) verified that the OFER has acceptable concurrent validity. Principal component analysis and confirmatory factor analysis revealed that three factors correspond to the structure of the original instrument and that recovery mediates the relationship between acute and chronic fatigue. The Cronbach's alpha for the chronic fatigue, acute fatigue, and intershift recovery subscales were .83, .85, and .86, respectively. Test-retest reliabilities with correlation coefficients from .61 to .78 were found in the three subscales. OFER is a reliable and valid instrument for assessing work-related fatigue in Chinese nurses. However, further improvement of the acute fatigue subscale is recommended. The OFER has the potential to elicit information that is useful for assessing fatigue in nurses in China. Furthermore, as it differentiates between acute and chronic fatigue, OFER may be an effective tool for guiding the development and implementation of various, related intervention measures.

Test-retest reliability of memory task functional magnetic resonance imaging in Alzheimer disease clinical trials.

PubMed

Atri, Alireza; O'Brien, Jacqueline L; Sreenivasan, Aishwarya; Rastegar, Sarah; Salisbury, Sibyl; DeLuca, Amy N; O'Keefe, Kelly M; LaViolette, Peter S; Rentz, Dorene M; Locascio, Joseph J; Sperling, Reisa A

2011-05-01

To examine the feasibility and test-retest reliability of encoding-task functional magnetic resonance imaging (fMRI) in mild Alzheimer disease (AD). Randomized, double-blind, placebo-controlled study. Memory clinical trials unit. We studied 12 patients with mild AD (mean [SEM] Mini-Mental State Examination score, 24.0 [0.7]; mean Clinical Dementia Rating score, 1.0) who had been taking donepezil hydrochloride for more than 6 months from the placebo arm of a larger 24-week study (n = 24, 4 scans on weeks 0, 6, 12, and 24, respectively). Placebo and 3 face-name, paired-associate encoding, block-design blood oxygenation level-dependent fMRI scans in 12 weeks. We performed whole-brain t maps (P < .001, 5 contiguous voxels) and hippocampal regions-of-interest analyses of extent (percentage of active voxels) and magnitude (percentage of signal change) for novel-greater-than-repeated face-name contrasts. We also calculated intraclass correlation coefficients and power estimates for hippocampal regions of interest. Task tolerability and data yield were high (95 of 96 scans yielded favorable-quality data). Whole-brain maps were stable. Right and left hippocampal regions-of-interest intraclass correlation coefficients were 0.59 to 0.87 and 0.67 to 0.74, respectively. To detect 25.0% to 50.0% changes in week-0 to week-12 hippocampal activity using left-right extent or right magnitude with 80.0% power (2-sided α = .05) requires 14 to 51 patients. Using left magnitude requires 125 patients because of relatively small signal to variance ratios. Encoding-task fMRI was successfully implemented in a single-site, 24-week, AD randomized controlled trial. Week 0 to 12 whole-brain t maps were stable, and test-retest reliability of hippocampal fMRI measures ranged from moderate to substantial. Right hippocampal magnitude may be the most promising of these candidate measures in a leveraged context. These initial estimates of test-retest reliability and power justify evaluation of encoding-task fMRI as a potential biomarker for signal of effect in exploratory and proof-of-concept trials in mild AD. Validation of these results with larger sample sizes and assessment in multisite studies is warranted.
The validity and reliability of a dynamic neuromuscular stabilization-heel sliding test for core stability.

PubMed

Cha, Young Joo; Lee, Jae Jin; Kim, Do Hyun; You, Joshua Sung H

2017-10-23

Core stabilization plays an important role in the regulation of postural stability. To overcome shortcomings associated with pain and severe core instability during conventional core stabilization tests, we recently developed the dynamic neuromuscular stabilization-based heel sliding (DNS-HS) test. The purpose of this study was to establish the criterion validity and test-retest reliability of the novel DNS-HS test. Twenty young adults with core instability completed both the bilateral straight leg lowering test (BSLLT) and DNS-HS test for the criterion validity study and repeated the DNS-HS test for the test-retest reliability study. Criterion validity was determined by comparing hip joint angle data that were obtained from BSLLT and DNS-HS measures. The test-retest reliability was determined by comparing hip joint angle data. Criterion validity was (ICC2,3) = 0.700 (p< 0.05), suggesting a good relationship between the two core stability measures. Test-retest reliability was (ICC3,3) = 0.953 (p< 0.05), indicating excellent consistency between the repeated DNS-HS measurements. Criterion validity data demonstrated a good relationship between the gold standard BSLLT and DNS-HS core stability measures. Test-retest reliability data suggests that DNS-HS core stability was a reliable test for core stability. Clinically, the DNS-HS test is useful to objectively quantify core instability and allow early detection and evaluation.
Validation of the Simple Shoulder Test in a Portuguese-Brazilian population. Is the latent variable structure and validation of the Simple Shoulder Test Stable across cultures?

PubMed

Neto, Jose Osni Bruggemann; Gesser, Rafael Lehmkuhl; Steglich, Valdir; Bonilauri Ferreira, Ana Paula; Gandhi, Mihir; Vissoci, João Ricardo Nickenig; Pietrobon, Ricardo

2013-01-01

The validation of widely used scales facilitates the comparison across international patient samples. The objective of this study was to translate, culturally adapt and validate the Simple Shoulder Test into Brazilian Portuguese. Also we test the stability of factor analysis across different cultures. The objective of this study was to translate, culturally adapt and validate the Simple Shoulder Test into Brazilian Portuguese. Also we test the stability of factor analysis across different cultures. The Simple Shoulder Test was translated from English into Brazilian Portuguese, translated back into English, and evaluated for accuracy by an expert committee. It was then administered to 100 patients with shoulder conditions. Psychometric properties were analyzed including factor analysis, internal reliability, test-retest reliability at seven days, and construct validity in relation to the Short Form 36 health survey (SF-36). Factor analysis demonstrated a three factor solution. Cronbach's alpha was 0.82. Test-retest reliability index as measured by intra-class correlation coefficient (ICC) was 0.84. Associations were observed in the hypothesized direction with all subscales of SF-36 questionnaire. The Simple Shoulder Test translation and cultural adaptation to Brazilian-Portuguese demonstrated adequate factor structure, internal reliability, and validity, ultimately allowing for its use in the comparison with international patient samples.
Validation of the Simple Shoulder Test in a Portuguese-Brazilian Population. Is the Latent Variable Structure and Validation of the Simple Shoulder Test Stable across Cultures?

PubMed Central

Neto, Jose Osni Bruggemann; Gesser, Rafael Lehmkuhl; Steglich, Valdir; Bonilauri Ferreira, Ana Paula; Gandhi, Mihir; Vissoci, João Ricardo Nickenig; Pietrobon, Ricardo

2013-01-01

Background The validation of widely used scales facilitates the comparison across international patient samples. The objective of this study was to translate, culturally adapt and validate the Simple Shoulder Test into Brazilian Portuguese. Also we test the stability of factor analysis across different cultures. Objective The objective of this study was to translate, culturally adapt and validate the Simple Shoulder Test into Brazilian Portuguese. Also we test the stability of factor analysis across different cultures. Methods The Simple Shoulder Test was translated from English into Brazilian Portuguese, translated back into English, and evaluated for accuracy by an expert committee. It was then administered to 100 patients with shoulder conditions. Psychometric properties were analyzed including factor analysis, internal reliability, test-retest reliability at seven days, and construct validity in relation to the Short Form 36 health survey (SF-36). Results Factor analysis demonstrated a three factor solution. Cronbach’s alpha was 0.82. Test-retest reliability index as measured by intra-class correlation coefficient (ICC) was 0.84. Associations were observed in the hypothesized direction with all subscales of SF-36 questionnaire. Conclusion The Simple Shoulder Test translation and cultural adaptation to Brazilian-Portuguese demonstrated adequate factor structure, internal reliability, and validity, ultimately allowing for its use in the comparison with international patient samples. PMID:23675436
Test-retest reliability of the Progressive Isoinertial Lifting Evaluation (PILE).

PubMed

Lygren, Hildegunn; Dragesund, Tove; Joensen, Jón; Ask, Tove; Moe-Nilssen, Rolf

2005-05-01

A repeated measures single group design. To investigate test-retest reliability of Progressive Isoinertial Lifting Evaluation on patients with long lasting musculoskeletal problems related to the lumbar spine. Test-retest reliability has been satisfactory in healthy men. Test-retest reliability for clinical populations has not been reported. A total of 31 patients (17 women and 14 men) with long lasting low back pain participated in the study. The patients were tested twice at an interval of 2 days and at the same time of the day. The heaviest load that the patient could lift 4 times was used as outcome measure. The error of measurement indicates that the true result in 95% of cases will be within +/-4.5 kg from the measured value, while the difference between 2 measurements in 95% of cases will be less than 6.4 kg. Intra-class correlation (1,1) was 0.91. Relative test-retest reliability was high assessed by intra-class correlation, but absolute measurement variability reported as the smallest detectable difference has relevance for the interpretation of clinical test results and should also be considered.
Improving the Test-Retest Reliability of Resting State fMRI by Removing the Impact of Sleep.

PubMed

Wang, Jiahui; Han, Junwei; Nguyen, Vinh T; Guo, Lei; Guo, Christine C

2017-01-01

Resting state functional magnetic resonance imaging (rs-fMRI) provides a powerful tool to examine large-scale neural networks in the human brain and their disturbances in neuropsychiatric disorders. Thanks to its low demand and high tolerance, resting state paradigms can be easily acquired from clinical population. However, due to the unconstrained nature, resting state paradigm is associated with excessive head movement and proneness to sleep. Consequently, the test-retest reliability of rs-fMRI measures is moderate at best, falling short of widespread use in the clinic. Here, we characterized the effect of sleep on the test-retest reliability of rs-fMRI. Using measures of heart rate variability (HRV) derived from simultaneous electrocardiogram (ECG) recording, we identified portions of fMRI data when subjects were more alert or sleepy, and examined their effects on the test-retest reliability of functional connectivity measures. When volumes of sleep were excluded, the reliability of rs-fMRI is significantly improved, and the improvement appears to be general across brain networks. The amount of improvement is robust with the removal of as much as 60% volumes of sleepiness. Therefore, test-retest reliability of rs-fMRI is affected by sleep and could be improved by excluding volumes of sleepiness as indexed by HRV. Our results suggest a novel and practical method to improve test-retest reliability of rs-fMRI measures.
The many ways sputum flows - Dealing with high within-subject variability in cystic fibrosis sputum rheology.

PubMed

Radtke, Thomas; Böni, Lukas; Bohnacker, Peter; Fischer, Peter; Benden, Christian; Dressel, Holger

2018-04-21

We evaluated test-retest reliability of sputum viscoelastic properties in clinically stable patients with cystic fibrosis (CF). Data from a prospective, randomized crossover study was used to determine within-subject variability of sputum viscoelasticity (G', storage modulus and G", loss modulus at 1 and 10 rad s -1 ) and solids content over three consecutive visits. Precision of sputum properties was quantified by within-subject standard deviation (SD ws ), coefficient of variation (CV) and intraclass correlation coefficients (ICC). Fifteen clinically stable adults with CF (FEV 1 range 24-94% predicted) were included. No differences between study visits (mean ± SD 8 ± 2 days) were observed for any sputum rheology measure. CV's for G', G" and solids content ranged between 40.3-45.3% and ICC's between 0.21-0.42 indicating poor to fair test-retest reliability. Short-term within-subject variability of sputum properties is high in clinically stable adults with CF. Investigators applying shear rheology experiments in future prospective studies should consider using multiple measurements aiming to increase precision of sputum rheological outcomes. Copyright © 2018 Elsevier B.V. All rights reserved.
[Desing and validation of a scale to measure caregiving dedication in caregivers of dependent older people].

PubMed

Serrano-Ortega, Natalia; Frías-Osuna, Antonio; Recio-Gómez, Juan M; Del-Pino-Casado, Rafael

2015-11-01

To develop and validate a scale to measure caregiving dedication regarding activities of daily living in caregivers of dependent older people. Cross-sectional study. Primary Health Care (Andalusia, Spain). a probabilistic sample of 200 caregivers of older relatives from Córdoba, Spain. Content validation by experts, construct validity (by exploratory factor analysis), divergent validity and reliability (internal consistency, test-retest reliability and inter-observers reliability). Cronbach's alpha was 0.86. Intraclass Correlation Coefficient was 0.96 for test-retest reliability and 0.88 for inter-observers reliability. When the sample was divided in two groups according to perceived burden level (presence and absence), the perceived burden was significantly different in each group (P=.001). The factor analysis revealed one only factor that explained 64% of the variance. The scale allows a suitable measure of caregiving dedication regarding activities of daily living in caregivers of older people, because this scale allows a quickly, easy administration, is well accepted by caregivers, has acceptable psychometric results and includes the frequency of caregiving, the kind of attended need and the dependence level in each need. Copyright © 2014 Elsevier España, S.L.U. All rights reserved.
Reliability and validity of Yo-Yo tests in 9- to 16-year-old football players and matched non-sports active schoolboys.

PubMed

Póvoas, Susana C A; Castagna, Carlo; Soares, José M C; Silva, Pedro M R; Lopes, Mariana V M F; Krustrup, Peter

2016-10-01

The purpose of this study was to examine the test-retest reliability and construct validity of three age-adapted Yo-Yo intermittent tests in football players aged 9-16 years (n = 70) and in age-matched non-sports active boys (n = 72). Within 7 days, each participant performed two repetitions of an age-related intensity-adapted Yo-Yo intermittent test, i.e. the Yo-Yo intermittent recovery level 1 children's test for 9- to 11-year-olds; the Yo-Yo intermittent endurance level 1 for 12- to 13-year-olds and the Yo-Yo intermittent endurance level 2 test for 14- to 16-year-olds. Peak heart rate (HRpeak) was determined for all tests. The distance covered in the tests was 57% (1098 ± 680 vs. 700 ± 272 m), 119% (2325 ± 778 vs. 1062 ± 285 m) and 238% (1743 ± 460 vs. 515 ± 113 m) higher (p ≤ .016), respectively for football-trained than for non-sports active boys aged 9-11, 12-13 and 14-16 years. The typical errors of measurement for Yo-Yo distance, expressed as a percentage of the coefficient of variation (confidence interval), were 11.1% (9.0-14.7%), 10.1% (8.1-13.7%) and 8.5% (6.7-11.7%) for football players aged 9-11, 12-13 and 14-16 years, respectively, with corresponding values of 9.3% (7.4-12.8%), 10.2% (8.1-14.0%) and 8.5% (6.8-11.3%) for non-sports active boys. Intraclass correlation coefficient values for test-retest were excellent in both groups (range: 0.844-0.981). Relative HRpeak did not differ significantly between the groups in test and retest. In conclusion, Yo-Yo intermittent test performances and HRpeak are reliable for 9- to 16-year-old footballers and non-sports active boys. Additionally, performances of the three Yo-Yo tests were seemingly better for football-trained than for non-sports active boys, providing evidence of construct validity.
Analysis of the reliability and validity of the Turkish version of the intermittent and constant osteoarthritis pain questionnaire.

PubMed

Erel, Suat; Şimşek, İbrahim Engin; Özkan, Hüseyin

2015-01-01

The aim of this study was to analyze the validity and reliability of the Turkish version (ICOAP-TR) of the intermittent and constant osteoarthritis pain (ICOAP) questionnaire in patients with knee osteoarthritis (OA). Thirty-eight volunteer patients diagnosed with knee OA answered the questionnaire twice with an interval of 2-4 days. The reliability of the measurement was assessed using Cronbach's alpha coefficient and intraclass correlation (ICC) for test-retest reliability. Criterion validity was tested against the Western Ontario and McMaster Universities Arthritis Index (WOMAC) pain score and visual analog scale (VAS) designed to assess the perceived discomfort rated by the patient. Test-retest reliability was found to be ICC=0.942 for total score, 0.902 for constant pain subscale, and 0.945 for intermittent pain subscale. Internal consistency was tested using Cronbach's alpha and was found to be 0.970 for total score, 0.948 for constant pain subscale, and 0.972 for intermittent pain subscale. For criterion validity, the correlation between the total score of ICOAP-TR and WOMAC pain subscale was r=0.779 (p<0.05), and correlation between total score of ICOAP-TR and VAS was r=0.570 (p<0.05). The ICOAP-TR is a reliable and valid instrument to be used with patients with knee OA.
Test-retest reliability of automated whole body and compartmental muscle volume measurements on a wide bore 3T MR system.

PubMed

Thomas, Marianna S; Newman, David; Leinhard, Olof Dahlqvist; Kasmai, Bahman; Greenwood, Richard; Malcolm, Paul N; Karlsson, Anette; Rosander, Johannes; Borga, Magnus; Toms, Andoni P

2014-09-01

To measure the test-retest reproducibility of an automated system for quantifying whole body and compartmental muscle volumes using wide bore 3 T MRI. Thirty volunteers stratified by body mass index underwent whole body 3 T MRI, two-point Dixon sequences, on two separate occasions. Water-fat separation was performed, with automated segmentation of whole body, torso, upper and lower leg volumes, and manually segmented lower leg muscle volumes. Mean automated total body muscle volume was 19·32 L (SD9·1) and 19·28 L (SD9·12) for first and second acquisitions (Intraclass correlation coefficient (ICC) = 1·0, 95% level of agreement -0·32-0·2 L). ICC for all automated test-retest muscle volumes were almost perfect (0·99-1·0) with 95% levels of agreement 1.8-6.6% of mean volume. Automated muscle volume measurements correlate closely with manual quantification (right lower leg: manual 1·68 L (2SD0·6) compared to automated 1·64 L (2SD 0·6), left lower leg: manual 1·69 L (2SD 0·64) compared to automated 1·63 L (SD0·61), correlation coefficients for automated and manual segmentation were 0·94-0·96). Fully automated whole body and compartmental muscle volume quantification can be achieved rapidly on a 3 T wide bore system with very low margins of error, excellent test-retest reliability and excellent correlation to manual segmentation in the lower leg. Sarcopaenia is an important reversible complication of a number of diseases. Manual quantification of muscle volume is time-consuming and expensive. Muscles can be imaged using in and out of phase MRI. Automated atlas-based segmentation can identify muscle groups. Automated muscle volume segmentation is reproducible and can replace manual measurements.
Preliminary psychometric properties of the chinese version of the work-related quality of life scale-2 in the nursing profession.

PubMed

Lin, Shike; Chaiear, Naesinee; Khiewyoo, Jiraporn; Wu, Bin; Johns, Nutjaree Pratheepawanit

2013-03-01

As quality of work-life (QWL) among nurses affects both patient care and institutional standards, assessment regarding QWL for the profession is important. Work-related Quality of Life Scale (WRQOLS) is a reliable QWL assessment tool for the nursing profession. To develop a Chinese version of the WRQOLS-2 and to examine its psychometric properties as an instrument to assess QWL for the nursing profession in China. Forward and back translating procedures were used to develop the Chinese version of WRQOLS-2. Six nursing experts participated in content validity evaluation and 352 registered nurses (RNs) participated in the tests. After a two-week interval, 70 of the RNs were retested. Structural validity was examined by principal components analysis and the Cronbach's alphas calculated. The respective independent sample t-test and intra-class correlation coefficient were used to analyze known-group validity and test-retest reliability. One item was rephrased for adaptation to Chinese organizational cultures. The content validity index of the scale was 0.98. Principal components analysis resulted in a seven-factor model, accounting for 62% of total variance, with Cronbach's alphas for subscales ranging from 0.71 to 0.88. Known-group validity was established in the assessment results of the participants in permanent employment vs. contract employment (t = 2.895, p < 0.01). Good test-retest reliability was observed (r = 0.88, p < 0.01). The translated Chinese version of the WRQOLS-2 has sufficient validity and reliability so that it can be used to evaluate the QWL among nurses in mainland China.
Age-Related Differences in Test-Retest Reliability in Resting-State Brain Functional Connectivity

PubMed Central

Song, Jie; Desphande, Alok S.; Meier, Timothy B.; Tudorascu, Dana L.; Vergun, Svyatoslav; Nair, Veena A.; Biswal, Bharat B.; Meyerand, Mary E.; Birn, Rasmus M.; Bellec, Pierre; Prabhakaran, Vivek

2012-01-01

Resting-state functional MRI (rs-fMRI) has emerged as a powerful tool for investigating brain functional connectivity (FC). Research in recent years has focused on assessing the reliability of FC across younger subjects within and between scan-sessions. Test-retest reliability in resting-state functional connectivity (RSFC) has not yet been examined in older adults. In this study, we investigated age-related differences in reliability and stability of RSFC across scans. In addition, we examined how global signal regression (GSR) affects RSFC reliability and stability. Three separate resting-state scans from 29 younger adults (18–35 yrs) and 26 older adults (55–85 yrs) were obtained from the International Consortium for Brain Mapping (ICBM) dataset made publically available as part of the 1000 Functional Connectomes project www.nitrc.org/projects/fcon_1000. 92 regions of interest (ROIs) with 5 cubic mm radius, derived from the default, cingulo-opercular, fronto-parietal and sensorimotor networks, were previously defined based on a recent study. Mean time series were extracted from each of the 92 ROIs from each scan and three matrices of z-transformed correlation coefficients were created for each subject, which were then used for evaluation of multi-scan reliability and stability. The young group showed higher reliability of RSFC than the old group with GSR (p-value = 0.028) and without GSR (p-value <0.001). Both groups showed a high degree of multi-scan stability of RSFC and no significant differences were found between groups. By comparing the test-retest reliability of RSFC with and without GSR across scans, we found significantly higher proportion of reliable connections in both groups without GSR, but decreased stability. Our results suggest that aging is associated with reduced reliability of RSFC which itself is highly stable within-subject across scans for both groups, and that GSR reduces the overall reliability but increases the stability in both age groups and could potentially alter group differences of RSFC. PMID:23227153
Reliability of the Dutch translation of the Kujala Patellofemoral Score Questionnaire.

PubMed

Ummels, P E J; Lenssen, A F; Barendrecht, M; Beurskens, A J H M

2017-01-01

There are no Dutch language disease-specific questionnaires for patients with patellofemoral pain syndrome available that could help Dutch physiotherapists to assess and monitor these symptoms and functional limitations. The aim of this study was to translate the original disease-specific Kujala Patellofemoral Score into Dutch and evaluate its reliability. The questionnaire was translated from English into Dutch in accordance with internationally recommended guidelines. Reliability was determined in 50 stable subjects with an interval of 1 week. The patient inclusion criteria were age between 14 and 60 years; knowledge of the Dutch language; and the presence of at least three of the following symptoms: pain while taking the stairs, pain when squatting, pain when running, pain when cycling, pain when sitting with knees flexed for a prolonged period, grinding of the patella and a positive clinical patella test. The internal consistency, test-retest reliability, measurement error and limits of agreement were calculated. Internal consistency was 0.78 for the first assessment and 0.80 for the second assessment. The intraclass correlation coefficient (ICC agreement ) between the first and second assessments was 0.98. The mean difference between the first and second measurements was 0.64, and standard deviation was 5.51. The standard error measurement was 3.9, and the smallest detectable change was 11. The Bland and Altman plot shows that the limits of agreement are -10.37 and 11.65. The results of the present study indicated that the test-retest reliability translated Dutch version of the Kujala Patellofemoral Score questionnaire is equivalent of the test-retest original English language version and has good internal consistency. Trial registration NTR (TC = 3258). Copyright © 2015 John Wiley & Sons, Ltd. Copyright © 2015 John Wiley & Sons, Ltd.
Reliability and validity of a smartphone pulse rate application for the assessment of resting and elevated pulse rate.

PubMed

Mitchell, Katy; Graff, Megan; Hedt, Corbin; Simmons, James

2016-08-01

Purpose/hypothesis: This study was designed to investigate the test-retest reliability, concurrent validity, and the standard error of measurement (SEm) of a pulse rate assessment application (Azumio®'s Instant Heart Rate) on both Android® and iOS® (iphone operating system) smartphones as compared to a FT7 Polar® Heart Rate monitor. Number of subjects: 111. Resting (sitting) pulse rate was assessed twice and then the participants were asked to complete a 1-min standing step test and then immediately re-assessed. The smartphone assessors were blinded to their measurements. Test-retest reliability (intraclass correlation coefficient [ICC 2,1] and 95% confidence interval) for the three tools at rest (time 1/time 2): iOS® (0.76 [0.67-0.83]); Polar® (0.84 [0.78-0.89]); and Android® (0.82 [0.75-0.88]). Concurrent validity at rest time 2 (ICC 2,1) with the Polar® device: IOS® (0.92 [0.88-0.94]) and Android® (0.95 [0.92-0.96]). Concurrent validity post-exercise (time 3) (ICC) with the Polar® device: iOS® (0.90 [0.86-0.93]) and Android® (0.94 [0.91-0.96]). The SEm values for the three devices at rest: iOS® (5.77 beats per minute [BPM]), Polar® (4.56 BPM) and Android® (4.96 BPM). The Android®, iOS®, and Polar® devices showed acceptable test-retest reliability at rest and post-exercise. Both the smartphone platforms demonstrated concurrent validity with the Polar® at rest and post-exercise. The Azumio® Instant Heart Rate application when used by either platform appears to be a reliable and valid tool to assess pulse rate in healthy individuals.
Cross-cultural adaptation of VISA-P score for patellar tendinopathy in Turkish population.

PubMed

Çelebi, Mehmet Mesut; Köse, Serdal Kenan; Akkaya, Zehra; Zergeroglu, Ali Murat

2016-01-01

VISA-P questionnaire assesses to severity of symptoms and treatment effects in athletes with patellar tendinopathy. The purpose of this study was to translated VISA-P questionnaire into Turkish language and to determine its validity and reliability. The English version of VISA-P questionnaire was translated into Turkish according to the internationally recommended guidelines. Test-retest reliability was determined on 89 participants with time interval 24 h. To determine validity of Turkish VISA-P, 31 (17 male, 14 female) healthy students, 34 (20 male, 14 female) patients with patellar tendinopathy (diagnosed by physical examination and ultrasonography) and 24 (16 male, 8 female) volleyball players (at risk populations) were completed VISA-P-Tr. Internal consistency was determined with Cronbach's alpha. Intraclass correlation coefficients (ICCs) were calculated to analyse test-retest reliability. To assessment of discrimination, VISA-P-Tr scores compared all groups using the Mann-Whitney-U test. The VISA-P-Tr questionnaire showed good test-retest reliability (The Cronbach's alpha was 0.79 and 0.78 respectively and ICC was 0.96). The VISA-P-Tr score (mean ± SD) were 93.7 ± 8.9 and 94.0 ± 8.1 for healthy students, 81.1 ± 13.7 and 80.7 ± 13.4 for volleyball players, 58.8 ± 12.1 and 58.5 ± 11.0 for athletes with patellar tendinopathy. The translated Turkish version of VISA-P has good internal consistency and good reliability and validity. Therefore VISA-P-Tr is useful to evaluate symptoms and follow the treatment effect in athletes with patellar tendinopathy.
Reliability and validity of a Chinese version of the Diagnostic Interview for Borderlines-Revised.

PubMed

Wang, Lanlan; Yuan, Chenmei; Qiu, Jianying; Gunderson, John; Zhang, Min; Jiang, Kaida; Leung, Freedom; Zhong, Jie; Xiao, Zeping

2014-09-01

Borderline personality disorder (BPD) is the most studied of the axis II disorders. One of the most widely used diagnostic instruments is the Diagnostic Interview for Borderline Patients-Revised (DIB-R). The aim of this study was to test the reliability and validity of DIB-R for use in the Chinese culture. The reliability and validity of the DIB-R Chinese version were assessed in a sample of 236 outpatients with a probable BPD diagnosis. The Structured Clinical Interview for DSM-IV Personality Disorders (SCID-II) was used as a standard. Test-retest reliability was tested six months later with 20 patients, and inter-rater reliability was tested on 32 patients. The Chinese version of the DIB-R showed good internal global consistency (Cronbach's α of 0.916), good test-retest reliability (Pearson correlation of 0.704), good inter-rater reliability (intra-class correlation coefficient of 0.892 and kappa of 0.861). When compared with the DSM-IV diagnosis as measured by the SCID-II, the DIB-R showed relatively good sensitivity (0.768) and specificity (0.891) at the cutoff of 7, moderate diagnostic convergence (kappa of 0.631), as well as good discriminating validity. The Chinese version of the DIB-R has good psychometric properties, which renders it a valuable method for examining the presence, the severity, and component phenotypes of BPD in Chinese samples. © 2013 Wiley Publishing Asia Pty Ltd.
Reliability, validity, and sensitivity to change of the lower extremity functional scale in individuals affected by stroke.

PubMed

Verheijde, Joseph L; White, Fred; Tompkins, James; Dahl, Peder; Hentz, Joseph G; Lebec, Michael T; Cornwall, Mark

2013-12-01

To investigate reliability, validity, and sensitivity to change of the Lower Extremity Functional Scale (LEFS) in individuals affected by stroke. The secondary objective was to test the validity and sensitivity of a single-item linear analog scale (LAS) of function. Prospective cohort reliability and validation study. A single rehabilitation department in an academic medical center. Forty-three individuals receiving neurorehabilitation for lower extremity dysfunction after stroke were studied. Their ages ranged from 32 to 95 years, with a mean of 70 years; 77% were men. Test-retest reliability was assessed by calculating the classical intraclass correlation coefficient, and the Bland-Altman limits of agreement. Validity was assessed by calculating the Pearson correlation coefficient between the instruments. Sensitivity to change was assessed by comparing baseline scores with end of treatment scores. Measurements were taken at baseline, after 1-3 days, and at 4 and 8 weeks. The LEFS, Short-Form-36 Physical Function Scale, Berg Balance Scale, Six-Minute Walk Test, Five-Meter Walk Test, Timed Up-and-Go test, and the LAS of function were used. The test-retest reliability of the LEFS was found to be excellent (ICC = 0.96). Correlated with the 6 other measures of function studied, the validity of the LEFS was found to be moderate to high (r = 0.40-0.71). Regarding the sensitivity to change, the mean LEFS scores from baseline to study end increased 1.2 SD and for LAS 1.1 SD. LEFS exhibits good reliability, validity, and sensitivity to change in patients with lower extremity impairments secondary to stroke. Therefore, the LEFS can be a clinically efficient outcome measure in the rehabilitation of patients with subacute stroke. The LAS is shown to be a time-saving and reasonable option to track changes in a patient's functional status. Copyright © 2013 American Academy of Physical Medicine and Rehabilitation. Published by Elsevier Inc. All rights reserved.
A Chinese Mandarin translation and validation of the Myocardial Infarction Dimensional Assessment Scale (MIDAS).

PubMed

Wang, W; Lopez, V; Thompson, D R

2006-09-01

To evaluate the validity, reliability, and cultural relevance of the Chinese Mandarin version of Myocardial Infarction Dimensional Assessment Scale (MIDAS) as a disease-specific quality of life measure. The cultural relevance and content validity of the Chinese Mandarin version of the MIDAS (CM-MIDAS) was evaluated by an expert panel. Measurement performance was tested on 180 randomly selected Chinese MI patents. Thirty participants from the primary group completed the CM-MIDAS for test-retest reliability after 2 weeks. Reliability, validity and discriminatory power of the CM-MIDAS were calculated. Two items were modified as suggested by the expert panel. The overall CM-MIDAS had acceptable internal consistency with Cronbach's alpha coefficient 0.93 for the scale and 0.71-0.94 for the seven domains. Test-retest reliability by intraclass correlations was 0.85 for the overall scale and 0.74-0.94 for the seven domains. There was acceptable concurrent validity with significant (p < 0.05) correlations between the CM-MDAS and the Chinese Version of the Short Form 36. The principal components analysis extracted seven factors that explained 67.18% of the variance with high factor loading indicating good construct validity. Empirical data support CM-MIDAS as a valid and reliable disease-specific quality of life measure for Chinese Mandarin speaking patients with myocardial infarction.
Measuring deception: test-retest reliability of physicians' self-reported manipulation of reimbursement rules for patients.

PubMed

VanGeest, Jonathan B; Wynia, Matthew K; Cummins, Deborah S; Wilson, Ira B

2002-06-01

This study examined the test-retest reliability of physicians' self-reported manipulation of reimbursement rules for patients. The test-retest reliability of self-report of three specific tactics were examined: (1) exaggerating the severity of patients' conditions, (2) changing a patient's official (billing) diagnosis, and (3) reporting signs or symptoms that patients did not have. The reliability of a scaled summary measure of physicians' manipulation of reimbursement rules was also assessed. Overall, the authors found high levels of test-retest agreement across all three items and the summary measure. These findings suggest that self-report can be used to produce reliable data on this controversial issue. Specifically, the three items reported here can be used to produce a reliable summary measure of physicians' manipulation of reimbursement rules to help patients obtain care that physicians perceive as necessary.

Comparison of the WOMAC (Western Ontario and McMaster Universities) osteoarthritis index and a self-report format of the self-administered Lequesne-Algofunctional index in patients with knee and hip osteoarthritis.

PubMed

Stucki, G; Sangha, O; Stucki, S; Michel, B A; Tyndall, A; Dick, W; Theiler, R

1998-03-01

To compare the metric properties and validity of German versions of the WOMAC (Western Ontario and McMaster Universities) and a self-administered questionnaire-format of the Lequesne-Algofunctional-Index in patients with osteoarthritis (OA) of the lower extremities. Cross-sectional analysis of the instruments' internal consistency (Cronbach's coefficient alpha) and construct validity (correlation with radiological OA-severity and limitation in range-of-motion) in ambulatory patients and patients before hip arthroplasty. Test-retest reliability was assessed on a subsample after 10 days. Data from 51 patients out of 91 contacted could be analyzed. Twenty-nine patients had knee and 22 patients had hip OA. Both the WOMAC and Lequesne OA-indices and their scales or sections had a satisfactory test-retest reliability (Intraclass correlation coefficient 0.43-0.96). All scales of the WOMAC were internally consistent (Cronbach's coefficient alpha 0.81-0.96) and associated with radiological OA-severity and joint range of motion. However, only the function but not the symptom sections (Cronbach's coefficient alpha knee: 0.55; hip: 0.63) of the self-administered Lequesne OA index were internally consistent for both, patients with knee and hip OA. Also, the symptom components were not or only weakly associated with radiological OA-severity and joint range of motion. Although our results are based on a German version using a self-report format we may caution using the self-administered Lequesne OA index without prior testing of its metric properties and validity.
Validity and Reliability of Thai Version of the Foot and Ankle Ability Measure (FAAM) Subjective Form.

PubMed

Arunakul, Marut; Arunakul, Preeyaphan; Suesiritumrong, Chakhrist; Angthong, Chayanin; Chernchujit, Bancha

2015-06-01

Self-administered questionnaires have become an important aspect for clinical outcome assessment of foot and ankle-related problems. The Foot and Ankle Ability Measure (FAAM) subjective form is a region-specific questionnaire that is widely used and has sufficient validity and reliability from previous studies. Translate the original English version of FAAM into a Thai version and evaluate the validity and reliability of Thai FAAM in patients with foot and ankle-related problems. The FAAM subjective form was translated into Thai using forward-backward translation protocol. Afterward, reliability and validity were tested. Following responses from 60 consecutive patients on two questionnaires, the Thai FAAM subjective form and the short form (SF)-36, were used. The validity was tested by correlating the scores from both questionnaires. The reliability was adopted by measuring the test-retest reliability and internal consistency. Thai FAAM score including activity of daily life (ADL) and Sport subscale demonstrated the sufficient correlations with physical functioning (PF) and physical composite score (PCS) domains of the SF-36 (statistically significant with p < 0.001 level and ≥ 0.5 values). The result of reliability revealed highly intra-class correlation coefficient as 0.8 and 0.77, respectively from test-retest study. The internal consistency was strong (Cronbach alpha = 0.94 and 0.88, respectively). The Thai version of FAAM subjective form retained the characteristics of the original version and has proved a reliable evaluation instrument for patients with foot and ankle-related problems.
Reliability and validity of pendulum test measures of spasticity obtained with the Polhemus tracking system from patients with chronic stroke

PubMed Central

Bohannon, Richard W; Harrison, Steven; Kinsella-Shaw, Jeffrey

2009-01-01

Background Spasticity is a common impairment accompanying stroke. Spasticity of the quadriceps femoris muscle can be quantified using the pendulum test. The measurement properties of pendular kinematics captured using a magnetic tracking system has not been studied among patients who have experienced a stroke. Therefore, this study describes the test-retest reliability and known groups and convergent validity of the pendulum test measures obtained with the Polhemus tracking system. Methods Eight patients with chronic stroke underwent pendulum tests with their affected and unaffected lower limbs, with and without the addition of a 2.2 kg cuff weight at the ankle, using the Polhemus magnetic tracking system. Also measured bilaterally were knee resting angles, Ashworth scores (grades 0–4) of quadriceps femoris muscles, patellar tendon (knee jerk) reflexes (grades 0–4), and isometric knee extension force. Results Three measures obtained from pendular traces of the affected side were reliable (intraclass correlation coefficient ≥ .844). Known groups validity was confirmed by demonstration of a significant difference in the measurements between sides. Convergent validity was supported by correlations ≥ .57 between pendulum test measures and other measures reflective of spasticity. Conclusion Pendulum test measures obtained with the Polhemus tracking system from the affected side of patients with stroke have good test-retest reliability and both known groups and convergent validity. PMID:19642989
Reliability and validity of pendulum test measures of spasticity obtained with the Polhemus tracking system from patients with chronic stroke.

PubMed

Bohannon, Richard W; Harrison, Steven; Kinsella-Shaw, Jeffrey

2009-07-30

Spasticity is a common impairment accompanying stroke. Spasticity of the quadriceps femoris muscle can be quantified using the pendulum test. The measurement properties of pendular kinematics captured using a magnetic tracking system has not been studied among patients who have experienced a stroke. Therefore, this study describes the test-retest reliability and known groups and convergent validity of the pendulum test measures obtained with the Polhemus tracking system. Eight patients with chronic stroke underwent pendulum tests with their affected and unaffected lower limbs, with and without the addition of a 2.2 kg cuff weight at the ankle, using the Polhemus magnetic tracking system. Also measured bilaterally were knee resting angles, Ashworth scores (grades 0-4) of quadriceps femoris muscles, patellar tendon (knee jerk) reflexes (grades 0-4), and isometric knee extension force. Three measures obtained from pendular traces of the affected side were reliable (intraclass correlation coefficient > or = .844). Known groups validity was confirmed by demonstration of a significant difference in the measurements between sides. Convergent validity was supported by correlations > or = .57 between pendulum test measures and other measures reflective of spasticity. Pendulum test measures obtained with the Polhemus tracking system from the affected side of patients with stroke have good test-retest reliability and both known groups and convergent validity.
Spanish validation of the Exercise Therapy Burden Questionnaire (ETBQ) for the assessment of barriers associated to doing physical therapy for the treatment of chronic illness.

PubMed

Navarro-Albarracín, César; Poiraudeau, Serge; Chico-Matallana, Noelia; Vergara-Martín, Jesús; Martin, William; Castro-Sánchez, Adelaida María; Matarán-Peñarrocha, Guillermo A

2018-06-08

To validate the Spanish version of the Exercise Therapy Burden Questionnaire (ETBQ) for the assessment of barriers associated to doing physical therapy for the treatment of chronic ailments. A sample of 177 patients, 55.93% men and 44.07% women, with an average age of 51.03±14.91 was recruited. The reliability of the questionnaire was tested with Cronbach's alpha coefficient, and the validity of the instrument was assessed through the divergent validation process and factor analysis. The factor analysis was different to the original questionnaire, composed of a dimension, in this case determined three dimensions: (1) General limitations for doing physical exercise. (2) Physical limitations for doing physical exercise. (3) Limitations caused by the patients' predisposition to their exercises. The reliability of the test-retest was measured through the intraclass correlation coefficient (ICC) and the Bland-Altman plot. Cronbach's alpha was 0.8715 for the total ETBQ. The ICC of the test-retest was 0.745 and the Bland-Altman plot showed no systematic trend. We have obtained the translated version in Spanish of the ETBQ questionnaire. Copyright © 2017 Elsevier España, S.L.U. All rights reserved.
Synkinesis assessment in facial palsy: validation of the Dutch Synkinesis Assessment Questionnaire.

PubMed

Kleiss, Ingrid J; Beurskens, Carien H G; Stalmeier, Peep F M; Ingels, Koen J A O; Marres, Henri A M

2016-06-01

The objective of this study is to validate an existing health-related quality of life questionnaire for patients with synkinesis in facial palsy for implementation in the Dutch language and culture. The Synkinesis Assessment Questionnaire was translated into the Dutch language using a forward-backward translation method. A pilot test with the translated questionnaire was performed in 10 patients with facial palsy and 10 normal subjects. Finally, cross-cultural adaption was accomplished at our outpatient clinic for facial palsy. Analyses for internal consistency, test-retest reliability, and construct validity were performed. Sixty-six patients completed the Dutch Synkinesis Assessment Questionnaire and the Dutch Facial Disability Index. Cronbach's α, representing internal consistency, was 0.80. Test-retest reliability was 0.53 (Spearman's correlation coefficient, P < 0.01). Correlations with the House-Brackmann score, Sunnybrook score, Facial Disability Index physical function, and social/well-being function were -0.29, 0.20, -0.29, and -0.32, respectively. Correlation with the Sunnybrook synkinesis subscore was 0.50 (Spearman's correlation coefficient). The Dutch Synkinesis Assessment Questionnaire shows good psychometric values and can be implemented in the management of Dutch-speaking patients with facial palsy and synkinesis in the Netherlands. Translation of the instrument into other languages may lead to widespread use, making evaluation, and comparison possible among different providers.
Reliability, standard error, and minimum detectable change of clinical pressure pain threshold testing in people with and without acute neck pain.

PubMed

Walton, David M; Macdermid, Joy C; Nielson, Warren; Teasell, Robert W; Chiasson, Marco; Brown, Lauren

2011-09-01

Clinical measurement. To evaluate the intrarater, interrater, and test-retest reliability of an accessible digital algometer, and to determine the minimum detectable change in normal healthy individuals and a clinical population with neck pain. Pressure pain threshold testing may be a valuable assessment and prognostic indicator for people with neck pain. To date, most of this research has been completed using algometers that are too resource intensive for routine clinical use. Novice raters (physiotherapy students or clinical physiotherapists) were trained to perform algometry testing over 2 clinically relevant sites: the angle of the upper trapezius and the belly of the tibialis anterior. A convenience sample of normal healthy individuals and a clinical sample of people with neck pain were tested by 2 different raters (all participants) and on 2 different days (healthy participants only). Intraclass correlation coefficient (ICC), standard error of measurement, and minimum detectable change were calculated. A total of 60 healthy volunteers and 40 people with neck pain were recruited. Intrarater reliability was almost perfect (ICC = 0.94-0.97), interrater reliability was substantial to near perfect (ICC = 0.79-0.90), and test-retest reliability was substantial (ICC = 0.76-0.79). Smaller change was detectable in the trapezius compared to the tibialis anterior. This study provides evidence that novice raters can perform digital algometry with adequate reliability for research and clinical use in people with and without neck pain.
The Chinese version of Instrument of Professional Attitude for Student Nurses (IPASN): Assessment of reliability and validity.

PubMed

Xiao, Yu-Ying; Li, Ting; Xiao, Lin; Wang, Su-Wei; Wang, Si-Qi; Wang, Han-Xiao; Wang, Bei-Bei; Gao, Yu-Lin

2017-02-01

Professional attitude is of great importance for nursing talents in the modern society. To develop an effective educational program for student nurses in China, an appropriate instrument is required for the assessment of their professional attitude. To assess the validity and reliability of the Instrument of Professional Attitude for Student Nurses (IPASN) in Chinese version. The original version of IPASN was translated through Brislin model (translation, back translation, culture adaption and pilot study) with the authorization from the developer. A total of 681 nursing students were chosen by stratified convenience sampling to assess construct validity using exploratory factor analysis (EFA). Besides, item analysis, Cronbach's alpha coefficients, test-retest reliability were conducted to test the psychometric properties in this part. A total of 204 nursing undergraduate trainees were selected by cluster convenience sampling to confirm the structure using confirmatory factor analysis (CFA) in another time. Corrected item-total correlations, alpha if item deleted were between 0.33 and 0.69, 0.906 and 0.913, respectively, indicating no item should be deleted. Cronbach alpha value was 0.91 for the total scale and Cronbach alpha coefficient for subscales ranged from 0.67 to 0.89. Test-retest reliability estimated from intraclass correlation coefficient (ICC) was 0.74 (P<0.05). Differences in item scores between the high-score group (the first 27%) and low-score group (the last 27%) were significant (P<0.001), indicating that the item discrimination ability was good. Seven subscales (contribution to increase of scientific information load, autonomy, community service, continuous education, to promote professional development, cooperation and theory guiding practice) were identified in EFA and confirmed in CFA, and explained 65.5% of the total variance. It indicated that the Chinese version of IPASN was valid and reliable for the evaluation of nursing students' professional attitude. Copyright © 2016 Elsevier Ltd. All rights reserved.
Consistency of Field-Based Measures of Neuromuscular Control Using Force-Plate Diagnostics in Elite Male Youth Soccer Players.

PubMed

Read, Paul J; Oliver, Jon L; Croix, Mark Ba De Ste; Myer, Gregory D; Lloyd, Rhodri S

2016-12-01

Read, P, Oliver, JL, Croix, MD, Myer, GD, and Lloyd, RS. Consistency of field-based measures of neuromuscular control using force-plate diagnostics in elite male youth soccer players. J Strength Cond Res 30(12): 3304-3311, 2016-Deficits in neuromuscular control during movement patterns such as landing are suggested pathomechanics that underlie sport-related injury. A common mode of assessment is measurement of landing forces during jumping tasks; however, these measures have been used less frequently in male youth soccer players, and reliability data are sparse. The aim of this study was to examine the reliability of a field-based neuromuscular control screening battery using force-plate diagnostics in this cohort. Twenty-six pre-peak height velocity (PHV) and 25 post-PHV elite male youth soccer players completed a drop vertical jump (DVJ), single-leg 75% horizontal hop and stick (75%HOP), and single-leg countermovement jump (SLCMJ). Measures of peak landing vertical ground reaction force (pVGRF), time to stabilization, time to pVGRF, and pVGRF asymmetry were recorded. A test-retest design was used, and reliability statistics included change in mean, intraclass correlation coefficient, and coefficient of variation (CV). No significant differences in mean score were reported for any of the assessed variables between test sessions. In both groups, pVGRF and asymmetry during the 75%HOP and SLCMJ demonstrated largely acceptable reliability (CV ≤ 10%). Greater variability was evident in DVJ pVGRF and all other assessed variables, across the 3 protocols (CV range = 13.8-49.7%). Intraclass correlation coefficient values ranged from small to large and were generally higher in the post-PHV players. The results of this study suggest that pVGRF and asymmetry can be reliably assessed using a 75%HOP and SLCMJ in this cohort. These measures could be used to support a screening battery for elite male youth soccer players and for test-retest comparison.
Translation and validation of European organization for research and treatment of cancer quality of life Questionnaire -C30 into Moroccan version for cancer patients in Morocco

PubMed Central

2014-01-01

Background Understanding the effects of cancer on the quality of life of affected patients is critical to clinical research as well as to optimal management and care. The aim of this study was to adapt the European Organization for Research and Treatment of Cancer Quality of Life Questionnaire-C30 (EORTC QLQ-C30) questionnaire into Moroccan Arabic and to determine its psychometric properties. After translation, back translation and pretesting of the pre-final version, the translated version was submitted to a committee of professionals composed by oncologists and epidemiologists. The psychometric properties were tested in patients with cancer. Internal consistency was tested using Cronbach’s alpha and the test-retest reliability using interclass correlation coefficients. Construct validity was assessed by examining item-convergent and divergent validity. It was also tested using Spearman’s correlation between QLQ-C30 scales and EQ-5D. Results The study was conducted in 125 patients. The Moroccan version was internally reliable, Cronbach’s α was 0.87 for the total scale and ranged from 0.34 to 0.97 for the subscales. The intraclass correlation coefficient of the test-retest reliability ranged from 0.64 for “social functioning” to 0.89 for “physical activities” subscales. The instrument demonstrated a good construct and concomitant validity. Conclusions We have developed a semantically equivalent translation with cultural adaptation of EORTC QLQ-C30 questionnaire. The assessment of its measurement properties showed that it is quite reliable and a valid measure of the effect of cancer on the quality of life in Moroccan patients. PMID:24721384
Spanish Validation of the Care Evaluation Scale for Measuring the Quality of Structure and Process of Palliative Care From the Family Perspective.

PubMed

Benitez-Rosario, Miguel Angel; Caceres-Miranda, Raquel; Aguirre-Jaime, Armando

2016-03-01

A reliable and valid measure of the structure and process of end-of-life care is important for improving the outcomes of care. This study evaluated the validity and reliability of the Spanish adaptation of a satisfaction tool of the Care Evaluation Scale (CES), which was developed in Japan to evaluate palliative care structure and process from the perspective of family members. Standard forward-backward translation and a pilot test were conducted. A multicenter survey was conducted with the relatives of patients admitted to palliative care units for symptom control. The dimensional structure was assessed using confirmatory factor analyses. Concurrent and discriminant validity were tested by correlation with the SERQVHOS, a Spanish hospital care satisfaction scale and with an 11-point rating scale on satisfaction with care. The reliability of the CES was tested by Cronbach α and by test-retest correlation. A total of 284 primary caregivers completed the CES, with low missing response rates. The results of the factor analysis suggested a six-factor solution explaining 69% of the total variance. The CES moderately correlated with the SERQVHOS and with the overall satisfaction scale (intraclass correlation coefficients of 0.66 and 0.44, respectively; P = 0.001). Cronbach α was 0.90 overall and ranged from 0.85 to 0.89 for subdomains. Intraclass correlation coefficient was 0.88 (P = 0.001) for test-retest analysis. The Spanish CES was found to be a reliable and valid measure of the satisfaction with end-of-life care structure and process from family members' perspectives. Copyright © 2016 American Academy of Hospice and Palliative Medicine. Published by Elsevier Inc. All rights reserved.
Reliability, validity, and clinical use of the Dominic Interactive: a DSM-based, self-report screen for school-aged children.

PubMed

Bergeron, Lise; Berthiaume, Claude; St-Georges, Marie; Piché, Geneviève; Smolla, Nicole

2013-08-01

As no single informant can be considered the gold standard of child psychopathology, interviewing of children regarding their own symptoms is necessary. Our study focused on the reliability, validity, and clinical use of the Dominic Interactive (DI), a multimedia self-report screen to assess symptoms for the most frequent Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition, Text Revision, mental disorders in school-aged children. A sample of 585 children aged 6 to 11 years from the community and psychiatric clinics was used to analyze the internal consistency, the test-retest estimate of reliability, and the criterion-related validity of the DI against the referral status. In addition, cross-informant correlation coefficients between this instrument (child report) and the Child Symptom Inventory (parent report) were explored in a subsample of 292 participants. For the total sample, Cronbach alpha coefficients ranged from 0.63 to 0.91. Test-retest kappas varied from 0.42 to 0.62 for categories based on cut-off points, except for specific phobias. Intraclass correlation coefficients ranged from 0.70 to 0.81 for symptom scales. The DI discriminated between referred and non-referred children in psychiatric clinics for all symptom scales. Significant cross-informant correlation coefficients were higher for the externalizing symptoms (0.35 to 0.48) than the internalizing symptoms (0.14 to 0.27). Findings of our study reasonably support adequate psychometric properties of the DI. This instrument offers a developmentally sensitive screening method to obtain unique information from young children about their mental health problems in front-line services, psychiatric clinics, and research settings.
Adaptation and Validation of the Kannada Version of the Singing Voice Handicap Index.

PubMed

Gunjawate, Dhanshree R; Aithal, Venkataraja U; Guddattu, Vasudeva; Bellur, Rajashekhar

2017-07-01

The present study aimed to adapt and validate the Singing Voice Handicap Index (SVHI) into Kannada language using standard procedures. This is a cross-sectional study. The original English version of SVHI was translated into Kannada. It was administered on 106 Indian classical singers, of whom 22 complained of voice problems. Its internal consistency was determined using Cronbach's alpha coefficient (α), test-retest reliability using Pearson's product moment correlation and paired t test, and the difference in mean scores by independent sample t test. The results revealed that the Kannada SVHI exhibited an excellent internal consistency (α = 0.96) with a high item-to-total correlation. Further, excellent test-retest reliability (r = 0.99) and significant differences in SVHI scores were also obtained by singers with and without a voice problem (t = 12.93, df = 104, P = 0.005). The Kannada SVHI is a valid and reliable tool for self-reported assessment of singers with voice problems. It will provide a valuable insight into the singing-related voice problems as perceived by the singers themselves. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Two-year Test-Retest Reliability in High School Athletes Using the Four- and Two-Factor ImPACT Composite Structures: The Effects of Learning Disorders and Headache/Migraine Treatment History.

PubMed

Brett, Benjamin L; Solomon, Gary S; Hill, Jennifer; Schatz, Philip

2018-03-01

This study examined the test-retest reliability of the four- and two-factor structures (i.e., Memory and Speed) of ImPACT over a 2-year interval across multiple groups with premorbid conditions, including those with a history of special education or learning disorders (LD; n = 114), treatment history for headache/migraine (n = 81), and a control group (n = 792). Nine hundred and eighty seven high school athletes completed baseline testing using online ImPACT across a 2-year interval. Paired-samples t-tests documented improvement from initial to follow-up assessments. Test stability was examined using Regression-based measures (RBM) and Reliable change indices (RCI). Reliability was examined using intraclass correlation coefficients (ICC). Significant improvement on all four composites were observed for the control group over a 2-year interval; whereas significant differences were observed only on Visual Motor Speed for the LD and headache/migraine treatment history groups. ICCs ranges were similar across groups and greater or comparable reliability was observed for the two-factor structure on Memory (0.67-0.73) and Speed (0.76-0.78) composites. RCIs and RBMs demonstrated stability for the four- and two-factor structures, with few cases falling outside the range of expected change within a healthy sample at the 90% and 95% CIs. Typical practices of obtaining new baselines every 2 years in the high school population can be applied to athletes with a history of special education or LD and headache/migraine treatment. The two-factor structure has potential to increase test-retest reliability. Further research regarding clinical utility is needed. © The Author 2017. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Long-term Stability and Reliability of Baseline Cognitive Assessments in High School Athletes Using ImPACT at 1-, 2-, and 3-year Test-Retest Intervals.

PubMed

Brett, Benjamin L; Smyk, Nathan; Solomon, Gary; Baughman, Brandon C; Schatz, Philip

2016-08-18

The ImPACT (Immediate Post-Concussion Assessment and Cognitive Testing) neurocognitive testing battery is a widely used tool used for the assessment and management of sports-related concussion. Research on the stability of ImPACT in high school athletes at a 1- and 2-year intervals have been inconsistent, requiring further investigation. We documented 1-, 2-, and 3-year test-retest reliability of repeated ImPACT baseline assessments in a sample of high school athletes, using multiple statistical methods for examining stability. A total of 1,510 high school athletes completed baseline cognitive testing using online ImPACT test battery at three time periods of approximately 1- (N = 250), 2- (N = 1146), and 3-year (N = 114) intervals. No participant sustained a concussion between assessments. Intraclass correlation coefficients (ICCs) ranged in composite scores from 0.36 to 0.90 and showed little change as intervals between assessments increased. Reliable change indices and regression-based measures (RBMs) examining the test-retest stability demonstrated a lack of significant change in composite scores across the various time intervals, with very few cases (0%-6%) falling outside of 95% confidence intervals. The results suggest ImPACT composites scores remain considerably stability across 1-, 2-, and 3-year test-retest intervals in high school athletes, when considering both ICCs and RBM. Annually ascertaining baseline scores continues to be optimal for ensuring accurate and individualized management of injury for concussed athletes. For instances in which more recent baselines are not available (1-2 years), clinicians should seek to utilize more conservative range estimates in determining the presence of clinically meaningful change in cognitive performance. © The Author 2016. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
The usability of a WeChat-based electronic questionnaire for collecting participant-reported data in female pelvic floor disorders: a comparison with the traditional paper-administered format.

PubMed

Sun, Zhi-Jing; Zhu, Lan; Liang, Maolian; Xu, Tao; Lang, Jing-He

2016-08-01

WeChat is a promising tool for capturing electronic data; however, no research has examined its use. This study evaluates the reliability and feasibility of WeChat for administering the Pelvic Floor Impact Questionnaire Short Form 7 questionnaire to women with pelvic floor disorders. Sixty-eight pelvic floor rehabilitation women were recruited between June and December 2015 and crossover randomized to two groups. All participants completed two questionnaire formats. One group completed the paper version followed by the WeChat version; the other group completed the questionnaires in reverse order. Two weeks later, each group completed the two versions in reverse order. The WeChat version's reliability was assessed using intraclass correlation coefficients and test-retest reliability. Forty-two women (61.8%) preferred the WeChat to the paper format, eight (11.8%) preferred the paper format, and 18 (26.5%) had no preference. The younger women preferred WeChat. Completion time was 116.5 (61.3) seconds for the WeChat version and 133.4 (107.0) seconds for the paper version, with no significant difference (P = 0.145). Age and education did not impact completion time (P > 0.05). Consistency between the WeChat and paper versions was excellent. The intraclass correlation coefficients of the Pelvic Floor Impact Questionnaire Short Form 7 and the three subscales ranged from 0.915 to 0.980. The Bland-Altman analysis and linear regression results also showed high consistency. The test-retest study had a Pearson's correlation coefficient of 0.908, demonstrating a strong correlation. WeChat-based questionnaires were well accepted by women with pelvic floor disorders and had good data quality and reliability.
Methodology for Developing a New EFNEP Food and Physical Activity Behaviors Questionnaire.

PubMed

Murray, Erin K; Auld, Garry; Baker, Susan S; Barale, Karen; Franck, Karen; Khan, Tarana; Palmer-Keenan, Debra; Walsh, Jennifer

2017-10-01

Research methods are described for developing a food and physical activity behaviors questionnaire for the Expanded Food and Nutrition Education Program (EFNEP), a US Department of Agriculture nutrition education program serving low-income families. Mixed-methods observational study. The questionnaire will include 5 domains: (1) diet quality, (2) physical activity, (3) food safety, (4) food security, and (5) food resource management. A 5-stage process will be used to assess the questionnaire's test-retest reliability and content, face, and construct validity. Research teams across the US will coordinate questionnaire development and testing nationally. Convenience samples of low-income EFNEP, or EFNEP-eligible, adult participants across the US. A 5-stage process: (1) prioritize domain concepts to evaluate (2) question generation and content analysis panel, (3) question pretesting using cognitive interviews, (4) test-retest reliability assessment, and (5) construct validity testing. A nationally tested valid and reliable food and physical activity behaviors questionnaire for low-income adults to evaluate EFNEP's effectiveness. Cognitive interviews will be summarized to identify themes and dominant trends. Paired t tests (P ≤ .05) and Spearman and intra-class correlation coefficients (r > .5) will be conducted to assess reliability. Construct validity will be assessed using Wilcoxon t test (P ≤ .05), Spearman correlations, and Bland-Altman plots. Copyright © 2017 Society for Nutrition Education and Behavior. Published by Elsevier Inc. All rights reserved.
Reliability and validity of a self-administered tool for online neuropsychological testing: The Amsterdam Cognition Scan.

PubMed

Feenstra, Heleen E M; Murre, Jaap M J; Vermeulen, Ivar E; Kieffer, Jacobien M; Schagen, Sanne B

2018-04-01

To facilitate large-scale assessment of a variety of cognitive abilities in clinical studies, we developed a self-administered online neuropsychological test battery: the Amsterdam Cognition Scan (ACS). The current studies evaluate in a group of adult cancer patients: test-retest reliability of the ACS and the influence of test setting (home or hospital), and the relationship between our online and a traditional test battery (concurrent validity). Test-retest reliability was studied in 96 cancer patients (57 female; M age = 51.8 years) who completed the ACS twice. Intraclass correlation coefficients (ICCs) were used to assess consistency over time. The test setting was counterbalanced between home and hospital; influence on test performance was assessed by repeated measures analyses of variance. Concurrent validity was studied in 201 cancer patients (112 female; M age = 53.5 years) who completed both the online and an equivalent traditional neuropsychological test battery. Spearman or Pearson correlations were used to assess consistency between online and traditional tests. ICCs of the online tests ranged from .29 to .76, with an ICC of .78 for the ACS total score. These correlations are generally comparable with the test-retest correlations of the traditional tests as reported in the literature. Correlating online and traditional test scores, we observed medium to large concurrent validity (r/ρ = .42 to .70; total score r = .78), except for a visuospatial memory test (ρ = .36). Correlations were affected-as expected-by design differences between online tests and their offline counterparts. Although development and optimization of the ACS is an ongoing process, and reliability can be optimized for several tests, our results indicate that it is a highly usable tool to obtain (online) measures of various cognitive abilities. The ACS is expected to facilitate efficient gathering of data on cognitive functioning in the near future.
TEST-RETEST RELIABILITY OF THE CLOSED KINETIC CHAIN UPPER EXTREMITY STABILITY TEST (CKCUEST) IN ADOLESCENTS: RELIABILITY OF CKCUEST IN ADOLESCENTS.

PubMed

de Oliveira, Valéria M A; Pitangui, Ana C R; Nascimento, Vinícius Y S; da Silva, Hítalo A; Dos Passos, Muana H P; de Araújo, Rodrigo C

2017-02-01

The Closed Kinetic Chain Upper Extremity Stability Test (CKCUEST) has been proposed as an option to assess upper limb function and stability; however, there are few studies that support the use of this test in adolescents. The purpose of the present study was to investigate the intersession reliability and agreement of three CKCUEST scores in adolescents and establish clinimetric values for this test. Test-retest reliability. Twenty-five healthy adolescents of both sexes were evaluated. The subjects performed two CKCUEST with an interval of one week between the tests. An intraclass correlation coefficient (ICC 3,3 ) two-way mixed model with a 95% interval of confidence was utilized to determine intersession reliability. A Bland-Altman graph was plotted to analyze the agreement between assessments. The presence of systematic error was evaluated by a one-sample t test. The difference between the evaluation and reevaluation was observed using a paired-sample t test. The level of significance was set at 0.05. Standard error of measurements and minimum detectable changes were calculated. The intersession reliability of the average touches score, normalized score, and power score were 0.68, 0.68 and 0.87, the standard error of measurement were 2.17, 1.35 and 6.49, and the minimal detectable change was 6.01, 3.74 and 17.98, respectively. The presence of systematic error (p < 0.014), the significant difference between the measurements (p < 0.05), and the analysis of the Bland-Altman graph infer that CKCUEST is a discordant test with moderate to excellent reliability when used with adolescents. The CKCUEST is a measurement with moderate to excellent reliability for adolescents. 2b.
Validity and reliability of a new ankle dorsiflexion measurement device.

PubMed

Gatt, Alfred; Chockalingam, Nachiappan

2013-08-01

The assessment of the maximum ankle dorsiflexion angle is an important clinical examination procedure. Evidence shows that the traditional goniometer is highly unreliable, and various designs of goniometers to measure the maximum ankle dorsiflexion angle rely on the application of a known force to obtain reliable results. Hence, an innovative ankle dorsiflexion measurement device was designed to make this measurement more reliable by holding the foot in a selected posture without the application of a known moment. To report on the comprehensive validity and reliability testing carried out on the new device. Following validity testing, four different trials to test reliability of the ankle dorsiflexion measurement device were performed. These trials included inter-rater and intra-rater testings with a controlled moment, intra-rater reliability testing with knees flexed and extended without a controlled moment, intra-rater testing with a patient population, and inter-rater reliability testing between four raters of varying experience without controlling moment. All raters were blinded. A series of trials to test intra-rater and inter-rater reliabilities. Intra-rater reliability intraclass correlation coefficient was 0.98 and inter-rater reliability intraclass correlation coefficient (2,1) was 0.953 with a controlled moment. With uncontrolled moment, very high reliability for intra-tester was also achieved (intraclass correlation coefficient = 0.94 with knees extended and intraclass correlation coefficient = 0.95 with knees flexed). For the trial investigating test-retest reliability with actual patients, intraclass correlation coefficient of 0.99 was obtained. In the trial investigating four different raters with uncontrolled moment, intraclass correlation coefficient of 0.91 was achieved. The new ankle dorsiflexion measurement device is a valid and reliable device for measuring ankle dorsiflexion in both healthy subjects and patients, with both controlled and uncontrolled moments, even by multiple raters of varying experience when the foot is dorsiflexed to its end of range of motion. An ankle dorsiflexion measuring device has been designed to increase the reliability of ankle dorsiflexion measurement and replace the traditional goniometer. While the majority of similar devices rely on application of a known moment to perform this measurement, it has been shown that this is not required with the new ankle dorsiflexion measurement device and, rather, foot posture should be taken into consideration as this affects the maximum ankle dorsiflexion angle.

Development and validation of an Infertility Stigma Scale for Chinese women.

PubMed

Fu, Bing; Qin, Nan; Cheng, Li; Tang, Guanxiu; Cao, Yi; Yan, Chunli; Huang, Xin; Yan, Pingping; Zhu, Shujuan; Lei, Jun

2015-07-01

To develop and validate a scale of stigma for infertile Chinese women. Infertile women admitted to the Xiangya Hospital, the Second Xiangya Hospital, and the Third Xiangya Hospital of Central South University for treatment were approached to participate in this study. The Infertility Stigma Scale (ISS) development involved: [1] item generation based on literature, interview (experts/patients: N=5/N=20) and related scale; [2] pre-test questionnaire formation with both experts' ratings (N=9) and infertile women's feedbacks (N=30); [3] the component structure assessed by principal components analysis with varimax rotation (N=334); [4] convergent validity assessed with Social Support Rating scale, Self-Esteem scale, Family APGAR Index (N=334); and [5] reliability identified by internal consistency Cronbach's α (N=334), split-half reliability (N=334), test-retest reliability (N=20). This study yielded a 27-item ISS with 4 factors (self-devaluation, social withdrawal, public stigma, and family stigma). Exploratory factor analysis indicated that these 4 factors accounted for 58.17% of total variances. The Cronbach's α, split-half coefficient and test-retest correlation coefficient for the whole scale was 0.94, 0.90, and 0.91, respectively. The associations of the ISS with other measures suggested good convergent validity. The Content Validity Index (CVI) was 0.92. The ISS appears to be a reliable and valid measure to assess levels of stigma experienced by infertile Chinese women. It may be a useful tool to help identify infertile women at greater risks of distress. Copyright © 2014 Elsevier Inc. All rights reserved.
Validation of the Spanish version of the "Questionnaire on the treatment of approximal and occlusal caries".

PubMed

Ruiz, Begoña; Urzúa, Iván; Cabello, Rodrigo; Rodríguez, Gonzalo; Espelid, Ivar

2013-01-01

To translate and validate a Spanish version of the "Questionnaire on the treatment of approximal and occlusal caries" as a method of collecting information about treatment decisions on caries management in Chilean primary health care services. The original questionnaire proposed by Espelid et al. was translated into Spanish using the forward-backward translation technique. Subsequently, validation of the Spanish version was undertaken. Data were collected from two separate samples; first, from 132 Spanish-speaking dentists recruited from primary health care services and second, from 21 individuals characterised as cariologists. Internal consistency was evaluated by the generation of Cronbach's alpha, test-retest reliability was evaluated by Cohen's kappa, convergent validity was evaluated by comparing the total scale scores to a global evaluation of treatment trends and discriminant validity was evaluated by investigating the differences in total scale scores between the Spanish-speaking dentist and cariologist samples. Cronbach's alpha indicated an internal consistency of 0.63 for the entire scale. Cohen's kappa correlation coefficient expressed a test-retest reliability of 0.83. Convergent validity determined a Pearson's correlation coefficient of 0.24 (p < 0.01). The comparison of proportions (chi-squared) indicated that discriminant validity was statistically significant (p < 0.01), using a one-tailed test. The Spanish version of the "Questionnaire on the treatment of approximal and occlusal caries" is a valid and reliable instrument for collecting information regarding treatment decisions in cariology. The clinical relevance of this study is to acquire a reliable instrument that allows for the determination of treatment decisions in Spanish-speaking dentists.
Validation and reliability of the Physical Activity Scale for the Elderly in Chinese population.

PubMed

Ngai, Shirley P C; Cheung, Roy T H; Lam, Priscillia L; Chiu, Joseph K W; Fung, Eric Y H

2012-05-01

Physical Activity Scale for the Elderly (PASE) is a widely used questionnaire in epidemiological studies for assessing the physical activity level of elderly. This study aims to translate and validate PASE in Chinese population. Cross-sectional study. Chinese elderly aged 65 or above. The original English version of PASE was translated into Chinese (PASE-C) following standardized translation procedures. Ninety Chinese elderly aged 65 or above were recruited in the community. Test-retest reliability was determined by comparing the scores obtained from two separate administrations by the intraclass correlation coefficient. Validity was evaluated by Spearman's rank correlation coefficients between PASE and Medical Outcome Survey 36-Item Short Form Health Survey (SF-36), grip strength, single-leg-stance, 5 times sit-to-stand and 10-m walk. PASE-C demonstrated good test-retest reliability (intraclass correlation coefficient = 0.81). Fair to moderate association were found between PASE-C and most of the subscales of SF-36 (rs = 0.285 to 0.578, p < 0.01), grip strength (rs = 0.405 to 0.426, p < 0.001), single-leg-stance (rs = 0.470 to 0.548, p < 0.001), 5 times sit-to-stand (rs = -0.33, p = 0.001) and 10-m walk (rs = -0.281, p = 0.007). PASE-C is a reliable and valid instrument for assessing the physical activity level of elderly in Chinese population.
The reliability of a VISION COACH task as a measure of psychomotor skills.

PubMed

Xi, Yubin; Rosopa, Patrick J; Mossey, Mary; Crisler, Matthew C; Drouin, Nathalie; Kopera, Kevin; Brooks, Johnell O

2014-10-01

The VISION COACH™ interactive light board is designed to test and enhance participants' psychomotor skills. The primary goal of this study was to examine the test-retest reliability of the Full Field 120 VISION COACH task. One hundred eleven male and 131 female adult participants completed six trials where they responded to 120 randomly distributed lights displayed on the VISION COACH interactive light board. The mean time required for a participant to complete a trial was 101 seconds. Intraclass correlation coefficients, ranging from 0.962 to 0.987 suggest the VISION COACH Full Field 120 task was a reliable task. Cohen's d's of adjacent pairs of trials suggest learning effects did not negatively affect reliability after the third trial.
Validation of the Hebrew version of the Burn Specific Health Scale-Brief questionnaire.

PubMed

Stavrou, Demetris; Haik, Josef; Wiser, Itay; Winkler, Eyal; Liran, Alon; Holloway, Samantha; Boyd, Julie; Zilinsky, Isaac; Weissman, Oren

2015-02-01

The Burns Specific Health Scale-Brief (BSHS-B) questionnaire is a suitable measurement tool for the assessment of general, physical, mental, and social health aspects of the burn survivor. To translate, culturally adapt and validate the BSHS-B to Hebrew (BSHS-H), and to investigate its psychometric properties. Eighty-six Hebrew speaking burn survivors filled out the BSHS-B and SF-36 questionnaires. Ten of them (11.63%) completed a retest. The psychometric properties of the scale were evaluated. Internal consistency, criterion validity, and construct validity were assessed using interclass correlation coefficient, Cronbach's alpha statistic, Spearman rank test, and Mann-Whitney U test respectively. BSHS-H Cronbach's alpha coefficient was 0.97. Test-retest interclass coefficients were between 0.81 and 0.98. BSHS-H was able to discriminate between facial burns, hand burns and burns >10% body surface area (p<0.05). BSHS-H and SF-36 were positively correlated (r(2)=0.667, p<0.01). BSHS-H is a reliable and valid instrument for use in the Israeli burn survivor population. The translation and cross-cultural adaptation of this disease specific scale allows future comparative international studies. Copyright © 2014 Elsevier Ltd and ISBI. All rights reserved.
Validation of the French version of the Burn Specific Health Scale-Brief (BSHS-B) questionnaire.

PubMed

Gandolfi, S; Auquit-Auckbur, I; Panunzi, S; Mici, E; Grolleau, J-L; Chaput, B

2016-11-01

The Burn Specific Health Scale-Brief questionnaire is a widely validated tool for estimating the health related quality of life and for assessing the best multidisciplinary management of burn patients. The aim of this study was to translate the BSHS-B into French and to investigate its reliability and validity. According to the procedure proposed by the Scientific Advisory Committee of the Medical Outcomes Trust, the Burn Specific Health Scale-Brief (BSHS-B) was translated from the English version into French. In order to test the reliability of the French version of the BSHS-B, 53 burn patients French speakers completed the BSHS-B and SF-36 questionnaires from two to four years after burn. Ten of them have been re-tested at 6 months after the first evaluation. To evaluate clinical utility of the BSHS-F, internal consistency, construct validity (using SF-36) and stability in time were assessed using Cronbach's alpha statistic, Spearman rank test, and intra-class correlation coefficient respectively. The French version of the BSHS-B Cronbach's alpha coefficient was 0.93 and was >0.80 for all the sub-domains. French version of the BSHS-B and the SF-36 were positively correlated, all the associations were statistically significant (p<0.01). Intra-class correlation coefficients for test-retest ranged between 0.95 and 0.99 for the sub-domains. The intra-class correlation coefficient (ICC) for the total score was 0.98. The French version of the BSHS-B shows a robust rate of internal consistency, construct validity and stability in time, supporting its application in routine clinical practice as well as in international studies. Copyright © 2016 Elsevier Ltd and ISBI. All rights reserved.
A New Protocol to Evaluate the Effect of Topical Anesthesia

PubMed Central

List, Thomas; Mojir, Katerina; Svensson, Peter; Pigg, Maria

2014-01-01

This double-blind, placebo-controlled, randomized cross-over clinical experimental study tested the reliability, validity, and sensitivity to change of punctuate pain thresholds and self-reported pain on needle penetration. Female subjects without orofacial pain were tested in 2 sessions at 1- to 2-week intervals. The test site was the mucobuccal fold adjacent to the first upper right premolar. Active lidocaine hydrochloride 2% (Dynexan) or placebo gel was applied for 5 minutes, and sensory testing was performed before and after application. The standardized quantitative sensory test protocol included mechanical pain threshold (MPT), pressure pain threshold (PPT), mechanical pain sensitivity (MPS), and needle penetration sensitivity (NPS) assessments. Twenty-nine subjects, mean (SD) age 29.0 (10.2) years, completed the study. Test-retest reliability intraclass correlation coefficient at 10-minute intervals between examinations was MPT 0.69, PPT 0.79, MPS 0.72, and NPS 0.86. A high correlation was found between NPS and MPS (r = 0.84; P < .001), whereas NPS and PPT were not significantly correlated. The study found good to excellent test-retest reliability for all measures. None of the sensory measures detected changes in sensitivity following lidocaine 2% or placebo gel. Electronic von Frey assessments of MPT/MPS on oral mucosa have good validity. PMID:25517548
Validation of the Japanese version of the Pediatric Quality of Life Inventory (PedsQL) Cancer Module.

PubMed

Tsuji, Naoko; Kakee, Naoko; Ishida, Yasushi; Asami, Keiko; Tabuchi, Ken; Nakadate, Hisaya; Iwai, Tsuyako; Maeda, Miho; Okamura, Jun; Kazama, Takuro; Terao, Yoko; Ohyama, Wataru; Yuza, Yuki; Kaneko, Takashi; Manabe, Atsushi; Kobayashi, Kyoko; Kamibeppu, Kiyoko; Matsushima, Eisuke

2011-04-10

The PedsQL 3.0 Cancer Module is a widely used instrument to measure pediatric cancer specific health-related quality of life (HRQOL) for children aged 2 to 18 years. We developed the Japanese version of the PedsQL Cancer Module and investigated its reliability and validity among Japanese children and their parents. Participants were 212 children with cancer and 253 of their parents. Reliability was determined by internal consistency using Cronbach's coefficient alpha and test-retest reliability using intra-class correlation coefficient (ICC). Validity was assessed through factor validity, convergent and discriminant validity, concurrent validity, and clinical validity. Factor validity was examined by exploratory factor analysis. Convergent and discriminant validity were examined by multitrait scaling analysis. Concurrent validity was assessed using Spearman's correlation coefficients between the Cancer Module and Generic Core Scales, and the comparison of the scores of child self-reports with those of other self-rating depression scales for children. Clinical validity was assessed by comparing the on- and off- treatment scores using Kruskal-Wallis and Mann-Whitney U tests. Cronbach's coefficient alpha was over 0.70 for the total scale and over 0.60 for each subscale by age except for the 'pain and hurt' subscale for children aged 5 to 7 years. For test-retest reliability, the ICC exceeded 0.70 for the total scale for each age. Exploratory factor analysis demonstrated sufficient factorial validity. Multitrait scaling analysis showed high success rates. Strong correlations were found between the reports by children and their parents, and the scores of the Cancer Module and the Generic Core Scales except for 'treatment anxiety' subscales for child reports. The Depression Self-Rating Scale for Children (DSRS-C) scores were significantly correlated with emotional domains and the total score of the cancer module. Children who had been off treatment over 12 months demonstrated significantly higher scores than those on treatment. The results demonstrate the reliability and validity of the Japanese version of the PedsQL Cancer Module among Japanese children.
Measurement Properties of the Modified Spinal Function Sort (M-SFS): Is It Reliable and Valid in Workers with Chronic Musculoskeletal Pain?

PubMed

Trippolini, Maurizio Alen; Janssen, Svenja; Hilfiker, Roger; Oesch, Peter

2018-06-01

Purpose To analyze the reliability and validity of a picture-based questionnaire, the Modified Spinal Function Sort (M-SFS). Methods Sixty-two injured workers with chronic musculoskeletal disorders (MSD) were recruited from two work rehabilitation centers. Internal consistency was assessed by Cronbach's alpha. Construct validity was tested based on four a priori hypotheses. Structural validity was measured with principal component analysis (PCA). Test-retest reliability and agreement was evaluated using intraclass correlation coefficient (ICC) and measurement error with the limits of agreement (LoA). Results Total score of the M-SFS was 54.4 (SD 16.4) and 56.1 (16.4) for test and retest, respectively. Item distribution showed no ceiling effects. Cronbach's alpha was 0.94 and 0.95 for test and retest, respectively. PCA showed the presence of four components explaining a total of 74% of the variance. Item communalities were >0.6 in 17 out of 20 items. ICC was 0.90, LoA was ±12.6/16.2 points. The correlations between the M-SFS were 0.89 with the original SFS, 0.49 with the Pain Disability Index, -0.37 and -0.33 with the Numeric Rating Scale for actual pain, -0.52 for selfreported disability due to chronic low back pain, and 0.50, 0.56-0.59 with three distinct lifting tests. No a priori defined hypothesis for construct validity was rejected. Conclusions The M-SFS allows reliable and valid assessment of perceived self-efficacy for work-related tasks and can be recommended for use in patients with chronic MSD. Further research should investigate the proposed M-SFS score of <56 for its predictive validity for non-return to work.
Development and psychometric testing of the Cancer Knowledge Scale for Elders.

PubMed

Su, Ching-Ching; Chen, Yuh-Min; Kuo, Bo-Jein

2009-03-01

To develop the Cancer Knowledge Scale for Elders and test its validity and reliability. The number of elders suffering from cancer is increasing. To facilitate cancer prevention behaviours among elders, they shall be educated about cancer-related knowledge. Prior to designing a programme that would respond to the special needs of elders, understanding the cancer-related knowledge within this population was necessary. However, extensive review of the literature revealed a lack of appropriate instruments for measuring cancer-related knowledge. A valid and reliable cancer knowledge scale for elders is necessary. A non-experimental methodological design was used to test the psychometric properties of the Cancer Knowledge Scale for Elders. Item analysis was first performed to screen out items that had low corrected item-total correlation coefficients. Construct validity was examined with a principle component method of exploratory factor analysis. Cancer-related health behaviour was used as the criterion variable to evaluate criterion-related validity. Internal consistency reliability was assessed by the KR-20. Stability was determined by two-week test-retest reliability. The factor analysis yielded a four-factor solution accounting for 49.5% of the variance. For criterion-related validity, cancer knowledge was positively correlated with cancer-related health behaviour (r = 0.78, p < 0.001). The KR-20 coefficients of each factor were 0.85, 0.76, 0.79 and 0.67 and 0.87 for the total scale. Test-retest reliability over a two-week period was 0.83 (p < 0.001). This study provides evidence for content validity, construct validity, criterion-related validity, internal consistency and stability of the Cancer Knowledge Scale for Elders. The results show that this scale is an easy-to-use instrument for elders and has adequate validity and reliability. The scale can be used as an assessment instrument when implementing cancer education programmes for elders. It can also be used to evaluate the effects of education programmes.
Reliability of two social cognition tests: The combined stories test and the social knowledge test.

PubMed

Thibaudeau, Élisabeth; Cellard, Caroline; Legendre, Maxime; Villeneuve, Karèle; Achim, Amélie M

2018-04-01

Deficits in social cognition are common in psychiatric disorders. Validated social cognition measures with good psychometric properties are necessary to assess and target social cognitive deficits. Two recent social cognition tests, the Combined Stories Test (COST) and the Social Knowledge Test (SKT), respectively assess theory of mind and social knowledge. Previous studies have shown good psychometric properties for these tests, but the test-retest reliability has never been documented. The aim of this study was to evaluate the test-retest reliability and the inter-rater reliability of the COST and the SKT. The COST and the SKT were administered twice to a group of forty-two healthy adults, with a delay of approximately four weeks between the assessments. Excellent test-retest reliability was observed for the COST, and a good test-retest reliability was observed for the SKT. There was no evidence of practice effect. Furthermore, an excellent inter-rater reliability was observed for both tests. This study shows a good reliability of the COST and the SKT that adds to the good validity previously reported for these two tests. These good psychometrics properties thus support that the COST and the SKT are adequate measures for the assessment of social cognition. Copyright © 2018. Published by Elsevier B.V.
Psychometric properties of the Mayo Elbow Performance Score.

PubMed

Celik, Derya

2015-06-01

To translate and culturally adapt the Mayo Elbow Performance Score (MEPS), a widely used instrument for evaluating disability associated with elbow injuries, into Turkish (MEPS-T) and to determine psychometric properties of the translated version. The MEPS was translated into Turkish using published methodological guidelines. The measurement properties of the MEPS-T (construct validity and floor and ceiling effects) were tested in 91 patients with elbow pathology. The reproducibility of the MEPS-T was tested in 59 patients over 7-14 days. The responsiveness of the MEPS-T was tested in a subgroup of 46 patients diagnosed with lateral epicondylitis and who received conservative treatment for 6 weeks. The interclass correlation coefficient (ICC) was used to estimate the test-retest reliability. The construct validity was analyzed with the disabilities of the arm, shoulder and hand (DASH), Visual Analog Scale (VAS) and the Short Form 36 (SF-36). Effect size (ES) was used to assess the responsiveness. The distribution of floor and ceiling effects was determined. The MEPS-T showed very good test-retest reliability (ICC 0.89). The correlation coefficients between the MEPS-T and DASH and VAS were -0.61 and -0.53, respectively (p < 0.001). The highest correlations were between the MEPS-T and the mental component summary (r = 0.47, p = 0.001) and role emotional (r = 0.45, p = 0.001). The MEPS-T ES, 0.50, was moderate (95% CI 0.33-0.62). We observed no ceiling or floor effects. The MEPS-T represents a valid, reliable and moderately responsive instrument for evaluating patients with elbow disease.
Test-Retest Reliability and Predictive Validity of the Implicit Association Test in Children

ERIC Educational Resources Information Center

Rae, James R.; Olson, Kristina R.

2018-01-01

The Implicit Association Test (IAT) is increasingly used in developmental research despite minimal evidence of whether children's IAT scores are reliable across time or predictive of behavior. When test-retest reliability and predictive validity have been assessed, the results have been mixed, and because these studies have differed on many…
Sino-Nasal Outcome Test-22: Translation, Cross-cultural Adaptation, and Validation in Hebrew-Speaking Patients.

PubMed

Shapira Galitz, Yael; Halperin, Doron; Bavnik, Yosef; Warman, Meir

2016-05-01

To perform the translation, cross-cultural adaptation, and validation of the Sino-Nasal Outcome Test-22 (SNOT-22) questionnaire to the Hebrew language. A single-center prospective cross-sectional study. Seventy-three chronic rhinosinusitis (CRS) patients and 73 patients without sinonasal disease filled the Hebrew version of the SNOT-22 questionnaire. Fifty-one CRS patients underwent endoscopic sinus surgery, out of which 28 filled a postoperative questionnaire. Seventy-three healthy volunteers without sinonasal disease also answered the questionnaire. Internal consistency, test-retest reproducibility, validity, and responsiveness of the questionnaire were evaluated. Questionnaire reliability was excellent, with a high internal consistency (Cronbach's alpha coefficient, 0.91-0.936) and test-retest reproducibility (Spearman's coefficient, 0.962). Mean scores for the preoperative, postoperative, and control groups were 50.44, 29.64, and 13.15, respectively (P < .0001 for CRS vs controls, P < .001 for preoperative vs postoperative), showing validity and responsiveness of the questionnaire. The Hebrew version of SNOT-22 questionnaire is a valid outcome measure for patients with CRS with or without nasal polyps. © American Academy of Otolaryngology—Head and Neck Surgery Foundation 2016.
Test-retest reliability and repeatability of renal diffusion tensor MRI in healthy subjects.

PubMed

Cutajar, Marica; Clayden, Jonathan D; Clark, Christopher A; Gordon, Isky

2011-12-01

This study assessed test-retest reliability and repeatability of diffusion tensor imaging (DTI) in the kidneys. Seven healthy volunteers (age range, 19-31 years), were imaged three consecutive times on the same day (short-term reliability) and the same imaging protocol was repeated after a month (long-term reliability). Diffusion-weighted magnetic resonance imaging scans in the coronal-oblique projection of the kidney were acquired on a 1.5 T scanner using a multi-section echo-planar sequence; six contiguous slices each 5 mm thick, diffusion sensitisation along 20 non-collinear directions, TR=730 ms, TE=73 ms and 2 b-values (0 and 400 s mm(-2)). Volunteers were asked to hold their breath throughout each data acquisition (approx. 20 s). The apparent diffusion coefficient (ADC) and fractional anisotropy (FA) values were obtained from maps generated using dedicated software MIStar (Apollo Medical Imaging, Melbourne, Australia). Statistical analyses of both short- and long-term repeats were carried out from which the within-subject coefficient of variation (wsCV) was calculated. The wsCV obtained for both the ADC and FA values were less than 10% in all the analyses carried out. In addition, paired (repeated measures) t-test was used to measure the variation between the diffusion parameters collected from the two scanning sessions a month apart. It showed no significant difference and the wsCV obtained after comparing the first and second scans were found to be smaller than 15% for both ADC and FA. Renal DTI produces reliable and repeatable results which make longitudinal investigation of patients viable. Copyright © 2010 Elsevier Ireland Ltd. All rights reserved.
Sexual function in cervical cancer patients: Psychometric properties and performance of a Chinese version of the Female Sexual Function Index.

PubMed

Liu, Huayun; Yu, Juping; Chen, Yongyi; He, Pingping; Zhou, Lianqing; Tang, Xinhui; Liu, Xiangyu; Li, Xuying; Wu, Yanping; Wang, Yuhua

2016-02-01

This study aimed to examine the psychometric properties and performance of a Chinese version of the Female Sexual Function Index (FSFI) among a sample of Chinese women with cervical cancer. A cross-sectional survey design was used. The respondents included 215 women with cervical cancer in an oncology hospital in China. A translated Chinese version of the FSFI was used to investigate their sexual functioning. Psychometric testing included internal consistency reliability (Cronbach's alpha coefficient and item-total correlations), test-retest reliability, construct validity (principal component analysis via oblique rotation and confirmatory factor analysis), and variability (floor and ceiling effects). The mean score of the total scale was 20.65 ± 4.77. The Cronbach values were .94 for the total scale, .72-.90 for the domains. Test-retest correlation coefficients over 2-4 weeks were .84 (p < .05) for the total scale, .68-.83 for the subscales. Item-total correlation coefficients ranged between .47 and .83 (p < .05). A five-factor model was identified via principal component analysis and established by confirmatory factor analysis, including desire/arousal, lubrication, orgasm, satisfaction, and pain. There was no evidence of floor or ceiling effects. With good psychometric properties similar to its original English version, this Chinese version of the FSFI is demonstrated to be a reliable and valid instrument that can be used to assess sexual functioning of women with cervical cancer in China. Future research is still needed to confirm its psychometric properties and performance among a large sample. Copyright © 2015 Elsevier Ltd. All rights reserved.
Health-related quality of life in young adults in education, employment, or training: development of the Japanese version of Pediatric Quality of Life Inventory (PedsQL) Generic Core Scales Young Adult Version.

PubMed

Kaneko, Mei; Sato, Iori; Soejima, Takafumi; Kamibeppu, Kiyoko

2014-09-01

The purpose of the study is to develop a Japanese version of the Pediatric Quality of Life Inventory (PedsQL) Generic Core Scales Young Adult Version (PedsQL-YA-J) and determine the feasibility, reliability, and validity of the scales. Translation equivalence and content validity were verified using back-translation and cognitive debriefing tests. A total of 428 young adults recruited from one university, two vocational schools, or five companies completed questionnaires. We determined questionnaire feasibility, internal consistency, and test-retest reliability; checked concurrent validity against the Center for Epidemiologic Studies Depression Scale (CES-D); determined convergent and discriminant validity with the Medical Outcome Study 36-item Short Form Health Survey (SF-36); described known-groups validity with regard to subjective symptoms, illness or injury requiring regular medical visits, and depression; and verified factorial validity. All scales were internally consistent (Cronbach's coefficient alpha = 0.77-0.86); test-retest reliability was acceptable (intraclass correlation coefficient = 0.57-0.69); and all scales were concurrently valid with depression (Pearson's correlation coefficient = 0.43-0.57). The scales convergent and discriminant validity with the SF-36 and CES-D were acceptable. Evaluation of known-groups validity confirmed that the Physical Functioning scale was sensitive for subjective symptoms, the Emotional Functioning scale for depression, and the Work/School Functioning scale for illness or injury requiring regular medical visits. Exploratory factor analysis found a six-factor structure consistent with the assumed structure (cumulative proportion = 57.0%). The PedsQL-YA-J is suitable for assessing health-related quality of life in young adults in education, employment, or training, and for clinical trials and epidemiological research.
General inattentiveness is a long-term reliable trait independently predictive of psychological health: Danish validation studies of the Mindful Attention Awareness Scale.

PubMed

Jensen, Christian Gaden; Niclasen, Janni; Vangkilde, Signe Allerup; Petersen, Anders; Hasselbalch, Steen Gregers

2016-05-01

The Mindful Attention Awareness Scale (MAAS) measures perceived degree of inattentiveness in different contexts and is often used as a reversed indicator of mindfulness. MAAS is hypothesized to reflect a psychological trait or disposition when used outside attentional training contexts, but the long-term test-retest reliability of MAAS scores is virtually untested. It is unknown whether MAAS predicts psychological health after controlling for standardized socioeconomic status classifications. First, MAAS translated to Danish was validated psychometrically within a randomly invited healthy adult community sample (N = 490). Factor analysis confirmed that MAAS scores quantified a unifactorial construct of excellent composite reliability and consistent convergent validity. Structural equation modeling revealed that MAAS scores contributed independently to predicting psychological distress and mental health, after controlling for age, gender, income, socioeconomic occupational class, stressful life events, and social desirability (β = 0.32-.42, ps < .001). Second, MAAS scores showed satisfactory short-term test-retest reliability in 100 retested healthy university students. Finally, MAAS sample mean scores as well as individuals' scores demonstrated satisfactory test-retest reliability across a 6 months interval in the adult community (retested N = 407), intraclass correlations ≥ .74. MAAS scores displayed significantly stronger long-term test-retest reliability than scores measuring psychological distress (z = 2.78, p = .005). Test-retest reliability estimates did not differ within demographic and socioeconomic strata. Scores on the Danish MAAS were psychometrically validated in healthy adults. MAAS's inattentiveness scores reflected a unidimensional construct, long-term reliable disposition, and a factor of independent significance for predicting psychological health. (PsycINFO Database Record (c) 2016 APA, all rights reserved).
Reliability of the Cooking Task in adults with acquired brain injury.

PubMed

Poncet, Frédérique; Swaine, Bonnie; Taillefer, Chantal; Lamoureux, Julie; Pradat-Diehl, Pascale; Chevignard, Mathilde

2015-01-01

Acquired brain injury (ABI) often leads to deficits in executive functioning (EF) responsible for severe and long-standing disabilities in daily life activities. The Cooking Task is an ecological and valid test of EF involving multi-tasking in a real environment. Given its complex scoring system, it is important to establish the tool's reliability. The objective of the study was to examine the reliability of the Cooking Task (internal consistency, inter-rater and test-retest reliability). A total of 160 patients with ABI (113 men, mean age 37 years, SD = 14.3) were tested using the Cooking Task. For test-retest reliability, patients were assessed by the same rater on two occasions (mean interval 11 days) while two raters independently and simultaneously observed and scored patients' performances to estimate inter-rater reliability. Internal consistency was high for the global scale (Cronbach α = .74). Inter-rater reliability (n = 66) for total errors was also high (ICC = .93), however the test-retest reliability (n = 11) was poor (ICC = .36). In general the Cooking Task appears to be a reliable tool. The low test-retest results were expected given the importance of EF in the performance of novel tasks.
Reliability and convergent validity of the five-step test in people with chronic stroke.

PubMed

Ng, Shamay S M; Tse, Mimi M Y; Tam, Eric W C; Lai, Cynthia Y Y

2018-01-10

(i) To estimate the intra-rater, inter-rater and test-retest reliabilities of the Five-Step Test (FST), as well as the minimum detectable change in FST completion times in people with stroke. (ii) To estimate the convergent validity of the FST with other measures of stroke-specific impairments. (iii) To identify the best cut-off times for distinguishing FST performance in people with stroke from that of healthy older adults. A cross-sectional study. University-based rehabilitation centre. Forty-eight people with stroke and 39 healthy controls. None. The FST, along with (for the stroke survivors only) scores on the Fugl-Meyer Lower Extremity Assessment (FMA-LE), the Berg Balance Scale (BBS), Limits of Stability (LOS) tests, and Activities-specific Balance Confidence (ABC) scale were tested. The FST showed excellent intra-rater (intra-class correlation coefficient; ICC = 0.866-0.905), inter-rater (ICC = 0.998), and test-retest (ICC = 0.838-0.842) reliabilities. A minimum detectable change of 9.16 s was found for the FST in people with stroke. The FST correlated significantly with the FMA-LE, BBS, and LOS results in the forward and sideways directions (r = -0.411 to -0.716, p < 0.004). The FST completion time of 13.35 s was shown to discriminate reliably between people with stroke and healthy older adults. The FST is a reliable, easy-to-administer clinical test for assessing stroke survivors' ability to negotiate steps and stairs.

Reliability and validity of an audio signal modified shuttle walk test.

PubMed

Singla, Rupak; Rai, Richa; Faye, Abhishek Anil; Jain, Anil Kumar; Chowdhury, Ranadip; Bandyopadhyay, Debdutta

2017-01-01

The audio signal in the conventionally accepted protocol of shuttle walk test (SWT) is not well-understood by the patients and modification of the audio signal may improve the performance of the test. The aim of this study is to study the validity and reliability of an audio signal modified SWT, called the Singla-Richa modified SWT (SWTSR), in healthy normal adults. In SWTSR, the audio signal was modified with the addition of reverse counting to it. A total of 54 healthy normal adults underwent conventional SWT (CSWT) at one instance and two times SWTSRon the same day. The validity was assessed by comparing outcomes of the SWTSRto outcomes of CSWT using the Pearson correlation coefficient and Bland-Altman plot. Test-retest reliability of SWTSRwas assessed using the intraclass correlation coefficient (ICC). The acceptability of the modified test in comparison to the conventional test was assessed using Likert scale. The distance walked (mean ± standard deviation) in the CSWT and SWTSRtest was 853.33 ± 217.33 m and 857.22 ± 219.56 m, respectively (Pearson correlation coefficient - 0.98; P < 0.001) indicating SWTSRto be a valid test. The SWTSRwas found to be a reliable test with ICC of 0.98 (95% confidence interval: 0.97-0.99). The acceptability of SWTSRwas significantly higher than CSWT. The SWTSRwith modified audio signal with reverse counting is a reliable as well as a valid test when compared with CSWT in healthy normal adults. It better understood by subjects compared to CSWT.
Reliability of plasma lipopolysaccharide-binding protein (LBP) from repeated measures in healthy adults.

PubMed

Citronberg, Jessica S; Wilkens, Lynne R; Lim, Unhee; Hullar, Meredith A J; White, Emily; Newcomb, Polly A; Le Marchand, Loïc; Lampe, Johanna W

2016-09-01

Plasma lipopolysaccharide-binding protein (LBP), a measure of internal exposure to bacterial lipopolysaccharide, has been associated with several chronic conditions and may be a marker of chronic inflammation; however, no studies have examined the reliability of this biomarker in a healthy population. We examined the temporal reliability of LBP measured in archived samples from participants in two studies. In Study one, 60 healthy participants had blood drawn at two time points: baseline and follow-up (either three, six, or nine months). In Study two, 24 individuals had blood drawn three to four times over a seven-month period. We measured LBP in archived plasma by ELISA. Test-retest reliability was estimated by calculating the intraclass correlation coefficient (ICC). Plasma LBP concentrations showed moderate reliability in Study one (ICC 0.60, 95 % CI 0.43-0.75) and Study two (ICC 0.46, 95 % CI 0.26-0.69). Restricting the follow-up period improved reliability. In Study one, the reliability of LBP over a three-month period was 0.68 (95 % CI: 0.41-0.87). In Study two, the ICC of samples taken ≤seven days apart was 0.61 (95 % CI 0.29-0.86). Plasma LBP concentrations demonstrated moderate test-retest reliability in healthy individuals with reliability improving over a shorter follow-up period.
Evaluating the reliability of an injury prevention screening tool: Test-retest study.

PubMed

Gittelman, Michael A; Kincaid, Madeline; Denny, Sarah; Wervey Arnold, Melissa; FitzGerald, Michael; Carle, Adam C; Mara, Constance A

2016-10-01

A standardized injury prevention (IP) screening tool can identify family risks and allow pediatricians to address behaviors. To assess behavior changes on later screens, the tool must be reliable for an individual and ideally between household members. Little research has examined the reliability of safety screening tool questions. This study utilized test-retest reliability of parent responses on an existing IP questionnaire and also compared responses between household parents. Investigators recruited parents of children 0 to 1 year of age during admission to a tertiary care children's hospital. When both parents were present, one was chosen as the "primary" respondent. Primary respondents completed the 30-question IP screening tool after consent, and they were re-screened approximately 4 hours later to test individual reliability. The "second" parent, when present, only completed the tool once. All participants received a 10-dollar gift card. Cohen's Kappa was used to estimate test-retest reliability and inter-rater agreement. Standard test-retest criteria consider Kappa values: 0.0 to 0.40 poor to fair, 0.41 to 0.60 moderate, 0.61 to 0.80 substantial, and 0.81 to 1.00 as almost perfect reliability. One hundred five families participated, with five lost to follow-up. Thirty-two (30.5%) parent dyads completed the tool. Primary respondents were generally mothers (88%) and Caucasian (72%). Test-retest of the primary respondents showed their responses to be almost perfect; average 0.82 (SD = 0.13, range 0.49-1.00). Seventeen questions had almost perfect test-retest reliability and 11 had substantial reliability. However, inter-rater agreement between household members for 12 objective questions showed little agreement between responses; inter-rater agreement averaged 0.35 (SD = 0.34, range -0.19-1.00). One question had almost perfect inter-rater agreement and two had substantial inter-rater agreement. The IP screening tool used by a single individual had excellent test-retest reliability for nearly all questions. However, when a reporter changes from pre- to postintervention, differences may reflect poor reliability or different subjective experiences rather than true change.
The development of two postnatal health instruments: one for mothers (M-PHI) and one for fathers (F-PHI) to measure health during the first year of parenting.

PubMed

Jones, G L; Morrell, C J; Cooke, J M; Speier, D; Anumba, D; Stewart-Brown, S

2011-09-01

To develop and psychometrically evaluate two questionnaires measuring both positive and negative postnatal health of mothers (M-PHI) and fathers (F-PHI) during the first year of parenting. The M-PHI and the F-PHI were developed in four stages. Stage 1: Postnatal women's focus group (M-PHI) and postnatal fathers' postal questionnaire (F-PHI); Stage 2: Qualitative interviews; Stage 3: Pilot postal survey and main postal survey; and Stage 4: Test-retest postal survey. The M-PHI consisted of a 29-item core questionnaire with six main scales and five conditional scales. The F-PHI consisted of a 27-item questionnaire with six main scales. All scales achieved good internal reliability (Cronbach's α 0.66-0.87 for M-PHI, 0.72-0.90 for F-PHI). Intraclass correlation coefficients demonstrated high test-retest reliability (0.60-0.88). Correlation coefficients supported the criterion validity of the M-PHI and the F-PHI when tested against the Short-Form-12 (SF-12), Edinburgh Postnatal Depression Scale (EPDS) and the Warwick and Edinburgh Mental Well-Being Scale (WEMWBS). The M-PHI and F-PHI are valid, reliable, parent-generated instruments. These unique instruments will be invaluable for practitioners wishing to promote family-centred care and for trialists and other researchers requiring a validated instrument to measure both positive and negative health during the first postnatal year, as to date no such measurement has existed.
Construction and validation of the fatigue impact and severity self-assessment for youth and young adults with cerebral palsy.

PubMed

Brunton, Laura K; Bartlett, Doreen J

2017-07-01

The Fatigue Impact and Severity Self-Assessment (FISSA) was created to assess the impact, severity, and self-management of fatigue for individuals with cerebral palsy (CP) aged 14-31 years. Items were generated from a review of measures and interviews with individuals with CP. Focus groups with health-care professionals were used for item reduction. A mailed survey was conducted (n=163/367) to assess the factor structure, known-groups validity, and test-retest reliability. The final measure contained 31 items in two factors and discriminated between individuals expected to have different levels of fatigue. Individuals with more functional abilities reported less fatigue (p < 0.002) and those with higher pain reported higher fatigue (p < 0.001). The FISSA was shown to have adequate test-retest reliability, intraclass correlation coefficient (ICC)(3,1)=0.74 (95% confidence interval [CI] 0.53-0.87). The FISSA valid and reliable for individuals with CP. It allows for identification of the activities that may be compromised by fatigue to enhance collaborative goal setting and intervention planning.
The Bahasa Melayu version of the Nursing Stress Scale among nurses: a reliability study in Malaysia.

PubMed

Rosnawati, Muhamad Robat; Moe, Htay; Masilamani, Retneswari; Darus, A

2010-10-01

The Nursing Stress Scale (NSS) has been shown to be a valid and reliable instrument to assess occupational stressors among nurses. The NSS, which was previously used in the English version, was translated and back-translated into Bahasa Melayu. This study was conducted to assess the reliability of the Bahasa Melayu version of the NSS among nurses for future studies in this country. The reliability of the NSS was assessed after its readministration to 30 nurses with a 2-week interval. The Spearman coefficient was calculated to assess its stability. The internal consistency was measured through 4 measures: Cronbach's α, Spearman-Brown, Guttman split-half, and standardized item α coefficients. The total response rate was 70%. Test-retest reliability showed remarkable stability (Spearman's ρ exceeded .70). All 4 measures of internal consistency among items indicated a satisfactory level (coefficients in the range of .68 to .87). In conclusion, the Bahasa Melayu version of the NSS is a reliable and useful instrument for measuring the possible stressors at the workplace among nurses.
The Self-Evaluation Scale-Self-Report (SES-S) Version: Studies of Reliability and Validity

ERIC Educational Resources Information Center

Erford, Bradley T.; Bardhoshi, Gerta; Duncan, Kelly; Voucas, Stephanie; Dewlin, Emily

2017-01-01

The Self-Evaluation Scale-Self-Report version was designed to assess self-concept in students aged 10 to 17 years. Coefficient a was 0.94, and test-retest was 0.87. A unidimensional construct emerged with strong convergent validity with scores on the Piers-Harris 2 (r = 0.77) and Self-Efficacy Self-Report Scale (r = 0.70).
Reliability and Validity of the Turkish Version of the Voice-Related Quality of Life Measure.

PubMed

Tezcaner, Zahide Çiler; Aksoy, Songül

2017-03-01

This study aims to test the validity and reliability of the Turkish version of the Voice-Related Quality of Life (V-RQOL) questionnaire. This is a nonrandomized, prospective study with control group. The questionnaire was administered to 249 individuals-130 with vocal complaint and 119 without-with a mean age of 37.8 ± 12.3 years. The Turkish version of the Voice Handicap Index (VHI) and perceptual voice evaluation measures were also administered at 2-14 days for retest reliability. The instrument was submitted to validity and reliability evaluation. The V-RQOL measure showed a strong internal consistency and test-retest reliability; the Cronbach's alpha coefficient for the overall V-RQOL was 0.969, the physical functioning domain was 0.949, and the social-emotional domain was 0.940. In the test-retest reliability test, the overall V-RQOL was found to be 0.989. The construct validity of the V-RQOL was determined based on the strength and direction of its relation to the VHI and the perceptual voice evaluation measure. The higher the VHI level, the lower the physical functioning, social-emotional, and overall score levels of the V-RQOL (r = -0.927, r = -0.912, r = -0.944, respectively; P < 0.001). Following the perceptual voice self-assessment, a statistically significant difference was found between the V-RQOL scores of individuals who defined their voices as good, very good, and perfect, and those who defined their voices as bad and very bad (P < 0.001). The results suggest that the Turkish version of the V-RQOL measure has reliability and validity and may play a crucial role in evaluating Turkish-speaking patients with voice disorders. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Reliability and Validity of the Italian Version of the Protocol of Orofacial Myofunctional Evaluation with Scores (I-OMES).

PubMed

Scarponi, Letizia; de Felicio, Claudia Maria; Sforza, Chiarella; Pimenta Ferreira, Claudia Lucia; Ginocchio, Daniela; Pizzorni, Nicole; Barozzi, Stefania; Mozzanica, Francesco; Schindler, Antonio

2018-05-30

To evaluate the reliability, validity, and responsiveness of the Italian OMES (I-OMES). The study consisted of 3 phases: (1) internal consistency and reliability, (2) validity, and (3) responsiveness analysis. The recruited population included 27 patients with orofacial myofunctional disorders (OMD) and 174 healthy volunteers. Forty-seven subjects, 18 healthy and all recruited patients with OMD were assessed for inter-rater and test-retest reliability analysis. I-OMES and Nordic Orofacial Test - Screening (NOT-S) scores of the patients were correlated for concurrent validity analysis. I-OMES scores from 27 patients with OMD and 27 age- and gender-matched healthy subjects were compared to investigate construct validity. I-OMES scores before and after successful swallowing rehabilitation in patients were compared for responsiveness analysis. Adequate internal consistency (Cronbach α = 0.71) and strong inter-rater and test-retest reliability (intraclass coefficient correlation = 0.97 and 0.98, respectively) were found. I-OMES and NOT-S scores significantly and inversely correlated (r = -0.38). A statistical significance (p < 0.001) was found between the pathological group and the control group for the total I-OMES score. The mean I-OMES score improved from 90 (78-102) to 99 (89-103) after myofunctional rehabilitation (p < 0.001). The I-OMES is a reliable and valid tool to evaluate OMD. © 2018 S. Karger AG, Basel.
Stability of person ability measures in people with acquired brain injury in the use of everyday technology: the test-retest reliability of the Management of Everyday Technology Assessment (META).

PubMed

Malinowsky, Camilla; Kassberg, Ann-Charlotte; Larsson-Lund, Maria; Kottorp, Anders

2016-01-01

To evaluate the test-retest reliability of the Management of Everyday Technology Assessment (META) in a sample of people with acquired brain injury (ABI). The META was administered twice within a two-week period to 25 people with ABI. A Rasch measurement model was used to convert the META ordinal raw scores into equal-interval linear measures of each participant's ability to manage everyday technology (ET). Test-retest reliability of the stability of the person ability measures in the META was examined by a standardized difference Z-test and an intra-class correlations analysis (ICC 1). The results showed that the paired person ability measures generated from the META were stable over the test-retest period for 22 of the 25 subjects. The ICC 1 correlation was 0.63, which indicates good overall reliability. The META demonstrated acceptable test-retest reliability in a sample of people with ABI. The results illustrate the importance of using sufficiently challenging ETs (relative to a person's abilities) to generate stable META measurements over time. Implications for Rehabilitation The findings add evidence regarding the test-retest reliability of the person ability measures generated from the observation assessment META in a sample of people with ABI. The META might support professionals in the evaluation of interventions that are designed to improve clients' performance of activities including the ability to manage ET.
Test-Retest Reliability of Measures Commonly Used to Measure Striatal Dysfunction across Multiple Testing Sessions: A Longitudinal Study.

PubMed

Palmer, Clare E; Langbehn, Douglas; Tabrizi, Sarah J; Papoutsi, Marina

2017-01-01

Cognitive impairment is common amongst many neurodegenerative movement disorders such as Huntington's disease (HD) and Parkinson's disease (PD) across multiple domains. There are many tasks available to assess different aspects of this dysfunction, however, it is imperative that these show high test-retest reliability if they are to be used to track disease progression or response to treatment in patient populations. Moreover, in order to ensure effects of practice across testing sessions are not misconstrued as clinical improvement in clinical trials, tasks which are particularly vulnerable to practice effects need to be highlighted. In this study we evaluated test-retest reliability in mean performance across three testing sessions of four tasks that are commonly used to measure cognitive dysfunction associated with striatal impairment: a combined Simon Stop-Signal Task; a modified emotion recognition task; a circle tracing task; and the trail making task. Practice effects were seen between sessions 1 and 2 across all tasks for the majority of dependent variables, particularly reaction time variables; some, but not all, diminished in the third session. Good test-retest reliability across all sessions was seen for the emotion recognition, circle tracing, and trail making test. The Simon interference effect and stop-signal reaction time (SSRT) from the combined-Simon-Stop-Signal task showed moderate test-retest reliability, however, the combined SSRT interference effect showed poor test-retest reliability. Our results emphasize the need to use control groups when tracking clinical progression or use pre-baseline training on tasks susceptible to practice effects.
Reliability, precision, and gender differences in knee internal/external rotation proprioception measurements.

PubMed

Nagai, Takashi; Sell, Timothy C; Abt, John P; Lephart, Scott M

2012-11-01

To develop and assess the reliability and precision of knee internal/external rotation (IR/ER) threshold to detect passive motion (TTDPM) and determine if gender differences exist. Test-retest for the reliability/precision and cross-sectional for gender comparisons. University neuromuscular and human performance research laboratory. Ten subjects for the reliability and precision aim. Twenty subjects (10 males and 10 females) for gender comparisons. All TTDPM tests were performed using a multi-mode dynamometer. Subjects performed TTDPM at two knee positions (near IR or ER end-range). Intraclass correlation coefficient (ICC (3,k)) and standard error of measurement (SEM) were used to evaluate the reliability and precision. Independent t-tests were used to compare genders. TTDPM toward IR and ER at two knee positions. Intrasession and intersession reliability and precision were good (ICC=0.68-0.86; SEM=0.22°-0.37°). Females had significantly diminished TTDPM toward IR at IR-test position (males: 0.77°±0.14°, females: 1.18°±0.46°, p=0.021) and TTDPM toward IR at the ER-test position (males: 0.87°±0.13°, females: 1.36°±0.58°, p=0.026). No other significant gender differences were found (p>0.05). The current IR/ER TTDPM methods are reliable and accurate for the test-retest or cross-section research design. Gender differences were found toward IR where the ACL acts as the secondary restraint. Copyright © 2011 Elsevier Ltd. All rights reserved.
Translation of the Neck Disability Index and validation of the Greek version in a sample of neck pain patients.

PubMed

Trouli, Marianna N; Vernon, Howard T; Kakavelakis, Kyriakos N; Antonopoulou, Maria D; Paganas, Aristofanis N; Lionis, Christos D

2008-07-22

Neck pain is a highly prevalent condition resulting in major disability. Standard scales for measuring disability in patients with neck pain have a pivotal role in research and clinical settings. The Neck Disability Index (NDI) is a valid and reliable tool, designed to measure disability in activities of daily living due to neck pain. The purpose of our study was the translation and validation of the NDI in a Greek primary care population with neck complaints. The original version of the questionnaire was used. Based on international standards, the translation strategy comprised forward translations, reconciliation, backward translation and pre-testing steps. The validation procedure concerned the exploration of internal consistency (Cronbach alpha), test-retest reliability (Intraclass Correlation Coefficient, Bland and Altman method), construct validity (exploratory factor analysis) and responsiveness (Spearman correlation coefficient, Standard Error of Measurement and Minimal Detectable Change) of the questionnaire. Data quality was also assessed through completeness of data and floor/ceiling effects. The translation procedure resulted in the Greek modified version of the NDI. The latter was culturally adapted through the pre-testing phase. The validation procedure raised a large amount of missing data due to low applicability, which were assessed with two methods. Floor or ceiling effects were not observed. Cronbach alpha was calculated as 0.85, which was interpreted as good internal consistency. Intraclass correlation coefficient was found to be 0.93 (95% CI 0.84-0.97), which was considered as very good test-retest reliability. Factor analysis yielded one factor with Eigenvalue 4.48 explaining 44.77% of variance. The Spearman correlation coefficient (0.3; P = 0.02) revealed some relation between the change score in the NDI and Global Rating of Change (GROC). The SEM and MDC were calculated as 0.64 and 1.78 respectively. The Greek version of the NDI measures disability in patients with neck pain in a reliable, valid and responsive manner. It is considered a useful tool for research and clinical settings in Greek Primary Health Care.
Translation of the Neck Disability Index and validation of the Greek version in a sample of neck pain patients

PubMed Central

Trouli, Marianna N; Vernon, Howard T; Kakavelakis, Kyriakos N; Antonopoulou, Maria D; Paganas, Aristofanis N; Lionis, Christos D

2008-01-01

Background Neck pain is a highly prevalent condition resulting in major disability. Standard scales for measuring disability in patients with neck pain have a pivotal role in research and clinical settings. The Neck Disability Index (NDI) is a valid and reliable tool, designed to measure disability in activities of daily living due to neck pain. The purpose of our study was the translation and validation of the NDI in a Greek primary care population with neck complaints. Methods The original version of the questionnaire was used. Based on international standards, the translation strategy comprised forward translations, reconciliation, backward translation and pre-testing steps. The validation procedure concerned the exploration of internal consistency (Cronbach alpha), test-retest reliability (Intraclass Correlation Coefficient, Bland and Altman method), construct validity (exploratory factor analysis) and responsiveness (Spearman correlation coefficient, Standard Error of Measurement and Minimal Detectable Change) of the questionnaire. Data quality was also assessed through completeness of data and floor/ceiling effects. Results The translation procedure resulted in the Greek modified version of the NDI. The latter was culturally adapted through the pre-testing phase. The validation procedure raised a large amount of missing data due to low applicability, which were assessed with two methods. Floor or ceiling effects were not observed. Cronbach alpha was calculated as 0.85, which was interpreted as good internal consistency. Intraclass correlation coefficient was found to be 0.93 (95% CI 0.84–0.97), which was considered as very good test-retest reliability. Factor analysis yielded one factor with Eigenvalue 4.48 explaining 44.77% of variance. The Spearman correlation coefficient (0.3; P = 0.02) revealed some relation between the change score in the NDI and Global Rating of Change (GROC). The SEM and MDC were calculated as 0.64 and 1.78 respectively. Conclusion The Greek version of the NDI measures disability in patients with neck pain in a reliable, valid and responsive manner. It is considered a useful tool for research and clinical settings in Greek Primary Health Care. PMID:18647393
The Pareidolia Test: A Simple Neuropsychological Test Measuring Visual Hallucination-Like Illusions

PubMed Central

Mamiya, Yasuyuki; Nishio, Yoshiyuki; Watanabe, Hiroyuki; Yokoi, Kayoko; Uchiyama, Makoto; Baba, Toru; Iizuka, Osamu; Kanno, Shigenori; Kamimura, Naoto; Kazui, Hiroaki; Hashimoto, Mamoru; Ikeda, Manabu; Takeshita, Chieko; Shimomura, Tatsuo; Mori, Etsuro

2016-01-01

Background Visual hallucinations are a core clinical feature of dementia with Lewy bodies (DLB), and this symptom is important in the differential diagnosis and prediction of treatment response. The pareidolia test is a tool that evokes visual hallucination-like illusions, and these illusions may be a surrogate marker of visual hallucinations in DLB. We created a simplified version of the pareidolia test and examined its validity and reliability to establish the clinical utility of this test. Methods The pareidolia test was administered to 52 patients with DLB, 52 patients with Alzheimer’s disease (AD) and 20 healthy controls (HCs). We assessed the test-retest/inter-rater reliability using the intra-class correlation coefficient (ICC) and the concurrent validity using the Neuropsychiatric Inventory (NPI) hallucinations score as a reference. A receiver operating characteristic (ROC) analysis was used to evaluate the sensitivity and specificity of the pareidolia test to differentiate DLB from AD and HCs. Results The pareidolia test required approximately 15 minutes to administer, exhibited good test-retest/inter-rater reliability (ICC of 0.82), and moderately correlated with the NPI hallucinations score (rs = 0.42). Using an optimal cut-off score set according to the ROC analysis, and the pareidolia test differentiated DLB from AD with a sensitivity of 81% and a specificity of 92%. Conclusions Our study suggests that the simplified version of the pareidolia test is a valid and reliable surrogate marker of visual hallucinations in DLB. PMID:27171377
Utility of computer-assisted approaches for population surveillance of physical activity.

PubMed

Creamer, MeLisa; Bowles, Heather R; von Hofe, Belinda; Pettee Gabriel, Kelley; Kohl, Harold W; Bauman, Adrian

2014-08-01

Computer-assisted techniques may be a useful way to enhance physical activity surveillance and increase accuracy of reported behaviors. Evaluate the reliability and validity of a physical activity (PA) self-report instrument administered by telephone and internet. The telephone-administered Active Australia Survey was adapted into 2 forms for internet self-administration: survey questions only (internet-text) and with videos demonstrating intensity (internet-video). Data were collected from 158 adults (20-69 years, 61% female) assigned to telephone (telephone-interview) (n = 56), internet-text (n = 51), or internet-video (n = 51). Participants wore an accelerometer and completed a logbook for 7 days. Test-retest reliability was assessed using intraclass correlation coefficients (ICC). Convergent validity was assessed using Spearman correlations. Strong test-retest reliability was observed for PA variables in the internet-text (ICC = 0.69 to 0.88), internet-video (ICC = 0.66 to 0.79), and telephone-interview (ICC = 0.69 to 0.92) groups (P-values < 0.001). For total PA, correlations (ρ) between the survey and Actigraph+logbook were ρ = 0.47 for the internet-text group, ρ = 0.57 for the internet-video group, and ρ = 0.65 for the telephone-interview group. For vigorous-intensity activity, the correlations between the survey and Actigraph+logbook were 0.52 for internet-text, 0.57 for internet-video, and 0.65 for telephone-interview (P < .05). Internet-video of the survey had similar test-retest reliability and convergent validity when compared with the telephone-interview, and should continue to be developed.
Construct validity and reliability of the Music Attentiveness Screening Assessment (MASA).

PubMed

Waldon, Eric G; Broadhurst, Emily

2014-01-01

Music as alternate engagement (MAE) can be used effectively to distract children during painful or anxiety-provoking medical procedures. For such interventions to be successful, it would seem important to assess the degree to which a child can attend to musical stimuli. The purposes of this study were as follows: (a) To establish construct validity by determining the extent to which the Music Attentiveness Screening Assessment (MASA) measures auditory attention; and (b) to gather evidence regarding MASA test-retest and inter-observer reliability. The Auditory Attention (AA) subtest from the NEPSY-II (NEPSY, Second Edition) and the two items from MASA were administered to a nonclinical sample of children (N = 50) aged 5 to 9 years. There was a statistically significant proportion of AA score variance shared with MASA (both items), R (2) = .21, F(2, 47) = 6.34, p = .004. Test-retest reliability on the first MASA item was moderately high (Pearson r = .84) while on the second item it was lower (r = .63). Similarly, interobserver agreement was high for Item I (intraclass correlation coefficient [ICC] = .95) and lower for Item II (ICC = .71). Evidence suggests that MASA measures, at least in part, auditory attention. Despite this finding, a large proportion of unexplained variance remains. Furthermore, reliability estimates (test-retest and interobserver agreement) differ between both items. These findings are discussed with particular attention paid to the ways in which MASA should be revised and further study conducted. © the American Music Therapy Association 2014. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Test-retest reliability of cognitive EEG

NASA Technical Reports Server (NTRS)

McEvoy, L. K.; Smith, M. E.; Gevins, A.

2000-01-01

OBJECTIVE: Task-related EEG is sensitive to changes in cognitive state produced by increased task difficulty and by transient impairment. If task-related EEG has high test-retest reliability, it could be used as part of a clinical test to assess changes in cognitive function. The aim of this study was to determine the reliability of the EEG recorded during the performance of a working memory (WM) task and a psychomotor vigilance task (PVT). METHODS: EEG was recorded while subjects rested quietly and while they performed the tasks. Within session (test-retest interval of approximately 1 h) and between session (test-retest interval of approximately 7 days) reliability was calculated for four EEG components: frontal midline theta at Fz, posterior theta at Pz, and slow and fast alpha at Pz. RESULTS: Task-related EEG was highly reliable within and between sessions (r0.9 for all components in WM task, and r0.8 for all components in the PVT). Resting EEG also showed high reliability, although the magnitude of the correlation was somewhat smaller than that of the task-related EEG (r0.7 for all 4 components). CONCLUSIONS: These results suggest that under appropriate conditions, task-related EEG has sufficient retest reliability for use in assessing clinical changes in cognitive status.
Assessment of deep tissue hyperalgesia in the groin - a method comparison of electrical vs. pressure stimulation.

PubMed

Aasvang, E K; Werner, M U; Kehlet, H

2014-09-01

Deep pain complaints are more frequent than cutaneous in post-surgical patients, and a prevalent finding in quantitative sensory testing studies. However, the preferred assessment method - pressure algometry - is indirect and tissue unspecific, hindering advances in treatment and preventive strategies. Thus, there is a need for development of methods with direct stimulation of suspected hyperalgesic tissues to identify the peripheral origin of nociceptive input. We compared the reliability of an ultrasound-guided needle stimulation protocol of electrical detection and pain thresholds to pressure algometry, by performing identical test-retest sequences 10 days apart, in deep tissues in the groin region. Electrical stimulation was performed by five up-and-down staircase series of single impulses of 0.04 ms duration, starting from 0 mA in increments of 0.2 mA until a threshold was reached and descending until sensation was lost. Method reliability was assessed by Bland-Altman plots, descriptive statistics, coefficients of variance and intraclass correlation coefficients. The electrical stimulation method was comparable to pressure algometry regarding 10 days test-retest repeatability, but with superior same-day reliability for electrical stimulation (P < 0.05). Between-subject variance rather than within-subject variance was the main source for test variation. There were no systematic differences in electrical thresholds across tissues and locations (P > 0.05). The presented tissue-specific direct deep tissue electrical stimulation technique has equal or superior reliability compared with the indirect tissue-unspecific stimulation by pressure algometry. This method may facilitate advances in mechanism based preventive and treatment strategies in acute and chronic post-surgical pain states. © 2014 The Acta Anaesthesiologica Scandinavica Foundation. Published by John Wiley & Sons Ltd.
Cross-Cultural Adaptation and Validation of the Back Beliefs Questionnaire to the Arabic Language.

PubMed

Alamrani, Samia; Alsobayel, Hana; Alnahdi, Ali H; Moloney, Niamh; Mackey, Martin

2016-06-01

Translation, cross-cultural adaptation, and psychometric testing. To translate the Back Beliefs Questionnaire (BBQ) into Arabic and investigate its psychometric properties in an Arabic-speaking sample of individuals with low back pain (LBP). Back pain beliefs are associated with pain chronicity and disability in people with LBP. The BBQ is a recognized and frequently used tool for measuring these beliefs. To date the BBQ has not been translated into Arabic. The English version of the BBQ was translated and culturally adapted into Arabic (BBQ-Ar) according to published guidelines. The BBQ-Ar was then tested in a sample of 115 Arabic-speaking individuals with LBP. Reliability was evaluated through internal consistency (Cronbach α) and test-retest reliability (intraclass correlation coefficient), the latter in a subgroup of 25. Construct validity was assessed using exploratory factor analysis and by examining the correlation between the BBQ-Ar, the Oswestry Disability Index and a Numerical Pain Rating Scale. Internal consistency of the BBQ-Ar was good (Cronbach α = 0.77). Test-retest reliability was good (intraclass correlation coefficient [2,1] = 0.88). Exploratory factor analysis revealed a three-factor structure, explaining 46% of total variance, with the first factor alone explaining 24%. Eight of the nine scoring items were loaded on the first factor thus forming a unidimensional scale. A significant negative correlation was found between Oswestry Disability Index and BBQ-Ar scores (r = -0.307; P < 0.01), whereas no significant correlation was found between BBQ-Ar and Pain Rating Scale scores. No floor or celling effects were observed. The BBQ-Ar is a valid and reliable tool that can be used to assess back pain beliefs in Arabic-speaking individuals. N/A.

Family Impact Scale (FIS): Cross-cultural Adaptation and Psychometric Properties for the Peruvian Spanish Language.

PubMed

Abanto, Jenny; Albites, Ursula; Bönecker, Marcelo; Paiva, Saul M; Castillo, Jorge L; Aguilar-Gálvez, Denisse

2015-12-01

The lack of a Family Impact Scale (FIS) in Spanish language limits its use as an indicator in Spanish-speaking countries and precludes comparisons with data from other cultural and ethnic groups. The purpose of this study was therefore to adapt the FIS cross-culturally to the Peruvian Spanish language and assess its reliability and validity. In order to translate and adapt the FIS cross-culturally, it was answered by 60 parents in two pilot tests, after which it was tested on 200 parents of children aged 11 to 14 years who were clinically examined for dental caries experience and malocclusions. Internal consistency was assessed by Cronbach's alpha coefficient while repeat administration of the FIS on the same 200 parents enabled the test-retest reliability to be assessed via intraclass correlation coefficient (ICC). Construct and discriminant validity were based on associations of the FIS with global ratings of oral health and clinical groups, respectively. Mean (standard deviation) FIS total score was 5.20 (5.86). Internal consistency was confirmed by Cronbach's alpha 0.84. Test-retest reliability revealed excellent reproducibility (ICC = 0.96). Construct validity was good, demonstrating statistically significant associations between total FIS score and global ratings of oral health (p=0.007) and overall wellbeing (p=0.002), as well as for the subscale scores (p<0.05) with exception of the financial burden subscale. The FIS was also able to discriminate between children with and without dental caries experience and malocclusions (p<0.05). Satisfactory psychometric results for the Peruvian Spanish FIS confirm it as a reliable, valid instrument for assessing the impact on the family caused by children's oral conditions. Sociedad Argentina de Investigación Odontológica.
The Healthy Brain Network Serial Scanning Initiative: a resource for evaluating inter-individual differences and their reliabilities across scan conditions and sessions.

PubMed

O'Connor, David; Potler, Natan Vega; Kovacs, Meagan; Xu, Ting; Ai, Lei; Pellman, John; Vanderwal, Tamara; Parra, Lucas C; Cohen, Samantha; Ghosh, Satrajit; Escalera, Jasmine; Grant-Villegas, Natalie; Osman, Yael; Bui, Anastasia; Craddock, R Cameron; Milham, Michael P

2017-02-01

Although typically measured during the resting state, a growing literature is illustrating the ability to map intrinsic connectivity with functional MRI during task and naturalistic viewing conditions. These paradigms are drawing excitement due to their greater tolerability in clinical and developing populations and because they enable a wider range of analyses (e.g., inter-subject correlations). To be clinically useful, the test-retest reliability of connectivity measured during these paradigms needs to be established. This resource provides data for evaluating test-retest reliability for full-brain connectivity patterns detected during each of four scan conditions that differ with respect to level of engagement (rest, abstract animations, movie clips, flanker task). Data are provided for 13 participants, each scanned in 12 sessions with 10 minutes for each scan of the four conditions. Diffusion kurtosis imaging data was also obtained at each session. Technical validation and demonstrative reliability analyses were carried out at the connection-level using the Intraclass Correlation Coefficient and at network-level representations of the data using the Image Intraclass Correlation Coefficient. Variation in intrinsic functional connectivity across sessions was generally found to be greater than that attributable to scan condition. Between-condition reliability was generally high, particularly for the frontoparietal and default networks. Between-session reliabilities obtained separately for the different scan conditions were comparable, though notably lower than between-condition reliabilities. This resource provides a test-bed for quantifying the reliability of connectivity indices across subjects, conditions and time. The resource can be used to compare and optimize different frameworks for measuring connectivity and data collection parameters such as scan length. Additionally, investigators can explore the unique perspectives of the brain's functional architecture offered by each of the scan conditions. © The Author 2017. Published by Oxford University Press.
Intensity response function of the photopic negative response (PhNR): effect of age and test-retest reliability.

PubMed

Joshi, Nabin R; Ly, Emma; Viswanathan, Suresh

2017-08-01

To assess the effect of age and test-retest reliability of the intensity response function of the full-field photopic negative response (PhNR) in normal healthy human subjects. Full-field electroretinograms (ERGs) were recorded from one eye of 45 subjects, and 39 of these subjects were tested on two separate days with a Diagnosys Espion System (Lowell, MA, USA). The visual stimuli consisted of brief (<5 ms) red flashes ranging from 0.00625 to 6.4 phot cd.s/m 2 , delivered on a constant 7 cd/m 2 blue background. PhNR amplitudes were measured at its trough from baseline (BT) and from the preceding b-wave peak (PT), and b-wave amplitude was measured at its peak from the preceding a-wave trough or baseline if the a-wave was not present. The intensity response data of all three ERG measures were fitted with a generalized Naka-Rushton function to derive the saturated amplitude (V max ), semisaturation constant (K) and slope (n) parameters. Effect of age on the fit parameters was assessed with linear regression, and test-retest reliability was assessed with the Wilcoxon signed-rank test and Bland-Altman analysis. Holm's correction was applied to account for multiple comparisons. V max of BT was significantly smaller than that of PT and b-wave, and the V max of PT and b-wave was not significantly different from each other. The slope parameter n was smallest for BT and the largest for b-wave and the difference between the slopes of all three measures were statistically significant. Small differences observed in the mean values of K for the different measures did not reach statistical significance. The Wilcoxon signed-rank test indicated no significant differences between the two test visits for any of the Naka-Rushton parameters for the three ERG measures, and the Bland-Altman plots indicated that the mean difference between test and retest measurements of the different fit parameters was close to zero and within 6% of the average of the test and retest values of the respective parameters for all three ERG measurements, indicating minimal bias. While the coefficient of reliability (COR, defined as 1.96 times the standard deviation of the test and retest difference) of each fit parameter was more or less comparable across the three ERG measurements, the %COR (COR normalized to the mean test and retest measures) was generally larger for BT compared to both PT and b-wave for each fit parameter. The Naka-Rushton fit parameters did not show statistically significant changes with age for any of the ERG measures when corrections were applied for multiple comparisons. However, the V max of BT demonstrated a weak correlation with age prior to correction for multiple comparisons, and the effect of age on this parameter showed greater significance when the measure was expressed as a ratio of the V max of b-wave from the same subject. V max of the BT amplitude measure of PhNR at the best was weakly correlated with age. None of the other parameters of the Naka-Rushton fit to the intensity response data of either the PhNR or the b-wave showed any systematic changes with age. The test-retest reliability of the fit parameters for PhNR BT amplitude measurements appears to be lower than those of the PhNR PT and b-wave amplitude measurements.
The Italian version of the Mouth Handicap in Systemic Sclerosis scale (MHISS) is valid, reliable and useful in assessing oral health-related quality of life (OHRQoL) in systemic sclerosis (SSc) patients.

PubMed

Maddali Bongi, S; Del Rosso, A; Miniati, I; Galluccio, F; Landi, G; Tai, G; Matucci-Cerinic, M

2012-09-01

In systemic sclerosis (SSc), mouth and face involvement leads to problems in oral health-related quality of life (OHRQoL). Mouth Handicap in Systemic Sclerosis scale (MHISS) is a 12-item questionnaire specifically quantifying mouth disability in SSc, organized in 3 subscales. Our aim was to validate Italian version of MHISS, by assessing its test-retest reliability and internal and external consistency in Italian SSc patients. Forty SSc patients (7 dSSc, 33 lSSc; age and disease duration: 57.27 ± 11.41, 9.4 ± 4.4 years; 22 with sicca syndrome) were evaluated with MHISS. MHISS was translated following a forward-backward translation procedure, with independent translations and counter-translation. Test-retest reliability was evaluated, comparing the results of two administrations, with intraclass correlation coefficient (ICC). Internal consistency was assessed by Cronbach's α and external consistency by comparison with mouth opening. MHISS has a good test-retest reliability (ICC: 0.93) and internal consistency (Cronbach's α:0.99). A good external consistency was confirmed by correlation with mouth opening (rho: -0,3869, p: 0.0137). Total MHISS score was 17.65 ± 5.20, with scores of subscale 1 (reduced mouth opening) of 6.60 ± 2.85 and scores of subscales 2 (sicca syndrome) and 3 (aesthetic concerns) of 7.82 ± 2.59 and 3.22 ± 1.14. Total and subscale 2 scores are higher in dSSc than in lSSc. This result may be due to the higher presence of sicca syndrome in dSSc than in lSSc (p = 0.0109). Our results support validity and reliability in Italian SSc patients of MHISS, specifically measuring SSc OHRQoL.
Validity and Reliability of the Persian Version of the Dysphagia Handicap Index (DHI).

PubMed

Asadollahpour, Faezeh; Baghban, Kowsar; Asadi, Mozhgan

2015-05-01

The Dysphagia Handicap Index (DHI) is one of the instruments used for measuring a dysphagic patient's self-assessment. In some ways, it reflects the patient's quality of life. Although it has been recognized and widely applied in English speaking populations, it has not been used in its present forms in Persian speaking countries. The purpose of this study was to adapt a Persian version of the DHI and to evaluate its validity, consistency, and reliability in the Persian population with oropharyngeal dysphagia. Some stages for cross-cultural adaptation were performed, which consisted in translation, synthesis, back translation, review by an expert committee, and final proof reading. The generated Persian DHI was administered to 85 patients with oropharyngeal dysphagia and 89 control subjects at Zahedan city between May 2013 and August 2013. The patients and control subjects answered the same questionnaire 2 weeks later to verify the test-retest reliability. Internal consistency and test-retest reliability were evaluated. The results of the patients and the control group were compared. The Persian DHI showed good internal consistency (Cronbach's alpha coefficients range from 0.82 to 0.94). Also, good test-retest reliability was found for the total scores of the Persian DHI (r=0.89). There was a significant difference between the DHI scores of the control group and those of the oropharyngeal dysphagia group (P‹0.001). The Persian version of the DHI achieved Face and translation validity. This study demonstrated that the Persian DHI is a valid tool for self-assessment of the handicapping effects of dysphagia on the physical, functional, and emotional aspects of patient life and can be a useful tool for screening and treatment planning for the Persian-speaking dysphagic patients, regardless of the cause or the severity of the dysphagia.
Assessing the Psychometric Properties of Two Food Addiction Scales

PubMed Central

Lemeshow, Adina; Gearhardt, Ashley; Genkinger, Jeanine; Corbin, William R.

2016-01-01

Background While food addiction is well accepted in popular culture and mainstream media, its scientific validity as an addictive behavior is still under investigation. This study evaluated the reliability and validity of the Yale Food Addiction Scale and Modified Yale Food Addiction Scale using data from two community-based convenience samples. Methods We assessed the internal and test-retest reliability of the Yale Food Addiction Scale and Modified Yale Food Addiction Scale, and estimated the sensitivity and negative predictive value of the Modified Yale Food Addiction Scale using the Yale Food Addiction Scale as the benchmark. We calculated Cronbach’s alphas and 95% confidence intervals (CIs) for internal reliability and Cohen’s Kappa coefficients and 95% CIs for test-retest reliability. Results Internal consistency (n=232) was marginal to good, ranging from α=0.63 to 0.84. The test-retest reliability (n=45) for food addiction diagnosis was substantial, with Kappa=0.73 (95% CI, 0.48–0.88) (Yale Food Addiction Scale) and 0.79 (95% CI, 0.66–1.00) (Modified Yale Food Addiction Scale). Sensitivity and negative predictive value for classifying food addiction status were excellent: compared to the Yale Food Addiction Scale, the Modified Yale Food Addiction Scale’s sensitivity was 92.3% (95% CI, 64%–99.8%), and the negative predictive value was 99.5% (95% CI, 97.5%–100%). Conclusions Our analyses suggest that the Modified Yale Food Addiction Scale may be an appropriate substitute for the Yale Food Addiction Scale when a brief measure is needed, and support the continued use of both scales to investigate food addiction. PMID:27623221
Repeatability of a cold stress test to assess cold sensitization.

PubMed

House, C M; Taylor, R J; Oakley, E H N

2015-10-01

Non-freezing cold injury (NFCI) is a syndrome in which damage to peripheral tissues occurs without the tissues freezing following exposure to low ambient temperatures. To assess the test-retest reliability of a cold stress test (CST) used to assess cold sensitization. Volunteers with no self-reported history of NFCI undertook the CST on three occasions. Thermal images were taken of the foot and hand before, immediately after and 5min after immersion of the limb in cold water for 2min. Cold sensitization was graded by the two clinicians and the lead author. Spot temperatures from the toe and finger pads were recorded. There were 30 white and 19 black male participants. The ratings indicated substantial agreement [a Cohen's kappa (κ) value of 0.61-0.8] to within ± one grading category for the hands and feet of the white volunteers and the hands of the black volunteers. Limits of agreement (LoA) analysis for toe and finger pad temperatures indicated high agreement (absolute 95% LoA < 5.5°C). Test-retest reliability for the feet of the black volunteers was not supported by the gradings (κ = 0.38) and toe pad temperatures (absolute 95% LoA = 9.5°C and coefficient of variation = 11%). The test-retest reliability of the CST is considered adequate for the assessment of the cold sensitization of the hands and feet of white and the hands of black healthy non-patients. The study should be repeated with patients who have suffered a NFCI. © Crown copyright 2015.
The Physical Activity Scale for Individuals with Physical Disabilities: test-retest reliability and comparison with an accelerometer.

PubMed

van der Ploeg, Hidde P; Streppel, Kitty R M; van der Beek, Allard J; van der Woude, Luc H V; Vollenbroek-Hutten, Miriam; van Mechelen, Willem

2007-01-01

The objective was to determine the test-retest reliability and criterion validity of the Physical Activity Scale for Individuals with Physical Disabilities (PASIPD). Forty-five non-wheelchair dependent subjects were recruited from three Dutch rehabilitation centers. Subjects' diagnoses were: stroke, spinal cord injury, whiplash, and neurological-, orthopedic- or back disorders. The PASIPD is a 7-d recall physical activity questionnaire that was completed twice, 1 wk apart. During this week, physical activity was also measured with an Actigraph accelerometer. The test-retest reliability Spearman correlation of the PASIPD was 0.77. The criterion validity Spearman correlation was 0.30 when compared to the accelerometer. The PASIPD had test-retest reliability and criterion validity that is comparable to well established self-report physical activity questionnaires from the general population.
Measuring participation as defined by the World Health Organization in the International Classification of Functioning, Disability and Health. Psychometric properties of the Ghent Participation Scale.

PubMed

Van de Velde, Dominique; Coorevits, Pascal; Sabbe, Lode; De Baets, Stijn; Bracke, Piet; Van Hove, Geert; Josephsson, Staffan; Ilsbroukx, Stephan; Vanderstraeten, Guy

2017-03-01

To examine the internal consistency, test-retest reliability, construct validity, discriminant validity and responsiveness of the Ghent Participation Scale. Cross-sectional study with a test-retest sample. Six outpatient rehabilitation centres in Belgium. A total of 365 outpatients from eight diagnostic groups. The Ghent Participation Scale, the Impact on Participation and Autonomy, the Utrecht Scale for Evaluation of Rehabilitation-Participation and the Medical outcome study Short Form SF-36. The Ghent Participation Scale was found to have good internal consistency (Cronbach's α between 0.75 and 0.83). At item level, the test-retest reliability was good; weighted kappas ranged between 0.57 and 0.88. On the dimension level intraclass correlation coefficients ranged between 0.80 and 0.90. Evidence for construct validity came from high correlations between the subscales of the Ghent Participation Scale and four subscales of the Impact on Participation and Autonomy (range, r = -0.71 to -0.87) and two subscales of the Utrecht Scale for Evaluation of Rehabilitation-Participation (range, r = 0.54 to 0.72). Standardized response mean ranged between 0.23 and 0.68 and the area under the curve ranged between 68% and 88%. The Ghent Participation Scale appears to be a valid and reliable method of assessing participation irrespective of the respondent's health condition. The Ghent Participation Scale is responsive and is able to detect changes over time.
Translation and Validation of the Arabic Version of the Fear-Avoidance Beliefs Questionnaire in Patients With Low Back Pain.

PubMed

Alanazi, Fahad; Gleeson, Peggy; Olson, Sharon; Roddey, Toni

2017-04-01

Prospective cohort study of a cross-cultural low back pain (LBP) questionnaire OBJECTIVE.: The objectives of the present study were to translate and cross-culturally adapt the Fear-Avoidance Beliefs Questionnaire (FABQ) to create a version in Arabic and to test its psychometric properties. The FABQ measures the effects that fear and avoidance beliefs have on work and on physical activity. An FABQ cross-culturally adapted for Arabic readers and speakers was created by forward translation, translation synthesis, and backward translation. Forty patients in Riyadh, Saudi Arabia, with LBP evaluated use of the questionnaire, and 70 patients from the same hospital participated in reliability, validity, and sensitivity studies. To determine test-retest reliability of the Arabic FABQ, patients completed it twice within 48 hours without receiving any active treatment between these two sessions. Patients completed the Arabic FABQ (and three other scales) at baseline and 14 days later to determine its validity and sensitivity. Test-retest reliability was good (FABQ-work: intraclass coefficient [ICC] = 0.74; FABQ-physical activity: ICC = 0.90; FABQ overall: ICC = 0.76). Correlations between the FABQ and three other instruments for measuring pain and disability were weak. The strongest correlation was found at the follow-up session with the Arabic Oswestry Questionnaire (r = 0.283; P ≤ 0.05). Sensitivity to change was low. The translation and adaptation of the Arabic version of the FABQ was successful. Overall, the Arabic FABQ had good test-retest reliability, acceptable construct validity, and low sensitivity to change. The Arabic version of the FABQ shows promise in the assessment of fear-avoidance beliefs among patients with LBP who speak and read Arabic. 3.
Validity and reliability of the South African health promoting schools monitoring questionnaire

PubMed Central

Struthers, Patricia; de Koker, Petra; Lerebo, Wondwossen; Blignaut, Renette J.

2017-01-01

Summary Health promoting schools, as conceptualised by the World Health Organisation, have been developed in many countries to facilitate the health-education link. In 1994, the concept of health promoting schools was introduced in South Africa. In the process of becoming a health promoting school, it is important for schools to monitor and evaluate changes and developments taking place. The Health Promoting Schools (HPS) Monitoring Questionnaire was developed to obtain opinions of students about their school as a health promoting school. It comprises 138 questions in seven sections: socio-demographic information; General health promotion programmes; health related Skills and knowledge; Policies; Environment; Community-school links; and support Services. This paper reports on the reliability and face validity of the HPS Monitoring Questionnaire. Seven experts reviewed the questionnaire and agreed that it has satisfactory face validity. A test-retest reliability study was conducted with 83 students in three high schools in Cape Town, South Africa. The kappa-coefficients demonstrate mostly fair (κ-scores between 0.21 and 0.4) to moderate (κ-scores between 0.41 and 0.6) agreement between test-retest General and Environment items; poor (κ-scores up to 0.2) agreement between Skills and Community test-retest items, fair agreement between Policies items, and for most of the questions focussing on Services a fair agreement was found. The study is a first effort at providing a tool that may be used to monitor and evaluate students’ opinions about changes in health promoting schools. Although the HPS Monitoring Questionnaire has face validity, the results of the reliability testing were inconclusive. Further research is warranted. PMID:27694227
Assessment of the psychometric properties of the Spanish language version of questionnaire ICIQ-Male Lower Urinary Tract Symptoms (ICIQ-MLUTS).

PubMed

Castro-Díaz, D M; Esteban-Fuertes, M; Salinas-Casado, J; Bustamante-Alarma, S; Gago-Ramos, J L; Galacho-Bech, A; García-Matres, M J; Rodríguez-Toves, L A; Zubiaur-Líbano, C; Collado-Serra, A; Batista-Miranda, J E; Ortiz-Gámiz, A

2014-03-01

To evaluate the psychometric properties of the Spanish version of the ICIQ-Male Lower Urinary Tract Symptoms Questionnaire (ICIQ-MLUTS): Feasibility (% of completion and ceiling/ground effects), reliability (Test-retest), convergent validity (vs Bladder Control Self-Assessment Questionnaire [BSAQ] and vs International Prostate Symptom Score [I-PSS]) and criterion validity (according to presence or absence of symptoms). This was an observational, non-interventionist and multicenter study. 223 male patients with lower urinary tract symptoms (LUTS), predominantly storage symptoms and aged 18-65, took part in the study. Patients completed the ICIQ-MLUTS (test), I-PSS and BSAQ questionnaires and referred their urinary symptoms in a single visit, with the exception of a subgroup composed by 49 patients that completed the questionnaire again 15 days after initial visit to evaluate test-retest reliability. The questionnaire includes 13 items divided in 2 sub-scales: Voiding symptoms (V) from 0-20 and Incontinence symptoms (I) from 0-24. Percentage of patients that completed all items: 98.84%. Ground effect is 0 and ceiling effect was under 6% in both sub-scales. Test-retest reliability: Intraclass correlation coefficient (ICC) ranged from 0.68 to 0.88, except on Delay. Kappa shows a good agreement, between 0.60 and 0.81, except for Nocturia. Convergent validity: Correlation (Spearman) between the questionnaire sub-scales scores and the rest of measures is statistically significant (P < .01 and P < .05). Criterion validity: Statistically significant differences (P < .05) between scores on ICIQ-MLUTS, from patients that refer experiencing symptoms and those who do not. The Spanish version of the ICIQ-MLUTS questionnaire shows adequate feasibility, reliability and validity. Copyright © 2013 AEU. Published by Elsevier Espana. All rights reserved.
Validity and reliability of the South African health promoting schools monitoring questionnaire.

PubMed

Struthers, Patricia; Wegner, Lisa; de Koker, Petra; Lerebo, Wondwossen; Blignaut, Renette J

2017-04-01

Health promoting schools, as conceptualised by the World Health Organisation, have been developed in many countries to facilitate the health-education link. In 1994, the concept of health promoting schools was introduced in South Africa. In the process of becoming a health promoting school, it is important for schools to monitor and evaluate changes and developments taking place. The Health Promoting Schools (HPS) Monitoring Questionnaire was developed to obtain opinions of students about their school as a health promoting school. It comprises 138 questions in seven sections: socio-demographic information; General health promotion programmes; health related Skills and knowledge; Policies; Environment; Community-school links; and support Services. This paper reports on the reliability and face validity of the HPS Monitoring Questionnaire. Seven experts reviewed the questionnaire and agreed that it has satisfactory face validity. A test-retest reliability study was conducted with 83 students in three high schools in Cape Town, South Africa. The kappa-coefficients demonstrate mostly fair (κ-scores between 0.21 and 0.4) to moderate (κ-scores between 0.41 and 0.6) agreement between test-retest General and Environment items; poor (κ-scores up to 0.2) agreement between Skills and Community test-retest items, fair agreement between Policies items, and for most of the questions focussing on Services a fair agreement was found. The study is a first effort at providing a tool that may be used to monitor and evaluate students' opinions about changes in health promoting schools. Although the HPS Monitoring Questionnaire has face validity, the results of the reliability testing were inconclusive. Further research is warranted. © The Author 2016. Published by Oxford University Press.
Validation of the Persian version of the Daily Spiritual Experiences Scale (DSES) in Pregnant Women: A Proper Tool to Assess Spirituality Related to Mental Health.

PubMed

Saffari, Mohsen; Amini, Hossein; Sheykh-Oliya, Zarindokht; Pakpour, Amir H; Koenig, Harold G

2017-12-01

Assessing spirituality in healthy pregnant women may lead to supportive interventions that will improve their care. A psychometrically valid measure such as the Daily Spiritual Experiences Scale (DSES) may be helpful in this regard. The current study sought to adapt a Persian version of DSES for use in pregnancy. A total of 377 pregnant women were recruited from three general hospitals located in Tehran, Iran. Administered scales were the DSES, Duke University Religion Index, Santa Clara Strength of Religious Faith scale, and Depression Anxiety Stress Scale, as well as demographic measures. Reliability of the DSES was tested using Cronbach's alpha for internal consistency and the intraclass correlation coefficient (ICC) for test-retest stability. Scale validity was assessed by criterion-related tests, known-groups comparison, and exploratory factor analysis. Participant's mean age was 27.7 (4.1), and most were nulliparous (70%). The correlation coefficient between individual items on the scale and the total score was greater than 0.30 in most cases. Cronbach's alpha for the scale was 0.90. The ICC for 2-week test-retest reliability was high (0.86). Relationships between similar and dissimilar scales indicated acceptable convergent and divergent validity. The factor structure of the scale indicated a single factor that explained 59% of the variance. The DSES was found to be a reliable and valid measure of spirituality in pregnant Iranian women. This scale may be used to examine the relationship between spirituality and health outcomes, research that may lead to supportive interventions in this population.
[Interpreting change scores of the Behavioural Rating Scale for Geriatric Inpatients (GIP)].

PubMed

Diesfeldt, H F A

2013-09-01

The Behavioural Rating Scale for Geriatric Inpatients (GIP) consists of fourteen, Rasch modelled subscales, each measuring different aspects of behavioural, cognitive and affective disturbances in elderly patients. Four additional measures are derived from the GIP: care dependency, apathy, cognition and affect. The objective of the study was to determine the reproducibility of the 18 measures. A convenience sample of 56 patients in psychogeriatric day care was assessed twice by the same observer (a professional caregiver). The median time interval between rating occasions was 45 days (interquartile range 34-58 days). Reproducibility was determined by calculating intraclass correlation coefficients (ICC agreement) for test-retest reliability. The minimal detectable difference (MDD) was calculated based on the standard error of measurement (SEM agreement). Test-retest reliability expressed by the ICCs varied from 0.57 (incoherent behaviour) to 0.93 (anxious behaviour). Standard errors of measurement varied from 0.28 (anxious behaviour) to 1.63 (care dependency). The results show how the GIP can be applied when interpreting individual change in psychogeriatric day care participants.
Chinese version of the Perceived Stress Scale-10: A psychometric study in Chinese university students.

PubMed

Lu, Wei; Bian, Qian; Wang, Wenzheng; Wu, Xiaoling; Wang, Zhen; Zhao, Min

2017-01-01

Chinese university students often suffer from acute stress, which can affect their mental health. We measured and evaluated perceived stress in this population using the Simplified Chinese version of the 10-item Perceived Stress Scale (SCPSS-10). The SCPSS-10, Patient Health Questionnaire (PHQ), and Generalized Anxiety Disorder 7-item scale (GAD-7) were conducted in 1096 university students. Two weeks later, 129 participants were re-tested using the SCPSS-10. Exploratory factor analysis yielded two factors with Eigen values of 4.76 and 1.48, accounting for 62.41% of the variance. Confirmatory factor analysis demonstrated good fit of this two-factor model. The internal consistency reliability, as measured by Cronbach's α, was 0.85. The test-retest reliability coefficient was 0.7. The SCPSS-10 exhibited high correlation with the PHQ-9 and GAD-7, indicating an acceptable concurrent validity. The SCPSS-10 exhibited satisfactory psychometric properties in Chinese university students.
[Translation and Development of the Chinese-Version Patient Privacy Scale].

PubMed

Chen, Li; Feng, Xian-Qiong; Yang, Xiao-Li; Li, Luo-Hong

2017-06-01

The unauthorized releasing of confidential patient information is a serious problem worldwide. Nurses, the healthcare professionals who are in most frequent contact with patients, have access to a significant amount of confidential patient information and play a key role in protecting patient privacy. However, currently, there is no proper tool to measure the level to which clinical nurses protect the privacy of their patients in China. To translate the patient privacy scale (PPS) into Chinese and to test the reliability and validity of this Chinese version. The original scale was developed by Özturk, Bahcecik, and Özçelik (2014) to identify whether nurses protect or violate patient privacy in the workplace. This study used the "back translation" method to translate the scale. A total of 616 nurses in two tertiary hospitals in the Western region of China were enrolled to test the internal consistency, test-retest reliability, and construct validity of the translated scale. The Cronbach's coefficients of the total scale and its 5 factors ranged from .84 to .94; the split half reliability was .91; the test-retest reliability was .82; and the content validity index was .95. Explanatory factor analysis revealed that the 5 factors explained 64.98% of the total variance. The Chinese version of the PPS is reliable and valid, and may be used to reliably assess the behaviors of nurses with regard to protecting the privacy of their patients. The scale may also be used to evaluate the effects of training on patient privacy protection.
[Cultural adaptation and validation of the Medical Outcomes Study Social Support Survey questionnaire (MOS-SSS)].

PubMed

Alonso Fachado, A; Montes Martinez, A; Menendez Villalva, C; Pereira, M Graça

2007-01-01

The aim of this study was the assesment of psychometric properties of the Portuguese version of the instrument "Medical Outcomes Study - Social Support Survey (MOSSSS)". This questionnaire has been translated and adapted in a Portuguese sample of 101 patients with chronic illness of a rural health centre in Portugal. The average age of patients was 63.4 years, 56.4% female. 29% were illiterate and 2% had completed high school. 78% had arterial hypertension and the 56.4% had diabetes mellitus type 2. The internal consistency was evaluated using Cronbach's alpha. Exploratory and Confirmatory factor analysis were performed in order to confirm reliability and validity of the scale and its multidimensional characteristics. The 2-week test-retest reliability was estimated using weighted kappa for the ordinals variables and intraclass coefficient correlation for the quantitative variables. Cronbach's alphas for the subscales ranged from 0.873 to 0.967 at test, and 0.862 to 0.972 at retest. Exploratory factor analysis revealed the existence of four factors (emotional, tangible, positive interaction and affection support) that explain the 72.71% of the variance. Confirmatory factor analysis supported the existence of four factors that allowed the application of the scale with original items. The goodness-of-fit measures corroborate the initial structure, with chi2/ df=2.01, GFI=0.998, CFI=0.999, AGFI=0.998, TLI=0.999, NFI=0.998, SRMR=0.332, RMSEA=0.76. The 2-weeks test-retest reliability of the Portuguese MOS-SSS as measured by the intraclass correlation coefficient was ranged from 0.941 to 0.966 for the four dimensions and the overall support index. The weighted kappa was ranged from 0.67 to 0.87 for all the items. The MOS-SSS Portuguese version demonstrates good psychometric properties and seems to be useful to measure multidimensional aspects of social support in the Portuguese population.
Measuring leprosy-related stigma - a pilot study to validate a toolkit of instruments.

PubMed

Rensen, Carin; Bandyopadhyay, Sudhakar; Gopal, Pala K; Van Brakel, Wim H

2011-01-01

Stigma negatively affects the quality of life of leprosy-affected people. Instruments are needed to assess levels of stigma and to monitor and evaluate stigma reduction interventions. We conducted a validation study of such instruments in Tamil Nadu and West Bengal, India. Four instruments were tested in a 'Community Based Rehabilitation' (CBR) setting, the Participation Scale, Internalised Scale of Mental Illness (ISMI) adapted for leprosy-affected persons, Explanatory Model Interview Catalogue (EMIC) for leprosy-affected and non-affected persons and the General Self-Efficacy (GSE) Scale. We evaluated the following components of validity, construct validity, internal consistency, test-retest reproducibility and reliability to distinguish between groups. Construct validity was tested by correlating instrument scores and by triangulating quantitative and qualitative findings. Reliability was evaluated by comparing levels of stigma among people affected by leprosy and community controls, and among affected people living in CBR project areas and those in non-CBR areas. For the Participation, ISMI and EMIC scores significant differences were observed between those affected by leprosy and those not affected (p = 0.0001), and between affected persons in the CBR and Control group (p < 0.05). The internal consistency of the instruments measured with Cronbach's α ranged from 0.83 to 0.96 and was very good for all instruments. Test-retest reproducibility coefficients were 0.80 for the Participation score, 0.70 for the EMIC score, 0.62 for the ISMI score and 0.50 for the GSE score. The construct validity of all instruments was confirmed. The Participation and EMIC Scales met all validity criteria, but test-retest reproducibility of the ISMI and GSE Scales needs further evaluation with a shorter test-retest interval and longer training and additional adaptations for the latter.
A critical analysis of test-retest reliability in instrument validation studies of cancer patients under palliative care: a systematic review

PubMed Central

2014-01-01

Background Patient-reported outcome validation needs to achieve validity and reliability standards. Among reliability analysis parameters, test-retest reliability is an important psychometric property. Retested patients must be in a clinically stable condition. This is particularly problematic in palliative care (PC) settings because advanced cancer patients are prone to a faster rate of clinical deterioration. The aim of this study was to evaluate the methods by which multi-symptom and health-related qualities of life (HRQoL) based on patient-reported outcomes (PROs) have been validated in oncological PC settings with regards to test-retest reliability. Methods A systematic search of PubMed (1966 to June 2013), EMBASE (1980 to June 2013), PsychInfo (1806 to June 2013), CINAHL (1980 to June 2013), and SCIELO (1998 to June 2013), and specific PRO databases was performed. Studies were included if they described a set of validation studies. Studies were included if they described a set of validation studies for an instrument developed to measure multi-symptom or multidimensional HRQoL in advanced cancer patients under PC. The COSMIN checklist was used to rate the methodological quality of the study designs. Results We identified 89 validation studies from 746 potentially relevant articles. From those 89 articles, 31 measured test-retest reliability and were included in this review. Upon critical analysis of the overall quality of the criteria used to determine the test-retest reliability, 6 (19.4%), 17 (54.8%), and 8 (25.8%) of these articles were rated as good, fair, or poor, respectively, and no article was classified as excellent. Multi-symptom instruments were retested over a shortened interval when compared to the HRQoL instruments (median values 24 hours and 168 hours, respectively; p = 0.001). Validation studies that included objective confirmation of clinical stability in their design yielded better results for the test-retest analysis with regard to both pain and global HRQoL scores (p < 0.05). The quality of the statistical analysis and its description were of great concern. Conclusion Test-retest reliability has been infrequently and poorly evaluated. The confirmation of clinical stability was an important factor in our analysis, and we suggest that special attention be focused on clinical stability when designing a PRO validation study that includes advanced cancer patients under PC. PMID:24447633

Home Lighting Assessment for Clients With Low Vision

PubMed Central

Bhorade, Anjali; Gordon, Mae; Hollingsworth, Holly; Engsberg, Jack E.; Baum, M. Carolyn

2013-01-01

OBJECTIVE. The goal was to develop an objective, comprehensive, near-task home lighting assessment for older adults with low vision. METHOD. A home lighting assessment was developed and tested with older adults with low vision. Interrater and test–retest reliability studies were conducted. Clinical utility was assessed by occupational therapists with expertise in low vision rehabilitation. RESULTS. Interrater reliability was high (intraclass correlation coefficient [ICC] = .83–1.0). Test–retest reliability was moderate (ICC = .67). Responses to a Clinical Utility Feedback Form developed for this study indicated that the Home Environment Lighting Assessment (HELA) has strong clinical utility. CONCLUSION. The HELA provides a structured tool to describe the quantitative and qualitative aspects of home lighting environments where near tasks are performed and can be used to plan lighting interventions. The HELA has the potential to affect assessment and intervention practices of rehabilitation professionals in the area of low vision and improve near-task performance of people with low vision. PMID:24195901
Reliability, Validity, and Ability to Identify Fall Status of the Balance Evaluation Systems Test, Mini-Balance Evaluation Systems Test, and Brief-Balance Evaluation Systems Test in Older People Living in the Community.

PubMed

Marques, Alda; Almeida, Sara; Carvalho, Joana; Cruz, Joana; Oliveira, Ana; Jácome, Cristina

2016-12-01

To assess the reliability, validity, and ability to identify fall status of the Balance Evaluation Systems Test (BESTest), Mini-BESTest, and Brief-BESTest, compared with the Berg Balance Scale (BBS), in older people living in the community. Cross-sectional. Community centers. Older adults (N=122; mean age ± SD, 76±9y). Not applicable. Participants reported on falls history in the preceding year and completed the Activities-Specific Balance Confidence (ABC) Scale. The BBS, BESTest, and the Five Times Sit-To-Stand Test were administered. Interrater (2 physiotherapists) and test-retest relative (48-72h) and absolute reliabilities were explored with the intraclass correlation coefficient (ICC) equation (2,1) and the Bland and Altman method. Minimal detectable changes at the 95% confidence level (MDC 95 ) were established. Validity was assessed by correlating the balance tests with each other and with the ABC Scale (Spearman correlation coefficients-ρ). Receiver operating characteristics assessed the ability of each balance test to differentiate between people with and without a history of falls. All balance tests presented good to excellent interrater (ICC=.71-.93) and test-retest (ICC=.50-.82) relative reliability, with no evidence of bias. MDC 95 values were 4.6, 9, 3.8, and 4.1 points for the BBS, BESTest, Mini-BESTest, and Brief-BESTest, respectively. All tests were significantly correlated with each other (ρ=.83-.96) and with the ABC Scale (ρ=.46-.61). Acceptable ability to identify fall status (areas under the curve, .71-.78) was found for all tests. Cutoff points were 48.5, 82, 19.5, and 12.5 points for the BBS, BESTest, Mini-BESTest, and Brief-BESTest, respectively. All balance tests are reliable, valid, and able to identify fall status in older people living in the community. Therefore, the choice of which test to use will depend on the level of balance impairment, purpose, and time availability. Copyright Â© 2016. Published by Elsevier Inc.
A reliable and valid questionnaire was developed to measure computer vision syndrome at the workplace.

PubMed

Seguí, María del Mar; Cabrero-García, Julio; Crespo, Ana; Verdú, José; Ronda, Elena

2015-06-01

To design and validate a questionnaire to measure visual symptoms related to exposure to computers in the workplace. Our computer vision syndrome questionnaire (CVS-Q) was based on a literature review and validated through discussion with experts and performance of a pretest, pilot test, and retest. Content validity was evaluated by occupational health, optometry, and ophthalmology experts. Rasch analysis was used in the psychometric evaluation of the questionnaire. Criterion validity was determined by calculating the sensitivity and specificity, receiver operator characteristic curve, and cutoff point. Test-retest repeatability was tested using the intraclass correlation coefficient (ICC) and concordance by Cohen's kappa (κ). The CVS-Q was developed with wide consensus among experts and was well accepted by the target group. It assesses the frequency and intensity of 16 symptoms using a single rating scale (symptom severity) that fits the Rasch rating scale model well. The questionnaire has sensitivity and specificity over 70% and achieved good test-retest repeatability both for the scores obtained [ICC = 0.802; 95% confidence interval (CI): 0.673, 0.884] and CVS classification (κ = 0.612; 95% CI: 0.384, 0.839). The CVS-Q has acceptable psychometric properties, making it a valid and reliable tool to control the visual health of computer workers, and can potentially be used in clinical trials and outcome research. Copyright © 2015 Elsevier Inc. All rights reserved.
Inter-Rater and Test-Retest Reliability of the Beery VMI in Schoolchildren

PubMed Central

Harvey, Erin M.; Leonard-Green, Tina K.; Mohan, Kathleen M.; Kulp, Marjean Taylor; Davis, Amy L.; Miller, Joseph M.; Twelker, J. Daniel; Campus, Irene; Dennis, Leslie K.

2017-01-01

Purpose To assess inter-rater and test-retest reliability of the 6th Edition Beery-Buktenica Developmental Test of Visual-Motor Integration (VMI) and test-retest reliability of the VMI Visual Perception Supplemental Test (VMIp) in school-age children. Methods Subjects were 163 Native American 3rd – 8th grade students with no significant refractive error (astigmatism < 1.00 D, myopia: < 0.75 D, hyperopia: < 2.50 D, anisometropia < 1.50 D) or ocular abnormalities. The VMI and VMIp were administered twice, on separate days. All VMI tests were scored by two trained scorers and a subset of 50 tests were also scored by an experienced scorer. Scorers strictly applied objective scoring criteria. Analyses included inter-rater and test-retest assessments of bias, 95% limits of agreement, and intraclass correlation analysis. Results Trained scorers had no significant scoring bias compared to the experienced scorer. One of the two trained scorers tended to provide higher scores than the other (mean difference in standardized scores = 1.54). Inter-rater correlations were strong (0.75 to 0.88). VMI and VMIp test-retest comparisons indicated no significant bias (subjects did not tend to score better on retest). Test-retest correlations were moderate (0.54 to 0.58). The 95% LOAs for the VMI were −24.14 to 24.67 (scorer 1) and −26.06 to 26.58 (scorer 2) and the 95% LOAs for the VMIp were −27.11 to 27.34. Conclusions The 95% LOA for test-retest differences will be useful for determining if the VMI and VMIp have sufficient sensitivity for detecting change with treatment in both clinical and research settings. Further research on test-retest reliability reporting 95% LOAs for children across different age ranges are recommended, particularly if the test is to be used to detect changes due to intervention or treatment. PMID:28422801
Cross-cultural adaptation and psychometric evaluations of the Turkish version of Parkinson Fatigue Scale.

PubMed

Ozturk, Erhan Arif; Kocer, Bilge Gonenli; Umay, Ebru; Cakci, Aytul

2018-06-07

The objectives of the present study were to translate and cross-culturally adapt the English version of the Parkinson Fatigue Scale into Turkish, to evaluate its psychometric properties, and to compare them with that of other language versions. A total of 144 patients with idiopathic Parkinson disease were included in the study. The Turkish version of Parkinson Fatigue Scale was evaluated for data quality, scaling assumptions, acceptability, reliability, and validity. The questionnaire response rate was 100% for both test and retest. The percentage of missing data was zero for items, and the percentage of computable scores was full. Floor and ceiling effects were absent. The Parkinson Fatigue Scale provides an acceptable internal consistency (Cronbach's alpha was 0.974 for 1st test and 0.964 for a retest, and corrected item-to-total correlations were ranged from 0.715 to 0.906) and test-retest reliability (Cohen's kappa coefficients were ranged from 0.632 to 0.786 for individuals items, and intraclass correlation coefficient was 0.887 for the overall Parkinson Fatigue Scale Score). An exploratory factor analysis of the items revealed a single factor explaining 71.7% of variance. The goodness-of-fit statistics for the one-factorial confirmatory factor analysis were Tucker Lewis index = 0.961, comparative fit index = 0.971 and root mean square error of approximation = 0.077 for a single factor. The average Parkinson Fatigue Scale Score was correlated significantly with sociodemographic data, clinical characteristics and scores of rating scales. The Turkish version of the Parkinson Fatigue Scale seems to be culturally well adapted and have good psychometric properties. The scale can be used in further studies to assess the fatigue in patients with Parkinson's disease.
Translation, Cross-Cultural Adaptation, and Validation of the Activity Rating Scale for Disorders of the Knee

PubMed Central

Flosadottir, Vala; Roos, Ewa M.; Ageberg, Eva

2017-01-01

Background: The Activity Rating Scale (ARS) for disorders of the knee evaluates the level of activity by the frequency of participation in 4 separate activities with high demands on knee function, with a score ranging from 0 (none) to 16 (pivoting activities 4 times/wk). Purpose: To translate and cross-culturally adapt the ARS into Swedish and to assess measurement properties of the Swedish version of the ARS. Study Design: Cohort study (diagnosis); Level of evidence, 2. Methods: The COSMIN guidelines were followed. Participants (N = 100 [55 women]; mean age, 27 years) who were undergoing rehabilitation for a knee injury completed the ARS twice for test-retest reliability. The Knee injury and Osteoarthritis Outcome Score (KOOS), Tegner Activity Scale (TAS), and modernized Saltin-Grimby Physical Activity Level Scale (SGPALS) were administered at baseline to validate the ARS. Construct validity and responsiveness of the ARS were evaluated by testing predefined hypotheses regarding correlations between the ARS, KOOS, TAS, and SGPALS. The Cronbach alpha, intraclass correlation coefficients, absolute reliability, standard error of measurement, smallest detectable change, and Spearman rank-order correlation coefficients were calculated. Results: The ARS showed good internal consistency (α ≈ 0.96), good test-retest reliability (intraclass correlation coefficient >0.9), and no systematic bias between measurements. The standard error of measurement was less than 2 points, and the smallest detectable change was less than 1 point at the group level and less than 5 points at the individual level. More than 75% of the hypotheses were confirmed, indicating good construct validity and good responsiveness of the ARS. Conclusion: The Swedish version of the ARS is valid, reliable, and responsive for evaluating the level of activity based on the frequency of participation in high-demand knee sports activities in young adults with a knee injury. PMID:28979920
Evaluating the use of in-store measures in retail food stores and restaurants in Brazil.

PubMed

Duran, Ana Clara; Lock, Karen; Latorre, Maria do Rosario D O; Jaime, Patricia Constante

2015-01-01

To assess inter-rater reliability, test-retest reliability, and construct validity of retail food store, open-air food market, and restaurant observation tools adapted to the Brazilian urban context. This study is part of a cross-sectional observation survey conducted in 13 districts across the city of Sao Paulo, Brazil in 2010-2011. Food store and restaurant observational tools were developed based on previously available tools, and then tested it. They included measures on the availability, variety, quality, pricing, and promotion of fruits and vegetables and ultra-processed foods. We used Kappa statistics and intra-class correlation coefficients to assess inter-rater and test-retest reliabilities in samples of 142 restaurants, 97 retail food stores (including open-air food markets), and of 62 restaurants and 45 retail food stores (including open-air food markets), respectively. Construct validity as the tool's abilities to discriminate based on store types and different income contexts were assessed in the entire sample: 305 retail food stores, 8 fruits and vegetable markets, and 472 restaurants. Inter-rater and test-retest reliability were generally high, with most Kappa values greater than 0.70 (range 0.49-1.00). Both tools discriminated between store types and neighborhoods with different median income. Fruits and vegetables were more likely to be found in middle to higher-income neighborhoods, while soda, fruit-flavored drink mixes, cookies, and chips were cheaper and more likely to be found in lower-income neighborhoods. The measures were reliable and able to reveal significant differences across store types and different contexts. Although some items may require revision, results suggest that the tools may be used to reliably measure the food stores and restaurant food environment in urban settings of middle-income countries. Such studies can help .inform health promotion interventions and policies in these contexts.
Evaluating the use of in-store measures in retail food stores and restaurants in Brazil

PubMed Central

Duran, Ana Clara; Lock, Karen; Latorre, Maria do Rosario D O; Jaime, Patricia Constante

2015-01-01

ABSTRACT OBJECTIVE To assess inter-rater reliability, test-retest reliability, and construct validity of retail food store, open-air food market, and restaurant observation tools adapted to the Brazilian urban context. METHODS This study is part of a cross-sectional observation survey conducted in 13 districts across the city of Sao Paulo, Brazil in 2010-2011. Food store and restaurant observational tools were developed based on previously available tools, and then tested it. They included measures on the availability, variety, quality, pricing, and promotion of fruits and vegetables and ultra-processed foods. We used Kappa statistics and intra-class correlation coefficients to assess inter-rater and test-retest reliabilities in samples of 142 restaurants, 97 retail food stores (including open-air food markets), and of 62 restaurants and 45 retail food stores (including open-air food markets), respectively. Construct validity as the tool’s abilities to discriminate based on store types and different income contexts were assessed in the entire sample: 305 retail food stores, 8 fruits and vegetable markets, and 472 restaurants. RESULTS Inter-rater and test-retest reliability were generally high, with most Kappa values greater than 0.70 (range 0.49-1.00). Both tools discriminated between store types and neighborhoods with different median income. Fruits and vegetables were more likely to be found in middle to higher-income neighborhoods, while soda, fruit-flavored drink mixes, cookies, and chips were cheaper and more likely to be found in lower-income neighborhoods. CONCLUSIONS The measures were reliable and able to reveal significant differences across store types and different contexts. Although some items may require revision, results suggest that the tools may be used to reliably measure the food stores and restaurant food environment in urban settings of middle-income countries. Such studies can help .inform health promotion interventions and policies in these contexts. PMID:26538101
Validity and reliability of an instrumented leg-extension machine for measuring isometric muscle strength of the knee extensors.

PubMed

Ruschel, Caroline; Haupenthal, Alessandro; Jacomel, Gabriel Fernandes; Fontana, Heiliane de Brito; Santos, Daniela Pacheco dos; Scoz, Robson Dias; Roesler, Helio

2015-05-20

Isometric muscle strength of knee extensors has been assessed for estimating performance, evaluating progress during physical training, and investigating the relationship between isometric and dynamic/functional performance. To assess the validity and reliability of an adapted leg-extension machine for measuring isometric knee extensor force. Validity (concurrent approach) and reliability (test and test-retest approach) study. University laboratory. 70 healthy men and women aged between 20 and 30 y (39 in the validity study and 31 in the reliability study). Intraclass correlation coefficient (ICC) values calculated for the maximum voluntary isometric torque of knee extensors at 30°, 60°, and 90°, measured with the prototype and with an isokinetic dynamometer (ICC2,1, validity study) and measured with the prototype in test and retest sessions, scheduled from 48 h to 72 h apart (ICC1,1, reliability study). In the validity analysis, the prototype showed good agreement for measurements at 30° (ICC2,1 = .75, SEM = 18.2 Nm) and excellent agreement for measurements at 60° (ICC2,1 = .93, SEM = 9.6 Nm) and at 90° (ICC2,1 = .94, SEM = 8.9 Nm). Regarding the reliability analysis, between-days' ICC1,1 were good to excellent, ranging from .88 to .93. Standard error of measurement and minimal detectable difference based on test-retest ranged from 11.7 Nm to 18.1 Nm and 32.5 Nm to 50.1 Nm, respectively, for the 3 analyzed knee angles. The analysis of validity and repeatability of the prototype for measuring isometric muscle strength has shown to be good or excellent, depending on the knee joint angle analyzed. The new instrument, which presents a relative low cost and easiness of transportation when compared with an isokinetic dynamometer, is valid and provides consistent data concerning isometric strength of knee extensors and, for this reason, can be used for practical, clinical, and research purposes.
Reliability and Validity of a New Method for Isometric Back Extensor Strength Evaluation Using A Hand-Held Dynamometer.

PubMed

Park, Hee-Won; Baek, Sora; Kim, Hong Young; Park, Jung-Gyoo; Kang, Eun Kyoung

2017-10-01

To investigate the reliability and validity of a new method for isometric back extensor strength measurement using a portable dynamometer. A chair equipped with a small portable dynamometer was designed (Power Track II Commander Muscle Tester). A total of 15 men (mean age, 34.8±7.5 years) and 15 women (mean age, 33.1±5.5 years) with no current back problems or previous history of back surgery were recruited. Subjects were asked to push the back of the chair while seated, and their isometric back extensor strength was measured by the portable dynamometer. Test-retest reliability was assessed with intraclass correlation coefficient (ICC). For the validity assessment, isometric back extensor strength of all subjects was measured by a widely used physical performance evaluation instrument, BTE PrimusRS system. The limit of agreement (LoA) from the Bland-Altman plot was evaluated between two methods. The test-retest reliability was excellent (ICC=0.82; 95% confidence interval, 0.65-0.91). The Bland-Altman plots demonstrated acceptable agreement between the two methods: the lower 95% LoA was -63.1 N and the upper 95% LoA was 61.1 N. This study shows that isometric back extensor strength measurement using a portable dynamometer has good reliability and validity.
Test-Retest Reproducibility of the Microperimeter MP3 With Fundus Image Tracking in Healthy Subjects and Patients With Macular Disease.

PubMed

Palkovits, Stefan; Hirnschall, Nino; Georgiev, Stefan; Leisser, Christoph; Findl, Oliver

2018-02-01

To evaluate the test-retest reproducibility of a novel microperimeter with fundus image tracking (MP3, Nidek Co, Japan) in healthy subjects and patients with macular disease. Ten healthy subjects and 20 patients suffering from range of macular diseases were included. After training measurements, two additional microperimetry measurements were scheduled. Test-retest reproducibility was assessed for mean retinal sensitivity, pointwise sensitivity, and deep scotoma size using the coefficient of repeatability and Bland-Altman diagrams. In addition, in a subgroup of patients microperimetry was compared with conventional perimetry. Average differences in mean retinal sensitivity between the two study measurements were 0.26 ± 1.7 dB (median 0 dB; interquartile range [IQR] -1 to 1) for the healthy and 0.36 ± 2.5 dB (median 0 dB; IQR -1 to 2) for the macular patient group. Coefficients of repeatability for mean retinal sensitivity and pointwise retinal sensitivity were 1.2 and 3.3 dB for the healthy subjects and 1.6 and 5.0 dB for the macular disease patients, respectively. Absolute agreement in deep scotoma size between both study days was found in 79.9% of the test loci. The microperimeter MP3 shows an adequate test-retest reproducibility for mean retinal sensitivity, pointwise retinal sensitivity, and deep scotoma size in healthy subjects and patients suffering from macular disease. Furthermore, reproducibility of microperimetry is higher than conventional perimetry. Reproducibility is an important measure for each diagnostic device. Especially in a clinical setting high reproducibility set the basis to achieve reliable results using the specific device. Therefore, assessment of the reproducibility is of eminent importance to interpret the findings of future studies.
Test-Retest Reliability of Computerized, Everyday Memory Measures and Traditional Memory Tests.

ERIC Educational Resources Information Center

Youngjohn, James R.; And Others

Test-retest reliabilities and practice effect magnitudes were considered for nine computer-simulated tasks of everyday cognition and five traditional neuropsychological tests. The nine simulated everyday memory tests were from the Memory Assessment Clinic battery as follows: (1) simple reaction time while driving; (2) divided attention (driving…
Measuring Nurses' Value, Implementation, and Knowledge of Evidence-Based Practice: Further Psychometric Testing of the Quick-EBP-VIK Survey.

PubMed

Connor, Linda; Paul, Fiona; McCabe, Margaret; Ziniel, Sonja

2017-02-01

The Quick-EBP-VIK is a new instrument for measuring nurses' value, implementation, and knowledge of EBP. Psychometric testing was conducted in two parts. Part 1 describes the tool development and validity testing which resulted in the development of a 25-item survey after receiving ≥0.80 Item-Level Content Validity Index for both clarity and relevance. Part 2 describes psychometric testing was necessary to assess additional types of validity and reliability. The purpose of this paper is to further describe the psychometric testing of the Quick-EBP-VIK survey instrument. This descriptive study was designed to assess test-retest reliability, internal consistency and construct validity via a web-based survey. The survey instrument was e-mailed to all nurses at the study hospital. Nurses who responded to the first survey (Wave 1) received another e-mail invitation to complete the survey instrument again (Wave 2) for the purpose of assessing the test-retest reliability of the instrument. A total of 1,177 deliverable e-mails were sent to all nursing staff at one free standing pediatric hospital with Magnet ® designation in the northeast. A total of 382 nurses returned completed surveys, indicating a 32.5% response rate for Wave 1. A total of 131 nurses responded to Wave 2 indicating a response rate of 34.3%. The intraclass correlation coefficients for the items included in the final instrument ranged from 0.43 to 0.80 and were deemed sufficient. These represent a sufficient intraclass correlation coefficient. The Cronbach's Alpha values for each of the three domains are all higher than 0.7 indicating that the items of each of the measurement dimension are internally consistent. However, the composite reliability of the third domain was slightly lower than 0.7 when using Raykov's Rho. The Quick-EBP-VIK instrument has gone through rigorous comprehensive testing and has demonstrated good psychometric properties. © 2016 Sigma Theta Tau International.
The Nutrition Literacy Assessment Instrument is a Valid and Reliable Measure of Nutrition Literacy in Adults with Chronic Disease.

PubMed

Gibbs, Heather D; Ellerbeck, Edward F; Gajewski, Byron; Zhang, Chuanwu; Sullivan, Debra K

2018-03-01

To test the reliability and validity of the Nutrition Literacy Assessment Instrument (NLit) in adult primary care and identify the relationship between nutrition literacy and diet quality. This instrument validation study included a cross-sectional sample participating in up to 2 visits 1 month apart. A total of 429 adults with nutrition-related chronic disease were recruited from clinics and a patient registry affiliated with a Midwestern university medical center. Nutrition literacy was measured by the NLit, which was composed of 6 subscales: nutrition and health, energy sources in food, food label and numeracy, household food measurement, food groups, and consumer skills. Diet quality was measured by Healthy Eating Index-2010 with nutrient data from Diet History Questionnaire II surveys. The researchers measured factor validity and reliability by using binary confirmatory factor analysis; test-retest reliability was measured by Pearson r and the intraclass correlation coefficient, and relationships between nutrition literacy and diet quality were analyzed by linear regression. The NLit demonstrated substantial factor validity and reliability (0.97; confidence interval, 0.96-0.98) and test-retest reliability (0.88; confidence interval, 0.85-0.90). Nutrition literacy was the most significant predictor of diet quality (β = .17; multivariate coefficient = 0.10; P < .001). The NLit is a valid and reliable tool for measuring nutrition literacy in adult primary care patients. Copyright © 2017 Society for Nutrition Education and Behavior. Published by Elsevier Inc. All rights reserved.
Validation of a Persian version of the Fibromyalgia Impact Questionnaire (FIQ-P).

PubMed

Bidari, Ali; Hassanzadeh, Morteza; Mohabat, Mohamad-Farzam; Talachian, Elham; Khoei, Effat Merghati

2014-02-01

The aim of this study is to translate, adapt, and validate a Persian version of the Fibromyalgia (FM) Impact Questionnaire (FIQ-P). The FIQ-P was adapted following the translation and back-translation approach; then, it was administered to thirty females with FM. Participants also completed two other validated questionnaires, the Medical Outcome Survey Short Form-36 (SF-36) and the Beck Depression Inventory (BDI). Internal consistency within the FIQ-P items and its test-retest reliability were assessed with Cronbach's alpha and Spearman's correlation coefficient, respectively. Construct validity was analyzed by Spearman's r when correlating the FIQ-P to other questionnaires. The translated version was concordant. Adaptation affected two sub-items of physical function. Participants' mean age ± standard deviation was 40.4 ± 9.0 years. Internal consistency proved good with α = 0.80. Test-retest coefficient ranged from 0.50 for the item "work days missed" to 0.79 for all FIQ-P items. Fair and statistically significant (P < 0.01) correlations were found between the FIQ-P items and two other questionnaires, SF-36 (r = -0.57) and BDI (r = 0.53). We concluded that the FIQ-P is a valid and reliable instrument for measuring health status of Persian-speaking FM patients.
Comparison of three instruments for measuring patient anxiety in a coronary care unit.

PubMed

Elliott, D

1993-09-01

This paper compares the State-Trait Anxiety Inventory (STAI), Hospital Anxiety and Depression Scale (HAD Scale) and a Linear Analogue Anxiety Scale (LAAS) for evaluating anxiety in patients with acute ischaemic heart disease. The instruments were examined for correlation, reliability and internal consistency. Strong associations were demonstrated at pre-test between the STAI and the other scales. Moderate coefficients between HAD-A and HAD-D/LAAS were also apparent. Lower correlations were found at post-test than at pre-test. At post-test, strong inter-correlations occurred for STAI/LAAS. The HAD Scale demonstrated high test-retest reliability, while the STAI and LAAS were moderate in their reliability in this sample. The adequate correlation between the instruments suggest that each is a valid and appropriate measure of anxiety in this clinical sample.
Measuring limitations in activities of daily living: a population-based validation of a short questionnaire.

PubMed

Elfering, Achim; Cronenberg, Sonja; Grebner, Simone; Tamcan, Oezguer; Müller, Urs

2017-12-01

A newly developed questionnaire assessing limitations in activity of daily living (LADL-Q) that should improve assessment of LADL is tested in a large population-based validation study. This survey was paper-based. Overall, 16,634 individuals who were representative of the working population in the German-speaking part of Switzerland participated in the study. Item analysis was used the final version of the LADL-Q to four items per subscale that correspond to potential problems in three body regions (back and neck, upper extremities, lower extremities). Analysis included tests for reliability, internal consistency, dimensionality and convergent validity. Test-retest reliability coefficients after 2 weeks ranged from 0.82 to 0.99 (Mdn = 0.87), with no item having a coefficient below 0.60. The median item-total coefficients ranged between moderate and good. Correlation coefficients between LADL-Q subscales and three validated clinical instruments (Western Ontario and McMaster Universities osteoarthritis index, shoulder pain disability index, Oswestry) ranged from 0.63 to 0.81. In structural equation modeling the three subscales were significantly related with two important outcomes in occupational rehabilitation: self-reported general health and daily task performance. The new LADL-Q is a brief, reliable and valid tool for assessment of LADL in studies on musculoskeletal health.
Translation and cross-cultural adaptation of the lower extremity functional scale into a Brazilian Portuguese version and validation on patients with knee injuries.

PubMed

Metsavaht, Leonardo; Leporace, Gustavo; Riberto, Marcelo; Sposito, Maria Matilde M; Del Castillo, Letícia N C; Oliveira, Liszt P; Batista, Luiz Alberto

2012-11-01

Clinical measurement. To translate and culturally adapt the Lower Extremity Functional Scale (LEFS) into a Brazilian Portuguese version, and to test the construct and content validity and reliability of this version in patients with knee injuries. There is no Brazilian Portuguese version of an instrument to assess the function of the lower extremity after orthopaedic injury. The translation of the original English version of the LEFS into a Brazilian Portuguese version was accomplished using standard guidelines and tested in 31 patients with knee injuries. Subsequently, 87 patients with a variety of knee disorders completed the Brazilian Portuguese LEFS, the Medical Outcomes Study 36-Item Short-Form Health Survey, the Western Ontario and McMaster Universities Osteoarthritis Index, and the International Knee Documentation Committee Subjective Knee Evaluation Form and a visual analog scale for pain. All patients were retested within 2 days to determine reliability of these measures. Validation was assessed by determining the level of association between the Brazilian Portuguese LEFS and the other outcome measures. Reliability was documented by calculating internal consistency, test-retest reliability, and standard error of measurement. The Brazilian Portuguese LEFS had a high level of association with the physical component of the Medical Outcomes Study 36-Item Short-Form Health Survey (r = 0.82), the Western Ontario and McMaster Universities Osteoarthritis Index (r = 0.87), the International Knee Documentation Committee Subjective Knee Evaluation Form (r = 0.82), and the pain visual analog scale (r = -0.60) (all, P<.05). The Brazilian Portuguese LEFS had a low level of association with the mental component of the Medical Outcomes Study 36-Item Short-Form Health Survey (r = 0.38, P<.05). The internal consistency (Cronbach α = .952) and test-retest reliability (intraclass correlation coefficient = 0.957) of the Brazilian Portuguese version of the LEFS were high. The standard error of measurement was low (3.6) and the agreement was considered high, demonstrated by the small differences between test and retest and the narrow limit of agreement, as observed in Bland-Altman and survival-agreement plots. The translation of the LEFS into a Brazilian Portuguese version was successful in preserving the semantic and measurement properties of the original version and was shown to be valid and reliable in a Brazilian population with knee injuries.
Development and Validation of a Web-Based Survey on the Use of Personal Communication Devices by Hospital Registered Nurses: Pilot Study

PubMed Central

LeVasseur, Sandra A; Li, Dongmei

2013-01-01

Background The use of personal communication devices (such as basic cell phones, enhanced cell phones or smartphones, and tablet computers) in hospital units has risen dramatically in recent years. The use of these devices for personal and professional activities can be beneficial, but also has the potential to negatively affect patient care, as clinicians may become distracted by these devices. Objective No validated questionnaire examining the impact of the use of these devices on patient care exists; thus, we aim to develop and validate an online questionnaire for surveying the views of registered nurses with experience of working in hospitals regarding the impact of the use of personal communication devices on hospital units. Methods A 50-item, four-domain questionnaire on the views of registered nursing staff regarding the impact of personal communication devices on hospital units was developed based on a literature review and interviews with such nurses. A repeated measures pilot study was conducted to examine the psychometrics of a survey questionnaire and the feasibility of conducting a larger study. Psychometric testing of the questionnaire included examining internal consistency reliability and test-retest reliability in a sample of 50 registered nurses. Results The response rate for the repeated measures was 30%. Cronbach coefficient alpha was used to examine the internal consistency and reliability, and in three of the four question groups (utilization, impact, and opinions), the correlation was observed to be very high. This suggests that the questions were measuring a single underlying theme. The Cronbach alpha value for the questions in the performance group, describing the use of personal communication devices while working, was lower than those for the other question groups. These values may be an indication that the assumptions underlying the Cronbach alpha calculation may have been violated for this group of questions. A Spearman rho correlation was used to determine the test-retest reliability. There was a strong test-retest reliability between the two tests for the majority of the questions. The average test-retest percent of agreement for the Likert scale responses was 74% (range 43-100%). Accounting for responses within the 1 SD range on the Likert scale increased the agreement to 96% (range 87-100%). Missing data were in the range of 0 to 7%. Conclusions The psychometrics of the questionnaire showed good to fair levels of internal consistency and test-retest reliability. The pilot study demonstrated that our questionnaire may be useful in exploring registered nurses’ perceptions of the impact of personal electronic devices on hospital units in a larger study. PMID:24280660
Cross-cultural adaptation and validation of the Patient-Rated Tennis Elbow Evaluation Questionnaire on lateral elbow tendinopathy for French-speaking patients.

PubMed

Kaux, Jean-François; Delvaux, François; Schaus, Jean; Demoulin, Christophe; Locquet, Médéa; Buckinx, Fanny; Beaudart, Charlotte; Dardenne, Nadia; Van Beveren, Julien; Croisier, Jean-Louis; Forthomme, Bénédicte; Bruyère, Olivier

Translation and validation of algo-functional questionnaire. The lateral elbow tendinopathy is a common injury in tennis players and physical workers. The Patient-Rated Tennis Elbow Evaluation (PRTEE) Questionnaire was specifically designed to measure pain and functional limitations in patients with lateral epicondylitis (tennis elbow). First developed in English, this questionnaire has since been translated into several languages. The aims of the study were to translate and cross-culturally adapt the PRTEE questionnaire into French and to evaluate the reliability and validity of this translated version of the questionnaire (PRTEE-F). The PRTEE was translated and cross-culturally adapted into French according to international guidelines. To assess the reliability and validity of the PRTEE-F, 115 participants were asked twice to fill in the PRTEE-F, and once the Disabilities of Arm, Shoulder and Hand Questionnaire (DASH) and the Short Form Health Survey (SF-36). Internal consistency (using Cronbach's alpha), test-retest reliability (using intraclass correlation coefficient (ICC), standard error of measurement and minimal detectable change), and convergent and divergent validity (using the Spearman's correlation coefficients respectively with the DASH and with some subscales of the SF-36) were assessed. The PRTEE was translated into French without any problems. PRTEE-F showed a good test-retest reliability for the overall score (ICC 0.86) and for each item (ICC 0.8-0.96) and a high internal consistency (Cronbach's alpha = 0.98). The correlation analyses revealed high correlation coefficients between PRTEE-F and DASH (convergent validity) and, as expected, a low or moderate correlation with the divergent subscales of the SF-36 (discriminant validity). There was no floor or ceiling effect. The PRTEE questionnaire was successfully cross-culturally adapted into French. The PRTEE-F is reliable and valid for evaluating French-speaking patients with lateral elbow tendinopathy. Copyright Â© 2016 Hanley & Belfus. Published by Elsevier Inc. All rights reserved.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.