reliability intra-class correlation: Topics by Science.gov

Sample records for reliability intra-class correlation

The inter and intra rater reliability of the Netball Movement Screening Tool.

PubMed

Reid, Duncan A; Vanweerd, Rebecca J; Larmer, Peter J; Kingstone, Rachel

2015-05-01

To establish the inter- and intra-rater reliability of the Netball Movement Screening Tool, for screening adolescent female netball players. Inter- and intra-rater reliability study. Forty secondary school netball players were recruited to take part in the study. Twenty subjects were screened simultaneously and independently by two raters to ascertain inter-rater agreement. Twenty subjects were scored by rater one on two occasions, separated by a week, to ascertain intra-rater agreement. Inter and intra-rater agreement was assessed utilising the two-way mixed inter class correlation coefficient and weighted kappa statistics. No significant demographic differences were found between the inter and intra-rater groups of subjects. Inter class correlation coefficients' demonstrated excellent inter-rater (two-way mixed inter class correlation coefficients 0.84, standard error of measurement 0.25) and intra-rater (two-way mixed inter class correlation coefficients 0.96, standard error of measurement 0.13) reliability for the overall Netball Movement Screening Tool score and substantial-excellent (two-way mixed inter class correlation coefficients 1.0-0.65) inter-rater and substantial-excellent intra-rater (two-way mixed inter class correlation coefficients 0.96-0.79) reliability for the component scores of the Netball Movement Screening Tool. Kappa statistic showed substantial to poor inter-rater (k=0.75-0.32) and intra-rater (k=0.77-0.27) agreement for individual tests of the NMST. The Netball Movement Screening Tool may be a reliable screening tool for adolescent netball players; however the individual test scores have low reliability. The screening tool can be administered reliably by raters with similar levels of training in the tool but variable clinical experience. On-going research needs to be undertaken to ascertain whether the Netball Movement Screening Tool is a valid tool in ascertaining increased injury risk for netball players. Copyright © 2014 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.
Intra- and inter-tester reliability and validity of normal finger size measurement using the Japanese ring gauge system.

PubMed

Suzuki, T; Sato, Y; Sotome, S; Arai, H; Arai, A; Yoshida, H

2017-06-01

This study was designed to investigate the reliability and validity of measurements of finger diameters with a ring gauge. A reliability study enrolled two independent samples (50 participants and seven examiners in Study I; 26 participants and 26 examiners in Study II). The sizes of each participant's little fingers were measured twice with a ring gauge by each examiner. To investigate the validity of the measurements, five hand therapists compared the finger size and hand volume of 30 participants with the ring gauge and with a figure-of-eight technique (Study III). The intra-class correlation coefficient for intra-observer reliability ranged from 0.97 to 0.99 in Study I, and 0.90 to 0.97 in Study II. The intra-class correlation coefficient for inter-observer reliability was 0.95 in Study I and 0.94 in Study II. The validity study showed a Pearson product moment correlation coefficient of 0.75. The ring gauge showed high reliability and validity for measurement of finger size. III, diagnostic.
Face validity and reliability of a pictorial instrument for assessing fundamental movement skill perceived competence in young children.

PubMed

Barnett, Lisa M; Ridgers, Nicola D; Zask, Avigdor; Salmon, Jo

2015-01-01

To determine reliability and face validity of an instrument to assess young children's perceived fundamental movement skill competence. Validation and reliability study. A pictorial instrument based on the Test Gross Motor Development-2 assessed perceived locomotor (six skills) and object control (six skills) competence using the format and item structure from the physical competence subscale of the Pictorial Scale of Perceived Competence and Acceptance for Young Children. Sample 1 completed object control items in May (n=32) and locomotor items in October 2012 (n=23) at two time points seven days apart. Children were asked at the end of the test-retest their understanding of what was happening in each picture to determine face validity. Sample 2 (n=58) completed 12 items in November 2012 on a single occasion to test internal reliability only. Sample 1 children were aged 5-7 years (M=6.0, SD=0.8) at object control assessment and 5-8 years at locomotor assessment (M=6.5, SD=0.9). Sample 2 children were aged 6-8 years (M=7.2, SD=0.73). Intra-class correlations assessed in Sample 1 children were excellent for object control (intra-class correlation=0.78), locomotor (intra-class correlation=0.82) and all 12 skills (intra-class correlations=0.83). Face validity was acceptable. Internal consistency was adequate in both samples for each subscale and all 12 skills (alpha range 0.60-0.81). This study has provided preliminary evidence for instrument reliability and face validity. This enables future alignment between the measurement of perceived and actual fundamental movement skill competence in young children. Crown Copyright © 2014. Published by Elsevier Ltd. All rights reserved.
Reliability of externally fixed dynamometry hamstring strength testing in elite youth football players.

PubMed

Wollin, Martin; Purdam, Craig; Drew, Michael K

2016-01-01

To investigate inter and intra-tester reliability of an externally fixed dynamometry unilateral hamstring strength test, in the elite sports setting. Reliability study. Sixteen, injury-free, elite male youth football players (age=16.81±0.54 years, height=180.22±5.29cm, weight 73.88±6.54kg, BMI=22.57±1.42) gave written informed consent. Unilateral maximum isometric peak hamstring force was evaluated by externally fixed dynamometry for inter-tester, intra-day and intra-tester, inter-week reliability. The test position was standardised to correlate with the terminal swing phase of the gait running cycle. Inter and intra-tester values demonstrated good to high levels of reliability. The intra-class coefficient (ICC) for inter-tester, intra-day reliability was 0.87 (95% CI=0.75-0.93) with standard error of measure percentage (SEM%) 4.7 and minimal detectable change percentage (MDC%) 12.9. Intra-tester, inter-week reliability results were ICC 0.86 (95% CI, 0.74-0.93), SEM% 5.0 and MDC% 14.0. This study demonstrates good to high inter and intra-tester reliability of isometric externally fixed dynamometry unilateral hamstring strength testing in the regular elite sport setting involving elite male youth football players. The intra-class coefficient in association with the low standard error of measure and minimal detectable change percentages suggest that this procedure is appropriate for clinical and academic use as well as monitoring hamstring strength in the elite sport setting. Crown Copyright © 2015. Published by Elsevier Ltd. All rights reserved.
The validation of the visual analogue scale for patient satisfaction after total hip arthroplasty.

PubMed

Brokelman, Roy B G; Haverkamp, Daniel; van Loon, Corné; Hol, Annemiek; van Kampen, Albert; Veth, Rene

2012-06-01

INTRODUCTION: Patient satisfaction becomes more important in our modern health care system. The assessment of satisfaction is difficult because it is a multifactorial item for which no golden standard exists. One of the potential methods of measuring satisfaction is by using the well-known visual analogue scale (VAS). In this study, we validated VAS for satisfaction. PATIENT AND METHODS: In this prospective study, we studied 147 patients (153 hips). The construct validity was measured using the Spearman correlation test that compares the satisfaction VAS with the Harris hip score, pain VAS at rest and during activity, Oxford hip score, Short Form 36 and Western Ontario McMaster Universities Osteoarthritis Index. The reliability was tested using the intra-class coefficient. RESULTS: The Pearson correlation test showed correlations in the range of 0.40-0.80. The satisfaction VAS had a high correlation between the pain VAS and Oxford hip score, which could mean that pain is one of the most important factors in patient satisfaction. The intra-class coefficient was 0.95. CONCLUSIONS: There is a moderate to mark degree of correlation between the satisfaction VAS and the currently available subjective and objective scoring systems. The intra-class coefficient of 0.95 indicates an excellent test-retest reliability. The VAS satisfaction is a simple instrument to quantify the satisfaction of a patient after total hip arthroplasty. In this study, we showed that the satisfaction VAS has a good validity and reliability.
[The reliability of a questionnaire regarding Colombian children's physical activity].

PubMed

Herazo-Beltrán, Aliz Y; Domínguez-Anaya, Regina

2012-10-01

Reporting the Physical Activity Questionnaire for school children's (PAQ-C) test-retest reliability and internal consistency. This was a descriptive study of 100 school-aged children aged 9 to 11 years old attending a school in Cartagena, Colombia. The sample was randomly selected. The PAQ-C was given twice, one week apart, after the informed consent forms had been signing by the children's parents and school officials. Cronbach's alpha coefficient of reliability was used for assessing internal consistency and an intra-class correlation coefficient for test-retest reliability SPSS (version 17.0) was used for statistical analysis. The questionnaire scored 0.73 internal consistencies during the first measurement and 0.78 on the second; intra-class correlation coefficient was 0.60. There were differences between boys and girls regarding both measurements. The PAQ-C had acceptable internal consistency and test-retest reliability, thereby making it useful for measuring children's self-reported physical activity and a valuable tool for population studies in Colombia.
The sizing of hamstring grafts for anterior cruciate reconstruction: intra- and inter-observer reliability.

PubMed

Dwyer, Tim; Whelan, Daniel B; Khoshbin, Amir; Wasserstein, David; Dold, Andrew; Chahal, Jaskarndip; Nauth, Aaron; Murnaghan, M Lucas; Ogilvie-Harris, Darrell J; Theodoropoulos, John S

2015-04-01

The objective of this study was to establish the intra- and inter-observer reliability of hamstring graft measurement using cylindrical sizing tubes. Hamstring tendons (gracilis and semitendinosus) were harvested from ten cadavers by a single surgeon and whip stitched together to create ten 4-strand hamstring grafts. Ten sports medicine surgeons and fellows sized each graft independently using either hollow cylindrical sizers or block sizers in 0.5-mm increments—the sizing technique used was applied consistently to each graft. Surgeons moved sequentially from graft to graft and measured each hamstring graft twice. Surgeons were asked to state the measured proximal (femoral) and distal (tibial) diameter of each graft, as well as the diameter of the tibial and femoral tunnels that they would drill if performing an anterior cruciate ligament (ACL) reconstruction using that graft. Reliability was established using intra-class correlation coefficients. Overall, both the inter-observer and intra-observer agreement were >0.9, demonstrating excellent reliability. The inter-observer reliability for drill sizes was also excellent (>0.9). Excellent correlation was seen between cylindrical sizing, and drill sizes (>0.9). Sizing of hamstring grafts by multiple surgeons demonstrated excellent intra-observer and intra-observer reliability, potentially validating clinical studies exploring ACL reconstruction outcomes by hamstring graft diameter when standard techniques are used. III.
The Comprehensive Snack Parenting Questionnaire (CSPQ): Development and Test-Retest Reliability.

PubMed

Gevers, Dorus W M; Kremers, Stef P J; de Vries, Nanne K; van Assema, Patricia

2018-04-26

The narrow focus of existing food parenting instruments led us to develop a food parenting practices instrument measuring the full range of food practices constructs with a focus on snacking behavior. We present the development of the questionnaire and our research on the test-retest reliability. The developed Comprehensive Snack Parenting Questionnaire (CSPQ) covers 21 constructs. Test-retest reliability was assessed by calculating intra class correlation coefficients and percentage agreement after two administrations of the CSPQ among a sample of 66 Dutch parents. Test-retest reliability analysis revealed acceptable intra class correlation coefficients (≥0.41) or agreement scores (≥0.60) for all items. These results, together with earlier work, suggest sufficient psychometric characteristics. The comprehensive, but brief CSPQ opens up chances for highly essential but unstudied research questions to understand and predict children’s snack intake. Example applications include studying the interactional nature of food parenting practices or interactions of food parenting with general parenting or child characteristics.
Intra-rater Reliability of Arm and Hand Muscle Strength Measurements in Persons With Late Effects of Polio.

PubMed

Brogårdh, Christina; Flansbjer, Ulla-Britt; Carlsson, Håkan; Lexell, Jan

2015-10-01

Muscle weakness in the upper limb is common in persons with late effects of polio. To be able to measure muscle strength and follow changes over time, reliable measurements are needed. To evaluate the intra-rater reliability of isometric and isokinetic arm and hand muscle strength measurements in persons with late effects of polio. A test-retest design. A university hospital outpatient clinic. Twenty-eight persons (mean age 68 years, SD 11 years) with late effects of polio in their upper limbs. Isometric shoulder abduction, isokinetic concentric elbow flexion and extension, isometric elbow flexion, and isometric grip strength were measured twice, 14 days apart. Reliability was evaluated with the intra-class correlation coefficient, the mean difference between the test sessions (d¯), together with the 95% confidence intervals for d¯ , the standard error of measurement (SEM and SEM%), the smallest real difference (SRD and SRD%), and Bland-Altman graphs. A fixed dynamometer (Biodex) was used to measure arm strength and an electronic dynamometer (GRIP-it) was used to measure grip strength. Intra-rater reliability was high, with intra-class correlation coefficients between 0.87 and 0.98. The SEM%, representing the smallest change for a group of persons, ranged from 7%-24% for all strength measurements, and the SRD%, representing the smallest change for an individual person, ranged from 20%-67%. Muscle strength in the upper limbs can be reliably measured in persons with late effects of polio. However, the measurement errors indicate that the method is more suitable to detect changes in muscle strength for a group of persons than for an individual person. Copyright © 2015 American Academy of Physical Medicine and Rehabilitation. Published by Elsevier Inc. All rights reserved.
Test-retest reliability of the Progressive Isoinertial Lifting Evaluation (PILE).

PubMed

Lygren, Hildegunn; Dragesund, Tove; Joensen, Jón; Ask, Tove; Moe-Nilssen, Rolf

2005-05-01

A repeated measures single group design. To investigate test-retest reliability of Progressive Isoinertial Lifting Evaluation on patients with long lasting musculoskeletal problems related to the lumbar spine. Test-retest reliability has been satisfactory in healthy men. Test-retest reliability for clinical populations has not been reported. A total of 31 patients (17 women and 14 men) with long lasting low back pain participated in the study. The patients were tested twice at an interval of 2 days and at the same time of the day. The heaviest load that the patient could lift 4 times was used as outcome measure. The error of measurement indicates that the true result in 95% of cases will be within +/-4.5 kg from the measured value, while the difference between 2 measurements in 95% of cases will be less than 6.4 kg. Intra-class correlation (1,1) was 0.91. Relative test-retest reliability was high assessed by intra-class correlation, but absolute measurement variability reported as the smallest detectable difference has relevance for the interpretation of clinical test results and should also be considered.
Intra-rater and inter-rater reliability of ultrasonographic measurements of acromion-greater tuberosity distance in patients with post-stroke hemiplegia.

PubMed

Kumar, Praveen; Cruziah, Reynold; Bradley, Michael; Gray, Selena; Swinkels, Annette

2016-06-01

Glenohumeral subluxation (GHS) is reported in up to 81% of patients with stroke. Ultrasonographic measurements of GHS by measuring the acromion-greater tuberosity (AGT) have been found to be reliable for experienced raters. The primary aim was to assess the intra-rater reliability of measurements of AGT distance in people with stroke following a short course of rater training. A secondary aim was to compare the inter-rater reliability of these measurements between novice and experienced raters. Patients with stroke (n = 16; 5 men, 11 women; 74 ± 10 years) with 1-sided weakness who gave informed consent were recruited. Ultrasonographic measurements were recorded at the bedside by two physiotherapists with patients seated upright in a hospital chair. Reliability was assessed by intra-class correlation coefficients (ICCs) and the standard error of measurements (SEM). Minimum detectable change (MDC90) scores were used to estimate the magnitude of change that is likely to exceed measurement error. Mean ± SD AGT distances on the affected and unaffected sides for rater 1 were 2.2 ± 0.7 and 1.7 ± 0.4 cm, respectively. Corresponding values for rater 2 were 2.5 ± 0.6 and 2.0 ± 0.4 cm. Intra-class correlation coefficient values for the affected and unaffected shoulders for rater 1 were 0.96 and 0.91, respectively. Corresponding values for rater 2 were 0.95 and 0.90.SEM and MDC90 for both affected and unaffected shoulders were ≤ 0.2 cm. Inter-rater reliability coefficients were 0.86 (affected) and 0.76 (unaffected) shoulders. Ultrasonographic measurement of AGT distance demonstrates excellent intra-rater reliability for a novice rater. Inter-rater reliability of ultrasonographic measurement of AGT also demonstrates good reliability between novice and experienced raters.
Consistency of clinical biomechanical measures between three different institutions: implications for multi-center biomechanical and epidemiological research.

PubMed

Myer, Gregory D; Wordeman, Samuel C; Sugimoto, Dai; Bates, Nathaniel A; Roewer, Benjamin D; Medina McKeon, Jennifer M; DiCesare, Christopher A; Di Stasi, Stephanie L; Barber Foss, Kim D; Thomas, Staci M; Hewett, Timothy E

2014-05-01

Multi-center collaborations provide a powerful alternative to overcome the inherent limitations to single-center investigations. Specifically, multi-center projects can support large-scale prospective, longitudinal studies that investigate relatively uncommon outcomes, such as anterior cruciate ligament injury. This project was conceived to assess within- and between-center reliability of an affordable, clinical nomogram utilizing two-dimensional video methods to screen for risk of knee injury. The authors hypothesized that the two-dimensional screening methods would provide good-to-excellent reliability within and between institutions for assessment of frontal and sagittal plane biomechanics. Nineteen female, high school athletes participated. Two-dimensional video kinematics of the lower extremity during a drop vertical jump task were collected on all 19 study participants at each of the three facilities. Within-center and between-center reliability were assessed with intra- and inter-class correlation coefficients. Within-center reliability of the clinical nomogram variables was consistently excellent, but between-center reliability was fair-to-good. Within-center intra-class correlation coefficient for all nomogram variables combined was 0.98, while combined between-center inter-class correlation coefficient was 0.63. Injury risk screening protocols were reliable within and repeatable between centers. These results demonstrate the feasibility of multi-site biomechanical studies and establish a framework for further dissemination of injury risk screening algorithms. Specifically, multi-center studies may allow for further validation and optimization of two-dimensional video screening tools. 2b.
The reliability of four widely used patellar height ratios.

PubMed

van Duijvenbode, Dennis; Stavenuiter, Michel; Burger, Bart; van Dijke, Cees; Spermon, Jacco; Hoozemans, Marco

2016-03-01

The objective of this study was to evaluate the inter-observer reliability and the intra-observer reliability of four patellar height ratios: Insall-Salvati (IS), modified Insall-Salvati (MIS), Blackburne-Peel (BP) and Caton-Deschamps (CD). The patellar height ratios were assessed by four independent examiners using weight-bearing lateral knee radiographs in 30° flexion. Intra-class correlation coefficients and Fleiss' kappa's were determined. The inter-observer reliability was excellent for the IS and moderate for the other ratios. When the ratio values were categorized, the inter-observer reliability was strong for the IS, moderate for the MIS and BP, and poor for the CD. The intra-observer reliability was excellent for the IS, MIS and CD, and strong for the BP. When the ratio values were categorized, the intra-observer reliability was strong for the IS and MIS, and moderate for the other ratios. Although the IS showed best reliability, we advise to use the MIS as it showed the second best reliability but is, according to the literature, associated with better validity.
Validity of a smartphone protractor to measure sagittal parameters in adult spinal deformity.

PubMed

Kunkle, William Aaron; Madden, Michael; Potts, Shannon; Fogelson, Jeremy; Hershman, Stuart

2017-10-01

Smartphones have become an integral tool in the daily life of health-care professionals (Franko 2011). Their ease of use and wide availability often make smartphones the first tool surgeons use to perform measurements. This technique has been validated for certain orthopedic pathologies (Shaw 2012; Quek 2014; Milanese 2014; Milani 2014), but never to assess sagittal parameters in adult spinal deformity (ASD). This study was designed to assess the validity, reproducibility, precision, and efficiency of using a smartphone protractor application to measure sagittal parameters commonly measured in ASD assessment and surgical planning. This study aimed to (1) determine the validity of smartphone protractor applications, (2) determine the intra- and interobserver reliability of smartphone protractor applications when used to measure sagittal parameters in ASD, (3) determine the efficiency of using a smartphone protractor application to measure sagittal parameters, and (4) elucidate whether a physician's level of experience impacts the reliability or validity of using a smartphone protractor application to measure sagittal parameters in ASD. An experimental validation study was carried out. Thirty standard 36″ standing lateral radiographs were examined. Three separate measurements were performed using a marker and protractor; then at a separate time point, three separate measurements were performed using a smartphone protractor application for all 30 radiographs. The first 10 radiographs were then re-measured two more times, for a total of three measurements from both the smartphone protractor and marker and protractor. The parameters included lumbar lordosis, pelvic incidence, and pelvic tilt. Three raters performed all measurements-a junior level orthopedic resident, a senior level orthopedic resident, and a fellowship-trained spinal deformity surgeon. All data, including the time to perform the measurements, were recorded, and statistical analysis was performed to determine intra- and interobserver reliability, as well as accuracy, efficiency, and precision. Statistical analysis using the intra- and interclass correlation coefficient was calculated using R (version 3.3.2, 2016) to determine the degree of intra- and interobserver reliability. High rates of intra- and interobserver reliability were observed between the junior resident, senior resident, and attending surgeon when using the smartphone protractor application as demonstrated by high inter- and intra-class correlation coefficients greater than 0.909 and 0.874 respectively. High rates of inter- and intraobserver reliability were also seen between the junior resident, senior resident, and attending surgeon when a marker and protractor were used as demonstrated by high inter- and intra-class correlation coefficients greater than 0.909 and 0.807 respectively. The lumbar lordosis, pelvic incidence, and pelvic tilt values were accurately measured by all three raters, with excellent inter- and intra-class correlation coefficient values. When the first 10 radiographs were re-measured at different time points, a high degree of precision was noted. Measurements performed using the smartphone application were consistently faster than using a marker and protractor-this difference reached statistical significance of p<.05. Adult spinal deformity radiographic parameters can be measured accurately, precisely, reliably, and more efficiently using a smartphone protractor application than with a standard protractor and wax pencil. A high degree of intra- and interobserver reliability was seen between the residents and attending surgeon, indicating measurements made with a smartphone protractor are unaffected by an observer's level of experience. As a result, smartphone protractors may be used when planning ASD surgery. Copyright © 2017 Elsevier Inc. All rights reserved.
Evaluating Written Patient Information for Eczema in German: Comparing the Reliability of Two Instruments, DISCERN and EQIP

PubMed Central

McCool, Megan E.; Wahl, Josepha; Schlecht, Inga; Apfelbacher, Christian

2015-01-01

Patients actively seek information about how to cope with their health problems, but the quality of the information available varies. A number of instruments have been developed to assess the quality of patient information, primarily though in English. Little is known about the reliability of these instruments when applied to patient information in German. The objective of our study was to investigate and compare the reliability of two validated instruments, DISCERN and EQIP, in order to determine which of these instruments is better suited for a further study pertaining to the quality of information available to German patients with eczema. Two independent raters evaluated a random sample of 20 informational brochures in German. All the brochures addressed eczema as a disorder and/or therapy options and care. Intra-rater and inter-rater reliability were assessed by calculating intra-class correlation coefficients, agreement was tested with weighted kappas, and the correlation of the raters’ scores for each instrument was measured with Pearson’s correlation coefficient. DISCERN demonstrated substantial intra- and inter-rater reliability. It also showed slightly better agreement than EQIP. There was a strong correlation of the raters’ scores for both instruments. The findings of this study support the reliability of both DISCERN and EQIP. However, based on the results of the inter-rater reliability, agreement and correlation analyses, we consider DISCERN to be the more precise tool for our project on patient information concerning the treatment and care of eczema. PMID:26440612
Evaluating Written Patient Information for Eczema in German: Comparing the Reliability of Two Instruments, DISCERN and EQIP.

PubMed

McCool, Megan E; Wahl, Josepha; Schlecht, Inga; Apfelbacher, Christian

2015-01-01

Patients actively seek information about how to cope with their health problems, but the quality of the information available varies. A number of instruments have been developed to assess the quality of patient information, primarily though in English. Little is known about the reliability of these instruments when applied to patient information in German. The objective of our study was to investigate and compare the reliability of two validated instruments, DISCERN and EQIP, in order to determine which of these instruments is better suited for a further study pertaining to the quality of information available to German patients with eczema. Two independent raters evaluated a random sample of 20 informational brochures in German. All the brochures addressed eczema as a disorder and/or therapy options and care. Intra-rater and inter-rater reliability were assessed by calculating intra-class correlation coefficients, agreement was tested with weighted kappas, and the correlation of the raters' scores for each instrument was measured with Pearson's correlation coefficient. DISCERN demonstrated substantial intra- and inter-rater reliability. It also showed slightly better agreement than EQIP. There was a strong correlation of the raters' scores for both instruments. The findings of this study support the reliability of both DISCERN and EQIP. However, based on the results of the inter-rater reliability, agreement and correlation analyses, we consider DISCERN to be the more precise tool for our project on patient information concerning the treatment and care of eczema.
Can we have an overall osteoarthritis severity score for the patellofemoral joint using magnetic resonance imaging? Reliability and validity.

PubMed

Kobayashi, Sarah; Peduto, Anthony; Simic, Milena; Fransen, Marlene; Refshauge, Kathryn; Mah, Jean; Pappas, Evangelos

2018-04-01

This work aimed to assess inter-rater reliability and agreement of a magnetic resonance imaging (MRI)-based Kellgren and Lawrence (K&L) grading for patellofemoral joint osteoarthritis (OA) and to validate it against the MRI Osteoarthritis Knee Score (MOAKS). MRI scans from people aged 45 to 75 years with chronic knee pain participating in a randomised clinical trial evaluating dietary supplements were utilised. Fifty participants were randomly selected and scored using the MRI-based K&L grading using axial and sagittal MRI scans. Raters conducted inter-rater reliability, blinded to clinical information, radiology reports and other rater results. Intra- and inter-rater reliability and agreement were evaluated using the intra-class correlation coefficient (ICC) and Cohen's weighted kappa. There was a 2-week interval between the first and second readings for intra-rater reliability. Validity was assessed using the MOAKS and evaluated using Spearman's correlation coefficient. Intra-rater reliability of the K&L system was excellent: ICC 0.91 (95% CI 0.82-0.95); weighted kappa (ĸ = 0.69). Inter-rater reliability was high (ICC 0.88; 95% CI 0.79-0.93), while agreement between raters was moderate (ĸ = 0.49-0.57). Validity analysis demonstrated a strong correlation between the total MOAKS features score and the K&L grading system (ρ = 0.62-0.67) but weak correlations when compared with individual MOAKS features (ρ = 0.19-0.61). The high reliability and good agreement show consistency in grading the severity of patellofemoral OA with the MRI-based K&L score. Our validity results suggest that the scale may be useful, particularly in the clinical environment. Future research should validate this method against clinical findings.
Assessment of lumbosacral kyphosis in spondylolisthesis: a computer-assisted reliability study of six measurement techniques

PubMed Central

Glavas, Panagiotis; Mac-Thiong, Jean-Marc; Parent, Stefan; de Guise, Jacques A.

2008-01-01

Although recognized as an important aspect in the management of spondylolisthesis, there is no consensus on the most reliable and optimal measure of lumbosacral kyphosis (LSK). Using a custom computer software, four raters evaluated 60 standing lateral radiographs of the lumbosacral spine during two sessions at a 1-week interval. The sample size consisted of 20 normal, 20 low and 20 high grade spondylolisthetic subjects. Six parameters were included for analysis: Boxall’s slip angle, Dubousset’s lumbosacral angle (LSA), the Spinal Deformity Study Group’s (SDSG) LSA, dysplastic SDSG LSA, sagittal rotation (SR), kyphotic Cobb angle (k-Cobb). Intra- and inter-rater reliability for all parameters was assessed using intra-class correlation coefficients (ICC). Correlations between parameters and slip percentage were evaluated with Pearson coefficients. The intra-rater ICC’s for all the parameters ranged between 0.81 and 0.97 and the inter-rater ICC’s were between 0.74 and 0.98. All parameters except sagittal rotation showed a medium to large correlation with slip percentage. Dubousset’s LSA and the k-Cobb showed the largest correlations (r = −0.78 and r = −0.50, respectively). SR was associated with the weakest correlation (r = −0.10). All other parameters had medium correlations with percent slip (r = 0.31–0.43). All measurement techniques provided excellent inter- and intra-rater reliability. Dubousset’s LSA showed the strongest correlation with slip grade. This parameter can be used in the clinical setting with PACS software capabilities to assess LSK. A computer-assisted technique is recommended in order to increase the reliability of the measurement of LSK in spondylolisthesis. PMID:19015898
Rater reliability and concurrent validity of the Keyboard Personal Computer Style instrument (K-PeCS).

PubMed

Baker, Nancy A; Cook, James R; Redfern, Mark S

2009-01-01

This paper describes the inter-rater and intra-rater reliability, and the concurrent validity of an observational instrument, the Keyboard Personal Computer Style instrument (K-PeCS), which assesses stereotypical postures and movements associated with computer keyboard use. Three trained raters independently rated the video clips of 45 computer keyboard users to ascertain inter-rater reliability, and then re-rated a sub-sample of 15 video clips to ascertain intra-rater reliability. Concurrent validity was assessed by comparing the ratings obtained using the K-PeCS to scores developed from a 3D motion analysis system. The overall K-PeCS had excellent reliability [inter-rater: intra-class correlation coefficients (ICC)=.90; intra-rater: ICC=.92]. Most individual items on the K-PeCS had from good to excellent reliability, although six items fell below ICC=.75. Those K-PeCS items that were assessed for concurrent validity compared favorably to the motion analysis data for all but two items. These results suggest that most items on the K-PeCS can be used to reliably document computer keyboarding style.
A systematic review of statistical methods used to test for reliability of medical instruments measuring continuous variables.

PubMed

Zaki, Rafdzah; Bulgiba, Awang; Nordin, Noorhaire; Azina Ismail, Noor

2013-06-01

Reliability measures precision or the extent to which test results can be replicated. This is the first ever systematic review to identify statistical methods used to measure reliability of equipment measuring continuous variables. This studyalso aims to highlight the inappropriate statistical method used in the reliability analysis and its implication in the medical practice. In 2010, five electronic databases were searched between 2007 and 2009 to look for reliability studies. A total of 5,795 titles were initially identified. Only 282 titles were potentially related, and finally 42 fitted the inclusion criteria. The Intra-class Correlation Coefficient (ICC) is the most popular method with 25 (60%) studies having used this method followed by the comparing means (8 or 19%). Out of 25 studies using the ICC, only 7 (28%) reported the confidence intervals and types of ICC used. Most studies (71%) also tested the agreement of instruments. This study finds that the Intra-class Correlation Coefficient is the most popular method used to assess the reliability of medical instruments measuring continuous outcomes. There are also inappropriate applications and interpretations of statistical methods in some studies. It is important for medical researchers to be aware of this issue, and be able to correctly perform analysis in reliability studies.

The reliability of three psoriasis assessment tools: Psoriasis area and severity index, body surface area and physician global assessment.

PubMed

Bożek, Agnieszka; Reich, Adam

2017-08-01

A wide variety of psoriasis assessment tools have been proposed to evaluate the severity of psoriasis in clinical trials and daily practice. The most frequently used clinical instrument is the psoriasis area and severity index (PASI); however, none of the currently published severity scores used for psoriasis meets all the validation criteria required for an ideal score. The aim of this study was to compare and assess the reliability of 3 commonly used assessment instruments for psoriasis severity: the psoriasis area and severity index (PASI), body surface area (BSA) and physician global assessment (PGA). On the scoring day, 10 trained dermatologists evaluated 9 adult patients with plaque-type psoriasis using the PASI, BSA and PGA. All the subjects were assessed twice by each physician. Correlations between the assessments were analyzed using the Pearson correlation coefficient. Intra-class correlation coefficient (ICC) was calculated to analyze intra-rater reliability, and the coefficient of variation (CV) was used to assess inter-rater variability. Significant correlations were observed among the 3 scales in both assessments. In all 3 scales the ICCs were > 0.75, indicating high intra-rater reliability. The highest ICC was for the BSA (0.96) and the lowest one for the PGA (0.87). The CV for the PGA and PASI were 29.3 and 36.9, respectively, indicating moderate inter-rater variability. The CV for the BSA was 57.1, indicating high inter-rater variability. Comparing the PASI, PGA and BSA, it was shown that the PGA had the highest inter-rater reliability, whereas the BSA had the highest intra-rater reliability. The PASI showed intermediate values in terms of interand intra-rater reliability. None of the 3 assessment instruments showed a significant advantage over the other. A reliable assessment of psoriasis severity requires the use of several independent evaluations simultaneously.
Repeatability and reproducibility of corneal thickness using SOCT Copernicus HR.

PubMed

Vidal, Silvia; Viqueira, Valentín; Mas, David; Domenech, Begoña

2013-05-01

The aim of this study is to determine the reliability of corneal thickness measurements derived from SOCT Copernicus HR (Fourier domain OCT). Thirty healthy eyes of 30 subjects were evaluated. One eye of each patient was chosen randomly. Images were obtained of the central (up to 2.0 mm from the corneal apex) and paracentral (2.0 to 4.0 mm) cornea. We assessed corneal thickness (central and paracentral) and epithelium thickness. The intra-observer repeatability data were analysed using the intra-class correlation coefficient (ICC) for a range of 95 per cent within-subject standard deviation (S(W)) and the within-subject coefficient of variation (C(W)). The level of agreement by Bland-Altman analysis was also represented for the study of the reproducibility between observers and agreement between methods of measurement (automatic versus manual). The mean value of the central corneal thickness (CCT) was 542.4 ± 30.1 μm (SD). There was a high intra-observer agreement, finding the best result in the central sector with an intra-class correlation coefficient of 0.99, 95 per cent CI (0.989 to 0.997) and the worst, in the minimum corneal thickness, with an intra-class correlation coefficient of 0.672, 95 per cent CI (0.417 to 0.829). Reproducibility between observers was very high. The best result was found in the central sector thickness obtained both manually and automatically with an intra-class correlation coefficient of 0.990 in both cases and the worst result in the maximum corneal thickness with an intra-class correlation coefficient of 0.827. The agreement between measurement methods was also very high with intra-class correlation coefficient greater than 0.91. On the other hand the repeatability and reproducibility for epithelial measurements was poor. Pachymetric mapping with SOCT Copernicus HR was found to be highly repeatable and reproducible. We found that the device lacks an appropriate ergonomic design as proper focusing of the laser beam onto the cornea for anterior segment scanning required that patients were positioned slightly farther away from the machine head-rest than in the setup for retinal imaging. © 2013 The Authors. Clinical and Experimental Optometry © 2013 Optometrists Association Australia.
Open source posturography.

PubMed

Rey-Martinez, Jorge; Pérez-Fernández, Nicolás

2016-12-01

The proposed validation goal of 0.9 in intra-class correlation coefficient was reached with the results of this study. With the obtained results we consider that the developed software (RombergLab) is a validated balance assessment software. The reliability of this software is dependent of the used force platform technical specifications. Develop and validate a posturography software and share its source code in open source terms. Prospective non-randomized validation study: 20 consecutive adults underwent two balance assessment tests, six condition posturography was performed using a clinical approved software and force platform and the same conditions were measured using the new developed open source software using a low cost force platform. Intra-class correlation index of the sway area obtained from the center of pressure variations in both devices for the six conditions was the main variable used for validation. Excellent concordance between RombergLab and clinical approved force platform was obtained (intra-class correlation coefficient =0.94). A Bland and Altman graphic concordance plot was also obtained. The source code used to develop RombergLab was published in open source terms.
Reliability of intra-oral quantitative sensory testing (QST) in patients with atypical odontalgia and healthy controls - a multicentre study.

PubMed

Baad-Hansen, L; Pigg, M; Yang, G; List, T; Svensson, P; Drangsholt, M

2015-02-01

The reliability of comprehensive intra-oral quantitative sensory testing (QST) protocol has not been examined systematically in patients with chronic oro-facial pain. The aim of the present multicentre study was to examine test-retest and interexaminer reliability of intra-oral QST measures in terms of absolute values and z-scores as well as within-session coefficients of variation (CV) values in patients with atypical odontalgia (AO) and healthy pain-free controls. Forty-five patients with AO and 68 healthy controls were subjected to bilateral intra-oral gingival QST and unilateral extratrigeminal QST (thenar) on three occasions (twice on 1 day by two different examiners and once approximately 1 week later by one of the examiners). Intra-class correlation coefficients and kappa values for interexaminer and test-retest reliability were computed. Most of the standardised intra-oral QST measures showed fair to excellent interexaminer (9-12 of 13 measures) and test-retest (7-11 of 13 measures) reliability. Furthermore, no robust differences in reliability measures or within-session variability (CV) were detected between patients with AO and the healthy reference group. These reliability results in chronic orofacial pain patients support earlier suggestions based on data from healthy subjects that intra-oral QST is sufficiently reliable for use as a part of a comprehensive evaluation of patients with somatosensory disturbances or neuropathic pain in the trigeminal region. © 2014 John Wiley & Sons Ltd.
Improving Teacher Selection: The Effect of Inter-Rater Reliability in the Screening Process. CEDR Working Paper. WP #2015-7

ERIC Educational Resources Information Center

Martinkova, Patricia; Goldhaber, Dan

2015-01-01

Inter-rater reliability, commonly assessed by intra-class correlation coefficient ICC, is an important index for describing the extent to which there is consistency amongst two or more raters in assigned measures. In organizational research, the data structure is often hierarchical and designs deviate substantially from the ideal of a balanced…
Improved estimation of subject-level functional connectivity using full and partial correlation with empirical Bayes shrinkage.

PubMed

Mejia, Amanda F; Nebel, Mary Beth; Barber, Anita D; Choe, Ann S; Pekar, James J; Caffo, Brian S; Lindquist, Martin A

2018-05-15

Reliability of subject-level resting-state functional connectivity (FC) is determined in part by the statistical techniques employed in its estimation. Methods that pool information across subjects to inform estimation of subject-level effects (e.g., Bayesian approaches) have been shown to enhance reliability of subject-level FC. However, fully Bayesian approaches are computationally demanding, while empirical Bayesian approaches typically rely on using repeated measures to estimate the variance components in the model. Here, we avoid the need for repeated measures by proposing a novel measurement error model for FC describing the different sources of variance and error, which we use to perform empirical Bayes shrinkage of subject-level FC towards the group average. In addition, since the traditional intra-class correlation coefficient (ICC) is inappropriate for biased estimates, we propose a new reliability measure denoted the mean squared error intra-class correlation coefficient (ICC MSE ) to properly assess the reliability of the resulting (biased) estimates. We apply the proposed techniques to test-retest resting-state fMRI data on 461 subjects from the Human Connectome Project to estimate connectivity between 100 regions identified through independent components analysis (ICA). We consider both correlation and partial correlation as the measure of FC and assess the benefit of shrinkage for each measure, as well as the effects of scan duration. We find that shrinkage estimates of subject-level FC exhibit substantially greater reliability than traditional estimates across various scan durations, even for the most reliable connections and regardless of connectivity measure. Additionally, we find partial correlation reliability to be highly sensitive to the choice of penalty term, and to be generally worse than that of full correlations except for certain connections and a narrow range of penalty values. This suggests that the penalty needs to be chosen carefully when using partial correlations. Copyright © 2018. Published by Elsevier Inc.
Development and reliability testing of a food store observation form.

PubMed

Rimkus, Leah; Powell, Lisa M; Zenk, Shannon N; Han, Euna; Ohri-Vachaspati, Punam; Pugach, Oksana; Barker, Dianne C; Resnick, Elissa A; Quinn, Christopher M; Myllyluoma, Jaana; Chaloupka, Frank J

2013-01-01

To develop a reliable food store observational data collection instrument to be used for measuring product availability, pricing, and promotion. Observational data collection. A total of 120 food stores (26 supermarkets, 34 grocery stores, 54 gas/convenience stores, and 6 mass merchandise stores) in the Chicago metropolitan statistical area. Inter-rater reliability for product availability, pricing, and promotion measures on a food store observational data collection instrument. Cohen's kappa coefficient and proportion of overall agreement for dichotomous variables and intra-class correlation coefficient for continuous variables. Inter-rater reliability, as measured by average kappa coefficient, was 0.84 for food and beverage product availability measures, 0.80 for interior store characteristics, and 0.70 for exterior store characteristics. For continuous measures, average intra-class correlation coefficient was 0.82 for product pricing measures; 0.90 for counts of fresh, frozen, and canned fruit and vegetable options; and 0.85 for counts of advertisements on the store exterior and property. The vast majority of measures demonstrated substantial or almost perfect agreement. Although some items may require revision, results suggest that the instrument may be used to reliably measure the food store environment. Copyright © 2013 Society for Nutrition Education and Behavior. Published by Elsevier Inc. All rights reserved.
Bimanual Capacity of Children With Cerebral Palsy: Intra- and Interrater Reliability of a Revised Edition of the Bimanual Fine Motor Function Classification.

PubMed

Elvrum, Ann-Kristin G; Beckung, Eva; Sæther, Rannei; Lydersen, Stian; Vik, Torstein; Himmelmann, Kate

2017-08-01

To develop a revised edition of the Bimanual Fine Motor Function (BFMF 2), as a classification of fine motor capacity in children with cerebral palsy (CP), and establish intra- and interrater reliability of this edition. The content of the original BFMF was discussed by an expert panel, resulting in a revised edition comprising the original description of the classification levels, but in addition including figures with specific explanatory text. Four professionals classified fine motor function of 79 children (3-17 years; 45 boys) who represented all subtypes of CP and Manual Ability Classification levels (I-V). Intra- and inter-rater reliability was assessed using overall intra-class correlation coefficient (ICC), and Cohen's quadratic weighted kappa. The overall ICC was 0.86. Cohen's weighted kappa indicated high intra-rater (к w : >0.90) and inter-rater (к w : >0.85) reliability. The revised BFMF 2 had high intra- and interrater reliability. The classification levels could be determined from short video recordings (<5 minutes), using the figures and precise descriptions of the fine motor function levels included in the BFMF 2. Thus, the BFMF 2 may be a feasible and useful classification of fine motor capacity both in research and in clinical practice.
The Scarbase Duo(®): Intra-rater and inter-rater reliability and validity of a compact dual scar assessment tool.

PubMed

Fell, Matthew; Meirte, Jill; Anthonissen, Mieke; Maertens, Koen; Pleat, Jonathon; Moortgat, Peter

2016-03-01

Objective scar assessment tools were designed to help identify problematic scars and direct clinical management. Their use has been restricted by their measurement of a single scar property and the bulky size of equipment. The Scarbase Duo(®) was designed to assess both trans-epidermal water loss (TEWL) and colour of a burn scar whilst being compact and easy to use. Twenty patients with a burn scar were recruited and measurements taken using the Scarbase Duo(®) by two observers. The Scarbase Duo(®) measures TEWL via an open-chamber system and undertakes colorimetry via narrow-band spectrophotometry, producing values for relative erythema and melanin pigmentation. Validity was assessed by comparing the Scarbase Duo(®) against the Dermalab(®) and the Minolta Chromameter(®) respectively for TEWL and colorimetry measurements. The intra-class correlation coefficient (ICC) was used to assess reliability with standard error of measurement (SEM) used to assess reproducibility of measurements. The Pearson correlation coefficient (r) was used to assess the convergent validity. The Scarbase Duo(®) TEWL mode had excellent reliability when used on scars for both intra- (ICC=0.95) and inter-rater (ICC=0.96) measurements with moderate SEM values. The erythema component of the colorimetry mode showed good reliability for use on scars for both intra-(ICC=0.81) and inter-rater (ICC=0.83) measurements with low SEM values. Pigmentation values showed excellent reliability on scar tissue for both intra- (ICC=0.97) and inter-rater (ICC=0.97) with moderate SEM values. The Scarbase Duo(®) TEWL function had excellent correlation with the Dermalab(®) (r=0.93) whilst the colorimetry erythema value had moderate correlation with the Minolta Chromameter (r=0.72). The Scarbase Duo(®) is a reliable and objective scar assessment tool, which is specifically designed for burn scars. However, for clinical use, standardised measurement conditions are recommended. Copyright © 2015 Elsevier Ltd and ISBI. All rights reserved.
Judging in Rhythmic Gymnastics at Different Levels of Performance.

PubMed

Leandro, Catarina; Ávila-Carvalho, Lurdes; Sierra-Palmeiro, Elena; Bobo-Arce, Marta

2017-12-01

This study aimed to analyse the quality of difficulty judging in rhythmic gymnastics, at different levels of performance. The sample consisted of 1152 difficulty scores concerning 288 individual routines, performed in the World Championships in 2013. The data were analysed using the mean absolute judge deviation from the final difficulty score, a Cronbach's alpha coefficient and intra-class correlations, for consistency and reliability assessment. For validity assessment, mean deviations of judges' difficulty scores, the Kendall's coefficient of concordance W and ANOVA eta-squared values were calculated. Overall, the results in terms of consistency (Cronbach's alpha mostly above 0.90) and reliability (intra-class correlations for single and average measures above 0.70 and 0.90, respectively) were satisfactory, in the first and third parts of the ranking on all apparatus. The medium level gymnasts, those in the second part of the ranking, had inferior reliability indices and highest score dispersion. In this part, the minimum of corrected item-total correlation of individual judges was 0.55, with most values well below, and the matrix for between-judge correlations identified remarkable inferior correlations. These findings suggest that the quality of difficulty judging in rhythmic gymnastics may be compromised at certain levels of performance. In future, special attention should be paid to the judging analysis of the medium level gymnasts, as well as the Code of Points applicability at this level.
Judging in Rhythmic Gymnastics at Different Levels of Performance

PubMed Central

Ávila-Carvalho, Lurdes; Sierra-Palmeiro, Elena; Bobo-Arce, Marta

2017-01-01

Abstract This study aimed to analyse the quality of difficulty judging in rhythmic gymnastics, at different levels of performance. The sample consisted of 1152 difficulty scores concerning 288 individual routines, performed in the World Championships in 2013. The data were analysed using the mean absolute judge deviation from the final difficulty score, a Cronbach’s alpha coefficient and intra-class correlations, for consistency and reliability assessment. For validity assessment, mean deviations of judges’ difficulty scores, the Kendall’s coefficient of concordance W and ANOVA eta-squared values were calculated. Overall, the results in terms of consistency (Cronbach’s alpha mostly above 0.90) and reliability (intra-class correlations for single and average measures above 0.70 and 0.90, respectively) were satisfactory, in the first and third parts of the ranking on all apparatus. The medium level gymnasts, those in the second part of the ranking, had inferior reliability indices and highest score dispersion. In this part, the minimum of corrected item-total correlation of individual judges was 0.55, with most values well below, and the matrix for between-judge correlations identified remarkable inferior correlations. These findings suggest that the quality of difficulty judging in rhythmic gymnastics may be compromised at certain levels of performance. In future, special attention should be paid to the judging analysis of the medium level gymnasts, as well as the Code of Points applicability at this level. PMID:29339996
Validity and reliability of a low-cost digital dynamometer for measuring isometric strength of lower limb.

PubMed

Romero-Franco, Natalia; Jiménez-Reyes, Pedro; Montaño-Munuera, Juan A

2017-11-01

Lower limb isometric strength is a key parameter to monitor the training process or recognise muscle weakness and injury risk. However, valid and reliable methods to evaluate it often require high-cost tools. The aim of this study was to analyse the concurrent validity and reliability of a low-cost digital dynamometer for measuring isometric strength in lower limb. Eleven physically active and healthy participants performed maximal isometric strength for: flexion and extension of ankle, flexion and extension of knee, flexion, extension, adduction, abduction, internal and external rotation of hip. Data obtained by the digital dynamometer were compared with the isokinetic dynamometer to examine its concurrent validity. Data obtained by the digital dynamometer from 2 different evaluators and 2 different sessions were compared to examine its inter-rater and intra-rater reliability. Intra-class correlation (ICC) for validity was excellent in every movement (ICC > 0.9). Intra and inter-tester reliability was excellent for all the movements assessed (ICC > 0.75). The low-cost digital dynamometer demonstrated strong concurrent validity and excellent intra and inter-tester reliability for assessing isometric strength in the main lower limb movements.
Comparison of in vivo 3D cone-beam computed tomography tooth volume measurement protocols.

PubMed

Forst, Darren; Nijjar, Simrit; Flores-Mir, Carlos; Carey, Jason; Secanell, Marc; Lagravere, Manuel

2014-12-23

The objective of this study is to analyze a set of previously developed and proposed image segmentation protocols for precision in both intra- and inter-rater reliability for in vivo tooth volume measurements using cone-beam computed tomography (CBCT) images. Six 3D volume segmentation procedures were proposed and tested for intra- and inter-rater reliability to quantify maxillary first molar volumes. Ten randomly selected maxillary first molars were measured in vivo in random order three times with 10 days separation between measurements. Intra- and inter-rater agreement for all segmentation procedures was attained using intra-class correlation coefficient (ICC). The highest precision was for automated thresholding with manual refinements. A tooth volume measurement protocol for CBCT images employing automated segmentation with manual human refinement on a 2D slice-by-slice basis in all three planes of space possessed excellent intra- and inter-rater reliability. Three-dimensional volume measurements of the entire tooth structure are more precise than 3D volume measurements of only the dental roots apical to the cemento-enamel junction (CEJ).
Quantitative comparison and evaluation of software packages for assessment of abdominal adipose tissue distribution by magnetic resonance imaging.

PubMed

Bonekamp, S; Ghosh, P; Crawford, S; Solga, S F; Horska, A; Brancati, F L; Diehl, A M; Smith, S; Clark, J M

2008-01-01

To examine five available software packages for the assessment of abdominal adipose tissue with magnetic resonance imaging, compare their features and assess the reliability of measurement results. Feature evaluation and test-retest reliability of softwares (NIHImage, SliceOmatic, Analyze, HippoFat and EasyVision) used in manual, semi-automated or automated segmentation of abdominal adipose tissue. A random sample of 15 obese adults with type 2 diabetes. Axial T1-weighted spin echo images centered at vertebral bodies of L2-L3 were acquired at 1.5 T. Five software packages were evaluated (NIHImage, SliceOmatic, Analyze, HippoFat and EasyVision), comparing manual, semi-automated and automated segmentation approaches. Images were segmented into cross-sectional area (CSA), and the areas of visceral (VAT) and subcutaneous adipose tissue (SAT). Ease of learning and use and the design of the graphical user interface (GUI) were rated. Intra-observer accuracy and agreement between the software packages were calculated using intra-class correlation. Intra-class correlation coefficient was used to obtain test-retest reliability. Three of the five evaluated programs offered a semi-automated technique to segment the images based on histogram values or a user-defined threshold. One software package allowed manual delineation only. One fully automated program demonstrated the drawbacks of uncritical automated processing. The semi-automated approaches reduced variability and measurement error, and improved reproducibility. There was no significant difference in the intra-observer agreement in SAT and CSA. The VAT measurements showed significantly lower test-retest reliability. There were some differences between the software packages in qualitative aspects, such as user friendliness. Four out of five packages provided essentially the same results with respect to the inter- and intra-rater reproducibility. Our results using SliceOmatic, Analyze or NIHImage were comparable and could be used interchangeably. Newly developed fully automated approaches should be compared to one of the examined software packages.
Quantitative comparison and evaluation of software packages for assessment of abdominal adipose tissue distribution by magnetic resonance imaging

PubMed Central

Bonekamp, S; Ghosh, P; Crawford, S; Solga, SF; Horska, A; Brancati, FL; Diehl, AM; Smith, S; Clark, JM

2009-01-01

Objective To examine five available software packages for the assessment of abdominal adipose tissue with magnetic resonance imaging, compare their features and assess the reliability of measurement results. Design Feature evaluation and test–retest reliability of softwares (NIHImage, SliceOmatic, Analyze, HippoFat and EasyVision) used in manual, semi-automated or automated segmentation of abdominal adipose tissue. Subjects A random sample of 15 obese adults with type 2 diabetes. Measurements Axial T1-weighted spin echo images centered at vertebral bodies of L2–L3 were acquired at 1.5 T. Five software packages were evaluated (NIHImage, SliceOmatic, Analyze, HippoFat and EasyVision), comparing manual, semi-automated and automated segmentation approaches. Images were segmented into cross-sectional area (CSA), and the areas of visceral (VAT) and subcutaneous adipose tissue (SAT). Ease of learning and use and the design of the graphical user interface (GUI) were rated. Intra-observer accuracy and agreement between the software packages were calculated using intra-class correlation. Intra-class correlation coefficient was used to obtain test–retest reliability. Results Three of the five evaluated programs offered a semi-automated technique to segment the images based on histogram values or a user-defined threshold. One software package allowed manual delineation only. One fully automated program demonstrated the drawbacks of uncritical automated processing. The semi-automated approaches reduced variability and measurement error, and improved reproducibility. There was no significant difference in the intra-observer agreement in SAT and CSA. The VAT measurements showed significantly lower test–retest reliability. There were some differences between the software packages in qualitative aspects, such as user friendliness. Conclusion Four out of five packages provided essentially the same results with respect to the inter- and intra-rater reproducibility. Our results using SliceOmatic, Analyze or NIHImage were comparable and could be used interchangeably. Newly developed fully automated approaches should be compared to one of the examined software packages. PMID:17700582
Reliability of the Wii Balance Board in kayak.

PubMed

Vando, Stefano; Laffaye, Guillaume; Masala, Daniele; Falese, Lavinia; Padulo, Johnny

2015-01-01

the seat of the kayaker represent the principal contact point to express mechanical Energy. therefore we investigated the reliability of the Wii Balance Board measures in the kayak vs. on the ground. Bland-Altman test showed a low systematic bias on the ground (2.85%) and in kayak (-2.13%) respectively; while 0.996 for Intra-class correlation coefficient. the Wii Balance Board is useful to assess postural sway in kayak.
Inter- and intra-operator reliability and repeatability of shear wave elastography in the liver: a study in healthy volunteers.

PubMed

Hudson, John M; Milot, Laurent; Parry, Craig; Williams, Ross; Burns, Peter N

2013-06-01

This study assessed the reproducibility of shear wave elastography (SWE) in the liver of healthy volunteers. Intra- and inter-operator reliability and repeatability were quantified in three different liver segments in a sample of 15 subjects, scanned during four independent sessions (two scans on day 1, two scans 1 wk later) by two operators. A total of 1440 measurements were made. Reproducibility was assessed using the intra-class correlation coefficient (ICC) and a repeated measures analysis of variance. The shear wave speed was measured and used to estimate Young's modulus using the Supersonics Imagine Aixplorer. The median Young's modulus measured through the inter-costal space was 5.55 ± 0.74 kPa. The intra-operator reliability was better for same-day evaluations (ICC = 0.91) than the inter-operator reliability (ICC = 0.78). Intra-observer agreement decreased when scans were repeated on a different day. Inter-session repeatability was between 3.3% and 9.9% for intra-day repeated scans, compared with to 6.5%-12% for inter-day repeated scans. No significant difference was observed in subjects with a body mass index greater or less than 25 kg/m(2). Copyright © 2013 World Federation for Ultrasound in Medicine & Biology. Published by Elsevier Inc. All rights reserved.
Inter- and intra-rater reliability and agreement in determining subcutaneous tumour margins in dogs.

PubMed

Ranganathan, B; Milovancev, M; Leeper, H; Townsend, K L; Bracha, S; Curran, K

2018-03-01

The objective of this prospective study was to evaluate agreement and reliability of calliper-based measurements of locally invasive subcutaneous malignant tumours in dogs. Four raters measured the longest diameter of 12 subcutaneous tumours (7 soft tissue sarcomas and 5 mast cell tumours) from 11 client-owned dogs during 3 randomized, blinded measurement trials, both pre- and post-sedation. Inter- and intra-rater reliability was evaluated using intra-class correlation coefficient (ICC) and agreement was evaluated using Bland-Altman plots. Inter- and intra-rater reliability was good (ICC range of 0.8694-0.89520) and excellent (ICC range of 0.9720-0.9966), respectively. For agreement calculations, an a priori clinically relevant limit of agreement of 10 mm was set. Inter- and intra-rater agreement was unacceptable with inter-rater limits of agreement ranging from 15.9 to 55.6 mm and intra-rater limit of agreement ranging from 11.9 to 28.1 mm. Review of the measurement trial photographs revealed that calliper orientation changes were frequent, occurring in 9/12 (75%) and 8/12 (67%) pre- and post-sedation cases. No significant correlation was found between inter-rater measurement standard deviations and calliper orientation changes or dog body condition score. These findings suggest veterinarians may have poor agreement in determining the gross edge of tumours, which is expected to introduce bias and inconsistency in tumour staging, assessing response to therapy, and surgical margin planning. Due to the potential consequences for veterinary cancer patients, future studies are needed to validate the present findings. © 2018 John Wiley & Sons Ltd.
The Reliability and Validity of the Computerized Double Inclinometer in Measuring Lumbar Mobility

PubMed Central

MacDermid, Joy Christine; Arumugam, Vanitha; Vincent, Joshua Israel; Carroll, Krista L

2014-01-01

Study Design : Repeated measures reliability/validity study. Objectives : To determine the concurrent validity, test-retest, inter-rater and intra-rater reliability of lumbar flexion and extension measurements using the Tracker M.E. computerized dual inclinometer (CDI) in comparison to the modified-modified Schober (MMS) Summary of Background : Numerous studies have evaluated the reliability and validity of the various methods of measuring spinal motion, but the results are inconsistent. Differences in equipment and techniques make it difficult to correlate results. Methods : Twenty subjects with back pain and twenty without back pain were selected through convenience sampling. Two examiners measured sagittal plane lumbar range of motion for each subject. Two separate tests with the CDI and one test with the MMS were conducted. Each test consisted of three trials. Instrument and examiner order was randomly assigned. Intra-class correlations (ICCs 2, 2 and 2, 2) and Pearson correlation coefficients (r) were used to calculate reliability and concurrent validity respectively. Results : Intra-trial reliability was high to very high for both the CDI (ICCs 0.85 - 0.96) and MMS (ICCs 0.84 - 0.98). However, the reliability was poor to moderate, when the CDI unit had to be repositioned either by the same rate (ICCs 0.16 - 0.59) or a different rater (ICCs 0.45 - 0.52). Inter-rater reliability for the MMS was moderate to high (ICCs 0.75 - 0.82) which bettered the moderate correlation obtained for the CDI (ICCs 0.45 - 0.52). Correlations between the CDI and MMS were poor for flexion (0.32; p<0.05) and poor to moderate (-0.42 - -0.51; p<0.05) for extension measurements. Conclusion : When using the CDI, an average of subsequent tests is required to obtain moderate reliability. The MMS was highly reliable than the CDI. The MMS and the CDI measure lumbar movement on a different metric that are not highly related to each other. PMID:25352928
Validation of the French version of the Burn Specific Health Scale-Brief (BSHS-B) questionnaire.

PubMed

Gandolfi, S; Auquit-Auckbur, I; Panunzi, S; Mici, E; Grolleau, J-L; Chaput, B

2016-11-01

The Burn Specific Health Scale-Brief questionnaire is a widely validated tool for estimating the health related quality of life and for assessing the best multidisciplinary management of burn patients. The aim of this study was to translate the BSHS-B into French and to investigate its reliability and validity. According to the procedure proposed by the Scientific Advisory Committee of the Medical Outcomes Trust, the Burn Specific Health Scale-Brief (BSHS-B) was translated from the English version into French. In order to test the reliability of the French version of the BSHS-B, 53 burn patients French speakers completed the BSHS-B and SF-36 questionnaires from two to four years after burn. Ten of them have been re-tested at 6 months after the first evaluation. To evaluate clinical utility of the BSHS-F, internal consistency, construct validity (using SF-36) and stability in time were assessed using Cronbach's alpha statistic, Spearman rank test, and intra-class correlation coefficient respectively. The French version of the BSHS-B Cronbach's alpha coefficient was 0.93 and was >0.80 for all the sub-domains. French version of the BSHS-B and the SF-36 were positively correlated, all the associations were statistically significant (p<0.01). Intra-class correlation coefficients for test-retest ranged between 0.95 and 0.99 for the sub-domains. The intra-class correlation coefficient (ICC) for the total score was 0.98. The French version of the BSHS-B shows a robust rate of internal consistency, construct validity and stability in time, supporting its application in routine clinical practice as well as in international studies. Copyright © 2016 Elsevier Ltd and ISBI. All rights reserved.

Inter-Rater Reliability and Intra-Rater Reliability of Assessing the 2-Minute Push-Up Test.

PubMed

Fielitz, Lynn; Coelho, Jeffrey; Horne, Thomas; Brechue, William

2016-02-01

The purpose of this study was to assess inter-rater reliability and intra-rater reliability of the 2-minute, 90° push-up test as utilized in the Army Physical Fitness Test. Analysis of rater assessment reliability included both total score agreement and agreement across individual push-up repetitions. This study utilized 8 Raters who assessed 15 different videotaped push-up performances over 4 iterations separated by a minimum of 1 week. The 15 push-up participants were videotaped during the semiannual Army Physical Fitness Test. Each Rater randomly viewed the 15 push-up and verbally responded with a "yes" or "no" to each push-up repetition. The data generated were analyzed using the Pearson product-moment correlation as well as the kappa, modified kappa and the intra-class correlation coefficient (3,1). An attribute agreement analysis was conducted to determine the percent of inter-rater and intra-rater agreement across individual push-ups.The results indicated that Raters varied a great deal in assessing push-ups. Over the 4 trials of 15 participants, the overall scores of the Raters varied between 3.0 and 35.7 push-ups. Post hoc comparisons found that there was significant increase in the grand mean of push-ups from trials 1-3 to trial 4 (p < 0.05). Also, there was a significant difference among raters over the 4 trials (p < 0.05). Pearson correlation coefficients for inter-rater and intra-rater reliability identified inter-rater reliability coefficients were between 0.10 and 0.97. Intra-rater coefficients were between 0.48 and 0.99. Intra-rater agreement for individual push-up repetitions ranged from 41.8% to 84.8%. The results indicated that the raters failed to assess the same push-up repetition with the same score (below 70% agreement) as well as failed to agree when viewed between raters (29%). Interestingly, as previously mentioned, scores on trial 4 increased significantly which might have been caused by rater drift or that the Raters did not maintain the push-up standard over the trials. It does appear that the final push-up scores received by each participant was a close approximation of actual performance (within 65%) but when assessing physical performance for retention in the Army, a more reliable test might be considered. Reprint & Copyright © 2016 Association of Military Surgeons of the U.S.
Concurrent validity of the Alberta Infant Motor Scale to detect delayed gross motor development in preterm infants: A comparative study with the Bayley III.

PubMed

Albuquerque, Plínio Luna de; Guerra, Miriam Queiroz de Farias; Lima, Marília de Carvalho; Eickmann, Sophie Helena

2017-05-24

To investigate the concurrent validity of AIMS in relation to the gross motor subtest of the Bayley Scale III/GM in preterm infants. A total of 159 gross motor development assessments were performed with the AIMS and Bayley-III/GM. Linear regression was used to assess the correlation between AIMS and Bayley-III/GM scores. The intra-class correlation coefficient (ICC) and the Bland-Altman plot were used to analyze intra- and inter-rater reliability. There was a prevalence of delayed gross motor development of 20.8% according to the Bayley-III/GM, and 11.9% for the 5th percentile and 21.4% for the 10th percentile of AIMS. A good correlation of AIMS with Bayley-III/GM scores and intra- and inter-rater reliability was encountered in this study. AIMS proved very capable of detecting delayed gross motor development in preterm infants when compared with the Bayley-III/GM. The 10th percentile of AIMS provided the best combination of indicators, with greater specificity.
Manual muscle testing and hand-held dynamometry in people with inflammatory myopathy: An intra- and interrater reliability and validity study

PubMed Central

Baschung Pfister, Pierrette; Sterkele, Iris; Maurer, Britta; de Bie, Rob A.; Knols, Ruud H.

2018-01-01

Manual muscle testing (MMT) and hand-held dynamometry (HHD) are commonly used in people with inflammatory myopathy (IM), but their clinimetric properties have not yet been sufficiently studied. To evaluate the reliability and validity of MMT and HHD, maximum isometric strength was measured in eight muscle groups across three measurement events. To evaluate reliability of HHD, intra-class correlation coefficients (ICC), the standard error of measurements (SEM) and smallest detectable changes (SDC) were calculated. To measure reliability of MMT linear Cohen`s Kappa was computed for single muscle groups and ICC for total score. Additionally, correlations between MMT8 and HHD were evaluated with Spearman Correlation Coefficients. Fifty people with myositis (56±14 years, 76% female) were included in the study. Intra-and interrater reliability of HHD yielded excellent ICCs (0.75–0.97) for all muscle groups, except for interrater reliability of ankle extension (0.61). The corresponding SEMs% ranged from 8 to 28% and the SDCs% from 23 to 65%. MMT8 total score revealed excellent intra-and interrater reliability (ICC>0.9). Intrarater reliability of single muscle groups was substantial for shoulder and hip abduction, elbow and neck flexion, and hip extension (0.64–0.69); moderate for wrist (0.53) and knee extension (0.49) and fair for ankle extension (0.35). Interrater reliability was moderate for neck flexion (0.54) and hip abduction (0.44); fair for shoulder abduction, elbow flexion, wrist and ankle extension (0.20–0.33); and slight for knee extension (0.08). Correlations between the two tests were low for wrist, knee, ankle, and hip extension; moderate for elbow flexion, neck flexion and hip abduction; and good for shoulder abduction. In conclusion, the MMT8 total score is a reliable assessment to consider general muscle weakness in people with myositis but not for single muscle groups. In contrast, our results confirm that HHD can be recommended to evaluate strength of single muscle groups. PMID:29596450
Reliability and criterion validity of measurements using a smart phone-based measurement tool for the transverse rotation angle of the pelvis during single-leg lifting.

PubMed

Jung, Sung-Hoon; Kwon, Oh-Yun; Jeon, In-Cheol; Hwang, Ui-Jae; Weon, Jong-Hyuck

2018-01-01

The purposes of this study were to determine the intra-rater test-retest reliability of a smart phone-based measurement tool (SBMT) and a three-dimensional (3D) motion analysis system for measuring the transverse rotation angle of the pelvis during single-leg lifting (SLL) and the criterion validity of the transverse rotation angle of the pelvis measurement using SBMT compared with a 3D motion analysis system (3DMAS). Seventeen healthy volunteers performed SLL with their dominant leg without bending the knee until they reached a target placed 20 cm above the table. This study used a 3DMAS, considered the gold standard, to measure the transverse rotation angle of the pelvis to assess the criterion validity of the SBMT measurement. Intra-rater test-retest reliability was determined using the SBMT and 3DMAS using intra-class correlation coefficient (ICC) [3,1] values. The criterion validity of the SBMT was assessed with ICC [3,1] values. Both the 3DMAS (ICC = 0.77) and SBMT (ICC = 0.83) showed excellent intra-rater test-retest reliability in the measurement of the transverse rotation angle of the pelvis during SLL in a supine position. Moreover, the SBMT showed an excellent correlation with the 3DMAS (ICC = 0.99). Measurement of the transverse rotation angle of the pelvis using the SBMT showed excellent reliability and criterion validity compared with the 3DMAS.
The reliability of the Adelaide in-shoe foot model.

PubMed

Bishop, Chris; Hillier, Susan; Thewlis, Dominic

2017-07-01

Understanding the biomechanics of the foot is essential for many areas of research and clinical practice such as orthotic interventions and footwear development. Despite the widespread attention paid to the biomechanics of the foot during gait, what largely remains unknown is how the foot moves inside the shoe. This study investigated the reliability of the Adelaide In-Shoe Foot Model, which was designed to quantify in-shoe foot kinematics and kinetics during walking. Intra-rater reliability was assessed in 30 participants over five walking trials whilst wearing shoes during two data collection sessions, separated by one week. Sufficient reliability for use was interpreted as a coefficient of multiple correlation and intra-class correlation coefficient of >0.61. Inter-rater reliability was investigated separately in a second sample of 10 adults by two researchers with experience in applying markers for the purpose of motion analysis. The results indicated good consistency in waveform estimation for most kinematic and kinetic data, as well as good inter-and intra-rater reliability. The exception is the peak medial ground reaction force, the minimum abduction angle and the peak abduction/adduction external hindfoot joint moments which resulted in less than acceptable repeatability. Based on our results, the Adelaide in-shoe foot model can be used with confidence for 24 commonly measured biomechanical variables during shod walking. Copyright © 2017 Elsevier B.V. All rights reserved.
Cross-Cultural and Psychometric Properties Assessment of the Exercise Self-Efficacy Scale in Individuals with Spinal Cord Injury.

PubMed

Pisconti, Fernando; Mahmoud Smaili Santos, Suhaila; Lopes, Josiane; Rosa Cardoso, Jefferson; Lopes Lavado, Edson

2017-11-29

The Exercise Self-Efficacy scale (ESES) is a reliable measure, in the English language, of exercise self-efficacy in individuals with spinal cord injury. The aim of this study was to culturally adjust and validate the Exercise Self-Efficacy scale in the Portuguese language. The Exercise Self-Efficacy scale was applied to 76 subjects, with three-month intervals (three applications in total). The reliability was appraised using the intra-class correlation coefficient and Bland-Altman methods, and the internal consistency was evaluated using Cronbach´s alpha. The Exercise Self-Efficacy scale was correlated with the domains of the Quality of life Questionnaire SF-36 and Functional Independence Measure and tested using the Spearman rho coefficient. The Exercise Self-Efficacy scale-Brazil presented good internal consistency (alpha 1 = 0.856; alpha 2 = 0.855; alpha 3 = 0.822) and high reliability in the test-retest (intra-class correlation coefficient = 0.97). There was a strong correlation between the Exercise Self-Efficacy scale-Brazil and the SF-36 only in the functional capacity domain (rho = 0.708). There were no changes in Exercise Self-Efficacy scale-Brazil scores between the three applications (p = 0.796). The validation of the Exercise Self-Efficacy scale questionnaire permits the assessor to use it reliably in Portuguese speaking countries, since it is the first instrument measuring self-efficacy specifically during exercises in individuals with spinal cord injury. Furthermore, the questionnaire can be used as an instrument to verify the effectiveness of interventions that use exercise as an outcome. The results of the Brazilian version of the Exercise Self-Efficacy scale support its use as a reliable and valid measurement of exercise self-efficacy for this population.
Reliability of the Test of Integrated Language and Literacy Skills (TILLS).

PubMed

Mailend, Marja-Liisa; Plante, Elena; Anderson, Michele A; Applegate, E Brooks; Nelson, Nickola W

2016-07-01

As new standardized tests become commercially available, it is critical that clinicians have access to the information about a test's psychometric properties, including aspects of reliability. The purpose of the three studies reported in this article was to investigate the reliability of a new test, the Test of Integrated Language and Literacy Skills (TILLS), with consideration of both internal and external sources of measurement error. The TILLS was administered to children aged 6;0-18;11 years. The participants varied in terms of their language and literacy skills and included children with typical language development as well as those diagnosed with language or learning disability. The sample of children also varied in terms of their racial and socioeconomic backgrounds. Study 1 (N = 1056) assessed the internal consistency of TILLS calculating the coefficient omega for each subtest. Study 2 (N = 103) and Study 3 (N = 39) used the intra-class correlation coefficients to report on test-retest and inter-rater reliability respectively. The results indicate strong internal consistency and inter-rater reliability for all subtests of TILLS. The test-retest reliability was strong for all but one subtest, for which the intra-class correlation coefficient was in the acceptable range. This article provides clinicians with essential scientific information that supports the internal and external reliability of a new test of oral and written language skills, the TILLS. Information about reliability is critical for guiding the selection of an appropriate diagnostic tool amongst a number of options. © 2016 Royal College of Speech and Language Therapists.
Intra-rater reliability of electromyographic recordings and subjective evaluation of neck muscle fatigue among helicopter pilots.

PubMed

Thuresson, Marcus; Ang, Björn; Linder, Jan; Harms-Ringdahl, Karin

2005-06-01

The aim was to evaluate the reliability of a method of measuring neck muscle fatigue among helicopter pilots. Surface EMG from three areas in the neck region, bilaterally, was recorded among 10 male helicopter pilots while they were performing isometric contractions in flexion and extension for 45 s, sustaining a force representing 75% of maximum strength in a seated position. Perceived fatigue was rated using the Borg CR-10 scale. The test was repeated twice the first day and then two additional times with one-week intervals. Variables analyzed were the slope of the median frequency change, the normalized slope, and the ratings after 15, 30 and 45 s; and also the initial median frequency (IMDF). The intra-class correlation (ICC) and the measurement error (S(w)), intra- and inter-day were calculated statistically. The best reliability for the slope was found for the 45 s intra-day analysis taking all measurements into account (ICC 0.65-0.83). The reliability after 30 s was poorer but still acceptable (ICC 0.52-0.71). For the subjective ratings, the highest reliability was found after 30 s inter-day (ICC 0.86-0.88). IMDF showed generally high reliability for the intra-day analyses (ICC 0.63-0.80). The method is reliable for use in further research. Since performing a contraction of 75% of maximum was quite strenuous, we recommend that the protocol be shortened to 30 s.
The reliability of dual-energy X-ray absorptiometry measurements of bone mineral density in the metatarsals.

PubMed

Fuller, Joel T; Archer, Jane; Buckley, Jonathan D; Tsiros, Margarita D; Thewlis, Dominic

2016-01-01

To investigate the reliability of a simple, efficient technique for measuring bone mineral density (BMD) in the metatarsals using dual-energy X-ray absorptiometry (DXA). BMD of the right foot of 32 trained male distance runners was measured using a DXA scanner with the foot in the plantar position. Separate regions of interest (ROI) were used to assess the BMD of each metatarsal shaft (1st-5th) for each participant. ROI analysis was repeated by the same investigator to determine within-scan intra-rater reliability and by a different investigator to determine within-scan inter-rater reliability. Repeat DXA scans were undertaken for ten participants to assess between-scan intra-rater reliability. Assessment of BMD was consistently most reliable for the first metatarsal across all domains of reliability assessed (intra-class correlation coefficient [ICC] ≥0.97; coefficient of variation [CV] ≤1.5%; limits of agreement [LOA] ≤4.2%). Reasonable levels of intra-rater reliability were also achieved for the second and fifth metatarsals (ICC ≥0.90; CV ≤4.2%; LOA ≤11.9%). Poorer levels of reliability were demonstrated for the third (ICC ≥0.64; CV ≤8.2%; LOA ≤23.6%) and fourth metatarsals (ICC ≥0.67; CV ≤9.6%; LOA ≤27.5%). BMD was greatest in the first and second metatarsals (P < 0.01). Reliable measurements of BMD were achieved for the first, second and fifth metatarsals.
Reliability of the Wii Balance Board in kayak

PubMed Central

Vando, Stefano; Laffaye, Guillaume; Masala, Daniele; Falese, Lavinia; Padulo, Johnny

2015-01-01

Summary Background: the seat of the kayaker represent the principal contact point to express mechanical Energy. Methods: therefore we investigated the reliability of the Wii Balance Board measures in the kayak vs. on the ground. Results: Bland-Altman test showed a low systematic bias on the ground (2.85%) and in kayak (−2.13%) respectively; while 0.996 for Intra-class correlation coefficient. Conclusion: the Wii Balance Board is useful to assess postural sway in kayak. PMID:25878987
Towards an Operational Definition of Clinical Competency in Pharmacy

PubMed Central

2015-01-01

Objective. To estimate the inter-rater reliability and accuracy of ratings of competence in student pharmacist/patient clinical interactions as depicted in videotaped simulations and to compare expert panelist and typical preceptor ratings of those interactions. Methods. This study used a multifactorial experimental design to estimate inter-rater reliability and accuracy of preceptors’ assessment of student performance in clinical simulations. The study protocol used nine 5-10 minute video vignettes portraying different levels of competency in student performance in simulated clinical interactions. Intra-Class Correlation (ICC) was used to calculate inter-rater reliability and Fisher exact test was used to compare differences in distribution of scores between expert and nonexpert assessments. Results. Preceptors (n=42) across 5 states assessed the simulated performances. Intra-Class Correlation estimates were higher for 3 nonrandomized video simulations compared to the 6 randomized simulations. Preceptors more readily identified high and low student performances compared to satisfactory performances. In nearly two-thirds of the rating opportunities, a higher proportion of expert panelists than preceptors rated the student performance correctly (18 of 27 scenarios). Conclusion. Valid and reliable assessments are critically important because they affect student grades and formative student feedback. Study results indicate the need for pharmacy preceptor training in performance assessment. The process demonstrated in this study can be used to establish minimum preceptor benchmarks for future national training programs. PMID:26089563
Trunk-acceleration based assessment of gait parameters in older persons: a comparison of reliability and validity of four inverted pendulum based estimations.

PubMed

Zijlstra, Agnes; Zijlstra, Wiebren

2013-09-01

Inverted pendulum (IP) models of human walking allow for wearable motion-sensor based estimations of spatio-temporal gait parameters during unconstrained walking in daily-life conditions. At present it is unclear to what extent different IP based estimations yield different results, and reliability and validity have not been investigated in older persons without a specific medical condition. The aim of this study was to compare reliability and validity of four different IP based estimations of mean step length in independent-living older persons. Participants were assessed twice and walked at different speeds while wearing a tri-axial accelerometer at the lower back. For all step-length estimators, test-retest intra-class correlations approached or were above 0.90. Intra-class correlations with reference step length were above 0.92 with a mean error of 0.0 cm when (1) multiplying the estimated center-of-mass displacement during a step by an individual correction factor in a simple IP model, or (2) adding an individual constant for bipedal stance displacement to the estimated displacement during single stance in a 2-phase IP model. When applying generic corrections or constants in all subjects (i.e. multiplication by 1.25, or adding 75% of foot length), correlations were above 0.75 with a mean error of respectively 2.0 and 1.2 cm. Although the results indicate that an individual adjustment of the IP models provides better estimations of mean step length, the ease of a generic adjustment can be favored when merely evaluating intra-individual differences. Further studies should determine the validity of these IP based estimations for assessing gait in daily life. Copyright © 2013 Elsevier B.V. All rights reserved.
Excellent Intra and Inter-Observer Reproducibility of Wrist Circumference Measurements in Obese Children and Adolescents

PubMed Central

Campagna, Giuseppe; Zampetti, Simona; Gallozzi, Alessia; Giansanti, Sara; Chiesa, Claudio; Pacifico, Lucia; Buzzetti, Raffaella

2016-01-01

In a previous study, we found that wrist circumference, in particular its bone component, was associated with insulin resistance in a population of overweight/obese children. The aim of the present study was to evaluate the intra- and inter-operator variability in wrist circumference measurement in a population of obese children and adolescents. One hundred and two (54 male and 48 female) obese children and adolescents were consecutively enrolled. In all subjects wrist circumferences were measured by two different operators two times to assess intra- and inter-operator variability. Statistical analysis was performed using SAS v.9.4 and JMP v.12. Measurements of wrist circumference showed excellent inter-operator reliability with Intra class Correlation Coefficients (ICC) of 0.96 and ICC of 0.97 for the first and the second measurement, respectively. The intra-operator reliability was, also, very strong with a Concordance Correlation Coefficient (CCC) of 0.98 for both operators. The high reproducibility demonstrated in our results suggests that wrist circumference measurement, being safe, non-invasive and repeatable can be easily used in out-patient settings to identify youths with increased risk of insulin-resistance. This can avoid testing the entire population of overweight/obese children for insulin resistance parameters. PMID:27294398
Inter-arch digital model vs. manual cast measurements: Accuracy and reliability.

PubMed

Kiviahde, Heikki; Bukovac, Lea; Jussila, Päivi; Pesonen, Paula; Sipilä, Kirsi; Raustia, Aune; Pirttiniemi, Pertti

2017-06-28

The purpose of this study was to evaluate the accuracy and reliability of inter-arch measurements using digital dental models and conventional dental casts. Thirty sets of dental casts with permanent dentition were examined. Manual measurements were done with a digital caliper directly on the dental casts, and digital measurements were made on 3D models by two independent examiners. Intra-class correlation coefficients (ICC), a paired sample t-test or Wilcoxon signed-rank test, and Bland-Altman plots were used to evaluate intra- and inter-examiner error and to determine the accuracy and reliability of the measurements. The ICC values were generally good for manual and excellent for digital measurements. The Bland-Altman plots of all the measurements showed good agreement between the manual and digital methods and excellent inter-examiner agreement using the digital method. Inter-arch occlusal measurements on digital models are accurate and reliable and are superior to manual measurements.
Cross-cultural Adaptation of the "Functional Activities Questionnaire - FAQ" for use in Brazil

PubMed Central

Sanchez, Maria Angélica dos Santos; Correa, Pricila Cristina Ribeiro; Lourenço, Roberto Alves

2011-01-01

Objective The aim of this paper was to present the results of the first stage of cross-cultural adaptation of the Functional Activities Questionnaire (FAQ). Methods The tool was subjected to translation and re-translation, and the test-retest reliability of a proposed version for use in Brazil was analyzed. Results Of the 548 questionnaire respondents, a convenience sample of 68 informants was selected for retesting. Internal consistency was measured by Cronbach's alpha (0.95) while test-retest reliability was assessed using intra-class correlation (0.97). The findings have shown that FAQ is brief - averaging seven minutes to apply, easily understood and has good intra-rater test-retest reliability. Conclusion Our results suggest this adapted version of the FAQ is a reliable and stable tool which may be useful for assessing function in Brazilian elderly. Notwithstanding, the version should be subjected to further analysis with the aim of reaching functional equivalence. PMID:29213759
Arm cranking versus wheelchair propulsion for testing aerobic fitness in children with spina bifida who are wheelchair dependent.

PubMed

Bloemen, Manon A T; de Groot, Janke F; Backx, Frank J G; Westerveld, Rosalyne A; Takken, Tim

2015-05-01

To determine the best test performance and feasibility using a Graded Arm Cranking Test vs a Graded Wheelchair Propulsion Test in young people with spina bifida who use a wheelchair, and to determine the reliability of the best test. Validity and reliability study. Young people with spina bifida who use a wheelchair. Physiological responses were measured during a Graded Arm Cranking Test and a Graded Wheelchair Propulsion Test using a heart rate monitor and calibrated mobile gas analysis system (Cortex Metamax). For validity, peak oxygen uptake (VO2peak) and peak heart rate (HRpeak) were compared using paired t-tests. For reliability, the intra-class correlation coefficients, standard error of measurement, and standard detectable change were calculated. VO2peak and HRpeak were higher during wheelchair propulsion compared with arm cranking (23.1 vs 19.5 ml/kg/min, p = 0.11; 165 vs 150 beats/min, p < 0.05). Reliability of wheelchair propulsion showed high intra-class correlation coefficients (ICCs) for both VO2peak (ICC = 0.93) and HRpeak (ICC = 0.90). This pilot study shows higher HRpeak and a tendency to higher VO2peak in young people with spina bifida who are using a wheelchair when tested during wheelchair propulsion compared with arm cranking. Wheelchair propulsion showed good reliability. We recommend performing a wheelchair propulsion test for aerobic fitness testing in this population.
Is laser speckle contrast analysis (LASCA) the new kid on the block in systemic sclerosis? A systematic literature review and pilot study to evaluate reliability of LASCA to measure peripheral blood perfusion in scleroderma patients.

PubMed

Cutolo, Maurizio; Vanhaecke, Amber; Ruaro, Barbara; Deschepper, Ellen; Ickinger, Claudia; Melsens, Karin; Piette, Yves; Trombetta, Amelia Chiara; De Keyser, Filip; Smith, Vanessa

2018-06-06

A reliable tool to evaluate flow is paramount in systemic sclerosis (SSc). We describe herein on the one hand a systematic literature review on the reliability of laser speckle contrast analysis (LASCA) to measure the peripheral blood perfusion (PBP) in SSc and perform an additional pilot study, investigating the intra- and inter-rater reliability of LASCA. A systematic search was performed in 3 electronic databases, according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines. In the pilot study, 30 SSc patients and 30 healthy subjects (HS) underwent LASCA assessment. Intra-rater reliability was assessed by having a first anchor rater performing the measurements at 2 time-points and inter-rater reliability by having the anchor rater and a team of second raters performing the measurements in 15 SSc and 30 HS. The measurements were repeated with a second anchor rater in the other 15 SSc patients, as external validation. Only 1 of the 14 records of interest identified through the systematic search was included in the final analysis. In the additional pilot study: intra-class correlation coefficient (ICC) for intra-rater reliability of the first anchor rater was 0.95 in SSc and 0.93 in HS, the ICC for inter-rater reliability was 0.97 in SSc and 0.93 in HS. Intra- and inter-rater reliability of the second anchor rater was 0.78 and 0.87. The identified literature regarding the reliability of LASCA measurements reports good to excellent inter-rater agreement. This very pilot study could confirm the reliability of LASCA measurements with good to excellent inter-rater agreement and found additionally good to excellent intra-rater reliability. Furthermore, similar results were found in the external validation. Copyright © 2018. Published by Elsevier B.V.
Radiologic analysis of hindfoot alignment: Comparison of Méary, long axial, and hindfoot alignment views.

PubMed

Neri, T; Barthelemy, R; Tourné, Y

2017-12-01

Among radiographic views available for assessing hindfoot alignment, the antero-posterior weight-bearing view with metal cerclage of the hindfoot (Méary view) is the most widely used in France. Internationally, the long axial view (LAV) and hindfoot alignment view (HAV) are used also. The objective of this study was to compare the reliability of these three views. The Méary view with cerclage of the hindfoot is as reliable as the LAV and HAV for assessing hindfoot alignment. All three views were obtained in each of 22 prospectively included patients. Intra-observer and inter-observer reliabilities were assessed by having two observers collect the radiographic measurements then computing the intra-class correlation coefficients (ICCs). The intra-observer and inter-observer ICCs were 0.956 and 0.988 with the Méary view, 0.990 and 0.765 with the HAV, and 0.997 and 0.991 with the LAV, respectively. Correlations were far stronger between the LAV and HAV than between each of these and the Méary view. Compared to the LAV and HAV, the Méary view indicated a greater degree of hindfoot valgus. Intra-observer reliability was excellent with both the LAV and HAV, whereas inter-observer reliability was better with the LAV. Excellent reliability was also obtained with the Méary view. Combining the Méary view to obtain a radiographic image of the clinical deformity with the LAV to measure the angular deviation of the hindfoot axis may be useful when assessing hindfoot malalignment. A comparison of the three views in a larger population is needed before clinical recommendations can be made. II, prospective study. Copyright © 2017 Elsevier Masson SAS. All rights reserved.
Reliability and Validity of a New Test of Agility and Skill for Female Amateur Soccer Players

PubMed Central

Kutlu, Mehmet; Yapici, Hakan; Yilmaz, Abdullah

2017-01-01

Abstract The aim of this study was to evaluate the Agility and Skill Test, which had been recently developed to assess agility and skill in female athletes. Following a 10 min warm-up, two trials to test the reliability and validity of the test were conducted one week apart. Measurements were collected to compare soccer players’ physical performance in a 20 m sprint, a T-Drill test, the Illinois Agility Run Test, change-of-direction and acceleration, as well as agility and skill. All tests were completed following the same order. Thirty-four amateur female soccer players were recruited (age = 20.8 ± 1.9 years; body height = 166 ± 6.9 cm; body mass = 55.5 ± 5.8 kg). To determine the reliability and usefulness of these tests, paired sample t-tests, intra-class correlation coefficients, typical error, coefficient of variation, and differences between the typical error and smallest worthwhile change statistics were computed. Test results showed no significant differences between the two sessions (p > 0.01). There were higher intra-class correlations between the test and retest values (r = 0.94–0.99) for all tests. Typical error values were below the smallest worthwhile change, indicating ‘good’ usefulness for these tests. A near perfect Pearson correlation between the Agility and Skill Test (r = 0.98) was found, and there were moderate-to-large levels of correlation between the Agility and Skill Test and other measures (r = 0.37 to r = 0.56). The results of this study suggest that the Agility and Skill Test is a reliable and valid test for female soccer players and has significant value for assessing the integrative agility and skill capability of soccer players. PMID:28469760
Post-traumatic subtalar osteoarthritis: which grading system should we use?

PubMed

de Muinck Keizer, Robert-Jan O; Backes, Manouk; Dingemans, Siem A; Goslings, J Carel; Schepers, Tim

2016-09-01

To assess and compare post-traumatic osteoarthritis following intra-articular calcaneal fractures, one must have a reliable grading system that consistently grades the post-traumatic changes of the joint. A reliable grading system aids in the communication between treating physicians and improves the interpretation of research. To date, there is no consensus on what grading system to use in the evaluation of post-traumatic subtalar osteoarthritis. The objective of this study was to determine and compare the inter- and intra-rater reliability of two grading systems for post-traumatic subtalar osteoarthritis. Four observers evaluated 50 calcaneal fractures at least one year after trauma on conventional oblique lateral, internally and externally rotated views, and graded post-traumatic subtalar osteoarthritis using the Kellgren and Lawrence Grading Scale (KLGS) and the Paley Grading System (PGS). Inter- and intra-rater reliability were calculated and compared. The inter-rater reliability showed an intra-class correlation (ICC) of 0.54 (95 % CI 0.40-0.67) for the KLGS and an ICC of 0.41 (95 % CI 0.26 - 0.57) for the PGS. This difference was not statistically significant. The intra-rater reliability showed a mean weighted kappa of 0.62 for both the KLGS and the PGS. There is no statistically significant difference in reliability between the Kellgren and Lawrence Grading System (KLGS) and the Paley Grading System (PGS). The PGS allows for an easy two-step approach making it easy for everyday clinical purposes. For research purposes however, the more detailed and widely used KLGS seems preferable.

Reliability of pulse waveform separation analysis: effects of posture and fasting.

PubMed

Stoner, Lee; Credeur, Daniel; Fryer, Simon; Faulkner, James; Lambrick, Danielle; Gibbs, Bethany Barone

2017-03-01

Oscillometric pulse wave analysis devices enable, with relative simplicity and objectivity, the measurement of central hemodynamic parameters. The important parameters are central blood pressures and indices of arterial wave reflection, including wave separation analysis (backward pressure component Pb and reflection magnitude). This study sought to determine whether the measurement precision (between-day reliability) of Pb and reflection magnitude: exceeds the criterion for acceptable reliability; and is affected by posture (supine, seated) and fasting state. Twenty healthy adults (50% female, 27.9 years, 24.2 kg/m) were tested on six different mornings: 3 days fasted, 3 days nonfasted condition. On each occasion, participants were tested in supine and seated postures. Oscillometric pressure waveforms were recorded on the left upper arm. The criterion intra-class correlation coefficient value of 0.75 was exceeded for Pb (0.76) and reflection magnitude (0.77) when participants were assessed under the combined supine-fasted condition. The intra-class correlation coefficient was lowest for Pb in seated-nonfasted condition (0.57), and lowest for reflection magnitude in the seated-fasted condition (0.56). For Pb, the smallest detectible change that must be exceeded in order for a significant change to occur in an individual was 2.5 mmHg, and for reflection magnitude, the smallest detectable change was 8.5%. Assessments of Pb and reflection magnitude are as follows: exceed the criterion for acceptable reliability; and are most reliable when participants are fasted in a supine position. The demonstrated reliability suggests sufficient precision to detect clinically meaningful changes in reflection magnitude and Pb.
Patient Assessment of Constipation Quality of Life Questionnaire: Translation, Cultural Adaptation, Reliability, and Validity of the Persian Version.

PubMed

Nikjooy, Afsaneh; Jafari, Hassan; Saba, Maryam A; Ebrahimi, Naghmeh; Mirzaei, Rezvan

2018-05-01

The Patient Assessment of Constipation Quality of Life (PAC-QOL) questionnaire is the most validated and the most specific tool for measuring the quality of life of patients with constipation. Over 120 million people live in countries whose official language is Persian. There is no reported Persian version of the PAC-QOL questionnaire yet. The aim of this study was to translate and culturally adapt the PAC-QOL questionnaire and to assess its reliability and validity among Persian patients with chronic constipation. Following the translation and cultural adaptation of the PAC-QOL questionnaire to Persian, 100 patients (mean±SD age=40.51±13.67) with constipation were recruited for validity measurement and 20 patients were re-examined for reliability. Content validity was assessed based on the opinions of an expert committee and the floor/ceiling effect. Construct validity was evaluated according to the hypothesis test. The SF-36 questionnaire was used for concurrent criterion validity, intra-class correlation coefficient for reliability, and Cronbach's alpha for internal consistency. The content validity of the PAC-QOL questionnaire was proven, and there was no floor/ceiling effect. Construct validity also was confirmed based on the hypothesis test. The overall Cronbach's alpha of the PAC-QOL questionnaire was 0.92 (range=0.72-0.92), and the overall intra-class correlation coefficient of the questionnaire was 0.88 (range=0.69-0.87). The correlation between the SF-36 and PAC-QOL questionnaires was moderate. The Persian version of the PAC-QOL questionnaire demonstrated good validity and reliability properties in chronic constipation. Accordingly, Persian researchers and clinicians can benefit from this questionnaire in further research and assessment of treatment outcomes.
Reliability of sagittal plane hip, knee, and ankle joint angles from a single frame of video data using the GAITRite camera system.

PubMed

Ross, Sandy A; Rice, Clinton; Von Behren, Kristyn; Meyer, April; Alexander, Rachel; Murfin, Scott

2015-01-01

The purpose of this study was to establish intra-rater, intra-session, and inter-rater, reliability of sagittal plane hip, knee, and ankle angles with and without reflective markers using the GAITRite walkway and single video camera between student physical therapists and an experienced physical therapist. This study included thirty-two healthy participants age 20-59, stratified by age and gender. Participants performed three successful walks with and without markers applied to anatomical landmarks. GAITRite software was used to digitize sagittal hip, knee, and ankle angles at two phases of gait: (1) initial contact; and (2) mid-stance. Intra-rater reliability was more consistent for the experienced physical therapist, regardless of joint or phase of gait. Intra-session reliability was variable, the experienced physical therapist showed moderate to high reliability (intra-class correlation coefficient (ICC) = 0.50-0.89) and the student physical therapist showed very poor to high reliability (ICC = 0.07-0.85). Inter-rater reliability was highest during mid-stance at the knee with markers (ICC = 0.86) and lowest during mid-stance at the hip without markers (ICC = 0.25). Reliability of a single camera system, especially at the knee joint shows promise. Depending on the specific type of reliability, error can be attributed to the testers (e.g. lack of digitization practice and marker placement), participants (e.g. loose fitting clothing) and camera systems (e.g. frame rate and resolution). However, until the camera technology can be upgraded to a higher frame rate and resolution, and the software can be linked to the GAITRite walkway, the clinical utility for pre/post measures is limited.
The reliability and validity of measurements of human dental casts made by an intra-oral 3D scanner, with conventional hand-held digital callipers as the comparison measure.

PubMed

Rajshekar, Mithun; Julian, Roberta; Williams, Anne-Marie; Tennant, Marc; Forrest, Alex; Walsh, Laurence J; Wilson, Gary; Blizzard, Leigh

2017-09-01

Intra-oral 3D scanning of dentitions has the potential to provide a fast, accurate and non-invasive method of recording dental information. The aim of this study was to assess the reliability of measurements of human dental casts made using a portable intra-oral 3D scanner appropriate for field use. Two examiners each measured 84 tooth and 26 arch features of 50 sets of upper and lower human dental casts using digital hand-held callipers, and secondly using the measuring tool provided with the Zfx IntraScan intraoral 3D scanner applied to the virtual dental casts. The measurements were repeated at least one week later. Reliability and validity were quantified concurrently by calculation of intra-class correlation coefficients (ICC) and standard errors of measurement (SEM). The measurements of the 110 landmark features of human dental casts made using the intra-oral 3D scanner were virtually indistinguishable from measurements of the same features made using conventional hand-held callipers. The difference of means as a percentage of the average of the measurements by each method ranged between 0.030% and 1.134%. The intermethod SEMs ranged between 0.037% and 0.535%, and the inter-method ICCs ranged between 0.904 and 0.999, for both the upper and the lower arches. The inter-rater SEMs were one-half and the intra-method/rater SEMs were one-third of the inter-method values. This study demonstrates that the Zfx IntraScan intra-oral 3D scanner with its virtual on-screen measuring tool is a reliable and valid method for measuring the key features of dental casts. Copyright © 2017 Elsevier B.V. All rights reserved.
Reliability of the AMA Guides to the Evaluation of Permanent Impairment.

PubMed

Forst, Linda; Friedman, Lee; Chukwu, Abraham

2010-12-01

AMA's Guides to the Evaluation of Permanent Impairment is used to rate loss of function and determine compensation and ability to work after injury or illness; however, there are few studies that evaluate reliability or construct validity. To evaluate the reliability of the fifth and sixth editions for back injury; to determine best methods for further study. Intra-class correlation coefficients within and between raters were relatively high. There was wider variability for individual cases. Impairment ratings were lower and correlated less well for the sixth edition, though confidence intervals overlapped. The sixth edition may not be an improvement over the fifth. A research agenda should include investigations of reliability and construct validity for different body sites and organ systems along the entire rating scale and among different categories of raters.
Ultrasonographic measurements of lower trapezius muscle thickness at rest and during isometric contraction: a reliability study.

PubMed

Talbott, Nancy R; Witt, Dexter W

2014-07-01

The purpose of this study was to determine the intra-rater reliability and inter-rater reliability of ultrasound imaging (USI) thickness measurements of the lower trapezius (LT) at rest and during active contractions when the transverse process and the lamina were used as reference sites for the measurement process. Twenty healthy individuals between the ages of 22 and 32 years volunteered. With the subject prone and the shoulder in 145° of abduction, images of the LT were taken bilaterally by one examiner as the subject: (1) rested; (2) actively held the test position; and (3) actively held the test position while holding a weight. Ten subjects returned and testing was repeated by the same examiner and by a second examiner. LT thickness measurements were recorded at the level of the transverse process and at the level of the lamina. Intra-class correlation coefficients (ICC) for within session intra-rater reliability (ICC3,3) ranged from 0.951 to 0.986 for both measurement sites while between session intra-rater reliability (ICC3,2) ranged from 0.935 to 0.962. Within session inter-rater reliability (ICC2,2) ranged from 0.934 to 0.973. USI can be used to reliably measure LT thickness at rest, during active contraction and during active contraction when holding a weight. The described protocol can be utilized during shoulder examinations to provide an additional assessment tool for monitoring changes in LT thickness.
A comparison of the reliability of the trochanteric prominence angle test and the alternative method in healthy subjects.

PubMed

Yoon, Tae-Lim; Park, Kyung-Mi; Choi, Sil-Ah; Lee, Ji-Hyun; Jeong, Hyo-Jung; Cynn, Heon-Seock

2014-04-01

A wide range of intra- and inter-rater reliabilities of the trochanteric prominence angle test (TPAT) has been reported. We introduced the transcondylar angle test (TCAT) as an alternative to the TPAT and using a smartphone as a reliable measurement tool for femoral neck anteversion (FNA) measurement. The reliabilities of the TPAT and the TCAT, the reliability of using a smartphone as a clinical measurement tool, and the correlation between the difference value of medial knee joint space (KJS) between rest and tested positions and the difference value between the TPAT and TCAT were assessed. Two physical therapists independently determined the reliabilities of the TPAT with a digital inclinometer, the TCAT with a digital inclinometer, and the TCAT with a smartphone in 19 hips of 10 healthy subjects (5 male and 5 female, 22.2 ± 1.69 years). The medial KJS in rest and the tested position were assessed using a sonography. The intra-class correlation coefficients (ICC) for the intra-rater reliabilities of TPAT with a digital inclinometer (ICC = 0.92), TCAT with a digital inclinometer (ICC = 0.94) and a smartphone (ICC = 0.95) in both testers were substantial. The inter-rater reliability of TPAT with a digital inclinometer was fair (ICC = 0.48) while TCAT with a digital inclinometer (ICC = 0.89) and a smartphone (ICC = 0.85) were substantial. The correlation between the difference value of medial KJS between rest and tested positions and the difference value between TPAT and TCAT was low and statistically non-significant (r = 0.114; p = 0.325). The TCAT would be more reliable than the TPAT in inter-rater test. Using a smartphone is a clinically comparable measuring tool to a digital inclinometer. Copyright © 2013 Elsevier Ltd. All rights reserved.
Reliability and convergent validity of the five-step test in people with chronic stroke.

PubMed

Ng, Shamay S M; Tse, Mimi M Y; Tam, Eric W C; Lai, Cynthia Y Y

2018-01-10

(i) To estimate the intra-rater, inter-rater and test-retest reliabilities of the Five-Step Test (FST), as well as the minimum detectable change in FST completion times in people with stroke. (ii) To estimate the convergent validity of the FST with other measures of stroke-specific impairments. (iii) To identify the best cut-off times for distinguishing FST performance in people with stroke from that of healthy older adults. A cross-sectional study. University-based rehabilitation centre. Forty-eight people with stroke and 39 healthy controls. None. The FST, along with (for the stroke survivors only) scores on the Fugl-Meyer Lower Extremity Assessment (FMA-LE), the Berg Balance Scale (BBS), Limits of Stability (LOS) tests, and Activities-specific Balance Confidence (ABC) scale were tested. The FST showed excellent intra-rater (intra-class correlation coefficient; ICC = 0.866-0.905), inter-rater (ICC = 0.998), and test-retest (ICC = 0.838-0.842) reliabilities. A minimum detectable change of 9.16 s was found for the FST in people with stroke. The FST correlated significantly with the FMA-LE, BBS, and LOS results in the forward and sideways directions (r = -0.411 to -0.716, p < 0.004). The FST completion time of 13.35 s was shown to discriminate reliably between people with stroke and healthy older adults. The FST is a reliable, easy-to-administer clinical test for assessing stroke survivors' ability to negotiate steps and stairs.
Reliability of different methodologies of infrared image analysis of myofascial trigger points in the upper trapezius muscle

PubMed Central

Dibai-Filho, Almir V.; Guirro, Elaine C. O.; Ferreira, Vânia T. K.; Brandino, Hugo E.; Vaz, Maíta M. O. L. L.; Guirro, Rinaldo R. J.

2015-01-01

BACKGROUND: Infrared thermography is recognized as a viable method for evaluation of subjects with myofascial pain. OBJECTIVE: The aim of the present study was to assess the intra- and inter-rater reliability of infrared image analysis of myofascial trigger points in the upper trapezius muscle. METHOD: A reliability study was conducted with 24 volunteers of both genders (23 females) between 18 and 30 years of age (22.12±2.54), all having cervical pain and presence of active myofascial trigger point in the upper trapezius muscle. Two trained examiners performed analysis of point, line, and area of the infrared images at two different periods with a 1-week interval. The intra-class correlation coefficient (ICC2,1) was used to assess the intra- and inter-rater reliability. RESULTS: With regard to the intra-rater reliability, ICC values were between 0.591 and 0.993, with temperatures between 0.13 and 1.57 °C for values of standard error of measurement (SEM) and between 0.36 and 4.35 °C for the minimal detectable change (MDC). For the inter-rater reliability, ICC ranged from 0.615 to 0.918, with temperatures between 0.43 and 1.22 °C for the SEM and between 1.19 and 3.38 °C for the MDC. CONCLUSION: The methods of infrared image analyses of myofascial trigger points in the upper trapezius muscle employed in the present study are suitable for clinical and research practices. PMID:25993626
Reliability of doming and toe flexion testing to quantify foot muscle strength.

PubMed

Ridge, Sarah Trager; Myrer, J William; Olsen, Mark T; Jurgensmeier, Kevin; Johnson, A Wayne

2017-01-01

Quantifying the strength of the intrinsic foot muscles has been a challenge for clinicians and researchers. The reliable measurement of this strength is important in order to assess weakness, which may contribute to a variety of functional issues in the foot and lower leg, including plantar fasciitis and hallux valgus. This study reports 3 novel methods for measuring foot strength - doming (previously unmeasured), hallux flexion, and flexion of the lesser toes. Twenty-one healthy volunteers performed the strength tests during two testing sessions which occurred one to five days apart. Each participant performed each series of strength tests (doming, hallux flexion, and lesser toe flexion) four times during the first testing session (twice with each of two raters) and two times during the second testing session (once with each rater). Intra-class correlation coefficients were calculated to test for reliability for the following comparisons: between raters during the same testing session on the same day (inter-rater, intra-day, intra-session), between raters on different days (inter-rater, inter-day, inter-session), between days for the same rater (intra-rater, inter-day, inter-session), and between sessions on the same day by the same rater (intra-rater, intra-day, inter-session). ICCs showed good to excellent reliability for all tests between days, raters, and sessions. Average doming strength was 99.96 ± 47.04 N. Average hallux flexion strength was 65.66 ± 24.5 N. Average lateral toe flexion was 50.96 ± 22.54 N. These simple tests using relatively low cost equipment can be used for research or clinical purposes. If repeated testing will be conducted on the same participant, it is suggested that the same researcher or clinician perform the testing each time for optimal reliability.
Reliability and validity of the range of motion scale (ROMS) in patients with abnormal postures.

PubMed

van Rooijen, Diana E; Lalli, Stefania; Marinus, Johan; Maihöfner, Christian; McCabe, Candida S; Munts, Alex G; van der Plas, Anton A; Tijssen, Marina A J; van de Warrenburg, Bart P; Albanese, Alberto; van Hilten, Jacobus J

2015-03-01

Sustained abnormal postures (i.e., fixed dystonia) are the most frequently reported motor abnormalities in complex regional pain syndrome (CRPS), but these symptoms may also develop after peripheral trauma without CRPS. Currently, there is no valid and reliable measurement instrument available to measure the severity and distribution of these postures. The range of motion scale (ROMS) was therefore developed to assess the severity based on the possible active range of motion of all joints (arms, legs, trunk, and neck), and the present study evaluates its reliability and validity. Inter- and intra-rater reliability of the ROMS was determined in 16 patients with abnormal sustained postures, who were videotaped following a standard video protocol in a university hospital. The recordings were rated by a panel of international experts. In addition, 30 patients were clinically tested with both the Burke-Fahn-Marsden (BFM) scale as well as the ROMS to assess construct validity. Inter-rater reliability for total ROMS scores showed an intra-class correlation coefficient (ICC) of 0.85. The majority of the scores for the separate joints (13 out of 18) demonstrated an almost perfect agreement with ICCs ranging from 0.81 to 0.94; of the other items, one showed fair, one moderate, and three substantial agreement. The ICCs for the intra-rater reliability ranged from moderate to almost perfect (0.68-0.98). Spearman's correlation coefficients between corresponding body areas as measured with the ROMS or BFM were all above 0.82. The ROMS is a reliable and valid instrument to evaluate the severity and distribution of sustained abnormal postures. Wiley Periodicals, Inc.
Reliability of mercury-in-silastic strain gauge plethysmography curve reading: influence of clinical clues and observer variation.

PubMed

Høyer, Christian; Pavar, Susanne; Pedersen, Begitte H; Biurrun Manresa, José A; Petersen, Lars J

2013-08-01

Mercury-in-silastic strain gauge pletysmography (SGP) is a well-established technique for blood flow and blood pressure measurements. The aim of this study was to examine (i) the possible influence of clinical clues, e.g. the presence of wounds and color changes during blood pressure measurements, and (ii) intra- and inter-observer variation of curve interpretation for segmental blood pressure measurements. A total of 204 patients with known or suspected peripheral arterial disease (PAD) were included in a diagnostic accuracy trial. Toe and ankle pressures were measured in both limbs, and primary observers analyzed a total of 804 pressure curve sets. The SGP curves were later reanalyzed separately by two observers blinded to clinical clues. Intra- and inter-observer agreement was quantified using Cohen's kappa and reliability was quantified using intra-class correlation coefficients, coefficients of variance, and Bland-Altman analysis. There was an overall agreement regarding patient diagnostic classification (PAD/not PAD) in 202/204 (99.0%) for intra-observer (κ = 0.969, p < 0.001), and 201/204 (98.5%) for inter-observer readings (κ = 0.953, p < 0.001). Reliability analysis showed excellent correlation between blinded versus non-blinded and inter-observer readings for determination of absolute segmental pressures (all intraclass correlation coefficients ≥ 0.984). The coefficient of variance for determination of absolute segmental blood pressure ranged from 2.9-3.4% for blinded/non-blinded data and from 3.8-5.0% for inter-observer data. This study shows a low inter-observer variation among experienced laboratory technicians for reading strain gauge curves. The low variation between blinded/non-blinded readings indicates that SGP measurements are minimally biased by clinical clues.
The development and validation of a custom built device for assessing frontal knee joint laxity.

PubMed

Ismail, Shiek Abdullah; Simic, Milena; Clarke, Jillian L; Lopes, Thiago Jambo Alves; Pappas, Evangelos

2017-12-01

This study reports the development and validation of a quantitative technique of assessing frontal knee joint laxity through a custom built device named KLICP. The objectives of this study were to determine: (i) the intra- and inter-rater reliability and (ii) the validity of the device when compared to real time ultrasound. Twenty-five participants had their frontal knee joint laxity assessed by the KLICP, by manual varus/valgus tests and by ultrasound. Two raters independently assessed laxity manually by three repeated measurements, repeated at least 48h later. Results were validated by comparing them to the medial and lateral joint space opening measured by the ultrasound. Intraclass correlation coefficients and standard error of measurement reliability were calculated. Pearson's correlation coefficients were calculated to determine the correlation between the KLICP and the joint space. Intra-rater reliability (intra-session) for each rater was good on both sessions (0.91-0.98), intra-rater reliability (inter-sessions) was moderate to good (0.62-0.87), and inter-rater reliability (intra-session) was good (0.75-0.80). There is low agreement for intra-rater (inter-session) and for inter-rater (intra-session) reliability. The KLICP measurement has a significant positive fair to moderate correlation to the ultrasound measurement at the left (r: 0.61, p: 0.01) and right (r: 0.48, p: 0.02) knee in the valgus direction and at the left (r: 0.51, p: 0.01) and right (r: 0.39, p: 0.05) knee in the varus direction. There is low agreement between the KLICP and the RTU. Reliability and agreement was good only when measured for intra-rater, within session. Copyright © 2017 Elsevier B.V. All rights reserved.
Does Dry Eye Affect Repeatability of Corneal Topography Measurements?

PubMed

Doğan, Aysun Şanal; Gürdal, Canan; Köylü, Mehmet Talay

2018-04-01

The purpose of this study was to assess the repeatability of corneal topography measurements in dry eye patients and healthy controls. Participants underwent consecutive corneal topography measurements (Sirius; Costruzione Strumenti Oftalmici, Florence, Italy). Two images with acquisition quality higher than 90% were accepted. The following parameters were evaluated: minimum and central corneal thickness, aqueous depth, apex curvature, anterior chamber volume, horizontal anterior chamber diameter, iridocorneal angle, cornea volume, and average simulated keratometry. Repeatability was assessed by calculating intra-class correlation coefficient. Thirty-three patients with dry eye syndrome and 40 healthy controls were enrolled to the study. The groups were similar in terms of age (39 [18-65] vs. 30.5 [18-65] years, p=0.198) and gender (M/F: 4/29 vs. 8/32, p=0.366). Intra-class correlation coefficients among all topography parameters within both groups showed excellent repeatability (>0.90). The anterior segment measurements provided by the Sirius corneal topography system were highly repeatable for dry eye patients and are sufficiently reliable for clinical practice and research.
Measuring the quality of life in mild to very severe dementia: testing the inter-rater and intra-rater reliability of the German version of the QUALIDEM.

PubMed

Dichter, Martin Nikolaus; Schwab, Christian G G; Meyer, Gabriele; Bartholomeyczik, Sabine; Dortmann, Olga; Halek, Margareta

2014-05-01

Quality of life (Qol) is an increasingly used outcome measure in dementia research. The QUALIDEM is a dementia-specific and proxy-rated Qol instrument. We aimed to determine the inter-rater and intra-rater reliability in residents with dementia in German nursing homes. The QUALIDEM consists of nine subscales that were applied to a sample of 108 people with mild to severe dementia and six consecutive subscales that were applied to a sample of 53 people with very severe dementia. The proxy raters were 49 registered nurses and nursing assistants. Inter-rater and intra-rater reliability scores were calculated on the subscale and item level. None of the QUALIDEM subscales showed strong inter-rater reliability based on the single-measure Intra-Class Correlation Coefficient (ICC) for absolute agreement ≥ 0.70. Based on the average-measure ICC for four raters, eight subscales for people with mild to severe dementia (care relationship, positive affect, negative affect, restless tense behavior, social relations, social isolation, feeling at home and having something to do) and five subscales for very severe dementia (care relationship, negative affect, restless tense behavior, social relations and social isolation) yielded a strong inter-rater agreement (ICC: 0.72-0.86). All of the QUALIDEM subscales, regardless of dementia severity, showed strong intra-rater agreement. The ICC values ranged between 0.70 and 0.79 for people with mild to severe dementia and between 0.75 and 0.87 for people with very severe dementia. This study demonstrated insufficient inter-rater reliability and sufficient intra-rater reliability for all subscales of both versions of the German QUALIDEM. The degree of inter-rater reliability can be improved by collaborative Qol rating by more than one nurse. The development of a measurement manual with accurate item definitions and a standardized education program for proxy raters is recommended.
Reliability of Various Measurement Stations for Determining Plantar Fascia Thickness and Echogenicity.

PubMed

Bisi-Balogun, Adebisi; Cassel, Michael; Mayer, Frank

2016-04-13

This study aimed to determine the relative and absolute reliability of ultrasound (US) measurements of the thickness and echogenicity of the plantar fascia (PF) at different measurement stations along its length using a standardized protocol. Twelve healthy subjects (24 feet) were enrolled. The PF was imaged in the longitudinal plane. Subjects were assessed twice to evaluate the intra-rater reliability. A quantitative evaluation of the thickness and echogenicity of the plantar fascia was performed using Image J, a digital image analysis and viewer software. A sonography evaluation of the thickness and echogenicity of the PF showed a high relative reliability with an Intra class correlation coefficient of ≥0.88 at all measurement stations. However, the measurement stations for both the PF thickness and echogenicity which showed the highest intraclass correlation coefficient (ICCs) did not have the highest absolute reliability. Compared to other measurement stations, measuring the PF thickness at 3 cm distal and the echogenicity at a region of interest 1 cm to 2 cm distal from its insertion at the medial calcaneal tubercle showed the highest absolute reliability with the least systematic bias and random error. Also, the reliability was higher using a mean of three measurements compared to one measurement. To reduce discrepancies in the interpretation of the thickness and echogenicity measurements of the PF, the absolute reliability of the different measurement stations should be considered in clinical practice and research rather than the relative reliability with the ICC.
Reliability of Various Measurement Stations for Determining Plantar Fascia Thickness and Echogenicity

PubMed Central

Bisi-Balogun, Adebisi; Cassel, Michael; Mayer, Frank

2016-01-01

This study aimed to determine the relative and absolute reliability of ultrasound (US) measurements of the thickness and echogenicity of the plantar fascia (PF) at different measurement stations along its length using a standardized protocol. Twelve healthy subjects (24 feet) were enrolled. The PF was imaged in the longitudinal plane. Subjects were assessed twice to evaluate the intra-rater reliability. A quantitative evaluation of the thickness and echogenicity of the plantar fascia was performed using Image J, a digital image analysis and viewer software. A sonography evaluation of the thickness and echogenicity of the PF showed a high relative reliability with an Intra class correlation coefficient of ≥0.88 at all measurement stations. However, the measurement stations for both the PF thickness and echogenicity which showed the highest intraclass correlation coefficient (ICCs) did not have the highest absolute reliability. Compared to other measurement stations, measuring the PF thickness at 3 cm distal and the echogenicity at a region of interest 1 cm to 2 cm distal from its insertion at the medial calcaneal tubercle showed the highest absolute reliability with the least systematic bias and random error. Also, the reliability was higher using a mean of three measurements compared to one measurement. To reduce discrepancies in the interpretation of the thickness and echogenicity measurements of the PF, the absolute reliability of the different measurement stations should be considered in clinical practice and research rather than the relative reliability with the ICC. PMID:27089369
Intra- and inter-rater reliability of digital image analysis for skin color measurement

PubMed Central

Sommers, Marilyn; Beacham, Barbara; Baker, Rachel; Fargo, Jamison

2013-01-01

Background We determined the intra- and inter-rater reliability of data from digital image color analysis between an expert and novice analyst. Methods Following training, the expert and novice independently analyzed 210 randomly ordered images. Both analysts used Adobe® Photoshop lasso or color sampler tools based on the type of image file. After color correction with Pictocolor® in camera software, they recorded L*a*b* (L*=light/dark; a*=red/green; b*=yellow/blue) color values for all skin sites. We computed intra-rater and inter-rater agreement within anatomical region, color value (L*, a*, b*), and technique (lasso, color sampler) using a series of one-way intra-class correlation coefficients (ICCs). Results Results of ICCs for intra-rater agreement showed high levels of internal consistency reliability within each rater for the lasso technique (ICC ≥ 0.99) and somewhat lower, yet acceptable, level of agreement for the color sampler technique (ICC = 0.91 for expert, ICC = 0.81 for novice). Skin L*, skin b*, and labia L* values reached the highest level of agreement (ICC ≥ 0.92) and skin a*, labia b*, and vaginal wall b* were the lowest (ICC ≥ 0.64). Conclusion Data from novice analysts can achieve high levels of agreement with data from expert analysts with training and the use of a detailed, standard protocol. PMID:23551208
Intra- and inter-rater reliability of digital image analysis for skin color measurement.

PubMed

Sommers, Marilyn; Beacham, Barbara; Baker, Rachel; Fargo, Jamison

2013-11-01

We determined the intra- and inter-rater reliability of data from digital image color analysis between an expert and novice analyst. Following training, the expert and novice independently analyzed 210 randomly ordered images. Both analysts used Adobe(®) Photoshop lasso or color sampler tools based on the type of image file. After color correction with Pictocolor(®) in camera software, they recorded L*a*b* (L*=light/dark; a*=red/green; b*=yellow/blue) color values for all skin sites. We computed intra-rater and inter-rater agreement within anatomical region, color value (L*, a*, b*), and technique (lasso, color sampler) using a series of one-way intra-class correlation coefficients (ICCs). Results of ICCs for intra-rater agreement showed high levels of internal consistency reliability within each rater for the lasso technique (ICC ≥ 0.99) and somewhat lower, yet acceptable, level of agreement for the color sampler technique (ICC = 0.91 for expert, ICC = 0.81 for novice). Skin L*, skin b*, and labia L* values reached the highest level of agreement (ICC ≥ 0.92) and skin a*, labia b*, and vaginal wall b* were the lowest (ICC ≥ 0.64). Data from novice analysts can achieve high levels of agreement with data from expert analysts with training and the use of a detailed, standard protocol. © 2013 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Simplified Chinese version of the Forgotten Joint Score (FJS) for patients who underwent joint arthroplasty: cross-cultural adaptation and validation.

PubMed

Cao, Shiqi; Liu, Ning; Han, Wuxiang; Zi, Yunpeng; Peng, Fan; Li, Lexiang; Fu, Qiwei; Chen, Yi; Zheng, Weijie; Qian, Qirong

2017-01-14

The Forgotten Joint Score (FJS) is a newly developed health-related quality of life (HRQoL) questionnaire designed to evaluate the awareness after total knee arthroplasty (TKA). This study cross-culturally adapted and psychometrically validated a simplified Chinese version of the FJS (SC-FJS). Cross-cultural adaptation was performed according to the internationally recognized guidelines. One-hundred and fifty participants who underwent primary TKA were recruited in this study. Cronbach's α and intra-class correlations were used to determine reliability. Construct validity was analyzed by evaluating the correlations between SC-FJS and the Knee Injury and Osteoarthritis Outcome Score (KOOS) and the short form (36) health survey (SF-36). Each of the 12 items was properly responded and correlated with the total items. SC-FJS had excellent reliability [Cronbach's α = 0.907, intra-class correlation coefficient (ICC) = 0.970, 95% CI 0.959-0.978). Elimination of any one item in all did not result in a value of Cronbach's α of <0.80. SC-FJS had a high correlation with symptoms (0.67, p < 0.001) and pain (0.60, p < 0.001) domains of KOOS and social functioning (0.66, p < 0.001) domain of SF-36, and it also moderately correlated with function in daily living (0.53, p < 0.001) and function in sport and recreation (0.40, p < 0.001) domains of KOOS, and physical subscale of SF-36 (0.49-0.53, p < 0.001) but had a low (r = 0.20) or not significant (p > 0.05) correlation with mental subscale of SF-36. SC-FJS demonstrated excellent acceptability, internal consistency, reliability, and construct validity, which can be recommended for patients who underwent joint arthroplasty in Mainland China.

Reliability of laser Doppler flowmetry curve reading for measurement of toe and ankle pressures: intra- and inter-observer variation.

PubMed

Høyer, C; Paludan, J P D; Pavar, S; Biurrun Manresa, J A; Petersen, L J

2014-03-01

To assess the intra- and inter-observer variation in laser Doppler flowmetry curve reading for measurement of toe and ankle pressures. A prospective single blinded diagnostic accuracy study was conducted on 200 patients with known or suspected peripheral arterial disease (PAD), with a total of 760 curve sets produced. The first curve reading for this study was performed by laboratory technologists blinded to clinical clues and previous readings at least 3 months after the primary data sampling. The pressure curves were later reassessed following another period of at least 3 months. Observer agreement in diagnostic classification according to TASC-II criteria was quantified using Cohen's kappa. Reliability was quantified using intra-class correlation coefficients, coefficients of variance, and Bland-Altman analysis. The overall agreement in diagnostic classification (PAD/not PAD) was 173/200 (87%) for intra-observer (κ = .858) and 175/200 (88%) for inter-observer data (κ = .787). Reliability analysis confirmed excellent correlation for both intra- and inter-observer data (ICC all ≥.931). The coefficients of variance ranged from 2.27% to 6.44% for intra-observer and 2.39% to 8.42% for inter-observer data. Subgroup analysis showed lower observer-variation for reading of toe pressures in patients with diabetes and/or chronic kidney disease than patients not diagnosed with these conditions. Bland-Altman plots showed higher variation in toe pressure readings than ankle pressure readings. This study shows substantial intra- and inter-observer agreement in diagnostic classification and reading of absolute pressures when using laboratory technologists as observers. The study emphasises that observer variation for curve reading is an important factor concerning the overall reproducibility of the method. Our data suggest diabetes and chronic kidney disease have an influence on toe pressure reproducibility. Copyright © 2013 European Society for Vascular Surgery. Published by Elsevier Ltd. All rights reserved.
Reliability of assessment of upper trapezius morphology, its mechanical properties and blood flow in female patients with myofascial pain syndrome using ultrasonography.

PubMed

Adigozali, Hakimeh; Shadmehr, Azadeh; Ebrahimi, Esmail; Rezasoltani, Asghar; Naderi, Farrokh

2017-01-01

In the present study, the intra-rater reliability of upper trapezius morphology, its mechanical properties and intramuscular blood circulation in females with myofascial pain syndrome were assessed using ultrasonography. A total of 37 patients (31.05 ± 10 years old) participated in this study. Ultrasonography producer was set up in three stages: a) Gray-scale: to measure muscle thickness, size and area of trigger points; b) Ultrasound elastography: to measure muscle stiffness; and c) Doppler imaging: to assess blood flow indices. According to data analysis, all variables, except End Diastolic Velocity (EDV), had excellent reliability (>0.806). Intra-class Correlation Coefficient (ICC) for EDV was 0.738, which was considered a poor to good reliability. The results of this study introduced a reliable method for developing details of upper trapezius features using muscular ultrasonography in female patients. These variables could be used for objective examination and provide guidelines for treatment plans in clinical settings. Copyright © 2016 Elsevier Ltd. All rights reserved.
Comparison of 3D computer-aided with manual cerebral aneurysm measurements in different imaging modalities.

PubMed

Groth, M; Forkert, N D; Buhk, J H; Schoenfeld, M; Goebell, E; Fiehler, J

2013-02-01

To compare intra- and inter-observer reliability of aneurysm measurements obtained by a 3D computer-aided technique with standard manual aneurysm measurements in different imaging modalities. A total of 21 patients with 29 cerebral aneurysms were studied. All patients underwent digital subtraction angiography (DSA), contrast-enhanced (CE-MRA) and time-of-flight magnetic resonance angiography (TOF-MRA). Aneurysm neck and depth diameters were manually measured by two observers in each modality. Additionally, semi-automatic computer-aided diameter measurements were performed using 3D vessel surface models derived from CE- (CE-com) and TOF-MRA (TOF-com) datasets. Bland-Altman analysis (BA) and intra-class correlation coefficient (ICC) were used to evaluate intra- and inter-observer agreement. BA revealed the narrowest relative limits of intra- and inter-observer agreement for aneurysm neck and depth diameters obtained by TOF-com (ranging between ±5.3 % and ±28.3 %) and CE-com (ranging between ±23.3 % and ±38.1 %). Direct measurements in DSA, TOF-MRA and CE-MRA showed considerably wider limits of agreement. The highest ICCs were observed for TOF-com and CE-com (ICC values, 0.92 or higher for intra- as well as inter-observer reliability). Computer-aided aneurysm measurement in 3D offers improved intra- and inter-observer reliability and a reproducible parameter extraction, which may be used in clinical routine and as objective surrogate end-points in clinical trials.
Reliability and Validity Study of the Chamorro Assisted Gait Scale for People with Sprained Ankles, Walking with Forearm Crutches

PubMed Central

Ridao-Fernández, Carmen; Ojeda, Joaquín; Benítez-Lugo, Marisa; Sevillano, José Luis

2016-01-01

Objective The aim of this study was to design and validate a functional assessment scale for assisted gait with forearm crutches (Chamorro Assisted Gait Scale—CHAGS) and to assess its reliability in people with sprained ankles. Design Thirty subjects who suffered from sprained ankle (anterior talofibular ligament first and second degree) were included in the study. A modified Delphi technique was used to obtain the content validity. The selected items were: pelvic and scapular girdle dissociation(1), deviation of Center of Gravity(2), crutch inclination(3), steps rhythm(4), symmetry of step length(5), cross support(6), simultaneous support of foot and crutch(7), forearm off(8), facing forward(9) and fluency(10). Two raters twice visualized the gait of the sample subjects which were recorded. The criterion-related validity was determined by correlation between CHAGS and Coding of eight criteria of qualitative gait analysis (Viel Coding). Internal consistency and inter and intra-rater reliability were also tested. Results CHAGS obtained a high and negative correlation with Viel Coding. We obtained a good internal consistency and the intra-class correlation coefficients oscillated between 0.97 and 0.99, while the minimal detectable changes were acceptable. Conclusion CHAGS scale is a valid and reliable tool for assessing assisted gait with crutches in people with sprained ankles to perform partial relief of lower limbs. PMID:27168236
Three dimensional reliability analyses of currently used methods for assessment of sagittal jaw discrepancy

PubMed Central

Almaqrami, Bushra-Sufyan; Alhammadi, Maged-Sultan

2018-01-01

Background The objective of this study was to analyse three dimensionally the reliability and correlation of angular and linear measurements in assessment of anteroposterior skeletal discrepancy. Material and Methods In this retrospective cross sectional study, a sample of 213 subjects were three-dimensionally analysed from cone-beam computed tomography scans. The sample was divided according to three dimensional measurement of anteroposterior relation (ANB angle) into three groups (skeletal Class I, Class II and Class III). The anterior-posterior cephalometric indicators were measured on volumetric images using Anatomage software (InVivo5.2). These measurements included three angular and seven linear measurements. Cross tabulations were performed to correlate the ANB angle with each method. Intra-class Correlation Coefficient (ICC) test was applied for the difference between the two reliability measurements. P value of < 0.05 was considered significant. Results There was a statistically significant (P<0.05) agreement between all methods used with variability in assessment of different anteroposterior relations. The highest correlation was between ANB and DSOJ (0.913), strong correlation with AB/FH, AB/SN/, MM bisector, AB/PP, Wits appraisal (0.896, 0.890, 0.878, 0.867,and 0.858, respectively), moderate with AD/SN and Beta angle (0.787 and 0.760), and weak correlation with corrected ANB angle (0.550). Conclusions Conjunctive usage of ANB angle with DSOJ, AB/FH, AB/SN/, MM bisector, AB/PP and Wits appraisal in 3D cephalometric analysis provide a more reliable and valid indicator of the skeletal anteroposterior relationship. Clinical relevance: Most of orthodontic literature depends on single method (ANB) with its drawbacks in assessment of skeletal discrepancy which is a cardinal factors for proper treatment planning, this study assessed three dimensionally the degree of correlation between all available methods to make clinical judgement more accurate based on more than one method of assessment. Key words:Anteroposterior relationships, ANB angle, Three-dimension, CBCT. PMID:29750096
Reliability of ultrasound thickness measurement of the abdominal muscles during clinical isometric endurance tests.

PubMed

ShahAli, Shabnam; Arab, Amir Massoud; Talebian, Saeed; Ebrahimi, Esmaeil; Bahmani, Andia; Karimi, Noureddin; Nabavi, Hoda

2015-07-01

The study was designed to evaluate the intra-examiner reliability of ultrasound (US) thickness measurement of abdominal muscles activity when supine lying and during two isometric endurance tests in subjects with and without Low back pain (LBP). A total of 19 women (9 with LBP, 10 without LBP) participated in the study. Within-day reliability of the US thickness measurements at supine lying and the two isometric endurance tests were assessed in all subjects. The intra-class correlation coefficient (ICC) was used to assess the relative reliability of thickness measurement. The standard error of measurement (SEM), minimal detectable change (MDC) and the coefficient of variation (CV) were used to evaluate the absolute reliability. Results indicated high ICC scores (0.73-0.99) and also small SEM and MDC scores for within-day reliability assessment. The Bland-Altman plots of agreement in US measurement of the abdominal muscles during the two isometric endurance tests demonstrated that 95% of the observations fall between the limits of agreement for test and retest measurements. Together the results indicate high intra-tester reliability for the US measurement of the thickness of abdominal muscles in all the positions tested. According to the study's findings, US imaging can be used as a reliable method for assessment of abdominal muscles activity in supine lying and the two isometric endurance tests employed, in participants with and without LBP. Copyright © 2014 Elsevier Ltd. All rights reserved.
The Reliability of Pharyngeal High Resolution Manometry with Impedance for Derivation of Measures of Swallowing Function in Healthy Volunteers

PubMed Central

Omari, Taher I.; Savilampi, Johanna; Kokkinn, Karmen; Schar, Mistyka; Lamvik, Kristin; Doeltgen, Sebastian; Cock, Charles

2016-01-01

Purpose. We evaluated the intra- and interrater agreement and test-retest reliability of analyst derivation of swallow function variables based on repeated high resolution manometry with impedance measurements. Methods. Five subjects swallowed 10 × 10 mL saline on two occasions one week apart producing a database of 100 swallows. Swallows were repeat-analysed by six observers using software. Swallow variables were indicative of contractility, intrabolus pressure, and flow timing. Results. The average intraclass correlation coefficients (ICC) for intra- and interrater comparisons of all variable means showed substantial to excellent agreement (intrarater ICC 0.85–1.00; mean interrater ICC 0.77–1.00). Test-retest results were less reliable. ICC for test-retest comparisons ranged from slight to excellent depending on the class of variable. Contractility variables differed most in terms of test-retest reliability. Amongst contractility variables, UES basal pressure showed excellent test-retest agreement (mean ICC 0.94), measures of UES postrelaxation contractile pressure showed moderate to substantial test-retest agreement (mean Interrater ICC 0.47–0.67), and test-retest agreement of pharyngeal contractile pressure ranged from slight to substantial (mean Interrater ICC 0.15–0.61). Conclusions. Test-retest reliability of HRIM measures depends on the class of variable. Measures of bolus distension pressure and flow timing appear to be more test-retest reliable than measures of contractility. PMID:27190520
Estimating a graphical intra-class correlation coefficient (GICC) using multivariate probit-linear mixed models.

PubMed

Yue, Chen; Chen, Shaojie; Sair, Haris I; Airan, Raag; Caffo, Brian S

2015-09-01

Data reproducibility is a critical issue in all scientific experiments. In this manuscript, the problem of quantifying the reproducibility of graphical measurements is considered. The image intra-class correlation coefficient (I2C2) is generalized and the graphical intra-class correlation coefficient (GICC) is proposed for such purpose. The concept for GICC is based on multivariate probit-linear mixed effect models. A Markov Chain Monte Carlo EM (mcm-cEM) algorithm is used for estimating the GICC. Simulation results with varied settings are demonstrated and our method is applied to the KIRBY21 test-retest dataset.
Intra-and inter-observer reliability of nailfold videocapillaroscopy - A possible outcome measure for systemic sclerosis-related microangiopathy.

PubMed

Dinsdale, Graham; Moore, Tonia; O'Leary, Neil; Tresadern, Philip; Berks, Michael; Roberts, Christopher; Manning, Joanne; Allen, John; Anderson, Marina; Cutolo, Maurizio; Hesselstrand, Roger; Howell, Kevin; Pizzorni, Carmen; Smith, Vanessa; Sulli, Alberto; Wildt, Marie; Taylor, Christopher; Murray, Andrea; Herrick, Ariane L

2017-07-01

Our aim was to assess the reliability of nailfold capillary assessment in terms of image evaluability, image severity grade ('normal', 'early', 'active', 'late'), capillary density, capillary (apex) width, and presence of giant capillaries, and also to gain further insight into differences in these parameters between patients with systemic sclerosis (SSc), patients with primary Raynaud's phenomenon (PRP) and healthy control subjects. Videocapillaroscopy images (magnification 300×) were acquired from all 10 digits from 173 participants: 101 patients with SSc, 22 with PRP and 50 healthy controls. Ten capillaroscopy experts from 7 European centres evaluated the images. Custom image mark-up software allowed extraction of the following outcome measures: overall grade ('normal', 'early', 'active', 'late', 'non-specific', or 'ungradeable'), capillary density (vessels/mm), mean vessel apical width, and presence of giant capillaries. Observers analysed a median of 129 images each. Evaluability (i.e. the availability of measures) varied across outcome measures (e.g. 73.0% for density and 46.2% for overall grade in patients with SSc). Intra-observer reliability for evaluability was consistently higher than inter- (e.g. for density, intra-class correlation coefficient [ICC] was 0.71 within and 0.14 between observers). Conditional on evaluability, both intra- and inter-observer reliability were high for grade (ICC 0.93 and 0.78 respectively), density (0.91 and 0.64) and width (0.91 and 0.85). Evaluability is one of the major challenges in assessing nailfold capillaries. However, when images are evaluable, the high intra- and inter-reliabilities suggest that overall image grade, capillary density and apex width have potential as outcome measures in longitudinal studies. Copyright © 2017 Elsevier Inc. All rights reserved.
Quantitative outcome measures for systemic sclerosis-related Microangiopathy - Reliability of image acquisition in Nailfold Capillaroscopy.

PubMed

Dinsdale, Graham; Moore, Tonia; O'Leary, Neil; Berks, Michael; Roberts, Christopher; Manning, Joanne; Allen, John; Anderson, Marina; Cutolo, Maurizio; Hesselstrand, Roger; Howell, Kevin; Pizzorni, Carmen; Smith, Vanessa; Sulli, Alberto; Wildt, Marie; Taylor, Christopher; Murray, Andrea; Herrick, Ariane L

2017-09-01

Nailfold capillaroscopic parameters hold increasing promise as outcome measures for clinical trials in systemic sclerosis (SSc). Their inclusion as outcomes would often naturally require capillaroscopy images to be captured at several time points during any one study. Our objective was to assess repeatability of image acquisition (which has been little studied), as well as of measurement. 41 patients (26 with SSc, 15 with primary Raynaud's phenomenon) and 10 healthy controls returned for repeat high-magnification (300×) videocapillaroscopy mosaic imaging of 10 digits one week after initial imaging (as part of a larger study of reliability). Images were assessed in a random order by an expert blinded observer and 4 outcome measures extracted: (1) overall image grade and then (where possible) distal vessel locations were marked, allowing (2) vessel density (across the whole nailfold) to be calculated (3) apex width measurement and (4) giant vessel count. Intra-rater, intra-visit and intra-rater inter-visit (baseline vs. 1week) reliability were examined in 475 and 392 images respectively. A linear, mixed-effects model was used to estimate variance components, from which intra-class correlation coefficients (ICCs) were determined. Intra-visit and inter-visit reliability estimates (ICCs) were (respectively): overall image grade, 0.97 and 0.90; vessel density, 0.92 and 0.65; mean vessel width, 0.91 and 0.79; presence of giant capillary, 0.68 and 0.56. These estimates were conditional on each parameter being measurable. Within-operator image analysis and acquisition are reproducible. Quantitative nailfold capillaroscopy, at least with a single observer, provides reliable outcome measures for clinical studies including randomised controlled trials. Copyright © 2017 Elsevier Inc. All rights reserved.
Validity and reliability of a new ankle dorsiflexion measurement device.

PubMed

Gatt, Alfred; Chockalingam, Nachiappan

2013-08-01

The assessment of the maximum ankle dorsiflexion angle is an important clinical examination procedure. Evidence shows that the traditional goniometer is highly unreliable, and various designs of goniometers to measure the maximum ankle dorsiflexion angle rely on the application of a known force to obtain reliable results. Hence, an innovative ankle dorsiflexion measurement device was designed to make this measurement more reliable by holding the foot in a selected posture without the application of a known moment. To report on the comprehensive validity and reliability testing carried out on the new device. Following validity testing, four different trials to test reliability of the ankle dorsiflexion measurement device were performed. These trials included inter-rater and intra-rater testings with a controlled moment, intra-rater reliability testing with knees flexed and extended without a controlled moment, intra-rater testing with a patient population, and inter-rater reliability testing between four raters of varying experience without controlling moment. All raters were blinded. A series of trials to test intra-rater and inter-rater reliabilities. Intra-rater reliability intraclass correlation coefficient was 0.98 and inter-rater reliability intraclass correlation coefficient (2,1) was 0.953 with a controlled moment. With uncontrolled moment, very high reliability for intra-tester was also achieved (intraclass correlation coefficient = 0.94 with knees extended and intraclass correlation coefficient = 0.95 with knees flexed). For the trial investigating test-retest reliability with actual patients, intraclass correlation coefficient of 0.99 was obtained. In the trial investigating four different raters with uncontrolled moment, intraclass correlation coefficient of 0.91 was achieved. The new ankle dorsiflexion measurement device is a valid and reliable device for measuring ankle dorsiflexion in both healthy subjects and patients, with both controlled and uncontrolled moments, even by multiple raters of varying experience when the foot is dorsiflexed to its end of range of motion. An ankle dorsiflexion measuring device has been designed to increase the reliability of ankle dorsiflexion measurement and replace the traditional goniometer. While the majority of similar devices rely on application of a known moment to perform this measurement, it has been shown that this is not required with the new ankle dorsiflexion measurement device and, rather, foot posture should be taken into consideration as this affects the maximum ankle dorsiflexion angle.
Phases of match-play in professional Australian Football: Descriptive analysis and reliability assessment.

PubMed

Rennie, Michael J; Watsford, Mark L; Spurrs, Robert W; Kelly, Stephen J; Pine, Matthew J

2018-06-01

To examine the frequency and time spent in the phases of Australian Football (AF) match-play and to assess the intra-assessor reliability of coding these phases of match-play. Observational, intra-reliability assessment. Video footage of 10 random quarters of AF match-play were coded by a single researcher. Phases of offence, defence, contested play, umpire stoppage, set shot and goal reset were coded using a set of operational definitions. Descriptive statistics were provided for all phases of match-play. Following a 6-month washout period, intra-coder reliability was assessed using typical error of measurement (TEM) and intra-class correlation coefficients (ICC). A quarter of AF match-play involved 128±20 different phases of match-play. The highest proportion of match-play involved contested play (25%), followed by offence (18%), defence (18%) and umpire stoppages (18%). The mean duration of offence, defence, contested play, umpire stoppage, set shot and goal reset were 14, 14, 10, 11, 28 and 47s, respectively. No differences were found between the two coding assessments (p>0.05). ICCs for coding the phases of play demonstrated very high reliability (r=0.902-0.992). TEM of the total time spent in each phase of play represented moderate to good reliability (TEM=1.8-9.3%). Coding of offence, defence and contested play tended to display slightly poorer TEMs than umpire stoppages, set shots and goal resets (TEM=8.1 vs 4.5%). Researchers can reliably code the phases of AF match-play which may permit the analysis of specific elements of competition. Copyright © 2017 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.
Psychometric viability of measures of functional performance commonly used for people with dementia: a systematic review of measurement properties.

PubMed

Fox, Benjamin; Henwood, Timothy; Keogh, Justin; Neville, Christine

2016-08-01

Confidence in findings can only be drawn from measurement tools that have sound psychometric properties for the population with which they are used. Within a dementia specific population, measures of physical function have been poorly justified in exercise intervention studies, with justification of measures based on validity or reliability studies from dissimilar clinical populations, such as people with bronchitis or healthy older adults without dementia. To review the reliability and validity of quantitative measures of pre-identified physical function, as commonly used within exercise intervention literature for adults with dementia. Participants were adults, aged 65 years and older, with a confirmed medical diagnosis of dementia. n/a Desired studies were observational and cross-sectional and that assessed measures from a pre-identified list of measures of physical function. Studies that assessed the psychometric constructs of reliability and validity were targeted. COSMIN taxology was used to define reliability and validity. This included, but were not limited to, Intra-Class Correlations, Kappa, Cronbach's Alpha, Chi Squared, Standard Error of Measurement, Minimal Detectable Change and Limits of Agreement. Published material was sourced from the following four databases: MEDLINE, EMBASE, CINAHL and ISI Web of Science. Grey literature was searched for using ALOIS, Google Scholar and ProQuest. The COSMIN checklist was used to assess methodological quality of included studies. Assessment was completed by two reviewers independently. Reliability and validity data was extracted from included studies using standardized Joanna Briggs Institute data collection forms. Extraction was completed by two reviewers. A narrative synthesis of measurement properties of the tools used to measure physical function was performed. Quantitative meta-analysis was conducted for Intra-Class Correlation Coefficients only. With respect to relative reliability, studies reporting assessed measures had intraclass correlation coefficients greater than 0.71, indicating their suitability for use at a group level. However, a consistent finding among studies that included assessment of absolute reliability was that intra individual variation was too large for meaningful measurement of individuals. This was indicated by large Minimal Detectable Change (MDC) scores. Walk Speed has the smallest reported Mimimal Detectable Change score at 0.11m/s. This represented a change of 35% before statistical variation could be eliminated as the cause for this change. All measures had large MDC values. Walk Speed had the smallest MDC values at 0.11m/s, which represented a necessary change of 35%. Only a limited number of studies assessed the validity of measures. This supports the use of these measures in a very narrow selection of circumstances (see Summary of Findings). In summary, measures have shown appropriate levels of relative reliability. This supports their use at the group level. However, large levels of intra-individual variation undermine their applicability at the individual level. Limited studies of validity were available to this review, which limits a conclusion on whether measures are valid for people with dementia.
Spanish version of the Kidney Disease Knowledge Survey (KiKS) in Peru: cross-cultural adaptation and validation.

PubMed

Mota-Anaya, Evelin; Yumpo-Cárdenas, Daniel; Alva-Bravo, Edmundo; Wright-Nunes, Julie; Mayta-Tristán, Percy

2016-08-08

Chronic kidney disease (CKD) affects 50 million people globally. Several studies show the importance of implementing interventions that enhance patients knowledge about their disease. In 2011 the Kidney Disease Knowledge Survey (KiKS) was developed: a questionnaire that assesses the specific knowledge about chronic kidney disease in pre-dialysis patients. To translate to Spanish, culturally adapt and validate the Kidney Disease Knowledge Survey questionnaire in a population of patients with pre-dialysis chronic kidney disease. We carried out a Spanish translation and cross-cultural adaptation of the Kidney Disease Knowledge Survey questionnaire. Subsequently, we determined its validity and reliability. We determined the validity through construct validity; and reliability by evaluating its internal consistency and its intra-observer reliability (test-retest). We found a good internal consistency (Kuder-Richardson = 0.85). The intra-observer reliability was measured by the intra-class correlation coefficient that yielded a value of 0.78 (95% CI: 0.5-1.0). This value indicated a good reproducibility; also, the mean difference of -1.1 test-retest SD 6.0 (p = 0.369) confirms this finding. The translated Spanish version of the Kidney Disease Knowledge Survey is acceptable and equivalent to the original version; it also has a good reliability, validity and reproducibility. Therefore, it can be used in a population of patients with pre-dialysis chronic kidney disease.
Validation of the Hindi version of National Institute of Health Stroke Scale.

PubMed

Prasad, Kameshwar; Dash, Deepa; Kumar, Amit

2012-01-01

To determine the reliability and validity of the National Institute of Health Stroke Scale (NIHSS) with the Hindi and Indian adaptation of items 9 and 10. NIHSS items 9 and 10 were modified and culturally adapted at All India Institute of Medical Sciences (AIIMS) and the resulting version was termed as Hindi version (HV-NIHSS). HV-NIHSS was applied by two independent investigators on 107 patients with stroke. Inter-observer agreement and intra-class correlation coefficients were calculated. The predictive validity of the HV-NIHSS was calculated using functional outcome after three months in the form of modified Rankin Scale (mRS) and Barthel Index (BI). The study included 107 patients of stroke recruited from a tertiary referral hospital at Delhi between November 1, 2009, and October 1, 2010; the mean age of these patients was 56.26±13.84 years and 65.4% of them had suffered ischemic stroke. Inter-rater reliability was high between the two examiners, with Pearson's r ranging from 0.72 to 0.99 for the 15 items on the Scale. Intra-class correlation coefficient for the total score was 0.995 (95% CI-0.993-0.997). Concurrent construct validity was established between HV-NIHSS and baseline Glasgow Coma Scale, with a high correlation (Spearman coefficient = -0.863, P<.001). Predictive validity was also established with BI at three months (Spearman's rho: -0.829, P<.001) and with mRS at three months (Spearman's rho: 0.851, P<0.001). This study shows that a Hindi language version of the NIHSS developed at AIIMS appears reliable and valid when applied to a Hindi-speaking population.
The Reliability and Validity of the Perceived Dietary Adherence Questionnaire for People with Type 2 Diabetes

PubMed Central

Asaad, Ghada; Sadegian, Maryam; Lau, Rita; Xu, Yunke; Soria-Contreras, Diana C.; Bell, Rhonda C.; Chan, Catherine B.

2015-01-01

Nutrition therapy is essential for diabetes treatment, and assessment of dietary intake can be time consuming. The purpose of this study was to develop a reliable and valid instrument to measure diabetic patients’ adherence to Canadian diabetes nutrition recommendations. Specific information derived from three, repeated 24-h dietary recalls of 64 type 2 diabetic patients, aged 59.2 ± 9.7 years, was correlated with a total score and individual items of the Perceived Dietary Adherence Questionnaire (PDAQ). Test-retest reliability was completed by 27 type 2 diabetic patients, aged 62.8 ± 8.4 years. The correlation coefficients for PDAQ items versus 24-h recalls ranged from 0.46 to 0.11. The intra-class correlation (0.78) was acceptable, indicating good reliability. The results suggest that PDAQ is a valid and reliable measure of diabetes nutrition recommendations. Because it is quick to administer and score, it may be useful as a screening tool in research and as a clinical tool to monitor dietary adherence. PMID:26198247
The Arthroscopic Surgical Skill Evaluation Tool (ASSET)

PubMed Central

Koehler, Ryan J.; Amsdell, Simon; Arendt, Elizabeth A; Bisson, Leslie J; Braman, Jonathan P; Butler, Aaron; Cosgarea, Andrew J; Harner, Christopher D; Garrett, William E; Olson, Tyson; Warme, Winston J.; Nicandri, Gregg T.

2014-01-01

Background Surgeries employing arthroscopic techniques are among the most commonly performed in orthopaedic clinical practice however, valid and reliable methods of assessing the arthroscopic skill of orthopaedic surgeons are lacking. Hypothesis The Arthroscopic Surgery Skill Evaluation Tool (ASSET) will demonstrate content validity, concurrent criterion-oriented validity, and reliability, when used to assess the technical ability of surgeons performing diagnostic knee arthroscopy on cadaveric specimens. Study Design Cross-sectional study; Level of evidence, 3 Methods Content validity was determined by a group of seven experts using a Delphi process. Intra-articular performance of a right and left diagnostic knee arthroscopy was recorded for twenty-eight residents and two sports medicine fellowship trained attending surgeons. Subject performance was assessed by two blinded raters using the ASSET. Concurrent criterion-oriented validity, inter-rater reliability, and test-retest reliability were evaluated. Results Content validity: The content development group identified 8 arthroscopic skill domains to evaluate using the ASSET. Concurrent criterion-oriented validity: Significant differences in total ASSET score (p<0.05) between novice, intermediate, and advanced experience groups were identified. Inter-rater reliability: The ASSET scores assigned by each rater were strongly correlated (r=0.91, p <0.01) and the intra-class correlation coefficient between raters for the total ASSET score was 0.90. Test-retest reliability: there was a significant correlation between ASSET scores for both procedures attempted by each individual (r = 0.79, p<0.01). Conclusion The ASSET appears to be a useful, valid, and reliable method for assessing surgeon performance of diagnostic knee arthroscopy in cadaveric specimens. Studies are ongoing to determine its generalizability to other procedures as well as to the live OR and other simulated environments. PMID:23548808
The Reliability of Anthropometric Measurements Used Preoperatively in Aesthetic Breast Surgery.

PubMed

Isaac, Kathryn V; Murphy, Blake D; Beber, Brett; Brown, Mitchell

2016-04-01

Patient outcomes in aesthetic breast surgery are highly dependent on breast measurements used in preoperative planning. The purpose of this study is to determine the reliability of anthropometric breast measurements. Four raters measured 28 women using 7 measurements: sternal notch to nipple distance (Sn-N), nipple to midline (N-M), nipple to inframammary-fold distance under maximal stretch (N-IMF), breast base width (BW), soft tissue pinch thickness of the upper pole (STPT:UP), STPT at the inframammary fold (STPT:IMF), and anterior pull skin stretch (APSS). Reliability was assessed using intra-class correlation coefficients (ICCs). Inter-rater reliability was excellent for Sn-N, N-M, and BW (ICC = 0.94, 0.90, and 0.76, respectively) and was good for N-IMF (ICC = 0.70). The STPT:UP, STPT:IMF, and APSS measurements were not reliable between raters (ICC < 0.2). Intra-rater reliability was excellent for Sn-N, N-M, and BW for all raters (all ICC > 0.75). The N-IMF intra-rater reliability was excellent in senior raters (ICC > 0.75) and good in junior raters (ICC > 0.6). The STPT:UP, STPT:IMF, and APSS measurements showed fair or poor reliability for most raters (ICC < 0.6). The Sn-N, N-M, and BW measurements are very reliable. Dynamic measurements including APSS, STPT:UP, and STUP:IMF are unreliable. N-IMF is the only reliable dynamic measurement, and its reliability improves with increasing clinical experience. The variable reliability of preoperative measurements must be considered in the planning of aesthetic breast surgery. 4 Diagnostic. © 2015 The American Society for Aesthetic Plastic Surgery, Inc. Reprints and permission: journals.permissions@oup.com.
Measuring family-centred practices of professionals in early intervention services in Taiwan.

PubMed

Kang, L-J; Palisano, R J; Simeonsson, R J; Hwang, A-W

2017-09-01

Family-centred practices emphasize professional supports for forming partnerships with families in early intervention. The Measure of Processes of Care for Service Providers (MPOC-SP) measures the perceptions of paediatric service providers in supporting children and families. This study aimed to establish reliability of the Chinese version of the MPOC-SP (C-MPOC-SP) and to examine professional perceptions of family-centred practices in relation to professional discipline and years of experience. A convenience sample of 94 physical therapists, occupational therapists, speech-language pathologists, social workers and early childhood educators completed the C-MPOC-SP. Thirty-seven professionals completed the measure a second time within 2-4 weeks for test-retest reliability. Internal consistency and test-retest reliability were examined by Cronbach's α and intra-class correlation coefficient. Comparisons were made across professional disciplines by multivariate analyses of variance followed by analyses of variance. Relationships between years of experience and ratings of family-centred practices were examined by Pearson's correlation coefficients (r). Cronbach's α for items on each of the four scales of the C-MPOC-SP ranged from 0.80 to 0.92, indicating adequate internal consistency. Intra-class correlation coefficient between the initial and repeat completion of the C-MPOC-SP for each scale ranged from 0.56 to 0.77, indicating adequate to excellent test-retest reliability. Mean ratings for the Communicating Specific Information were significantly higher for physical therapists, occupational therapists and speech-language pathologists than for social workers (P = 0.001). The C-MPOC-SP scores were positively correlated with years of experience for all four scales (r = 0.23-0.38; P < 0.05). This study established adequate internal consistency and adequate to excellent test-retest reliability of the C-MPOC-SP in measuring perceptions of family centeredness of early intervention service providers. Cross-discipline differences were found in communicating specific information about the child. Higher perceptions of family centeredness were associated with more years of experience. The results support the utility of the C-MPOC-SP in professional education and programme evaluation of early intervention services in Taiwan. © 2017 John Wiley & Sons Ltd.
Reliability of the School Food Checklist for in-school audits and photograph analysis of children's packed lunches.

PubMed

Mitchell, S A; Miles, C L; Brennan, L; Matthews, J

2010-02-01

Assessment of children's diets is problematic, typically relying on error-prone parent or child recall or reporting, or resource intensive direct observation. The School Food Checklist (SFC) is an objective instrument comprising of 20 food and beverage categories designed to measure the foods contained in children's packed lunches. The present study aimed to assess intra-rater and inter-rater reliability of each of the food and beverage categories of the SFC for both in-school audits and photograph analysis of children's school lunches. Participants comprised 176 children aged 5-8 years from five primary schools in Northern Metropolitan Melbourne. The SFC was used to measure the foods contained in children's packed lunches in the school setting and using photographs. Photograph analysis was conducted by the auditors 2-3 months after completion of in-school audits. Both intra-rater [intra-class correlation coefficient (ICC) = 0.78-1] and inter-rater (ICC = 0.50-0.95) reliability analysis indicated strong agreement for in-school auditing. With the exception of the food category titled 'leftovers', there was strong intra-rater reliability for auditors' live audits and their analysis of photographs [ICC = 0.57-0.98 (Auditor 1); ICC = 0.72-0.90 (Auditor 2)], and strong inter-rater reliability for photograph analysis (ICC = 0.68-0.92). The SFC is a reliable method of measuring the foods and beverages contained in children's packed lunches when used in the school setting or for photograph analysis. This finding has broad implications, particularly for the use of photograph analysis, because this approach offers a convenient and cost effective method of measuring what food and beverages children bring to school in home packed lunches.

Effect of image resolution manipulation in rearfoot angle measurements obtained with photogrammetry

PubMed Central

Sacco, I.C.N.; Picon, A.P.; Ribeiro, A.P.; Sartor, C.D.; Camargo-Junior, F.; Macedo, D.O.; Mori, E.T.T.; Monte, F.; Yamate, G.Y.; Neves, J.G.; Kondo, V.E.; Aliberti, S.

2012-01-01

The aim of this study was to investigate the influence of image resolution manipulation on the photogrammetric measurement of the rearfoot static angle. The study design was that of a reliability study. We evaluated 19 healthy young adults (11 females and 8 males). The photographs were taken at 1536 pixels in the greatest dimension, resized into four different resolutions (1200, 768, 600, 384 pixels) and analyzed by three equally trained examiners on a 96-pixels per inch (ppi) screen. An experienced physiotherapist marked the anatomic landmarks of rearfoot static angles on two occasions within a 1-week interval. Three different examiners had marked angles on digital pictures. The systematic error and the smallest detectable difference were calculated from the angle values between the image resolutions and times of evaluation. Different resolutions were compared by analysis of variance. Inter- and intra-examiner reliability was calculated by intra-class correlation coefficients (ICC). The rearfoot static angles obtained by the examiners in each resolution were not different (P > 0.05); however, the higher the image resolution the better the inter-examiner reliability. The intra-examiner reliability (within a 1-week interval) was considered to be unacceptable for all image resolutions (ICC range: 0.08-0.52). The whole body image of an adult with a minimum size of 768 pixels analyzed on a 96-ppi screen can provide very good inter-examiner reliability for photogrammetric measurements of rearfoot static angles (ICC range: 0.85-0.92), although the intra-examiner reliability within each resolution was not acceptable. Therefore, this method is not a proper tool for follow-up evaluations of patients within a therapeutic protocol. PMID:22911379
Effect of image resolution manipulation in rearfoot angle measurements obtained with photogrammetry.

PubMed

Sacco, I C N; Picon, A P; Ribeiro, A P; Sartor, C D; Camargo-Junior, F; Macedo, D O; Mori, E T T; Monte, F; Yamate, G Y; Neves, J G; Kondo, V E; Aliberti, S

2012-09-01

The aim of this study was to investigate the influence of image resolution manipulation on the photogrammetric measurement of the rearfoot static angle. The study design was that of a reliability study. We evaluated 19 healthy young adults (11 females and 8 males). The photographs were taken at 1536 pixels in the greatest dimension, resized into four different resolutions (1200, 768, 600, 384 pixels) and analyzed by three equally trained examiners on a 96-pixels per inch (ppi) screen. An experienced physiotherapist marked the anatomic landmarks of rearfoot static angles on two occasions within a 1-week interval. Three different examiners had marked angles on digital pictures. The systematic error and the smallest detectable difference were calculated from the angle values between the image resolutions and times of evaluation. Different resolutions were compared by analysis of variance. Inter- and intra-examiner reliability was calculated by intra-class correlation coefficients (ICC). The rearfoot static angles obtained by the examiners in each resolution were not different (P > 0.05); however, the higher the image resolution the better the inter-examiner reliability. The intra-examiner reliability (within a 1-week interval) was considered to be unacceptable for all image resolutions (ICC range: 0.08-0.52). The whole body image of an adult with a minimum size of 768 pixels analyzed on a 96-ppi screen can provide very good inter-examiner reliability for photogrammetric measurements of rearfoot static angles (ICC range: 0.85-0.92), although the intra-examiner reliability within each resolution was not acceptable. Therefore, this method is not a proper tool for follow-up evaluations of patients within a therapeutic protocol.
Inter-rater reliability of kinesthetic measurements with the KINARM robotic exoskeleton.

PubMed

Semrau, Jennifer A; Herter, Troy M; Scott, Stephen H; Dukelow, Sean P

2017-05-22

Kinesthesia (sense of limb movement) has been extremely difficult to measure objectively, especially in individuals who have survived a stroke. The development of valid and reliable measurements for proprioception is important to developing a better understanding of proprioceptive impairments after stroke and their impact on the ability to perform daily activities. We recently developed a robotic task to evaluate kinesthetic deficits after stroke and found that the majority (~60%) of stroke survivors exhibit significant deficits in kinesthesia within the first 10 days post-stroke. Here we aim to determine the inter-rater reliability of this robotic kinesthetic matching task. Twenty-five neurologically intact control subjects and 15 individuals with first-time stroke were evaluated on a robotic kinesthetic matching task (KIN). Subjects sat in a robotic exoskeleton with their arms supported against gravity. In the KIN task, the robot moved the subjects' stroke-affected arm at a preset speed, direction and distance. As soon as subjects felt the robot begin to move their affected arm, they matched the robot movement with the unaffected arm. Subjects were tested in two sessions on the KIN task: initial session and then a second session (within an average of 18.2 ± 13.8 h of the initial session for stroke subjects), which were supervised by different technicians. The task was performed both with and without the use of vision in both sessions. We evaluated intra-class correlations of spatial and temporal parameters derived from the KIN task to determine the reliability of the robotic task. We evaluated 8 spatial and temporal parameters that quantify kinesthetic behavior. We found that the parameters exhibited moderate to high intra-class correlations between the initial and retest conditions (Range, r-value = [0.53-0.97]). The robotic KIN task exhibited good inter-rater reliability. This validates the KIN task as a reliable, objective method for quantifying kinesthesia after stroke.
Validation of the FASH (Functional Assessment Scale for Acute Hamstring Injuries) questionnaire for German-speaking football players.

PubMed

Lohrer, Heinz; Nauck, Tanja; Korakakis, Vasileios; Malliaropoulos, Nikos

2016-10-24

The FASH (Functional Assessment Scale for Acute Hamstring Injuries) questionnaire has been recently developed as a disease-specific self-administered questionnaire for use in Greek, English, and German languages. Its psychometric qualities (validity and reliability) were tested only in Greek-speaking patients mainly representing track and field athletes. As hamstring injuries represent the most common football injury, we tested the validity and reliability of the FASH-G (G = German version) questionnaire in German-speaking footballers suffering from acute hamstring injuries. The FASH-G questionnaire was tested for reliability and validity, in 16 footballers with hamstring injuries (patients' group), 77 asymptomatic footballers (healthy group), and 19 field hockey players (at-risk group). Known-group validity was tested by comparing the total FASH-G scores of the injured and non-injured groups. Reliability of the FASH-G questionnaire was analysed in 18 asymptomatic footballers using the intra-class coefficient. Known-group validity was demonstrated by significant differences between injured and non-injured participants (p < 0.001). The FASH-G exhibited very good test-retest reliability (intra-class correlation coefficient = 0.982, p < 0.001). Internal consistency was excellent (α = 0.938). Compared with the results presented in the original publication, no statistical differences were found between healthy athletes (p = 0.257), but patients' groups and at-risk groups presented scoring differences (p = 0.040 and <0.001, respectively). The FASH-G is a valid and reliable instrument to assess and determine the severity of hamstring injuries in German footballers.
Development of a Peer Teaching-Assessment Program and a Peer Observation and Evaluation Tool

PubMed Central

Trujillo, Jennifer M.; Barr, Judith; Gonyeau, Michael; Van Amburgh, Jenny A.; Matthews, S. James; Qualters, Donna

2008-01-01

Objectives To develop a formalized, comprehensive, peer-driven teaching assessment program and a valid and reliable assessment tool. Methods A volunteer taskforce was formed and a peer-assessment program was developed using a multistep, sequential approach and the Peer Observation and Evaluation Tool (POET). A pilot study was conducted to evaluate the efficiency and practicality of the process and to establish interrater reliability of the tool. Intra-class correlation coefficients (ICC) were calculated. Results ICCs for 8 separate lectures evaluated by 2-3 observers ranged from 0.66 to 0.97, indicating good interrater reliability of the tool. Conclusion Our peer assessment program for large classroom teaching, which includes a valid and reliable evaluation tool, is comprehensive, feasible, and can be adopted by other schools of pharmacy. PMID:19325963
Hand assessment in older adults with musculoskeletal hand problems: a reliability study.

PubMed

Myers, Helen L; Thomas, Elaine; Hay, Elaine M; Dziedzic, Krysia S

2011-01-07

Musculoskeletal hand pain is common in the general population. This study aims to investigate the inter- and intra-observer reliability of two trained observers conducting a simple clinical interview and physical examination for hand problems in older adults. The reliability of applying the American College of Rheumatology (ACR) criteria for hand osteoarthritis to community-dwelling older adults will also be investigated. Fifty-five participants aged 50 years and over with a current self-reported hand problem and registered with one general practice were recruited from a previous health questionnaire study. Participants underwent a standardised, structured clinical interview and physical examination by two independent trained observers and again by one of these observers a month later. Agreement beyond chance was summarised using Kappa statistics and intra-class correlation coefficients. Median values for inter- and intra-observer reliability for clinical interview questions were found to be "substantial" and "moderate" respectively [median agreement beyond chance (Kappa) was 0.75 (range: -0.03, 0.93) for inter-observer ratings and 0.57 (range: -0.02, 1.00) for intra-observer ratings]. Inter- and intra-observer reliability for physical examination items was variable, with good reliability observed for some items, such as grip and pinch strength, and poor reliability observed for others, notably assessment of altered sensation, pain on resisted movement and judgements based on observation and palpation of individual features at single joints, such as bony enlargement, nodes and swelling. Moderate agreement was observed both between and within observers when applying the ACR criteria for hand osteoarthritis. Standardised, structured clinical interview is reliable for taking a history in community-dwelling older adults with self reported hand problems. Agreement between and within observers for physical examination items is variable. Low Kappa values may have resulted, in part, from a low prevalence of clinical signs and symptoms in the study participants. The decision to use clinical interview and hand assessment variables in clinical practice or further research in primary care should include consideration of clinical applicability and training alongside reliability. Further investigation is required to determine the relationship between these clinical questions and assessments and the clinical course of hand pain and hand problems in community-dwelling older adults.
Quality of Radiomic Features in Glioblastoma Multiforme: Impact of Semi-Automated Tumor Segmentation Software

PubMed Central

Lee, Myungeun; Woo, Boyeong; Kuo, Michael D.; Jamshidi, Neema

2017-01-01

Objective The purpose of this study was to evaluate the reliability and quality of radiomic features in glioblastoma multiforme (GBM) derived from tumor volumes obtained with semi-automated tumor segmentation software. Materials and Methods MR images of 45 GBM patients (29 males, 16 females) were downloaded from The Cancer Imaging Archive, in which post-contrast T1-weighted imaging and fluid-attenuated inversion recovery MR sequences were used. Two raters independently segmented the tumors using two semi-automated segmentation tools (TumorPrism3D and 3D Slicer). Regions of interest corresponding to contrast-enhancing lesion, necrotic portions, and non-enhancing T2 high signal intensity component were segmented for each tumor. A total of 180 imaging features were extracted, and their quality was evaluated in terms of stability, normalized dynamic range (NDR), and redundancy, using intra-class correlation coefficients, cluster consensus, and Rand Statistic. Results Our study results showed that most of the radiomic features in GBM were highly stable. Over 90% of 180 features showed good stability (intra-class correlation coefficient [ICC] ≥ 0.8), whereas only 7 features were of poor stability (ICC < 0.5). Most first order statistics and morphometric features showed moderate-to-high NDR (4 > NDR ≥1), while above 35% of the texture features showed poor NDR (< 1). Features were shown to cluster into only 5 groups, indicating that they were highly redundant. Conclusion The use of semi-automated software tools provided sufficiently reliable tumor segmentation and feature stability; thus helping to overcome the inherent inter-rater and intra-rater variability of user intervention. However, certain aspects of feature quality, including NDR and redundancy, need to be assessed for determination of representative signature features before further development of radiomics. PMID:28458602
Quality of Radiomic Features in Glioblastoma Multiforme: Impact of Semi-Automated Tumor Segmentation Software.

PubMed

Lee, Myungeun; Woo, Boyeong; Kuo, Michael D; Jamshidi, Neema; Kim, Jong Hyo

2017-01-01

The purpose of this study was to evaluate the reliability and quality of radiomic features in glioblastoma multiforme (GBM) derived from tumor volumes obtained with semi-automated tumor segmentation software. MR images of 45 GBM patients (29 males, 16 females) were downloaded from The Cancer Imaging Archive, in which post-contrast T1-weighted imaging and fluid-attenuated inversion recovery MR sequences were used. Two raters independently segmented the tumors using two semi-automated segmentation tools (TumorPrism3D and 3D Slicer). Regions of interest corresponding to contrast-enhancing lesion, necrotic portions, and non-enhancing T2 high signal intensity component were segmented for each tumor. A total of 180 imaging features were extracted, and their quality was evaluated in terms of stability, normalized dynamic range (NDR), and redundancy, using intra-class correlation coefficients, cluster consensus, and Rand Statistic. Our study results showed that most of the radiomic features in GBM were highly stable. Over 90% of 180 features showed good stability (intra-class correlation coefficient [ICC] ≥ 0.8), whereas only 7 features were of poor stability (ICC < 0.5). Most first order statistics and morphometric features showed moderate-to-high NDR (4 > NDR ≥1), while above 35% of the texture features showed poor NDR (< 1). Features were shown to cluster into only 5 groups, indicating that they were highly redundant. The use of semi-automated software tools provided sufficiently reliable tumor segmentation and feature stability; thus helping to overcome the inherent inter-rater and intra-rater variability of user intervention. However, certain aspects of feature quality, including NDR and redundancy, need to be assessed for determination of representative signature features before further development of radiomics.
Reliability of the measures of weight-bearing distribution obtained during quiet stance by digital scales in subjects with and without hemiparesis.

PubMed

de Araujo-Barbosa, Paulo Henrique Ferreira; de Menezes, Lidiane Teles; Costa, Abraão Souza; Couto Paz, Clarissa Cardoso Dos Santos; Fachin-Martins, Emerson

2015-05-01

Described as an alternative way of assessing weight-bearing asymmetries, the measures obtained from digital scales have been used as an index to classify weight-bearing distribution. This study aimed to describe the intra-test and the test/retest reliability of measures in subjects with and without hemiparesis during quiet stance. The percentage of body weight borne by one limb was calculated for a sample of subjects with hemiparesis and for a control group that was matched by gender and age. A two-way analysis of variance was used to verify the intra-test reliability. This analysis was calculated using the differences between the averages of the measures obtained during single, double or triple trials. The intra-class correlation coefficient (ICC) was utilized and data plotted using the Bland-Altman method. The intra-test analysis showed significant differences, only observed in the hemiparesis group, between the measures obtained by single and triple trials. Excellent and moderate ICC values (0.69-0.84) between test and retest were observed in the hemiparesis group, while for control groups ICC values (0.41-0.74) were classified as moderate, progressing from almost poor for measures obtained by a single trial to almost excellent for those obtained by triple trials. In conclusion, good reliability ranging from moderate to excellent classifications was found for participants with and without hemiparesis. Moreover, an improvement of the repeatability was observed with fewer trials for participants with hemiparesis, and with more trials for participants without hemiparesis.
CROSS-CULTURAL ADAPTATION AND VALIDATION OF THE KOREAN VERSION OF THE CUMBERLAND ANKLE INSTABILITY TOOL.

PubMed

Ko, Jupil; Rosen, Adam B; Brown, Cathleen N

2015-12-01

The Cumberland Ankle Instability Tool (CAIT) is a valid and reliable patient reported outcome used to assess the presence and severity of chronic ankle instability (CAI). The CAIT has been cross-culturally adapted into other languages for use in non-English speaking populations. However, there are no valid questionnaires to assess CAI in individuals who speak Korean. The purpose of this study was to translate, cross-culturally adapt, and validate the CAIT, for use in a Korean-speaking population with CAI. Cross-cultural reliability study. The CAIT was cross-culturally adapted into Korean according to accepted guidelines and renamed the Cumberland Ankle Instability Tool-Korean (CAIT-K). Twenty-three participants (12 males, 11 females) who were bilingual in English and Korean were recruited and completed the original and adapted versions to assess agreement between versions. An additional 168 national level Korean athletes (106 male, 62 females; age = 20.3 ± 1.1 yrs), who participated in ≥ 90 minutes of physical activity per week, completed the final version of the CAIT-K twice within 14 days. Their completed questionnaires were assessed for internal consistency, test-retest reliability, criterion validity, and construct validity. For bilingual participants, intra-class correlation coefficients (ICC2,1) between the CAIT and the CAIT-K for test-retest reliability were 0.95 (SEM=1.83) and 0.96 (SEM=1.50) in right and left limbs, respectively. The Cronbach's alpha coefficients were 0.92 and 0.90 for the CAIT-K in right and left limbs, respectively. For native Korean speakers, the CAIT-K had high internal consistency (Cronbach's α=0.89) and intra-class correlation coefficient (ICC2,1 = 0.94, SEM=1.72), correlation with the physical component score (rho=0.70, p = 0.001) of the Short-Form Health Survey (SF-36), and the Kaiser-Meyer-Olkin score was 0.87. The original CAIT was translated, cross-culturally adapted, and validated from English to Korean. The CAIT-K appears to be valid and reliable and could be useful in assessing the Korean speaking population with CAI.
Reliability of Semi-Automated Segmentations in Glioblastoma.

PubMed

Huber, T; Alber, G; Bette, S; Boeckh-Behrens, T; Gempt, J; Ringel, F; Alberts, E; Zimmer, C; Bauer, J S

2017-06-01

In glioblastoma, quantitative volumetric measurements of contrast-enhancing or fluid-attenuated inversion recovery (FLAIR) hyperintense tumor compartments are needed for an objective assessment of therapy response. The aim of this study was to evaluate the reliability of a semi-automated, region-growing segmentation tool for determining tumor volume in patients with glioblastoma among different users of the software. A total of 320 segmentations of tumor-associated FLAIR changes and contrast-enhancing tumor tissue were performed by different raters (neuroradiologists, medical students, and volunteers). All patients underwent high-resolution magnetic resonance imaging including a 3D-FLAIR and a 3D-MPRage sequence. Segmentations were done using a semi-automated, region-growing segmentation tool. Intra- and inter-rater-reliability were addressed by intra-class-correlation (ICC). Root-mean-square error (RMSE) was used to determine the precision error. Dice score was calculated to measure the overlap between segmentations. Semi-automated segmentation showed a high ICC (> 0.985) for all groups indicating an excellent intra- and inter-rater-reliability. Significant smaller precision errors and higher Dice scores were observed for FLAIR segmentations compared with segmentations of contrast-enhancement. Single rater segmentations showed the lowest RMSE for FLAIR of 3.3 % (MPRage: 8.2 %). Both, single raters and neuroradiologists had the lowest precision error for longitudinal evaluation of FLAIR changes. Semi-automated volumetry of glioblastoma was reliably performed by all groups of raters, even without neuroradiologic expertise. Interestingly, segmentations of tumor-associated FLAIR changes were more reliable than segmentations of contrast enhancement. In longitudinal evaluations, an experienced rater can detect progressive FLAIR changes of less than 15 % reliably in a quantitative way which could help to detect progressive disease earlier.
Computing Inter-Rater Reliability for Observational Data: An Overview and Tutorial

PubMed Central

Hallgren, Kevin A.

2012-01-01

Many research designs require the assessment of inter-rater reliability (IRR) to demonstrate consistency among observational ratings provided by multiple coders. However, many studies use incorrect statistical procedures, fail to fully report the information necessary to interpret their results, or do not address how IRR affects the power of their subsequent analyses for hypothesis testing. This paper provides an overview of methodological issues related to the assessment of IRR with a focus on study design, selection of appropriate statistics, and the computation, interpretation, and reporting of some commonly-used IRR statistics. Computational examples include SPSS and R syntax for computing Cohen’s kappa and intra-class correlations to assess IRR. PMID:22833776
Validity and reliability of a Nigerian-Yoruba version of the stroke-specific quality of life scale 2.0.

PubMed

Odetunde, Marufat Oluyemisi; Akinpelu, Aderonke Omobonike; Odole, Adesola Christiana

2017-10-19

Psychometric evidence is necessary to establish scientific integrity and clinical usefulness of translations and cultural adaptations of the Stroke-Specific Quality of Life (SS-QoL) scale. However, the limited evidence on psychometrics of Yoruba version of SS-QoL 2.0 (SS-QoL(Y)) is a significant shortcoming. This study assessed the test-retest reliability, internal consistency, convergent, divergent, discriminant and known-group validity of the SS-QoL(Y). Yoruba version of the WHOQoL-BREF was used to test the convergent and divergent validity of the SS-QoL(Y) among 100 consenting stroke survivors. The WHOQoL-BREF and SS-QoL(Y) was administered randomly in order to eliminate bias. The test-retest reliability of the SS-QoL(Y) was carried out among 68 of the respondents within an interval of 7 days. All respondents were purposively recruited from selected secondary and tertiary health facilities in South-west Nigeria. Data were analysed using descriptive statistics of mean and standard deviation, and inferential statistics of Spearman correlation, Cronbach's alpha, Intra-class Correlation Coefficient (ICC), Independent t-test and One-way ANOVA. Alpha level was set at p < 0.05. The physical health, psychological health, social relationship and environment domains on WHOQoL-BREF with correlation coefficient that ranged from 0.214 to 0.360 showed significant correlation with similar domains on SS-QoL(Y). Dissimilar domains between the two scales had r values from 0.035 to 0.366. Discriminant validity of SS-QoL(Y) showed that items' r value ranged from 0.711 to 0.920 with their hypothesized domains. The scale demonstrated moderate to strong test-retest reliability with Intra-class correlation coefficient (ICC) for the domains and overall scores (r = 0.47 to 0.81) and moderate to high internal consistency (Cronbach's alpha =0.61 to 0.82) for domains scores. These correlations were also significant for the domains and overall scores (p < 0.05). There were no significant differences across different age groups or gender for the domains or overall scores of SS-QoL(Y). Discriminant and known-group validity, test-retest reliability and internal consistency of the Yoruba version of the Stroke Specific Quality of Life 2.0 are adequate while the convergent and divergent validity are low but acceptable. The SS-QoL(Y) is recommended for assessing health-related quality of life among Yoruba stroke survivors.
Reliability and validity of tongue color analysis in the prediction of symptom patterns in terms of East Asian Medicine.

PubMed

Park, Young-Jae; Lee, Jin-Moo; Yoo, Seung-Yeon; Park, Young-Bae

2016-04-01

To examine whether color parameters of tongue inspection (TI) using a digital camera was reliable and valid, and to examine which color parameters serve as predictors of symptom patterns in terms of East Asian medicine (EAM). Two hundred female subjects' tongue substances were photographed by a mega-pixel digital camera. Together with the photographs, the subjects were asked to complete Yin deficiency, Phlegm pattern, and Cold-Heat pattern questionnaires. Using three sets of digital imaging software, each digital image was exposure- and white balance-corrected, and finally L* (luminance), a* (red-green balance), and b* (yellow-blue balance) values of the tongues were calculated. To examine intra- and inter-rater reliabilities and criterion validity of the color analysis method, three raters were asked to calculate color parameters for 20 digital image samples. Finally, four hierarchical regression models were formed. Color parameters showed good or excellent reliability (0.627-0.887 for intra-class correlation coefficients) and significant criterion validity (0.523-0.718 for Spearman's correlation). In the hierarchical regression models, age was a significant predictor of Yin deficiency (β = 0.192), and b* value of the tip of the tongue was a determinant predictor of Yin deficiency, Phlegm, and Heat patterns (β = - 0.212, - 0.172, and - 0.163). Luminance (L*) was predictive of Yin deficiency (β = -0.172) and Cold (β = 0.173) pattern. Our results suggest that color analysis of the tongue using the L*a*b* system is reliable and valid, and that color parameters partially serve as symptom pattern predictors in EAM practice.
Translation, Cultural Adaptation and Validation of the Simple Shoulder Test to Spanish

PubMed Central

Arcuri, Francisco; Barclay, Fernando; Nacul, Ivan

2015-01-01

Background: The validation of widely used scales facilitates the comparison across international patient samples. Objective: The objective was to translate, culturally adapt and validate the Simple Shoulder Test into Argentinian Spanish. Methods: The Simple Shoulder Test was translated from English into Argentinian Spanish by two independent translators, translated back into English and evaluated for accuracy by an expert committee to correct the possible discrepancies. It was then administered to 50 patients with different shoulder conditions.Psycometric properties were analyzed including internal consistency, measured with Cronbach´s Alpha, test-retest reliability at 15 days with the interclass correlation coefficient. Results: The internal consistency, validation, was an Alpha of 0,808, evaluated as good. The test-retest reliability index as measured by intra-class correlation coefficient (ICC) was 0.835, evaluated as excellent. Conclusion: The Simple Shoulder Test translation and it´s cultural adaptation to Argentinian-Spanish demonstrated adequate internal reliability and validity, ultimately allowing for its use in the comparison with international patient samples.
Reliability and Validity of the Multidimensional Scale of Perceived Social Support (MSPSS): Thai Version.

PubMed

Wongpakaran, Tinakon; Wongpakaran, Nahathai; Ruktrakul, Ruk

2011-01-01

This study examines the Thai version of the Multidimensional Scale of Perceived Social Support (MSPSS) for its psychometric properties. In total 462 participants were recruited - 310 medical students from Chiang Mai University and 152 psychiatric patients, and they completed the Thai version of the MSPSS, the State Trait Anxiety Inventory (STAI), the Rosenberg Self-Esteem Scale (RSES) and the Thai Depression Inventory (TDI). Test-retest reliability was conducted over a four week period. Factor analysis produced three-factor solutions for both patient (PG) and student groups (SG), and overall the model demonstrated adequate fit indices. The mean total score and the sub-scale score for the SG were statistically higher than those in the PG, except for 'Significant Others'. The internal consistency of the scale was good, with a Cronbach's alpha of 0.91 for the SG and 0.87 for the PG. After a four week retest for reliability exercise, the intra-class correlation coefficient (ICC) was found to be 0.84. The Thai-MSPSS was found to have a negative correlation with the STAI and the TDI, but was positively correlated with the RSES. The Thai MSPSS is a reliable and valid instrument to use.
Validity and reliability of isometric muscle strength measurements of hip abduction and abduction with external hip rotation in a bent-hip position using a handheld dynamometer with a belt.

PubMed

Aramaki, Hidefumi; Katoh, Munenori; Hiiragi, Yukinobu; Kawasaki, Tsubasa; Kurihara, Tomohisa; Ohmi, Yorikatsu

2016-07-01

[Purpose] This study aimed to investigate the relatedness, reliability, and validity of isometric muscle strength measurements of hip abduction and abduction with an external hip rotation in a bent-hip position using a handheld dynamometer with a belt. [Subjects and Methods] Twenty healthy young adults, with a mean age of 21.5 ± 0.6 years were included. Isometric hip muscle strength in the subjects' right legs was measured under two posture positions using two devices: a handheld dynamometer with a belt and an isokinetic dynamometer. Reliability was evaluated using an intra-class correlation coefficient (ICC); relatedness and validity were evaluated using Pearson's product moment correlation coefficient. Differences in measurements of devices were assessed by two-way ANOVA. [Results] ICC (1, 1) was ≥0.9; significant positive correlations in measurements were found between the two devices under both conditions. No main effect was found between the measurement values. [Conclusion] Our findings revealed that there was relatedness, reliability, and validity of this method for isometric muscle strength measurements using a handheld dynamometer with a belt.
How to assess and compare inter-rater reliability, agreement and correlation of ratings: an exemplary analysis of mother-father and parent-teacher expressive vocabulary rating pairs

PubMed Central

Stolarova, Margarita; Wolf, Corinna; Rinker, Tanja; Brielmann, Aenne

2014-01-01

This report has two main purposes. First, we combine well-known analytical approaches to conduct a comprehensive assessment of agreement and correlation of rating-pairs and to dis-entangle these often confused concepts, providing a best-practice example on concrete data and a tutorial for future reference. Second, we explore whether a screening questionnaire developed for use with parents can be reliably employed with daycare teachers when assessing early expressive vocabulary. A total of 53 vocabulary rating pairs (34 parent–teacher and 19 mother–father pairs) collected for two-year-old children (12 bilingual) are evaluated. First, inter-rater reliability both within and across subgroups is assessed using the intra-class correlation coefficient (ICC). Next, based on this analysis of reliability and on the test-retest reliability of the employed tool, inter-rater agreement is analyzed, magnitude and direction of rating differences are considered. Finally, Pearson correlation coefficients of standardized vocabulary scores are calculated and compared across subgroups. The results underline the necessity to distinguish between reliability measures, agreement and correlation. They also demonstrate the impact of the employed reliability on agreement evaluations. This study provides evidence that parent–teacher ratings of children's early vocabulary can achieve agreement and correlation comparable to those of mother–father ratings on the assessed vocabulary scale. Bilingualism of the evaluated child decreased the likelihood of raters' agreement. We conclude that future reports of agreement, correlation and reliability of ratings will benefit from better definition of terms and stricter methodological approaches. The methodological tutorial provided here holds the potential to increase comparability across empirical reports and can help improve research practices and knowledge transfer to educational and therapeutic settings. PMID:24994985
Development and validation of a Malawian version of the primary care assessment tool.

PubMed

Dullie, Luckson; Meland, Eivind; Hetlevik, Øystein; Mildestvedt, Thomas; Gjesdal, Sturla

2018-05-16

Malawi does not have validated tools for assessing primary care performance from patients' experience. The aim of this study was to develop a Malawian version of Primary Care Assessment Tool (PCAT-Mw) and to evaluate its reliability and validity in the assessment of the core primary care dimensions from adult patients' perspective in Malawi. A team of experts assessed the South African version of the primary care assessment tool (ZA-PCAT) for face and content validity. The adapted questionnaire underwent forward and backward translation and a pilot study. The tool was then used in an interviewer administered cross-sectional survey in Neno district, Malawi, to test validity and reliability. Exploratory factor analysis was performed on a random half of the sample to evaluate internal consistency, reliability and construct validity of items and scales. The identified constructs were then tested with confirmatory factor analysis. Likert scale assumption testing and descriptive statistics were done on the final factor structure. The PCAT-Mw was further tested for intra-rater and inter-rater reliability. From the responses of 631 patients, a 29-item PCAT-Mw was constructed comprising seven multi-item scales, representing five primary care dimensions (first contact, continuity, comprehensiveness, coordination and community orientation). All the seven scales achieved good internal consistency, item-total correlations and construct validity. Cronbach's alpha coefficient ranged from 0.66 to 0.91. A satisfactory goodness of fit model was achieved (GFI = 0.90, CFI = 0.91, RMSEA = 0.05, PCLOSE = 0.65). The full range of possible scores was observed for all scales. Scaling assumptions tests were achieved for all except the two comprehensiveness scales. Intra-class correlation coefficient (ICC) was 0.90 (n = 44, 95% CI 0.81-0.94, p < 0.001) for intra-rater reliability and 0.84 (n = 42, 95% CI 0.71-0.96, p < 0.001) for inter-rater reliability. Comprehensive metric analyses supported the reliability and validity of PCAT-Mw in assessing the core concepts of primary care from adult patients' experience. This tool could be used for health service research in primary care in Malawi.
Estimating the reliability of repeatedly measured endpoints based on linear mixed-effects models. A tutorial.

PubMed

Van der Elst, Wim; Molenberghs, Geert; Hilgers, Ralf-Dieter; Verbeke, Geert; Heussen, Nicole

2016-11-01

There are various settings in which researchers are interested in the assessment of the correlation between repeated measurements that are taken within the same subject (i.e., reliability). For example, the same rating scale may be used to assess the symptom severity of the same patients by multiple physicians, or the same outcome may be measured repeatedly over time in the same patients. Reliability can be estimated in various ways, for example, using the classical Pearson correlation or the intra-class correlation in clustered data. However, contemporary data often have a complex structure that goes well beyond the restrictive assumptions that are needed with the more conventional methods to estimate reliability. In the current paper, we propose a general and flexible modeling approach that allows for the derivation of reliability estimates, standard errors, and confidence intervals - appropriately taking hierarchies and covariates in the data into account. Our methodology is developed for continuous outcomes together with covariates of an arbitrary type. The methodology is illustrated in a case study, and a Web Appendix is provided which details the computations using the R package CorrMixed and the SAS software. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.

Reliability of concentrations of organophosphate pesticide metabolites in serial urine specimens from pregnancy in the Generation R Study.

PubMed

Spaan, Suzanne; Pronk, Anjoeka; Koch, Holger M; Jusko, Todd A; Jaddoe, Vincent W V; Shaw, Pamela A; Tiemeier, Henning M; Hofman, Albert; Pierik, Frank H; Longnecker, Matthew P

2015-05-01

The widespread use of organophosphate (OP) pesticides has resulted in ubiquitous exposure in humans, primarily through their diet. Exposure to OP pesticides may have adverse health effects, including neurobehavioral deficits in children. The optimal design of new studies requires data on the reliability of urinary measures of exposure. In the present study, urinary concentrations of six dialkyl phosphate (DAP) metabolites, the main urinary metabolites of OP pesticides, were determined in 120 pregnant women participating in the Generation R Study in Rotterdam. Intra-class correlation coefficients (ICCs) across serial urine specimens taken at <18, 18-25, and >25 weeks of pregnancy were determined to assess reliability. Geometric mean total DAP metabolite concentrations were 229 (GSD 2.2), 240 (GSD 2.1), and 224 (GSD 2.2) nmol/g creatinine across the three periods of gestation. Metabolite concentrations from the serial urine specimens in general correlated moderately. The ICCs for the six DAP metabolites ranged from 0.14 to 0.38 (0.30 for total DAPs), indicating weak to moderate reliability. Although the DAP metabolite levels observed in this study are slightly higher and slightly more correlated than in previous studies, the low to moderate reliability indicates a high degree of within-person variability, which presents challenges for designing well-powered epidemiological studies.
Responsiveness to change and reliability of measurement of radiographic joint space width in osteoarthritis of the knee: a systematic review.

PubMed

Reichmann, W M; Maillefert, J F; Hunter, D J; Katz, J N; Conaghan, P G; Losina, E

2011-05-01

The goal of this systematic review was to report the responsiveness to change and reliability of conventional radiographic joint space width (JSW) measurement. We searched the PubMed and Embase databases using the following search criteria: [osteoarthritis (OA) (MeSH)] AND (knee) AND (X-ray OR radiography OR diagnostic imaging OR radiology OR disease progression) AND (joint space OR JSW or disease progression). We assessed responsiveness by calculating the standardized response mean (SRM). We assessed reliability using intra- and inter-reader intra-class correlation (ICC) and coefficient of variation (CV). Random-effects models were used to pool results from multiple studies. Results were stratified by study duration, design, techniques of obtaining radiographs, and measurement method. We identified 998 articles using the search terms. Of these, 32 articles (43 estimates) reported data on responsiveness of JSW measurement and 24 (50 estimates) articles reported data on measures of reliability. The overall pooled SRM was 0.33 [95% confidence interval (CI): 0.26, 0.41]. Responsiveness of change in JSW measurement was improved substantially in studies of greater than 2 years duration (0.57). Further stratifying this result in studies of greater than 2 years duration, radiographs obtained with the knee in a flexed position yielded an SRM of 0.71. Pooled intra-reader ICC was estimated at 0.97 (95% CI: 0.92, 1.00) and the intra-reader CV estimated at 3.0 (95% CI: 2.0, 4.0). Pooled inter-reader ICC was estimated at 0.93 (95% CI: 0.86, 0.99) and the inter-reader CV estimated at 3.4% (95% CI: 1.3%, 5.5%). Measurement of JSW obtained from radiographs in persons with knee is reliable. These data will be useful to clinicians who are planning RCTs where the change in minimum JSW is the outcome of interest. Copyright © 2011 Osteoarthritis Research Society International. Published by Elsevier Ltd. All rights reserved.
Analysis of the psychometric properties of the American Orthopaedic Foot and Ankle Society Score (AOFAS) in rheumatoid arthritis patients: application of the Rasch model.

PubMed

Conceição, Cristiano Sena da; Neto, Mansueto Gomes; Neto, Anolino Costa; Mendes, Selena M D; Baptista, Abrahão Fontes; Sá, Kátia Nunes

2016-01-01

To tested the reliability and validity of Aofas in a sample of rheumatoid arthritis patients. The scale was applicable to rheumatoid arthritis patients, twice by the interviewer 1 and once by the interviewer 2. The Aofas was subjected to test-retest reliability analysis (with 20 Rheumatoid arthritis subjects). The psychometric properties were investigated using Rasch analysis on 33 Rheumatoid arthritis patients. Intra-Class Correlation Coefficient (ICC) were (0.90
Reliability and validity of a talent identification test battery for seated and standing Paralympic throws.

PubMed

Spathis, Jemima Grace; Connick, Mark James; Beckman, Emma Maree; Newcombe, Peter Anthony; Tweedy, Sean Michael

2015-01-01

Paralympic throwing events for athletes with physical impairments comprise seated and standing javelin, shot put, discus and seated club throwing. Identification of talented throwers would enable prediction of future success and promote participation; however, a valid and reliable talent identification battery for Paralympic throwing has not been reported. This study evaluates the reliability and validity of a talent identification battery for Paralympic throws. Participants were non-disabled so that impairment would not confound analyses, and results would provide an indication of normative performance. Twenty-eight non-disabled participants (13 M; 15 F) aged 23.6 years (±5.44) performed five kinematically distinct criterion throws (three seated, two standing) and nine talent identification tests (three anthropometric, six motor); 23 were tested a second time to evaluate test-retest reliability. Talent identification test-retest reliability was evaluated using Intra-class Correlation Coefficient (ICC) and Bland-Altman plots (Limits of Agreement). Spearman's correlation assessed strength of association between criterion throws and talent identification tests. Reliability was generally acceptable (mean ICC = 0.89), but two seated talent identification tests require more extensive familiarisation. Correlation strength (mean rs = 0.76) indicated that the talent identification tests can be used to validly identify individuals with competitively advantageous attributes for each of the five kinematically distinct throwing activities. Results facilitate further research in this understudied area.
Region of Interest Correction Factors Improve Reliability of Diffusion Imaging Measures Within and Across Scanners and Field Strengths

PubMed Central

Venkatraman, Vijay K; Gonzalez, Christopher E.; Landman, Bennett; Goh, Joshua; Reiter, David A.; An, Yang; Resnick, Susan M.

2017-01-01

Diffusion tensor imaging (DTI) measures are commonly used as imaging markers to investigate individual differences in relation to behavioral and health-related characteristics. However, the ability to detect reliable associations in cross-sectional or longitudinal studies is limited by the reliability of the diffusion measures. Several studies have examined reliability of diffusion measures within (i.e. intra-site) and across (i.e. inter-site) scanners with mixed results. Our study compares the test-retest reliability of diffusion measures within and across scanners and field strengths in cognitively normal older adults with a follow-up interval less than 2.25 years. Intra-class correlation (ICC) and coefficient of variation (CoV) of fractional anisotropy (FA) and mean diffusivity (MD) were evaluated in sixteen white matter and twenty-six gray matter bilateral regions. The ICC for intra-site reliability (0.32 to 0.96 for FA and 0.18 to 0.95 for MD in white matter regions; 0.27 to 0.89 for MD and 0.03 to 0.79 for FA in gray matter regions) and inter-site reliability (0.28 to 0.95 for FA in white matter regions, 0.02 to 0.86 for MD in gray matter regions) with longer follow-up intervals were similar to earlier studies using shorter follow-up intervals. The reliability of across field strengths comparisons was lower than intra- and inter-site reliability. Within and across scanner comparisons showed that diffusion measures were more stable in larger white matter regions (> 1500 mm3). For gray matter regions, the MD measure showed stability in specific regions and was not dependent on region size. Linear correction factor estimated from cross-sectional or longitudinal data improved the reliability across field strengths. Our findings indicate that investigations relating diffusion measures to external variables must consider variable reliability across the distinct regions of interest and that correction factors can be used to improve consistency of measurement across field strengths. An important result of this work is that inter-scanner and field strength effects can be partially mitigated with linear correction factors specific to regions of interest. These data-driven linear correction techniques can be applied in cross-sectional or longitudinal studies. PMID:26146196
Reliability of Autism-Tics, AD/HD, and other Comorbidities (A-TAC) inventory in a test-retest design.

PubMed

Larson, Tomas; Kerekes, Nóra; Selinus, Eva Norén; Lichtenstein, Paul; Gumpert, Clara Hellner; Anckarsäter, Henrik; Nilsson, Thomas; Lundström, Sebastian

2014-02-01

The Autism-Tics, AD/HD, and other Comorbidities (A-TAC) inventory is used in epidemiological research to assess neurodevelopmental problems and coexisting conditions. Although the A-TAC has been applied in various populations, data on retest reliability are limited. The objective of the present study was to present additional reliability data. The A-TAC was administered by lay assessors and was completed on two occasions by parents of 400 individual twins, with an average interval of 70 days between test sessions. Intra- and inter-rater reliability were analysed with intraclass correlations and Cohen's kappa. A-TAC showed excellent test-retest intraclass correlations for both autism spectrum disorder and attention deficit hyperactivity disorder (each at .84). Most modules in the A-TAC had intra- and inter-rater reliability intraclass correlation coefficients of > or = .60. Cohen's kappa indi- cated acceptable reliability. The current study provides statistical evidence that the A-TAC yields good test-retest reliability in a population-based cohort of children.
Validity and Reliability of 10-Hz Global Positioning System to Assess In-line Movement and Change of Direction.

PubMed

Nikolaidis, Pantelis T; Clemente, Filipe M; van der Linden, Cornelis M I; Rosemann, Thomas; Knechtle, Beat

2018-01-01

The objectives of the present study were to examine the validity and reliability of the 10 Hz Johan GPS unit in assessing in-line movement and change of direction. The validity was tested against the criterion measure of 200 m track-and-field (track-and-field athletes, n = 8) and 20 m shuttle run endurance test (female soccer players, n = 20). Intra-unit and inter-unit reliability was tested by intra-class correlation coefficient (ICC) and coefficient of variation (CV), respectively. An analysis of variance examined differences between the GPS measurement and five laps of 200 m at 15 km/h, and t -test examined differences between the GPS measurement and 20 m shuttle run endurance test. The difference between the GPS measurement and 200 m distance ranged from -0.13 ± 3.94 m (95% CI -3.42; 3.17) in the first lap to 2.13 ± 2.64 m (95% CI -0.08; 4.33) in the fifth lap. A good intra-unit reliability was observed in 200 m (ICC = 0.833, 95% CI 0.535; 0.962). Inter-unit CV ranged from 1.31% (fifth lap) to 2.20% (third lap). The difference between the GPS measurement and 20 m shuttle run endurance test ranged from 0.33 ± 4.16 m (95% CI -10.01; 10.68) in 11.5 km/h to 9.00 ± 5.30 m (95% CI 6.44; 11.56) in 8.0 km/h. A moderate intra-unit reliability was shown in the second and third stage of the 20 m shuttle run endurance test (ICC = 0.718, 95% CI 0.222;0.898) and good reliability in the fifth, sixth, seventh and eighth (ICC = 0.831, 95% CI -0.229;0.996). Inter-unit CV ranged from 2.08% (11.5 km/h) to 3.92% (8.5 km/h). Based on these findings, it was concluded that the 10 Hz Johan system offers an affordable valid and reliable tool for coaches and fitness trainers to monitor training and performance.
Three-dimensional facial analysis of Chinese children with repaired unilateral cleft lip and palate

NASA Astrophysics Data System (ADS)

Othman, Siti Adibah; Aidil Koay, Noor Airin

2016-08-01

We analyzed the facial features of Chinese children with repaired unilateral cleft lip and palate (UCLP) and compared them with a normal control group using a three-dimensional (3D) stereophotogrammetry camera. This cross-sectional study examined 3D measurements of the facial surfaces of 20 Chinese children with repaired UCLP and 40 unaffected Chinese children aged 7 to 12 years old, which were captured using the VECTRA 3D five-pod photosystem and analyzed using Mirror software. Twenty-five variables and two ratios were compared between both groups using independent t-test. Intra- and inter-observer reliability was determined using ten randomly selected images and analyzed using intra-class correlation coefficient test (ICC). The level of significance was set at p < 0.0018. Intra- and inter-observers’ reliability was considered fair to excellent with an ICC value ranging from 0.54 to 0.99. Statistically significant differences (p < 0.0018) were found mainly in the nasolabial region. The cleft group exhibited wider alar base root width, flattened nose and broader nostril floor width on the cleft side. They tended to have shorter upper lip length and thinner upper vermillion thickness. Faces of Chinese children with repaired UCLP displayed meaningful differences when compared to the normal group especially in the nasolabial regions.
Validity and Reliability of Thai Version of the Foot and Ankle Ability Measure (FAAM) Subjective Form.

PubMed

Arunakul, Marut; Arunakul, Preeyaphan; Suesiritumrong, Chakhrist; Angthong, Chayanin; Chernchujit, Bancha

2015-06-01

Self-administered questionnaires have become an important aspect for clinical outcome assessment of foot and ankle-related problems. The Foot and Ankle Ability Measure (FAAM) subjective form is a region-specific questionnaire that is widely used and has sufficient validity and reliability from previous studies. Translate the original English version of FAAM into a Thai version and evaluate the validity and reliability of Thai FAAM in patients with foot and ankle-related problems. The FAAM subjective form was translated into Thai using forward-backward translation protocol. Afterward, reliability and validity were tested. Following responses from 60 consecutive patients on two questionnaires, the Thai FAAM subjective form and the short form (SF)-36, were used. The validity was tested by correlating the scores from both questionnaires. The reliability was adopted by measuring the test-retest reliability and internal consistency. Thai FAAM score including activity of daily life (ADL) and Sport subscale demonstrated the sufficient correlations with physical functioning (PF) and physical composite score (PCS) domains of the SF-36 (statistically significant with p < 0.001 level and ≥ 0.5 values). The result of reliability revealed highly intra-class correlation coefficient as 0.8 and 0.77, respectively from test-retest study. The internal consistency was strong (Cronbach alpha = 0.94 and 0.88, respectively). The Thai version of FAAM subjective form retained the characteristics of the original version and has proved a reliable evaluation instrument for patients with foot and ankle-related problems.
Disease Severity and Progression in Progressive Supranuclear Palsy and Multiple System Atrophy: Validation of the NNIPPS – PARKINSON PLUS SCALE

PubMed Central

Payan, Christine A. M.; Viallet, François; Landwehrmeyer, Bernhard G.; Bonnet, Anne-Marie; Borg, Michel; Durif, Franck; Lacomblez, Lucette; Bloch, Frédéric; Verny, Marc; Fermanian, Jacques; Agid, Yves; Ludolph, Albert C.

2011-01-01

Background The Natural History and Neuroprotection in Parkinson Plus Syndromes (NNIPPS) study was a large phase III randomized placebo-controlled trial of riluzole in Progressive Supranuclear Palsy (PSP, n = 362) and Multiple System Atrophy (MSA, n = 398). To assess disease severity and progression, we constructed and validated a new clinical rating scale as an ancillary study. Methods and Findings Patients were assessed at entry and 6-montly for up to 3 years. Evaluation of the scale's psychometric properties included reliability (n = 116), validity (n = 760), and responsiveness (n = 642). Among the 85 items of the initial scale, factor analysis revealed 83 items contributing to 15 clinically relevant dimensions, including Activity of daily Living/Mobility, Axial bradykinesia, Limb bradykinesia, Rigidity, Oculomotor, Cerebellar, Bulbar/Pseudo-bulbar, Mental, Orthostatic, Urinary, Limb dystonia, Axial dystonia, Pyramidal, Myoclonus and Tremor. All but the Pyramidal dimension demonstrated good internal consistency (Cronbach α≥0.70). Inter-rater reliability was high for the total score (Intra-class coefficient = 0.94) and 9 dimensions (Intra-class coefficient = 0.80–0.93), and moderate (Intra-class coefficient = 0.54–0.77) for 6. Correlations of the total score with other clinical measures of severity were good (rho≥0.70). The total score was significantly and linearly related to survival (p<0.0001). Responsiveness expressed as the Standardized Response Mean was high for the total score slope of change (SRM = 1.10), though higher in PSP (SRM = 1.25) than in MSA (SRM = 1.0), indicating a more rapid progression of PSP. The slope of change was constant with increasing disease severity demonstrating good linearity of the scale throughout disease stages. Although MSA and PSP differed quantitatively on the total score at entry and on rate of progression, the relative contribution of clinical dimensions to overall severity and progression was similar. Conclusions The NNIPPS-PPS has suitable validity, is reliable and sensitive, and therefore is appropriate for use in clinical studies with PSP or MSA. Trial Registration ClinicalTrials.gov NCT00211224 PMID:21829612
Measuring competence in endoscopic sinus surgery.

PubMed

Syme-Grant, J; White, P S; McAleer, J P G

2008-02-01

Competence based education is currently being introduced into higher surgical training in the UK. Valid and reliable performance assessment tools are essential to ensure competencies are achieved. No such tools have yet been reported in the UK literature. We sought to develop and pilot test an Endoscopic Sinus Surgery Competence Assessment Tool (ESSCAT). The ESSCAT was designed for in-theatre assessment of higher surgical trainees in the UK. The ESSCAT rating matrix was developed through task analysis of ESS procedures. All otolaryngology consultants and specialist registrars in Scotland were given the opportunity to contribute to its refinement. Two cycles of in-theatre testing were used to ensure utility and gather quantitative data on validity and reliability. Videos of trainees performing surgery were used in establishing inter-rater reliability. National consultation, the consensus derived minimum standard of performance, Cronbach's alpha = 0.89 and demonstration of trainee learning (p = 0.027) during the in vivo application of the ESSCAT suggest a high level of validity. Inter-rater reliability was moderate for competence decisions (Cohen's Kappa = 0.5) and good for total scores (Intra-Class Correlation Co-efficient = 0.63). Intra-rater reliability was good for both competence decisions (Kappa = 0.67) and total scores (Kendall's Tau-b = 0.73). The ESSCAT generates a valid and reliable assessment of trainees' in-theatre performance of endoscopic sinus surgery. In conjunction with ongoing evaluation of the instrument we recommend the use of the ESSCAT in higher specialist training in otolaryngology in the UK.
Intra- and interrater reliability of the 'lumbar-locked thoracic rotation test' in competitive swimmers ages 10 through 18 years.

PubMed

Feijen, Stef; Kuppens, Kevin; Tate, Angela; Baert, Isabel; Struyf, Thomas; Struyf, Filip

2018-04-17

Measuring thoracic spine mobility can be of interest to competitive swimmers as it has been associated with shoulder girdle function and scapular position in subjects with and without shoulder pain. At present, no reliability data of thoracic spine mobility measurements are available in the swimming population. This study aims to evaluate the within-session intra- and interrater reliability of the "lumbar-locked rotation test" for thoracic spine rotation in competitive swimmers aged 10 to 18 years. This reliability study is part of a larger prospective cohort study investigating potential risk factors for the development of shoulder pain in competitive swimmers. Within-session, intra- and inter-rater reliability. Competitive swimming clubs in Belgium. 21 competitive swimmers. Intra- and inter-rater reliability of the lumbar-locked thoracic rotation test. Intraclass correlation coefficients (ICCs) ranged from 0.91 (95% CI 0.78 to 0.96) to 0.96 (0.89-0.98) for intra-rater reliability. Results for inter-rater reliability ranged from 0.89 (0.72-0.95) to 0.86 (0.65-0.94) respectively for right and left thoracic rotation. Results suggest good to excellent reliability of the lumbar-locked thoracic rotation test, indicating this test can be used reliably in clinical practice. Copyright © 2018 Elsevier Ltd. All rights reserved.
The city of hope-quality of life-ostomy questionnaire: persian translation and validation.

PubMed

Anaraki, F; Vafaie, M; Behboo, R; Esmaeilpour, S; Maghsoodi, N; Safaee, A; Grant, M

2014-07-01

Since there is no disease-specific instrument for measuring quality-of-life (QOL) in Ostomy patients in Persian language. This study was designed to translate and evaluate the validity and reliability of City of Hope-quality of life-Ostomy questionnaire (COH-QOL-Ostomy questionnaire). This study was designed as cross-sectional study. Reliability of the subscales and the summary scores were demonstrated by intra-class correlation coefficients. Pearson's correlations of an item with its own scale and other scales were calculated to evaluated convergent and discriminant validity. Clinical validity was also evaluated by known-group comparisons. Cronbach's alpha coefficient for all subscales was about 0.70 or higher. Results of interscale correlation were satisfactory and each subscale only measured a single and specified trait. All subscales met the standards of convergent and discriminant validity. Known group comparison analysis showed significant differences in social and spiritual well-being. The findings confirmed the reliability and validity of Persian version of COH-QOL-Ostomy questionnaire. The instrument was also well received by the Iranian patients. It can be considered as a valuable instrument to assess the different aspects of health related quality-of-life in Ostomy patients and used in clinical research in the future.
The Validity and Reliability Test of the Indonesian Version of Gastroesophageal Reflux Disease Quality of Life (GERD-QOL) Questionnaire.

PubMed

Siahaan, Laura A; Syam, Ari F; Simadibrata, Marcellus; Setiati, Siti

2017-01-01

to obtain a valid and reliable GERD-QOL questionnaire for Indonesian application. at the initial stage, the GERD-QOL questionnaire was first translated into Indonesian language and the translated questionnaire was subsequently translated back into the original language (back-to-back translation). The results were evaluated by the researcher team and therefore, an Indonesian version of GERD-QOL questionnaire was developed. Ninety-one patients who had been clinically diagnosed with GERD based on the Montreal criteria were interviewed using the Indonesian version of GERD-QOL questionnaire and the SF 36 questionnaire. The validity was evaluated using a method of construct validity and external validity, and reliability can be tested by the method of internal consistency and test retest. the Indonesian version of GERD-QOL questionnaire had a good internal consistency reliability with a Cronbach Alpha of 0.687-0.842 and a good test retest reliability with an intra-class correlation coefficient of 0.756-0.936; p<0.05). The questionnaire had also been demonstrated to have a good validity with a proven high correlation to each question of SF-36 (p<0.05). the Indonesian version of GERD-QOL questionnaire has been proven valid and reliable to evaluate the quality of life of GERD patients.
Relationship between the alpha and beta angles in diagnosing CAM-type femoroacetabular impingement on frog-leg lateral radiographs.

PubMed

Khan, Moin; Ranawat, Anil; Williams, Dale; Gandhi, Rajiv; Choudur, Hema; Parasu, Naveen; Simunovic, Nicole; Ayeni, Olufemi R

2015-09-01

Alpha and beta angles are commonly used radiographic measures to assess the sphericity of the proximal femur and distance between the pathologic head-neck junction and the acetabular rim, respectively. The aim of this study was to explore the relationship between these two measurements on frog-leg lateral hip radiographs. Fifty frog-leg lateral hip radiographs were evaluated by two orthopaedic surgeons and two radiologists. Each reviewer measured the alpha and beta angles on two separate occasions to determine the relationship between positive alpha and beta angles and the inter- and intra-observer reliability of these measurements. There was no significant association between positive alpha and beta angles, [kappa range -0.043 (95 % CI -0.17 to 0.086) to 0.54 (95 % CI 0.33-0.75)]. Intra-observer reliability was high [alpha angle intra-class correlation coefficient (ICC) range 0.74 (95 % CI 0.58-0.84) to 0.99 (95 % CI 0.98-0.99) and beta angle ICC range 0.86 (95 % CI 0.76-0.92) to 0.97 (95 % CI 0.95-0.98)]. There is no statistical or functional relationship between readings of positive alpha and beta angles. The radiographic measurements resulted in high intra-observer and fair-to-moderate inter-observer reliability. Results of this study suggest that the presence of a CAM lesion on lateral radiographs as suggested by a positive alpha angle does not necessitate a decrease in clearance between the femoral head and acetabular rim as measured by the beta angle and thus may not be the best measure of functional impingement. Understanding the relationship between these two aspects of femoroacetabular impingement improves a surgeon's ability to anticipate potential operative management.
Inter-Observer, Intra-Observer and Intra-Individual Reliability of Uroflowmetry Tests in Aged Men: A Generalizability Theory Approach.

PubMed

Liu, Ying-Buh; Yang, Stephen S; Hsieh, Cheng-Hsing; Lin, Chia-Da; Chang, Shang-Jen

2014-05-01

To evaluate the inter-observer, intra-observer and intra-individual reliability of uroflowmetry and post-void residual urine (PVR) tests in adult men. Healthy volunteers aged over 40 years were enrolled. Every participant underwent two sets of uroflowmetry and PVR tests with a 2-week interval between the tests. The uroflowmetry tests were interpreted by four urologists independently. Uroflowmetry curves were classified as bell-shaped, bell-shaped with tail, obstructive, restrictive, staccato, interrupted and tower-shaped and scored from 1 (highly abnormal) to 5 (absolutely normal). The agreements between the observers, interpretations and tests within individuals were analyzed using kappa statistics and intraclass correlation coefficients. Generalizability theory with decision analysis was used to determine how many observers, tests, and interpretations were needed to obtain an acceptable reliability (> 0.80). Of 108 volunteers, we randomly selected the uroflowmetry results from 25 participants for the evaluation of reliability. The mean age of the studied adults was 55.3 years. The intra-individual and intra-observer reliability on uroflowmetry tests ranged from good to very good. However, the inter-observer reliability on normalcy and specific type of flow pattern were relatively lower. In generalizability theory, three observers were needed to obtain an acceptable reliability on normalcy of uroflow pattern if the patient underwent uroflowmetry tests twice with one observation. The intra-individual and intra-observer reliability on uroflowmetry tests were good while the inter-observer reliability was relatively lower. To improve inter-observer reliability, the definition of uroflowmetry should be clarified by the International Continence Society. © 2013 Wiley Publishing Asia Pty Ltd.
Effect of knee angle on neuromuscular assessment of plantar flexor muscles: A reliability study

PubMed Central

Cornu, Christophe; Jubeau, Marc

2018-01-01

Introduction This study aimed to determine the intra- and inter-session reliability of neuromuscular assessment of plantar flexor (PF) muscles at three knee angles. Methods Twelve young adults were tested for three knee angles (90°, 30° and 0°) and at three time points separated by 1 hour (intra-session) and 7 days (inter-session). Electrical (H reflex, M wave) and mechanical (evoked and maximal voluntary torque, activation level) parameters were measured on the PF muscles. Intraclass correlation coefficients (ICC) and coefficients of variation were calculated to determine intra- and inter-session reliability. Results The mechanical measurements presented excellent (ICC>0.75) intra- and inter-session reliabilities regardless of the knee angle considered. The reliability of electrical measurements was better for the 90° knee angle compared to the 0° and 30° angles. Conclusions Changes in the knee angle may influence the reliability of neuromuscular assessments, which indicates the importance of considering the knee angle to collect consistent outcomes on the PF muscles. PMID:29596480
Reliability of concentrations of organophosphate pesticide metabolites in serial urine specimens from pregnancy in the Generation R study

PubMed Central

Spaan, Suzanne; Pronk, Anjoeka; Koch, Holger M.; Jusko, Todd A.; Jaddoe, Vincent W.V.; Shaw, Pamela A.; Tiemeier, Henning M.; Hofman, Albert; Pierik, Frank H.; Longnecker, Matthew P.

2014-01-01

The widespread use of organophosphate (OP) pesticides has resulted in ubiquitous exposure in humans, primarily through their diet. Exposure to OP pesticides may have adverse health effects, including neurobehavioral deficits in children. The optimal design of new studies requires data on the reliability of urinary measures of exposure. In the present study, urinary concentrations of six dialkyl phosphate (DAP) metabolites, the main urinary metabolites of OP pesticides, were determined in 120 pregnant women participating in the Generation R Study in Rotterdam. Intra-class correlation coefficients (ICCs) across serial urine specimens taken at <18, 18–25, and >25 weeks of pregnancy were determined to assess reliability. Geometric mean total DAP metabolite concentrations were 229 (GSD 2.2), 240 (GSD 2.1), and 224 (GSD 2.2) nmol/g creatinine across the three periods of gestation. Metabolite concentrations from the serial urine specimens in general correlated moderately. The ICCs for the six DAP metabolites ranged from 0.14 to 0.38 (0.30 for total DAPs), indicating weak to moderate reliability. Although the DAP metabolite levels observed in this study are slightly higher and slightly more correlated than in previous studies, the low to moderate reliability indicates a high degree of within-person variability, which presents challenges for designing well-powered epidemiologic studies. PMID:25515376
Translation and cultural adaptation of the Manchester-Oxford Foot Questionnaire (MOXFQ) into Persian language.

PubMed

Mousavian, Alireza; Ebrahimzadeh, Mohammad H; Birjandinejad, Ali; Omidi-Kashani, Farzad; Kachooei, Amir Reza

2015-12-01

In this study, we aimed to translate and test the validity and reliablity of the Persian version of the Manchester-Oxford Foot Questionnaire in foot and ankle patients. We translated the Manchester-Oxford Foot Questionnaire to Persian language according to the accepted guidelines, then assessed the psychometric properties including the validity and reliability on 308 patients with long-standing foot and ankle problems. To test the reliability, we calculated the intra-class correlation coefficient (ICC) for test-retest reliability and measured Cronbach's alpha to test the internal consistency. To test the construct validity of the Manchester-Oxford Foot Questionnaire we also administered the Short-Form 36 to patients. Construct validity was supported by significant correlation with SF36 subscales except for pain subscale of the persian MOXFQ with mental health of the SF36 (r=0.207). Intraclass correlation coefficient was 0.79 for the total MOXFQ and ranged from 0.83 to 0.89 for the three subscales. Cronbach's alpha for pain, walking/standing, and social interaction was 0.86, 0.88, and 0.89, respectively, and was 0.79 for the total MOXFQ showing good internal consistency in each domain. The Persian Manchester-Oxford Foot Questionnaire health scoring system is a valid and reliable patient-reported instrument for foot and ankle problems. Copyright © 2015. Published by Elsevier Ltd.
Reliability, repeatability, and reproducibility of pulmonary transit time assessment by contrast enhanced echocardiography.

PubMed

Herold, Ingeborg H F; Saporito, Salvatore; Bouwman, R Arthur; Houthuizen, Patrick; van Assen, Hans C; Mischi, Massimo; Korsten, Hendrikus H M

2016-01-05

The aim of this study is to investigate the inter and intra-rater reliability, repeatability, and reproducibility of pulmonary transit time (PTT) measurement in patients using contrast enhanced ultrasound (CEUS), as an indirect measure of preload and left ventricular function. Mean transit times (MTT) were measured by drawing a region of interest (ROI) in right and left cardiac ventricle in the CEUS loops. Acoustic intensity dilution curves were obtained from the ROIs. MTTs were calculated by applying model-based fitting on the dilution curves. PTT was calculated as the difference of the MTTs. Eight raters with different levels of experience measured the PTT (time moment 1) and repeated the measurement within a week (time moment 2). Reliability and agreement were assessed using intra-class correlations (ICC) and Bland-Altman analysis. Repeatability was tested by estimating the variance of means (ANOVA) of three injections in each patient at different doses. Reproducibility was tested by the ICC of the two time moments. Fifteen patients with heart failure were included. The mean PTT was 11.8 ± 3.1 s at time moment 1 and 11.7 ± 2.9 s at time moment 2. The inter-rater reliability for PTT was excellent (ICC = 0.94). The intra-rater reliability per rater was between 0.81-0.99. Bland-Altman analysis revealed a bias of 0.10 s within the rater groups. Reproducibility for PTT showed an ICC = 0.94 between the two time moments. ANOVA showed no significant difference between the means of the three different doses F = 0.048 (P = 0.95). The mean and standard deviation for PTT estimates at three different doses was 11.6 ± 3.3 s. PTT estimation using CEUS shows a high inter- and intra-rater reliability, repeatability at three different doses, and reproducibility by ROI drawing. This makes the minimally invasive PTT measurement using contrast echocardiography ready for clinical evaluation in patients with heart failure and for preload estimation.

A semi-automated algorithm for hypothalamus volumetry in 3 Tesla magnetic resonance images.

PubMed

Wolff, Julia; Schindler, Stephanie; Lucas, Christian; Binninger, Anne-Sophie; Weinrich, Luise; Schreiber, Jan; Hegerl, Ulrich; Möller, Harald E; Leitzke, Marco; Geyer, Stefan; Schönknecht, Peter

2018-07-30

The hypothalamus, a small diencephalic gray matter structure, is part of the limbic system. Volumetric changes of this structure occur in psychiatric diseases, therefore there is increasing interest in precise volumetry. Based on our detailed volumetry algorithm for 7 Tesla magnetic resonance imaging (MRI), we developed a method for 3 Tesla MRI, adopting anatomical landmarks and work in triplanar view. We overlaid T1-weighted MR images with gray matter-tissue probability maps to combine anatomical information with tissue class segmentation. Then, we outlined regions of interest (ROIs) that covered potential hypothalamus voxels. Within these ROIs, seed growing technique helped define the hypothalamic volume using gray matter probabilities from the tissue probability maps. This yielded a semi-automated method with short processing times of 20-40 min per hypothalamus. In the MRIs of ten subjects, reliabilities were determined as intraclass correlations (ICC) and volume overlaps in percent. Three raters achieved very good intra-rater reliabilities (ICC 0.82-0.97) and good inter-rater reliabilities (ICC 0.78 and 0.82). Overlaps of intra- and inter-rater runs were very good (≥ 89.7%). We present a fast, semi-automated method for in vivo hypothalamus volumetry in 3 Tesla MRI. Copyright © 2018 Elsevier B.V. All rights reserved.
The validity and reliability of the Thai version of the Kujala score for patients with patellofemoral pain syndrome.

PubMed

Apivatgaroon, Adinun; Angthong, Chayanin; Sanguanjit, Prakasit; Chernchujit, Bancha

2016-10-01

To develop a Thai version of the Kujala score and show the evaluation of the validity and reliability of the score. The Thai version of the Kujala score was developed using the forward-backward translation protocol. The 49 PFPS patients answered the Thai version of questionnaires including the Kujala score, Short Form-36 (SF-36) and International Knee Documentation Committee (IKDC) Subjective Knee Form. The validity between the scores has been tested. The reliability was assessed using test-retest reliability and internal consistency. The Thai version of the Kujala score showed a good correlation with Thai IKDC Subjective Knee Form (Pearson's correlation coefficient; r = 0.74: p < 0.01) and moderate correlation with the Thai SF-36 subscales of physical component summary, total score and role physical (r = 0.586, 0.571 and 0.524, respectively: p < 0.01). The test-retest reliability was excellent with an intra-class correlation coefficient of 0.908 (p < 0.001; 95% CI [0.842-0.947]). The internal consistency was strong with Cronbach's alpha of 0.952 (p < 0.001). No floor and ceiling effects were observed. The Thai version of the Kujala score has shown good validity and reliability. This score can be effectively used for evaluating Thai patients with patellofemoral pain syndrome. Implications for Rehabilitation The Kujala score is a self-administered questionnaire for patients with patellofemoral pain syndrome (PFPS). The validity and reliability of the Thai version of Kujala are compatible with other versions (Turkish, Chinese and Persian version). The Thai version of Kujala has been shown to have validity and reliability in Thai PFPS patients and can be used for clinical evaluation and also in the research work.
Reliability and smallest real difference of the ankle lunge test post ankle fracture.

PubMed

Simondson, David; Brock, Kim; Cotton, Susan

2012-02-01

This study aimed to determine the reliability and the smallest real difference of the Ankle Lunge test in an ankle fracture patient population. In the post immobilisation stage of ankle fracture, ankle dorsiflexion is an important measure of progress and outcome. The Ankle Lunge test measures weight bearing dorsiflexion, resulting in negative scores (knee to wall distance) and positive scores (toe to wall distance), for which the latter has proven reliability in normal subjects only. A consecutive sample of ankle fracture patients with permission to commence weight bearing, were recruited to the study. Three measurements of the Ankle Lunge Test were performed each by two raters, one senior and one junior physiotherapist. These occurred prior to therapy sessions in the second week after plaster removal. A standardised testing station was utilised and allowed for both knee to wall distance and toe to wall distance measurement. Data was collected from 10 individuals with ankle fracture, with an average age of 36 years (SD 14.8). Seventy seven percent of observations were negative. Intra and inter-rater reliability yielded intra class correlations at or above 0.97, p < .001. There was a significant systematic bias towards improved scores during repeated measurement for one rater (p = .01). The smallest real difference was calculated as 13.8mm. The Ankle Lunge test is a practical and reliable tool for measuring weightbearing dorsiflexion post ankle fracture. Copyright © 2011 Elsevier Ltd. All rights reserved.
2D phase sensitive inversion recovery imaging to measure in-vivo spinal cord gray and white matter areas in clinically feasible acquisition times

PubMed Central

Papinutto, N.; Schlaeger, R.; Panara, V.; Caverzasi, E.; Ahn, S.; Johnson, K.J.; Zhu, A.H.; Stern, W.A.; Laub, G.; Hauser, S.L.; Henry, R.G.

2018-01-01

PURPOSE In-vivo assessment of spinal cord gray matter (GM) and white matter (WM) could become pivotal to study various neurological diseases, but it is challenging because of insufficient GM/WM contrast provided by conventional MRI. Here we present and assess a procedure for measurement of spinal cord total cross-sectional area (TCA) and GM areas based on phase sensitive inversion recovery imaging (PSIR). MATERIALS AND METHODS We acquired 2D PSIR images at 3T at each disc level of the spinal axis on 10 healthy subjects and measured TCA, cord diameters, WM and GM area, and GM area/TCA ratio. We secondly investigated 32 healthy subjects at 4 selected levels (C2–C3, C3–C4, T8–T9, T9–T10, total acquisition time <8 minutes) and generated normative reference values of TCA and GM areas. We assessed test-retest, intra- and inter-operator reliability of the acquisition strategy and measurement steps. RESULTS The measurement procedure based on 2D PSIR imaging allowed TCA and GM area assessments along the entire spinal cord axis. The tests we performed revealed high test-retest/intra-operator reliability (mean coefficient of variation (COV) at C2–C3: TCA=0.41%, GM area=2.75%) and inter-operator reliability of the measurements (mean COV on the 4 levels: TCA=0.44%, GM area= 4.20%; mean intra-class correlation coefficient: TCA=0.998, GM area=0.906). CONCLUSION 2D PSIR allows reliable in-vivo assessment of spinal cord TCA, GM and WM areas in clinically feasible acquisition times. The area measurements presented here are in agreement with previous MRI and post-mortem studies. PMID:25483607
Reliability of Untrained and Experienced Raters on FEES: Rating Overall Residue is a Simple Task.

PubMed

Pisegna, Jessica M; Borders, James C; Kaneoka, Asako; Coster, Wendy J; Leonard, Rebecca; Langmore, Susan E

2018-03-07

The purpose of this study was to investigate the reliability of residue ratings on Fiberoptic Endoscopic Evaluation of Swallowing (FEES). We also examined rating differences based on experience to determine if years of experience influenced residue ratings. A group of 44 raters watched 81 FEES videos representing a wide range of residue severities for thin liquid, applesauce, and cracker boluses. Raters were untrained on the rating scales and simply rated their overall impression of residue amount on a visual analog scale (VAS) and a five-point ordinal scale in a randomized fashion across two sessions. Intra-class correlation coefficients, kappa coefficients, and ANOVAs were used to analyze agreement and differences in ratings. Residue ratings on both the VAS and ordinal scales had acceptable inter- and intra-rater reliability. Inter-rater agreement was acceptable (ICC > 0.7) for all comparisons. Intra-rater agreement was excellent on the VAS scale (r c = 0.9) and good on the ordinal scale (k = 0.78). There was no significant difference between expert ratings and other raters based on years of experience for cracker ratings (p = 0.2119) and applesauce ratings (p = 0.2899), but there was a significant difference between clinicians on thin liquid ratings (p = 0.0005). Without any specific training, raters demonstrated high reliability when rating the overall amount of residue on FEES. Years of experience with FEES did not influence residue ratings, suggesting that expert ratings of overall residue amount are not unique or specialized. Rating the overall amount of residue on FEES appears to be a simple visual-perceptual task for puree and cracker boluses.
Reproducibility of African giant pouched rats detecting Mycobacterium tuberculosis.

PubMed

Ellis, Haylee; Mulder, Christiaan; Valverde, Emilio; Poling, Alan; Edwards, Timothy

2017-04-24

African pouched rats sniffing sputum samples provided by local clinics have significantly increased tuberculosis case findings in Tanzania and Mozambique. The objective of this study was to determine the reproducibility of rat results. Over an 18-month period 11,869 samples were examined by the rats. Intra-rater reliability was assessed through Yule's Q. Inter-rater reliability was assessed with Krippendorff's alpha. Intra-rater reliability was high, with a mean Yule's Q of 0.9. Inter-rater agreement was fair, with Krippendorf's alpha ranging from 0.15 to 0.45. Both Intra- and Inter-rater reliability was independent of the sex of the animals, but they were positively correlated with age. Both intra- and inter-rater agreement was lowest for samples designated as smear-negative by the clinics. Overall, the reproducibility of tuberculosis detection rat results was fair and diagnostic results were therefore independent of the rats used.
Three-dimensional assessment of the asymptomatic and post-stroke shoulder: intra-rater test-retest reliability and within-subject repeatability of the palpation and digitization approach.

PubMed

Pain, Liza A M; Baker, Ross; Sohail, Qazi Zain; Richardson, Denyse; Zabjek, Karl; Mogk, Jeremy P M; Agur, Anne M R

2018-03-23

Altered three-dimensional (3D) joint kinematics can contribute to shoulder pathology, including post-stroke shoulder pain. Reliable assessment methods enable comparative studies between asymptomatic shoulders of healthy subjects and painful shoulders of post-stroke subjects, and could inform treatment planning for post-stroke shoulder pain. The study purpose was to establish intra-rater test-retest reliability and within-subject repeatability of a palpation/digitization protocol, which assesses 3D clavicular/scapular/humeral rotations, in asymptomatic and painful post-stroke shoulders. Repeated measurements of 3D clavicular/scapular/humeral joint/segment rotations were obtained using palpation/digitization in 32 asymptomatic and six painful post-stroke shoulders during four reaching postures (rest/flexion/abduction/external rotation). Intra-class correlation coefficients (ICCs), standard error of the measurement and 95% confidence intervals were calculated. All ICC values indicated high to very high test-retest reliability (≥0.70), with lower reliability for scapular anterior/posterior tilt during external rotation in asymptomatic subjects, and scapular medial/lateral rotation, humeral horizontal abduction/adduction and axial rotation during abduction in post-stroke subjects. All standard error of measurement values demonstrated within-subject repeatability error ≤5° for all clavicular/scapular/humeral joint/segment rotations (asymptomatic ≤3.75°; post-stroke ≤5.0°), except for humeral axial rotation (asymptomatic ≤5°; post-stroke ≤15°). This noninvasive, clinically feasible palpation/digitization protocol was reliable and repeatable in asymptomatic shoulders, and in a smaller sample of painful post-stroke shoulders. Implications for Rehabilitation In the clinical setting, a reliable and repeatable noninvasive method for assessment of three-dimensional (3D) clavicular/scapular/humeral joint orientation and range of motion (ROM) is currently required. The established reliability and repeatability of this proposed palpation/digitization protocol will enable comparative 3D ROM studies between asymptomatic and post-stroke shoulders, which will further inform treatment planning. Intra-rater test-retest repeatability, which is measured by the standard error of the measure, indicates the range of error associated with a single test measure. Therefore, clinicians can use the standard error of the measure to determine the "true" differences between pre-treatment and post-treatment test scores.
Validation of the Walking Impairment Questionnaire for Spanish patients.

PubMed

Lozano, Francisco S; March, José R; González-Porras, José R; Carrasco, Eduardo; Lobos, José M; Areitio-Aurtena, Alix

2013-09-01

The Walking Impairment Questionnaire (WIQ) is a short, easy to complete, disease-specific questionnaire to assess intermittent claudication. A Spanish version of the WIQ for Hispanic Americans has recently been validated in Texas, but it needs to be validated for European Spanish people. After translation and cultural adaptation of the WIQ, 920 patients with intermittent claudication (ankle brachial index < 0.9) completed two questionnaires (Spanish version of the WIQ and European Quality of Life 5 Dimension [EQ-5D]). The validity of the WIQ was determined by correlating WIQ and EQ-5D. Test-retest reliability and internal consistency were determined using the intra-class correlation coefficient (ICC) and Cronbach's alpha, respectively. The three domains of the WIQ were moderately correlated with the EQ-5D health outcome (r = 0.54 to 0.60; p < 0.001). Test-retest reliabilities ranged from ICC = 0.89 to 0.91 and internal consistency (Cronbach's alpha = 0.92) was high. The Spanish version of the WIQ for European Spanish patients was valid and reproducible, suggesting that it could be used in Spanish patients with intermittent claudication.
The reliability and validity of the Complex Task Performance Assessment: A performance-based assessment of executive function.

PubMed

Wolf, Timothy J; Dahl, Abigail; Auen, Colleen; Doherty, Meghan

2017-07-01

The objective of this study was to evaluate the inter-rater reliability, test-retest reliability, concurrent validity, and discriminant validity of the Complex Task Performance Assessment (CTPA): an ecologically valid performance-based assessment of executive function. Community control participants (n = 20) and individuals with mild stroke (n = 14) participated in this study. All participants completed the CTPA and a battery of cognitive assessments at initial testing. The control participants completed the CTPA at two different times one week apart. The intra-class correlation coefficient (ICC) for inter-rater reliability for the total score on the CTPA was .991. The ICCs for all of the sub-scores of the CTPA were also high (.889-.977). The CTPA total score was significantly correlated to Condition 4 of the DKEFS Color-Word Interference Test (p = -.425), and the Wechsler Test of Adult Reading (p = -.493). Finally, there were significant differences between control subjects and individuals with mild stroke on the total score of the CTPA (p = .007) and all sub-scores except interpretation failures and total items incorrect. These results are also consistent with other current executive function performance-based assessments and indicate that the CTPA is a reliable and valid performance-based measure of executive function.
Cross-cultural adaptation and validation to Brazil of the Obesity-related Problems Scale.

PubMed

Brasil, Andreia Mara Brolezzi; Brasil, Fábio; Maurício, Angélica Aparecida; Vilela, Regina Maria

2017-01-01

To validate a reliable version of the Obesity-related Problems Scale in Portuguese to use it in Brazil. The Obesity-related Problems Scale was translated and transculturally adapted. Later it was simultaneously self-applied with a 12-item version of the World Health Organization Disability Assessment Schedule 2.0 (WHODAS 2.0), to 50 obese patients and 50 non-obese individuals, and applied again to half of them after 14 days. The Obesity-related Problems scale was able to differentiate obese from non-obese individuals with higher accuracy than WHODAS 2.0, correlating with this scale and with body mass index. The factor analysis determined a two-dimensional structure, which was confirmed with χ2/df=1.81, SRMR=0.05, and CFI=0.97. The general a coefficient was 0.90 and the inter-item intra-class correlation, in the reapplication, ranged from 0.75 to 0.87. The scale proved to be valid and reliable for use in the Brazilian population, without the need to exclude items.
Predictors and long-term reproducibility of urinary phthalate metabolites in middle-aged men and women living in urban Shanghai

PubMed Central

Starling, Anne P.; Engel, Lawrence S.; Calafat, Antonia M.; Koutros, Stella; Satagopan, Jaya M.; Yang, Gong; Matthews, Charles E.; Cai, Qiuyin; Buckley, Jessie P.; Ji, Bu-Tian; Cai, Hui; Chow, Wong-Ho; Zheng, Wei; Gao, Yu-Tang; Rothman, Nathaniel; Xiang, Yong-Bing; Shu, Xiao-Ou

2015-01-01

Phthalate esters are man-made chemicals commonly used as plasticizers and solvents, and humans may be exposed through ingestion, inhalation, and dermal absorption. Little is known about predictors of phthalate exposure, particularly in Asian countries. Because phthalates are rapidly metabolized and excreted from the body following exposure, it is important to evaluate whether phthalate metabolites measured at a single point in time can reliably rank exposures to phthalates over a period of time. We examined the concentrations and predictors of phthalate metabolite concentrations among 50 middle-aged women and 50 men from two Shanghai cohorts, enrolled in 1997-2000 and 2002-2006, respectively. We assessed the reproducibility of urinary concentrations of phthalate metabolites in three spot samples per participant taken several years apart (mean interval between first and third sample was 7.5 years [women] or 2.9 years [men]), using Spearman's rank correlation coefficients and intra-class correlation coefficients. We detected ten phthalate metabolites in at least 50% of individuals for two or more samples. Participant sex, age, menopausal status, education, income, body mass index, consumption of bottled water, recent intake of medication, and time of day of collection of the urine sample were associated with concentrations of certain phthalate metabolites. The reproducibility of an individual's urinary concentration of phthalate metabolites across several years was low, with all intra-class correlation coefficients and most Spearman rank correlation coefficients ≤ 0.3. Only mono(2-ethylhexyl) phthalate, a metabolite of di(2-ethylhexyl)phthalate, had a Spearman rank correlation coefficient ≥ 0.4 among men, suggesting moderate reproducibility. These findings suggest that a single spot urine sample is not sufficient to rank exposures to phthalates over several years in an adult urban Chinese population. PMID:26255822
Response variability of different anodal transcranial direct current stimulation intensities across multiple sessions.

PubMed

Ammann, Claudia; Lindquist, Martin A; Celnik, Pablo A

It is well known that transcranial direct current stimulation (tDCS) is capable of modulating corticomotor excitability. However, a source of growing concern has been the observed inter- and intra-individual variability of tDCS-responses. Recent studies have assessed whether individuals respond in a predictable manner across repeated sessions of anodal tDCS (atDCS). The findings of these investigations have been inconsistent, and their methods have some limitations (i.e. lack of sham condition or testing only one tDCS intensity). To study inter- and intra-individual variability of atDCS effects at two different intensities on primary motor cortex (M1) excitability. Twelve subjects participated in a crossover study testing 7-min atDCS over M1 in three separate conditions (2 mA, 1 mA, sham) each repeated three times separated by 48 h. Motor evoked potentials were recorded before and after stimulation (up to 30min). Time of testing was maintained consistent within participants. To estimate the reliability of tDCS effects across sessions, we calculated the Intra-class Correlation Coefficient (ICC). AtDCS at 2 mA, but not 1 mA, significantly increased cortical excitability at the group level in all sessions. The overall ICC revealed fair to high reliability of tDCS effects for multiple sessions. Given that the distribution of responses showed important variability in the sham condition, we established a Sham Variability-Based Threshold to classify responses and to track individual changes across sessions. Using this threshold an intra-individual consistent response pattern was then observed only for the 2 mA condition. 2 mA anodal tDCS results in consistent intra- and inter-individual increases of M1 excitability. Copyright © 2017 Elsevier Inc. All rights reserved.
Preliminary appraisal of the reliability and validity of the Colorado State University Feline Acute Pain Scale.

PubMed

Shipley, Hilary; Guedes, Alonso; Graham, Lynelle; Goudie-DeAngelis, Elizabeth; Wendt-Hornickle, Erin

2018-05-01

Objectives The objective of this study was to determine the inter-rater reliability and convergent validity of the Colorado State University Feline Acute Pain Scale (CSU-FAPS) in a preliminary appraisal of its performance in a clinical teaching setting. Methods Sixty-eight female cats were assessed for pain after ovariohysterectomy. A cohort of 21 cats was examined independently by four raters (two board-certified anesthesiologists and two anesthesia residents) with the CSU-FAPS, and intra-class correlation coefficient (ICC) was used to determine inter-rater reliability. Weighted Cohen's kappa was used to determine inter-rater reliability centered on the 'need to reassess analgesic plan' (dichotomous scale). A separate cohort of 47 cats was evaluated independently by two raters (one board-certified anesthesiologist and one veterinary small animal rotating intern) using the CSU-FAPS and the Glasgow Composite Measure Pain Scale (CMPS-Feline), and Spearman rank-order correlation was determined to assess convergent validity. Reliability was interpreted using Altman's classification as very good, good, moderate, fair and poor. Validity was considered adequate if correlation coefficients were between 0.4 and 0.8. Results The ICC was 0.61 for anesthesiologists and 0.67 for residents, indicating good reliability. Weighted Cohen's kappa was 0.79 for anesthesiologists and 0.44 for residents, indicating moderate to good reliability. The Spearman rank correlation indicated a statistically significant ( P = 0.0003) positive correlation (0.31; 95% confidence interval 0.14-0.46) between the CSU-FAPS and the CMPS-Feline. Conclusions and relevance The CSU-FAPS showed moderate-to-good inter-rater reliability when used by veterinarians to assess pain level or need to reassess analgesic plan after ovariohysterectomy in cats. The validity fell short of current guidelines for correlation coefficients and further refinement and testing are warranted to improve its performance.
The development and reliability of a simple field based screening tool to assess core stability in athletes.

PubMed

O'Connor, S; McCaffrey, N; Whyte, E; Moran, K

2016-07-01

To adapt the trunk stability test to facilitate further sub-classification of higher levels of core stability in athletes for use as a screening tool. To establish the inter-tester and intra-tester reliability of this adapted core stability test. Reliability study. Collegiate athletic therapy facilities. Fifteen physically active male subjects (19.46 ± 0.63) free from any orthopaedic or neurological disorders were recruited from a convenience sample of collegiate students. The intraclass correlation coefficients (ICC) and 95% Confidence Intervals (CI) were computed to establish inter-tester and intra-tester reliability. Excellent ICC values were observed in the adapted core stability test for inter-tester reliability (0.97) and good to excellent intra-tester reliability (0.73-0.90). While the 95% CI were narrow for inter-tester reliability, Tester A and C 95% CI's were widely distributed compared to Tester B. The adapted core stability test developed in this study is a quick and simple field based test to administer that can further subdivide athletes with high levels of core stability. The test demonstrated high inter-tester and intra-tester reliability. Copyright © 2015 Elsevier Ltd. All rights reserved.
Psychometric properties of the medical outcomes study: social support survey among methadone maintenance patients in Ho Chi Minh City, Vietnam: a validation study.

PubMed

Khuong, Long Quynh; Vu, Tuong-Vi Thi; Huynh, Van-Anh Ngoc; Thai, Truc Thanh

2018-02-14

Social support plays a crucial role in the treatment and recovery process of patients engaging in methadone maintenance treatment (MMT). However, there is a paucity of research about social support among MMT patients, possibly due to a lack of appropriate measuring tools. This study aimed to evaluate the psychometric properties of the Vietnamese version of the Medical Outcomes Study: Social Support Survey (MOS-SSS) among MMT patients. A cross-sectional survey of 300 patients was conducted in a methadone clinic in Ho Chi Minh City, Vietnam. MMT patients who agreed to participate in the study completed a face-to-face interview in a private room. The MOS-SSS was translated into Vietnamese using standard forward-backward process. Internal consistency was measured by Cronbach's alpha. The intra-class correlation coefficient was used to determine the test-retest reliability of the MOS-SSS in 75 participants two weeks after the first survey. Concurrent validity of the MOS-SSS was evaluated by correlations with the Multidimensional Scale of Perceived Social Support (MSPSS) and the Perceived Stigma of Addiction Scale (PSAS). Construct validity was investigated by confirmatory factor analysis. The MOS-SSS had good internal consistency with Cronbach's alpha from 0.95 to 0.97 for the four subscales and 0.97 for the overall scale. The two-week test-retest reliability was at moderate level with intra-class correlation coefficients of 0.61-0.73 for the four subscales and 0.76 for the overall scale. Strong significant correlations between the MOS-SSS and the MSPSS (r = 0.77; p < 0.001) and the PSAS (r = - 0.76; p < 0.001) indicated good concurrent validity. Construct validity of the MOS-SSS was established since a final four-factor model fitted the data well with Comparative Fit Index (0.97), Tucker-Lewis Index (0.97), Standardized Root Mean Square Residual (0.03) and Root Mean Square Error of Approximation (0.068; 90% CI = 0.059-0.077). The MOS-SSS is a reliable and valid tool for measuring social support in Vietnamese MMT patients. Further studies among methadone patients at different stages of their treatment and among those from different areas of Vietnam are needed.
Measurement and Reliability of Response Inhibition

PubMed Central

Congdon, Eliza; Mumford, Jeanette A.; Cohen, Jessica R.; Galvan, Adriana; Canli, Turhan; Poldrack, Russell A.

2012-01-01

Response inhibition plays a critical role in adaptive functioning and can be assessed with the Stop-signal task, which requires participants to suppress prepotent motor responses. Evidence suggests that this ability to inhibit a prepotent motor response (reflected as Stop-signal reaction time (SSRT)) is a quantitative and heritable measure of interindividual variation in brain function. Although attention has been given to the optimal method of SSRT estimation, and initial evidence exists in support of its reliability, there is still variability in how Stop-signal task data are treated across samples. In order to examine this issue, we pooled data across three separate studies and examined the influence of multiple SSRT calculation methods and outlier calling on reliability (using Intra-class correlation). Our results suggest that an approach which uses the average of all available sessions, all trials of each session, and excludes outliers based on predetermined lenient criteria yields reliable SSRT estimates, while not excluding too many participants. Our findings further support the reliability of SSRT, which is commonly used as an index of inhibitory control, and provide support for its continued use as a neurocognitive phenotype. PMID:22363308
The quadrant method measuring four points is as a reliable and accurate as the quadrant method in the evaluation after anatomical double-bundle ACL reconstruction.

PubMed

Mochizuki, Yuta; Kaneko, Takao; Kawahara, Keisuke; Toyoda, Shinya; Kono, Norihiko; Hada, Masaru; Ikegami, Hiroyasu; Musha, Yoshiro

2017-11-20

The quadrant method was described by Bernard et al. and it has been widely used for postoperative evaluation of anterior cruciate ligament (ACL) reconstruction. The purpose of this research is to further develop the quadrant method measuring four points, which we named four-point quadrant method, and to compare with the quadrant method. Three-dimensional computed tomography (3D-CT) analyses were performed in 25 patients who underwent double-bundle ACL reconstruction using the outside-in technique. The four points in this study's quadrant method were defined as point1-highest, point2-deepest, point3-lowest, and point4-shallowest, in femoral tunnel position. Value of depth and height in each point was measured. Antero-medial (AM) tunnel is (depth1, height2) and postero-lateral (PL) tunnel is (depth3, height4) in this four-point quadrant method. The 3D-CT images were evaluated independently by 2 orthopaedic surgeons. A second measurement was performed by both observers after a 4-week interval. Intra- and inter-observer reliability was calculated by means of intra-class correlation coefficient (ICC). Also, the accuracy of the method was evaluated against the quadrant method. Intra-observer reliability was almost perfect for both AM and PL tunnel (ICC > 0.81). Inter-observer reliability of AM tunnel was substantial (ICC > 0.61) and that of PL tunnel was almost perfect (ICC > 0.81). The AM tunnel position was 0.13% deep, 0.58% high and PL tunnel position was 0.01% shallow, 0.13% low compared to quadrant method. The four-point quadrant method was found to have high intra- and inter-observer reliability and accuracy. This method can evaluate the tunnel position regardless of the shape and morphology of the bone tunnel aperture for use of comparison and can provide measurement that can be compared with various reconstruction methods. The four-point quadrant method of this study is considered to have clinical relevance in that it is a detailed and accurate tool for evaluating femoral tunnel position after ACL reconstruction. Case series, Level IV.
Inter-Observer and Intra-Observer Reliability of Clinical Assessments in Knee Osteoarthritis

PubMed Central

Maricar, Nasimah; Callaghan, Michael J; Parkes, Matthew J; Felson, David T; O’Neill, Terence W

2016-01-01

Background Clinical examination of the knee is subject to measurement error. The aim of this analysis was to determine inter- and intra-observer reliability of commonly used clinical tests in patients with knee osteoarthritis(OA). Methods We studied subjects with symptomatic knee OA who were participants in an open-label clinical trial of intra-articular steroid therapy. Following standardisation of the clinical test procedures, two clinicians assessed 25 subjects independently at the same visit, and the same clinician assessed 88 subjects over an interval period of 2–10 weeks; in both cases prior to the steroid intervention. Clinical examination included assessment of bony enlargement, crepitus, quadriceps wasting, knee effusion, joint-line and anserine tenderness and knee range of movement(ROM). Intra-class correlation coefficients(ICC), estimated kappa(κ), weighted kappa(κω) and Bland and Altman plots were used to determine inter- and intra-observer levels of agreement. Results Using Landis and Koch criteria, inter-observer kappa scores were moderate for patellofemoral joint(κ=0.53) and anserine tenderness(κ=0.48); good for bony enlargement(κ=0.66), quadriceps wasting(κ=0.78), crepitus(κ=0.78), medial tibiofemoral joint tenderness(κ=0.76), and effusion assessed by ballottement(κ=0.73) and bulge sign(κω =0.78); and excellent for lateral tibiofemoral joint tenderness(κ=1.00), flexion(ICC=0.97) and extension(ICC=0.87) ROM. Intra-observer kappa scores were moderate for lateral tibiofemoral joint tenderness(κ=0.60), good for crepitus(κ=0.78), effusion assessed by ballottement test(κ=0.77), patellofemoral joint(κ=0.66), medial tibiofemoral joint(κ=0.64) and anserine(κ=0.73) tenderness and excellent for effusion assessed by bulge sign(κω =0.83), bony enlargement(κ=0.98), quadriceps wasting(κ=0.83), flexion(ICC=0.99) and extension(ICC=0.96) ROM. Conclusion Among individuals with symptomatic knee OA, the reliability of clinical examination of the knee was at least good for the majority of clinical signs of knee OA. PMID:27909143
Inter-vender and test-retest reliabilities of resting-state functional magnetic resonance imaging: Implications for multi-center imaging studies.

PubMed

An, Hyeong Su; Moon, Won-Jin; Ryu, Jae-Kyun; Park, Ju Yeon; Yun, Won Sung; Choi, Jin Woo; Jahng, Geon-Ho; Park, Jang-Yeon

2017-12-01

This prospective multi-center study aimed to evaluate the inter-vendor and test-retest reliabilities of resting-state functional magnetic resonance imaging (RS-fMRI) by assessing the temporal signal-to-noise ratio (tSNR) and functional connectivity. Study included 10 healthy subjects and each subject was scanned using three 3T MR scanners (GE Signa HDxt, Siemens Skyra, and Philips Achieva) in two sessions. The tSNR was calculated from the time course data. Inter-vendor and test-retest reliabilities were assessed with intra-class correlation coefficients (ICCs) derived from variant component analysis. Independent component analysis was performed to identify the connectivity of the default-mode network (DMN). In result, the tSNR for the DMN was not significantly different among the GE, Philips, and Siemens scanners (P=0.638). In terms of vendor differences, the inter-vendor reliability was good (ICC=0.774). Regarding the test-retest reliability, the GE scanner showed excellent correlation (ICC=0.961), while the Philips (ICC=0.671) and Siemens (ICC=0.726) scanners showed relatively good correlation. The DMN pattern of the subjects between the two sessions for each scanner and between three scanners showed the identical patterns of functional connectivity. The inter-vendor and test-retest reliabilities of RS-fMRI using different 3T MR scanners are good. Thus, we suggest that RS-fMRI could be used in multicenter imaging studies as a reliable imaging marker. Copyright © 2017 Elsevier Inc. All rights reserved.
Is intra-bladder pressure measurement a reliable indicator for raised intra-abdominal pressure? A prospective comparative study.

PubMed

Al-Abassi, Abdulla Ahmed; Al Saadi, Azan Saleh; Ahmed, Faisal

2018-06-19

Intra-abdominal pressure (IAP) can be measured by several indirect methods; however, the urinary bladder is largely preferred. The aim of this study was to compare intra-bladder pressure (IBP) at different levels of IAPs and assess its reliability as an indirect method for IAP measurement. We compared IBP with IAP in twenty-one patients undergoing laparoscopic cholecystectomy under general anesthesia. Measurements were recorded at increasing levels of insufflation pressures to approximately 22 mmHg. Pearson's correlation coefficient was calculated to establish the relationship between the two pressure measurements and Bland-Altman analysis was used to assess the limits of agreement between the two methods of measurements. The urinary bladder pressures reflected well the pressures in the abdominal cavity. Pearson correlation coefficient showed a good correlation between the two measurement techniques (r = 0.966, p < 0.0001) and Bland-Altman analysis indicated that the 95% limits of agreement between the two methods ranged from - 2.83 to 2.64. This range is accepted both clinically and according to the recommendations of the World Society of Abdominal Compartment Syndrome (WSACS). Our study showed that IBP measurement is a simple, minimally invasive method that may reliably estimates IAP in patients placed in supine position. Measurements for pressures higher than 12 mmHg may be less reliable. When applied clinically, this should alert the clinician to take safety measures to avoid abdominal compartment syndrome (ACS).

Assessing self-reported disability in a low-literate population with chronic low back pain: cross-cultural adaptation and psychometric testing of Igbo Roland Morris disability questionnaire.

PubMed

Igwesi-Chidobe, Chinonso N; Obiekwe, Chinwe; Sorinola, Isaac O; Godfrey, Emma L

2017-12-14

Cross-culturally adapt and validate the Igbo Roland Morris Disability Questionnaire. Cross-cultural adaptation, test-retest, and cross-sectional psychometric testing. Roland Morris Disability Questionnaire was forward and back translated by clinical/non-clinical translators. An expert committee appraised the translations. Twelve participants with chronic low back pain pre-tested the measure in a rural Nigerian community. Internal consistency using Cronbach's alpha; test-retest reliability using intra-class correlation coefficient and Bland-Altman plot; and minimal detectable change were investigated in a convenient sample of 50 people with chronic low back pain in rural and urban Nigeria. Pearson's correlation analyses using the eleven-point box scale and back performance scale, and exploratory factor analysis were used to examine construct validity in a random sample of 200 adults with chronic low back pain in rural Nigeria. Ceiling and floor effects were investigated in the two samples. Modifications gave the option of interviewer-administration and reflected Nigerian social context. The measure had excellent internal consistency (α = 0.91) and intraclass correlation coefficient (ICC =0.84), moderately high correlations (r > 0.6) with performance-based disability and pain intensity, and a predominant uni-dimensional structure, with no ceiling or floor effects. Igbo Roland Morris Disability Questionnaire is a valid and reliable measure of pain-related disability. Implications for rehabilitation Low back pain is the leading cause of years lived with disability worldwide, and is particularly prevalent in rural Nigeria, but there are no self-report measures to assess its impact due to low literacy rates. This study describes the cross-cultural adaptation and validation of a core self-report back pain specific disability measure in a low-literate Nigerian population. The Igbo Roland Morris Disability Questionnaire is a reliable and valid measure of self-reported disability in Igbo populations as indicated by excellent internal consistency (α = 0.91) and intra-class correlation coefficient (ICC =0.84), moderately high correlations (r > 0.6) with performance-based disability and pain intensity that supports a pain-related disability construct, a predominant one factor structure with no ceiling or floor effects. The measure will be useful for researchers and clinicians examining the factors associated with low back pain disability or the effects of interventions on low back pain disability in this culture. This measure will support global health initiatives concurrently involving people from several cultures or countries, and may inform cross-cultural disability research in other populations.
Reliability and cross-cultural validation of the Turkish version of Manual Ability Classification System (MACS) for children with cerebral palsy.

PubMed

Akpinar, Pinar; Tezel, Canan G; Eliasson, Ann-Christin; Icagasioglu, Afitap

2010-01-01

To determine the reliability and cross-cultural validation of the Turkish translation of the Manual Ability Classification System (MACS) for children with cerebral palsy (CP) and to investigate the relation to gross motor function and other comorbidities. After the forward and backward translation procedures, inter-rater and test-retest reliability was assessed between parents, physiotherapists and physicians using the intra-class correlation coefficient (ICC). Children (N = 118, 4 to 18 years, mean age 9 years 4 months; 68 boys, 50 girls) with various types of CP were classified. Additional data on the Gross Motor Function Classification System (GMFCS), intellectual delay, visual acuity, and epilepsy were collected. The inter-rater reliability was high; the ICC ranged from 0.89 to 0.96 among different professionals and parents. Between two persons of the same profession it ranged from 0.97 to 0.98. For the test-retest reliability it ranged from 0.91 to 0.98. Total agreement between the GMFCS and the MACS occurred in only 45% of the children. The level of the MACS was found to correlate with the accompanying comorbidities, namely intellectual delay and epilepsy. The Turkish version of the MACS is found to be valid and reliable, and is suggested to be appropriate for the assessment of manual ability within the Turkish population.
Reliability and criterion-related validity of a new repeated agility test

PubMed Central

Makni, E; Jemni, M; Elloumi, M; Chamari, K; Nabli, MA; Padulo, J; Moalla, W

2016-01-01

The study aimed to assess the reliability and the criterion-related validity of a new repeated sprint T-test (RSTT) that includes intense multidirectional intermittent efforts. The RSTT consisted of 7 maximal repeated executions of the agility T-test with 25 s of passive recovery rest in between. Forty-five team sports players performed two RSTTs separated by 3 days to assess the reliability of best time (BT) and total time (TT) of the RSTT. The intra-class correlation coefficient analysis revealed a high relative reliability between test and retest for BT and TT (>0.90). The standard error of measurement (<0.50) showed that the RSTT has a good absolute reliability. The minimal detectable change values for BT and TT related to the RSTT were 0.09 s and 0.58 s, respectively. To check the criterion-related validity of the RSTT, players performed a repeated linear sprint (RLS) and a repeated sprint with changes of direction (RSCD). Significant correlations between the BT and TT of the RLS, RSCD and RSTT were observed (p<0.001). The RSTT is, therefore, a reliable and valid measure of the intermittent repeated sprint agility performance. As this ability is required in all team sports, it is suggested that team sports coaches, fitness coaches and sports scientists consider this test in their training follow-up. PMID:27274109
High-resolution dental magnetic resonance imaging for planning palatal graft surgery-a clinical pilot study.

PubMed

Hilgenfeld, Tim; Kästel, Thorsten; Heil, Alexander; Rammelsberg, Peter; Heiland, Sabine; Bendszus, Martin; Schwindling, Franz Sebastian

2018-04-01

To evaluate whether high-resolution, non-contrast-enhanced dental magnetic resonance imaging (MRI) can be used for accurate determination of palatal masticatory mucosa thickness (PMMT) and to locate the greater palatal artery (GPA). In five volunteers (four males, one female; mean age 30.2 ± 0.4 years), two independent raters measured PMMT by use of dental MRI in 180 positions. For comparison, clinical bone sounding was performed. The GPA was identified in time-of-flight (TOF) angiography and MSVAT-SPACE-prototype sequence. Intra- and inter-observer agreement for MRI measurements, agreement between MRI and bone sounding were analysed by intra-class correlation coefficient (ICC) and Cohen's kappa (κ). Reliability of dental MRI measurements was high (intra-observer-ICC 0.962; inter-observer ICC 0.959). Agreement of MRI measurements with bone sounding was moderate (ICC 0.744), and the GPA could be identified in 60% of measurement points using the TOF-angiography alone and in 85% with additional information of the MSVAT-SPACE. Good intra-observer agreement was observed for GPA identification (κ: 0.778). Palatal masticatory mucosa thickness measured by high-resolution, non-contrast enhanced dental MRI is comparable with that obtained by bone sounding. Dental MRI enables reliable, non-invasive and radiation-free planning of palatal tissue harvesting and can also be used for location of the GPA at 85% of measurement points, which might help reduce complications during surgery. © 2018 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Reliability and Concurrent Validity of Dynamic Rotator Stability Test-A Cross Sectional study.

PubMed

Binoy Mathew, K V; Eapen, Charu; Kumar, P Senthil

2012-01-01

To find intra rater and inter rater reliability of Dynamic Rotator Stability Test (DRST) and to find concurrent validity of Dynamic Rotator Stability Test (DRST) with University of Pennsylvania Shoulder Score (PENN) Scale. 40 subjects of either gender between the age group of 18-70 with painful shoulder conditions of musculoskeletal origin was selected through convenient sampling. Tester 1 and tester 2 administered DRST and PENN scale randomly. In a subgroup of 20 subjects DRST was administered by both the testers to find the inter rater reliability. 180° Standard Universal Goniometer was used to take measurements. For intra-rater reliability, all the test variables were showing highly significant correlation (p=.94 - 1). For inter -rater, with tester 2, test variables like position, ROM, force, direction of abnormal translation, pain during the test, compensatory movement during test were found to be significant (p=.71-1).only some variables of DRST showed significant correlation with PENN scale (P=.320-.450). Dynamic Rotator Stability Test has good intra rater and moderate inter rater reliability. Concurrent validity of Dynamic Rotator Stability Test was found to be poor when compared to PENN Shoulder Score.
Chinese translation and validation of the Walking Impairment Questionnaire in patients with peripheral artery disease.

PubMed

Yan, Bryan P; Lau, James Y; Yu, Check-Man; Au, Kim; Chan, Ka-Wai; Yu, Doris S; Ma, Ronald C; Lam, Yat-Yin; Hiatt, William R

2011-06-01

The Walking Impairment Questionnaire (WIQ) is a frequently used questionnaire to evaluate patients with intermittent claudication on four subscales: pain severity, walking distance, walking speed and the ability to climb stairs. The aim of this study is to translate and validate the WIQ in Chinese. After translation and cultural adaptation of the WIQ, 134 patients with intermittent claudication completed the Chinese WIQ and European Quality of Life 5 Dimension (EQ-5D). Walking distances were determined by the 6-minute walk test (6MWT). Correlations between the WIQ, quality of life questionnaire and walking distances were calculated to determine validity. Reliability and internal consistency were determined using the intra-class correlation coefficient (ICC) and Cronbach's alpha (α), respectively. Significant correlations were found between the WIQ score, initial claudication distance (ICD), absolute claudication distance (ACD) and all domains of the EQ-5D (all p ≤ 0.01). Test-retest reliability (ICC = 0.74) and the overall internal consistency determined (α = 0.90) showed good agreement. A lower WIQ score corresponded to shorter walking distances. In conclusion, this study showed that the Chinese version of the WIQ is a valid, reliable and clinically relevant instrument for assessing walking impairment in patients with intermittent claudication.
The City of Hope-Quality of Life-Ostomy Questionnaire: Persian Translation and Validation

PubMed Central

Anaraki, F; Vafaie, M; Behboo, R; Esmaeilpour, S; Maghsoodi, N; Safaee, A; Grant, M

2014-01-01

Background: Since there is no disease-specific instrument for measuring quality-of-life (QOL) in Ostomy patients in Persian language. Aim: This study was designed to translate and evaluate the validity and reliability of City of Hope-quality of life-Ostomy questionnaire (COH-QOL-Ostomy questionnaire). Subjects and Methods: This study was designed as cross-sectional study. Reliability of the subscales and the summary scores were demonstrated by intra-class correlation coefficients. Pearson's correlations of an item with its own scale and other scales were calculated to evaluated convergent and discriminant validity. Clinical validity was also evaluated by known-group comparisons. Results: Cronbach's alpha coefficient for all subscales was about 0.70 or higher. Results of interscale correlation were satisfactory and each subscale only measured a single and specified trait. All subscales met the standards of convergent and discriminant validity. Known group comparison analysis showed significant differences in social and spiritual well-being. Conclusion: The findings confirmed the reliability and validity of Persian version of COH-QOL-Ostomy questionnaire. The instrument was also well received by the Iranian patients. It can be considered as a valuable instrument to assess the different aspects of health related quality-of-life in Ostomy patients and used in clinical research in the future. PMID:25221719
Validity and reliability of a novel immunosuppressive adverse effects scoring system in renal transplant recipients.

PubMed

Meaney, Calvin J; Arabi, Ziad; Venuto, Rocco C; Consiglio, Joseph D; Wilding, Gregory E; Tornatore, Kathleen M

2014-06-12

After renal transplantation, many patients experience adverse effects from maintenance immunosuppressive drugs. When these adverse effects occur, patient adherence with immunosuppression may be reduced and impact allograft survival. If these adverse effects could be prospectively monitored in an objective manner and possibly prevented, adherence to immunosuppressive regimens could be optimized and allograft survival improved. Prospective, standardized clinical approaches to assess immunosuppressive adverse effects by health care providers are limited. Therefore, we developed and evaluated the application, reliability and validity of a novel adverse effects scoring system in renal transplant recipients receiving calcineurin inhibitor (cyclosporine or tacrolimus) and mycophenolic acid based immunosuppressive therapy. The scoring system included 18 non-renal adverse effects organized into gastrointestinal, central nervous system and aesthetic domains developed by a multidisciplinary physician group. Nephrologists employed this standardized adverse effect evaluation in stable renal transplant patients using physical exam, review of systems, recent laboratory results, and medication adherence assessment during a clinic visit. Stable renal transplant recipients in two clinical studies were evaluated and received immunosuppressive regimens comprised of either cyclosporine or tacrolimus with mycophenolic acid. Face, content, and construct validity were assessed to document these adverse effect evaluations. Inter-rater reliability was determined using the Kappa statistic and intra-class correlation. A total of 58 renal transplant recipients were assessed using the adverse effects scoring system confirming face validity. Nephrologists (subject matter experts) rated the 18 adverse effects as: 3.1 ± 0.75 out of 4 (maximum) regarding clinical importance to verify content validity. The adverse effects scoring system distinguished 1.75-fold increased gastrointestinal adverse effects (p=0.008) in renal transplant recipients receiving tacrolimus and mycophenolic acid compared to the cyclosporine regimen. This finding demonstrated construct validity. Intra-class correlation was 0.81 (95% confidence interval: 0.65-0.90) and Kappa statistic of 0.68 ± 0.25 for all 18 adverse effects and verified substantial inter-rater reliability. This immunosuppressive adverse effects scoring system in stable renal transplant recipients was evaluated and substantiated face, content and construct validity with inter-rater reliability. The scoring system may facilitate prospective, standardized clinical monitoring of immunosuppressive adverse drug effects in stable renal transplant recipients and improve medication adherence.
A reliability assessment of constrained spherical deconvolution-based diffusion-weighted magnetic resonance imaging in individuals with chronic stroke.

PubMed

Snow, Nicholas J; Peters, Sue; Borich, Michael R; Shirzad, Navid; Auriat, Angela M; Hayward, Kathryn S; Boyd, Lara A

2016-01-15

Diffusion-weighted magnetic resonance imaging (DW-MRI) is commonly used to assess white matter properties after stroke. Novel work is utilizing constrained spherical deconvolution (CSD) to estimate complex intra-voxel fiber architecture unaccounted for with tensor-based fiber tractography. However, the reliability of CSD-based tractography has not been established in people with chronic stroke. Establishing the reliability of CSD-based DW-MRI in chronic stroke. High-resolution DW-MRI was performed in ten adults with chronic stroke during two separate sessions. Deterministic region of interest-based fiber tractography using CSD was performed by two raters. Mean fractional anisotropy (FA), apparent diffusion coefficient (ADC), tract number, and tract volume were extracted from reconstructed fiber pathways in the corticospinal tract (CST) and superior longitudinal fasciculus (SLF). Callosal fiber pathways connecting the primary motor cortices were also evaluated. Inter-rater and test-retest reliability were determined by intra-class correlation coefficients (ICCs). ICCs revealed excellent reliability for FA and ADC in ipsilesional (0.86-1.00; p<0.05) and contralesional hemispheres (0.94-1.00; p<0.0001), for CST and SLF fibers; and excellent reliability for all metrics in callosal fibers (0.85-1.00; p<0.05). ICC ranged from poor to excellent for tract number and tract volume in ipsilesional (-0.11 to 0.92; p≤0.57) and contralesional hemispheres (-0.27 to 0.93; p≤0.64), for CST and SLF fibers. Like other select DW-MRI approaches, CSD-based tractography is a reliable approach to evaluate FA and ADC in major white matter pathways, in chronic stroke. Future work should address the reproducibility and utility of CSD-based metrics of tract number and tract volume. Copyright © 2015 Elsevier B.V. All rights reserved.
INTRA-RATER RELIABILITY OF THE MULTIPLE SINGLE-LEG HOP-STABILIZATION TEST AND RELATIONSHIPS WITH AGE, LEG DOMINANCE AND TRAINING.

PubMed

Sawle, Leanne; Freeman, Jennifer; Marsden, Jonathan

2017-04-01

Balance is a complex construct, affected by multiple components such as strength and co-ordination. However, whilst assessing an athlete's dynamic balance is an important part of clinical examination, there is no gold standard measure. The multiple single-leg hop-stabilization test is a functional test which may offer a method of evaluating the dynamic attributes of balance, but it needs to show adequate intra-tester reliability. The purpose of this study was to assess the intra-rater reliability of a dynamic balance test, the multiple single-leg hop-stabilization test on the dominant and non-dominant legs. Intra-rater reliability study. Fifteen active participants were tested twice with a 10-minute break between tests. The outcome measure was the multiple single-leg hop-stabilization test score, based on a clinically assessed numerical scoring system. Results were analysed using an Intraclass Correlations Coefficient (ICC 2,1 ) and Bland-Altman plots. Regression analyses explored relationships between test scores, leg dominance, age and training (an alpha level of p = 0.05 was selected). ICCs for intra-rater reliability were 0.85 for the dominant and non-dominant legs (confidence intervals = 0.62-0.95 and 0.61-0.95 respectively). Bland-Altman plots showed scores within two standard deviations. A significant correlation was observed between the dominant and non-dominant leg on balance scores (R 2 =0.49, p<0.05), and better balance was associated with younger participants in their non-dominant leg (R 2 =0.28, p<0.05) and their dominant leg (R 2 =0.39, p<0.05), and a higher number of hours spent training for the non-dominant leg R 2 =0.37, p<0.05). The multiple single-leg hop-stabilisation test demonstrated strong intra-tester reliability with active participants. Younger participants who trained more, have better balance scores. This test may be a useful measure for evaluating the dynamic attributes of balance. 3.
The Queensland high risk foot form (QHRFF) – is it a reliable and valid clinical research tool for foot disease?

PubMed Central

2014-01-01

Background Foot disease complications, such as foot ulcers and infection, contribute to considerable morbidity and mortality. These complications are typically precipitated by “high-risk factors”, such as peripheral neuropathy and peripheral arterial disease. High-risk factors are more prevalent in specific “at risk” populations such as diabetes, kidney disease and cardiovascular disease. To the best of the authors’ knowledge a tool capturing multiple high-risk factors and foot disease complications in multiple at risk populations has yet to be tested. This study aimed to develop and test the validity and reliability of a Queensland High Risk Foot Form (QHRFF) tool. Methods The study was conducted in two phases. Phase one developed a QHRFF using an existing diabetes foot disease tool, literature searches, stakeholder groups and expert panel. Phase two tested the QHRFF for validity and reliability. Four clinicians, representing different levels of expertise, were recruited to test validity and reliability. Three cohorts of patients were recruited; one tested criterion measure reliability (n = 32), another tested criterion validity and inter-rater reliability (n = 43), and another tested intra-rater reliability (n = 19). Validity was determined using sensitivity, specificity and positive predictive values (PPV). Reliability was determined using Kappa, weighted Kappa and intra-class correlation (ICC) statistics. Results A QHRFF tool containing 46 items across seven domains was developed. Criterion measure reliability of at least moderate categories of agreement (Kappa > 0.4; ICC > 0.75) was seen in 91% (29 of 32) tested items. Criterion validity of at least moderate categories (PPV > 0.7) was seen in 83% (60 of 72) tested items. Inter- and intra-rater reliability of at least moderate categories (Kappa > 0.4; ICC > 0.75) was seen in 88% (84 of 96) and 87% (20 of 23) tested items respectively. Conclusions The QHRFF had acceptable validity and reliability across the majority of items; particularly items identifying relevant co-morbidities, high-risk factors and foot disease complications. Recommendations have been made to improve or remove identified weaker items for future QHRFF versions. Overall, the QHRFF possesses suitable practicality, validity and reliability to assess and capture relevant foot disease items across multiple at risk populations. PMID:24468080
Reliability and Validity of a New Test of Change-of-Direction Speed for Field-Based Sports: the Change-of-Direction and Acceleration Test (CODAT).

PubMed

Lockie, Robert G; Schultz, Adrian B; Callaghan, Samuel J; Jeffriess, Matthew D; Berry, Simon P

2013-01-01

Field sport coaches must use reliable and valid tests to assess change-of-direction speed in their athletes. Few tests feature linear sprinting with acute change- of-direction maneuvers. The Change-of-Direction and Acceleration Test (CODAT) was designed to assess field sport change-of-direction speed, and includes a linear 5-meter (m) sprint, 45° and 90° cuts, 3- m sprints to the left and right, and a linear 10-m sprint. This study analyzed the reliability and validity of this test, through comparisons to 20-m sprint (0-5, 0-10, 0-20 m intervals) and Illinois agility run (IAR) performance. Eighteen Australian footballers (age = 23.83 ± 7.04 yrs; height = 1.79 ± 0.06 m; mass = 85.36 ± 13.21 kg) were recruited. Following familiarization, subjects completed the 20-m sprint, CODAT, and IAR in 2 sessions, 48 hours apart. Intra-class correlation coefficients (ICC) assessed relative reliability. Absolute reliability was analyzed through paired samples t-tests (p ≤ 0.05) determining between-session differences. Typical error (TE), coefficient of variation (CV), and differences between the TE and smallest worthwhile change (SWC), also assessed absolute reliability and test usefulness. For the validity analysis, Pearson's correlations (p ≤ 0.05) analyzed between-test relationships. Results showed no between-session differences for any test (p = 0.19-0.86). CODAT time averaged ~6 s, and the ICC and CV equaled 0.84 and 3.0%, respectively. The homogeneous sample of Australian footballers meant that the CODAT's TE (0.19 s) exceeded the usual 0.2 x standard deviation (SD) SWC (0.10 s). However, the CODAT is capable of detecting moderate performance changes (SWC calculated as 0.5 x SD = 0.25 s). There was a near perfect correlation between the CODAT and IAR (r = 0.92), and very large correlations with the 20-m sprint (r = 0.75-0.76), suggesting that the CODAT was a valid change-of-direction speed test. Due to movement specificity, the CODAT has value for field sport assessment. Key pointsThe change-of-direction and acceleration test (CODAT) was designed specifically for field sport athletes from specific speed research, and data derived from time-motion analyses of sports such as rugby union, soccer, and Australian football. The CODAT features a linear 5-meter (m) sprint, 45° and 90° cuts and 3-m sprints to the left and right, and a linear 10-m sprint.The CODAT was found to be a reliable change-of-direction speed assessment when considering intra-class correlations between two testing sessions, and the coefficient of variation between trials. A homogeneous sample of Australian footballers resulted in absolute reliability limitations when considering differences between the typical error and smallest worthwhile change. However, the CODAT will detect moderate (0.5 times the test's standard deviation) changes in performance.The CODAT correlated with the Illinois agility run, highlighting that it does assess change-of-direction speed. There were also significant relationships with short sprint performance (i.e. 0-5 m and 0-10 m), demonstrating that linear acceleration is assessed within the CODAT, without the extended duration and therefore metabolic limitations of the IAR. Indeed, the average duration of the test (~6 seconds) is field sport-specific. Therefore, the CODAT could be used as an assessment of change-of-direction speed in field sport athletes.
Kinematic predictors of single-leg squat performance: a comparison of experienced physiotherapists and student physiotherapists.

PubMed

Weeks, Benjamin K; Carty, Christopher P; Horan, Sean A

2012-10-25

The single-leg squat (SLS) is a common test used by clinicians for the musculoskeletal assessment of the lower limb. The aim of the current study was to reveal the kinematic parameters used by experienced and inexperienced clinicians to determine SLS performance and establish reliability of such assessment. Twenty-two healthy, young adults (23.8 ± 3.1 years) performed three SLSs on each leg whilst being videoed. Three-dimensional data for the hip and knee was recorded using a 10-camera optical motion analysis system (Vicon, Oxford, UK). SLS performance was rated from video data using a 10-point ordinal scale by experienced musculoskeletal physiotherapists and student physiotherapists. All ratings were undertaken a second time at least two weeks after the first by the same raters. Stepwise multiple regression analysis was performed to determine kinematic predictors of SLS performance scores and inter- and intra-rater reliability were determined using a two-way mixed model to generate intra-class correlation coefficients (ICC3,1) of consistency. One SLS per leg for each participant was used for analysis, providing 44 SLSs in total. Eight experienced physiotherapists and eight physiotherapy students agreed to rate each SLS. Variance in physiotherapist scores was predicted by peak knee flexion, knee medio-lateral displacement, and peak hip adduction (R2 = 0.64, p = 0.01), while variance in student scores was predicted only by peak knee flexion, and knee medio-lateral displacement (R2 = 0.57, p = 0.01). Inter-rater reliability was good for physiotherapists (ICC3,1 = 0.71) and students (ICC3,1 = 0.60), whilst intra-rater reliability was excellent for physiotherapists (ICC3,1 = 0.81) and good for students (ICC3,1 = 0.71). Physiotherapists and students are both capable of reliable assessment of SLS performance. Physiotherapist assessments, however, bear stronger relationships to lower limb kinematics and are more sensitive to hip joint motion than student assessments.
Test-retest reliability of a handheld dynamometer for measurement of isometric cervical muscle strength.

PubMed

Vannebo, Katrine Tranaas; Iversen, Vegard Moe; Fimland, Marius Steiro; Mork, Paul Jarle

2018-03-02

There is a lack of test-retest reliability studies of measurements of cervical muscle strength, taking into account gender and possible learning effects. To investigate test-retest reliability of measurement of maximal isometric cervical muscle strength by handheld dynamometry. Thirty women (age 20-58 years) and 28 men (age 20-60 years) participated in the study. Maximal isometric strength (neck flexion, neck extension, and right/left lateral flexion) was measured on three separate days at least five days apart by one evaluator. Intra-rater consistency tended to improve from day 1-2 measurements to day 2-3 measurements in both women and men. In women, the intra-class correlation coefficients (ICC) for day 2 to day 3 measurements were 0.91 (95% confidence interval [CI], 0.82-0.95) for neck flexion, 0.88 (95% CI, 0.76-0.94) for neck extension, 0.84 (95% CI, 0.68-0.92) for right lateral flexion, and 0.89 (95% CI, 0.78-0.95) for left lateral flexion. The corresponding ICCs among men were 0.86 (95% CI, 0.72-0.93) for neck flexion, 0.93 (95% CI, 0.85-0.97) for neck extension, 0.82 (95% CI, 0.65-0.91) for right lateral flexion and 0.73 (95% CI, 0.50-0.87) for left lateral flexion. This study describes a reliable and easy-to-administer test for assessing maximal isometric cervical muscle strength.
Two-colour chewing gum mixing ability test for evaluating masticatory performance in children with mixed dentition: validity and reliability study.

PubMed

Kaya, M S; Güçlü, B; Schimmel, M; Akyüz, S

2017-11-01

The unappealing taste of the chewing material and the time-consuming repetitive task in masticatory performance tests using artificial foodstuff may discourage children from performing natural chewing movements. Therefore, the aim was to determine the validity and reliability of a two-colour chewing gum mixing ability test for masticatory performance (MP) assessment in mixed dentition children. Masticatory performance was tested in two groups: systemically healthy fully dentate young adults and children in mixed dentition. Median particle size was assessed using a comminution test, and a two-colour chewing gum mixing ability test was applied for MP analysis. Validity was tested with Pearson correlation, and reliability was tested with intra-class correlation coefficient, Pearson correlation and Bland-Altman plots. Both comminution and two-colour chewing gum mixing ability tests revealed statistically significant MP differences between children (n = 25) and adults (n = 27, both P < 0·01). Pearson correlation between comminution and two-colour chewing gum mixing ability tests was positive and significant (r = 0·418, P = 0·002). Correlations for interobserver reliability and test-retest values were significant (r = 0·990, P = 0·0001 and r = 0·995, P = 0·0001). Although both methods could discriminate MP differences, the comminution test detected these differences generally in a wider range compared to two-colour chewing gum mixing ability test. However, considering the high reliability of the results, the two-colour chewing gum mixing ability test can be used to assess masticatory performance in children, especially at non-clinical settings. © 2017 John Wiley & Sons Ltd.
Mechanical Player Load™ using trunk-mounted accelerometry in football: Is it a reliable, task- and player-specific observation?

PubMed

Barreira, Paulo; Robinson, Mark A; Drust, Barry; Nedergaard, Niels; Raja Azidin, Raja Mohammed Firhad; Vanrenterghem, Jos

2017-09-01

The aim of the present study was to examine reliability and construct convergent validity of Player Load™ (PL) from trunk-mounted accelerometry, expressed as a cumulative measure and an intensity measure (PL · min - 1 ). Fifteen male participants twice performed an overground football match simulation that included four different multidirectional football actions (jog, side cut, stride and sprint) whilst wearing a trunk-mounted accelerometer inbuilt in a global positioning system unit. Results showed a moderate-to-high reliability as indicated by the intra-class correlation coefficient (0.806-0.949) and limits of agreement. Convergent validity analysis showed considerable between-participant variation (coefficient of variation range 14.5-24.5%), which was not explained from participant demographics despite a negative association with body height for the stride task. Between-task variations generally showed a moderate correlation between ranking of participants for PL (0.593-0.764) and PL · min - 1 (0.282-0.736). It was concluded that monitoring PL ® in football multidirectional actions presents moderate-to-high reliability, that between-participant variability most likely relies on the individual's locomotive skills and not their anthropometrics, and that the intensity of a task expressed by PL · min - 1 is largely related to the running velocity of the task.
Test-retest reliability of the irrational performance beliefs inventory.

PubMed

Turner, M J; Slater, M J; Dixon, J; Miller, A

2018-02-01

The irrational performance beliefs inventory (iPBI) was developed to measure irrational beliefs within performance domains such as sport, academia, business, and the military. Past research indicates that the iPBI has good construct, concurrent, and predictive validity, but the test-retest reliability of the iPBI has not yet been examined. Therefore, in the present study the iPBI was administered to university sport and exercise students (n = 160) and academy soccer athletes (n = 75) at three-time points. Time point two occurred 7 days after time point one, and time point three occurred 21 days after time point two. In addition, social desirability was also measured. Repeated-measures MANCOVAs, intra-class coefficients, and Pearson's (r) correlations demonstrate that the iPBI has good test-retest reliability, with iPBI scores remaining stable across the three-time points. Pearson's correlation coefficients revealed no relationships between the iPBI and social desirability, indicating that the iPBI is not highly susceptible to response bias. The results are discussed with reference to the continued usage and development of the iPBI, and future research recommendations relating to the investigation of irrational performance beliefs are proposed.
Comparison of caries detection methods using varying numbers of intra-oral digital photographs with visual examination for epidemiology in children

PubMed Central

2013-01-01

Background This was a method comparison study. The aim of study was to compare caries information obtained from a full mouth visual examination using the method developed by the British Association for the Study of Community Dentistry (BASCD) for epidemiological surveys with caries data obtained from eight, six and four intra-oral digital photographs of index teeth in two groups of children aged 5 years and 10/11 years. Methods Five trained and calibrated examiners visually examined the whole mouth of 240 5-year-olds and 250 10-/11-year-olds using the BASCD method. The children also had intra-oral digital photographs taken of index teeth. The same 5 examiners assessed the intra-oral digital photographs (in groups of 8, 6 and 4 intra-oral photographs) for caries using the BASCD criteria; dmft/DMFT were used to compute Weighted Kappa Statistic as a measure of intra-examiner reliability and intra-class correlation coefficients as a measure of inter-examiner reliability for each method. A method comparison analysis was performed to determine the 95% limits of agreement for all five examiners, comparing the visual examination method with the photographic assessment method using 8, 6 and 4 intra-oral photographs. Results The intra-rater reliability for the visual examinations ranged from 0.81 to 0.94 in the 5-year-olds and 0.90 to 0.97 in the 10-/11-year-olds. Those for the photographic assessments in the 5-year-olds were for 8 intra-oral photographs, 0.86 to 0.94, for 6 intra-oral photographs, 0.85 to 0.98 and for 4 intra-oral photographs, 0.80 to 0.96; for the 10-/11-year-olds were for 8 intra-oral photographs 0.84 to 1.00, for 6 intra-oral photographs 0.82 to 1.00 and for 4 intra-oral photographs 0.72 to 0.98. The 95% limits of agreement were −1.997 to 1.967, -2.375 to 2.735 and −2.250 to 2.921 respectively for the 5-year-olds and −2.614 to 2.027, -2.179 to 3.887 and −2.594 to 2.163 respectively for the 10-/11-year-olds. Conclusions The photographic assessment method, particularly assessment of 8 intra-oral digital photographs is comparable to the visual examination method in the primary dentition. With the additional benefits of archiving, remote scoring, allowing multiple scorers to score images and enabling longitudinal analysis, the photographic assessment method may be used as an alternative caries detection method in the primary dentition in situations where the visual examination method may not be applicable such as when examiner blinding is required and in practice based randomised controlled trials (RCTs). PMID:23312001
Inter- and intra-rater reliability of calliper-based lymph node measurement in dogs with peripheral nodal lymphomas.

PubMed

Childress, M O; Fulkerson, C M; Lahrman, S A; Weng, H-Y

2016-08-01

The purpose of this study was to assess reliability of lymph node measurements between and within raters in dogs with nodal lymphomas. Three raters measured lymph nodes from 20 dogs twice prior to and once after administering chemotherapy. Sum tumour volume (TV) and sum longest diameter (LD) of all lymph nodes at each time point, and the percent change in measurements following chemotherapy, were calculated for each dog. Inter- and intra-rater reliability were assessed with the intraclass correlation coefficient (ICC). ICC for inter-rater sum TV and sum LD prior to chemotherapy were 0.86 and 0.80, respectively. ICC for inter-rater sum TV and sum LD after chemotherapy were 0.95 and 0.91, respectively. ICC for percent change in sum TV and sum LD were 0.96 and 0.94, respectively. ICC for intra-rater reliability ranged from 0.90 to 0.98 for each rater. Inter- and intra-rater reliability in measurements among the three raters was good to excellent. © 2014 John Wiley & Sons Ltd.
Test-retest reliability of jump execution variables using mechanography: a comparison of jump protocols.

PubMed

Fitzgerald, John S; Johnson, LuAnn; Tomkinson, Grant; Stein, Jesse; Roemmich, James N

2018-05-01

Mechanography during the vertical jump may enhance screening and determining mechanistic causes underlying physical performance changes. Utility of jump mechanography for evaluation is limited by scant test-retest reliability data on force-time variables. This study examined the test-retest reliability of eight jump execution variables assessed from mechanography. Thirty-two women (mean±SD: age 20.8 ± 1.3 yr) and 16 men (age 22.1 ± 1.9 yr) attended a familiarization session and two testing sessions, all one week apart. Participants performed two variations of the squat jump with squat depth self-selected and controlled using a goniometer to 80º knee flexion. Test-retest reliability was quantified as the systematic error (using effect size between jumps), random error (using coefficients of variation), and test-retest correlations (using intra-class correlation coefficients). Overall, jump execution variables demonstrated acceptable reliability, evidenced by small systematic errors (mean±95%CI: 0.2 ± 0.07), moderate random errors (mean±95%CI: 17.8 ± 3.7%), and very strong test-retest correlations (range: 0.73-0.97). Differences in random errors between controlled and self-selected protocols were negligible (mean±95%CI: 1.3 ± 2.3%). Jump execution variables demonstrated acceptable reliability, with no meaningful differences between the controlled and self-selected jump protocols. To simplify testing, a self-selected jump protocol can be used to assess force-time variables with negligible impact on measurement error.

The Factor Structure of the Spiritual Well-Being Scale in Veterans Experienced Chemical Weapon Exposure.

PubMed

Sharif Nia, Hamid; Pahlevan Sharif, Saeed; Boyle, Christopher; Yaghoobzadeh, Ameneh; Tahmasbi, Bahram; Rassool, G Hussein; Taebei, Mozhgan; Soleimani, Mohammad Ali

2018-04-01

This study aimed to determine the factor structure of the spiritual well-being among a sample of the Iranian veterans. In this methodological research, 211 male veterans of Iran-Iraq warfare completed the Paloutzian and Ellison spiritual well-being scale. Maximum likelihood (ML) with oblique rotation was used to assess domain structure of the spiritual well-being. The construct validity of the scale was assessed using confirmatory factor analysis (CFA), convergent validity, and discriminant validity. Reliability was evaluated with Cronbach's alpha, Theta (θ), and McDonald Omega (Ω) coefficients, intra-class correlation coefficient (ICC), and construct reliability (CR). Results of ML and CFA suggested three factors which were labeled "relationship with God," "belief in fate and destiny," and "life optimism." The ICC, coefficients of the internal consistency, and CR were >.7 for the factors of the scale. Convergent validity and discriminant validity did not fulfill the requirements. The Persian version of spiritual well-being scale demonstrated suitable validity and reliability among the veterans of Iran-Iraq warfare.
A two-factor theory for concussion assessment using ImPACT: memory and speed.

PubMed

Schatz, Philip; Maerlender, Arthur

2013-12-01

We present the initial validation of a two-factor structure of Immediate Post-Concussion Assessment and Cognitive Testing (ImPACT) using ImPACT composite scores and document the reliability and validity of this factor structure. Factor analyses were conducted for baseline (N = 21,537) and post-concussion (N = 560) data, yielding "Memory" (Verbal and Visual) and "Speed" (Visual Motor Speed and Reaction Time) Factors; inclusion of Total Symptom Scores resulted in a third discrete factor. Speed and Memory z-scores were calculated, and test-retest reliability (using intra-class correlation coefficients) at 1 month (0.88/0.81), 1 year (0.85/0.75), and 2 years (0.76/0.74) were higher than published data using Composite scores. Speed and Memory scores yielded 89% sensitivity and 70% specificity, which was higher than composites (80%/62%) and comparable with subscales (91%/69%). This emergent two-factor structure has improved test-retest reliability with no loss of sensitivity/specificity and may improve understanding and interpretability of ImPACT test results.
Reliability and validity of a Chinese version of the Diagnostic Interview for Borderlines-Revised.

PubMed

Wang, Lanlan; Yuan, Chenmei; Qiu, Jianying; Gunderson, John; Zhang, Min; Jiang, Kaida; Leung, Freedom; Zhong, Jie; Xiao, Zeping

2014-09-01

Borderline personality disorder (BPD) is the most studied of the axis II disorders. One of the most widely used diagnostic instruments is the Diagnostic Interview for Borderline Patients-Revised (DIB-R). The aim of this study was to test the reliability and validity of DIB-R for use in the Chinese culture. The reliability and validity of the DIB-R Chinese version were assessed in a sample of 236 outpatients with a probable BPD diagnosis. The Structured Clinical Interview for DSM-IV Personality Disorders (SCID-II) was used as a standard. Test-retest reliability was tested six months later with 20 patients, and inter-rater reliability was tested on 32 patients. The Chinese version of the DIB-R showed good internal global consistency (Cronbach's α of 0.916), good test-retest reliability (Pearson correlation of 0.704), good inter-rater reliability (intra-class correlation coefficient of 0.892 and kappa of 0.861). When compared with the DSM-IV diagnosis as measured by the SCID-II, the DIB-R showed relatively good sensitivity (0.768) and specificity (0.891) at the cutoff of 7, moderate diagnostic convergence (kappa of 0.631), as well as good discriminating validity. The Chinese version of the DIB-R has good psychometric properties, which renders it a valuable method for examining the presence, the severity, and component phenotypes of BPD in Chinese samples. © 2013 Wiley Publishing Asia Pty Ltd.
Trunk Muscle Size and Composition Assessment in Older Adults with Chronic Low Back Pain: An Intra-Examiner and Inter-Examiner Reliability Study.

PubMed

Sions, Jaclyn Megan; Smith, Andrew Craig; Hicks, Gregory Evan; Elliott, James Matthew

2016-08-01

To evaluate intra- and inter-examiner reliability for the assessment of relative cross-sectional area, muscle-to-fat infiltration indices, and relative muscle cross-sectional area, i.e., total cross-sectional area minus intramuscular fat, from T1-weighted magnetic resonance images obtained in older adults with chronic low back pain. Reliability study. n = 13 (69.3 ± 8.2 years old) After lumbar magnetic resonance imaging, two examiners produced relative cross-sectional area measurements of multifidi, erector spinae, psoas, and quadratus lumborum by tracing regions of interest just inside fascial borders. Pixel-intensity summaries were used to determine muscle-to-fat infiltration indices; relative muscle cross-sectional area was calculated. Intraclass correlation coefficients were used to estimate intra- and inter-examiner reliability; standard error of measurement was calculated. Intra-examiner intraclass correlation coefficient point estimates for relative cross-sectional area, muscle-to-fat infiltration indices, and relative muscle cross-sectional area were excellent for multifidi and erector spinae across levels L2-L5 (ICC = 0.77-0.99). At L3, intra-examiner reliability was excellent for relative cross-sectional area, muscle-to-fat infiltration indices, and relative muscle cross-sectional area for both psoas and quadratus lumborum (ICC = 0.81-0.99). Inter-examiner intraclass correlation coefficients ranged from poor to excellent for relative cross-sectional area, muscle-to-fat infiltration indices, and relative muscle cross-sectional area. Assessment of relative cross-sectional area, muscle-to-fat infiltration indices, and relative muscle cross-sectional area in older adults with chronic low back pain can be reliably determined by one examiner from T1-weighted images. Such assessments provide valuable information, as muscle-to-fat infiltration indices and relative muscle cross-sectional area indicate that a substantial amount of relative cross-sectional area may be magnetic resonance-visible intramuscular fat in older adults with chronic low back pain. © 2015 American Academy of Pain Medicine. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Inter- and intra-observer reliability of measurement of pedicle screw breach assessed by postoperative CT scans.

PubMed

Lavelle, William F; Ranade, Ashish; Samdani, Amer F; Gaughan, John P; D'Andrea, Linda P; Betz, Randal R

2014-01-01

Pedicle screws are used increasingly in spine surgery. Concerns of complications associated with screw breach necessitates accurate pedicle screw placement. Postoperative CT imaging helps to detect screw malposition and assess its severity. However, accuracy is dependent on the reading of the CT scans. Inter- and intra-observer variability could affect the reliability of CT scans to assess multiple screw types and sites. The purpose of this study was to assess the reliability of multi-observer analysis of CT scans for determining pedicle screw breach for various screw types and sites in patients with spinal deformity or degenerative pathologies. Axial CT scan images of 23 patients (286 screws) were read by four experienced spine surgeons. Pedicle screw placement was considered 'In' when the screw was fully contained and/or the pedicle wall breach was ≤2 mm. 'Out' was defined as a breach in the medial or lateral pedicle wall >2 mm. Intra-class coefficients (ICC) were calculated to assess the inter- and intra-observer reliability. Marked inter- and intra-observer variability was noticed. The overall inter-observer ICC was 0.45 (95% confidence limits 0.25 to 0.65). The intra-observer ICC was 0.49 (95% confidence limits 0.29 to 0.69). Underlying spinal pathology, screw type, and patient age did not seem to impact the reliability of our CT assessments. Our results indicate the evaluation of pedicle screw breach on CT by a single surgeon is highly variable, and care should be taken when using individual CT evaluations of millimeters of breach as a basis for screw removal. This was a Level III study.
Movement-related beta oscillations show high intra-individual reliability.

PubMed

Espenhahn, Svenja; de Berker, Archy O; van Wijk, Bernadette C M; Rossiter, Holly E; Ward, Nick S

2017-02-15

Oscillatory activity in the beta frequency range (15-30Hz) recorded from human sensorimotor cortex is of increasing interest as a putative biomarker of motor system function and dysfunction. Despite its increasing use in basic and clinical research, surprisingly little is known about the test-retest reliability of spectral power and peak frequency measures of beta oscillatory signals from sensorimotor cortex. Establishing that these beta measures are stable over time in healthy populations is a necessary precursor to their use in the clinic. Here, we used scalp electroencephalography (EEG) to evaluate intra-individual reliability of beta-band oscillations over six sessions, focusing on changes in beta activity during movement (Movement-Related Beta Desynchronization, MRBD) and after movement termination (Post-Movement Beta Rebound, PMBR). Subjects performed visually-cued unimanual wrist flexion and extension. We assessed Intraclass Correlation Coefficients (ICC) and between-session correlations for spectral power and peak frequency measures of movement-related and resting beta activity. Movement-related and resting beta power from both sensorimotor cortices was highly reliable across sessions. Resting beta power yielded highest reliability (average ICC=0.903), followed by MRBD (average ICC=0.886) and PMBR (average ICC=0.663). Notably, peak frequency measures yielded lower ICC values compared to the assessment of spectral power, particularly for movement-related beta activity (ICC=0.386-0.402). Our data highlight that power measures of movement-related beta oscillations are highly reliable, while corresponding peak frequency measures show greater intra-individual variability across sessions. Importantly, our finding that beta power estimates show high intra-individual reliability over time serves to validate the notion that these measures reflect meaningful individual differences that can be utilised in basic research and clinical studies. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.
Intra-rater reliability of hallux flexor strength measures using the Nintendo Wii Balance Board.

PubMed

Quek, June; Treleaven, Julia; Brauer, Sandra G; O'Leary, Shaun; Clark, Ross A

2015-01-01

The purpose of this study was to investigate the intra-rater reliability of a new method in combination with the Nintendo Wii Balance Board (NWBB) to measure the strength of hallux flexor muscle. Thirty healthy individuals (age: 34.9 ± 12.9 years, height: 170.4 ± 10.5 cm, weight: 69.3 ± 15.3 kg, female = 15) participated. Repeated testing was completed within 7 days. Participants performed strength testing in sitting using a wooden platform in combination with the NWBB. This new method was set up to selectively recruit an intrinsic muscle of the foot, specifically the flexor hallucis brevis muscle. Statistical analysis was performed using intra-class coefficients and ordinary least product analysis. To estimate measurement error, standard error of measurement (SEM), minimal detectable change (MDC) and percentage error were calculated. Results indicate excellent intra-rater reliability (ICC = 0.982, CI = 0.96-0.99) with an absence of systematic bias. SEM, MDC and percentage error value were 0.5, 1.4 and 12 % respectively. This study demonstrates that a new method in combination with the NWBB application is reliable to measure hallux flexor strength and has potential to be used for future research and clinical application.
Translation and validation of chronic liver disease questionnaire (CLDQ) in Tamil language.

PubMed

Goel, Amit; Arivazhagan, Karunanithi; Sasi, Avani; Shanmugam, Vanathy; Koshi, Seleena; Pottakkat, Biju; Lakshmi, C P; Awasthi, Ashish

2017-05-01

Chronic liver disease questionnaire (CLDQ), a self-administered quality-of-life (QOL) instrument for chronic liver disease (CLD) patients, was originally developed in English language. We aimed to translate and validate CLDQ in Tamil language (CLDQ-T). CLDQ-T, prepared by two forward and two backward independent translations by four bilingual (Tamil and English) persons, and repeated iterative modifications, was validated in adult, native-Tamil patients with CLD. CLDQ-T was re-tested in some patients 2 weeks later. Convergent validity was assessed using Spearman's correlation, and discriminant validity by comparison with World Health Organization's brief QOL tool (WHOQOL-BREF). Reliability was assessed through internal consistency (Cronbach's alpha) and test-retest reliability (intra-class correlation). Cutoff used for statistical significance was p<0.05. The study included 126 patients (age: mean [SD] 46 years [12.5]; male 104; cause: alcohol 42%, HBV 25%, HCV 4%, cryptogenic 29%; CTP class A 47%, B 37%, and C 16%). In convergent validity, all domains except the "abdominal domain" showed significant correlation between CLDQ-T and WHOQOL-BREF. Patients with severe disease had lower scores for all domains of CLDQ-T except the "abdominal" domain, but not for any of the domains for WHOQOL-BREF. Overall Cronbach's alpha was 0.942, and more than 0.7 for all the individual domains except the "activity" domain. On retesting in 44 (35%) patients, intraclass correlation coefficient was 0.879 for the overall CLDQ-T score and >0.700 for individual domains. CLDQ-T was easily understood and showed good performance characteristics in assessing QOL in Tamil-speaking patients with CLD.
Psychometric properties of sleep and coping numeric rating scales in rheumatoid arthritis: a subanalysis of an etanercept trial.

PubMed

Avila-Ribeiro, Pedro; Brault, Yves; Dougados, Maxime; Gossec, Laure

2017-01-01

In rheumatoid arthritis, quality of sleep and ability to cope are important for patients; however their usefulness as outcome measures is not well established. Post-hoc analysis of an open-label 12-week trial of etanercept in biologic-naïve rheumatoid arthritis patients with visits at screening, baseline and over 12 weeks. Outcomes measured included Disease Activity Score 28 erythrocyte sedimentation rate (DAS28), numeric rating scales for sleep, coping, patient and physician-global assessment, pain and fatigue, and modified-HAQ. Reliability between screening and baseline visits by intra-class correlation, and responsiveness between baseline and 12 weeks by standardised response means were assessed for each outcome. In 108 patients, mean age 54 (standard deviation (SD) 13) years, mean disease duration 8 (SD 7) years, 75% women; disease activity was high at baseline: mean DAS28 5.5 (SD 0.8). Reliability intra-class correlation was 0.83[95% confidence interval: 0.77;0.88] for sleep, 0.81[0.74;0.87] for modified-HAQ, 0.80[0.71;0.86] for fatigue, 0.72[0.62;0.80] for physician-global assessment, 0.66[0.54;076] for coping, 0.65[0.53;0.75] for pain and 0.63[0.50;0.73] for patient-global assessment. Responsiveness standardised response means was 1.65[1.32;2.10] for physician-global assessment, 1.37[1.09;1.73] for pain, 1.36[1.08;1.73] for patient-global assessment, 1.15[0.95;1.41] for fatigue, 0.96[0.70;1.28] for coping, 0.92[0.73;1.15] for sleep and 0.86[0.69;1.07] for modified-HAQ. Numeric rating scales assessing sleep and coping were found to be generally as reliable as 'usual' outcomes in rheumatoid arthritis. Responsiveness was less high, indicating these domains of health may be less accessible to biologic treatment. When assessing the patient's perspective on treatment, it is feasible and valid to measure sleep and coping by numeric rating scales.
Cross-cultural adaptation and validation to Brazil of the Obesity-related Problems Scale

PubMed Central

Brasil, Andreia Mara Brolezzi; Brasil, Fábio; Maurício, Angélica Aparecida; Vilela, Regina Maria

2017-01-01

ABSTRACT Objective To validate a reliable version of the Obesity-related Problems Scale in Portuguese to use it in Brazil. Methods The Obesity-related Problems Scale was translated and transculturally adapted. Later it was simultaneously self-applied with a 12-item version of the World Health Organization Disability Assessment Schedule 2.0 (WHODAS 2.0), to 50 obese patients and 50 non-obese individuals, and applied again to half of them after 14 days. Results The Obesity-related Problems scale was able to differentiate obese from non-obese individuals with higher accuracy than WHODAS 2.0, correlating with this scale and with body mass index. The factor analysis determined a two-dimensional structure, which was confirmed with χ2/df=1.81, SRMR=0.05, and CFI=0.97. The general a coefficient was 0.90 and the inter-item intra-class correlation, in the reapplication, ranged from 0.75 to 0.87. Conclusion The scale proved to be valid and reliable for use in the Brazilian population, without the need to exclude items. PMID:29091155
Diagnostic accuracy of sleep bruxism scoring in absence of audio-video recording: a pilot study.

PubMed

Carra, Maria Clotilde; Huynh, Nelly; Lavigne, Gilles J

2015-03-01

Based on the most recent polysomnographic (PSG) research diagnostic criteria, sleep bruxism is diagnosed when >2 rhythmic masticatory muscle activity (RMMA)/h of sleep are scored on the masseter and/or temporalis muscles. These criteria have not yet been validated for portable PSG systems. This pilot study aimed to assess the diagnostic accuracy of scoring sleep bruxism in absence of audio-video recordings. Ten subjects (mean age 24.7 ± 2.2) with a clinical diagnosis of sleep bruxism spent one night in the sleep laboratory. PSG were performed with a portable system (type 2) while audio-video was recorded. Sleep studies were scored by the same examiner three times: (1) without, (2) with, and (3) without audio-video in order to test the intra-scoring and intra-examiner reliability for RMMA scoring. The RMMA event-by-event concordance rate between scoring without audio-video and with audio-video was 68.3 %. Overall, the RMMA index was overestimated by 23.8 % without audio-video. However, the intra-class correlation coefficient (ICC) between scorings with and without audio-video was good (ICC = 0.91; p < 0.001); the intra-examiner reliability was high (ICC = 0.97; p < 0.001). The clinical diagnosis of sleep bruxism was confirmed in 8/10 subjects based on scoring without audio-video and in 6/10 subjects with audio-video. Although the absence of audio-video recording, the diagnostic accuracy of assessing RMMA with portable PSG systems appeared to remain good, supporting their use for both research and clinical purposes. However, the risk of moderate overestimation in absence of audio-video must be taken into account.
Cross-cultural adaptation and validation of the Peripheral Artery Questionnaire: Korean version for patients with peripheral vascular diseases.

PubMed

Lee, Ji Hyun; Cho, Kyoung Im; Spertus, John; Kim, Seong Man

2012-08-01

The Peripheral Artery Questionnaire (PAQ), as developed in US English, is a validated scale to evaluate the health status of patients with peripheral artery disease (PAD). The aim of this study was to translate the PAQ into Korean and to evaluate its reliability and validity. A multi-step process of forward-translation, reconciliation, consultation with the developer, back-translation and proofreading was conducted. The test-retest reliability was evaluated at a 2-week interval using the intra-class correlation coefficient (ICC). The validity was assessed by identifying associations between Korean PAQ (KPAQ) scores and Korean Health Assessment Questionnaire (KHAQ) scores. A total of 100 PAD patients were enrolled: 63 without and 37 with severe claudication. The reliability of the KPAQ was adequate, with an ICC of 0.71. There were strong correlations between KPAQ's subscales. Cronbach's alpha for the summary score was 0.94, indicating good internal consistency and congruence with the original US version. The validity was supported by a significant correlation between the total KHAQ score and KPAQ physical function, stability, symptom, social limitation and quality of life scores (r = -0.24 to -0.90; p < 0.001) as well as between the KHAQ walking subscale and the KPAQ physical function score (r = -0.55, p < 0.001). Our results indicate that the KPAQ is a reliable, valid instrument to evaluate the health status of Korean patients with PAD.
Psychometric properties of the Iranian interview-administered version of the World Health Organization's Quality of Life Questionnaire (WHOQOL-BREF): a population-based study.

PubMed

Nedjat, Saharnaz; Montazeri, Ali; Holakouie, Kourosh; Mohammad, Kazem; Majdzadeh, Reza

2008-03-21

The objective of the current study was to translate and validate the Iranian version of the WHOQOL-BREF. A forward-backward translation procedure was followed to develop the Iranian version of the questionnaire. A stratified random sample of individuals aged 18 and over completed the questionnaire in Tehran, Iran. Psychometric properties of the instrument including reliability (internal consistency, and test-retest analysis), validity (known groups' comparison and convergent validity), and items' correlation with their hypothesized domains were assessed. In all 1164 individuals entered into the study. The mean age of the participants was 36.6 (SD = 13.2) years, and the mean years of their formal education was 10.7 (SD = 4.4). In general the questionnaire received well and all domains met the minimum reliability standards (Cronbach's alpha and intra-class correlation > 0.7), except for social relationships (alpha = 0.55). Performing known groups' comparison analysis, the results indicated that the questionnaire discriminated well between subgroups of the study samples differing in their health status. Since the WHOQOL-BREF demonstrated statistically significant correlation with the Iranian version of the SF-36 as expected, the convergent validity of the questionnaire was found to be desirable. Correlation matrix also showed satisfactory results in all domains except for social relationships. This study has provided some preliminary evidence of the reliability and validity of the WHOQOL-BREF to be used in Iran, though further research is required to challenge the problems of reliability in one of the dimensions and the instrument's factor structure.
Stability of person ability measures in people with acquired brain injury in the use of everyday technology: the test-retest reliability of the Management of Everyday Technology Assessment (META).

PubMed

Malinowsky, Camilla; Kassberg, Ann-Charlotte; Larsson-Lund, Maria; Kottorp, Anders

2016-01-01

To evaluate the test-retest reliability of the Management of Everyday Technology Assessment (META) in a sample of people with acquired brain injury (ABI). The META was administered twice within a two-week period to 25 people with ABI. A Rasch measurement model was used to convert the META ordinal raw scores into equal-interval linear measures of each participant's ability to manage everyday technology (ET). Test-retest reliability of the stability of the person ability measures in the META was examined by a standardized difference Z-test and an intra-class correlations analysis (ICC 1). The results showed that the paired person ability measures generated from the META were stable over the test-retest period for 22 of the 25 subjects. The ICC 1 correlation was 0.63, which indicates good overall reliability. The META demonstrated acceptable test-retest reliability in a sample of people with ABI. The results illustrate the importance of using sufficiently challenging ETs (relative to a person's abilities) to generate stable META measurements over time. Implications for Rehabilitation The findings add evidence regarding the test-retest reliability of the person ability measures generated from the observation assessment META in a sample of people with ABI. The META might support professionals in the evaluation of interventions that are designed to improve clients' performance of activities including the ability to manage ET.
COMFORT scale: a reliable and valid method to measure the amount of stress of ventilated preterm infants.

PubMed

Wielenga, J M; De Vos, R; de Leeuw, R; De Haan, R J

2004-01-01

Assessment of clinimetric properties and diagnostic quality of a stress measurement scale (COMFORT scale). Sample of an open population. Neonatology department (Neonatal Intensive Care Unit), Academic Medical Centre/Emma Children's Hospital, Amsterdam, The Netherlands. One clinical expert and 9 observers observed ventilated premature born babies simultaneously. Criterion validity was assessed by correlating the COMFORT scale with the clinical judgment regarding the amount of stress. Interobserver reliability was assessed on the clinical judgment as well as on the COMFORT scale. Diagnostic qualities were evaluated with a ROC curve. On 19 ventilated prematurely born babies (mean gestational age 30 weeks, mean birth weight 1385 gm), one clinical expert and 9 observers made 30 paired observations. The criterion validity of the COMFORT scale was good (Pearson's r of 0.84). The interobserver reliability of the clinical judgment was very good (weighted Kappa 0.84). The interobserver reliability of each item varied from good to almost perfect (weighted Kappa of 0.64 for muscle tone to 1.00 on heart rate). The reliability of the total COMFORT scale score was satisfying (intra-class correlation coefficient of 0.94). The diagnostic quality of the COMFORT scale was excellent, at a cut-off point of 20 the sensitivity was 100 percent, the specificity was 77 percent, and the area under the curve (AUC) of 0.95. In this first evaluation, the COMFORT scale appears to be a valid and reliable measurement tool to assess the stress of ventilated prematurely born babies.
Reliability and validity of the multimedia activity recall in children and adults (MARCA) in people with chronic obstructive pulmonary disease.

PubMed

Hunt, Toby; Williams, Marie T; Olds, Tim S

2013-01-01

To determine the reliability and validity of the Multimedia Activity Recall for Children and Adults (MARCA) in people with chronic obstructive pulmonary disease (COPD). People with COPD and their carers completed the Multimedia Activity Recall for Children and Adults (MARCA) for four, 24-hour periods (including test-retest of 2 days) while wearing a triaxial accelerometer (Actigraph GT3X+®), a multi-sensor armband (Sensewear Pro3®) and a pedometer (New Lifestyles 1000®). Self reported activity recalls (MARCA) and objective activity monitoring (Accelerometry) were recorded under free-living conditions. 24 couples were included in the analysis (COPD; age 74.4 ± 7.9 yrs, FEV1 54 ± 13% Carer; age 69.6 ± 10.9 yrs, FEV1 99 ± 24%). Not applicable. Test-retest reliability was compared for MARCA activity domains and different energy expenditure zones. Validity was assessed between MARCA-derived physical activity level (in metabolic equivalent of task (MET) per minute), duration of moderate to vigorous physical activity (min) and related data from the objective measurement devices. Analysis included intra-class correlation coefficients (ICC), Bland-Altman analyses, paired t-tests (p) and Spearman's rank correlation coefficients (rs). Reliability between occasions of recall for all activity domains was uniformly high, with test-retest correlations consistently >0.9. Validity correlations were moderate to strong (rs = 0.43-0.80) across all comparisons. The MARCA yields comparable PAL estimates and slightly higher moderate to vigorous physical activity (MVPA) estimates. In older adults with chronic illness, the MARCA is a valid and reliable tool for capturing not only the time and energy expenditure associated with physical and sedentary activities but also information on the types of activities.
Validity and Reliability of Persian Version of HIV/AIDS Related Stigma Scale for People Living With HIV/AIDS in Iran.

PubMed

Pourmarzi, Davoud; Khoramirad, Ashraf; Ahmari Tehran, Hoda; Abedini, Zahra

2015-11-01

To assess the perceived HIV/AIDS related stigma a comprehensive and well developed stigma instrument is necessary. This study aimed to assess validity and reliability of the Persian version of HIV/AIDS related stigma scale which was developed by Kang et al for people living with HIV/AIDS in Iran. Thescale was forward translatedby two bilingual academic members then both translations were discussed by expert team. Back-translation was done by two other bilingual translators then we carried out discussion with both of them. To evaluate understandability the scale was administered to 10 Persons Living with HIV/AIDS (PLWHA). Final Persian version was administered to 80 PLWHA in Qom, Iran in 2014. Test-retest reliability was assessed in a sample of 20 PLWHA after a week by intra-class correlation coefficient (ICC). Cronbach's alpha coefficient for overall scale was 0.85. Also Cronbach's alpha coefficients for the five subscales were as follows: social rejection (9 items, α = 0.84), negative self-worth (4 items, α = 0.70), perceived interpersonal insecurity (2 items, α = 0.57), financial insecurity (3 items, α = 0.70), discretionary disclosure (2 items, α = 0.83). Test-retest reliability was also approved with ICC = 0.78. Correlation between items and their hypothesized subscale is greater than 0.5. Correlation between an item and its own subscale was significantly higher than its correlation with other subscales. This study demonstrate that the Persian version of HIV/AIDS related stigma scale is valid and reliable to assess HIV/AIDS related stigma perceived by people living whit HIV/AIDS in Iran.
Validity and Reliability of Persian Version of HIV/AIDS Related Stigma Scale for People Living With HIV/AIDS in Iran

PubMed Central

Pourmarzi, Davoud; Khoramirad, Ashraf; Ahmari Tehran, Hoda; Abedini, Zahra

2015-01-01

Objective: To assess the perceived HIV/AIDS related stigma a comprehensive and well developed stigma instrument is necessary. This study aimed to assess validity and reliability of the Persian version of HIV/AIDS related stigma scale which was developed by Kang et al for people living with HIV/AIDS in Iran. Materials and methods: Thescale was forward translatedby two bilingual academic members then both translations were discussed by expert team. Back-translation was done by two other bilingual translators then we carried out discussion with both of them. To evaluate understandability the scale was administered to 10 Persons Living with HIV/AIDS (PLWHA). Final Persian version was administered to 80 PLWHA in Qom, Iran in 2014. Test–retest reliability was assessed in a sample of 20 PLWHA after a week by intra-class correlation coefficient (ICC). Results: Cronbach’s alpha coefficient for overall scale was 0.85. Also Cronbach’s alpha coefficients for the five subscales were as follows: social rejection (9 items, α = 0.84), negative self-worth (4 items, α = 0.70), perceived interpersonal insecurity (2 items, α = 0.57), financial insecurity (3 items, α = 0.70), discretionary disclosure (2 items, α = 0.83). Test–retest reliability was also approved with ICC = 0.78. Correlation between items and their hypothesized subscale is greater than 0.5. Correlation between an item and its own subscale was significantly higher than its correlation with other subscales. Conclusion: This study demonstrate that the Persian version of HIV/AIDS related stigma scale is valid and reliable to assess HIV/AIDS related stigma perceived by people living whit HIV/AIDS in Iran. PMID:27047562
Translation, cross-cultural adaptation, and validation of the Turkish version of the Harris Hip Score.

PubMed

Çelik, Derya; Can, Canan; Aslan, Yasemin; Ceylan, Hasan Huseyin; Bilsel, Kerem; Ozdincler, Arzu Razak

2014-01-01

The Harris Hip Score (HHS) developed to assess function and pain from the perspective of patients hip pathologies. The purpose of this study was to translate and culturally adapt the HHS into Turkish, and thereby determine the reliability and validity of the translated version. The HHS was translated into Turkish in accordance with the stages recommended by Beaton. The measurement properties of the HHS were tested in 80 patients; 52 males, mean age 51 years (range 21-75 years) suffering from different hip pathologies. The test-retest reliability was tested in 58 patients; 28 males mean age, 52 years (range 30-73 years) after an interval of seven days. The Cronbach's Alpha was used to assess internal consistency and the intra-class correlation coefficient (ICC) was used to estimate the test-retest reliability. Patients were asked to answer the Oxford Hip Score (OHS), the Western Ontario and McMaster Universities Arthritis Index (WOMAC), the VAS and the Short Form-36 (SF-36) for the validity of the estimation. The Turkish version of the HHS showed sufficient internal consistency (Cronbach's alpha,0.70) and test-retest reliability (ICC = 0.91). The correlation coefficients between the HHS, the WOMAC and the OHS were 0.64 and 0.89 respectively. The highest correlations between the HHS and SF-36 were with the physical function scale (r = 0.72), and the lowest correlations were with the mental function scale (r = 0.10). We observed no floor or ceiling effects. The Turkish version of the HHS has sufficient reliability and validity to measure patient-reported outcome for Turkish-speaking individuals with a variety of hip disorders.
The psychometric properties of the cervical nonorganic signs in patients with neck pain: an assessment of pain expression.

PubMed

Lue, Yi-Jing; Chang, Jyh-Jong; Wu, Yuh-Yih; Lin, Rong-Fong; Lu, Yen-Mou

2018-04-01

Neck pain is a common cause of disability. This study investigated the psychometric properties of the cervical nonorganic signs (CNOS), a tool for assessing abnormal illness behaviors in patients with neck pain. The CNOS was administered on patients with neck pain. Reliability and validity analyses were used to evaluate the psychometric properties. Exploratory factor analysis was used to investigate the dimensionality. Correlations with the Short Form-36 were used to investigate the convergent validity. The results supported the reliability (inter-rater reliability intra-class correlation: 0.920), validity (correlated with body pain (|ρ|=0.31) and vitality (|ρ| =0.30), and two-factor dimensionality (χ 2 = 5.904, p= 0.66; χ 2 /df = 0.738; RMSEA< 0.001; CFI = 1.000; TLI = 1.024; SRMR = 0.047) of the scale. The two factors were pain (severe pain) and vitality (poor vitality) expressed by the patients. The CNOS is a reliable and valid instrument for assessing pain and vitality problems. It helps patients to express severe pain and lack of vitality. The rehabilitation discipline could use the scale to understand pain expression and to design proper rehabilitation programs. Implications for Rehabilitation The cervical nonorganic signs has two domains (pain and vitality). The scale is reliable and valid for patients with neck pain. Patients with high scores on the pain domain have severe body pain that may interfere with normal social activities. Clinicians should understand their suffering and try to help them to alleviate the pain.

The Pooling-score (P-score): inter- and intra-rater reliability in endoscopic assessment of the severity of dysphagia.

PubMed

Farneti, D; Fattori, B; Nacci, A; Mancini, V; Simonelli, M; Ruoppolo, G; Genovese, E

2014-04-01

This study evaluated the intra- and inter-rater reliability of the Pooling score (P-score) in clinical endoscopic evaluation of severity of swallowing disorder, considering excess residue in the pharynx and larynx. The score (minimum 4 - maximum 11) is obtained by the sum of the scores given to the site of the bolus, the amount and ability to control residue/bolus pooling, the latter assessed on the basis of cough, raclage, number of dry voluntary or reflex swallowing acts (< 2, 2-5, > 5). Four judges evaluated 30 short films of pharyngeal transit of 10 solid (1/4 of a cracker), 11 creamy (1 tablespoon of jam) and 9 liquid (1 tablespoon of 5 cc of water coloured with methlyene blue, 1 ml in 100 ml) boluses in 23 subjects (10 M/13 F, age from 31 to 76 yrs, mean age 58.56±11.76 years) with different pathologies. The films were randomly distributed on two CDs, which differed in terms of the sequence of the films, and were given to judges (after an explanatory session) at time 0, 24 hours later (time 1) and after 7 days (time 2). The inter- and intra-rater reliability of the P-score was calculated using the intra-class correlation coefficient (ICC; 3,k). The possibility that consistency of boluses could affect the scoring of the films was considered. The ICC for site, amount, management and the P-score total was found to be, respectively, 0.999, 0.997, 1.00 and 0.999. Clinical evaluation of a criterion of severity of a swallowing disorder remains a crucial point in the management of patients with pathologies that predispose to complications. The P-score, derived from static and dynamic parameters, yielded a very high correlation among the scores attributed by the four judges during observations carried out at different times. Bolus consistencies did not affect the outcome of the test: the analysis of variance, performed to verify if the scores attributed by the four judges to the parameters selected, might be influenced by the different consistencies of the boluses, was not significant. These initial data validate the clinical use of the P-score in the management of patients with deglutition disorders by a multidisciplinary team.
Validity and reliability of Persian version of Listening Styles Profile-Revised (LSP- R) in Iranian students.

PubMed

Fatehi, Zahra; Baradaran, Hamid Reza; Asadpour, Mohamad; Rezaeian, Mohsen

2017-01-01

Background: Individuals' listening styles differs based on their characters, professions and situations. This study aimed to assess the validity and reliability of Listening Styles Profile- Revised (LSP- R) in Iranian students. Methods: After translating into Persian, LSP-R was employed in a sample of 240 medical and nursing Persian speaking students in Iran. Statistical analysis was performed to test the reliability and validity of the LSP-R. Results: The study revealed high internal consistency and good test-retest reliability for the Persian version of the questionnaire. The Cronbach's alpha coefficient was 0.72 and intra-class correlation coefficient 0.87. The means for the content validity index and the content validity ratio (CVR) were 0.90 and 0.83, respectively. Exploratory factor analysis (EFA) yielded a four-factor solution accounted for 60.8% of the observed variance. Majority of medical students (73%) as well as majority of nursing students (70%) stated that their listening styles were task-oriented. Conclusion: In general, the study finding suggests that the Persian version of LSP-R is a valid and reliable instrument for assessing listening styles profile in the studied sample.
Reliability and validity of a tool to measure the severity of tongue thrust in children: the Tongue Thrust Rating Scale.

PubMed

Serel Arslan, S; Demir, N; Karaduman, A A

2017-02-01

This study aimed to develop a scale called Tongue Thrust Rating Scale (TTRS), which categorised tongue thrust in children in terms of its severity during swallowing, and to investigate its validity and reliability. The study describes the developmental phase of the TTRS and presented its content and criterion-based validity and interobserver and intra-observer reliability. For content validation, seven experts assessed the steps in the scale over two Delphi rounds. Two physical therapists evaluated videos of 50 children with cerebral palsy (mean age, 57·9 ± 16·8 months), using the TTRS to test criterion-based validity, interobserver and intra-observer reliability. The Karaduman Chewing Performance Scale (KCPS) and Drooling Severity and Frequency Scale (DSFS) were used for criterion-based validity. All the TTRS steps were deemed necessary. The content validity index was 0·857. A very strong positive correlation was found between two examinations by one physical therapist, which indicated intra-observer reliability (r = 0·938, P < 0·001). A very strong positive correlation was also found between the TTRS scores of two physical therapists, indicating interobserver reliability (r = 0·892, P < 0·001). There was also a strong positive correlation between the TTRS and KCPS (r = 0·724, P < 0·001) and a very strong positive correlation between the TTRS scores and DSFS (r = 0·822 and r = 0·755; P < 0·001). These results demonstrated the criterion-based validity of the TTRS. The TTRS is a valid, reliable and clinically easy-to-use functional instrument to document the severity of tongue thrust in children. © 2016 John Wiley & Sons Ltd.
Absolute Reliability and Concurrent Validity of Hand Held Dynamometry and Isokinetic Dynamometry in the Hip, Knee and Ankle Joint: Systematic Review and Meta-analysis

PubMed Central

Chamorro, Claudio; Armijo-Olivo, Susan; De la Fuente, Carlos; Fuentes, Javiera; Javier Chirosa, Luis

2017-01-01

Abstract The purpose of the study is to establish absolute reliability and concurrent validity between hand-held dynamometers (HHDs) and isokinetic dynamometers (IDs) in lower extremity peak torque assessment. Medline, Embase, CINAHL databases were searched for studies related to psychometric properties in muscle dynamometry. Studies considering standard error of measurement SEM (%) or limit of agreement LOA (%) expressed as percentage of the mean, were considered to establish absolute reliability while studies using intra-class correlation coefficient (ICC) were considered to establish concurrent validity between dynamometers. In total, 17 studies were included in the meta-analysis. The COSMIN checklist classified them between fair and poor. Using HHDs, knee extension LOA (%) was 33.59%, 95% confidence interval (CI) 23.91 to 43.26 and ankle plantar flexion LOA (%) was 48.87%, CI 35.19 to 62.56. Using IDs, hip adduction and extension; knee flexion and extension; and ankle dorsiflexion showed LOA (%) under 15%. Lower hip, knee, and ankle LOA (%) were obtained using an ID compared to HHD. ICC between devices ranged between 0.62, CI (0.37 to 0.87) for ankle dorsiflexion to 0.94, IC (0.91to 0.98) for hip adduction. Very high correlation were found for hip adductors and hip flexors and moderate correlations for knee flexors/extensors and ankle plantar/dorsiflexors. PMID:29071305
Testing the reliability of the Fall Risk Screening Tool in an elderly ambulatory population.

PubMed

Fielding, Susan J; McKay, Michael; Hyrkas, Kristiina

2013-11-01

To identify and test the reliability of a fall risk screening tool in an ambulatory outpatient clinic. The Fall Risk Screening Tool (Albert Lea Medical Center, MN, USA) was scripted for an interview format. Two interviewers separately screened a convenience sample of 111 patients (age ≥ 65 years) in an ambulatory outpatient clinic in a northeastern US city. The interviewers' scoring of fall risk categories was similar. There was good internal consistency (Cronbach's α = 0.834-0.889) and inter-rater reliability [intra-class correlation coefficients (ICC) = 0.824-0.881] for total, Risk Factor and Client's Health Status subscales. The Physical Environment scores indicated acceptable internal consistency (Cronbach's α = 0.742) and adequate reliability (ICC = 0.688). Two Physical Environment items (furniture and medical equipment condition) had low reliabilities [Kappa (K) = 0.323, P = 0.08; K = -0.078, P = 0.648), respectively. The scripted Fall Risk Screening Tool demonstrated good reliability in this sample. Rewording two Physical Environment items will be considered. A reliable instrument such as the scripted Fall Risk Screening Tool provides a standardised assessment for identifying high fall risk patients. This tool is especially useful because it assesses personal, behavioural and environmental factors specific to community-dwelling patients; the interview format also facilitates patient-provider interaction. © 2013 John Wiley & Sons Ltd.
Reliability of a computer and Internet survey (Computer User Profile) used by adults with and without traumatic brain injury (TBI).

PubMed

Kilov, Andrea M; Togher, Leanne; Power, Emma

2015-01-01

To determine test-re-test reliability of the 'Computer User Profile' (CUP) in people with and without TBI. The CUP was administered on two occasions to people with and without TBI. The CUP investigated the nature and frequency of participants' computer and Internet use. Intra-class correlation coefficients and kappa coefficients were conducted to measure reliability of individual CUP items. Descriptive statistics were used to summarize content of responses. Sixteen adults with TBI and 40 adults without TBI were included in the study. All participants were reliable in reporting demographic information, frequency of social communication and leisure activities and computer/Internet habits and usage. Adults with TBI were reliable in 77% of their responses to survey items. Adults without TBI were reliable in 88% of their responses to survey items. The CUP was practical and valuable in capturing information about social, leisure, communication and computer/Internet habits of people with and without TBI. Adults without TBI scored more items with satisfactory reliability overall in their surveys. Future studies may include larger samples and could also include an exploration of how people with/without TBI use other digital communication technologies. This may provide further information on determining technology readiness for people with TBI in therapy programmes.
Observed intra-cluster correlation coefficients in a cluster survey sample of patient encounters in general practice in Australia

PubMed Central

Knox, Stephanie A; Chondros, Patty

2004-01-01

Background Cluster sample study designs are cost effective, however cluster samples violate the simple random sample assumption of independence of observations. Failure to account for the intra-cluster correlation of observations when sampling through clusters may lead to an under-powered study. Researchers therefore need estimates of intra-cluster correlation for a range of outcomes to calculate sample size. We report intra-cluster correlation coefficients observed within a large-scale cross-sectional study of general practice in Australia, where the general practitioner (GP) was the primary sampling unit and the patient encounter was the unit of inference. Methods Each year the Bettering the Evaluation and Care of Health (BEACH) study recruits a random sample of approximately 1,000 GPs across Australia. Each GP completes details of 100 consecutive patient encounters. Intra-cluster correlation coefficients were estimated for patient demographics, morbidity managed and treatments received. Intra-cluster correlation coefficients were estimated for descriptive outcomes and for associations between outcomes and predictors and were compared across two independent samples of GPs drawn three years apart. Results Between April 1999 and March 2000, a random sample of 1,047 Australian general practitioners recorded details of 104,700 patient encounters. Intra-cluster correlation coefficients for patient demographics ranged from 0.055 for patient sex to 0.451 for language spoken at home. Intra-cluster correlations for morbidity variables ranged from 0.005 for the management of eye problems to 0.059 for management of psychological problems. Intra-cluster correlation for the association between two variables was smaller than the descriptive intra-cluster correlation of each variable. When compared with the April 2002 to March 2003 sample (1,008 GPs) the estimated intra-cluster correlation coefficients were found to be consistent across samples. Conclusions The demonstrated precision and reliability of the estimated intra-cluster correlations indicate that these coefficients will be useful for calculating sample sizes in future general practice surveys that use the GP as the primary sampling unit. PMID:15613248
Inter- and intra- observer reliability of risk assessment of repetitive work without an explicit method.

PubMed

Eliasson, Kristina; Palm, Peter; Nyman, Teresia; Forsman, Mikael

2017-07-01

A common way to conduct practical risk assessments is to observe a job and report the observed long term risks for musculoskeletal disorders. The aim of this study was to evaluate the inter- and intra-observer reliability of ergonomists' risk assessments without the support of an explicit risk assessment method. Twenty-one experienced ergonomists assessed the risk level (low, moderate, high risk) of eight upper body regions, as well as the global risk of 10 video recorded work tasks. Intra-observer reliability was assessed by having nine of the ergonomists repeat the procedure at least three weeks after the first assessment. The ergonomists made their risk assessment based on his/her experience and knowledge. The statistical parameters of reliability included agreement in %, kappa, linearly weighted kappa, intraclass correlation and Kendall's coefficient of concordance. The average inter-observer agreement of the global risk was 53% and the corresponding weighted kappa (K w ) was 0.32, indicating fair reliability. The intra-observer agreement was 61% and 0.41 (K w ). This study indicates that risk assessments of the upper body, without the use of an explicit observational method, have non-acceptable reliability. It is therefore recommended to use systematic risk assessment methods to a higher degree. Copyright © 2017 The Authors. Published by Elsevier Ltd.. All rights reserved.
Intra and interrater reliability of spinal sagittal curves and mobility using pocket goniometer IncliMed® in healthy subjects.

PubMed

Alderighi, Marzia; Ferrari, Raffaello; Maghini, Irene; Del Felice, Alessandra; Masiero, Stefano

2016-11-21

Radiographic examination is the gold standard to evaluate spine curves, but ionising radiations limit routine use. Non-invasive methods, such as skin-surface goniometer (IncliMed®) should be used instead. To evaluate intra- and interrater reliability to assess sagittal curves and mobility of the spine with IncliMed®. a reliability study on agonistic football players. Thoracic kyphosis, lumbar lordosis and mobility of the spine were assessed by IncliMed®. Measurements were repeated twice by each examiner during the same session with between-rater blinding. Intrarater and interrater reliability were measured by Intraclass Correlation Coefficient (ICC), 95% Confidence Interval (CI 95%) and Standard Error of Measurement (SEM). Thirty-four healthy female football players (19.17 ± 4.52 years) were enrolled. Statistical results showed high intrarater (0.805-0.923) and interrater (0.701-0.886) reliability (ICC > 0.8). The obtained intra- and interrater SEM were low, with overall absolute intrarater values between 1.39° and 2.76° and overall interrater values between 1.71° and 4.25°. IncliMed® provides high intra- and interrater reliability in healthy subjects, with limited Standard Error of Measurement. These results encourage its use in clinical practice and scientific research.
Indices of Paraspinal Muscles Degeneration: Reliability and Association With Facet Joint Osteoarthritis: Feasibility Study.

PubMed

Kalichman, Leonid; Klindukhov, Alexander; Li, Ling; Linov, Lina

2016-11-01

A reliability and cross-sectional observational study. To introduce a scoring system for visible fat infiltration in paraspinal muscles; to evaluate intertester and intratester reliability of this system and its relationship with indices of muscle density; to evaluate the association between indices of paraspinal muscle degeneration and facet joint osteoarthritis. Current evidence suggests that the paraspinal muscles degeneration is associated with low back pain, facet joint osteoarthritis, spondylolisthesis, and degenerative disc disease. However, the evaluation of paraspinal muscles on computed tomography is not radiological routine, probably because of absence of simple and reliable indices of paraspinal degeneration. One hundred fifty consecutive computed tomography scans of the lower back (N=75) or abdomen (N=75) were evaluated. Mean radiographic density (in Hounsfield units) and SD of the density of multifidus and erector spinae were evaluated at the L4-L5 spinal level. A new index of muscle degeneration, radiographic density ratio=muscle density/SD of density, was calculated. To evaluate the visible fat infiltration in paraspinal muscles, we proposed a 3-graded scoring system. The prevalence of facet joint osteoarthritis was also evaluated. Intraclass correlation and κ statistics were used to evaluate inter-rater and intra-rater reliability. Logistic regression examined the association between paraspinal muscle indices and facet joint osteoarthritis. Intra-rater reliability for fat infiltration score (κ) ranged between 0.87 and 0.92; inter-rater reliability between 0.70 and 0.81. Intra-rater reliability (intraclass correlation) for mean density of paraspinal muscles ranged between 0.96 and 0.99, inter-rater reliability between 0.95 and 0.99; SD intra-rater reliability ranged between 0.82 and 0.91, inter-rater reliability between 0.80 and 0.89. Significant associations (P<0.01) were found between facet joint osteoarthritis, fat infiltration score, and radiographic density ratio. Two suggested indices of paraspinal muscle degeneration showed excellent reliability and were significantly associated with facet joint osteoarthritis. Additional studies are needed to evaluate the associations with other spinal degeneration features and low back pain.
Reliability of a Malay-translated questionnaire for use in a hand-arm vibration syndrome study in Malaysia.

PubMed

Su, T A; Hoe, V C W

2008-12-01

Validity and reliability of the information relating to hand-transmitted vibration exposure and vibration-related health outcome are very important for case finding in hand-arm vibration syndrome (HAVS) studies. In a local HAVS study among a group of construction workers in Kuala Lumpur, Malaysia, a questionnaire translated into Malay was created based on the Hand-transmitted Vibration Health Surveillance--Initial Questionnaire and Clinical Assessment, from Vibration Injury Network. This study was conducted to determine the reliability of standardised questions in the questionnaire used in the study. 15 subjects were selected randomly from the sampling frame of the HAVS study. Test-retest reliability was conducted on all items contained in parts 1-6 of the questionnaire and clinical assessment form, with an interval of 13-14 days between the first and second administration. Kappa coefficient and percentage agreement were calculated for all standardised questions. The kappa coefficient and percentage agreement for all standardised questions varied from -0.174 to 1.000 and 66.7 to 100.0 percent, respectively. The kappa coefficient for important questions related to current vibratory tool usage, tingling, numbness and hand grip weakness were 0.714, 0.432, -0.077 and -0.120, respectively, while the percentage agreement for current vibratory tool usage, finger colour change, tingling, numbness and hand grip weakness were 85.7 percent, 92.8 percent, 79.5 percent, 85.7 percent and 71.4 percent, respectively. Intra-rater reliability on the extent of vibration exposure was good, with the intra-class correlation coefficient (95 percent confidence interval) ranging from 0.786 (0.334-0.931) to 0.975 (0.923-0.992). Critical questions on vascular, neurological and musculoskeletal symptoms of HAVS were found to be reliable. The history on the extent of vibration exposure revealed good reliability when explored by the investigator alone. This questionnaire is considered reliable to be used in the study of HAVS among construction workers working in a construction site.
Braden scale (ALB) for assessing pressure ulcer risk in hospital patients: A validity and reliability study.

PubMed

Chen, Hong-Lin; Cao, Ying-Juan; Zhang, Wei; Wang, Jing; Huai, Bao-Sha

2017-02-01

The inter-rater reliability of Braden Scale is not so good. We modified the Braden(ALB) scale by defining nutrition subscale based on serum albumin, then assessed it's the validity and reliability in hospital patients. We designed a retrospective study for validity analysis, and a prospective study for reliability analysis. Receiver operating curve (ROC) and area under the curve (AUC) were used to evaluate the predictive validity. Intra-class correlation coefficient (ICC) was used to investigate the inter-rater reliability. Two thousand five hundred twenty-five patients were included for validity analysis, 76 patients (3.0%) developed pressure ulcer. Positive correlation was found between serum albumin and nutrition score in Braden scale (Spearman's coefficient 0.2203, P<0.0001). The AUCs for Braden scale and Braden(ALB) scale predicting pressure ulcer risk were 0.813 (95% CI 0.797-0.828; P<0.0001), and 0.859 (95% CI 0.845-0.872; P<0.0001), respectively. The Braden(ALB) scale was even more valid than the Braden scale (z=1.860, P=0.0628). In different age subgroups, the Braden(ALB) scale seems also more valid than the original Braden scale, but no statistically significant differences were found (P>0.05). The inter-rater reliability study showed the ICC-value for nutrition increased 45.9%, and increased 4.3% for total score. The Braden(ALB) scale has similar validity compared with the original Braden scale for in hospital patients. However, the inter-rater reliability was significantly increased. Copyright © 2016 Elsevier Inc. All rights reserved.
Exercise self-efficacy in persons with spinal cord injury: psychometric properties of the Dutch translation of the Exercise Self-Efficacy Scale.

PubMed

Nooijen, Carla F J; Post, Marcel W M; Spijkerman, Dorien C M; Bergen, Michael P; Stam, Henk J; van den Berg-Emons, Rita J G

2013-04-01

To assess the reliability and validity of the Dutch version of the exercise self-efficacy scale (ESES) in persons with spinal cord injury. This is the first independent study of ESES psychometric properties, and the first report on ESES test-retest reliability. A total of 53 Dutch persons with spinal cord injury. Subjects completed the Dutch ESES twice, with 2 weeks between (ESES_1 and ESES_2). Subjects also completed the General self-efficacy scale (GSE), and a questionnaire regarding demographic characteristics and lesion characteristics. Psychometric properties of the Dutch translation of the ESES were assessed and compared with those of the original English-language version. The Dutch ESES was found to have good internal consistency (Cronbach's α for ESES_1 = 0.90, ESES_2 = 0.88). Test-retest reliability was adequate (intra-class correlation coefficient = 0.81, 95% confidence interval 0.70-0.89). For validity, a moderate, statistically significant correlation was found between ESES and the GSE (Spearman's ρ ESES_1 = 0.52, ESES_2 = 0.66, p < 0.01). Furthermore, the psychometric properties of the Dutch ESES were found to be similar to those of the original English version. The results of this study support the use of the ESES as a reliable and valid measure of exercise self-efficacy.
The Pareidolia Test: A Simple Neuropsychological Test Measuring Visual Hallucination-Like Illusions.

PubMed

Mamiya, Yasuyuki; Nishio, Yoshiyuki; Watanabe, Hiroyuki; Yokoi, Kayoko; Uchiyama, Makoto; Baba, Toru; Iizuka, Osamu; Kanno, Shigenori; Kamimura, Naoto; Kazui, Hiroaki; Hashimoto, Mamoru; Ikeda, Manabu; Takeshita, Chieko; Shimomura, Tatsuo; Mori, Etsuro

2016-01-01

Visual hallucinations are a core clinical feature of dementia with Lewy bodies (DLB), and this symptom is important in the differential diagnosis and prediction of treatment response. The pareidolia test is a tool that evokes visual hallucination-like illusions, and these illusions may be a surrogate marker of visual hallucinations in DLB. We created a simplified version of the pareidolia test and examined its validity and reliability to establish the clinical utility of this test. The pareidolia test was administered to 52 patients with DLB, 52 patients with Alzheimer's disease (AD) and 20 healthy controls (HCs). We assessed the test-retest/inter-rater reliability using the intra-class correlation coefficient (ICC) and the concurrent validity using the Neuropsychiatric Inventory (NPI) hallucinations score as a reference. A receiver operating characteristic (ROC) analysis was used to evaluate the sensitivity and specificity of the pareidolia test to differentiate DLB from AD and HCs. The pareidolia test required approximately 15 minutes to administer, exhibited good test-retest/inter-rater reliability (ICC of 0.82), and moderately correlated with the NPI hallucinations score (rs = 0.42). Using an optimal cut-off score set according to the ROC analysis, and the pareidolia test differentiated DLB from AD with a sensitivity of 81% and a specificity of 92%. Our study suggests that the simplified version of the pareidolia test is a valid and reliable surrogate marker of visual hallucinations in DLB.
Methodology for Developing a New EFNEP Food and Physical Activity Behaviors Questionnaire.

PubMed

Murray, Erin K; Auld, Garry; Baker, Susan S; Barale, Karen; Franck, Karen; Khan, Tarana; Palmer-Keenan, Debra; Walsh, Jennifer

2017-10-01

Research methods are described for developing a food and physical activity behaviors questionnaire for the Expanded Food and Nutrition Education Program (EFNEP), a US Department of Agriculture nutrition education program serving low-income families. Mixed-methods observational study. The questionnaire will include 5 domains: (1) diet quality, (2) physical activity, (3) food safety, (4) food security, and (5) food resource management. A 5-stage process will be used to assess the questionnaire's test-retest reliability and content, face, and construct validity. Research teams across the US will coordinate questionnaire development and testing nationally. Convenience samples of low-income EFNEP, or EFNEP-eligible, adult participants across the US. A 5-stage process: (1) prioritize domain concepts to evaluate (2) question generation and content analysis panel, (3) question pretesting using cognitive interviews, (4) test-retest reliability assessment, and (5) construct validity testing. A nationally tested valid and reliable food and physical activity behaviors questionnaire for low-income adults to evaluate EFNEP's effectiveness. Cognitive interviews will be summarized to identify themes and dominant trends. Paired t tests (P ≤ .05) and Spearman and intra-class correlation coefficients (r > .5) will be conducted to assess reliability. Construct validity will be assessed using Wilcoxon t test (P ≤ .05), Spearman correlations, and Bland-Altman plots. Copyright © 2017 Society for Nutrition Education and Behavior. Published by Elsevier Inc. All rights reserved.
Health related quality of life in disorders of defecation: the Defecation Disorder List

PubMed Central

Voskuijl, W; van der Zaag-Loon..., H J; Ketel, I; Grootenhuis, M; Derkx, B; Benninga, M

2004-01-01

Background: Constipation and encopresis frequently cause problems with respect to emotional wellbeing, and social and family life. Instruments to measure Health Related Quality of Life (HRQoL) in these disorders are not available. Methods: A disease specific HRQoL instrument, the "Defecation Disorder List" (DDL) for children with constipation or functional non-retentive faecal soiling (FNRFS) was developed using accepted guidelines. For each phase of the process, different samples of patients were used. The final phase of development included 27 children. Reliability was assessed in two ways: internal consistency of domains with Cronbach's alpha, and test-retest reliability with intra-class correlation coefficients (ICC). To assess validity, comparable items and domains were correlated with Tacqol, a generic HRQoL instrument for children (TNO-AZL). Results: In the final phase of the development, 27 children completed the instrument. It consisted of 37 items in four domains. The response rate was 96%. Reliability was good for all domains, with Cronbach's alpha values ranging from 0.61 to 0.76. Measures of test-retest stability were good for all four domains with ICCs ranging from 0.82 to 0.92. Validity based on comparison with the Tacqol instrument was moderate. Conclusion: The DDL is promising as a measure of HRQoL in childhood defecation disorders. PMID:15557046
Reliability of the Pictorial Scale of Perceived Movement Skill Competence in 2 Diverse Samples of Young Children.

PubMed

Barnett, Lisa M; Robinson, Leah E; Webster, E Kipling; Ridgers, Nicola D

2015-08-01

The purpose was to determine the reliability of an instrument designed to assess young children's perceived movement skill competence in 2 diverse samples. A pictorial instrument assessed 12 perceived Fundamental Movement Skills (FMS) based on the Test of Gross Motor Development 2nd edition. Intra-Class Correlations (ICC) and internal consistency analyses were conducted. Paired sample t tests assessed change in mean perceived skill scores. Bivariate correlations between the intertrial difference and the mean of the trials explored proportional bias. Sample 1 (S1) were culturally diverse Australian children (n = 111; 52% boys) aged 5 to 8 years (mean = 6.4, SD = 1.0) with educated parents. Sample 2 (S2) were racially diverse and socioeconomically disadvantaged American children (n = 110; 57% boys) aged 5 to 10 years (mean = 6.8, SD = 1.1). For all children, the internal consistency for 12 FMS was acceptable (S1 = 0.72, 0.75, S2 = 0.66, 0.67). ICCs were higher in S1 (0.73) than S2 (0.50). Mean changes between trials were small. There was little evidence of proportional bias. Lower values in S2 may be due to differences in study demographic and execution. While the instrument demonstrated reliability/internal consistency, further work is recommended in diverse samples.
Development and psychometric testing of a trans-professional evidence-based practice profile questionnaire.

PubMed

McEvoy, Maureen Patricia; Williams, Marie T; Olds, Timothy Stephen

2010-01-01

Previous survey tools operationalising knowledge, attitudes or beliefs about evidence-based practice (EBP) have shortcomings in content, psychometric properties and target audience. This study developed and psychometrically assessed a self-report trans-professional questionnaire to describe an EBP profile. Sixty-six items were collated from existing EBP questionnaires and administered to 526 academics and students from health and non-health backgrounds. Principal component factor analysis revealed the presence of five factors (Relevance, Terminology, Confidence, Practice and Sympathy). Following expert panel review and pilot testing, the 58-item final questionnaire was disseminated to 105 subjects on two occasions. Test-retest and internal reliability were quantified using intra-class correlation coefficients (ICCs) and Cronbach's alpha, convergent validity against a commonly used EBP questionnaire by Pearson's correlation coefficient and discriminative validity via analysis of variance (ANOVA) based on exposure to EBP training. The final questionnaire demonstrated acceptable internal consistency (Cronbach's alpha 0.96), test-retest reliability (ICCs range 0.77-0.94) and convergent validity (Practice 0.66, Confidence 0.80 and Sympathy 0.54). Three factors (Relevance, Terminology and Confidence) distinguished EBP exposure groups (ANOVA p < 0.001-0.004). The evidence-based practice profile (EBP(2)) questionnaire is a reliable instrument with the ability to discriminate for three factors, between respondents with differing EBP exposures.
The Computerized Perceptual Motor Skills Assessment: A new visual perceptual motor skills evaluation tool for children in early elementary grades.

PubMed

Howe, Tsu-Hsin; Chen, Hao-Ling; Lee, Candy Chieh; Chen, Ying-Dar; Wang, Tien-Ni

2017-10-01

Visual perceptual motor skills have been proposed as underlying courses of handwriting difficulties. However, there is no evaluation tool currently available to assess these skills comprehensively and to serve as a sensitive measure. The purpose of this study was to validate the Computerized Perceptual Motor Skills Assessment (CPMSA), a newly developed evaluation tool for children in early elementary grades. Its test-retest reliability, concurrent validity, discriminant validity, and responsiveness were examined in 43 typically developing children and 26 children with handwriting difficulty. The CPMSA demonstrated excellent reliability across all subtests with intra-class correlation coefficients (ICCs)≥0.80. Significant moderate correlations between the domains of the CPMSA and corresponding gold standards including Beery VMI, the TVPS-3, and the eye-hand coordination subtest of the DTVP-2 demonstrated good concurrent validity. In addition, the CPMSA showed evidence of discriminant validity in samples of children with and without handwriting difficulty. This article provides evidence in support of the CPMSA. The CPMSA is a reliable, valid, and promising measure of visual perceptual motor skills for children in early elementary grades. Directions for future study and improvements to the assessment are discussed. Copyright © 2017. Published by Elsevier Ltd.
Cross-parent reliability in rating ASD markers in infants.

PubMed

Ben-Sasson, Ayelet; Amit-Ben-Simhon, Hemda; Meyer, Sonya

2015-06-01

To investigate the congruence and discrepancies between mother and father reports of early autism spectrum disorders (ASD) markers. Mothers (n = 80) and fathers (n = 78) of 12-month-old infants (55% boys) completed the first year inventory (FYI), an ASD norm-referenced screening questionnaire. Mothers also completed the Infant Toddler Social Emotional Assessment (ITSEA). There were significant and moderate intra-class correlations between mother and father reports for most FYI factors. Fathers' median FYI social-communication domain score was almost twice that of mothers. Mann-Whitney tests indicated that fathers rated their child significantly higher than mothers on the four FYI social-communication factors and on the sensory processing factor. Linear weighted kappa analyses indicated poor agreement on gaze-related and reactivity FYI items. FYI social-communication and sensory-regulatory factors showed significant correlations with corresponding ITSEA scores. Social-communication markers pose a greater challenge for consistent report across parents than sensory-regulatory markers.

Reliability of primary caregivers reports on lifestyle behaviours of European pre-school children: the ToyBox-study.

PubMed

González-Gil, E M; Mouratidou, T; Cardon, G; Androutsos, O; De Bourdeaudhuij, I; Góźdź, M; Usheva, N; Birnbaum, J; Manios, Y; Moreno, L A

2014-08-01

Reliable assessments of health-related behaviours are necessary for accurate evaluation on the efficiency of public health interventions. The aim of the current study was to examine the reliability of a self-administered primary caregivers questionnaire (PCQ) used in the ToyBox-intervention. The questionnaire consisted of six sections addressing sociodemographic and perinatal factors, water and beverages consumption, physical activity, snacking and sedentary behaviours. Parents/caregivers from six countries (Belgium, Bulgaria, Germany, Greece, Poland and Spain) were asked to complete the questionnaire twice within a 2-week interval. A total of 93 questionnaires were collected. Test-retest reliability was assessed using intra-class correlation coefficient (ICC). Reliability of the six questionnaire sections was assessed. A stronger agreement was observed in the questions addressing sociodemographic and perinatal factors as opposed to questions addressing behaviours. Findings showed that 92% of the ToyBox PCQ had a moderate-to-excellent test-retest reliability (defined as ICC values from 0.41 to 1) and less than 8% poor test-retest reliability (ICC < 0.40). Out of the total ICC values, 67% showed good-to-excellent reliability (ICC from 0.61 to 1). We conclude that the PCQ is a reliable tool to assess sociodemographic characteristics, perinatal factors and lifestyle behaviours of pre-school children and their families participating in the ToyBox-intervention. © 2014 World Obesity.
Intra and inter-rater reliability of infrared image analysis of masticatory and upper trapezius muscles in women with and without temporomandibular disorder.

PubMed

Costa, Ana C S; Dibai Filho, Almir V; Packer, Amanda C; Rodrigues-Bigaton, Delaine

2013-01-01

Infrared thermography is an aid tool that can be used to evaluate several pathologies given its efficiency in analyzing the distribution of skin surface temperature. To propose two forms of infrared image analysis of the masticatory and upper trapezius muscles, and to determine the intra and inter-rater reliability of both forms of analysis. Infrared images of masticatory and upper trapezius muscles of 64 female volunteers with and without temporomandibular disorder (TMD) were collected. Two raters performed the infrared image analysis, which occurred in two ways: temperature measurement of the muscle length and in central portion of the muscle. The Intraclass Correlation Coefficient (ICC) was used to determine the intra and inter-rater reliability. The ICC showed excellent intra and inter-rater values for both measurements: temperature measurement of the muscle length (TMD group, intra-rater, ICC ranged from 0.996 to 0.999, inter-rater, ICC ranged from 0.992 to 0.999; control group, intra-rater, ICC ranged from 0.993 to 0.998, inter-rater, ICC ranged from 0.990 to 0.998), and temperature measurement of the central portion of the muscle (TMD group, intra-rater, ICC ranged from 0.981 to 0.998, inter-rater, ICC ranged from 0.971 to 0.998; control group, intra-rater, ICC ranged from 0.887 to 0.996, inter-rater, ICC ranged from 0.852 to 0.996). The results indicated that temperature measurements of the masticatory and upper trapezius muscles carried out by the analysis of the muscle length and central portion yielded excellent intra and inter-rater reliability.
Performance of intraclass correlation coefficient (ICC) as a reliability index under various distributions in scale reliability studies.

PubMed

Mehta, Shraddha; Bastero-Caballero, Rowena F; Sun, Yijun; Zhu, Ray; Murphy, Diane K; Hardas, Bhushan; Koch, Gary

2018-04-29

Many published scale validation studies determine inter-rater reliability using the intra-class correlation coefficient (ICC). However, the use of this statistic must consider its advantages, limitations, and applicability. This paper evaluates how interaction of subject distribution, sample size, and levels of rater disagreement affects ICC and provides an approach for obtaining relevant ICC estimates under suboptimal conditions. Simulation results suggest that for a fixed number of subjects, ICC from the convex distribution is smaller than ICC for the uniform distribution, which in turn is smaller than ICC for the concave distribution. The variance component estimates also show that the dissimilarity of ICC among distributions is attributed to the study design (ie, distribution of subjects) component of subject variability and not the scale quality component of rater error variability. The dependency of ICC on the distribution of subjects makes it difficult to compare results across reliability studies. Hence, it is proposed that reliability studies should be designed using a uniform distribution of subjects because of the standardization it provides for representing objective disagreement. In the absence of uniform distribution, a sampling method is proposed to reduce the non-uniformity. In addition, as expected, high levels of disagreement result in low ICC, and when the type of distribution is fixed, any increase in the number of subjects beyond a moderately large specification such as n = 80 does not have a major impact on ICC. Copyright © 2018 John Wiley & Sons, Ltd.
Validation of EncephalApp, Smartphone-based Stroop Test, for the Diagnosis of Covert Hepatic Encephalopathy

PubMed Central

Bajaj, Jasmohan S; Heuman, Douglas M; Sterling, Richard K; Sanyal, Arun J; Siddiqui, Muhammad; Matherly, Scott; Luketic, Velimir; Stravitz, R Todd; Fuchs, Michael; Thacker, Leroy R; Gilles, HoChong; White, Melanie B; Unser, Ariel; Hovermale, James; Gavis, Edith; Noble, Nicole A; Wade, James B

2014-01-01

Background & Aims Detection of covert hepatic encephalopathy (CHE) is difficult but point of care testing could increase rates of diagnosis. We aimed to validate the ability of the smartphone app EncephalApp, a streamlined version of Stroop App, to detect CHE. We evaluated face validity, test–retest reliability, and external validity. Methods Patients with cirrhosis (n=167; 38% with overt HE [OHE]; mean age, 55 years; mean model for end-stage liver disease score, 12) and controls (n=114) were each given a paper and pencil cognitive battery (standard) along with EncephalApp. EncephalApp has Off and On states; results measured were: OffTime, OnTime, OffTime+OnTime, and number of runs required to complete 5 off and on runs. Thirty-six patients with cirrhosis underwent driving simulation tests, and EncephalApp results were correlated with results. Test–retest reliability was analyzed in a subgroup of patients. The test was performed before and after transjugular intra-hepatic portosystemic shunt placement, before and after correction for hyponatremia, to determine external validity. Results All patients with cirrhosis performed worse on paper and pencil and EncephalApp tests than controls. Patients with cirrhosis and OHE performed worse than those without OHE. Age-dependent EncephalApp cut-offs (younger or older than 45 years) were set. An OffTime+OnTime value of >190 seconds identified all patients with CHE with an area under the receiver operator characteristic (AUROC) value of 0.91; the AUROC value was 0.88 for diagnosis of CHE in those without OHE. EncephalApp times correlated with crashes and illegal turns in driving simulation tests. Test–retest reliability was high (intra-class coefficient, 0.83) among 30 patients retested 1–3 months apart. OffTime+OnTime increased significantly (206 vs 255, P=.007) among 10 patients retested 33±7 days after transjugular intra-hepatic portosystemic shunt placement. OffTime+OnTime decreased significantly (242 vs 225, P=.03) in 7 patients tested before and after correction for hyponatremia (126±3 to 132±4 meq/L, P=.01), 10±5 days apart. Conclusions A smartphone app called EncephalApp has good face validity, test–retest reliability, and external validity for the diagnosis of CHE. PMID:24846278
The root coverage esthetic score: Intra-examiner reliability among students and faculty at tufts university school of dental medicine.

PubMed

Isaia, Federica; Gyurko, Robert; Roomian, Tamar C; Hawley, Charles E

2018-04-06

The Root Coverage Esthetic Score (RES) was published in 2009 as an esthetic scoring system to measure visible final outcomes of root coverage procedures performed on Miller I and II recession defects. The aim of this study was to evaluate the intra-examiner, intra-group, and inter-examiner reliability of the (Root Coverage Esthetic Score) RES when used among periodontal faculty, post-graduate students in periodontology, and pre-doctoral DMD students when using the RES at Tufts University School of Dental Medicine (TUSDM). Thirty-three participants (12 second year DMD students, 11 periodontal residents, and 10 faculty members) were assembled to evaluate 25 baseline and 6-months post-treatment outcomes of mucogingival surgeries using the RES. Each projection was shown for 30 seconds during which the participants were asked to use the RES scoring system to evaluate the surgical outcomes. The results were then recorded on a standardized worksheet grid. To test intra-examiner reliability, 7 of the 25 projections were shown twice. Intra-examiner reliability and inter-examiner reliability were assessed using intraclass correlation coefficient using a two-way mixed effects model, and stratified by education level. PG residents had the highest tendency to agree with each other with an interclass correlation (ICC) of 0.53 (95%CI 0.36 - 0.74). DMD students had an ICC: 0.51 (95%CI: 0.33 - 0.75), and PG faculty members produced an ICC: 0.41 (95%CI: 0.24 - 0.64). There was no statistically significant difference in ICC among the three groups of participants (Kruskal-Wallis test, P = 0.2440). When the data for each RES element were then combined, the mean ICC for the total interrater agreement for RES was 0.48 (95% CI: 0.32-0.71). This corresponds to an overall moderate agreement among all participants using the RES to evaluate the 25 surgical outcomes. The intra-examiner reliability within each of the three groups was quite high. The highest mean ICC was produced by the PG Faculty (0.908). The mean ICCs for PG residents was 0.867, and the mean ICC for DMD students was 0.855. The Kruskal-Wallis test (p = 0.46) failed to find any statistical difference in intra-examiner reliability between the three groups of participants CONCLUSIONS: The RES is a "moderately" reliable scoring system for mucogingival treatments in a dental school setting and can be used even by operators with different level of periodontal experience. This scoring system can be repeated by the same examiner obtaining reliable results. This article is protected by copyright. All rights reserved. © 2018 American Academy of Periodontology.
Day-to-day reliability of gait characteristics in rats.

PubMed

Raffalt, Peter C; Nielsen, Louise R; Madsen, Stefan; Munk Højberg, Laurits; Pingel, Jessica; Nielsen, Jens Bo; Wienecke, Jacob; Alkjær, Tine

2018-04-27

The purpose of the present study was to determine the day-to-day reliability in stride characteristics in rats during treadmill walking obtained with two-dimensional (2D) motion capture. Kinematics were recorded from 26 adult rats during walking at 8 m/min, 12 m/min and 16 m/min on two separate days. Stride length, stride time, contact time, swing time and hip, knee and ankle joint range of motion were extracted from 15 strides. The relative reliability was assessed using intra-class correlation coefficients (ICC(1,1)) and (ICC(3,1)). The absolute reliability was determined using measurement error (ME). Across walking speeds, the relative reliability ranged from fair to good (ICCs between 0.4 and 0.75). The ME was below 91 mm for strides lengths, below 55 ms for the temporal stride variables and below 6.4° for the joint angle range of motion. In general, the results indicated an acceptable day-to-day reliability of the gait pattern parameters observed in rats during treadmill walking. The results of the present study may serve as a reference material that can help future intervention studies on rat gait characteristics both with respect to the selection of outcome measures and in the interpretation of the results. Copyright © 2018 Elsevier Ltd. All rights reserved.
Validity and reliability of the Japanese version of the FIM + FAM in patients with cerebrovascular accident.

PubMed

Miki, Emi; Yamane, Shingo; Yamaoka, Mai; Fujii, Hiroe; Ueno, Hiroka; Kawahara, Toshie; Tanaka, Keiko; Tamashiro, Hiroaki; Inoue, Eiji; Okamoto, Takatsugu; Kuriyama, Masaru

2016-09-01

The study aim was to investigate the validity and reliability of the Functional Independence Measure and Functional Assessment Measure (FIM + FAM), which is unfamiliar in Japan, by using its Japanese version (FIM + FAM-j) in patients with cerebrovascular accident (CVA). Forty-two CVA patients participated. Criterion validity was examined by correlating the full scale and subscales of FIM + FAM-j with several well-established measurements using Spearman's correlation coefficient. Reliability was evaluated by internal consistency (tested by Cronbach's alpha coefficient) and intra-rater reliability (tested by Kendall's tau correlation coefficient). Good-to-excellent criterion validity was found between the full scale and motor subscales of the FIM + FAM-j and the Barthel Index, National Institutes of Health Stroke Scale, modified Rankin Scale, and lower extremity Brunnstrom Recovery Stage. High internal consistency was observed within the full-scale FIM + FAM-j and the motor and cognitive subscales (Cronbach's alphas were 0.968, 0.954, and 0.948, respectively). Additionally, good intra-rater reliability was observed within the full scale and motor subscales, and excellent reliability for the cognitive subscales (taus were 0.83, 0.80, and 0.98, respectively). This study showed that the FIM + FAM-j demonstrated acceptable levels of validity and reliability when used for CVA as a measure of disability.
Inter- and intraobserver reliability of the vertebral, local and segmental kyphosis in 120 traumatic lumbar and thoracic burst fractures: evaluation in lateral X-rays and sagittal computed tomographies

PubMed Central

Brunner, Alexander; Gühring, Markus; Schmälzle, Traude; Weise, Kuno; Badke, Andreas

2009-01-01

Evaluation of the kyphosis angle in thoracic and lumbar burst fractures is often used to indicate surgical procedures. The kyphosis angle could be measured as vertebral, segmental and local kyphosis according to the method of Cobb. The vertebral, segmental and local kyphosis according to the method of Cobb were measured at 120 lateral X-rays and sagittal computed tomographies of 60 thoracic and 60 lumbar burst fractures by 3 independent observers on 2 separate occasions. Osteoporotic fractures were excluded. The intra- and interobserver reliability of these angles in X-ray and computed tomogram, using the intra class correlation coefficient (ICC) were evaluated. Highest reproducibility showed the segmental kyphosis followed by the vertebral kyphosis. For thoracic fractures segmental kyphosis shows in X-ray “excellent” inter- and intraobserver reliabilities (ICC 0.826, 0.802) and for lumbar fractures “good” to “excellent” inter- and intraobserver reliabilities (ICC = 0.790, 0.803). In computed tomography, the segmental kyphosis showed “excellent” inter- and intraobserver reliabilities (ICC = 0.824, 0.801) for thoracic and “excellent” inter- and intraobserver reliabilities (ICC = 0.874, 0.835) for the lumbar fractures. Regarding both diagnostic work ups (X-ray and computed tomography), significant differences were evaluated in interobserver reliabilities for vertebral kyphosis measured in lumbar fracture X-rays (p = 0.035) and interobserver reliabilities for local kyphosis, measured in thoracic fracture X-rays (p = 0.010). Regarding both fracture localizations (thoracic and lumbar fractures), significant differences could only be evaluated in interobserver reliabilities for the local kyphosis measured in computed tomographies (p = 0.045) and in intraobserver reliabilities for the vertebral kyphosis measured in X-rays (p = 0.024). “Good” to “excellent” inter- and intraobserver reliabilities for vertebral, segmental and local kyphosis in X-ray make these angles to a helpful tool, indicating surgical procedures. For the practical use in lateral X-ray, we emphasize the determination of the segmental kyphosis, because of the highest reproducibility of this angle. “Good” to “excellent” inter- and intraobserver reliabilities for these three angles could also be evaluated in computed tomographies. Therefore, also in computed tomography, the use of these three angles seems to be generally possible. For a direct correlation of the results in lateral X-ray and in computed tomography, further studies should be needed. PMID:19953277
Lumbar lordosis and sacral slope in lumbar spinal stenosis: standard values and measurement accuracy.

PubMed

Bredow, J; Oppermann, J; Scheyerer, M J; Gundlfinger, K; Neiss, W F; Budde, S; Floerkemeier, T; Eysel, P; Beyer, F

2015-05-01

Radiological study. To asses standard values, intra- and interobserver reliability and reproducibility of sacral slope (SS) and lumbar lordosis (LL) and the correlation of these parameters in patients with lumbar spinal stenosis (LSS). Anteroposterior and lateral X-rays of the lumbar spine of 102 patients with LSS were included in this retrospective, radiologic study. Measurements of SS and LL were carried out by five examiners. Intraobserver correlation and correlation between LL and SS were calculated with Pearson's r linear correlation coefficient and intraclass correlation coefficients (ICC) were calculated for inter- and intraobserver reliability. In addition, patients were examined in subgroups with respect to previous surgery and the current therapy. Lumbar lordosis averaged 45.6° (range 2.5°-74.9°; SD 14.2°), intraobserver correlation was between Pearson r = 0.93 and 0.98. The measurement of SS averaged 35.3° (range 13.8°-66.9°; SD 9.6°), intraobserver correlation was between Pearson r = 0.89 and 0.96. Intraobserver reliability ranged from 0.966 to 0.992 ICC in LL measurements and 0.944-0.983 ICC in SS measurements. There was an interobserver reliability ICC of 0.944 in LL and 0.990 in SS. Correlation between LL and SS averaged r = 0.79. No statistically significant differences were observed between the analyzed subgroups. Manual measurement of LL and SS in patients with LSS on lateral radiographs is easily performed with excellent intra- and interobserver reliability. Correlation between LL and SS is very high. Differences between patients with and without previous decompression were not statistically significant.
A marker placement laser device for improving repeatability in 3D-foot motion analysis.

PubMed

Kalkum, Eva; van Drongelen, Stefan; Mussler, Johannes; Wolf, Sebastian I; Kuni, Benita

2016-02-01

In 3D gait analysis, the repeated positioning of markers is associated with a high error rate, particularly when using a complex foot model with many markers. Therefore, a marker placement laser device was developed that ensures a reliable repositioning of markers. We report the development and reliability of this device for the foot at different tape conditions. In 38 subjects, markers were placed at the foot according to the Heidelberg foot measurement method. Subjects were tested barefoot and barefoot with three different tape conditions. For all conditions, a static standing trial was captured. We analyzed differences in distances between markers and the intra-class correlation coefficients (ICC). Small differences between the conditions (0.03-3.28 mm) and excellent ICCs (0.91-0.97 mm) were found for all parameters. The laser marker placement device appeared to be a reliable method to place markers on a tape at previously palpated positions and ensures an exact position. The device could find a wide application in different clinical research fields. Copyright © 2015 Elsevier B.V. All rights reserved.
Leg lengthening and femoral-offset reduction after total hip arthroplasty: where is the problem - stem or cup positioning?

PubMed

Al-Amiry, Bariq; Mahmood, Sarwar; Krupic, Ferid; Sayed-Noor, Arkan

2017-09-01

Background Restoration of femoral offset (FO) and leg length is an important goal in total hip arthroplasty (THA) as it improves functional outcome. Purpose To analyze whether the problem of postoperative leg lengthening and FO reduction is related to the femoral stem or acetabular cup positioning or both. Material and Methods Between September 2010 and April 2013, 172 patients with unilateral primary osteoarthritis treated with THA were included. Postoperative leg-length discrepancy (LLD) and global FO (summation of cup and FO) were measured by two observers using a standardized protocol for evaluation of antero-posterior plain hip radiographs. Patients with postoperative leg lengthening ≥10 mm (n = 41) or with reduced global FO >5 mm (n = 58) were further studied by comparing the stem and cup length of the operated side with the contralateral side in the lengthening group, and by comparing the stem and cup offset of the operated side with the contralateral side in the FO reduction group. We evaluated also the inter-observer and intra-observer reliability of the radiological measurements. Results Both observers found that leg lengthening was related to the stem positioning while FO reduction was related to the positioning of both the femoral stem and acetabular cup. Both inter-observer reliability and intra-observer reproducibility were moderate to excellent (intra-class correlation co-efficient, ICC ≥0.69). Conclusion Post THA leg lengthening was mainly caused by improper femoral stem positioning while global FO reduction resulted from improper positioning of both the femoral stem and the acetabular cup.
Malocclusion Class II division 1 skeletal and dental relationships measured by cone-beam computed tomography.

PubMed

Xu, Yiling; Oh, Heesoo; Lagravère, Manuel O

2017-09-01

The purpose of this study was to locate traditionally-used landmarks in two-dimensional (2D) images and newly-suggested ones in three-dimensional (3D) images (cone-beam computer tomographies [CBCTs]) and determine possible relationships between them to categorize patients with Class II-1 malocclusion. CBCTs from 30 patients diagnosed with Class II-1 malocclusion were obtained from the University of Alberta Graduate Orthodontic Program database. The reconstructed images were downloaded and visualized using the software platform AVIZO ® . Forty-two landmarks were chosen and the coordinates were then obtained and analyzed using linear and angular measurements. Ten images were analyzed three times to determine the reliability and measurement error of each landmark using Intra-Class Correlation coefficient (ICC). Descriptive statistics were done using the SPSS statistical package to determine any relationships. ICC values were excellent for all landmarks in all axes, with the highest measurement error of 2mm in the y-axis for the Gonion Left landmark. Linear and angular measurements were calculated using the coordinates of each landmark. Descriptive statistics showed that the linear and angular measurements used in the 2D images did not correlate well with the 3D images. The lowest standard deviation obtained was 0.6709 for S-GoR/N-Me, with a mean of 0.8016. The highest standard deviation was 20.20704 for ANS-InfraL, with a mean of 41.006. The traditional landmarks used for 2D malocclusion analysis show good reliability when transferred to 3D images. However, they did not reveal specific skeletal or dental patterns when trying to analyze 3D images for malocclusion. Thus, another technique should be considered when classifying 3D CBCT images for Class II-1malocclusion. Copyright © 2017 CEO. Published by Elsevier Masson SAS. All rights reserved.
Reliability of peripheral arterial tonometry in patients with heart failure, diabetic nephropathy and arterial hypertension.

PubMed

Weisrock, Fabian; Fritschka, Max; Beckmann, Sebastian; Litmeier, Simon; Wagner, Josephine; Tahirovic, Elvis; Radenovic, Sara; Zelenak, Christine; Hashemi, Djawid; Busjahn, Andreas; Krahn, Thomas; Pieske, Burkert; Dinh, Wilfried; Düngen, Hans-Dirk

2017-08-01

Endothelial dysfunction plays a major role in cardiovascular diseases and pulse amplitude tonometry (PAT) offers a non-invasive way to assess endothelial dysfunction. However, data about the reliability of PAT in cardiovascular patient populations are scarce. Thus, we evaluated the test-retest reliability of PAT using the natural logarithmic transformed reactive hyperaemia index (LnRHI). Our cohort consisted of 91 patients (mean age: 65±9.7 years, 32% female), who were divided into four groups: those with heart failure with preserved ejection fraction (HFpEF) ( n=25), heart failure with reduced ejection fraction (HFrEF) ( n=22), diabetic nephropathy ( n=21), and arterial hypertension ( n=23). All subjects underwent two separate PAT measurements at a median interval of 7 days (range 4-14 days). LnRHI derived by PAT showed good reliability in subjects with diabetic nephropathy (intra-class correlation (ICC) = 0.863) and satisfactory reliability in patients with both HFpEF (ICC = 0.557) and HFrEF (ICC = 0.576). However, in subjects with arterial hypertension, reliability was poor (ICC = 0.125). We demonstrated that PAT is a reliable technique to assess endothelial dysfunction in adults with diabetic nephropathy, HFpEF or HFrEF. However, in subjects with arterial hypertension, we did not find sufficient reliability, which can possibly be attributed to variations in heart rate and the respective time of the assessments. Clinical Trial Registration Identifier: NCT02299960.
Reliability of the Inverse Water Volumetry Method to Measure the Volume of the Upper Limb.

PubMed

Beek, Martinus A; te Slaa, Alexander; van der Laan, Lijckle; Mulder, Paul G H; Rutten, Harm J T; Voogd, Adri C; Luiten, Ernest J T; Gobardhan, Paul D

2015-06-01

Lymphedema of the upper extremity is a common side effect of lymph node dissection or irradiation of the axilla. Several techniques are being applied in order to examine the presence and severity of lymphedema. Measurement of circumference of the upper extremity is most frequently performed. An alternative is the water-displacement method. The aim of this study was to determine the reliability and the reproducibility of the "Inverse Water Volumetry apparatus" (IWV-apparatus) for the measurement of arm volumes. The IWV-apparatus is based on the water-displacement method. Measurements were performed by three breast cancer nurse practitioners on ten healthy volunteers in three weekly sessions. The intra-class correlation coefficient, defined as the ratio of the subject component to the total variance, equaled 0.99. The reliability index is calculated as 0.14 kg. This indicates that only changes in a patient's arm volume measurement of more than 0.14 kg would represent a true change in arm volume, which is about 6% of the mean arm volume of 2.3 kg. The IWV-apparatus proved to be a reliable and reproducible method to measure arm volume.
Cross-cultural adaptation, reliability, and validity of the Persian version of the Cumberland Ankle Instability Tool.

PubMed

Hadadi, Mohammad; Ebrahimi Takamjani, Ismail; Ebrahim Mosavi, Mohammad; Aminian, Gholamreza; Fardipour, Shima; Abbasi, Faeze

2017-08-01

The purpose of the present study was to translate and to cross-culturally adapt the Cumberland Ankle Instability Tool (CAIT) into Persian language and to evaluate its psychometric properties. The International Quality of Life Assessment process was pursued to translate CAIT into Persian. Two groups of Persian-speaking individuals, 105 participants with a history of ankle sprain and 30 participants with no history of ankle sprain, were asked to fill out Persian version of CAIT (CAIT-P), Foot and Ankle Ability Measure (FAAM), and Visual Analog Scale (VAS). Data obtained from the first administration of CAIT were used to evaluate floor and ceiling effects, internal consistency, dimensionality, and criterion validity. To determine the test-retest reliability, 45 individuals re-filled CAIT 5-7 days after the first session. Cronbach's alpha was over the cutoff point of 0.70 for both ankles and in both groups. The intra-class correlation coefficient was high for right (0.95) and left (0.91) ankles. There was a strong correlation between each item and the total score of the CAIT-P. Although the CAIT-P had strong correlation with VAS, its correlation with both subscales of FAAM was moderate. The CAIT-P has good validity and reliability and it can be used by clinicians and researchers for identification and investigation of functional ankle instability. Implications for Rehabilitation Chronic ankle instability is one of the most common consequences of acute ankle sprain. Cumberland Ankle Instability Tool is an acceptable measure to determine functional ankle instability and its severity. The Persian version of Cumberland Ankle Instability Tool is a valid and reliable tool for clinical and research purpose in Persian-speaking individuals.
The Reliability of Individualized Load-Velocity Profiles.

PubMed

Banyard, Harry G; Nosaka, K; Vernon, Alex D; Haff, G Gregory

2017-11-15

This study examined the reliability of peak velocity (PV), mean propulsive velocity (MPV), and mean velocity (MV) in the development of load-velocity profiles (LVP) in the full depth free-weight back squat performed with maximal concentric effort. Eighteen resistance-trained men performed a baseline one-repetition maximum (1RM) back squat trial and three subsequent 1RM trials used for reliability analyses, with 48-hours interval between trials. 1RM trials comprised lifts from six relative loads including 20, 40, 60, 80, 90, and 100% 1RM. Individualized LVPs for PV, MPV, or MV were derived from loads that were highly reliable based on the following criteria: intra-class correlation coefficient (ICC) >0.70, coefficient of variation (CV) ≤10%, and Cohen's d effect size (ES) <0.60. PV was highly reliable at all six loads. Importantly, MPV and MV were highly reliable at 20, 40, 60, 80 and 90% but not 100% 1RM (MPV: ICC=0.66, CV=18.0%, ES=0.10, standard error of the estimate [SEM]=0.04m·s -1 ; MV: ICC=0.55, CV=19.4%, ES=0.08, SEM=0.04m·s -1 ). When considering the reliable ranges, almost perfect correlations were observed for LVPs derived from PV 20-100% (r=0.91-0.93), MPV 20-90% (r=0.92-0.94) and MV 20-90% (r=0.94-0.95). Furthermore, the LVPs were not significantly different (p>0.05) between trials, movement velocities, or between linear regression versus second order polynomial fits. PV 20-100% , MPV 20-90% , and MV 20-90% are reliable and can be utilized to develop LVPs using linear regression. Conceptually, LVPs can be used to monitor changes in movement velocity and employed as a method for adjusting sessional training loads according to daily readiness.
The reliability and validity study of the Kinesthetic and Visual Imagery Questionnaire in individuals with Multiple Sclerosis

PubMed Central

Tabrizi, Yousef Moghadas; Zangiabadi, Nasser; Mazhari, Shahrzad; Zolala, Farzaneh

2013-01-01

Objective Motor imagery (MI) has been recently considered as an adjunct to physical rehabilitation in patients with multiple sclerosis (MS). It is necessary to assess MI abilities and benefits in patients with MS by using a reliable tool. The Kinesthetic and Visual Imagery Questionnaire (KVIQ) was recently developed to assess MI ability in patients with stroke and other disabilities. Considering the different underlying pathologies, the present study aimed to examine the validity and reliability of the KVIQ in MS patients. Method Fifteen MS patients were assessed using the KVIQ in 2 sessions (5-14days apart) by the same examiner. In the second session, the participants also completed a revised MI questionnaire (MIQ-R) as the gold standard. Intra-class correlation coefficients (ICCs) were measured to determine test-retest reliability. Spearman's correlation analysis was performed to assess concurrent validity with the MIQ-R. Furthermore, the internal consistency (Cronbach's alpha) and factorial structure of the KVIQ were studied. Results The test-retest reliability for the KVIQ was good (ICCs: total KVIQ=0.89, visual KVIQ=0.85, and kinesthetic KVIQ=0.93), and the concurrent validity between the KVIQ and MIQ-R was good (r=0.79). The KVIQ had good internal consistency, with high Cronbach's alpha (alpha=0.84). Factorial analysis showed the bi-factorial structure of the KVIQ, which was explained by visual=57.6% and kinesthetic=32.4%. Conclusions The results of the present study revealed that the KVIQ is a valid and reliable tool for assessing MI in MS patients. PMID:24271091
The reliability and validity study of the Kinesthetic and Visual Imagery Questionnaire in individuals with multiple sclerosis.

PubMed

Tabrizi, Yousef Moghadas; Zangiabadi, Nasser; Mazhari, Shahrzad; Zolala, Farzaneh

2013-01-01

Motor imagery (MI) has been recently considered as an adjunct to physical rehabilitation in patients with multiple sclerosis (MS). It is necessary to assess MI abilities and benefits in patients with MS by using a reliable tool. The Kinesthetic and Visual Imagery Questionnaire (KVIQ) was recently developed to assess MI ability in patients with stroke and other disabilities. Considering the different underlying pathologies, the present study aimed to examine the validity and reliability of the KVIQ in MS patients. Fifteen MS patients were assessed using the KVIQ in 2 sessions (5-14 days apart) by the same examiner. In the second session, the participants also completed a revised MI questionnaire (MIQ-R) as the gold standard. Intra-class correlation coefficients (ICCs) were measured to determine test-retest reliability. Spearman's correlation analysis was performed to assess concurrent validity with the MIQ-R. Furthermore, the internal consistency (Cronbach's alpha) and factorial structure of the KVIQ were studied. The test-retest reliability for the KVIQ was good (ICCs: total KVIQ=0.89, visual KVIQ=0.85, and kinesthetic KVIQ=0.93), and the concurrent validity between the KVIQ and MIQ-R was good (r=0.79). The KVIQ had good internal consistency, with high Cronbach's alpha (alpha=0.84). Factorial analysis showed the bi-factorial structure of the KVIQ, which was explained by visual=57.6% and kinesthetic=32.4%. The results of the present study revealed that the KVIQ is a valid and reliable tool for assessing MI in MS patients.
Reliability and relative validity of three physical activity questionnaires in Taizhou population of China: the Taizhou Longitudinal Study.

PubMed

Hu, B; Lin, L F; Zhuang, M Q; Yuan, Z Y; Li, S Y; Yang, Y J; Lu, M; Yu, S Z; Jin, L; Ye, W M; Wang, X F

2015-09-01

To examine the test-retest reliabilities and relative validities of the Chinese version of short International Physical Activity Questionnaire (IPAQ-S-C), the Global Physical Activity Questionnaire (GPAQ-C), and the Total Energy Expenditure Questionnaire (TEEQ-C) in a population-based prospective study, the Taizhou Longitudinal Study (TZLS). A longitudinal comparative study. A total of 205 participants (male: 38.54%) aged 30-70 years completed three questionnaires twice (day one and day nine) and physical activity log (PA-log) over seven consecutive days. The test-retest reliabilities were evaluated using intra-class correlation coefficients (ICCs) and the relative validities were estimated by comparing the data from physical activity questionnaires (PAQs) and PA-log. Good reliabilities were observed between the repeated PAQs. The ICCs ranged from 0.51 to 0.80 for IPAQ-C, 0.67 to 0.85 for GPAQ-C, and 0.74 to 0.94 for TEEQ-C, respectively. Energy expenditure of most PA domains estimated by the three PAQs correlated moderately with the results recorded by PA-log except the walking domain of IPAQ-S-C. The partial correlation coefficients between the PAQs and PA-log ranged from 0.44 to 0.58 for IPAQ-S-C, 0.26 to 0.52 for GPAQ-C, and 0.41 to 0.72 for TEEQ-C, respectively. Bland-Altman plots showed acceptable agreement between the three PAQs and PA-log. The three PAQs, especially TEEQ-C, were relatively reliable and valid for assessment of physical activity and could be used in TZLS. Copyright © 2015 The Royal Society for Public Health. Published by Elsevier Ltd. All rights reserved.
Asthma Symptom Utility Index: Reliability, validity, responsiveness and the minimal important difference in adult asthma patients

PubMed Central

Bime, Christian; Wei, Christine Y.; Holbrook, Janet T.; Sockrider, Marianna M.; Revicki, Dennis A.; Wise, Robert A.

2012-01-01

Background The evaluation of asthma symptoms is a core outcome measure in asthma clinical research. The Asthma Symptom Utility Index (ASUI) was developed to assess frequency and severity of asthma symptoms. The psychometric properties of the ASUI are not well characterized and a minimal important difference (MID) is not established. Objectives We assessed the reliability, validity, and responsiveness to change of the ASUI in a population of adult asthma patients. We also sought to determine the MID for the ASUI. Methods Adult asthma patients (n = 1648) from two previously completed multicenter randomized trials were included. Demographic information, spirometry, ASUI scores, and other asthma questionnaire scores were obtained at baseline and during follow-up visits. Participants also kept a daily asthma diary. Results Internal consistency reliability of the ASUI was 0.74 (Cronbach’s alpha). Test-retest reliability was 0.76 (intra-class correlation). Construct validity was demonstrated by significant correlations between ASUI scores and Asthma Control Questionnaire (ACQ) scores (Spearman correlation r = −0.79, 95% CI [−0.85, −0.75], P<0.001) and Mini Asthma Quality of Life Questionnaire (Mini AQLQ) scores (r = 0.59, 95% CI [0.51, 0.61], P<0.001). Responsiveness to change was demonstrated, with significant differences between mean changes in ASUI score across groups of participants differing by 10% in the percent predicted FEV1 (P<0.001), and by 0.5 points in ACQ score (P < 0.001). Anchor-based methods and statistical methods support an MID for the ASUI of 0.09 points. Conclusions The ASUI is reliable, valid, and responsive to changes in asthma control over time. The MID of the ASUI (range of scores 0–1) is 0.09. PMID:23026499

Validity and Reliability of the Chronic Respiratory Disease Questionnaire in Elderly Individuals with Mild to Moderate Non-Cystic Fibrosis Bronchiectasis.

PubMed

Vodanovich, Domagoj A; Bicknell, Thomas J; Holland, Anne E; Hill, Catherine J; Cecins, Nola; Jenkins, Sue; McDonald, Christine F; Burge, Angela T; Thompson, Philip; Stirling, Robert G; Lee, Annemarie L

2015-01-01

The chronic respiratory disease questionnaire (CRDQ) is designed to assess health-related quality of life (HRQOL) in chronic respiratory conditions, but its reliability, validity and responsiveness in individuals with mild to moderate non-cystic fibrosis (CF) bronchiectasis are unclear. This study aimed to determine measurement properties of the CRDQ in non-CF bronchiectasis. Participants with non-CF bronchiectasis involved in a randomised controlled trial of exercise training were recruited. Internal consistency was assessed using Cronbach's α. Over 8 weeks, reliability was evaluated using intra-class correlation coefficients and Bland-Altman analysis for measures of agreement. Convergent and divergent validity was assessed by correlations with the other HRQOL questionnaires and the Hospital Anxiety and Depression Scale (HADS). The responsiveness to exercise training was assessed using effect sizes and standardised response means. Eighty-five participants were included (mean age ± SD, 64 ± 13 years). Internal consistency was adequate (>0.7) for all CRDQ domains and the total score. Test-retest reliability ranged from 0.69 to 0.85 for each CRDQ domain and was 0.82 for the total score. Dyspnoea (CRDQ) was related to St George's respiratory questionnaire (SGRQ) symptoms only (r = 0.38), with no relationship to the Leicester cough questionnaire (LCQ) or HADS. Moderate correlations were found between the total score of the CRDQ, the SGRQ (rs = -0.49) and the LCQ score (rs = 0.51). Lower CRDQ scores were associated with higher anxiety and depression (rs = -0.46 to -0.56). The responsiveness of the CRDQ was small (effect size 0.1-0.24). The CRDQ is a valid and reliable measure of HRQOL in mild to moderate non-CF bronchiectasis, but responsiveness was limited. © 2015 S. Karger AG, Basel.
Validity of the Special Needs Education Assessment Tool (SNEAT), a Newly Developed Scale for Children with Disabilities.

PubMed

Kohara, Aiko; Han, ChangWan; Kwon, HaeJin; Kohzuki, Masahiro

2015-11-01

The improvement of the quality of life (QOL) of children with disabilities has been considered important. Therefore, the Special Needs Education Assessment Tool (SNEAT) was developed based on the concept of QOL to objectively evaluate the educational outcome of children with disabilities. SNEAT consists of 11 items in three domains: physical functioning, mental health, and social functioning. This study aimed to verify the reliability and construct validity of SNEAT using 93 children collected from the classes on independent activities of daily living for children with disabilities in Okinawa Prefecture between October and November 2014. Survey data were collected in a longitudinal prospective cohort study. The reliability of SNEAT was verified via the internal consistency method and the test-pretest method; both the coefficient of Cronbach's α and the intra-class correlation coefficient were over 0.7. The validity of SNEAT was also verified via one-way repeated-measures ANOVA and the latent growth curve model. The scores of all the items and domains and the total scores obtained from one-way repeated-measures ANOVA were the same as the predicted scores. SNEAT is valid based on its goodness-of-fit values obtained using the latent growth curve model, where the values of comparative fit index (0.983) and root mean square error of approximation (0.062) were within the goodness-of-fit range. These results indicate that SNEAT has high reliability and construct validity and may contribute to improve QOL of children with disabilities in the classes on independent activities of daily living for children with disabilities.
Measurement of glenohumeral joint translation using real-time ultrasound imaging: A physiotherapist and sonographer intra-rater and inter-rater reliability study.

PubMed

Rathi, Sangeeta; Taylor, Nicholas F; Gee, Jamie; Green, Rodney A

2016-12-01

Ultrasonography is an economical and non-invasive method for measuring real-time joint movements. Although physiotherapists are increasingly using ultrasound imaging for rotator cuff disorders, there is a lack of evidence on their reliability in using ultrasonography to measure glenohumeral translation. The aim of this study was to evaluate the reliability of a physiotherapist in measuring anterior and posterior glenohumeral joint translation with ultrasound. Study design: within day reliability. Anterior and posterior glenohumeral translations were measured at rest, in response to passive accessory motion testing force, and with isometric internal and external rotation in 12 young healthy adults. All the measurements were made in real time by a physiotherapist and an experienced sonographer in two positions (neutral and abducted) and in two views (anterior and posterior). Intra-rater and inter-rater reliability were expressed using intraclass correlation coefficients (ICC) and measurement error (mm). Intra-rater reliability was good for both raters (ICC P : 0.86-0.98; ICC S : 0.85-0.96). The inter-rater reliability between the physiotherapist and sonographer was moderate to good for posterior measurements (ICC 0.50-0.75) and poor to moderate for anterior measurements (ICC 0.31-0.53). For both intra-rater and inter-rater measurements, posterior translation was more reliable than the anterior translation with smaller measurement errors (posterior: 0.1-0.2 mm, anterior: 0.2-0.3 mm). A physiotherapist with minimal training was reliable in measuring glenohumeral joint translations. The ultrasound method was reliable for repeated measurement of both anterior and posterior glenohumeral translations with posterior measurements being more reliable than anterior. This method is recommended for future research to investigate the stabilising role of rotator cuff muscles. Copyright © 2016 Elsevier Ltd. All rights reserved.
Test-retest reliability of pulse amplitude tonometry measures of vascular endothelial function: implications for clinical trial design.

PubMed

McCrea, Cindy E; Skulas-Ray, Ann C; Chow, Mosuk; West, Sheila G

2012-02-01

Endothelial dysfunction is an important outcome for assessing vascular health in intervention studies. However, reliability of the standard non-invasive method (flow-mediated dilation) is a significant challenge for clinical applications and multicenter trials. We evaluated the repeatability of pulse amplitude tonometry (PAT) to measure change in pulse wave amplitude during reactive hyperemia (Itamar Medical Ltd, Caesarea, Israel). Twenty healthy adults completed two PAT tests (mean interval = 19.5 days) under standardized conditions. PAT-derived measures of endothelial function (reactive hyperemia index, RHI) and arterial stiffness (augmentation index, AI) showed strong repeatability (intra-class correlations = 0.74 and 0.83, respectively). To guide future research, we also analyzed sample size requirements for a range of effect sizes. A crossover design powered at 0.90 requires 28 participants to detect a 15% change in RHI. Our study is the first to show that PAT measurements are repeatable in adults over an interval greater than 1 week.
Comparison of different passive knee extension torque-angle assessments.

PubMed

Freitas, Sandro R; Vaz, João R; Bruno, Paula M; Valamatos, Maria J; Mil-Homens, Pedro

2013-11-01

Previous studies have used isokinetic dynamometry to assess joint torques and angles during passive extension of the knee, often without reporting upon methodological errors and reliability outcomes. In addition, the reliability of the techniques used to measure passive knee extension torque-angle and the extent to which reliability may be affected by the position of the subjects is also unclear. Therefore, we conducted an analysis of the intra- and inter-session reliability of two methods of assessing passive knee extension: (A) a 2D kinematic analysis coupled to a custom-made device that enabled the direct measurement of resistance to stretch and (B) an isokinetic dynamometer used in two testing positions (with the non-tested thigh either flexed at 45° or in the neutral position). The intra-class correlation coefficients (ICCs) of torque, the slope of the torque-angle curve, and the parameters of the mathematical model that were fit to the torque-angle data for the above conditions were measured in sixteen healthy male subjects (age: 21.4 ± 2.1 yr; BMI: 22.6 ± 3.3 kg m(-2); tibial length: 37.4 ± 3.4 cm). The results found were: (1) methods A and B led to distinctly different torque-angle responses; (2) passive torque-angle relationship and stretch tolerance were influenced by the position of the non-tested thigh; and (3) ICCs obtained for torque were higher than for the slope and for the mathematical parameters that were fit to the torque-angle curve. In conclusion, the measurement method that is used and the positioning of subjects can influence the passive knee extension torque-angle outcome.
Analysis of linear measurements on 3D surface models using CBCT data segmentation obtained by automatic standard pre-set thresholds in two segmentation software programs: an in vitro study.

PubMed

Poleti, Marcelo Lupion; Fernandes, Thais Maria Freire; Pagin, Otávio; Moretti, Marcela Rodrigues; Rubira-Bullen, Izabel Regina Fischer

2016-01-01

The aim of this in vitro study was to evaluate the reliability and accuracy of linear measurements on three-dimensional (3D) surface models obtained by standard pre-set thresholds in two segmentation software programs. Ten mandibles with 17 silica markers were scanned for 0.3-mm voxels in the i-CAT Classic (Imaging Sciences International, Hatfield, PA, USA). Twenty linear measurements were carried out by two observers two times on the 3D surface models: the Dolphin Imaging 11.5 (Dolphin Imaging & Management Solutions, Chatsworth, CA, USA), using two filters(Translucent and Solid-1), and in the InVesalius 3.0.0 (Centre for Information Technology Renato Archer, Campinas, SP, Brazil). The physical measurements were made by another observer two times using a digital caliper on the dry mandibles. Excellent intra- and inter-observer reliability for the markers, physical measurements, and 3D surface models were found (intra-class correlation coefficient (ICC) and Pearson's r ≥ 0.91). The linear measurements on 3D surface models by Dolphin and InVesalius software programs were accurate (Dolphin Solid-1 > InVesalius > Dolphin Translucent). The highest absolute and percentage errors were obtained for the variable R1-R1 (1.37 mm) and MF-AC (2.53 %) in the Dolphin Translucent and InVesalius software, respectively. Linear measurements on 3D surface models obtained by standard pre-set thresholds in the Dolphin and InVesalius software programs are reliable and accurate compared with physical measurements. Studies that evaluate the reliability and accuracy of the 3D models are necessary to ensure error predictability and to establish diagnosis, treatment plan, and prognosis in a more realistic way.
Cross-Cultural Adaptation, Reliability and Validity of the Danish Version of the Readiness for Return to Work Instrument.

PubMed

Stapelfeldt, Christina Malmose; Momsen, Anne-Mette Hedeager; Lund, Thomas; Grønborg, Therese Koops; Hogg-Johnson, Sheilah; Jensen, Chris; Skakon, Janne; Labriola, Merete

2018-06-06

The objective of the present study was to translate and validate the Canadian Readiness for Return To Work instrument (RRTW-CA) into a Danish version (RRTWDK) by testing its test-retest and internal consistency reliability and its structural and construct validity. Cross-cultural adaptation of the six-staged RRTW-CA instrument was performed in a standardised, systematic five-step-procedure; forward translation, panel synthesis of the translation, back translation, consolidation and revision by researchers, and finally pre-testing. This RRTW-DK beta-version was tested for its psychometric properties by intra-class correlation coefficient and standard error of measurement (n = 114), Cronbach's alpha (n = 471), confirmatory factor analyses (n = 373), and Spearman's rank correlation coefficient (n = 436) in sickness beneficiaries from a municipal employment agency and hospital wards. The original RRTW-CA stage structure could not be confirmed in the RRTWDK. The psychometric properties were thus inconclusive. The RRTW-DK cannot be recommended for use in the current version as the RRTW construct is questionable. The RRTW construct needs further exploration, preferably in a population that is homogeneous with regard to cause of sickness, disability duration and age.
Cigarette dependence questionnaire: development and psychometric testing with male smokers.

PubMed

Huang, Chih-Ling; Lin, Hsi-Hui; Wang, Hsiu-Hung

2010-10-01

This paper is a report of a study conducted to develop and test a theoretically derived Cigarette Dependence Questionnaire for adult male smokers. Fagerstrom questionnaires have been used worldwide to assess cigarette dependence. However, these assessments lack any theoretical perspective. A theory-based approach is needed to ensure valid assessment. In 2007, an initial pool of 103 Cigarette Dependence Questionnaire items was distributed to 109 adult smokers in Taiwan. Item analysis was conducted to select items for inclusion in the refined scale. The psychometric properties of the Cigarette Dependence Questionnaire were further evaluated 2007-08, when it was administered to 256 respondents and their saliva was collected and analysed for cotinine levels. Criterion validity was established through the Pearson correlation between the scale and saliva cotinine levels. Exploratory factor analysis was used to test construct validity. Reliability was determined with Cronbach's alpha coefficient and a 2-week test-retest coefficient. The selection of 30 items for seven perspectives was based on item analysis. One factor accounting for 44.9% of the variance emerged from the factor analysis. The factor was named as cigarette dependence. Cigarette Dependence Questionnaire scores were statistically significantly correlated with saliva cotinine levels (r = 0.21, P = 0.01). Cronbach's alpha was 0.95 and test-retest reliability using an intra-class correlation was 0.92. The Cigarette Dependence Questionnaire showed sound reliability and validity and could be used by nurses to set up smoking cessation interventions based on assessment of cigarette dependence. © 2010 Blackwell Publishing Ltd.
Day to Day Variability and Reliability of Blood Oxidative Stress Markers within a Four-Week Period in Healthy Young Men.

PubMed

Goldfarb, A H; Garten, R S; Waller, J; Labban, J D

2014-01-01

The present study aimed to determine the day to day variability and reliability of several blood oxidative stress markers at rest in a healthy young cohort over a four-week period. Twelve apparently healthy resistance trained males (24.6 ± 3.0 yrs) were tested over 7 visits within 4 weeks with at least 72 hrs between visits at the same time of day. Subjects rested 30 minutes prior to blood being obtained by vacutainer. Results. The highest IntraClass correlations (ICC's) were obtained for protein carbonyls (PC) and oxygen radical absorbance capacity (ORAC) (PC = 0.785 and ORAC = 0.780). Cronbach's α reliability score for PC was 0.967 and for ORAC was 0.961. The ICC's for GSH, GSSG, and the GSSG/TGH ratio ICC were 0.600, 0.573, and 0.570, respectively, with Cronbach's α being 0.913, 0.904, and 0.903, respectively. Xanthine oxidase ICC was 0.163 and Cronbach's α was 0.538. Conclusions. PC and ORAC demonstrated good to excellent reliability while glutathione factors had poor to excellent reliability. Xanthine oxidase showed poor reliability and high variability. These results suggest that the PC and ORAC markers were the most stable and reliable oxidative stress markers in blood and that daily changes across visits should be considered when interpreting resting blood oxidative stress markers.
Validation of the Chinese translation of the 6-item De Jong Gierveld Loneliness Scale in elderly Chinese.

PubMed

Leung, Grace Tak Yu; de Jong Gierveld, Jenny; Lam, Linda Chiu Wa

2008-12-01

This study aims to develop and validate a Chinese translation of the 6-item De Jong Gierveld Loneliness Scale - a widely used instrument to measure loneliness - specifically determining its psychometric properties in community-dwelling Chinese elders in Hong Kong. The relationships between loneliness and several clinical variables are also assessed. The English version was translated into Chinese. Content validity was established by group discussion and Delphi panel. The questionnaire was administered to 103 Hong Kong Chinese community-dwelling elders. Statistical analysis was performed to test the reliability and validity of the scale. The content validity was high as shown by the results of the Delphi panel. Cronbach's alpha of the 6-item scale was 0.76. For the inter-rater reliability of the six items, the intra-class correlation coefficients ranged from 0.98 to 1.00. The emotional loneliness subscale significantly correlated with the social loneliness subscale (rho = 0.37; p < 0.001). In using a direct question to measure loneliness, 21 participants (20%) reported that they were lonely. The overall loneliness scale score significantly correlated with the answers on the direct question of loneliness (rpb = 0.71; p < 0.001). The overall loneliness score showed significant correlation with Cornell Scale for Depression in Dementia (rho = 0.29; p = 0.003), current smoking status (rpb = 0.24; p = 0.014), and some objective social characteristics. The Chinese version of the 6-item De Jong Gierveld Loneliness Scale is a reliable and valid measure of loneliness in Chinese elders.
Validity and reliability of the Arabic version of the Household Food Insecurity Access Scale in rural Lebanon.

PubMed

Naja, Farah; Hwalla, Nahla; Fossian, Talar; Zebian, Dina; Nasreddine, Lara

2015-02-01

To assess the validity and reliability of the Arabic version of the Household Food Insecurity Access Scale (HFIAS) in rural Lebanon. A cross-sectional study on a sample of households with at least one child aged 0-2 years. In a one-to-one interview, participants completed an adapted Arabic version of the HFIAS. In order to evaluate the validity of the HFIAS, basic sociodemographic information, anthropometric measurements of the mother and child, and dietary intake data of the child were obtained. In order to examine reproducibility, the HFIAS was re-administered after 3 months. Rural Lebanon. Mother and child pairs (n 150). Factor analysis of HFIAS items revealed two factors: 'insufficient food quality' and 'insufficient food quantity'. Using Pearson's correlation, food insecurity was inversely associated with mother's and father's education levels, number of cars and electrical appliances in the household, income, weight-for-age and length-for-age of the child and the child's dietary adequacy. In contrast, mother's BMI and crowding index were positively associated with food insecurity scores (P < 0·05 for all correlations). Cronbach's α of the scale was 0·91. A moderate correlation was observed between the two administrations of the questionnaire (intra-class correlation = 0·58; P < 0·05). Our findings indicated that the adapted Arabic version of the HFIAS is a valid and reliable tool to assess food insecurity in rural Lebanon, lending further evidence to the utility of the HFIAS in assessing food insecurity in culturally diverse populations.
Reliability of the Berg Balance Scale as a Clinical Measure of Balance in Community-Dwelling Older Adults with Mild to Moderate Alzheimer Disease: A Pilot Study.

PubMed

Muir-Hunter, Susan W; Graham, Laura; Montero Odasso, Manuel

2015-08-01

To measure test-retest and interrater reliability of the Berg Balance Scale (BBS) in community-dwelling adults with mild to moderate Alzheimer disease (AD). Method : A sample of 15 adults (mean age 80.20 [SD 5.03] years) with AD performed three balance tests: the BBS, timed up-and-go test (TUG), and Functional Reach Test (FRT). Both relative reliability, using the intra-class correlation coefficient (ICC), and absolute reliability, using standard error of measurement (SEM) and minimal detectable change (MDC95) values, were calculated; Bland-Altman plots were constructed to evaluate inter-tester agreement. The test-retest interval was 1 week. Results : For the BBS, relative reliability values were 0.95 (95% CI, 0.85-0.98) for test-retest reliability and 0.72 (95% CI, 0.31-0.91) for interrater reliability; SEM was 6.01 points and MDC95 was 16.66 points; and interrater agreement was 16.62 points. The BBS performed better in test-retest reliability than the TUG and FRT, tests with established reliability in AD. Between 33% and 50% of participants required cueing beyond standardized instructions because they were unable to remember test instructions. Conclusions : The BBS achieved relative reliability values that support its clinical utility, but MDC95 and agreement values indicate the scale has performance limitations in AD. Further research to optimize balance assessment for people with AD is required.
Reproducibility of electronic tooth colour measurements.

PubMed

Ratzmann, Anja; Klinke, Thomas; Schwahn, Christian; Treichel, Anja; Gedrange, Tomasz

2008-10-01

Clinical methods of investigation, such as tooth colour determination, should be simple, quick and reproducible. The determination of tooth colours usually relies upon manual comparison of a patient's tooth colour with a colour ring. After some days, however, measurement results frequently lack unequivocal reproducibility. This study aimed to examine an electronic method for reliable colour measurement. The colours of the teeth 14 to 24 were determined by three different examiners in 10 subjects using the colour measuring device Shade Inspector. In total, 12 measurements per tooth were taken. Two measurement time points were scheduled to be taken, namely at study onset (T(1)) and after 6 months (T(2)). At either time point, two measurement series per subject were taken by the different examiners at 2-week intervals. The inter-examiner and intra-examiner agreement of the measurement results was assessed. The concordance for lightness and colour intensity (saturation) was represented by the intra-class correlation coefficient. The categorical variable colour shade (hue) was assessed using the kappa statistic. The study results show that tooth colour can be measured independently of the examiner. Good agreement was found between the examiners.
Intra- and Interobserver Reliability of Three Classification Systems for Hallux Rigidus.

PubMed

Dillard, Sarita; Schilero, Christina; Chiang, Sharon; Pham, Peter

2018-04-18

There are over ten classification systems currently used in the staging of hallux rigidus. This results in confusion and inconsistency with radiographic interpretation and treatment. The reliability of hallux rigidus classification systems has not yet been tested. The purpose of this study was to evaluate intra- and interobserver reliability using three commonly used classifications for hallux rigidus. Twenty-one plain radiograph sets were presented to ten ACFAS board-certified foot and ankle surgeons. Each physician classified each radiograph based on clinical experience and knowledge according to the Regnauld, Roukis, and Hattrup and Johnson classification systems. The two-way mixed single-measure consistency intraclass correlation was used to calculate intra- and interrater reliability. The intrarater reliability of individual sets for the Roukis and Hattrup and Johnson classification systems was "fair to good" (Roukis, 0.62±0.19; Hattrup and Johnson, 0.62±0.28), whereas the intrarater reliability of individual sets for the Regnauld system bordered between "fair to good" and "poor" (0.43±0.24). The interrater reliability of the mean classification was "excellent" for all three classification systems. Conclusions Reliable and reproducible classification systems are essential for treatment and prognostic implications in hallux rigidus. In our study, Roukis classification system had the best intrarater reliability. Although there are various classification systems for hallux rigidus, our results indicate that all three of these classification systems show reliability and reproducibility.
The reliability of a simplified water displacement instrument: a method for measuring arm volume.

PubMed

Sagen, Ase; Kåresen, Rolf; Risberg, May Arna

2005-01-01

To present a new water displacement measurement, the Simplified Water Displacement Instrument (SWDI), and to evaluate its intra- and intertester reliability. Reliability design. Hospital setting. Fifty-six healthy people were studied. Intratester reliability was evaluated once a week for 4 weeks in 20 women and 10 men. Intertester reliability was assessed by 2 physical therapists in 26 people. Not applicable. Coefficients of variation (CVs) and intraclass correlation coefficients (ICCs). The intratester reliability showed a CV range of 2.2% to 2.6% and an ICC range of .98 to .99. The intertester reliability showed a CV of 1.3% and an ICC of .99. There was a significant increase in arm volume in men compared with women. There were no significant differences in changes in volume over the 4 weeks. There was a significant greater right arm volume (3.3%) among the right-handed subjects (P<.001). Both intra- and intertester reliability were satisfactory for the SWDI.
Characterising the reproducibility and reliability of dietary patterns among Yup'ik Alaska Native people.

PubMed

Ryman, Tove K; Boyer, Bert B; Hopkins, Scarlett; Philip, Jacques; O'Brien, Diane; Thummel, Kenneth; Austin, Melissa A

2015-02-28

FFQ data can be used to characterise dietary patterns for diet-disease association studies. In the present study, we evaluated three previously defined dietary patterns--'subsistence foods', market-based 'processed foods' and 'fruits and vegetables'--among a sample of Yup'ik people from Southwest Alaska. We tested the reproducibility and reliability of the dietary patterns, as well as the associations of these patterns with dietary biomarkers and participant characteristics. We analysed data from adult study participants who completed at least one FFQ with the Center for Alaska Native Health Research 9/2009-5/2013. To test the reproducibility of the dietary patterns, we conducted a confirmatory factor analysis (CFA) of a hypothesised model using eighteen food items to measure the dietary patterns (n 272). To test the reliability of the dietary patterns, we used the CFA to measure composite reliability (n 272) and intra-class correlation coefficients for test-retest reliability (n 113). Finally, to test the associations, we used linear regression (n 637). All factor loadings, except one, in CFA indicated acceptable correlations between foods and dietary patterns (r>0·40), and model-fit criteria were >0·90. Composite and test-retest reliability of the dietary patterns were, respectively, 0·56 and 0·34 for 'subsistence foods', 0·73 and 0·66 for 'processed foods', and 0·72 and 0·54 for 'fruits and vegetables'. In the multi-predictor analysis, the dietary patterns were significantly associated with dietary biomarkers, community location, age, sex and self-reported lifestyle. This analysis confirmed the reproducibility and reliability of the dietary patterns in the present study population. These dietary patterns can be used for future research and development of dietary interventions in this underserved population.
Can a bronchoscopist reliably assess a patient's experience of bronchoscopy?

PubMed Central

Hadzri, HM; Azarisman, SMS; Fauzi, ARM; Roslan, H; Roslina, AM; Adina, ATN; Fauzi, MA

2010-01-01

Objectives Bronchoscopy is an essential investigative tool in many respiratory complaints. The procedure can be unpleasant for both bronchoscopists and patients. To the best of our knowledge, there are only a few studies that correlate the bronchoscopist's satisfaction with that of the patient's during bronchoscopy. The aim of our study is to assess whether or not a bronchoscopist could reliably assess a patient's satisfaction during bronchoscopy. Design Cross-sectional, observational study with convenience sampling. Setting Patients attending flexible fibreoptic bronchoscopy appointments at the bronchoscopy suite, Respiratory Unit, Universiti Kebangsaan Malaysia Medical Centre (UKMMC), Cheras, Kuala Lumpur, Malaysia between March and September 2006. Participants Sixty patients undergoing bronchoscopy over a 6-month period completed a questionnaire after the procedure. All patients received standard pre-medication with intravenous midazolam. Main outcome measures Bronchoscopists and patients rated the level of satisfaction of the procedure using a 10 cm visual analogue scale (VAS). Lower scores indicated better satisfaction or less discomfort. Patients and bronchoscopists also rated coughing, choking and vomiting perception using the same 10 cm VAS. Reliability analysis (intra-class correlation coefficient [ICC]) was used to analyse the correlation between patients' and bronchoscopists' VAS scores. Results All 60 patients answered the questionnaire. The median overall satisfaction scored by bronchoscopists was 2.2 (2.0) with a non-significant (p = 0.880) trend to a better median overall satisfaction of 1.9 (2.3) scored by patients. The VAS scores for cough sensation were 1.9 (2.7) and 1.5 (5.0), respectively. There was positive correlation between bronchoscopists' and patients' VAS scores for coughing sensation (p = 0.047, ICC = 0.233). No significant correlation for overall satisfaction, vomiting sensation and choking sensation was found. Conclusion Positive correlation for cough perception suggested that the bronchoscopist could reliably assess the degree of cough discomfort patients experience during bronchoscopy. PMID:21103127
The validity and reliability of the Persian version of the Revised Fibromyalgia Impact Questionnaire.

PubMed

Ghavidel Parsa, Banafsheh; Amir Maafi, Alireza; Haghdoost, Afrooz; Arabi, Yasaman; Khojamli, Monire; Chatrnour, Gelayol; Bidari, Ali

2014-02-01

The Revised Fibromyalgia Impact Questionnaire (FIQR), an updated version of the Fibromyalgia Impact Questionnaire (FIQ) achieved a better balance among different domains (i.e., function, overall impact, and symptom severity) and attempts to address the limitations of FIQ. As there is no Persian version of the FIQR available, we aimed to investigate the validity and reliability of a Persian translation of the FIQR in Iranian patients. After translating the FIQR into Persian, it was administered to 77 female patients with fibromyalgia syndrome. All of the patients filled out the questionnaire together with a Persian version of the FIQ, short form-12 (SF-12). The tender-point count was also calculated. One week later, FM patients filled out the Persian FIQR at their second visit. Reliability was analyzed by internal consistency and reproducibility including Cronbach's α coefficient and intra-class correlation coefficient. Construct validity was evaluated by Spearman's correlation coefficient and Pearson's correlation coefficient. Statistical analysis was performed using SPSS for Windows version 17.0. All patients included in this study were female, and the mean age was 38.23 ± 10.68 years. The total scores of the FIQR and FIQ were 49.77 ± 18.27 and 54.05 ± 14.00 that were closely correlated (r = 0.63, p < 0.01), and each of the three domains of the Persian FIQR was also correlated well with the three related FIQ domains (r = 0.36-0.63, p < 0.01). Also some significant inverse correlations of FIQR with quality-of-life (assessed by SF-12) domains and items were found. Cronbach's α was 0.87 for FIQR in the first visit. The Persian FIQR showed adequate reliability and validity. This instrument can be used in the clinical evaluation of Iranian patients with fibromyalgia.
The reliability of the Hendrich Fall Risk Model in a geriatric hospital.

PubMed

Heinze, Cornelia; Halfens, Ruud; Dassen, Theo

2008-12-01

Aims and objectives. The purpose of this study was to test the interrater reliability of the Hendrich Fall Risk Model, an instrument to identify patients in a hospital setting with a high risk of falling. Background. Falls are a serious problem in older patients. Valid and reliable fall risk assessment tools are required to identify high-risk patients and to take adequate preventive measures. Methods. Seventy older patients were independently and simultaneously assessed by six pairs of raters made up of nursing staff members. Consensus estimates were calculated using simple percentage agreement and consistency estimates using Spearman's rho and intra class coefficient. Results. Percentage agreement ranged from 0.70 to 0.92 between the six pairs of raters. Spearman's rho coefficients were between 0.54 and 0.80 and the intra class coefficients were between 0.46 and 0.92. Conclusions. Whereas some pairs of raters obtained considerable interobserver agreement and internal consistency, the others did not. Therefore, it is concluded that the Hendrich Fall Risk Model is not a reliable instrument. The use of more unambiguous operationalized items is preferred. Relevance to clinical practice. In practice, well operationalized fall risk assessment tools are necessary. Observer agreement should always be investigated after introducing a standardized measurement tool. © 2008 The Authors. Journal compilation © 2008 Blackwell Publishing Ltd.
Radiological findings for hip dysplasia at skeletal maturity. Validation of digital and manual measurement techniques.

PubMed

Engesæter, Ingvild Øvstebø; Laborie, Lene Bjerke; Lehmann, Trude Gundersen; Sera, Francesco; Fevang, Jonas; Pedersen, Douglas; Morcuende, José; Lie, Stein Atle; Engesæter, Lars Birger; Rosendahl, Karen

2012-07-01

To report on intra-observer, inter-observer, and inter-method reliability and agreement for radiological measurements used in the diagnosis of hip dysplasia at skeletal maturity, as obtained by a manual and a digital measurement technique. Pelvic radiographs from 95 participants (56 females) in a follow-up hip study of 18- to 19-year-old patients were included. Eleven radiological measurements relevant for hip dysplasia (Sharp's, Wiberg's, and Ogata's angles; acetabular roof angle of Tönnis; articulo-trochanteric distance; acetabular depth-width ratio; femoral head extrusion index; maximum teardrop width; and the joint space width in three different locations) were validated. Three observers measured the radiographs using both a digital measurement program and manually in AgfaWeb1000. Inter-method and inter- and intra-observer agreement were analyzed using the mean differences between the readings/readers, establishing the 95% limits of agreement. We also calculated the minimum detectable change and the intra-class correlation coefficient. Large variations among different radiological measurements were demonstrated. However, the variation was not related to the use of either the manual or digital measurement technique. For measurements with greater absolute values (Sharp's angle, femoral head extrusion index, and acetabular depth-width ratio) the inter- and intra-observer and inter-method agreements were better as compared to measurements with lower absolute values (acetabular roof angle, teardrop and joint space width). The inter- and intra-observer variation differs notably across different radiological measurements relevant for hip dysplasia at skeletal maturity, a fact that should be taken into account in clinical practice. The agreement between the manual and digital methods is good.

Reliability and criterion validity of two applications of the iPhone™ to measure cervical range of motion in healthy participants

PubMed Central

2013-01-01

Summary of background data Recent smartphones, such as the iPhone, are often equipped with an accelerometer and magnetometer, which, through software applications, can perform various inclinometric functions. Although these applications are intended for recreational use, they have the potential to measure and quantify range of motion. The purpose of this study was to estimate the intra and inter-rater reliability as well as the criterion validity of the clinometer and compass applications of the iPhone in the assessment cervical range of motion in healthy participants. Methods The sample consisted of 28 healthy participants. Two examiners measured cervical range of motion of each participant twice using the iPhone (for the estimation of intra and inter-reliability) and once with the CROM (for the estimation of criterion validity). Estimates of reliability and validity were then established using the intraclass correlation coefficient (ICC). Results We observed a moderate intra-rater reliability for each movement (ICC = 0.65-0.85) but a poor inter-rater reliability (ICC < 0.60). For the criterion validity, the ICCs are moderate (>0.50) to good (>0.65) for movements of flexion, extension, lateral flexions and right rotation, but poor (<0.50) for the movement left rotation. Conclusion We found good intra-rater reliability and lower inter-rater reliability. When compared to the gold standard, these applications showed moderate to good validity. However, before using the iPhone as an outcome measure in clinical settings, studies should be done on patients presenting with cervical problems. PMID:23829201
Accuracy of templating the acetabular cup size in Total Hip Replacement using conventional acetate templates on digital radiographs.

PubMed

Krishnamoorthy, Vignesh P; Perumal, Rajamani; Daniel, Alfred J; Poonnoose, Pradeep M

2015-12-01

Templating of the acetabular cup size in Total Hip Replacement (THR) is normally done using conventional radiographs. As these are being replaced by digital radiographs, it has become essential to create a technique of templating using digital films. We describe a technique that involves templating the digital films using the universally available acetate templates for THR without the use of special software. Preoperative digital radiographs of the pelvis were taken with a 30 mm diameter spherical metal ball strapped over the greater trochanter. Using standard acetate templates provided by the implant company on magnified digital radiographs, the size of the metal ball (X mm) and acetabular cup (Y mm) were determined. The size of the acetabular cup to be implanted was estimated using the formula 30*Y/X. The estimated size was compared with the actual size of the cup used at surgery. Using this technique, it was possible to accurately predict the acetabular cup size in 28/40 (70%) of the hips. When the accuracy to within one size was considered, templating was correct in 90% (36/40). When assessed by two independent observers, there was good intra-observer and inter-observer reliability with intra-class correlation coefficient values greater than 0.8. It was possible to accurately and reliably predict the size of the acetabular cup, using acetate templates on digital films, without any digital templates.
The Pareidolia Test: A Simple Neuropsychological Test Measuring Visual Hallucination-Like Illusions

PubMed Central

Mamiya, Yasuyuki; Nishio, Yoshiyuki; Watanabe, Hiroyuki; Yokoi, Kayoko; Uchiyama, Makoto; Baba, Toru; Iizuka, Osamu; Kanno, Shigenori; Kamimura, Naoto; Kazui, Hiroaki; Hashimoto, Mamoru; Ikeda, Manabu; Takeshita, Chieko; Shimomura, Tatsuo; Mori, Etsuro

2016-01-01

Background Visual hallucinations are a core clinical feature of dementia with Lewy bodies (DLB), and this symptom is important in the differential diagnosis and prediction of treatment response. The pareidolia test is a tool that evokes visual hallucination-like illusions, and these illusions may be a surrogate marker of visual hallucinations in DLB. We created a simplified version of the pareidolia test and examined its validity and reliability to establish the clinical utility of this test. Methods The pareidolia test was administered to 52 patients with DLB, 52 patients with Alzheimer’s disease (AD) and 20 healthy controls (HCs). We assessed the test-retest/inter-rater reliability using the intra-class correlation coefficient (ICC) and the concurrent validity using the Neuropsychiatric Inventory (NPI) hallucinations score as a reference. A receiver operating characteristic (ROC) analysis was used to evaluate the sensitivity and specificity of the pareidolia test to differentiate DLB from AD and HCs. Results The pareidolia test required approximately 15 minutes to administer, exhibited good test-retest/inter-rater reliability (ICC of 0.82), and moderately correlated with the NPI hallucinations score (rs = 0.42). Using an optimal cut-off score set according to the ROC analysis, and the pareidolia test differentiated DLB from AD with a sensitivity of 81% and a specificity of 92%. Conclusions Our study suggests that the simplified version of the pareidolia test is a valid and reliable surrogate marker of visual hallucinations in DLB. PMID:27171377
Validity and Reliability of Gait and Postural Control Analysis Using the Tri-axial Accelerometer of the iPod Touch.

PubMed

Kosse, Nienke M; Caljouw, Simone; Vervoort, Danique; Vuillerme, Nicolas; Lamoth, Claudine J C

2015-08-01

Accelerometer-based assessments can identify elderly with an increased fall risk and monitor interventions. Smart devices, like the iPod Touch, with built-in accelerometers are promising for clinical gait and posture assessments due to easy use and cost-effectiveness. The aim of the present study was to establish the validity and reliability of the iPod Touch for gait and posture assessment. Sixty healthy participants (aged 18-75 years) were measured with an iPod Touch and stand-alone accelerometer while they walked under single- and dual-task conditions, and while standing in parallel and semi-tandem stance with eyes open, eyes closed and when performing a dual task. Cross-correlation values (CCV) showed high correspondence of anterior-posterior and medio-lateral signal patterns (CCV's ≥ 0.88). Validity of gait parameters (foot contacts, index of harmonicity, and amplitude variability) and standing posture parameters [root mean square of accelerations, median power frequency (MPF) and sway area] as indicated by intra-class correlation (ICC) was high (ICC = 0.85-0.99) and test-retest reliability was good (ICC = 0.81-0.97), except for MPF (ICC = 0.59-0.87). Overall, the iPod Touch obtained valid and reliable measures of gait and postural control in healthy adults of all ages under different conditions. Additionally, smart devices have the potential to be used for clinical gait and posture assessments.
Team performance in resuscitation teams: Comparison and critique of two recently developed scoring tools☆

PubMed Central

McKay, Anthony; Walker, Susanna T.; Brett, Stephen J.; Vincent, Charles; Sevdalis, Nick

2012-01-01

Background and aim Following high profile errors resulting in patient harm and attracting negative publicity, the healthcare sector has begun to focus on training non-technical teamworking skills as one way of reducing the rate of adverse events. Within the area of resuscitation, two tools have been developed recently aiming to assess these skills – TEAM and OSCAR. The aims of the study reported here were:1.To determine the inter-rater reliability of the tools in assessing performance within the context of resuscitation.2.To correlate scores of the same resuscitation teams episodes using both tools, thereby determining their concurrent validity within the context of resuscitation.3.To carry out a critique of both tools and establish how best each one may be utilised. Methods The study consisted of two phases – reliability assessment; and content comparison, and correlation. Assessments were made by two resuscitation experts, who watched 24 pre-recorded resuscitation simulations, and independently rated team behaviours using both tools. The tools were critically appraised, and correlation between overall score surrogates was assessed. Results Both OSCAR and TEAM achieved high levels of inter-rater reliability (in the form of adequate intra-class coefficients) and minor significant differences between Wilcoxon tests. Comparison of the scores from both tools demonstrated a high degree of correlation (and hence concurrent validity). Finally, critique of each tool highlighted differences in length and complexity. Conclusion Both OSCAR and TEAM can be used to assess resuscitation teams in a simulated environment, with the tools correlating well with one another. We envisage a role for both tools – with TEAM giving a quick, global assessment of the team, but OSCAR enabling more detailed breakdown of the assessment, facilitating feedback, and identifying areas of weakness for future training. PMID:22561464
The Development and Validation of a Generic Instrument, QoDoS, for Assessing the Quality of Decision Making.

PubMed

Donelan, Ronan; Walker, Stuart; Salek, Sam

2016-01-01

The impact of decision-making during the development and the regulatory review of medicines greatly influences the delivery of new medicinal products. Currently, there is no generic instrument that can be used to assess the quality of decision-making. This study describes the development of the Quality of Decision-Making Orientation Scheme QoDoS(©) instrument for appraising the quality of decision-making. Semi-structured interviews about decision-making were carried out with 29 senior decision makers from the pharmaceutical industry (10), regulatory authorities (9) and contract research organizations (10). The interviews offered a qualified understanding of the subjective decision-making approach, influences, behaviors and other factors that impact such processes for individuals and organizations involved in the delivery of new medicines. Thematic analysis of the transcribed interviews was carried out using NVivo8® software. Content validity was carried out using qualitative and quantitative data by an expert panel, which led to the developmental version of the QoDoS. Further psychometric evaluations were performed, including factor analysis, item reduction, reliability testing and construct validation. The thematic analysis of the interviews yielded a 94-item initial version of the QoDoS(©) with a 5-point Likert scale. The instrument was tested for content validity using a panel of experts for language clarity, completeness, relevance and scaling, resulting in a favorable agreement by panel members with an intra-class correlation coefficient value of 0.89 (95% confidence interval = 0.56, 0.99). A 76-item QoDoS(©) (version 2) emerged from content validation. Factor analysis produced a 47-item measure with four domains. The 47-item QoDoS(©) (version 3) showed high internal consistency (n = 120, Cronbach's alpha = 0.89), high reproducibility (n = 20, intra-class correlation = 0.77) and a mean completion time of 10 min. Reliability testing and construct validation was successfully performed. The QoDoS(©) is both reliable and valid for use. It has the potential for extensive use in medicines development by both the pharmaceutical industry and regulatory authorities. The QoDoS(©) can be used to assess the quality of decision-making and to inform decision makers of the factors that influence decision-making.
Mammography image quality and evidence based practice: Analysis of the demonstration of the inframammary angle in the digital setting.

PubMed

Spuur, Kelly; Webb, Jodi; Poulos, Ann; Nielsen, Sharon; Robinson, Wayne

2018-03-01

The aim of this study is to determine the clinical rates of the demonstration of the inframammary angle (IMA) on the mediolateral oblique (MLO) view of the breast on digital mammograms and to compare the outcomes with current accreditation standards for compliance. Relationships between the IMA, age, the posterior nipple line (PNL) and compressed breast thickness will be identified and the study outcomes validated using appropriate analyses of inter-reader and inter-rater reliability and variability. Differences in left versus right data were also investigated. A quantitative retrospective study of 2270 randomly selected paired digital mammograms performed by BreastScreen NSW was undertaken. Data was collected by direct measurement and visual analysis. Intra-class correlation analyses were used to evaluate inter- and intra-rater reliability. The IMA was demonstrated on 52.4% of individual and 42.6% of paired mammograms. A linear relationship was found between the posterior nipple line (PNL) and age (p-value <0.001). The PNL was predicted to increase by 0.48 mm for every one year increment in age. The odds of demonstrating the IMA reduced by 2% for every one year increase in age (p-value = 0.001); are 0.4% higher for every 1 mm increase in PNL (p-value = 0.001) and 1.6% lower for every 1 mm increase in compressed breast thickness, (p-value<0.001). There was high inter- and intra-rater reliability for the PNL while there was 100% agreement for the demonstration of the IMA. Analysis of the demonstration of the IMA indicates clinically achievable rates (42.6%) well below that required for compliance (50%-75%) to known worldwide accreditation standards for screening mammography. These standards should be aligned to the reported evidence base. Visualisation of the IMA is impacted negatively by increasing age and compressed breast thickness but positively by breast size (PNL). Copyright © 2018 Elsevier B.V. All rights reserved.
The push-off test: development of a simple, reliable test of upper extremity weight-bearing capability.

PubMed

Vincent, Joshua I; MacDermid, Joy C; Michlovitz, Susan L; Rafuse, Richard; Wells-Rowsell, Christina; Wong, Owen; Bisbee, Leslie

2014-01-01

Longitudinal clinical measurement study. The push-off test (POT) is a novel and simple measure of upper extremity weight-bearing that can be measured with a grip dynamometer. There are no published studies on the validity and reliability of the POT. The relationship between upper extremity self-report activity/participation and impairment measures remain an unexplored realm. The primary purpose of this study is to estimate the intra and inter-rater reliability and construct validity of the POT. The secondary purpose is to estimate the relationship between upper extremity self-report activity/participation questionnaires and impairment measures. A convenience sample of 22 patients with wrist or elbow injuries were tested for POT, wrist/elbow range of motion (ROM), isometric wrist extension strength (WES) and grip strength; and completed two self-report activity/participation questionnaires: Disability of the Arm, Shoulder and the Hand (DASH) and Work Limitations Questionnaire (WLQ-26). POT's inter and intra-rater reliability and construct validity was tested. Pearson's correlations were run between the impairment measures and self-report questionnaires to look into the relationship amongst them. The POT demonstrated high inter-rater reliability (ICC affected = 0.97; 95% C.I. 0.93-0.99; ICC unaffected = 0.85; 95% C.I. 0.68-0.94) and intra-rater reliability (ICC affected = 0.96; 95% C.I. 0.92-0.97; ICC unaffected = 0.92; 95% C.I. 0.85-0.97). The POT was correlated moderately with the DASH (r = -0.47; p = 0.03). While examining the relationship between upper extremity self-reported activity/participation questionnaires and impairment measures the strongest correlation was between the DASH and the POT (r = -0.47; p = 0.03) and none of the correlations with the other physical impairment measures reached significance. At-work disability demonstrated insignificant correlations with physical impairments. The POT test provides a reliable and easily administered quantitative measure of ability to bear the load through an injured arm. Preliminary evidence supports a moderate relationship between loading bearing measured by the POT and upper extremity function measured by the DASH. 1b. Copyright © 2014 Hanley & Belfus. Published by Elsevier Inc. All rights reserved.
Reliability of joint count assessment in rheumatoid arthritis: a systematic literature review.

PubMed

Cheung, Peter P; Gossec, Laure; Mak, Anselm; March, Lyn

2014-06-01

Joint counts are central to the assessment of rheumatoid arthritis (RA) but reliability is an issue. To evaluate the reliability and agreement of joint counts (intra-observer and inter-observer) by health care professionals (physicians, nurses, and metrologists) and patients in RA, and the impact of training and standardization on joint count reliability through a systematic literature review. Articles reporting joint count reliability or agreement in RA in PubMed, EMBase, and the Cochrane library between 1960 and 2012 were selected. Data were extracted regarding tender joint counts (TJCs) and swollen joint counts (SJCs) derived by physicians, metrologists, or patients for intra-observer and inter-observer reliability. In addition, methods and effects of training or standardization were extracted. Statistics expressing reliability such as intraclass correlation coefficients (ICCs) were extracted. Data analysis was primarily descriptive due to high heterogeneity. Twenty-eight studies on health care professionals (HCP) and 20 studies on patients were included. Intra-observer reliability for TJCs and SJCs was good for HCPs and patients (range of ICC: 0.49-0.98). Inter-observer reliability between HCPs for TJCs was higher than for SJCs (range of ICC: 0.64-0.88 vs. 0.29-0.98). Patient inter-observer reliability with HCPs as comparators was better for TJCs (range of ICC: 0.31-0.91) compared to SJCs (0.16-0.64). Nine studies (7 with HCPs and 2 with patients) evaluated consensus or training, with improvement in reliability of TJCs but conflicting evidence for SJCs. Intra- and inter-observer reliability was high for TJCs for HCPs and patients: among all groups, reliability was better for TJCs than SJCs. Inter-observer reliability of SJCs was poorer for patients than HCPs. Data were inconclusive regarding the potential for training to improve SJC reliability. Overall, the results support further evaluation for patient-reported joint counts as an outcome measure. © 2013 Published by Elsevier Inc.
Reliability of handheld dynamometry in assessment of hip strength in adult male football players.

PubMed

Fulcher, Mark L; Hanna, Chris M; Raina Elley, C

2010-01-01

The aim of this study was to evaluate the intra- and interrater reliability of handheld dynamometry (HHD) for measuring hip muscle strength in a sample of 30 healthy semi-professional adult male football players. The reliability of HHD had not been assessed in athletes who were likely to be stronger than populations tested previously. Maximal isometric strength of resisted hip flexion and adduction were measured. Mean strength ranged from 51.5 kg for dominant hip flexion to 26.7 kg for hip adduction at 90 degrees of hip flexion. Intrarater reliability intraclass correlation coefficients (ICCs) ranged from 0.70 to 0.89. ICCs for interrater reliability ranged from 0.66 to 0.87. As expected, muscle strength in this group of athletes was significantly higher than that of populations in which HHD reliability has been assessed. Despite this, muscle strength testing of hip flexor and adductor muscles can be performed with good to excellent intra- and interrater reliability in this population. Copyright (c) 2009. Published by Elsevier Ltd.
Validation of personal digital photography to assess dietary quality among people with intellectual disabilities.

PubMed

Elinder, L S; Brunosson, A; Bergström, H; Hagströmer, M; Patterson, E

2012-02-01

Dietary assessment is a challenge in general, and specifically in individuals with intellectual disabilities (ID). This study aimed to evaluate personal digital photography as a method of assessing different aspects of dietary quality in this target group. Eighteen adults with ID were recruited from community residences and activity centres in Stockholm County. Participants were instructed to photograph all foods and beverages consumed during 1 day, while observed. Photographs were coded by two raters. Observations and photographs of meal frequency, intake occasions of four specific food and beverage items, meal quality and dietary diversity were compared. Evaluation of inter-rater reliability and validity of the method was performed by intra-class correlation analysis. With reminders from staff, 85% of all observed eating or drinking occasions were photographed. The inter-rater reliability was excellent for all assessed variables (ICC ≥ 0.88), except for meal quality where ICC was 0.66. The correlations between items assessed in photos and observations were strong to almost perfect with ICC values ranging from 0.71 to 0.92 and all were statistically significant. Personal digital photography appears to be a feasible, reliable and valid method for assessing dietary quality in people with mild to moderate ID, who have daily staff support. © 2011 The Authors. Journal of Intellectual Disability Research © 2011 Blackwell Publishing Ltd.
Translation, cultural adaption, and test-retest reliability of Chinese versions of the Edinburgh Handedness Inventory and Waterloo Footedness Questionnaire.

PubMed

Yang, Nan; Waddington, Gordon; Adams, Roger; Han, Jia

2018-05-01

Quantitative assessments of handedness and footedness are often required in studies of human cognition and behaviour, yet no reliable Chinese versions of commonly used handedness and footedness questionnaires are available. Accordingly, the objective of the present study was to translate the Edinburgh Handedness Inventory (EHI) and the Waterloo Footedness Questionnaire-Revised (WFQ-R) into Mandarin Chinese and to evaluate the reliability and validity of these translated versions in healthy Chinese people. In the first stage of the study, Chinese versions of the EHI and WFQ-R were produced from a process of translation, back translation and examination, with necessary cultural adaptations. The second stage involved determining the reliability and validity of the translated EHI and WFQ-R for the Chinese population. One hundred and ten Chinese participants were tested online, and the results showed that the Cronbach's alpha coefficient of internal consistency was 0.877 for the translated EHI and 0.855 for the translated WFQ-R. Another 170 Chinese participants were tested and re-tested after a 30-day interval. The intra-class correlation coefficients showed high reliability, 0.898 for the translated EHI and 0.869 for the translated WFQ-R. This preliminary validation study found the translated versions to be reliable and valid tools for assessing handedness and footedness in this population.
Validity and reliability of a Malay version of the Lawton instrumental activities of daily living scale among the Malay speaking elderly in Malaysia.

PubMed

Kadar, Masne; Ibrahim, Suhaili; Razaob, Nor Afifi; Chai, Siaw Chui; Harun, Dzalani

2018-02-01

The Lawton Instrumental Activities of Daily Living Scale is a tool often used to assess independence among elderly at home. Its suitability to be used with the elderly population in Malaysia has not been validated. This current study aimed to assess the validity and reliability of the Lawton Instrumental Activities of Daily Living Scale - Malay Version to Malay speaking elderly in Malaysia. This study was divided into three phases: (1) translation and linguistic validity involving both forward and backward translations; (2) establishment of face validity and content validity; and (3) establishment of reliability involving inter-rater, test-retest and internal consistency analyses. Data used for these analyses were obtained by interviewing 65 elderly respondents. Percentages of Content Validity Index for 4 criteria were from 88.89 to 100.0. The Cronbach α coefficient for internal consistency was 0.838. Intra-class Correlation Coefficient of inter-rater reliability and test-retest reliability was 0.957 and 0.950 respectively. The result shows that the Lawton Instrumental Activities of Daily Living Scale - Malay Version has excellent reliability and validity for use with the Malay speaking elderly people in Malaysia. This scale could be used by professionals to assess functional ability of elderly who live independently in community. © 2018 Occupational Therapy Australia.
The reliability of a modified Kalamazoo Consensus Statement Checklist for assessing the communication skills of multidisciplinary clinicians in the simulated environment.

PubMed

Peterson, Eleanor B; Calhoun, Aaron W; Rider, Elizabeth A

2014-09-01

With increased recognition of the importance of sound communication skills and communication skills education, reliable assessment tools are essential. This study reports on the psychometric properties of an assessment tool based on the Kalamazoo Consensus Statement Essential Elements Communication Checklist. The Gap-Kalamazoo Communication Skills Assessment Form (GKCSAF), a modified version of an existing communication skills assessment tool, the Kalamazoo Essential Elements Communication Checklist-Adapted, was used to assess learners in a multidisciplinary, simulation-based communication skills educational program using multiple raters. 118 simulated conversations were available for analysis. Internal consistency and inter-rater reliability were determined by calculating a Cronbach's alpha score and intra-class correlation coefficients (ICC), respectively. The GKCSAF demonstrated high internal consistency with a Cronbach's alpha score of 0.844 (faculty raters) and 0.880 (peer observer raters), and high inter-rater reliability with an ICC of 0.830 (faculty raters) and 0.89 (peer observer raters). The Gap-Kalamazoo Communication Skills Assessment Form is a reliable method of assessing the communication skills of multidisciplinary learners using multi-rater methods within the learning environment. The Gap-Kalamazoo Communication Skills Assessment Form can be used by educational programs that wish to implement a reliable assessment and feedback system for a variety of learners. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Reliability analysis for radiographic measures of lumbar lordosis in adult scoliosis: a case–control study comparing 6 methods

PubMed Central

Hong, Jae Young; Modi, Hitesh N.; Hur, Chang Yong; Song, Hae Ryong; Park, Jong Hoon

2010-01-01

Several methods are used to measure lumbar lordosis. In adult scoliosis patients, the measurement is difficult due to degenerative changes in the vertebral endplate as well as the coronal and sagittal deformity. We did the observational study with three examiners to determine the reliability of six methods for measuring the global lumbar lordosis in adult scoliosis patients. Ninety lateral lumbar radiographs were collected for the study. The radiographs were divided into normal (Cobb < 10°), low-grade (Cobb 10°–19°), high-grade (Cobb ≥ 20°) group to determine the reliability of Cobb L1–S1, Cobb L1–L5, centroid, posterior tangent L1–S1, posterior tangent L1–L5 and TRALL method in adult scoliosis. The 90 lateral radiographs were measured twice by each of the three examiners using the six measurement methods. The data was analyzed to determine the inter- and intra-observer reliability. In general, for the six radiographic methods, the inter- and intra-class correlation coefficients (ICCs) were all ≥0.82. A comparison of the ICCs and 95% CI for the inter- and intra-observer reliability between the groups with varying degrees of scoliosis showed that, the reliability of the lordosis measurement decreased with increasing severity of scoliosis. In Cobb L1–S1, centroid and posterior tangent L1–S1 methods, the ICCs were relatively lower in the high-grade scoliosis group (≥0.60). And, the mean absolute difference (MAD) in these methods was high in the high-grade scoliosis group (≤7.17°). However, in the Cobb L1–L5 and posterior tangent L1–L5 method, the ICCs were ≥0.86 in all groups. And, in the TRALL method, the ICCs were ≥0.76 in all groups. In addition, in the Cobb L1–L5 and posterior tangent L1–L5 method, the MAD was ≤3.63°. And, in the TRALL method, the MAD was ≤3.84° in all groups. We concluded that the Cobb L1–L5 and the posterior tangent L1–L5 methods are reliable methods for measuring the global lumbar lordosis in adult scoliosis. And the TRALL method is more reliable method than other methods which include the L5–S1 joint in lordosis measurement. PMID:20437183
Reliability and validity of the Turkish version of the Berg Balance Scale.

PubMed

Sahin, Fusun; Yilmaz, Figen; Ozmaden, Asli; Kotevolu, Nurdan; Sahin, Tulay; Kuran, Banu

2008-01-01

The purpose of this study was to develop a Turkish version of the Berg Balance Scale (BBS) and assess its reliability and validity. Sixty healthy volunteers older than 65 years were included in to the study. Subjects who had lower extremity amputation, or were armchair or bedridden were excluded. After translation process, the Turkish version of the scale was administered to each participant twice with an interval of 2 weeks. The intraclass correlation coefficient (ICC) was calculated to assess intra- and inter-observer reliability. Chronbach alpha was calculated to evaluate internal consistency of the total BBS score. Interclass correlation coefficient was calcuated to examine test-retest reliability. Convergent validity was assessed by correlating the scale with Modified Barthel Index (MBI) and Timed Up and Go Test (TUG). Construct validity was assessed with factor analysis. The mean age in years of the participants were 77.00+/-5.67 (range: 67-92 yrs). The ICC for intra- and inter- observer reliability was 0.98 (p<0.0001) and 0.97 (p<0.0001), respectively. Chronbach alpha of the Turkish version of the BBS was 0.98. The test-retest reliability (ICC) of the Turkish version of the BBS was determined as 0.98 for the total score, and ranged from 0.86-0.99 for individual items. In terms of validity, the Turkish version of the BBS was correlated with the MBI (in positive direction) and TUG (in negative direction) (r=0.67 p<0.0001; r=-0.75 p<0.0001, respectively). The Turkish version of the BBS is a reliable and valid scale to be used in balance assessment of Turkish older adults.
Is computed tomography an accurate and reliable method for measuring total knee arthroplasty component rotation?

PubMed

Figueroa, José; Guarachi, Juan Pablo; Matas, José; Arnander, Magnus; Orrego, Mario

2016-04-01

Computed tomography (CT) is widely used to assess component rotation in patients with poor results after total knee arthroplasty (TKA). The purpose of this study was to simultaneously determine the accuracy and reliability of CT in measuring TKA component rotation. TKA components were implanted in dry-bone models and assigned to two groups. The first group (n = 7) had variable femoral component rotations, and the second group (n = 6) had variable tibial tray rotations. CT images were then used to assess component rotation. Accuracy of CT rotational assessment was determined by mean difference, in degrees, between implanted component rotation and CT-measured rotation. Intraclass correlation coefficient (ICC) was applied to determine intra-observer and inter-observer reliability. Femoral component accuracy showed a mean difference of 2.5° and the tibial tray a mean difference of 3.2°. There was good intra- and inter-observer reliability for both components, with a femoral ICC of 0.8 and 0.76, and tibial ICC of 0.68 and 0.65, respectively. CT rotational assessment accuracy can differ from true component rotation by approximately 3° for each component. It does, however, have good inter- and intra-observer reliability.
[Validation of a questionnaire to assess the quality of health information in Argentinian newspapers].

PubMed

Biondo, Emiliano; Khoury, Marina Claudia

2005-09-01

The daily press is an important source of health information and may influence health care utilization. However, medical reports published in newspapers from developed countries have shown a poor quality. The reliability of the questionnaire Index of Scientific Quality was evaluated by using it to measure the scientific quality of health information published by Argentinian newspapers between 2000 and 2002. It assessed the readability of the texts in grade levels and explored the relationship between quality and other factors. The Spanish adaptation of the instrument consisted in translation, back-traslation and pilot study. The reliability was assessed by applying the instrument to newspaper articles with more than 300 words that discussed therapy, diagnosis, prevention, lifestyle effects, and hazardous exposure. Two physicians independently graded 129 articles. Inter-observer and intra-observer concordance was quantified for each item with the intra-class correlation coefficient (CI95%). To measure scientific quality, a randomized sample of 210 articles was assessed. Each received a mark that ranged from 0 to 100. Readability was determined by the FRY graph method. The relationship between quality and other variables was explored with multiple linear regression analysis. The inter-rater concordance varied between 0.48 (0.34-0.61) and 0.67 (0.56-0.75). Intra-rater concordance varied from 0.51 (0.37-0.63) to 0.95 (0.93-0.96). The internal consistency (Cronbach's alpha) was 0.88. The quality-of-health-information was rated at 25 points (16.7, 33.3) [median (intercuartil range)]. The reading level was assessed to be at the 10.4 grade (10.2-10.6) [mean (CI 95%)]. Quality of the health information was greatly deficient; however, no specific factors were related with quality. Readability was probably a key barrier for access to the health information.
Anthropometric Study of Three-Dimensional Facial Morphology in Malay Adults

PubMed Central

Majawit, Lynnora Patrick; Mohd Razi, Roziana

2016-01-01

Objectives To establish the three-dimensional (3D) facial soft tissue morphology of adult Malaysian subjects of the Malay ethnic group; and to determine the morphological differences between the genders, using a non-invasive stereo-photogrammetry 3D camera. Material and Methods One hundred and nine subjects participated in this research, 54 Malay men and 55 Malay women, aged 20–30 years old with healthy BMI and with no adverse skeletal deviation. Twenty-three facial landmarks were identified on 3D facial images captured using a VECTRA M5-360 Head System (Canfield Scientific Inc, USA). Two angular, 3 ratio and 17 linear measurements were identified using Canfield Mirror imaging software. Intra- and inter-examiner reliability tests were carried out using 10 randomly selected images, analyzed using the intra-class correlation coefficient (ICC). Multivariate analysis of variance (MANOVA) was carried out to investigate morphologic differences between genders. Results ICC scores were generally good for both intra-examiner (range 0.827–0.987) and inter-examiner reliability (range 0.700–0.983) tests. Generally, all facial measurements were larger in men than women, except the facial profile angle which was larger in women. Clinically significant gender dimorphisms existed in biocular width, nose height, nasal bridge length, face height and lower face height values (mean difference > 3mm). Clinical significance was set at 3mm. Conclusion Facial soft tissue morphological values can be gathered efficiently and measured effectively from images captured by a non-invasive stereo-photogrammetry 3D camera. Adult men in Malaysia when compared to women had a wider distance between the eyes, a longer and more prominent nose and a longer face. PMID:27706220
Reliability of capturing foot parameters using digital scanning and the neutral suspension casting technique

PubMed Central

2011-01-01

Background A clinical study was conducted to determine the intra and inter-rater reliability of digital scanning and the neutral suspension casting technique to measure six foot parameters. The neutral suspension casting technique is a commonly utilised method for obtaining a negative impression of the foot prior to orthotic fabrication. Digital scanning offers an alternative to the traditional plaster of Paris techniques. Methods Twenty one healthy participants volunteered to take part in the study. Six casts and six digital scans were obtained from each participant by two raters of differing clinical experience. The foot parameters chosen for investigation were cast length (mm), forefoot width (mm), rearfoot width (mm), medial arch height (mm), lateral arch height (mm) and forefoot to rearfoot alignment (degrees). Intraclass correlation coefficients (ICC) with 95% confidence intervals (CI) were calculated to determine the intra and inter-rater reliability. Measurement error was assessed through the calculation of the standard error of the measurement (SEM) and smallest real difference (SRD). Results ICC values for all foot parameters using digital scanning ranged between 0.81-0.99 for both intra and inter-rater reliability. For neutral suspension casting technique inter-rater reliability values ranged from 0.57-0.99 and intra-rater reliability values ranging from 0.36-0.99 for rater 1 and 0.49-0.99 for rater 2. Conclusions The findings of this study indicate that digital scanning is a reliable technique, irrespective of clinical experience, with reduced measurement variability in all foot parameters investigated when compared to neutral suspension casting. PMID:21375757

Measurement errors when estimating the vertical jump height with flight time using photocell devices: the example of Optojump.

PubMed

Attia, A; Dhahbi, W; Chaouachi, A; Padulo, J; Wong, D P; Chamari, K

2017-03-01

Common methods to estimate vertical jump height (VJH) are based on the measurements of flight time (FT) or vertical reaction force. This study aimed to assess the measurement errors when estimating the VJH with flight time using photocell devices in comparison with the gold standard jump height measured by a force plate (FP). The second purpose was to determine the intrinsic reliability of the Optojump photoelectric cells in estimating VJH. For this aim, 20 subjects (age: 22.50±1.24 years) performed maximal vertical jumps in three modalities in randomized order: the squat jump (SJ), counter-movement jump (CMJ), and CMJ with arm swing (CMJarm). Each trial was simultaneously recorded by the FP and Optojump devices. High intra-class correlation coefficients (ICCs) for validity (0.98-0.99) and low limits of agreement (less than 1.4 cm) were found; even a systematic difference in jump height was consistently observed between FT and double integration of force methods (-31% to -27%; p<0.001) and a large effect size (Cohen's d >1.2). Intra-session reliability of Optojump was excellent, with ICCs ranging from 0.98 to 0.99, low coefficients of variation (3.98%), and low standard errors of measurement (0.8 cm). It was concluded that there was a high correlation between the two methods to estimate the vertical jump height, but the FT method cannot replace the gold standard, due to the large systematic bias. According to our results, the equations of each of the three jump modalities were presented in order to obtain a better estimation of the jump height.
Measurement errors when estimating the vertical jump height with flight time using photocell devices: the example of Optojump

PubMed Central

Attia, A; Chaouachi, A; Padulo, J; Wong, DP; Chamari, K

2016-01-01

Common methods to estimate vertical jump height (VJH) are based on the measurements of flight time (FT) or vertical reaction force. This study aimed to assess the measurement errors when estimating the VJH with flight time using photocell devices in comparison with the gold standard jump height measured by a force plate (FP). The second purpose was to determine the intrinsic reliability of the Optojump photoelectric cells in estimating VJH. For this aim, 20 subjects (age: 22.50±1.24 years) performed maximal vertical jumps in three modalities in randomized order: the squat jump (SJ), counter-movement jump (CMJ), and CMJ with arm swing (CMJarm). Each trial was simultaneously recorded by the FP and Optojump devices. High intra-class correlation coefficients (ICCs) for validity (0.98-0.99) and low limits of agreement (less than 1.4 cm) were found; even a systematic difference in jump height was consistently observed between FT and double integration of force methods (-31% to -27%; p<0.001) and a large effect size (Cohen’s d>1.2). Intra-session reliability of Optojump was excellent, with ICCs ranging from 0.98 to 0.99, low coefficients of variation (3.98%), and low standard errors of measurement (0.8 cm). It was concluded that there was a high correlation between the two methods to estimate the vertical jump height, but the FT method cannot replace the gold standard, due to the large systematic bias. According to our results, the equations of each of the three jump modalities were presented in order to obtain a better estimation of the jump height. PMID:28416900
Development, reliability, and validity testing of Toddler NutriSTEP: a nutrition risk screening questionnaire for children 18-35 months of age.

PubMed

Randall Simpson, Janis; Gumbley, Jillian; Whyte, Kylie; Lac, Jane; Morra, Crystal; Rysdale, Lee; Turfryer, Mary; McGibbon, Kim; Beyers, Joanne; Keller, Heather

2015-09-01

Nutrition is vital for optimal growth and development of young children. Nutrition risk screening can facilitate early intervention when followed by nutritional assessment and treatment. NutriSTEP (Nutrition Screening Tool for Every Preschooler) is a valid and reliable nutrition risk screening questionnaire for preschoolers (aged 3-5 years). A need was identified for a similar questionnaire for toddlers (aged 18-35 months). The purpose was to develop a reliable and valid Toddler NutriSTEP. Toddler NutriSTEP was developed in 4 phases. Content and face validity were determined with a literature review, parent focus groups (n = 6; 48 participants), and experts (n = 13) (phase A). A draft questionnaire was refined with key intercept interviews of 107 parents/caregivers (phase B). Test-retest reliability (phase C), based on intra-class correlations (ICC), Kappa (κ) statistics, and Wilcoxon tests was assessed with 133 parents/caregivers. Criterion validity (phase D) was assessed using Receiver Operating Characteristic (ROC) curves by comparing scores on the Toddler NutriSTEP to a comprehensive nutritional assessment of 200 toddlers with a registered dietitian (RD). The Toddler NutriSTEP was reliable between 2 administrations (ICC = 0.951, F = 20.53, p < 0.001); most questions had moderate (κ ≥ 0.6) or excellent (κ ≥ 0.8) agreement. Scores on the RD nutrition risk rating and the Toddler NutriSTEP were correlated (r = 0.67, p < 0.000). The area under the ROC curve for moderate and high RD risk ratings were 84.6% and 82.7%, respectively. Cut-points of ≥21 (sensitivity 86%; specificity 61%) (moderate risk) and ≥26 (sensitivity 95%; specificity 63%) (high risk) were determined. The Toddler NutriSTEP questionnaire is both reliable and valid for screening for nutritional risk in toddlers.
Reliability and validity of the Korean version of the community balance and mobility scale in patients with hemiplegia after stroke

PubMed Central

Lee, Kyoung-bo; Lee, Paul; Yoo, Sang-won; Kim, Young-dong

2016-01-01

[Purpose] The aim of this study was to translate and adapt the Community Balance and Mobility Scale (CB&M) into Korean (K-CB&M) and to verify the reliability and validity of scores obtained with Korean patients. [Subjects and Methods] A total of 16 subjects were recruited from St. Vincent’s Hospital in South Korea. At each testing session, subjects completed the K-CB&M, Berg balance scale (BBS), timed up and go test (TUG), and functional reaching test. All tests were administered by a physical therapist, and subjects completed the tests in an identical standardized order during all testing sessions. [Results] The inter- and intra-rater reliability coefficients were high for most subscores, while moderate inter-rater reliability was observed for the items “walking and looking” and “walk, look, and carry”, and moderate intra-rater reliability was observed for “forward to backward walking”. There was a positive correlation between the K-CB&M and BBS and a negative correlation between the K-CB&M and TUG in the convergent validity assessments. [Conclusion] The reliability and validity of the K-CB&M was high, suggesting that clinical practitioners treating Korean patients with hemiplegia can use this material for assessing static and dynamic balance. PMID:27630420
Translation and validation of Moroccan Western Ontario and McMaster Universities (WOMAC) osteoarthritis index in knee osteoarthritis.

PubMed

Faik, A; Benbouazza, K; Amine, B; Maaroufi, H; Bahiri, R; Lazrak, N; Aboukal, R; Hajjaj-Hassouni, N

2008-05-01

The aim of this study is to assess the reliability and validity of the Western Ontario and McMaster University Osteoarthritis Index (WOMAC) in Moroccan patients with knee osteoarthritis. The WOMAC was translated and back translated to and from dialectal Arabic, pre-tested and reviewed by a committee following the Guillemin criteria. The Moroccan version of the WOMAC was administered twice during a 24-48 h interval to 71 Moroccan patients with symptomatic knee osteoarthritis, fulfilling the revised criteria of the American College of Rheumatology. The test-retest reliability was assessed using intra-class correlation coefficient, and the Bland and Altman method. Internal consistency was assessed by Cronbach's alpha coefficient. Construct validity was tested by correlating the WOMAC subscales with visual analogic scale (VAS) of pain, VAS of handicap, maximum distance walked and clinical characteristics. The Moroccan version of the WOMAC showed good reliability, with ICC values of the three dimensions: pain, stiffness and physical function being 0.80, 0.77 and 0.89, respectively. Bland and Altman analysis showed that means of differences did not differ significantly from 0 and that no systematic trend was observed. Internal consistency with Cronbach's alpha for pain was found to be 0.76, and its equivalents for stiffness and physical function subscales were evaluated at 0.76, 0.90, respectively. Construct validity showed statistically significant correlation with all WOMAC subscales and VAS of pain (rho=0.38, 0.42, 0.63 respectively, P<0.01). Correlation between VAS handicap (rho=0.38 P<0.001) and maximum distance walked (rho=-0.40, P<0.01) was observed with physical function subscale. There was no correlation between age, duration of disease, BMI and severity of pain and physical function in knee OA. The Moroccan version of the WOMAC is a comprehensible, reliable, and valid instrument to measure outcome in patients with knee OA.
Evaluation of the numeric rating scale for perception of effort during isometric elbow flexion exercise.

PubMed

Lampropoulou, Sofia; Nowicky, Alexander V

2012-03-01

The aim of the study was to examine the reliability and validity of the numerical rating scale (0-10 NRS) for rating perception of effort during isometric elbow flexion in healthy people. 33 individuals (32 ± 8 years) participated in the study. Three re-test measurements within one session and three weekly sessions were undertaken to determine the reliability of the scale. The sensitivity of the scale following 10 min isometric fatiguing exercise of the elbow flexors as well as the correlation of the effort with the electromyographic (EMG) activity of the flexor muscles were tested. Perception of effort was tested during isometric elbow flexion at 10, 30, 50, 70, 90, and 100% MVC. The 0-10 NRS demonstrated an excellent test-retest reliability [intra class correlation (ICC) = 0.99 between measurements taken within a session and 0.96 between 3 consecutive weekly sessions]. Exploratory curve fitting for the relationship between effort ratings and voluntary force, and underlying EMG showed that both are best described by power functions (y = ax ( b )). There were also strong correlations (range 0.89-0.95) between effort ratings and EMG recordings of all flexor muscles supporting the concurrent criterion validity of the measure. The 0-10 NRS was sensitive enough to detect changes in the perceived effort following fatigue and significantly increased at the level of voluntary contraction used in its assessment (p < 0.001). These findings suggest the 0-10 NRS is a valid and reliable scale for rating perception of effort in healthy individuals. Future research should seek to establish the validity of the 0-10 NRS in clinical settings.
Reliability and validity of the adapted Resistance Training Skills Battery for Children.

PubMed

Furzer, Bonnie J; Bebich-Philip, Marc D; Wright, Kemi E; Reid, Siobhan L; Thornton, Ashleigh L

2017-12-29

Resistance training (RT) is emerging as a training modality to improve motor function and facilitate physical activity participation in children across the motor proficiency spectrum. Although RT competency assessments have been established and validated among adolescent cohorts, the extent to which these methods are suitable for assessing children's RT skills is unknown. This project aimed to assess the psychometric properties of the adapted Resistance Training Skills Battery for Children (RTSBc), in children with varying motor proficiency. Repeated measures design with 40 participants (M age=8.2±1.7years) displaying varying levels of motor proficiency. Participants performed the adapted RTSBc on two occasions, receiving a score for their execution of each component, in addition to an overall RT skill quotient child (RTSQc). Cronbach's alpha, intra-class correlation (ICC), Bland-Altman analysis, and typical error were used to assess test-retest reliability. To examine construct validity, exploratory factor analysis was performed alongside computing correlations between participants' muscle strength, motor proficiency, age, lean muscle mass, and RTSQc. The RTSBc displayed an acceptable level of internal consistency (alpha=0.86) and test-retest reliability (ICC range=0.86-0.99). Exploratory factor analysis supported internal test structure, with all six RT skills loading strongly on a single factor (range 0.56-0.89). Analyses of structural validity revealed positive correlations for RTSQc in relation to motor proficiency (r=0.52, p<0.001) and strength scores (r=0.61, p<0.001). Analyses revealed support for the construct validity and test-retest reliability of the RTSBc, providing preliminary evidence that the RTSBc is appropriate for use in the assessment of children's RT competency. Copyright © 2018 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.
Development and validation of the irritable bowel syndrome scale under the system of quality of life instruments for chronic diseases QLICD-IBS: combinations of classical test theory and generalizability theory.

PubMed

Lei, Pingguang; Lei, Guanghe; Tian, Jianjun; Zhou, Zengfen; Zhao, Miao; Wan, Chonghua

2014-10-01

This paper is aimed to develop the irritable bowel syndrome (IBS) scale of the system of Quality of Life Instruments for Chronic Diseases (QLICD-IBS) by the modular approach and validate it by both classical test theory and generalizability theory. The QLICD-IBS was developed based on programmed decision procedures with multiple nominal and focus group discussions, in-depth interview, and quantitative statistical procedures. One hundred twelve inpatients with IBS were used to provide the data measuring QOL three times before and after treatments. The psychometric properties of the scale were evaluated with respect to validity, reliability, and responsiveness employing correlation analysis, factor analyses, multi-trait scaling analysis, t tests and also G studies and D studies of generalizability theory analysis. Multi-trait scaling analysis, correlation, and factor analyses confirmed good construct validity and criterion-related validity when using SF-36 as a criterion. Test-retest reliability coefficients (Pearson r and intra-class correlation (ICC)) for the overall score and all domains were higher than 0.80; the internal consistency α for all domains at two measurements were higher than 0.70 except for the social domain (0.55 and 0.67, respectively). The overall score and scores for all domains/facets had statistically significant changes after treatments with moderate or higher effect size standardized response mean (SRM) ranging from 0.72 to 1.02 at domain levels. G coefficients and index of dependability (Ф coefficients) confirmed the reliability of the scale further with more exact variance components. The QLICD-IBS has good validity, reliability, responsiveness, and some highlights and can be used as the quality of life instrument for patients with IBS.
Caregiver-proxy reliability of the Igbo-culture adapted Maleka Stroke Community Reintegration Measure: a validation study.

PubMed

Okoye, Emmanuel Chiebuka; Awhen, Peter Agba; Akosile, Christopher Olusanjo; Maruf, Fatai Adesina; Iheukwumere, Ngozi; Egwuonwu, Afamefuna Victor

2017-09-01

This study was designed to determine the caregiver-proxy reliability of the Igbo-culture adapted urban version of the Maleka Stroke Community Reintegration Measure (I-MSCRIM). This was a validation study involving 74 consenting stroke survivors and their 74 primary informal caregivers consecutively recruited from selected tertiary hospitals in South-East Nigeria (Igboland). The I-MSCRIM was researcher-administered to the participants. Obtained data was analyzed using frequency counts, percentages, range, mean, standard deviation, Spearman rank order correlation, Mann-Whitney U test, Kruskal-Wallis test and Intra-class Correlation Coefficient. Alpha level was set at 0.05. The mean ages of the stroke survivors (55.4% males) and their primary informal caregivers (41.9% males) were 50.14 ± 12.24 and 31.93 ± 10.9 years respectively. There was no significant difference in the community reintegration (CR) scores as rated by stroke survivors and their primary informal caregivers (p > 0.05). The correlations between stroke survivors' and primary informal caregivers' rated CR scores were all adequate and acceptable (ICC = 0.602-0.917). The discrepancy in the total CR scores between the two ratings was significantly influenced by primary informal caregivers' educational attainment (k = 13.15; p < 0.01). The I-MSCRIM has acceptable caregiver-proxy reliability among Igbo stroke survivors in South-East Nigeria. This suggests that primary informal caregivers of stroke survivors can reliably estimate the CR of their care recipients when I-MSCRIM is administered to them. This will be useful when a stroke survivor cannot respond to I-MSCRIM.
Validating the Danish adaptation of the World Health Organization's International Classification for Patient Safety classification of patient safety incident types

PubMed Central

Mikkelsen, Kim Lyngby; Thommesen, Jacob; Andersen, Henning Boje

2013-01-01

Objectives Validation of a Danish patient safety incident classification adapted from the World Health Organizaton's International Classification for Patient Safety (ICPS-WHO). Design Thirty-three hospital safety management experts classified 58 safety incident cases selected to represent all types and subtypes of the Danish adaptation of the ICPS (ICPS-DK). Outcome Measures Two measures of inter-rater agreement: kappa and intra-class correlation (ICC). Results An average number of incident types used per case per rater was 2.5. The mean ICC was 0.521 (range: 0.199–0.809) and the mean kappa was 0.513 (range: 0.193–0.804). Kappa and ICC showed high correlation (r = 0.99). An inverse correlation was found between the prevalence of type and inter-rater reliability. Results are discussed according to four factors known to determine the inter-rater agreement: skill and motivation of raters; clarity of case descriptions; clarity of the operational definitions of the types and the instructions guiding the coding process; adequacy of the underlying classification scheme. Conclusions The incident types of the ICPS-DK are adequate, exhaustive and well suited for classifying and structuring incident reports. With a mean kappa a little above 0.5 the inter-rater agreement of the classification system is considered ‘fair’ to ‘good’. The wide variation in the inter-rater reliability and low reliability and poor discrimination among the highly prevalent incident types suggest that for these types, precisely defined incident sub-types may be preferred. This evaluation of the reliability and usability of WHO's ICPS should be useful for healthcare administrations that consider or are in the process of adapting the ICPS. PMID:23287641
Development of a Telephone Interview Version of the Chedoke-McMaster Stroke Assessment Activity Inventory

PubMed Central

Miller, Patricia A.; Pooyania, Sepideh; Stratford, Paul

2016-01-01

Purpose: To develop a telephone version of the Chedoke-McMaster Stroke Assessment Activity Inventory (CMSA–AI) and estimate the test–retest reliability, interrater reliability (between participant and proxy), and construct validity of the scores for individuals with stroke. Methods: Adults with stroke and their caregivers or proxies were included. Participants were assessed with the CMSA–AI at discharge from a stroke rehabilitation unit and interviewed using the telephone version (TCMSA–AI). Two months after discharge, participants were evaluated with the CMSA–AI and interviewed over the phone using the TCMSA–AI on two occasions 2–3 days apart. Proxies were interviewed with the TCMSA–AI within another 2–3 days. Results: The mean age of the 53 participants with stroke was 62 years; 59% were male; 43% had right-side hemiparesis; 42 completed follow-up interviews; and 18 had proxies who also participated. Test–retest reliability showed an intra-class correlation coefficient of 0.98 (95% CI: 0.96, 0.99) for the total score, 0.96 (95% CI: 0.91, 0.98) for the Gross Motor Function Index, and 0.96 (95% CI: 0.91, 0.98) for the Walking Index, and an interrater reliability (between participant and proxy) of 0.75 (95% CI: 0.28, 0.90) for total score. Spearman's rho correlation between CMSA–AI and TCMSA–AI total scores was 0.62 (lower-sided 95% CI: 0.42) at discharge and 0.90 (lower-sided 95% CI: 0.82) at 2 months after discharge. Correlations between the change scores of the CMSA–AI and TCMSA–AI were 0.50 or lower. Conclusion: There is potential for remote evaluation of the functional mobility of individuals with stroke in research and clinical settings. PMID:27909370
Development of a Telephone Interview Version of the Chedoke-McMaster Stroke Assessment Activity Inventory.

PubMed

Barclay, Ruth; Miller, Patricia A; Pooyania, Sepideh; Stratford, Paul

Purpose: To develop a telephone version of the Chedoke-McMaster Stroke Assessment Activity Inventory (CMSA-AI) and estimate the test-retest reliability, interrater reliability (between participant and proxy), and construct validity of the scores for individuals with stroke. Methods: Adults with stroke and their caregivers or proxies were included. Participants were assessed with the CMSA-AI at discharge from a stroke rehabilitation unit and interviewed using the telephone version (TCMSA-AI). Two months after discharge, participants were evaluated with the CMSA-AI and interviewed over the phone using the TCMSA-AI on two occasions 2-3 days apart. Proxies were interviewed with the TCMSA-AI within another 2-3 days. Results: The mean age of the 53 participants with stroke was 62 years; 59% were male; 43% had right-side hemiparesis; 42 completed follow-up interviews; and 18 had proxies who also participated. Test-retest reliability showed an intra-class correlation coefficient of 0.98 (95% CI: 0.96, 0.99) for the total score, 0.96 (95% CI: 0.91, 0.98) for the Gross Motor Function Index, and 0.96 (95% CI: 0.91, 0.98) for the Walking Index, and an interrater reliability (between participant and proxy) of 0.75 (95% CI: 0.28, 0.90) for total score. Spearman's rho correlation between CMSA-AI and TCMSA-AI total scores was 0.62 (lower-sided 95% CI: 0.42) at discharge and 0.90 (lower-sided 95% CI: 0.82) at 2 months after discharge. Correlations between the change scores of the CMSA-AI and TCMSA-AI were 0.50 or lower. Conclusion: There is potential for remote evaluation of the functional mobility of individuals with stroke in research and clinical settings.
Reliability, validity, and responsiveness of the Persian version of Shoulder Activity Scale in a group of patients with shoulder disorders.

PubMed

Negahban, Hossein; Mohtasebi, Elham; Goharpey, Shahin

2015-01-01

The aim of this methodological study was to cross-culturally translate the Shoulder Activity Scale (SAS) into the Persian and determine its clinimetric properties including reliability, validity, and responsiveness in patients with shoulder disorders. Persian version of the SAS was obtained after standard forward-backward translation. Three questionnaires were completed by the respondents: SAS, shoulder pain and disability index (SPADI), and Short-Form 36 Health Survey (SF-36). The patients completed the SAS, 1 week after the first visit to evaluate the test-retest reliability. Construct validity was evaluated by examining the associations between the scores on the SAS and the scores obtained from the SPADI, SF-36, and age of the patients. To assess responsiveness, data were collected in the first visit and then again after 4 weeks physiotherapy intervention. Test-retest reliability and internal consistency were assessed using Intra-class Correlation Coefficient (ICC) and Cronbach's alpha, respectively. To evaluate construct validity, Spearman's rank correlation was used. The ability of the SAS to detect changes was evaluated by the receiver-operating characteristics method. No problem or language difficulties were reported during translation process. Test-retest reliability of the SAS was excellent with an ICC of 0.98. Also, the marginal Cronbach's alpha level of 0.64 was obtained. The correlation between the SAS and the SPADI was low, proving divergent validity, whereas the correlations between the SAS and the SF-36/age were moderate proving convergent validity. A marginally acceptable responsiveness was achieved for the Persian SAS. The study provides some evidences to support the test-retest reliability, internal consistency, construct validity, and responsiveness of the Persian version of the SAS in patients with shoulder disorders. Therefore, it seems that this instrument is a useful measure of shoulder activity level in research setting and clinical practice. The shoulder activity scale (SAS) is a reliable, valid, and responsive measure of shoulder activity level in Persian-speaking patients with different shoulder disorders. The results on clinimetric properties of the Persian SAS are comparable with its original, English version. Persian version of the SAS can be used in "clinical" and "research" settings of patients with shoulder disorders.
THE INTRA- AND INTER-RATER RELIABILITY OF THE SOCCER INJURY MOVEMENT SCREEN (SIMS).

PubMed

McCunn, Robert; Aus der Fünten, Karen; Govus, Andrew; Julian, Ross; Schimpchen, Jan; Meyer, Tim

2017-02-01

The growing volume of movement screening research reveals a belief among practitioners and researchers alike that movement quality may have an association with injury risk. However, existing movement screening tools have not considered the sport-specific movement and injury patterns relevant to soccer. The present study introduces the Soccer Injury Movement Screen (SIMS), which has been designed specifically for use within soccer. Furthermore, the purpose of the present study was to assess the intra- and inter-rater reliability of the SIMS and determine its suitability for use in further research. The study utilized a test-retest design to discern reliablility. Twenty-five (11 males, 14 females) healthy, recreationally active university students (age 25.5 ± 4.0 years, height 171 ± 9 cm, weight 64.7 ± 12.6 kg) agreed to participate. The SIMS contains five sub-tests: the anterior reach, single-leg deadlift, in-line lunge, single-leg hop for distance and tuck jump. Each movement was scored out of 10 points and summed to produce a composite score out of 50. The anterior reach and single-leg hop for distance were scored in real-time while the remaining tests were filmed and scored retrospectively. Three raters conducted the SIMS with each participant on three occasions separated by an average of three and a half days (minimum one day, maximum seven days). Rater 1 re-scored the filmed movements for all participants on all occasions six months later to establish the 'pure' intra-rater (intra-occasion) reliability for those movements. Intraclass correlation coefficient (ICC) values for intra- and inter-rater composite score reliability ranged from 0.66-0.72 and 0.79-0.86 respectively. Weighted kappa values representing the intra- and inter-rater reliability of the individual sub-tests ranged from 0.35-0.91 indicating fair to almost perfect agreement. Establishing the reliability of the SIMS is a prerequisite for further research seeking to investigate the relationship between test score and subsequent injury. The present results indicate acceptable reliability for this purpose; however, room for further development of the intra-rater reliability exists for some of the individual sub-tests. 2b.
THE INTRA- AND INTER-RATER RELIABILITY OF THE SOCCER INJURY MOVEMENT SCREEN (SIMS)

PubMed Central

aus der Fünten, Karen; Govus, Andrew; Julian, Ross; Schimpchen, Jan; Meyer, Tim

2017-01-01

Background/purpose The growing volume of movement screening research reveals a belief among practitioners and researchers alike that movement quality may have an association with injury risk. However, existing movement screening tools have not considered the sport-specific movement and injury patterns relevant to soccer. The present study introduces the Soccer Injury Movement Screen (SIMS), which has been designed specifically for use within soccer. Furthermore, the purpose of the present study was to assess the intra- and inter-rater reliability of the SIMS and determine its suitability for use in further research. Methods The study utilized a test-retest design to discern reliablility. Twenty-five (11 males, 14 females) healthy, recreationally active university students (age 25.5 ± 4.0 years, height 171 ± 9 cm, weight 64.7 ± 12.6 kg) agreed to participate. The SIMS contains five sub-tests: the anterior reach, single-leg deadlift, in-line lunge, single-leg hop for distance and tuck jump. Each movement was scored out of 10 points and summed to produce a composite score out of 50. The anterior reach and single-leg hop for distance were scored in real-time while the remaining tests were filmed and scored retrospectively. Three raters conducted the SIMS with each participant on three occasions separated by an average of three and a half days (minimum one day, maximum seven days). Rater 1 re-scored the filmed movements for all participants on all occasions six months later to establish the ‘pure’ intra-rater (intra-occasion) reliability for those movements. Results Intraclass correlation coefficient (ICC) values for intra- and inter-rater composite score reliability ranged from 0.66-0.72 and 0.79-0.86 respectively. Weighted kappa values representing the intra- and inter-rater reliability of the individual sub-tests ranged from 0.35-0.91 indicating fair to almost perfect agreement. Conclusions Establishing the reliability of the SIMS is a prerequisite for further research seeking to investigate the relationship between test score and subsequent injury. The present results indicate acceptable reliability for this purpose; however, room for further development of the intra-rater reliability exists for some of the individual sub-tests. Level of evidence 2b PMID:28217416
Reliability and validity of the Microsoft Kinect for evaluating static foot posture

PubMed Central

2013-01-01

Background The evaluation of foot posture in a clinical setting is useful to screen for potential injury, however disagreement remains as to which method has the greatest clinical utility. An inexpensive and widely available imaging system, the Microsoft Kinect™, may possess the characteristics to objectively evaluate static foot posture in a clinical setting with high accuracy. The aim of this study was to assess the intra-rater reliability and validity of this system for assessing static foot posture. Methods Three measures were used to assess static foot posture; traditional visual observation using the Foot Posture Index (FPI), a 3D motion analysis (3DMA) system and software designed to collect and analyse image and depth data from the Kinect. Spearman’s rho was used to assess intra-rater reliability and concurrent validity of the Kinect to evaluate foot posture, and a linear regression was used to examine the ability of the Kinect to predict total visual FPI score. Results The Kinect demonstrated moderate to good intra-rater reliability for four FPI items of foot posture (ρ = 0.62 to 0.78) and moderate to good correlations with the 3DMA system for four items of foot posture (ρ = 0.51 to 0.85). In contrast, intra-rater reliability of visual FPI items was poor to moderate (ρ = 0.17 to 0.63), and correlations with the Kinect and 3DMA systems were poor (absolute ρ = 0.01 to 0.44). Kinect FPI items with moderate to good reliability predicted 61% of the variance in total visual FPI score. Conclusions The majority of the foot posture items derived using the Kinect were more reliable than the traditional visual assessment of FPI, and were valid when compared to a 3DMA system. Individual foot posture items recorded using the Kinect were also shown to predict a moderate degree of variance in the total visual FPI score. Combined, these results support the future potential of the Kinect to accurately evaluate static foot posture in a clinical setting. PMID:23566934
Reliability of real-time ultrasound measurement of transversus abdominis thickness in healthy trained subjects.

PubMed

Gnat, Rafael; Saulicz, Edward; Miądowicz, Barbara

2012-08-01

To investigate intra- and inter-rater reliability of the ultrasound measurement of transversus abdominis (TrA) thickness and thickness change (difference between thickness at rest and during contraction) in asymptomatic, trained subjects. To define the number of repeated measurements that provide acceptable level of reliability. To investigate variability of the measurements over time of 5 days and the reliability of duplicate analysis of images. A single-group repeated-measures design was used to assess reliability. Healthy volunteers (n = 10) were subjected to 1-week training in voluntary activation of TrA. Real-time ultrasound imaging and subsequent measurement of the TrA thickness at rest and during voluntary contraction were repeated on Monday, Wednesday and Friday of the next week. Using a single repeated measurement, intraclass correlation coefficients (ICCs) for TrA thickness were: 0.86-0.95 (intra-rater), 0.86-0.92 (inter-rater); and for TrA thickness change: 0.34-0.56 (intra-rater), 0.47-0.61 (inter-rater). Using the mean of three repeated measurements respective values were: 0.97, 0.96-0.98; and 0.81-0.84, 0.80-0.90. No significant differences were found between mean values of TrA thickness as well as thickness change obtained on three consecutive measurement days. Duplicate analysis of the images was highly reliable with ICCs of 0.89-0.99. Two repeated measurements for TrA thickness and at least three measurements for TrA thickness change are needed to achieve acceptable levels of intra- and inter-rater reliability. In healthy trained volunteers TrA thickness and thickness change are relatively stable parameters over a 5-day period. Duplicate analysis of the same images by two blinded observers is reliable.
The reliability and validity of the Turkish version of Fullerton Advanced Balance (FAB-T) scale.

PubMed

Iyigun, Gozde; Kirmizigil, Berkiye; Angin, Ender; Oksuz, Sevim; Can, Filiz; Eker, Levent; Rose, Debra J

2018-06-04

The aim of this study was to evaluate the reliability and validity of the Turkish version of the FAB(FAB-T) scale in the older Turkish adults. The reliability and validity of the scale was tested on 200 community-dwelling older adults. FAB-T scale was scored by different physiotherapists on different days to evaluate inter-rater and intrarater reliability. The Berg Balance Scale (BBS) was used for the evaluation of convergent validity, and the content validity of the FAB-T scale was investigated. The FAB-T scale showed very high inter- and intra-rater reliability. For inter-rater agreement, on the individual test items and total score ICC values were 0.92 (95 %CI; 0.90-0.94) and 0.96 (95% CI; 0.95-0.97) respectively. The intra-rater agreement, on the individual test items and total score ICC values were 0.93 (95 %CI; 0.91- 0.95) and 0.96 (95% CI; 0.95- 0.97) respectively. There was a good agreement between the FAB-T and BBS scales. A high correlation was found between the BBS and FAB-T scales [rho = 0.70 (%95 CI; 0.62-0.76)] indicating good convergent validity. Considering the content validity of the FAB-T scale, no floor (floor score: 0%) or ceiling (ceiling score: 6.5%) effect was detected. The FAB-T scale was successfully translated from the original English version (FAB) and demonstrated strong psychometric features. It was found that the FAB-T scale has very high inter-rater and intra-rater reliability. Considering the convergent validity, the scale has high correlation with the BBS. The FAB-T has no floor and ceiling effect. Copyright © 2018 Elsevier B.V. All rights reserved.
Development and reliability of a Motivational Interviewing Scenarios Tool for Eating Disorders (MIST-ED) using a skills-based intervention among caregivers.

PubMed

Sepulveda, Ana R; Wise, Caroline; Zabala, Maria; Todd, Gill; Treasure, Janet

2013-12-01

The aims of this study were to develop an eating disorder scenarios tool to assess the motivational interviewing (MI) skills of caregivers and evaluate the coding reliability of the instrument, and to test the sensitivity to change through a pre/post/follow-up design. The resulting Motivational Interview Scenarios Tool for Eating Disorders (MIST-ED) was administered to caregivers (n = 66) who were asked to provide oral and written responses before and after a skills-based intervention, and at a 3-month follow-up. Raters achieved excellent inter-rater reliability (intra-class correlations of 91.8% on MI adherent and 86.1% for MI non-adherent statements for written scenarios and 89.2%, and 85.3% for oral scenarios). Following the intervention, MI adherent statements increased (baseline = 9.4%, post = 61.5% and follow-up 47.2%) and non-MI adherent statements decreased (baseline = 90.6%, post = 38.5% and follow-up = 52.8%). This instrument can be used as a simple method to measure the acquisition of MI skills to improve coping and both response methods are adequate. The tool shows good sensitivity to improved skills. © 2013.
Inter-rater reliability of Hamilton depression rating scale using video-recorded interviews — Focus on rater-blinding

PubMed Central

Prasad, M. Krishna; Udupa, K.; Kishore, K. R.; Thirthalli, J.; Sathyaprabha, T. N.; Gangadhar, B. N.

2009-01-01

Background: Hamilton depression rating scale (Ham-D) is the most widely used clinician rating scale for depression. There has been no Indian study that has examined the inter-rater reliability (IRR) of video-recorded interviews of the 21-item Ham-D. Aim: To study the IRR of scoring video-recorded interviews for 21-item Ham-D. Materials and Methods: Eighteen subjects with major depressive disorder involved in a larger study were interviewed using the semi-structured clinical interview of the 21-item Ham-D by a primary rater after informed consent. These interviews were video-recorded and portions edited to ensure rater blinding. Subsequently, the video-recorded interviews were rated by a “blind” rater. Both rated the different sub-domains of Ham-D according to Rhoades and Overall (1983). IRR was evaluated using intra-class correlation coefficient. Results: Excellent IRR was observed (0.9891) between the two raters. This was true for each of the primary factors and super-factors. Conclusion: Video recorded 21-item Ham-D has excellentIRR. Video-recorded interviews of Ham-D can be reliably used to blind raters in research. PMID:19881046

Validity, Reliability and Feasibility of the Eating Behavior Pattern Questionnaire (EBPQ) among Iranian Female Students

PubMed Central

Dehghan, Parvin; Asghari-Jafarabadi, Mohammad; Salekzamani, Shabnam

2015-01-01

Background: The aim of this study was to assess the validity, reliability and feasibility of eating behavior pattern questionnaire (EBPQ) in female university students. Methods: In this study, after forward-backward translation, the questionnaire was reviewed by a panel of nutritionists and a psychologist and further thirty participants for the content validity measurement. The translated and modified questionnaire was completed by 225 female students of Tabriz University in 2013. Principle axis factoring, confirmatory factor analysis and known group analysis were conducted for construct, convergent and discriminant validity. Internal consistency and test–retest reliability were assessed by Cronbach’s α coefficient and intra-class correlation coefficient (ICC). Ceiling and floor effects were also performed for evaluating the feasibility of the instrument. Results: By using exploratory factor analysis, nine factors were extracted. Confirmatory factor analysis confirmed the convergent validity. Cronbach ’s αand ICC were ranged between 0.55 to 0.78 and 0.67 to 0.89, respectively. The significant difference for some three subscales between diabetes and healthy subjects determined the discriminant validity. No ceiling and floor effects were found. Conclusion: Our findings demonstrate the initial validity, reliability and feasibility of the Iranian version of EBPQ as a useful tool for eating behavior studies in young females. PMID:26290828
Reliability of panoramic ultrasound imaging in simultaneously examining muscle size and quality of the hamstring muscles in young, healthy males and females.

PubMed

Palmer, Ty B; Akehi, Kazuma; Thiele, Ryan M; Smith, Doug B; Thompson, Brennan J

2015-03-01

The purpose of this study was to examine the reliability of ultrasound (US) measures of cross-sectional area (CSA), muscle thickness (MT) and echo intensity (EI) of the hamstrings, with comparisons between males and females. In 20 healthy participants (10 males, 10 females), CSA, MT and EI were measured from panoramic US scans of the hamstrings on 2 separate days. The intra-class correlation coefficients and standard errors of measurement as a percentage of the mean for CSA, MT and EI ranged from 0.715 to 0.984 and from 3.145 to 12.541% in the males and from 0.724 to 0.977 and from 4.571 to 17.890% in the females, respectively. The males had greater CSAs and MTs and lower EIs than the females (p = 0.002-0.049), and significant relationships were observed between CSA and MT (r = 0.714-0.938, p ≤ 0.001-0.023). From an overall reliability standpoint, these findings suggest that panoramic US may be a reliable technique for examining muscle size and quality of the hamstrings in both males and females. Copyright © 2015 World Federation for Ultrasound in Medicine & Biology. Published by Elsevier Inc. All rights reserved.
Reliability of anthropometric measurements in young male and female artistic gymnasts.

PubMed

Siatras, Theophanis; Skaperda, Malamati; Mameletzi, Dimitra

2010-12-01

Body dimensions and body composition of children participating in artistic activities, such as gymnastics and many types of dancing, are important factors in performance improvement. The present study aimed to determine the reliability of a series of selected anthropometric measurements in young male and female gymnasts. Segment lengths, body breadths, circumferences, and skinfold thickness were measured in 20 young gymnasts by the same experienced examiner, using portable and easy-to-use instruments. All parameters were measured twice (test-retest) under the same conditions within a week's period. The high intra-class correlation coefficient (ICC) values ranging from 0.87 to 0.99, as well as the low coefficient of variation (CV) values (<5.3%), affirmed that the selected measurements were highly reliable. The technical error of measurement (TEM) values for lengths and breadths were 0.15 to 0.80 cm, for circumferences 0.22 to 1 cm, and for skinfold thickness 0.33 to 0.58 mm. The high test-retest ICC and the low CV and TEM values confirmed the reliability of all anthropometric measurements in young artistic gymnasts. Therefore, these measurements could contribute to further research in this field of investigation, helping to monitor young artistic gymnasts' growth status and identify specific characteristics for increased performance in this sport.
Validity of radiographic assessment of the knee joint space using automatic image analysis.

PubMed

Komatsu, Daigo; Hasegawa, Yukiharu; Kojima, Toshihisa; Seki, Taisuke; Ikeuchi, Kazuma; Takegami, Yasuhiko; Amano, Takafumi; Higuchi, Yoshitoshi; Kasai, Takehiro; Ishiguro, Naoki

2016-09-01

The present study investigated whether there were differences between automatic and manual measurements of the minimum joint space width (mJSW) on knee radiographs. Knee radiographs of 324 participants in a systematic health screening were analyzed using the following three methods: manual measurement of film-based radiographs (Manual), manual measurement of digitized radiographs (Digital), and automatic measurement of digitized radiographs (Auto). The mean mJSWs on the medial and lateral sides of the knees were determined using each method, and measurement reliability was evaluated using intra-class correlation coefficients. Measurement errors were compared between normal knees and knees with radiographic osteoarthritis. All three methods demonstrated good reliability, although the reliability was slightly lower with the Manual method than with the other methods. On the medial and lateral sides of the knees, the mJSWs were the largest in the Manual method and the smallest in the Auto method. The measurement errors of each method were significantly larger for normal knees than for radiographic osteoarthritis knees. The mJSW measurements are more accurate and reliable with the Auto method than with the Manual or Digital method, especially for normal knees. Therefore, the Auto method is ideal for the assessment of the knee joint space.
Reproducibility of abdominal fat assessment by ultrasound and computed tomography

PubMed Central

Mauad, Fernando Marum; Chagas-Neto, Francisco Abaeté; Benedeti, Augusto César Garcia Saab; Nogueira-Barbosa, Marcello Henrique; Muglia, Valdair Francisco; Carneiro, Antonio Adilton Oliveira; Muller, Enrico Mattana; Elias Junior, Jorge

2017-01-01

Objective: To test the accuracy and reproducibility of ultrasound and computed tomography (CT) for the quantification of abdominal fat in correlation with the anthropometric, clinical, and biochemical assessments. Materials and Methods: Using ultrasound and CT, we determined the thickness of subcutaneous and intra-abdominal fat in 101 subjects-of whom 39 (38.6%) were men and 62 (61.4%) were women-with a mean age of 66.3 years (60-80 years). The ultrasound data were correlated with the anthropometric, clinical, and biochemical parameters, as well as with the areas measured by abdominal CT. Results: Intra-abdominal thickness was the variable for which the correlation with the areas of abdominal fat was strongest (i.e., the correlation coefficient was highest). We also tested the reproducibility of ultrasound and CT for the assessment of abdominal fat and found that CT measurements of abdominal fat showed greater reproducibility, having higher intraobserver and interobserver reliability than had the ultrasound measurements. There was a significant correlation between ultrasound and CT, with a correlation coefficient of 0.71. Conclusion: In the assessment of abdominal fat, the intraobserver and interobserver reliability were greater for CT than for ultrasound, although both methods showed high accuracy and good reproducibility. PMID:28670024
Reproducibility of abdominal fat assessment by ultrasound and computed tomography.

PubMed

Mauad, Fernando Marum; Chagas-Neto, Francisco Abaeté; Benedeti, Augusto César Garcia Saab; Nogueira-Barbosa, Marcello Henrique; Muglia, Valdair Francisco; Carneiro, Antonio Adilton Oliveira; Muller, Enrico Mattana; Elias Junior, Jorge

2017-01-01

To test the accuracy and reproducibility of ultrasound and computed tomography (CT) for the quantification of abdominal fat in correlation with the anthropometric, clinical, and biochemical assessments. Using ultrasound and CT, we determined the thickness of subcutaneous and intra-abdominal fat in 101 subjects-of whom 39 (38.6%) were men and 62 (61.4%) were women-with a mean age of 66.3 years (60-80 years). The ultrasound data were correlated with the anthropometric, clinical, and biochemical parameters, as well as with the areas measured by abdominal CT. Intra-abdominal thickness was the variable for which the correlation with the areas of abdominal fat was strongest (i.e., the correlation coefficient was highest). We also tested the reproducibility of ultrasound and CT for the assessment of abdominal fat and found that CT measurements of abdominal fat showed greater reproducibility, having higher intraobserver and interobserver reliability than had the ultrasound measurements. There was a significant correlation between ultrasound and CT, with a correlation coefficient of 0.71. In the assessment of abdominal fat, the intraobserver and interobserver reliability were greater for CT than for ultrasound, although both methods showed high accuracy and good reproducibility.
The Maristán stigma scale: a standardized international measure of the stigma of schizophrenia and other psychoses.

PubMed

Saldivia, Sandra; Runte-Geidel, Ariadne; Grandón, Pamela; Torres-González, Francisco; Xavier, Miguel; Antonioli, Claudio; Ballester, Dinarte A; Melipillán, Roberto; Galende, Emiliano; Vicente, Benjamín; Caldas, José Miguel; Killaspy, Helen; Gibbons, Rachel; King, Michael

2014-06-18

People with schizophrenia face prejudice and discrimination from a number of sources including professionals and families. The degree of stigma perceived and experienced varies across cultures and communities. We aimed to develop a cross-cultural measure of the stigma perceived by people with schizophrenia. Items for the scale were developed from qualitative group interviews with people with schizophrenia in six countries. The scale was then applied in face-to-face interviews with 164 participants, 103 of which were repeated after 30 days. Principal Axis Factoring and Promax rotation evaluated the structure of the scale; Horn's parallel combined with bootstrapping determined the number of factors; and intra-class correlation assessed test-retest reliability. The final scale has 31 items and four factors: informal social networks, socio-institutional, health professionals and self-stigma. Cronbach's alpha was 0.84 for the Factor 1; 0.81 for Factor 2; 0.74 for Factor 3, and 0.75 for Factor 4. Correlation matrix among factors revealed that most were in the moderate range [0.31-0.49], with the strongest occurring between perception of stigma in the informal network and self-stigma and there was also a weaker correlation between stigma from health professionals and self-stigma. Test-retest reliability was highest for informal networks [ICC 0.76 [0.67 -0.83
The Maristán stigma scale: a standardized international measure of the stigma of schizophrenia and other psychoses

PubMed Central

2014-01-01

Background People with schizophrenia face prejudice and discrimination from a number of sources including professionals and families. The degree of stigma perceived and experienced varies across cultures and communities. We aimed to develop a cross-cultural measure of the stigma perceived by people with schizophrenia. Method Items for the scale were developed from qualitative group interviews with people with schizophrenia in six countries. The scale was then applied in face-to-face interviews with 164 participants, 103 of which were repeated after 30 days. Principal Axis Factoring and Promax rotation evaluated the structure of the scale; Horn’s parallel combined with bootstrapping determined the number of factors; and intra-class correlation assessed test-retest reliability. Results The final scale has 31 items and four factors: informal social networks, socio-institutional, health professionals and self-stigma. Cronbach’s alpha was 0.84 for the Factor 1; 0.81 for Factor 2; 0.74 for Factor 3, and 0.75 for Factor 4. Correlation matrix among factors revealed that most were in the moderate range [0.31-0.49], with the strongest occurring between perception of stigma in the informal network and self-stigma and there was also a weaker correlation between stigma from health professionals and self-stigma. Test-retest reliability was highest for informal networks [ICC 0.76 [0.67 -0.83
Reliable and fast volumetry of the lumbar spinal cord using cord image analyser (Cordial).

PubMed

Tsagkas, Charidimos; Altermatt, Anna; Bonati, Ulrike; Pezold, Simon; Reinhard, Julia; Amann, Michael; Cattin, Philippe; Wuerfel, Jens; Fischer, Dirk; Parmar, Katrin; Fischmann, Arne

2018-04-30

To validate the precision and accuracy of the semi-automated cord image analyser (Cordial) for lumbar spinal cord (SC) volumetry in 3D T1w MRI data of healthy controls (HC). 40 3D T1w images of 10 HC (w/m: 6/4; age range: 18-41 years) were acquired at one 3T-scanner in two MRI sessions (time interval 14.9±6.1 days). Each subject was scanned twice per session, allowing determination of test-retest reliability both in back-to-back (intra-session) and scan-rescan images (inter-session). Cordial was applied for lumbar cord segmentation twice per image by two raters, allowing for assessment of intra- and inter-rater reliability, and compared to a manual gold standard. While manually segmented volumes were larger (mean: 2028±245 mm 3 vs. Cordial: 1636±300 mm 3 , p<0.001), accuracy assessments between manually and semi-automatically segmented images showed a mean Dice-coefficient of 0.88±0.05. Calculation of within-subject coefficients of variation (COV) demonstrated high intra-session (1.22-1.86%), inter-session (1.26-1.84%), as well as intra-rater (1.73-1.83%) reproducibility. No significant difference was shown between intra- and inter-session reproducibility or between intra-rater reliabilities. Although inter-rater reproducibility (COV: 2.87%) was slightly lower compared to all other reproducibility measures, between rater consistency was very strong (intraclass correlation coefficient: 0.974). While under-estimating the lumbar SCV, Cordial still provides excellent inter- and intra-session reproducibility showing high potential for application in longitudinal trials. • Lumbar spinal cord segmentation using the semi-automated cord image analyser (Cordial) is feasible. • Lumbar spinal cord is 40-mm cord segment 60 mm above conus medullaris. • Cordial provides excellent inter- and intra-session reproducibility in lumbar spinal cord region. • Cordial shows high potential for application in longitudinal trials.
Valid statistical approaches for analyzing sholl data: Mixed effects versus simple linear models.

PubMed

Wilson, Machelle D; Sethi, Sunjay; Lein, Pamela J; Keil, Kimberly P

2017-03-01

The Sholl technique is widely used to quantify dendritic morphology. Data from such studies, which typically sample multiple neurons per animal, are often analyzed using simple linear models. However, simple linear models fail to account for intra-class correlation that occurs with clustered data, which can lead to faulty inferences. Mixed effects models account for intra-class correlation that occurs with clustered data; thus, these models more accurately estimate the standard deviation of the parameter estimate, which produces more accurate p-values. While mixed models are not new, their use in neuroscience has lagged behind their use in other disciplines. A review of the published literature illustrates common mistakes in analyses of Sholl data. Analysis of Sholl data collected from Golgi-stained pyramidal neurons in the hippocampus of male and female mice using both simple linear and mixed effects models demonstrates that the p-values and standard deviations obtained using the simple linear models are biased downwards and lead to erroneous rejection of the null hypothesis in some analyses. The mixed effects approach more accurately models the true variability in the data set, which leads to correct inference. Mixed effects models avoid faulty inference in Sholl analysis of data sampled from multiple neurons per animal by accounting for intra-class correlation. Given the widespread practice in neuroscience of obtaining multiple measurements per subject, there is a critical need to apply mixed effects models more widely. Copyright © 2017 Elsevier B.V. All rights reserved.
T2* mapping of hip joint cartilage in various histological grades of degeneration.

PubMed

Bittersohl, B; Miese, F R; Hosalkar, H S; Herten, M; Antoch, G; Krauspe, R; Zilkens, C

2012-07-01

To evaluate T2* values in various histological severities of osteoarthritis (OA). Magnetic resonance imaging (MRI) and T2* mapping including a three-dimensional (3D) double-echo steady-state (DESS) sequence for morphological cartilage assessment and a 3D multiecho data image combination (MEDIC) sequence for T2* mapping were conducted in 21 human femoral head specimens with varying severities of OA. Subsequently, histological assessment was undertaken in all specimens to correlate the observations of T2* mapping with histological analyses. According to the Mankin score, four grades of histological changes were determined: grade 0 (Mankin scores of 0-4), grade I (scores of 5-8), grade II (scores of 9-10), and grade III (scores of 11-14). For reliability assessment, cartilage T2* measurements were repeated after 4 weeks in 10 randomly selected femoral head specimens. T2* values decreased significantly with increasing cartilage degeneration (total P-values <0.001) ranging from 36.3 ± 4.3 ms in grade 0 regions to 22.8 ± 4.3 ms in regions with grade III changes. Pearson correlation analysis proved a fair correlation between T2* values and Mankin score (correlation coefficient = -0.362) that was statistically significant (P-value <0.001). Intra-class correlation (ICC) analysis demonstrated high intra-observer reproducibility for the T2* measurement (ICC: 0.949, P < 0.001). Given the advantages of the T2* mapping technique with no need for contrast medium, high image resolution and ability to perform 3D biochemically sensitive imaging, T2* mapping may be a strong addition to the currently evolving era of cartilage biochemical imaging. Copyright © 2012 Osteoarthritis Research Society International. Published by Elsevier Ltd. All rights reserved.
Cross-Cultural adaption, validity and reliability of a Hindi version of the Corah’s Dental Anxiety Scale

PubMed Central

Jain, Meena; Tandon, Shourya; Sharma, Ankur; Jain, Vishal; Rani Yadav, Nisha

2018-01-01

Background: An appropriate scale to assess the dental anxiety of Hindi speaking population is lacking. This study, therefore, aims to evaluate the psychometric properties of Hindi version of one of the oldest dental anxiety scale, Corah’s Dental Anxiety Scale (CDAS) in Hindi speaking Indian adults. Methods: A total of 348 subjects from the outpatient department of a dental hospital in India participated in this cross-sectional study. The scale was cross-culturally adapted by forward and backward translation, committee review and pretesting method. The construct validity of the translated scale was explored with exploratory factor analysis. The correlation of the Hindi version of CDAS with visual analogue scale (VAS) was used to measure the convergent validity. Reliability was assessed through calculations of Cronbach’s alpha and intra class correlation 48 forms were completed for test-retest. Results: Prevalence of dental anxiety in the sample within the age range of 18-80 years was 85.63% [95% CI: 0.815-0.891]. The response rate was 100 %. Kaiser-Meyer-Olkin (KMO) test value was 0.776. After factor analysis, a single factor (dental anxiety) was obtained with 4 items.The single factor model explained 61% variance. Pearson correlation coefficient between CDASand VAS was 0.494. Test-retest showed the Cronbach’s alpha value of 0.814. The test-retest intraclass correlation coefficient of the total CDAS score was 0.881 [95% CI: 0.318-0.554]. Conclusion: Hindi version of CDAS is a valid and reliable scale to assess dental anxiety in Hindi speaking population. Convergent validity is well recognized but discriminant validity is limited and requires further study. PMID:29744307
Cross-Cultural adaption, validity and reliability of a Hindi version of the Corah's Dental Anxiety Scale.

PubMed

Jain, Meena; Tandon, Shourya; Sharma, Ankur; Jain, Vishal; Rani Yadav, Nisha

2018-01-01

Background: An appropriate scale to assess the dental anxiety of Hindi speaking population is lacking. This study, therefore, aims to evaluate the psychometric properties of Hindi version of one of the oldest dental anxiety scale, Corah's Dental Anxiety Scale (CDAS) in Hindi speaking Indian adults. Methods: A total of 348 subjects from the outpatient department of a dental hospital in India participated in this cross-sectional study. The scale was cross-culturally adapted by forward and backward translation, committee review and pretesting method. The construct validity of the translated scale was explored with exploratory factor analysis. The correlation of the Hindi version of CDAS with visual analogue scale (VAS) was used to measure the convergent validity. Reliability was assessed through calculations of Cronbach's alpha and intra class correlation 48 forms were completed for test-retest. Results: Prevalence of dental anxiety in the sample within the age range of 18-80 years was 85.63% [95% CI: 0.815-0.891]. The response rate was 100 %. Kaiser-Meyer-Olkin (KMO) test value was 0.776. After factor analysis, a single factor (dental anxiety) was obtained with 4 items.The single factor model explained 61% variance. Pearson correlation coefficient between CDASand VAS was 0.494. Test-retest showed the Cronbach's alpha value of 0.814. The test-retest intraclass correlation coefficient of the total CDAS score was 0.881 [95% CI: 0.318-0.554]. Conclusion: Hindi version of CDAS is a valid and reliable scale to assess dental anxiety in Hindi speaking population. Convergent validity is well recognized but discriminant validity is limited and requires further study.
Reliability and concurrent validity of the iPhone® Compass application to measure thoracic rotation range of motion (ROM) in healthy participants

PubMed Central

Schram, Ben; Cox, Alistair J.; Anderson, Sarah L.; Keogh, Justin

2018-01-01

Background Several water-based sports (swimming, surfing and stand up paddle boarding) require adequate thoracic mobility (specifically rotation) in order to perform the appropriate activity requirements. The measurement of thoracic spine rotation is problematic for clinicians due to a lack of convenient and reliable measurement techniques. More recently, smartphones have been used to quantify movement in various joints in the body; however, there appears to be a paucity of research using smartphones to assess thoracic spine movement. Therefore, the aim of this study is to determine the reliability (intra and inter rater) and validity of the iPhone® app (Compass) when assessing thoracic spine rotation ROM in healthy individuals. Methods A total of thirty participants were recruited for this study. Thoracic spine rotation ROM was measured using both the current clinical gold standard, a universal goniometer (UG) and the Smart Phone Compass app. Intra-rater and inter-rater reliability was determined with a Intraclass Correlation Coefficient (ICC) and associated 95% confidence intervals (CI). Validation of the Compass app in comparison to the UG was measured using Pearson’s correlation coefficient and levels of agreement were identified with Bland–Altman plots and 95% limits of agreement. Results Both the UG and Compass app measurements both had excellent reproducibility for intra-rater (ICC 0.94–0.98) and inter-rater reliability (ICC 0.72–0.89). However, the Compass app measurements had higher intra-rater reliability (ICC = 0.96 − 0.98; 95% CI [0.93–0.99]; vs. ICC = 0.94 − 0.98; 95% CI [0.88–0.99]) and inter-rater reliability (ICC = 0.87 − 0.89; 95% CI [0.74–0.95] vs. ICC = 0.72 − 0.82; 95% CI [0.21–0.94]). A strong and significant correlation was found between the UG and the Compass app, demonstrating good concurrent validity (r = 0.835, p < 0.001). Levels of agreement between the two devices were 24.8° (LoA –9.5°, +15.3°). The UG was found to consistently measure higher values than the compass app (mean difference 2.8°, P < 0.001). Conclusion This study reveals that the iPhone® app (Compass) is a reliable tool for measuring thoracic spine rotation which produces greater reproducibility of measurements both within and between raters than a UG. As a significant positive correlation exists between the Compass app and UG, this supports the use of either device in clinical practice as a reliable and valid tool to measure thoracic rotation. Considering the levels of agreement are clinically unacceptable, the devices should not be used interchangeably for initial and follow up measurements. PMID:29568701
Reliability and concurrent validity of the iPhone® Compass application to measure thoracic rotation range of motion (ROM) in healthy participants.

PubMed

Furness, James; Schram, Ben; Cox, Alistair J; Anderson, Sarah L; Keogh, Justin

2018-01-01

Several water-based sports (swimming, surfing and stand up paddle boarding) require adequate thoracic mobility (specifically rotation) in order to perform the appropriate activity requirements. The measurement of thoracic spine rotation is problematic for clinicians due to a lack of convenient and reliable measurement techniques. More recently, smartphones have been used to quantify movement in various joints in the body; however, there appears to be a paucity of research using smartphones to assess thoracic spine movement. Therefore, the aim of this study is to determine the reliability (intra and inter rater) and validity of the iPhone ® app (Compass) when assessing thoracic spine rotation ROM in healthy individuals. A total of thirty participants were recruited for this study. Thoracic spine rotation ROM was measured using both the current clinical gold standard, a universal goniometer (UG) and the Smart Phone Compass app. Intra-rater and inter-rater reliability was determined with a Intraclass Correlation Coefficient (ICC) and associated 95% confidence intervals (CI). Validation of the Compass app in comparison to the UG was measured using Pearson's correlation coefficient and levels of agreement were identified with Bland-Altman plots and 95% limits of agreement. Both the UG and Compass app measurements both had excellent reproducibility for intra-rater (ICC 0.94-0.98) and inter-rater reliability (ICC 0.72-0.89). However, the Compass app measurements had higher intra-rater reliability ( ICC = 0.96 - 0.98; 95% CI [0.93-0.99]; vs. ICC = 0.94 - 0.98; 95% CI [0.88-0.99]) and inter-rater reliability ( ICC = 0.87 - 0.89; 95% CI [0.74-0.95] vs. ICC = 0.72 - 0.82; 95% CI [0.21-0.94]). A strong and significant correlation was found between the UG and the Compass app, demonstrating good concurrent validity ( r = 0.835, p < 0.001). Levels of agreement between the two devices were 24.8° (LoA -9.5°, +15.3°). The UG was found to consistently measure higher values than the compass app (mean difference 2.8°, P < 0.001). This study reveals that the iPhone ® app (Compass) is a reliable tool for measuring thoracic spine rotation which produces greater reproducibility of measurements both within and between raters than a UG. As a significant positive correlation exists between the Compass app and UG, this supports the use of either device in clinical practice as a reliable and valid tool to measure thoracic rotation. Considering the levels of agreement are clinically unacceptable, the devices should not be used interchangeably for initial and follow up measurements.
Student Practice Evaluation Form-Revised Edition online comment bank: development and reliability analysis.

PubMed

Rodger, Sylvia; Turpin, Merrill; Copley, Jodie; Coleman, Allison; Chien, Chi-Wen; Caine, Anne-Maree; Brown, Ted

2014-08-01

The reliable evaluation of occupational therapy students completing practice education placements along with provision of appropriate feedback is critical for both students and for universities from a quality assurance perspective. This study describes the development of a comment bank for use with an online version of the Student Practice Evaluation Form-Revised Edition (SPEF-R Online) and investigates its reliability. A preliminary bank of 109 individual comments (based on previous students' placement performance) was developed via five stages. These comments reflected all 11 SPEF-R domains. A purpose-designed online survey was used to examine the reliability of the comment bank. A total of 37 practice educators returned surveys, 31 of which were fully completed. Participants were asked to rate each individual comment using the five-point SPEF-R rating scale. One hundred and two of 109 comments demonstrated satisfactory agreement with their respective default ratings that were determined by the development team. At each domain level, the intra-class correlation coefficients (ranging between 0.86 and 0.96) also demonstrated good to excellent inter-rater reliability. There were only seven items that required rewording prior to inclusion in the final SPEF-R Online comment bank. The development of the SPEF-R Online comment bank offers a source of reliable comments (consistent with the SPEF-R rating scale across different domains) and aims to assist practice educators in providing reliable and timely feedback to students in a user-friendly manner. © 2014 Occupational Therapy Australia.
Test-retest reliability of quantitative sensory testing for mechanical somatosensory and pain modulation assessment of masticatory structures.

PubMed

Costa, Y M; Morita-Neto, O; de Araújo-Júnior, E N S; Sampaio, F A; Conti, P C R; Bonjardim, L R

2017-03-01

Assessing the reliability of medical measurements is a crucial step towards the elaboration of an applicable clinical instrument. There are few studies that evaluate the reliability of somatosensory assessment and pain modulation of masticatory structures. This study estimated the test-retest reliability, that is over time, of the mechanical somatosensory assessment of anterior temporalis, masseter and temporomandibular joint (TMJ) and the conditioned pain modulation (CPM) using the anterior temporalis as the test site. Twenty healthy women were evaluated in two sessions (1 week apart) by the same examiner. Mechanical detection threshold (MDT), mechanical pain threshold (MPT), wind-up ratio (WUR) and pressure pain threshold (PPT) were assessed on the skin overlying the anterior temporalis, masseter and TMJ of the dominant side. CPM was tested by comparing PPT before and during the hand immersion in a hot water bath. anova and intra-class correlation coefficients (ICCs) were applied to the data (α = 5%). The overall ICCs showed acceptable values for the test-retest reliability of mechanical somatosensory assessment of masticatory structures. The ICC values of 75% of all quantitative sensory measurements were considered fair to excellent (fair = 8·4%, good = 33·3% and excellent = 33·3%). However, the CPM paradigm presented poor reliability (ICC = 0·25). The mechanical somatosensory assessment of the masticatory structures, but not the proposed CPM protocol, can be considered sufficiently reliable over time to evaluate the trigeminal sensory function. © 2016 John Wiley & Sons Ltd.
[Reliability and validity of Parkinson's disease sleep scale-Chinese version in the south west of China].

PubMed

Zhang, J H; Peng, R; Du, Y; Mou, Y; Li, N N; Cheng, L

2016-11-08

Objective: To evaluate the reliability and validity of Parkinson's disease sleep scale-Chinese version (CPDSS) through a study of a large PD population in southwest China, and to explore the prevalence and characteristics of sleep disorders in Parkinson's disease (PD) patients from southwest China. Methods: A total of 544 PD patients and 220 control subjects were enrolled in our study. Demographic data, CPDSS, ESS, PDQ39, HAMD and H-Y stage were assessed in all subjects. Statistical description, Cronbach's alpha coefficient, intra-class correlation coefficient ( ICC ), Spearman rank correlation coefficient and Mann-Whitney U test were used for statistical analyses. Result: The Cronbach's alpha coefficient for CPDSS was 0.79, ICC of the total scale was 0.94 and ICC of each item ranged from 0.73 to 0.97. The factor analysis yielded a five-factor solution, which explained 63.4% of the total variance. Total and each item scores of CPDSS in PD patients were lower than those in healthy controls. 69.3% of PD patients had sleep disorder, while prevalence in the control group was only 29.6%. Negative correlation was found between CPDSS and ESS. Daytime sleepiness was the most common factor (35.9%) leading to sleep disorders. The sleep disorders of PD patients in Southwest China were significantly related with the course of disease, the severity of disease, the quality of life, depression, cognitive level and motor symptoms. Conclusion: CPDSS has good feasibility, reliability and validity in PD population from southwest China. CPDSS is considered as an effective tool for the assessment of sleep disorder in PD patients.
Intra- and inter-observer reliability of quantitative analysis of the infra-patellar fat pad and comparison between fat- and non-fat-suppressed imaging--Data from the osteoarthritis initiative.

PubMed

Steidle-Kloc, E; Wirth, W; Ruhdorfer, A; Dannhauer, T; Eckstein, F

2016-03-01

The infra-patellar fat pad (IPFP), as intra-articular adipose tissue represents a potential source of pro-inflammatory cytokines and its size has been suggested to be associated with osteoarthritis (OA) of the knee. This study examines inter- and intra-observer reliability of fat-suppressed (fs) and non-fat-suppressed (nfs) MR imaging for determination of IPFP morphological measurements as novel biomarkers. The IPFP of nine right knees of healthy Osteoarthritis Initiative participants was segmented by five readers, using fs and nfs baseline sagittal MRIs. The intra-observer reliability was determined from baseline and 1-year follow-up images. All segmentations were quality controlled (QC) by an expert reader. Reliability was expressed as root mean square coefficient of variation (RMS CV%). After QC, the inter-observer reliability for fs (nfs) imaging was 2.0% (1.1%) for IPFP volume, 2.1%/2.5% (1.6%/1.8%) for anterior/posterior surface areas, 1.8% (1.8%) for depth, and 2.1% (2.4%) for maximum sagittal area. The intra-observer reliability was 3.1% (5.0%) for volume, 2.3%/2.8% (2.5%/2.9%) for anterior/posterior surfaces, 1.9% (3.5%) for depth, and 3.3% (4.5%) for maximum sagittal area. IPFP volume from nfs images was systematically greater (+7.3%) than from fs images, but highly correlated (r=0.98). The results suggest that quantitative measurements of IPFP morphology can be performed with satisfactory reliability when expert QC is implemented. The IPFP is more clearly depicted in nfs images, and there is a small systematic off-set versus analysis from fs images. However, the high linear relationship between fs and nfs imaging suggests that fs images can be used to analyze IPFP morphology, when nfs images are not available. Copyright © 2015 Elsevier GmbH. All rights reserved.
Intra- and inter-observer reliability of quantitative analysis of the infra-patellar fat pad and comparison between fat- and non-fat-suppressed imaging—Data from the osteoarthritis initiative

PubMed Central

Steidle-Kloc, E.; Wirth, W.; Ruhdorfer, A.; Dannhauer, T.; Eckstein, F.

2015-01-01

The infra-patellar fat pad (IPFP), as intra-articular adipose tissue represents a potential source of pro-inflammatory cytokines and its size has been suggested to be associated with osteoarthritis (OA) of the knee. This study examines inter- and intra-observer reliability of fat-suppressed (fs) and non-fat-suppressed (nfs) MR imaging for determination of IPFP morphological measurements as novel biomarkers. The IPFP of nine right knees of healthy Osteoarthritis Initiative participants was segmented by five readers, using fs and nfs baseline sagittal MRIs. The intra-observer reliability was determined from baseline and 1-year follow-up images. All segmentations were quality controlled (QC) by an expert reader. Reliability was expressed as root mean square coefficient of variation (RMS CV%). After QC, the inter-observer reliability for fs (nfs) imaging was 2.0% (1.1%) for IPFP volume, 2.1%/2.5% (1.6%/1.8%) for anterior/posterior surface areas, 1.8% (1.8%) for depth, and 2.1% (2.4%) for maximum sagittal area. The intra-observer reliability was 3.1% (5.0%) for volume, 2.3%/2.8% (2.5%/2.9%) for anterior/posterior surfaces, 1.9% (3.5%) for depth, and 3.3% (4.5%) for maximum sagittal area. IPFP volume from nfs images was systematically greater (+7.3%) than from fs images, but highly correlated (r = 0.98). The results suggest that quantitative measurements of IPFP morphology can be performed with satisfactory reliability when expert QC is implemented. The IPFP is more clearly depicted in nfs images, and there is a small systematic off-set versus analysis from fs images. However, the high linear relationship between fs and nfs imaging suggests that fs images can be used to analyze IPFP morphology, when nfs images are not available. PMID:26569532

Psychometric Properties of the Persian Version of Death Depression Scale-Revised in Iranian Patients with Acute Myocardial Infarction.

PubMed

Sharif Nia, Hamid; Pahlevan Sharif, Saeed; Lehto, Rebecca H; Allen, Kelly A; Goudarzian, Amir Hossein; Yaghoobzadeh, Ameneh; Soleimani, Mohammad Ali

2017-07-01

Objective: Limited research has examined the psychometric properties of death depression scales in Persian populations with cardiac disease despite the need for valid assessment tools for evaluating depressive symptoms in patients with life-limiting chronic conditions. The present study aimed at evaluating the reliability and validity of the Persian Version of Death Depression Scale - Revised (DDS-R) in Iranian patients who had recent acute myocardial infarction (AMI). Method: This psychometric study was conducted with a convenience sample of 407 patients with AMI diagnosis who completed the Persian version of the DDS-R. The face, content, and construct validity of the scale were ascertained. Internal consistency, test-retest, and construct reliability (CR) were used to assess reliability of the Persian Version of DDS-R. Results: Based on maximum likelihood exploratory factor analysis and consideration of conceptual meaning, a 4-factor solution was identified, explaining 75.89% of the total variance. Goodness-of-fit indices (GFI), Comparative Fit Index (CFI), Normed Fit Index (NFI), Incremental Fit Index (IFI), and Root Mean Square Error of Approximation (RMSEA) in the final DDS-R structure demonstrated the adequacy of the 4-domain structure. The internal consistency, construct reliability, and Intra-class Correlation Coefficients (ICC) were greater than .70. Conclusion: The DDS-R was found to be a valid and reliable assessment tool for evaluating death depression symptoms in Iranian patients with AMI.
Reliability of a retail food store survey and development of an accompanying retail scoring system to communicate survey findings and identify vendors for healthful food and marketing initiatives.

PubMed

Ghirardelli, Alyssa; Quinn, Valerie; Sugerman, Sharon

2011-01-01

To develop a retail grocery instrument with weighted scoring to be used as an indicator of the food environment. Twenty six retail food stores in low-income areas in California. Observational. Inter-rater reliability for grocery store survey instrument. Description of store scoring methodology weighted to emphasize availability of healthful food. Type A intra-class correlation coefficients (ICC) with absolute agreement definition or a κ test for measures using ranges as categories. Measures of availability and price of fruits and vegetables performed well in reliability testing (κ = 0.681-0.800). Items for vegetable quality were better than for fruit (ICC 0.708 vs 0.528). Kappa scores indicated low to moderate agreement (0.372-0.674) on external store marketing measures and higher scores for internal store marketing. "Next to" the checkout counter was more reliable than "within 6 feet." Health departments using the store scoring system reported it as the most useful communication of neighborhood findings. There was good reliability of the measures among the research pairs. The local store scores can show the need to bring in resources and to provide access to fruits and vegetables and other healthful food. Copyright © 2011 Society for Nutrition Education. Published by Elsevier Inc. All rights reserved.
Reliability of scores between stroke patients and significant others on the Reintegration to Normal Living (RNL) Index.

PubMed

Tooth, Leigh R; McKenna, Kryss T; Smith, Melinda; O'Rourke, Peter K

2003-05-06

This study measured reliability between stroke patients' and significant others' scores on items on the Reintegration to Normal Living (RNL) Index and whether there were any scoring biases. The 11-item RNL Index was administered to 57 pairs of patients and significants six months after stroke rehabilitation. The index was scored using a 10-point visual analogue scale. Patient and significant other demographic information and data on patients' clinical, functional and cognitive status were collected. Reliability was measured using the intra-class correlation coefficient (ICC) and percent agreement. Overall poor reliability was found for the RNL Index total score (ICC=.36, 95% CI .07 to .59) and the daily functioning subscale (ICC=.24, 95% Cl -.003 to .46) and moderate reliability was found for the perception of self subscale (ICC= .55, 95% Cl .28 to .73). There was a moderate bias for patients to rate themselves as achieving better reintegration than was indicated by significant others, although no demographic or clinical factors were associated with this bias. Exact match agreement was best for the subjective items and worse for items reflecting mobility around the community and participation in a work activity. Caution is needed when interpreting patient information reported by significant others on the RNL Index. The use of a shorter scale to rate the RNL Index requires investigation.
Translation into Brazilian Portuguese, cross-cultural adaptation and validation of the Stanford presenteeism scale-6 and work instability scale for ankylosing spondylitis.

PubMed

Frauendorf, Renata; de Medeiros Pinheiro, Marcelo; Ciconelli, Rozana Mesquita

2014-12-01

Loss of productivity at work, as a result of health problems, is becoming an issue of interest due to the high burden it represents in society. The measurement of such phenomenon can be made using generic and specific scales for certain diseases such as the Stanford Presenteeism Scale (SPS-6) and the Work Instability Scale for Ankylosing Spondylitis (AS-WIS), specific for patients with ankylosing spondylitis (AS). The aim of this study was to translate and perform a cross-cultural adaptation of SPS-6 and AS-WIS into Portuguese and check their psychometric properties. The study also aimed to evaluate the relationship between the general scores of the scales and the main sociodemographic and clinical data, lifestyles, and absenteeism in patients with AS and correlate these variables with SPS-6 and AS-WIS scales. A sample of 120 patients with AS and 80 workers at a university hospital was evaluated. The processes for the translation and cross-cultural adaptation of the instruments followed preestablished steps and rules presented in the literature. For the evaluation of measurement properties and correlations between scales, intra-class correlation coefficient (reproducibility analysis), Cronbach alpha (internal consistency), and Pearson correlation coefficient (validity) were employed. The inter-observer (0.986) and intra-observer (0.992) reproducibilities of the AS-WIS were shown to be high as well as the internal consistency (0.995). Similarly, the inter-observer reliability of SPS-6 was considered good (0.890), although it showed a poorer performance when considering the same observer (Pearson correlation coefficient = 0.675 and intra-class correlation = 0.656). Internal consistency, for the total number of items, as measured by Cronbach alpha, was 0.889. The validity of the scales was evaluated thru the comparison of the achieved scores with the results of the WLQ, SF-36, ASQoL, BASFI, BASDAI, HAQ-S, and SRQ-20 instruments. Correlations between loss of productivity at work, worse quality of life, presence of emotional disturbances, and worse health conditions were positive. The process of translation, cross-cultural adaptation, and validation of the SPS-6 as a generic measurement for the loss of productivity at work and of the AS-WIS as a specific measurement for patients with AS are valid, reproducible, and specific instruments to be used in Brazil. In both scales, productivity at work was associated to advanced age, higher rate of absenteeism in the last month and year, presence of peripheral arthritis, and a larger number of comorbidities in patients with AS. The AS-WIS and SPS-6 showed a good correlation among them although they are not mutually exclusive but supplementary.
Evaluation of patients with painful total hip arthroplasty using combined single photon emission tomography and conventional computerized tomography (SPECT/CT) - a comparison of semi-quantitative versus 3D volumetric quantitative measurements.

PubMed

Barthassat, Emilienne; Afifi, Faik; Konala, Praveen; Rasch, Helmut; Hirschmann, Michael T

2017-05-08

It was the primary purpose of our study to evaluate the inter- and intra-observer reliability of a standardized SPECT/CT algorithm for evaluating patients with painful primary total hip arthroplasty (THA). The secondary purpose was a comparison of semi-quantitative and 3D volumetric quantification method for assessment of bone tracer uptake (BTU) in those patients. A novel SPECT/CT localization scheme consisting of 14 femoral and 4 acetabular regions on standardized axial and coronal slices was introduced and evaluated in terms of inter- and intra-observer reliability in 37 consecutive patients with hip pain after THA. BTU for each anatomical region was assessed semi-quantitatively using a color-coded Likert type scale (0-10) and volumetrically quantified using a validated software. Two observers interpreted the SPECT/CT findings in all patients two times with six weeks interval between interpretations in random order. Semi-quantitative and quantitative measurements were compared in terms of reliability. In addition, the values were correlated using Pearson`s correlation. A factorial cluster analysis of BTU was performed to identify clinically relevant regions, which should be grouped and analysed together. The localization scheme showed high inter- and intra-observer reliabilities for all femoral and acetabular regions independent of the measurement method used (semiquantitative versus 3D volumetric quantitative measurements). A high to moderate correlation between both measurement methods was shown for the distal femur, the proximal femur and the acetabular cup. The factorial cluster analysis showed that the anatomical regions might be summarized into three distinct anatomical regions. These were the proximal femur, the distal femur and the acetabular cup region. The SPECT/CT algorithm for assessment of patients with pain after THA is highly reliable independent from the measurement method used. Three clinically relevant anatomical regions (proximal femoral, distal femoral, acetabular) were identified.
Scoring haemophilic arthropathy on X-rays: improving inter- and intra-observer reliability and agreement using a consensus atlas.

PubMed

Foppen, Wouter; van der Schaaf, Irene C; Beek, Frederik J A; Verkooijen, Helena M; Fischer, Kathelijn

2016-06-01

The radiological Pettersson score (PS) is widely applied for classification of arthropathy to evaluate costly haemophilia treatment. This study aims to assess and improve inter- and intra-observer reliability and agreement of the PS. Two series of X-rays (bilateral elbows, knees, and ankles) of 10 haemophilia patients (120 joints) with haemophilic arthropathy were scored by three observers according to the PS (maximum score 13/joint). Subsequently, (dis-)agreement in scoring was discussed until consensus. Example images were collected in an atlas. Thereafter, second series of 120 joints were scored using the atlas. One observer rescored the second series after three months. Reliability was assessed by intraclass correlation coefficients (ICC), agreement by limits of agreement (LoA). Median Pettersson score at joint level (PSjoint) of affected joints was 6 (interquartile range 3-9). Using the consensus atlas, inter-observer reliability of the PSjoint improved significantly from 0.94 (95 % confidence interval (CI) 0.91-0.96) to 0.97 (CI 0.96-0.98). LoA improved from ±1.7 to ±1.1 for the PSjoint. Therefore, true differences in arthropathy were differences in the PSjoint of >2 points. Intra-observer reliability of the PSjoint was 0.98 (CI 0.97-0.98), intra-observer LoA were ±0.9 points. Reliability and agreement of the PS improved by using a consensus atlas. • Reliability of the Pettersson score significantly improved using the consensus atlas. • The presented consensus atlas improved the agreement among observers. • The consensus atlas could be recommended to obtain a reproducible Pettersson score.
Reliability and clinical features associated with the IPSG MRI tibiotalar and subtalar joint scores in children, adolescents and young adults with haemophilia.

PubMed

Brunel, T; Lobet, S; Deschamps, K; Hermans, C; Peerlinck, K; Vandesande, J; Pialat, J-B

2018-01-01

To assess the reliability of the IPSG MRI scale for tibiotalar (TTJ) and subtalar joint (STJ) changes in young haemophilic patients, correlating MRI findings with functional scores and 3D-rearfoot kinematics. A total of 37 haemophilic patients underwent bilateral MRI of the footankle, clinical evaluation and quantitative assessment of their 3D-rearfoot kinematics during walking. TTJ and STJ soft tissues were assessed twice along with osteochondral changes by two radiologists using the IPSG MRI scale. Inter- and intra-observer reproducibility of MRI scoring were tested by means of kappa statistics. Correlational analyses were performed between MRI findings and the Haemophilia Joint Health Score 2.1 (HJHS) and 3D-rearfoot kinematic data. The intra-reader reliability of MRI scoring was good to excellent (Kappa: 0.62-1), whereas the inter-reader reliability was moderate to good (Kappa: 0.54-0.79). Weak yet significant correlations were found between the frontal plane rearfoot range of motion (ROM) during loading response of gait and STJ score, as well as between frontal plane rearfoot ROM during the terminal stance phase and the rearfoot osteochondral lesions. The IPSG score appears applicable to not only the TTJ but also the STJ. Contrary to TTJ lesions, those of the STJ do not correlate with the HJHS but do with 3D-rearfoot kinematic data. © 2017 John Wiley & Sons Ltd.
Ventilatory threshold may be a more specific measure of aerobic capacity than peak oxygen consumption rate in persons with stroke.

PubMed

Boyne, Pierce; Reisman, Darcy; Brian, Michael; Barney, Brian; Franke, Ava; Carl, Daniel; Khoury, Jane; Dunning, Kari

2017-03-01

After stroke, aerobic deconditioning can have a profound impact on daily activities. This is usually measured by the peak oxygen consumption rate achieved during exercise testing (VO2-peak). However, VO2-peak may be distorted by motor function. The oxygen uptake efficiency slope (OUES) and VO2 at the ventilatory threshold (VO2-VT) could more specifically assess aerobic capacity after stroke, but this has not been tested. To assess the differential influence of motor function on three measures of aerobic capacity (VO2-peak, OUES, and VO2-VT) and to evaluate the inter-rater reliability of VO2-VT determination post-stroke. Among 59 persons with chronic stroke, cross-sectional correlations with motor function (comfortable gait speed [CGS] and lower extremity Fugl-Meyer [LEFM]) were compared between the different aerobic capacity measures, after adjustment for covariates, in order to isolate any distorting effect of motor function. Reliability of VO2-VT determination between three raters was assessed with intra-class correlation (ICC). CGS was moderately correlated with VO2-peak (r = 0.52, p < 0.0001) and weakly correlated with OUES (r = 0.41, p = 0.002) and VO2-VT (r = 0.37, p = 0.01). LEFM was weakly correlated with VO2-peak (r = 0.26, p = 0.055) and very weakly correlated with OUES (r = 0.19, p = 0.17) and VO2-VT (r = 0.14, p = 0.31). Compared to VO2-peak, VO2-VT was significantly less correlated with CGS (r difference = -0.16, p = 0.02). Inter-rater reliability of VO2-VT determination was high (ICC: 0.93, 95% CI: 0.89-0.96). Motor dysfunction appears to artificially lower measured aerobic capacity. VO2-VT seemed to be less distorted than VO2-peak and had good inter-rater reliability, so it may provide more specific assessment of aerobic capacity post-stroke.
The reliability of WorkWell Systems Functional Capacity Evaluation: a systematic review

PubMed Central

2014-01-01

Background Functional capacity evaluation (FCE) determines a person’s ability to perform work-related tasks and is a major component of the rehabilitation process. The WorkWell Systems (WWS) FCE (formerly known as Isernhagen Work Systems FCE) is currently the most commonly used FCE tool in German rehabilitation centres. Our systematic review investigated the inter-rater, intra-rater and test-retest reliability of the WWS FCE. Methods We performed a systematic literature search of studies on the reliability of the WWS FCE and extracted item-specific measures of inter-rater, intra-rater and test-retest reliability from the identified studies. Intraclass correlation coefficients ≥ 0.75, percentages of agreement ≥ 80%, and kappa coefficients ≥ 0.60 were categorised as acceptable, otherwise they were considered non-acceptable. The extracted values were summarised for the five performance categories of the WWS FCE, and the results were classified as either consistent or inconsistent. Results From 11 identified studies, 150 item-specific reliability measures were extracted. 89% of the extracted inter-rater reliability measures, all of the intra-rater reliability measures and 96% of the test-retest reliability measures of the weight handling and strength tests had an acceptable level of reliability, compared to only 67% of the test-retest reliability measures of the posture/mobility tests and 56% of the test-retest reliability measures of the locomotion tests. Both of the extracted test-retest reliability measures of the balance test were acceptable. Conclusions Weight handling and strength tests were found to have consistently acceptable reliability. Further research is needed to explore the reliability of the other tests as inconsistent findings or a lack of data prevented definitive conclusions. PMID:24674029
Development and validation of the Japanese version of cognitive flexibility scale.

PubMed

Oshiro, Keiko; Nagaoka, Sawako; Shimizu, Eiji

2016-05-17

Various instruments have been developed to assess cognitive flexibility, which is an important construct in psychology. Among these, the self-report cognitive flexibility scale (CFS) is particularly popular for use with English speakers; however, there is not yet a Japanese version of this scale. This study reports on the development of a Japanese version of the cognitive flexibility scale (CFS-J), and the assessment of its internal consistency, test-retest reliability, and validities. We used the standard translation-back-translation process to develop the Japanese wording of the items and tested these using a sample of 335 eligible participants who did not have a mental illness, were aged 18 years or older, and lived in the suburbs of Tokyo. Participants included office workers, public servants, and college students; 71.6 % were women and 64.8 % were students. The translated scale's internal consistency reliability was assessed by calculating Cronbach's alpha and McDonald's omega, and test-retest reliability was assessed with 107 eligible participants via intra-class correlation coefficient (ICC) and Spearman's correlation of coefficient. Exploratory factory analysis (EFA) and correlations with other scales were used to examine the factor-based and concurrent validities of the CFS-J. Results indicated that the CFS-J has good internal consistency (Cronbach's alpha = 0.847, McDonald's omega = 0.871) and acceptable test-retest reliability (Spearman's = 0.687, ICC = 0.689). EFA provided evidence that the CFS-J has a one-factor structure and factor loadings were generally appropriate. The total CFS-J score was significantly and positively correlated with the cognitive flexibility inventory-Japanese version and its two subscales, along with the cognitive control scale and the positive subscale of the short Japanese version of the automatic thought questionnaire-revised (ATQ-R); further, it had a significantly negative correlation with the negative subscale of the ATQ-R (ps < 0.001). This study developed a Japanese version of the cognitive flexibility scale and confirmed its reliability and validity among a sample of people with no current mental illness, who were living in the suburbs of Tokyo.
Standardizing Foot-Type Classification Using Arch Index Values

PubMed Central

Weil, Rich; de Boer, Emily

2012-01-01

ABSTRACT Purpose: The lack of a reliable classification standard for foot type makes drawing conclusions from existing research and clinical decisions difficult, since different foot types may move and respond to treatment differently. The purpose of this study was to determine interrater agreement for foot-type classification based on photo-box-derived arch index values. Method: For this correlational study with two raters, a sample of 11 healthy volunteers with normal to obese body mass indices was recruited from both a community weight-loss programme and a programme in physical therapy. Arch index was calculated using AutoCAD software from footprint photographs obtained via mirrored photo-box. Classification as high-arched, normal, or low-arched foot type was based on arch index values. Reliability of the arch index was determined with intra-class correlations; agreement on foot-type classification was determined using quadratic weighted kappa (κw). Results: Average arch index was 0.215 for one tester and 0.219 for the second tester, with an overall range of 0.017 to 0.370. Both testers classified 6 feet as low-arched, 9 feet as normal, and 7 feet as high-arched. Interrater reliability for the arch index was ICC=0.90; interrater agreement for foot-type classification was κw=0.923. Conclusions: Classification of foot type based on arch index values derived from plantar footprint photographs obtained via mirrored photo-box showed excellent reliability in people with varying BMI. Foot-type classification may help clinicians and researchers subdivide sample populations to better differentiate mobility, gait, or treatment effects among foot types. PMID:23729964
Validation of the Pain Resilience Scale in Chinese-speaking patients with temporomandibular disorders pain.

PubMed

He, S L; Wang, J H; Ji, P

2018-03-01

To validate the Pain Resilience Scale (PRS) for use in Chinese patients with temporomandibular disorders (TMD) pain. According to international guidelines, the original PRS was first translated and cross-culturally adapted to formulate the Chinese version of PRS (PRS-C). A total of 152 patients with TMD pain were recruited to complete series of questionnaires. Reliability of the PRS-C was investigated using internal consistency and test-retest reliability. Validity of the PRS-C was calculated using cross-cultural validity and convergent validity. Cross-cultural validity was evaluated by examining the confirmatory factor analysis (CFA). And convergent validity was examined through correlating the PRS-C scores with scores of 2 commonly used pain-related measures (the Connor-Davidson Resilience Scale [CD-RISC] and the Tampa Scale for Kinesiophobia for Temporomandibular Disorders [TSK-TMD]). The PRS-C had a high internal consistency (Cronbach's alpha = 0.92) and good test-retest reliability (intra-class correlation coefficient [ICC] = 0.81). The CFA supported a 2-factor model for the PRS-C with acceptable fit to the data. The fit indices were chi-square/DF = 2.21, GFI = 0.91, TLI = 0.97, CFI = 0.98 and RMSEA = 0.08. As regards convergent validity, the PRS-C evidenced moderate-to-good relationships with the CD-RISC and the TSK-TMD. The PRS-C shows good psychometric properties and could be considered as a reliable and valid measure to evaluate pain-related resilience in patients with TMD pain. © 2017 John Wiley & Sons Ltd.
Psychometric testing of a Mandarin Chinese Version of the Clinically Useful Depression Outcome Scale for patients diagnosed with type 2 diabetes mellitus.

PubMed

Hsu, Lan-Fang; Kao, Ching-Chiu; Wang, Mei-Yeh; Chang, Chun-Jen; Tsai, Pei-Shan

2014-12-01

The Clinically Useful Depression Outcome Scale (CUDOS) is a self-report instrument that assesses symptoms and the severity of depression, but its psychometric properties in patients with type 2 diabetes mellitus in Chinese-Speaking populations are unknown. To examine the psychometric properties of the Mandarin Chinese version of the CUDOS (CUDOS-Chinese). A methodological research design. Endocrinology and metabolism outpatient clinics at 2 university-affiliated hospitals in northern Taiwan. Two-hundred and fourteen type 2 diabetic patients with the mean age of 62.6 years were enrolled, and two-hundred and twelve of them completed the study. Internal consistency, test-retest reliability, concurrent, and contrasted-groups validity were assessed. A receiver operating characteristic curve analysis was performed to assess sensitivity and specificity. Construct validity by means of confirmatory factor analysis was conducted. Internal consistency (Cronbach α of total scale and four subscales=0.93, 0.80, 0.66, 0.80, and 0.83, respectively), test-retest reliability (intra-class correlation coefficients of total scale and four subscales=0.92, 0.89, 0.94, 0.89, and 0.91, respectively), and strong correlations with the Beck Depression Inventory-II (r=0.87) suggested good reliability and validity. The confirmatory factor analysis supported a four-factor model. A cut-off score of 19/20 yielded 77.8% sensitivity and 75.6% specificity. The CUDOS-Chinese demonstrated satisfactory validity and reliability for detecting depression in type 2 diabetic patients in Taiwan. Copyright © 2014 Elsevier Ltd. All rights reserved.
Precision of lumbar intervertebral measurements: does a computer-assisted technique improve reliability?

PubMed

Pearson, Adam M; Spratt, Kevin F; Genuario, James; McGough, William; Kosman, Katherine; Lurie, Jon; Sengupta, Dilip K

2011-04-01

Comparison of intra- and interobserver reliability of digitized manual and computer-assisted intervertebral motion measurements and classification of "instability." To determine if computer-assisted measurement of lumbar intervertebral motion on flexion-extension radiographs improves reliability compared with digitized manual measurements. Many studies have questioned the reliability of manual intervertebral measurements, although few have compared the reliability of computer-assisted and manual measurements on lumbar flexion-extension radiographs. Intervertebral rotation, anterior-posterior (AP) translation, and change in anterior and posterior disc height were measured with a digitized manual technique by three physicians and by three other observers using computer-assisted quantitative motion analysis (QMA) software. Each observer measured 30 sets of digital flexion-extension radiographs (L1-S1) twice. Shrout-Fleiss intraclass correlation coefficients for intra- and interobserver reliabilities were computed. The stability of each level was also classified (instability defined as >4 mm AP translation or 10° rotation), and the intra- and interobserver reliabilities of the two methods were compared using adjusted percent agreement (APA). Intraobserver reliability intraclass correlation coefficients were substantially higher for the QMA technique THAN the digitized manual technique across all measurements: rotation 0.997 versus 0.870, AP translation 0.959 versus 0.557, change in anterior disc height 0.962 versus 0.770, and change in posterior disc height 0.951 versus 0.283. The same pattern was observed for interobserver reliability (rotation 0.962 vs. 0.693, AP translation 0.862 vs. 0.151, change in anterior disc height 0.862 vs. 0.373, and change in posterior disc height 0.730 vs. 0.300). The QMA technique was also more reliable for the classification of "instability." Intraobserver APAs ranged from 87 to 97% for QMA versus 60% to 73% for digitized manual measurements, while interobserver APAs ranged from 91% to 96% for QMA versus 57% to 63% for digitized manual measurements. The use of QMA software substantially improved the reliability of lumbar intervertebral measurements and the classification of instability based on flexion-extension radiographs.
A new scale for the assessment of performance and capacity of hand function in children with hemiplegic cerebral palsy: reliability and validity studies.

PubMed

Rosa-Rizzotto, M; Visonà Dalla Pozza, L; Corlatti, A; Luparia, A; Marchi, A; Molteni, F; Facchin, P; Pagliano, E; Fedrizzi, E

2014-10-01

In hemiplegic children, the recognition of the activity limitation pattern and the possibility of grading its severity are relevant for clinicians while planning interventions, monitoring results, predicting outcomes. Aim of the study is to examine the reliability and validity of Besta Scale, an instrument used to measure in hemiplegic children from 18 months to 12 years of age both grasp on request (capacity) and spontaneous use of upper limb (performance) in bimanual play activities and in ADL. Psychometric analysis of reliability and of validity of the Besta scale was performed. Outpatient study sample Reliability study: A sample of 39 patients was enrolled. The administration of Besta scale was video-recorded in a standardized manner. All videos were scored by 20 independent raters on subsequent viewing. 3 raters randomly selected from the 20-raters group rescored the same video two years later for intra-rater reliability. Intra and inter-rater reliability were calculated using Intraclass Correlation Coefficient (ICC) and Kendall's coefficient (K), respectively. Internal consistency reliability was assessed using Alpha's Chronbach coefficient. Validity study: a sample of 105 children was assessed 5 times (at t0 and 2, 3, 6 and 12 months later) by 20 independent raters. Each patient underwent at the same time to QUEST and Besta scale administration and assessment. Criterion validity was calculated using rho-Pearson coefficient. Reliability study: The inter-rater reliability calculated with Kendall's coefficient resulted moderate K=0.47. The intra-rater (or test-retest) reliability for 3 raters was excellent (ICC=0.927). The Cronbach's alpha for internal consistency was 0.972. Validity study: Besta scale showed a good criterion validity compared to QUEST increasing by age and severity of impairment. Rho Pearson's correlation coefficient r was 0.81 (P<0.0001). Limitations. Besta scales in infants finds hard to distinguish between mild to moderately impaired hand function. Besta scale scoring system is a valid and reliable tool, utilizable in a clinical setting to monitor evolution of unimanual and bimanual manipulation and to distinguish hand's capacity from performance.
Reliability of assessing postural control during seated balancing using a physical human-robot interaction.

PubMed

Ramadan, Ahmed; Cholewicki, Jacek; Radcliffe, Clark J; Popovich, John M; Reeves, N Peter; Choi, Jongeun

2017-11-07

This study evaluated the within- and between-visit reliability of a seated balance test for quantifying trunk motor control using input-output data. Thirty healthy subjects performed a seated balance test under three conditions: eyes open (EO), eyes closed (EC), and eyes closed with vibration to the lumbar muscles (VIB). Each subject performed three trials of each condition on three different visits. The seated balance test utilized a torque-controlled robotic seat, which together with a sitting subject resulted in a physical human-robot interaction (pHRI) (two degrees-of-freedom with upper and lower body rotations). Subjects balanced the pHRI by controlling trunk rotation in response to pseudorandom torque perturbations applied to the seat in the coronal plane. Performance error was expressed as the root mean square (RMSE) of deviations from the upright position in the time domain and as the mean bandpass signal energy (E mb ) in the frequency domain. Intra-class correlation coefficients (ICC) quantified the between-visit reliability of both RMSE and E mb . The empirical transfer function estimates (ETFE) from the perturbation input to each of the two rotational outputs were calculated. Coefficients of multiple correlation (CMC) quantified the within- and between-visit reliability of the averaged ETFE. ICCs of RMSE and E mb for all conditions were ≥0.84. The mean within- and between-visit CMCs were all ≥0.96 for the lower body rotation and ≥0.89 for the upper body rotation. Therefore, our seated balance test consisting of pHRI to assess coronal plane trunk motor control is reliable. Copyright © 2017 Elsevier Ltd. All rights reserved.
Measuring the quality of motivational interviewing in primary health care encounters: The development and validation of the motivational interviewing assessment scale (MIAS).

PubMed

Campiñez Navarro, Manuel; Pérula de Torres, Luis Ángel; Bosch Fontcuberta, Josep M; Barragán Brun, Nieves; Arbonies Ortiz, Juan Carlos; Novo Rodríguez, Jesús Manuel; Bóveda Fontán, Julia; Martín Alvarez, Remedios; Prados Castillejo, Jose Antonio; Rivas Doutreleau, Gabriela Renée; Domingo Peña, Carmen; Castro Moreno, Jaime Jesús; Romero Rodríguez, Esperanza María

2016-09-01

Motivational interviewing (MI) is a collaborative, goal-oriented method to help patients change behaviour. Tools that are often used to measure MI are the motivational interviewing skills code' (MISC), the 'motivational interviewing treatment integrity' (MITI) and the 'behaviour change counselling index' (BECCI). The first two instruments have not been designed to be used in primary healthcare (PHC) settings. The BECCI actually is time-consuming. The motivational interviewing assessment scale (MIAS, 'EVEM' in Spanish) was developed to measure MI in PHC encounters as an alternative to the previous instruments. To validate MIAS as an instrument to assess the quality of MI in PHC settings. (a) Sixteen experts in MI participated in the design, face and consensus validity, using a Delphi-type methodology. (b) 27 PHC centres located in Spain. four experts in MI tested its psychometric properties with 332 video recordings coming from the Dislip-EM study (consultations provided by 37 practitioners). dimensionality, internal consistency, reliability (intra-class correlation coefficient-ICC), sensitivity to change and convergent validity with the BECCI scale. A 14-item scale was obtained after the validation process. Factor analysis: two factors explained 76.6% of the total variance. Internal consistency, α = 0.99. Reliability: intra-rater ICC = 0.96; inter-rater ICC = 0.97. Sensitivity to change: means before and after training were 23.63 versus 38.57 (P < 0.001). Spearman's coefficient between the MIAS and the BECCI scale was 0.98 (P < 0.001). The MIAS is a consistent and reliable instrument to assess the use of MI in PHC settings. [Box: see text].
Development of the TeamOBS-PPH - targeting clinical performance in postpartum hemorrhage.

PubMed

Brogaard, Lise; Hvidman, Lone; Hinshaw, Kim; Kierkegaard, Ole; Manser, Tanja; Musaeus, Peter; Arafeh, Julie; Daniels, Kay I; Judy, Amy E; Uldbjerg, Niels

2018-06-01

This study aimed to develop a valid and reliable TeamOBS-PPH tool for assessing clinical performance in the management of postpartum hemorrhage (PPH). The tool was evaluated using video-recordings of teams managing PPH in both real-life and simulated settings. A Delphi panel consisting of 12 obstetricians from the UK, Norway, Sweden, Iceland, and Denmark achieved consensus on (i) the elements to include in the assessment tool, (ii) the weighting of each element, and (iii) the final tool. The validity and reliability were evaluated according to Cook and Beckman. (Level 1) Four raters scored four video-recordings of in situ simulations of PPH. (Level 2) Two raters scored 85 video-recordings of real-life teams managing patients with PPH ≥1000 mL in two Danish hospitals. (Level 3) Two raters scored 15 video-recordings of in situ simulations of PPH from a US hospital. The tool was designed with scores from 0 to 100. (Level 1) Teams of novices had a median score of 54 (95% CI 48-60), whereas experienced teams had a median score of 75 (95% CI 71-79; p < 0.001). (Level 2) The intra-rater [intra-class correlation (ICC) = 0.96] and inter-rater (ICC = 0.83) agreements for real-life PPH were strong. The tool was applicable in all cases: atony, retained placenta, and lacerations. (Level 3) The tool was easily adapted to in situ simulation settings in the USA (ICC = 0.86). The TeamOBS-PPH tool appears to be valid and reliable for assessing clinical performance in real-life and simulated settings. The tool will be shared as the free TeamOBS App. © 2018 Nordic Federation of Societies of Obstetrics and Gynecology.
PubMed Central

FARNETI, D.; FATTORI, B.; NACCI, A.; MANCINI, V.; SIMONELLI, M.; RUOPPOLO, G.; GENOVESE, E.

2014-01-01

SUMMARY This study evaluated the intra- and inter-rater reliability of the Pooling score (P-score) in clinical endoscopic evaluation of severity of swallowing disorder, considering excess residue in the pharynx and larynx. The score (minimum 4 - maximum 11) is obtained by the sum of the scores given to the site of the bolus, the amount and ability to control residue/bolus pooling, the latter assessed on the basis of cough, raclage, number of dry voluntary or reflex swallowing acts (< 2, 2-5, > 5). Four judges evaluated 30 short films of pharyngeal transit of 10 solid (1/4 of a cracker), 11 creamy (1 tablespoon of jam) and 9 liquid (1 tablespoon of 5 cc of water coloured with methlyene blue, 1 ml in 100 ml) boluses in 23 subjects (10 M/13 F, age from 31 to 76 yrs, mean age 58.56±11.76 years) with different pathologies. The films were randomly distributed on two CDs, which differed in terms of the sequence of the films, and were given to judges (after an explanatory session) at time 0, 24 hours later (time 1) and after 7 days (time 2). The inter- and intra-rater reliability of the P-score was calculated using the intra-class correlation coefficient (ICC; 3,k). The possibility that consistency of boluses could affect the scoring of the films was considered. The ICC for site, amount, management and the P-score total was found to be, respectively, 0.999, 0.997, 1.00 and 0.999. Clinical evaluation of a criterion of severity of a swallowing disorder remains a crucial point in the management of patients with pathologies that predispose to complications. The P-score, derived from static and dynamic parameters, yielded a very high correlation among the scores attributed by the four judges during observations carried out at different times. Bolus consistencies did not affect the outcome of the test: the analysis of variance, performed to verify if the scores attributed by the four judges to the parameters selected, might be influenced by the different consistencies of the boluses, was not significant. These initial data validate the clinical use of the P-score in the management of patients with deglutition disorders by a multidisciplinary team. PMID:24843220
The reliability and validity of a designed setup for the assessment of static back extensor force and endurance in older women with and without hyperkyphosis.

PubMed

Roghani, Taybeh; Khalkhali Zavieh, Minoo; Rahimi, Abbas; Talebian, Saeed; Manshadi, Farideh Dehghan; Akbarzadeh Baghban, Alireza; King, Nicole; Katzman, Wendy

2018-01-25

The purpose of this study was to investigate the intra-rater reliability and validity of a designed load cell setup for the measurement of back extensor muscle force and endurance. The study sample included 19 older women with hyperkyphosis, mean age 67.0 ± 5.0 years, and 14 older women without hyperkyphosis, mean age 63.0 ± 6.0 years. Maximum back extensor force and endurance were measured in a sitting position with a designed load cell setup. Tests were performed by the same examiner on two separate days within a 72-hour interval. The intra-rater reliability of the measurements was analyzed using intraclass correlation coefficient (ICC), standard errors of measurement (SEM), and minimal detectable change (MDC). The validity of the setup was determined using Pearson correlation analysis and independent t-test. Using our designed load cell, the values of ICC indicated very high reliability of force measurement (hyperkyphosis group: 0.96, normal group: 0.97) and high reliability of endurance measurement (hyperkyphosis group: 0.82, normal group: 0.89). For all tests, the values of SEM and MDC were low in both groups. A significant correlation between two documented forces (load cell force and target force) and significant differences in the muscle force and endurance among the two groups were found. The measurements of static back muscle force and endurance are reliable and valid with our designed setup in older women with and without hyperkyphosis.

Reliability testing of a portfolio assessment tool for postgraduate family medicine training in South Africa

PubMed Central

Mash, Bob; Derese, Anselme

2013-01-01

Abstract Background Competency-based education and the validity and reliability of workplace-based assessment of postgraduate trainees have received increasing attention worldwide. Family medicine was recognised as a speciality in South Africa six years ago and a satisfactory portfolio of learning is a prerequisite to sit the national exit exam. A massive scaling up of the number of family physicians is needed in order to meet the health needs of the country. Aim The aim of this study was to develop a reliable, robust and feasible portfolio assessment tool (PAT) for South Africa. Methods Six raters each rated nine portfolios from the Stellenbosch University programme, using the PAT, to test for inter-rater reliability. This rating was repeated three months later to determine test–retest reliability. Following initial analysis and feedback the PAT was modified and the inter-rater reliability again assessed on nine new portfolios. An acceptable intra-class correlation was considered to be > 0.80. Results The total score was found to be reliable, with a coefficient of 0.92. For test–retest reliability, the difference in mean total score was 1.7%, which was not statistically significant. Amongst the subsections, only assessment of the educational meetings and the logbook showed reliability coefficients > 0.80. Conclusion This was the first attempt to develop a reliable, robust and feasible national portfolio assessment tool to assess postgraduate family medicine training in the South African context. The tool was reliable for the total score, but the low reliability of several sections in the PAT helped us to develop 12 recommendations regarding the use of the portfolio, the design of the PAT and the training of raters.
Intra and Inter-Rater Reliability of Screening for Movement Impairments: Movement Control Tests from The Foundation Matrix

PubMed Central

Mischiati, Carolina R.; Comerford, Mark; Gosford, Emma; Swart, Jacqueline; Ewings, Sean; Botha, Nadine; Stokes, Maria; Mottram, Sarah L.

2015-01-01

Pre-season screening is well established within the sporting arena, and aims to enhance performance and reduce injury risk. With the increasing need to identify potential injury with greater accuracy, a new risk assessment process has been produced; The Performance Matrix (battery of movement control tests). As with any new method of objective testing, it is fundamental to establish whether the same results can be reproduced between examiners and by the same examiner on consecutive occasions. This study aimed to determine the intra-rater test re-test and inter-rater reliability of tests from a component of The Performance Matrix, The Foundation Matrix. Twenty participants were screened by two experienced musculoskeletal therapists using nine tests to assess the ability to control movement during specific tasks. Movement evaluation criteria for each test were rated as pass or fail. The therapists observed participants real-time and tests were recorded on video to enable repeated ratings four months later to examine intra-rater reliability (videos rated two weeks apart). Overall test percentage agreement was 87% for inter-rater reliability; 98% Rater 1, 94% Rater 2 for test re-test reliability; and 75% for real-time versus video. Intraclass-correlation coefficients (ICCs) were excellent between raters (0.81) and within raters (Rater 1, 0.96; Rater 2, 0.88) but poor for real-time versus video (0.23). Reliability for individual components of each test was more variable: inter-rater, 68-100%; intra-rater, 88-100% Rater 1, 75-100% Rater 2; and real-time versus video 31-100%. Cohen’s Kappa values for inter-rater reliability were 0.0-1.0; intra-rater 0.6-1.0 for Rater 1; -0.1-1.0 for Rater 2; and -0.1-1 for real-time versus video. It is concluded that both inter and intra-rater reliability of tests in The Foundation Matrix are acceptable when rated by experienced therapists. Recommendations are made for modifying some of the criteria to improve reliability where excellence was not reached. Key points The movement control tests of The Foundation Matrix had acceptable reliability between raters and within raters on different days Agreement between observations made on tests performed real-time and on video recordings was low, indicating poor validity of use of video recordings Some movement evaluation criteria related to specific tests that did not achieve excellent agreement could be modified to improve reliability PMID:25983594
Functional sensibility assessment. Part I: develop a reliable apparatus to assess momentary pinch force control.

PubMed

Chiu, Haw-Yen; Hsu, Hsiu-Yun; Kuo, Li-Chieh; Chang, Jer-Hao; Su, Fong-Chin

2009-08-01

A precise magnitude and timing control of pinch performance is based on accurate feed-forward and feedback control mechanisms. Ratio of peak pinch force and maximum load force during a functional performance is a sensitive parameter to reflect the ability to scale pinch force output according to actual loads. A pinch apparatus was constructed to detect momentary pinch force modulation of 20 subjects with normal hand sensation. The results indicated high intra-class correlation coefficient and small coefficient of variation of the detected force ratio among three repeated tests, which represented that the stability test of the measured response confirmed the feasibility of this apparatus. The force ratio for a 480 g object with a steel surface ranged between 1.77 and 1.98. Normal subjects were able to scale and contribute pinch force precisely to a pinch-holding-up test. This study may provide clinicians a reliable apparatus and method to analyze the recovery of functional sensibility in patients with nerve injuries. Copyright 2009 Orthopaedic Research Society. Published by Wiley Periodicals, Inc.
Validity and reliability of a video questionnaire to assess physical function in older adults.

PubMed

Balachandran, Anoop; N Verduin, Chelsea; Potiaumpai, Melanie; Ni, Meng; Signorile, Joseph F

2016-08-01

Self-report questionnaires are widely used to assess physical function in older adults. However, they often lack a clear frame of reference and hence interpreting and rating task difficulty levels can be problematic for the responder. Consequently, the usefulness of traditional self-report questionnaires for assessing higher-level functioning is limited. Video-based questionnaires can overcome some of these limitations by offering a clear and objective visual reference for the performance level against which the subject is to compare his or her perceived capacity. Hence the purpose of the study was to develop and validate a novel, video-based questionnaire to assess physical function in older adults independently living in the community. A total of 61 community-living adults, 60years or older, were recruited. To examine validity, 35 of the subjects completed the video questionnaire, two types of physical performance tests: a test of instrumental activity of daily living (IADL) included in the Short Physical Functional Performance battery (PFP-10), and a composite of 3 performance tests (30s chair stand, single-leg balance and usual gait speed). To ascertain reliability, two-week test-retest reliability was assessed in the remaining 26 subjects who did not participate in validity testing. The video questionnaire showed a moderate correlation with the IADLs (Spearman rho=0.64, p<0.001; 95% CI (0.4, 0.8)), and a lower correlation with the composite score of physical performance tests (Spearman rho=0.49, p<0.01; 95% CI (0.18, 0.7)). The test-retest assessment yielded an intra-class correlation (ICC) of 0.87 (p<0.001; 95% CI (0.70, 0.94)) and a Cronbach's alpha of 0.89 demonstrating good reliability and internal consistency. Our results show that the video questionnaire developed to evaluate physical function in community-living older adults is a valid and reliable assessment tool; however, further validation is needed for definitive conclusions. Copyright © 2016 Elsevier Inc. All rights reserved.
Development and validation of the coronary heart disease scale under the system of quality of life instruments for chronic diseases QLICD-CHD: combinations of classical test theory and Generalizability Theory.

PubMed

Wan, Chonghua; Li, Hezhan; Fan, Xuejin; Yang, Ruixue; Pan, Jiahua; Chen, Wenru; Zhao, Rong

2014-06-04

Quality of life (QOL) for patients with coronary heart disease (CHD) is now concerned worldwide with the specific instruments being seldom and no one developed by the modular approach. This paper is aimed to develop the CHD scale of the system of Quality of Life Instruments for Chronic Diseases (QLICD-CHD) by the modular approach and validate it by both classical test theory and Generalizability Theory. The QLICD-CHD was developed based on programmed decision procedures with multiple nominal and focus group discussions, in-depth interview, pre-testing and quantitative statistical procedures. 146 inpatients with CHD were used to provide the data measuring QOL three times before and after treatments. The psychometric properties of the scale were evaluated with respect to validity, reliability and responsiveness employing correlation analysis, factor analyses, multi-trait scaling analysis, t-tests and also G studies and D studies of Genralizability Theory analysis. Multi-trait scaling analysis, correlation and factor analyses confirmed good construct validity and criterion-related validity when using SF-36 as a criterion. The internal consistency α and test-retest reliability coefficients (Pearson r and Intra-class correlations ICC) for the overall instrument and all domains were higher than 0.70 and 0.80 respectively; The overall and all domains except for social domain had statistically significant changes after treatments with moderate effect size SRM (standardized response mea) ranging from 0.32 to 0.67. G-coefficients and index of dependability (Ф coefficients) confirmed the reliability of the scale further with more exact variance components. The QLICD-CHD has good validity, reliability, and moderate responsiveness and some highlights, and can be used as the quality of life instrument for patients with CHD. However, in order to obtain better reliability, the numbers of items for social domain should be increased or the items' quality, not quantity, should be improved.
Measuring reliable change in cognition using the Edinburgh Cognitive and Behavioural ALS Screen (ECAS).

PubMed

Crockford, Christopher; Newton, Judith; Lonergan, Katie; Madden, Caoifa; Mays, Iain; O'Sullivan, Meabhdh; Costello, Emmet; Pinto-Grau, Marta; Vajda, Alice; Heverin, Mark; Pender, Niall; Al-Chalabi, Ammar; Hardiman, Orla; Abrahams, Sharon

2018-02-01

Cognitive impairment affects approximately 50% of people with amyotrophic lateral sclerosis (ALS). Research has indicated that impairment may worsen with disease progression. The Edinburgh Cognitive and Behavioural ALS Screen (ECAS) was designed to measure neuropsychological functioning in ALS, with its alternate forms (ECAS-A, B, and C) allowing for serial assessment over time. The aim of the present study was to establish reliable change scores for the alternate forms of the ECAS, and to explore practice effects and test-retest reliability of the ECAS's alternate forms. Eighty healthy participants were recruited, with 57 completing two and 51 completing three assessments. Participants were administered alternate versions of the ECAS serially (A-B-C) at four-month intervals. Intra-class correlation analysis was employed to explore test-retest reliability, while analysis of variance was used to examine the presence of practice effects. Reliable change indices (RCI) and regression-based methods were utilized to establish change scores for the ECAS alternate forms. Test-retest reliability was excellent for ALS Specific, ALS Non-Specific, and ECAS Total scores of the combined ECAS A, B, and C (all > .90). No significant practice effects were observed over the three testing sessions. RCI and regression-based methods produced similar change scores. The alternate forms of the ECAS possess excellent test-retest reliability in a healthy control sample, with no significant practice effects. The use of conservative RCI scores is recommended. Therefore, a change of ≥8, ≥4, and ≥9 for ALS Specific, ALS Non-Specific, and ECAS Total score is required for reliable change.
An assessment of the intra- and inter-reliability of the lumbar paraspinal muscle parameters using CT scan and magnetic resonance imaging.

PubMed

Hu, Zhi-Jun; He, Jian; Zhao, Feng-Dong; Fang, Xiang-Qian; Zhou, Li-Na; Fan, Shun-Wu

2011-06-01

A reliability study was conducted. To estimate the intra- and intermeasurement errors in the measurements of functional cross-sectional area (FCSA), density, and T2 signal intensity of paraspinal muscles using computed tomography (CT) scan and magnetic resonance imaging (MRI). CT scan and MRI had been used widely to measure the cross-sectional area and degeneration of the back muscles in spine and muscle research. But there is still no systemic study to analyze the reliability of these measurements. This study measured the FCSA and fatty infiltration (density on CT scan and T2 signal intensity on MRI) of the paraspinal muscles at L3-L4, L4-L5, and L5-S1 in 29 patients with chronic low back pain. Two experienced musculoskeletal radiologists and one superior spine surgeon traced the region of interest twice within 3 weeks for measurement of the intra- and interobserver reliability. The intraclass correlation coefficients (ICCs) of the intra-reliability ranged from fair to excellent for FCSA, and good to excellent for fatty infiltration. The ICCs of the inter-reliability ranged from fair to excellent for FCSA, and good to excellent for fatty infiltration. There were no significant differences between CT scan and MRI in reliability results, except in the relative standard error of fatty infiltration measurement. The ICCs of the FCSA measurement between CT scan and MRI ranged from poor to good. The reliabilities of the CT scan and MRI for measuring the FCSA and fatty infiltration of the atrophied lumbar paraspinal muscles were acceptable. It was reliable for using uniform one image method for a single paraspinal muscle evaluation study. And the authors preferred to advise the MRI other than CT scan for paraspinal muscles measurements of FCSA and fatty infiltration.
Random spot urine protein to creatinine ratio is a reliable measure of proteinuria in lupus nephritis in Koreans.

PubMed

Choi, In Ah; Park, Jin Kyun; Lee, Eun Young; Song, Yeong Wook; Lee, Eun Bong

2013-01-01

The accurate assessment of proteinuria is critical for the management of lupus nephritis. Measuring the protein to creatinine (P/C) ratio in random spot urine (RSU) samples has been introduced as an alternative to the 24-hour (24h) urine collection method. However, it remains unclear as to whether the RSU P/C ratio is reliable for assessing lupus nephritis (LN) in routine clinical practice. In total, 275 pairs of 24h urine and RSU samples from 102 patients with biopsy-proven LN were analysed. The correlation and concordance between the P/C ratios in the two sample types were assessed by Pearson or Spearman correlation and intra-class correlation coefficient (ICC) using mixed models for repeated measurements, respectively. The mean 24h urine P/C ratio was 3.2 ± 4.9. Overall, RSU P/C ratio correlated strongly with the 24h urine P/C ratio (r=0.944, p<0.001) with an excellent agreement (ICC=0.949, 95% confidence interval [CI]: 0.69-1.00). Subgroup analyses revealed that the correlation remained high in class II, III, IV, and V LN (rho=0.868, p<0.001; rho=0.649, p=0.007; r=0.945, p<0.001; and rho=0.900, p=0.001, respectively). The correlation between the 24h urine and RSU P/C ratio in the range of 0.5 to 3 was good (r=0.720, p<0.001) with ICC of 0.659 (95%CI 0.554-0.812). RSU P/C ratio ≥0.5 could predict 24h PCR ≥0.5 with 91.7% sensitivity and 70.2% specificity, whereas RSU P/C ratio ≥1.0 increased specificity up to 94.7%. The RSU P/C ratio is an excellent alternative to the 24 hour P/C ratio for assessing the presence of clinically significant proteinuria in LN. RSU P/C ratio >1.0 may prompt directly to a renal biopsy, whereas RSU P/C ratio between 0.5-1.0 should be followed by a confirmatory 24h urine collection.
Validity and Reliability of Spine Rasterstereography in Patients With Adolescent Idiopathic Scoliosis.

PubMed

Tabard-Fougère, Anne; Bonnefoy-Mazure, Alice; Hanquinet, Sylviane; Lascombes, Pierre; Armand, Stéphane; Dayer, Romain

2017-01-15

Test-retest study. This study aimed to evaluate the validity and reliability of rasterstereography in patients with adolescent idiopathic scoliosis (AIS) with a major curve Cobb angle (CA) between 10° and 40° for frontal, sagittal, and transverse parameters. Previous studies evaluating the validity and reliability of rasterstereography concluded that this technique had good accuracy compared with radiographs and a high intra- and interday reliability in healthy volunteers. To the best of our knowledge, the validity and reliability have not been assessed in AIS patients. Thirty-five adolescents with AIS (male = 13) aged 13.1 ± 2.0 years were included. To evaluate the validity of the scoliosis angle (SA) provided by rasterstereography, a comparison (t test, Pearson correlation) was performed with the CA obtained using 2D EOS® radiography (XR). Three rasterstereographic repeated measurements were independently performed by two operators on the same day (interrater reliability) and again by the first operator 1 week later (intrarater reliability). The variables of interest were the SA, lumbar lordosis, and thoracic kyphosis angle, trunk length, pelvic obliquity, and maximum, root mean square and amplitude of vertebral rotations. The data analyses used intraclass correlation coefficients (ICCs). The CA and SA were strongly correlated (R = 0.70) and were nonsignificantly different (P = 0.60). The intrarater reliability (same day: ICC [1, 1], n = 35; 1 week later: ICC [1, 3], n = 28) and interrater reliability (ICC [3, 3], n = 16) were globally excellent (ICC > 0.75) except for the assessment of pelvic obliquity. This study showed that the rasterstereographic system allows for the evaluation of AIS patients with a good validity compared with XR with an overall excellent intra- and interrater reliability. Based on these results, this automatic, fast, and noninvasive system can be used for monitoring the evolution of AIS in growing patients instead of repetitive radiographs, thereby reducing radiation exposure and decreasing costs. 4.
Carotid and vertebral injury study (CAVIS) technique for characterization of blunt traumatic aneurysms with reliability assessment.

PubMed

Griessenauer, Christoph J; Foreman, Paul; Shoja, Mohammadali M; Kicielinski, Kimberly P; Deveikis, John P; Walters, Beverly C; Harrigan, Mark R

2015-04-01

Traumatic aneurysms occur in up to 20% of blunt traumatic extracranial carotid artery injuries. Currently there is no standardized method for characterization of traumatic aneurysms. For the carotid and vertebral injury study (CAVIS), a prospective study of traumatic cerebrovascular injury, we established a method for aneurysm characterization and tested its reliability. Saccular aneurysm size was defined as the greatest linear distance between the expected location of the normal artery wall and the outer edge of the aneurysm lumen ("depth"). Fusiform aneurysm size was defined as the "depth" and longitudinal distance ("length") paralleling the normal artery. The size of the aneurysm relative to the normal artery was also assessed. Reliability measurements were made using four raters who independently reviewed 15 computed tomographic angiograms (CTAs) and 13 digital subtraction angiograms (DSAs) demonstrating a traumatic aneurysm of the internal carotid artery. Raters categorized the aneurysms as either "saccular" or "fusiform" and made measurements. Five scans of each imaging modality were repeated to evaluate intra-rater reliability. Fleiss's free-marginal multi-rater kappa (κ), Cohen's kappa (κ), and interclass correlation coefficient (ICC) determined inter- and intra-rater reliability. Inter-rater agreement as to the aneurysm "shape" was almost perfect for CTA (κ = 0.82) and DSA (κ = 0.897). Agreements on aneurysm "depth," "length," "aneurysm plus parent artery," and "parent artery" for CTA and DSA were excellent (ICC > 0.75). Intra-rater agreement as to aneurysm "shape" was substantial to almost perfect (κ > 0.60). The CAVIS method of traumatic aneurysm characterization has remarkable inter- and intra-rater reliability and will facilitate further studies of the natural history and management of extracranial cerebrovascular traumatic aneurysms. © The Author(s) 2015 Reprints and permissions: sagepub.co.uk/journalsPermissions.nav.
Inter-clinician and intra-clinician reliability of force application during joint mobilization: a systematic review.

PubMed

Gorgos, Kara S; Wasylyk, Nicole T; Van Lunen, Bonnie L; Hoch, Matthew C

2014-04-01

Joint mobilizations are commonly used by clinicians to decrease pain and restore joint arthrokinematics following musculoskeletal injury. The force applied during a joint mobilization treatment is subjective to the individual clinician but may have an effect on patient outcomes. The purpose of this systematic review was to critically appraise and synthesize the studies which examined the reliability of clinicians' force application during joint mobilization. A systematic search of PubMed and EBSCO Host databases from inception to March 1, 2013 was conducted to identify studies assessing the reliability of force application during joint mobilizations. Two reviewers utilized the Quality Appraisal of Reliability Studies (QAREL) assessment tool to determine the quality of included studies. The relative reliability of the included studies was examined through intraclass correlation coefficients (ICC) to synthesize study findings. All results were collated qualitatively with a level of evidence approach. A total of seven studies met the eligibility and were included. Five studies were included that assessed inter-clinician reliability, and six studies were included that assessed intra-clinician reliability. The overall level of evidence for inter-clinician reliability was strong for poor-to-moderate reliability (ICC = -0.04 to 0.70). The overall level of evidence for intra-clinician reliability was strong for good reliability (ICC = 0.75-0.99). This systematic review indicates there is variability in force application between clinicians but individual clinicians apply forces consistently. The results of this systematic review suggest innovative instructional methods are needed to improve consistency and validate the forces applied during of joint mobilization treatments. This is particularly evident for improving the consistency of force application across clinicians. Copyright © 2014 Elsevier Ltd. All rights reserved.
Surgeon Reliability for the Assessment of Lumbar Spinal Stenosis on MRI: The Impact of Surgeon Experience.

PubMed

Marawar, Satyajit V; Madom, Ian A; Palumbo, Mark; Tallarico, Richard A; Ordway, Nathaniel R; Metkar, Umesh; Wang, Dongliang; Green, Adam; Lavelle, William F

2017-01-01

Treating surgeon's visual assessment of axial MRI images to ascertain the degree of stenosis has a critical impact on surgical decision-making. The purpose of this study was to prospectively analyze the impact of surgeon experience on inter-observer and intra-observer reliability of assessing severity of spinal stenosis on MRIs by spine surgeons directly involved in surgical decision-making. Seven fellowship trained spine surgeons reviewed MRI studies of 30 symptomatic patients with lumbar stenosis and graded the stenosis in the central canal, the lateral recess and the foramen at T12-L1 to L5-S1 as none, mild, moderate or severe. No specific instructions were provided to what constituted mild, moderate, or severe stenosis. Two surgeons were "senior" (>fifteen years of practice experience); two were "intermediate" (>four years of practice experience), and three "junior" (< one year of practice experience). The concordance correlation coefficient (CCC) was calculated to assess inter-observer reliability. Seven MRI studies were duplicated and randomly re-read to evaluate inter-observer reliability. Surgeon experience was found to be a strong predictor of inter-observer reliability. Senior inter-observer reliability was significantly higher assessing central(p<0.001), foraminal p=0.005 and lateral p=0.001 than "junior" group.Senior group also showed significantly higher inter-observer reliability that intermediate group assessing foraminal stenosis (p=0.036). In intra-observer reliability the results were contrary to that found in inter-observer reliability. Inter-observer reliability of assessing stenosis on MRIs increases with surgeon experience. Lower intra-observer reliability values among the senior group, although not clearly explained, may be due to the small number of MRIs evaluated and quality of MRI images.Level of evidence: Level 3.
Test Re-Test Reliability of Four Versions of the 3-Cone Test in Non-Athletic Men

PubMed Central

Langley, Jason G.; Chetlin, Robert D.

2017-01-01

Until recently, measurement and evaluation in sport science, especially agility testing, has not always included key elements of proper test construction. Often tests are published without reporting reliability and validity analysis for a specific population. The purpose of the present study was to examine the test re-test reliability of four versions of the 3-Cone Test (3CT), and provide guidance on proper test construction for testing agility in athletic populations. Forty male students enrolled in classes in the Department of Physical Education at a mid-Atlantic university participated. On each of test day participants performed 10 trials. In random order, they performed three trials to the right (3CTR, standard test), three to the left (3CTL), and two modified trials (3CTAR and 3CTAL), which included a reactive component in which a visual cue was given to indicate direction. Intra-class correlation coefficients (ICC) indicated a moderate to high reliability for the four tests, 3CTR 0.79 (0.64-0.88, 95%CI), 3CTL 0.73 (0.55-0.85), 3CTAR 0.85(0.74-0.92), and 3CTAL 0.79 (0.64-0.88). Small standard error of the measurement (SEM) was found; range 0.09 to 0.10. Pearson correlations between tests were high (0.82-0.92) on day one as well as day two (0.72-0.85). These results indicate each version of the 3-Cone Test is reliable; however, further tests are needed with specific athletic populations. Only the 3CTAR and 3CTAL are tests of agility due to the inclusion of a reactive component. Future studies examining agility testing and training should incorporate technological elements, including automated timing systems and motion capture analysis. Such instrumentation will allow for optimal design of tests that simulate sport-specific game conditions. Key points The commonly used 3-cone test (upside down “L” to the right”) is a reliable change of direction speed (CODS) test when evaluating collegiate males. A modification of the CODS 3-cone test (upside down “L” to the left instead of to the right) is also reliable for evaluating collegiate males. A modification of the 3-cone that includes reaction and a choice of a cut to the left or right remains reliable as now an agility test version in collegiate males. There are moderate to high correlation between the 4 versions of the tests. Reaction remains a critical to the design of testing and training agility protocols, and should be investigated similarly to various athletes including novice/expert, male/female, and nearly every sporting event. PMID:28344450
Effects of a common transcranial direct current stimulation (tDCS) protocol on motor evoked potentials found to be highly variable within individuals over 9 testing sessions.

PubMed

Horvath, Jared Cooney; Vogrin, Simon J; Carter, Olivia; Cook, Mark J; Forte, Jason D

2016-09-01

Transcranial direct current stimulation (tDCS) uses a weak electric current to modulate neuronal activity. A neurophysiologic outcome measure to demonstrate reliable tDCS modulation at the group level is transcranial magnetic stimulation engendered motor evoked potentials (MEPs). Here, we conduct a study testing the reliability of individual MEP response patterns following a common tDCS protocol. Fourteen participants (7m/7f) each underwent nine randomized sessions of 1 mA, 10 min tDCS (3 anode; 3 cathode; 3 sham) delivered using an M1/orbito-frontal electrode montage (sessions separated by an average of ~5.5 days). Fifteen MEPs were obtained prior to, immediately following and in 5 min intervals for 30 min following tDCS. TMS was delivered at 130 % resting motor threshold using neuronavigation to ensure consistent coil localization. A number of non-experimental variables were collected during each session. At the individual level, considerable variability was seen among different testing sessions. No participant demonstrated an excitatory response ≥20 % to all three anodal sessions, and no participant demonstrated an inhibitory response ≥20 % to all three cathodal sessions. Intra-class correlation revealed poor anodal and cathodal test-retest reliability [anode: ICC(2,1) = 0.062; cathode: ICC(2,1) = 0.055] and moderate sham test-retest reliability [ICC(2,1) = 0.433]. Results also revealed no significant effect of tDCS at the group level. Using this common protocol, we found the effects of tDCS on MEP amplitudes to be highly variable at the individual level. In addition, no significant effects of tDCS on MEP amplitude were found at the group level. Future studies should consider utilizing a more strict experimental protocol to potentially account for intra-individual response variations.
Reliability of 3D laser-based anthropometry and comparison with classical anthropometry.

PubMed

Kuehnapfel, Andreas; Ahnert, Peter; Loeffler, Markus; Broda, Anja; Scholz, Markus

2016-05-26

Anthropometric quantities are widely used in epidemiologic research as possible confounders, risk factors, or outcomes. 3D laser-based body scans (BS) allow evaluation of dozens of quantities in short time with minimal physical contact between observers and probands. The aim of this study was to compare BS with classical manual anthropometric (CA) assessments with respect to feasibility, reliability, and validity. We performed a study on 108 individuals with multiple measurements of BS and CA to estimate intra- and inter-rater reliabilities for both. We suggested BS equivalents of CA measurements and determined validity of BS considering CA the gold standard. Throughout the study, the overall concordance correlation coefficient (OCCC) was chosen as indicator of agreement. BS was slightly more time consuming but better accepted than CA. For CA, OCCCs for intra- and inter-rater reliability were greater than 0.8 for all nine quantities studied. For BS, 9 of 154 quantities showed reliabilities below 0.7. BS proxies for CA measurements showed good agreement (minimum OCCC > 0.77) after offset correction. Thigh length showed higher reliability in BS while upper arm length showed higher reliability in CA. Except for these issues, reliabilities of CA measurements and their BS equivalents were comparable.
Compartment elasticity measured by pressure-related ultrasound to determine patients "at risk" for compartment syndrome: an experimental in vitro study.

PubMed

Sellei, Richard Martin; Hingmann, Simon Johannes; Kobbe, Philipp; Weber, Christian; Grice, John Edward; Zimmerman, Frauke; Jeromin, Sabine; Hildebrand, Frank; Pape, Hans-Christoph

2015-01-01

Decision-making in treatment of an acute compartment syndrome is based on clinical assessment, supported by invasive monitoring. Thus, evolving compartment syndrome may require repeated pressure measurements. In suspected cases of potential compartment syndromes clinical assessment alone seems to be unreliable. The objective of this study was to investigate the feasibility of a non-invasive application estimating whole compartmental elasticity by ultrasound, which may improve accuracy of diagnostics. In an in vitro model, using an artificial container simulating dimensions of the human anterior tibial compartment, intra-compartmental pressures (p) were raised subsequently up to 80 mmHg by infusion of saline solution. The compartmental depth (mm) in the cross-section view was measured before and after manual probe compression (100 mmHg) upon the surface resulting in a linear compartmental displacement (∆d). This was repeated at rising compartmental pressures. The resulting displacements were related to the corresponding intra-compartmental pressures simulated in our model. A hypothesized relationship between pressures related compartmental displacement and the elasticity at elevated compartment pressures was investigated. With rising compartmental pressures, a non-linear, reciprocal proportional relation between the displacement (mm) and the intra-compartmental pressure (mmHg) occurred. The Pearson coefficient showed a high correlation (r(2) = -0.960). The intra-observer reliability value kappa resulted in a statistically high reliability (κ = 0.840). The inter-observer value indicated a fair reliability (κ = 0.640). Our model reveals that a strong correlation between compartmental strain displacements assessed by ultrasound and the intra-compartmental pressure changes occurs. Further studies are required to prove whether this assessment is transferable to human muscle tissue. Determining the complete compartmental elasticity by ultrasound enhancement, this application may improve detection of early signs of potential compartment syndrome.
Screening of the spine in adolescents: inter- and intra-rater reliability and measurement error of commonly used clinical tests.

PubMed

Aartun, Ellen; Degerfalk, Anna; Kentsdotter, Linn; Hestbaek, Lise

2014-02-10

Evidence on the reliability of clinical tests used for the spinal screening of children and adolescents is currently lacking. The aim of this study was to determine the inter- and intra-rater reliability and measurement error of clinical tests commonly used when screening young spines. Two experienced chiropractors independently assessed 111 adolescents aged 12-14 years who were recruited from a primary school in Denmark. A standardised examination protocol was used to test inter-rater reliability including tests for scoliosis, hypermobility, general mobility, inter-segmental mobility and end range pain in the spine. Seventy-five of the 111 subjects were re-examined after one to four hours to test intra-rater reliability. Percentage agreement and Cohen's Kappa were calculated for binary variables, and interclass correlation (ICC) and Bland-Altman plots with Limits of Agreement (LoA) were calculated for continuous measures. Inter-rater percentage agreement for binary data ranged from 59.5% to 100%. Kappa ranged from 0.06-1.00. Kappa ≥ 0.40 was seen for elbow, thumb, fifth finger and trunk/hip flexion hypermobility, pain response in inter-segmental mobility and end range pain in lumbar flexion and extension. For continuous data, ICCs ranged from 0.40-0.95. Only forward flexion as measured by finger-to-floor distance reached an acceptable ICC(≥ 0.75). Overall, results for intra-rater reliability were better than for inter-rater reliability but for both components, the LoA were quite wide compared with the range of assessments. Some clinical tests showed good, and some tests poor, reliability when applied in a spinal screening of adolescents. The results could probably be improved by additional training and further test standardization. This is the first step in evaluating the value of these tests for the spinal screening of adolescents. Future research should determine the association between these tests and current and/or future neck and back pain.
Intra- and interobserver reliability of the Eaton classification for trapeziometacarpal arthritis: a systematic review.

PubMed

Berger, Aaron J; Momeni, Arash; Ladd, Amy L

2014-04-01

Trapeziometacarpal, or thumb carpometacarpal (CMC), arthritis is a common problem with a variety of treatment options. Although widely used, the Eaton radiographic staging system for CMC arthritis is of questionable clinical utility, as disease severity does not predictably correlate with symptoms or treatment recommendations. A possible reason for this is that the classification itself may not be reliable, but the literature on this has not, to our knowledge, been systematically reviewed. We therefore performed a systematic review to determine the intra- and interobserver reliability of the Eaton staging system. We systematically reviewed English-language studies published between 1973 and 2013 to assess the degree of intra- and interobserver reliability of the Eaton classification for determining the stage of trapeziometacarpal joint arthritis and pantrapezial arthritis based on plain radiographic imaging. Search engines included: PubMed, Scopus(®), and CINAHL. Four studies, which included a total of 163 patients, met our inclusion criteria and were evaluated. The level of evidence of the studies included in this analysis was determined using the Oxford Centre for Evidence Based Medicine Levels of Evidence Classification by two independent observers. A limited number of studies have been performed to assess intra- and interobserver reliability of the Eaton classification system. The four studies included were determined to be Level 3b. These studies collectively indicate that the Eaton classification demonstrates poor to fair interobserver reliability (kappa values: 0.11-0.56) and fair to moderate intraobserver reliability (kappa values: 0.54-0.657). Review of the literature demonstrates that radiographs assist in the assessment of CMC joint disease, but there is not a reliable system for classification of disease severity. Currently, diagnosis and treatment of thumb CMC arthritis are based on the surgeon's qualitative assessment combining history, physical examination, and radiographic evaluation. Inconsistent agreement using the current common radiographic classification system suggests a need for better radiographic tools to quantify disease severity.
Validation of the Simple Shoulder Test in a Portuguese-Brazilian population. Is the latent variable structure and validation of the Simple Shoulder Test Stable across cultures?

PubMed

Neto, Jose Osni Bruggemann; Gesser, Rafael Lehmkuhl; Steglich, Valdir; Bonilauri Ferreira, Ana Paula; Gandhi, Mihir; Vissoci, João Ricardo Nickenig; Pietrobon, Ricardo

2013-01-01

The validation of widely used scales facilitates the comparison across international patient samples. The objective of this study was to translate, culturally adapt and validate the Simple Shoulder Test into Brazilian Portuguese. Also we test the stability of factor analysis across different cultures. The objective of this study was to translate, culturally adapt and validate the Simple Shoulder Test into Brazilian Portuguese. Also we test the stability of factor analysis across different cultures. The Simple Shoulder Test was translated from English into Brazilian Portuguese, translated back into English, and evaluated for accuracy by an expert committee. It was then administered to 100 patients with shoulder conditions. Psychometric properties were analyzed including factor analysis, internal reliability, test-retest reliability at seven days, and construct validity in relation to the Short Form 36 health survey (SF-36). Factor analysis demonstrated a three factor solution. Cronbach's alpha was 0.82. Test-retest reliability index as measured by intra-class correlation coefficient (ICC) was 0.84. Associations were observed in the hypothesized direction with all subscales of SF-36 questionnaire. The Simple Shoulder Test translation and cultural adaptation to Brazilian-Portuguese demonstrated adequate factor structure, internal reliability, and validity, ultimately allowing for its use in the comparison with international patient samples.
Agreement between the spatio-temporal gait parameters from treadmill-based photoelectric cell and the instrumented treadmill system in healthy young adults and stroke patients.

PubMed

Lee, Myungmo; Song, Changho; Lee, Kyoungjin; Shin, Doochul; Shin, Seungho

2014-07-14

Treadmill gait analysis was more advantageous than over-ground walking because it allowed continuous measurements of the gait parameters. The purpose of this study was to investigate the concurrent validity and the test-retest reliability of the OPTOGait photoelectric cell system against the treadmill-based gait analysis system by assessing spatio-temporal gait parameters. Twenty-six stroke patients and 18 healthy adults were asked to walk on the treadmill at their preferred speed. The concurrent validity was assessed by comparing data obtained from the 2 systems, and the test-retest reliability was determined by comparing data obtained from the 1st and the 2nd session of the OPTOGait system. The concurrent validity, identified by the intra-class correlation coefficients (ICC [2, 1]), coefficients of variation (CVME), and 95% limits of agreement (LOA) for the spatial-temporal gait parameters, were excellent but the temporal parameters expressed as a percentage of the gait cycle were poor. The test-retest reliability of the OPTOGait System, identified by ICC (3, 1), CVME, 95% LOA, standard error of measurement (SEM), and minimum detectable change (MDC95%) for the spatio-temporal gait parameters, was high. These findings indicated that the treadmill-based OPTOGait System had strong concurrent validity and test-retest reliability. This portable system could be useful for clinical assessments.

Validation of the Simple Shoulder Test in a Portuguese-Brazilian Population. Is the Latent Variable Structure and Validation of the Simple Shoulder Test Stable across Cultures?

PubMed Central

Neto, Jose Osni Bruggemann; Gesser, Rafael Lehmkuhl; Steglich, Valdir; Bonilauri Ferreira, Ana Paula; Gandhi, Mihir; Vissoci, João Ricardo Nickenig; Pietrobon, Ricardo

2013-01-01

Background The validation of widely used scales facilitates the comparison across international patient samples. The objective of this study was to translate, culturally adapt and validate the Simple Shoulder Test into Brazilian Portuguese. Also we test the stability of factor analysis across different cultures. Objective The objective of this study was to translate, culturally adapt and validate the Simple Shoulder Test into Brazilian Portuguese. Also we test the stability of factor analysis across different cultures. Methods The Simple Shoulder Test was translated from English into Brazilian Portuguese, translated back into English, and evaluated for accuracy by an expert committee. It was then administered to 100 patients with shoulder conditions. Psychometric properties were analyzed including factor analysis, internal reliability, test-retest reliability at seven days, and construct validity in relation to the Short Form 36 health survey (SF-36). Results Factor analysis demonstrated a three factor solution. Cronbach’s alpha was 0.82. Test-retest reliability index as measured by intra-class correlation coefficient (ICC) was 0.84. Associations were observed in the hypothesized direction with all subscales of SF-36 questionnaire. Conclusion The Simple Shoulder Test translation and cultural adaptation to Brazilian-Portuguese demonstrated adequate factor structure, internal reliability, and validity, ultimately allowing for its use in the comparison with international patient samples. PMID:23675436
Validation of the Middlesex Elderly Assessment of Mental State (MEAMS) as a cognitive screening test in patients with acquired brain injury in Turkey.

PubMed

Kutlay, Sehim; Kuçukdeveci, Ayse A; Elhan, Atilla H; Yavuzer, Gunes; Tennant, Alan

2007-02-28

Assessment of cognitive impairment with a valid cognitive screening tool is essential in neurorehabilitation. The aim of this study was to test the reliability and validity of the Turkish-adapted version of the Middlesex Elderly Assessment of Mental State (MEAMS) among acquired brain injury patients in Turkey. Some 155 patients with acquired brain injury admitted for rehabilitation were assessed by the adapted version of MEAMS at admission and discharge. Reliability was tested by internal consistency, intra-class correlation coefficient (ICC) and person separation index; internal construct validity by Rasch analysis; external construct validity by associations with physical and cognitive disability (FIM); and responsiveness by Effect Size. Reliability was found to be good with Cronbach's alpha of 0.82 at both admission and discharge; and likewise an ICC of 0.80. Person separation index was 0.813. Internal construct validity was good by fit of the data to the Rasch model (mean item fit -0.178; SD 1.019). Items were substantially free of differential item functioning. External construct validity was confirmed by expected associations with physical and cognitive disability. Effect size was 0.42 compared with 0.22 for cognitive FIM. The reliability and validity of the Turkish version of MEAMS as a cognitive impairment screening tool in acquired brain injury has been demonstrated.
Between-day reliability of a method for non-invasive estimation of muscle composition.

PubMed

Simunič, Boštjan

2012-08-01

Tensiomyography is a method for valid and non-invasive estimation of skeletal muscle fibre type composition. The validity of selected temporal tensiomyographic measures has been well established recently; there is, however, no evidence regarding the method's between-day reliability. Therefore it is the aim of this paper to establish the between-day repeatability of tensiomyographic measures in three skeletal muscles. For three consecutive days, 10 healthy male volunteers (mean±SD: age 24.6 ± 3.0 years; height 177.9 ± 3.9 cm; weight 72.4 ± 5.2 kg) were examined in a supine position. Four temporal measures (delay, contraction, sustain, and half-relaxation time) and maximal amplitude were extracted from the displacement-time tensiomyogram. A reliability analysis was performed with calculations of bias, random error, coefficient of variation (CV), standard error of measurement, and intra-class correlation coefficient (ICC) with a 95% confidence interval. An analysis of ICC demonstrated excellent agreement (ICC were over 0.94 in 14 out of 15 tested parameters). However, lower CV was observed in half-relaxation time, presumably because of the specifics of the parameter definition itself. These data indicate that for the three muscles tested, tensiomyographic measurements were reproducible across consecutive test days. Furthermore, we indicated the most possible origin of the lowest reliability detected in half-relaxation time. Copyright © 2012 Elsevier Ltd. All rights reserved.
Reliability and variability of day-to-day vault training measures in artistic gymnastics.

PubMed

Bradshaw, Elizabeth; Hume, Patria; Calton, Mark; Aisbett, Brad

2010-06-01

Inter-day training reliability and variability in artistic gymnastics vaulting was determined using a customised infra-red timing gate and contact mat timing system. Thirteen Australian high performance gymnasts (eight males and five females) aged 11-23 years were assessed during two consecutive days of normal training. Each gymnast completed a number of vault repetitions per daily session. Inter-day variability of vault run-up velocities (at -18 to -12 m, -12 to -6 m, -6 to -2 m, and -2 to 0 m from the nearest edge of the beat board), and board contact, pre-flight, and table contact times were determined using mixed modelling statistics to account for random (within-subject variability) and fixed effects (gender, number of subjects, number of trials). The difference in the mean (Mdiff) and Cohen's effect sizes for reliability assessment and intra-class correlation coefficients, and the coefficient of variation percentage (CV%) were calculated for variability assessment. Approach velocity (-18 to -2m, CV = 2.4-7.8%) and board contact time (CV = 3.5%) were less variable measures when accounting for day-to-day performance differences, than pre-flight time (CV = 17.7%) and table contact time (CV = 20.5%). While pre-flight and table contact times are relevant training measures, approach velocity and board contact time are more reliable when quantifying vaulting performance.
Ultrasound measures of tendon thickness: Intra-rater, Inter-rater and Inter-machine reliability.

PubMed

Del Baño-Aledo, María Elena; Martínez-Payá, Jacinto Javier; Ríos-Díaz, José; Mejías-Suárez, Silvia; Serrano-Carmona, Sergio; de Groot-Ferrando, Ana

2017-01-01

Ultrasound imaging is often used by physiotherapists and other healthcare professionals but the reliability of image acquisition with different ultrasound machines is unknown. The objective was to compare the intra-rater, inter-rater and intermachine reliability of thickness measurements of the plantar fascia (PF), Achilles tendon (AT), patellar tendon (PT) and elbow common extensor tendon (ECET) with musculoskeletal ultrasound imaging (MSUS). Tendon thickness was measured in four anatomical structures (14 participants, 28 images per tendon) by two sonographers and with two different ultrasound machines. Intraclass Correlation Coefficients (ICCs) and Bland-Altman plots were calculated. The standard error of measurement (SEM) and minimum detectable difference (MDD) were calculated. Inter-rater reliability was excellent for AT (ICC=0.98; 95% CI= 0.96-0.99) and very good for PT (ICC=0.85; 95% CI = 0.67-0.93) and ECET (ICC=0.81; 95% CI= 0.72-0.94). Reliability for PF was moderate, with an ICC of 0.63 (CI 95%= 0.20-0.83). Bland-Altman plot for inter-machine reliability showed a mean difference of 1 m for PF measurements and a mean difference of 4 m and 20 m for AT and PT. The relative SEMs were below 7% and the MDCs were below 0.7 mm. The MSUS reliability in measuring thickness of the four tendons is confirmed by the homogeneous readings intra sonographers, between operators and between different machines. Level of evidence: Tendon thickness can be measured reliably on different ultrasound devices, which is an important step forward in the use of this technique in daily clinical practice and research. III.
Reliability and validity of CODA motion analysis system for measuring cervical range of motion in patients with cervical spondylosis and anterior cervical fusion.

PubMed

Gao, Zhongyang; Song, Hui; Ren, Fenggang; Li, Yuhuan; Wang, Dong; He, Xijing

2017-12-01

The aim of the present study was to evaluate the reliability of the Cartesian Optoelectronic Dynamic Anthropometer (CODA) motion system in measuring the cervical range of motion (ROM) and verify the construct validity of the CODA motion system. A total of 26 patients with cervical spondylosis and 22 patients with anterior cervical fusion were enrolled and the CODA motion analysis system was used to measure the three-dimensional cervical ROM. Intra- and inter-rater reliability was assessed by interclass correlation coefficients (ICCs), standard error of measurement (SEm), Limits of Agreements (LOA) and minimal detectable change (MDC). Independent samples t-tests were performed to examine the differences of cervical ROM between cervical spondylosis and anterior cervical fusion patients. The results revealed that in the cervical spondylosis group, the reliability was almost perfect (intra-rater reliability: ICC, 0.87-0.95; LOA, -12.86-13.70; SEm, 2.97-4.58; inter-rater reliability: ICC, 0.84-0.95; LOA, -13.09-13.48; SEm, 3.13-4.32). In the anterior cervical fusion group, the reliability was high (intra-rater reliability: ICC, 0.88-0.97; LOA, -10.65-11.08; SEm, 2.10-3.77; inter-rater reliability: ICC, 0.86-0.96; LOA, -10.91-13.66; SEm, 2.20-4.45). The cervical ROM in the cervical spondylosis group was significantly higher than that in the anterior cervical fusion group in all directions except for left rotation. In conclusion, the CODA motion analysis system is highly reliable in measuring cervical ROM and the construct validity was verified, as the system was sufficiently sensitive to distinguish between the cervical spondylosis and anterior cervical fusion groups based on their ROM.
Development and psychometric evaluation of a clinical global impression for schizoaffective disorder scale.

PubMed

Allen, Michael H; Daniel, David G; Revicki, Dennis A; Canuso, Carla M; Turkoz, Ibrahim; Fu, Dong-Jing; Alphs, Larry; Ishak, K Jack; Bartko, John J; Lindenmayer, Jean-Pierre

2012-01-01

The Clinical Global Impression for Schizoaffective Disorder scale is a new rating scale adapted from the Clinical Global Impression scale for use in patients with schizoaffective disorder. The psychometric characteristics of the Clinical Global Impression for Schizoaffective Disorder are described. Content validity was assessed using an investigator questionnaire. Inter-rater reliability was determined with 12 sets of videotaped interviews rated independently by two trained individuals. Test-retest reliability was assessed using 30 randomly selected raters from clinical trials who evaluated the same videos on separate occasions two weeks apart. Convergent and divergent validity and effect size were evaluated by comparing scores between the Clinical Global Impression for Schizoaffective Disorder and the Positive and Negative Syndrome Scale, 21-item Hamilton Rating Scale for Depression, and Young Mania Rating Scale scales using pooled patient data from two clinical trials. Clinical Global Impression for Schizoaffective Disorder scores were then linked to corresponding Positive and Negative Syndrome Scale scores. Content validity was strong. Inter-rater agreement was good to excellent for most scales and subscales (intra-class correlation coefficient ≥ 0.50). Test-retest showed good reproducibility, with intraclass correlation coefficients ranging from 0.444 to 0.898. Spearman correlations between Clinical Global Impression for Schizoaffective Disorder domains and corresponding symptom scales were 0.60 or greater, and effect sizes for Clinical Global Impression for Schizoaffective Disorder overall and domain scores were similar to Positive and Negative Syndrome Scale Young Mania Rating Scale, and 21-item Hamilton Rating Scale for Depression scores. Raters anticipated that the scale might be less effective in distinguishing negative from depressive symptoms, and, in fact, the results here may reflect that clinical reality. Multiple lines of evidence support the reliability and validity of the Clinical Global Impression for Schizoaffective Disorder for studies in schizoaffective disorder.
Validation of the Chinese Version of the Quality of Nursing Work Life Scale

PubMed Central

Fu, Xia; Xu, Jiajia; Song, Li; Li, Hua; Wang, Jing; Wu, Xiaohua; Hu, Yani; Wei, Lijun; Gao, Lingling; Wang, Qiyi; Lin, Zhanyi; Huang, Huigen

2015-01-01

Quality of Nursing Work Life (QNWL) serves as a predictor of a nurse’s intent to leave and hospital nurse turnover. However, QNWL measurement tools that have been validated for use in China are lacking. The present study evaluated the construct validity of the QNWL scale in China. A cross-sectional study was conducted conveniently from June 2012 to January 2013 at five hospitals in Guangzhou, which employ 1938 nurses. The participants were asked to complete the QNWL scale and the World Health Organization Quality of Life abbreviated version (WHOQOL-BREF). A total of 1922 nurses provided the final data used for analyses. Sixty-five nurses from the first investigated division were re-measured two weeks later to assess the test-retest reliability of the scale. The internal consistency reliability of the QNWL scale was assessed using Cronbach’s α. Test-retest reliability was assessed using the intra-class correlation coefficient (ICC). Criterion-relation validity was assessed using the correlation of the total scores of the QNWL and the WHOQOL-BREF. Construct validity was assessed with the following indices: χ2 statistics and degrees of freedom; relative mean square error of approximation (RMSEA); the Akaike information criterion (AIC); the consistent Akaike information criterion (CAIC); the goodness-of-fit index (GFI); the adjusted goodness of fit index; and the comparative fit index (CFI). The findings demonstrated high internal consistency (Cronbach’s α = 0.912) and test-retest reliability (interclass correlation coefficient = 0.74) for the QNWL scale. The chi-square test (χ2 = 13879.60, df [degree of freedom] = 813 P = 0.0001) was significant. The RMSEA value was 0.091, and AIC = 1806.00, CAIC = 7730.69, CFI = 0.93, and GFI = 0.74. The correlation coefficient between the QNWL total scores and the WHOQOL-BREF total scores was 0.605 (p<0.01). The QNWL scale was reliable and valid in Chinese-speaking nurses and could be used as a clinical and research instrument for measuring work-related factors among nurses in China. PMID:25950838
Validation of the Japanese version of the Pediatric Quality of Life Inventory (PedsQL) Cancer Module.

PubMed

Tsuji, Naoko; Kakee, Naoko; Ishida, Yasushi; Asami, Keiko; Tabuchi, Ken; Nakadate, Hisaya; Iwai, Tsuyako; Maeda, Miho; Okamura, Jun; Kazama, Takuro; Terao, Yoko; Ohyama, Wataru; Yuza, Yuki; Kaneko, Takashi; Manabe, Atsushi; Kobayashi, Kyoko; Kamibeppu, Kiyoko; Matsushima, Eisuke

2011-04-10

The PedsQL 3.0 Cancer Module is a widely used instrument to measure pediatric cancer specific health-related quality of life (HRQOL) for children aged 2 to 18 years. We developed the Japanese version of the PedsQL Cancer Module and investigated its reliability and validity among Japanese children and their parents. Participants were 212 children with cancer and 253 of their parents. Reliability was determined by internal consistency using Cronbach's coefficient alpha and test-retest reliability using intra-class correlation coefficient (ICC). Validity was assessed through factor validity, convergent and discriminant validity, concurrent validity, and clinical validity. Factor validity was examined by exploratory factor analysis. Convergent and discriminant validity were examined by multitrait scaling analysis. Concurrent validity was assessed using Spearman's correlation coefficients between the Cancer Module and Generic Core Scales, and the comparison of the scores of child self-reports with those of other self-rating depression scales for children. Clinical validity was assessed by comparing the on- and off- treatment scores using Kruskal-Wallis and Mann-Whitney U tests. Cronbach's coefficient alpha was over 0.70 for the total scale and over 0.60 for each subscale by age except for the 'pain and hurt' subscale for children aged 5 to 7 years. For test-retest reliability, the ICC exceeded 0.70 for the total scale for each age. Exploratory factor analysis demonstrated sufficient factorial validity. Multitrait scaling analysis showed high success rates. Strong correlations were found between the reports by children and their parents, and the scores of the Cancer Module and the Generic Core Scales except for 'treatment anxiety' subscales for child reports. The Depression Self-Rating Scale for Children (DSRS-C) scores were significantly correlated with emotional domains and the total score of the cancer module. Children who had been off treatment over 12 months demonstrated significantly higher scores than those on treatment. The results demonstrate the reliability and validity of the Japanese version of the PedsQL Cancer Module among Japanese children.
Health-related quality of life in children with dysphonia and validation of the French Pediatric Voice Handicap Index.

PubMed

Oddon, P A; Boucekine, M; Boyer, L; Triglia, J M; Nicollas, R

2018-01-01

voice disorders are common in the pediatric population and can negatively affect children's quality of life. The pediatric voice handicap Index (pVHI) is a valid instrument to assess parental perception of their children voice but it is not translated into French language. The aim of the present study was to adapt a French version of the pVHI and to evaluate its psychometric properties including construct validity, reliability, and some aspects of external validity. we performed a cross sectional study including 32 dysphonic children and 60 children with no history of voice problems between 3 and 12 years of age. The original pVHI was translated into French language according to forward-backward rules and then administered to parents or caregivers. Construct validity and internal consistency were explored using confirmatory factor analysis and Cronbach's alpha. The questionnaire was filled twice to assess test-retest reliability using the intra-class correlation coefficient. The external validity was explored by comparing the French pVHI total and subscales scores between dysphonic and asymptomatic children. Correlations between the French pVHI and both the perceptual GRBAS scale and the health-related quality of life (HRQOL) survey "Vécu et Santé Perçu de l'Adolescent et de l'Enfant" (VSP-Ap) were also performed. the structure of the French pVHI showed a good fit with excellent reliability (α = 0.929) and high test-retest reliability. Significant differences were found between the group of dysphonic children and the control group (p < 0.001). The French pVHI scores were positively correlated to all parameters of the GRBAS scale (p < 0.05). Significant negative correlations were found between the Functional domain of the pVHI and various domains of the VSP-Ap as Leisure Activities, Schooling and Sentimental Relationship (p < 0.05). the French pVHI is considered to be a valid and reliable instrument to assess voice-related quality of life in children with voice disorder. We recommend its use in the multidimensional protocols for assessing voice disorder in the pediatric population. Copyright © 2017. Published by Elsevier B.V.
Assessment of Lower Limb Muscle Strength and Power Using Hand-Held and Fixed Dynamometry: A Reliability and Validity Study

PubMed Central

Perraton, Luke G.; Bower, Kelly J.; Adair, Brooke; Pua, Yong-Hao; Williams, Gavin P.; McGaw, Rebekah

2015-01-01

Introduction Hand-held dynamometry (HHD) has never previously been used to examine isometric muscle power. Rate of force development (RFD) is often used for muscle power assessment, however no consensus currently exists on the most appropriate method of calculation. The aim of this study was to examine the reliability of different algorithms for RFD calculation and to examine the intra-rater, inter-rater, and inter-device reliability of HHD as well as the concurrent validity of HHD for the assessment of isometric lower limb muscle strength and power. Methods 30 healthy young adults (age: 23±5yrs, male: 15) were assessed on two sessions. Isometric muscle strength and power were measured using peak force and RFD respectively using two HHDs (Lafayette Model-01165 and Hoggan microFET2) and a criterion-reference KinCom dynamometer. Statistical analysis of reliability and validity comprised intraclass correlation coefficients (ICC), Pearson correlations, concordance correlations, standard error of measurement, and minimal detectable change. Results Comparison of RFD methods revealed that a peak 200ms moving window algorithm provided optimal reliability results. Intra-rater, inter-rater, and inter-device reliability analysis of peak force and RFD revealed mostly good to excellent reliability (coefficients ≥ 0.70) for all muscle groups. Concurrent validity analysis showed moderate to excellent relationships between HHD and fixed dynamometry for the hip and knee (ICCs ≥ 0.70) for both peak force and RFD, with mostly poor to good results shown for the ankle muscles (ICCs = 0.31–0.79). Conclusions Hand-held dynamometry has good to excellent reliability and validity for most measures of isometric lower limb strength and power in a healthy population, particularly for proximal muscle groups. To aid implementation we have created freely available software to extract these variables from data stored on the Lafayette device. Future research should examine the reliability and validity of these variables in clinical populations. PMID:26509265
Face biometrics with renewable templates

NASA Astrophysics Data System (ADS)

van der Veen, Michiel; Kevenaar, Tom; Schrijen, Geert-Jan; Akkermans, Ton H.; Zuo, Fei

2006-02-01

In recent literature, privacy protection technologies for biometric templates were proposed. Among these is the so-called helper-data system (HDS) based on reliable component selection. In this paper we integrate this approach with face biometrics such that we achieve a system in which the templates are privacy protected, and multiple templates can be derived from the same facial image for the purpose of template renewability. Extracting binary feature vectors forms an essential step in this process. Using the FERET and Caltech databases, we show that this quantization step does not significantly degrade the classification performance compared to, for example, traditional correlation-based classifiers. The binary feature vectors are integrated in the HDS leading to a privacy protected facial recognition algorithm with acceptable FAR and FRR, provided that the intra-class variation is sufficiently small. This suggests that a controlled enrollment procedure with a sufficient number of enrollment measurements is required.
Cross-cultural adaptation of the Innsbruck Health Dimensions Questionnaire for Neurosurgical Patients (IHD-NS).

PubMed

Santos, Camila Batista dos; Carvalho, Simone Carneiro Ahualli de; Silva, Maria Fernanda Gouveia da; Fuentes, Daniel; Santana, Pedro Augusto; Furlan, André Beer; Aguiar, Paulo Henrique Pires de

2008-09-01

The goal of this study was to accomplish the cross-cultural adaptation of a quality of life instrument, specific for neurosurgical patients, called Innsbruck Health Dimensions Questionnaire for Neurosurgical Patients (IHD-NS). Thirty patients participated in this study, male and female, all having been submitted to brain tumor surgery more than twelve months before, and whose ages ranged from 26 to 66. After the process of translation/back translation and the elaboration of the Brazilian version of the instrument, the patients were assessed and reassessed within a one-month period. Statistical analyses evinced the preservation of the internal consistency, high agreement levels and highly significant intra-class correlation, allowing for the belief in the quality and reliability of the Portuguese version, named Questionário de Dimensões de Saúde para Pacientes Neurocirúrgicos de Innsbruck--DSI (NC).
The minimal clinically important difference of the control of allergic rhinitis and asthma test (CARAT): cross-cultural validation and relation with pollen counts

PubMed Central

van der Leeuw, Sander; van der Molen, Thys; Dekhuijzen, PN Richard; Fonseca, Joao A; van Gemert, Frederik A; Gerth van Wijk, Roy; Kocks, Janwillem WH; Oosterom, Helma; Riemersma, Roland A; Tsiligianni, Ioanna G; de Weger, Letty A; Oude Elberink, Joanne NG; Flokstra-de Blok, Bertine MJ

2015-01-01

Background: The Control of Allergic Rhinitis and Asthma Test (CARAT) monitors control of asthma and allergic rhinitis. Aims: To determine the CARAT’s minimal clinically important difference (MCID) and to evaluate the psychometric properties of the Dutch CARAT. Methods: CARAT was applied in three measurements at 1-month intervals. Patients diagnosed with asthma and/or rhinitis were approached. MCID was evaluated using Global Rating of Change (GRC) and standard error of measurement (s.e.m.). Cronbach’s alpha was used to evaluate internal consistency. Spearman’s correlation coefficients were calculated between CARAT, the Asthma Control Questionnaire (ACQ5) and the Visual Analog Scale (VAS) on airway symptoms to determine construct and longitudinal validity. Test–retest reliability was evaluated with intra-class correlation coefficient (ICC). Changes in pollen counts were compared with delta CARAT and ACQ5 scores. Results: A total of 92 patients were included. The MCID of the CARAT was 3.50 based on GRC scores; the s.e.m. was 2.83. Cronbach’s alpha was 0.82. Correlation coefficients between CARAT and ACQ5 and VAS questions ranged from 0.64 to 0.76 (P<0.01). Longitudinally, correlation coefficients between delta CARAT scores and delta ACQ5 and VAS scores ranged from 0.41 to 0.67 (P<0.01). Test–retest reliability showed an ICC of 0.81 (P<0.01) and 0.80 (P<0.01). Correlations with pollen counts were higher for CARAT than for ACQ5. Conclusions: This is the first investigation of the MCID of the CARAT. The CARAT uses a whole-point scale, which suggests that the MCID is 4 points. The CARAT is a valid and reliable tool that is also applicable in the Dutch population. PMID:25569880
Cross-cultural adaptation and validation of the Saudi Arabic version of the Knee Injury and Osteoarthritis Outcome Score (KOOS).

PubMed

Alfadhel, Saud A; Vennu, Vishal; Alnahdi, Ali H; Omar, Mohammed T; Alasmari, Saeed H; AlJafri, Zahra; Bindawas, Saad M

2018-06-07

The Knee Injury Osteoarthritis Outcome Score (KOOS) is a widely used joint-specific measure employed to evaluate pain, symptoms, activities of daily living, recreational activities, and quality of life in patients with knee osteoarthritis (OA). Although the original KOOS has been translated into many languages, a Saudi Arabic version is not available. This study aimed to culturally adapt and evaluate the psychometric properties of the Saudi Arabic version of the KOOS in patients with knee OA. The original KOOS was translated and adapted into Saudi Arabic version over six stages according to the guidelines suggested by Beaton and recommended by the American Association of Orthopedic Surgeons Outcome Committee. Patients diagnosed with knee OA (n = 136) were recruited to examine the psychometric properties, such as internal consistency that was tested using Cronbach's alpha, test-retest reliability that was analyzed using the intra-class correlation coefficient (ICC 2,1 ), and construct validity that examined by testing the correlations between the new version subscales, Form 36 Health Survey subscales, and the Visual Analog Scale, Spearman's correlation coefficient (r s ) was used to measure the correlations. A total of 122 (89.7%) of the 136 participants with knee OA completed the second re-test of new Saudi Arabic version. Excellent internal consistency (Cronbach's alpha = 0.87-0.92) was detected in the subscales of the adapted version, as well as excellent test-retest reliability (ICC 2,1 = 0.92-0.94). The pattern of correlation between the subscales of the Saudi Arabic version of the KOOS, SF-36 domains and the Visual Analog Scale for pain supported the construct validity of the adapted version. The Saudi Arabic version of the KOOS was well accepted and exhibited excellent reliability, internal consistency, and construct validity in Saudi patients with knee OA.
The Health Informatics Trial Enhancement Project (HITE): Using routinely collected primary care data to identify potential participants for a depression trial

PubMed Central

2010-01-01

Background Recruitment to clinical trials can be challenging. We identified anonymous potential participants to an existing pragmatic randomised controlled depression trial to assess the feasibility of using routinely collected data to identify potential trial participants. We discuss the strengths and limitations of this approach, assess its potential value, report challenges and ethical issues encountered. Methods Swansea University's Health Information Research Unit's Secure Anonymised Information Linkage (SAIL) database of routinely collected health records was interrogated, using Structured Query Language (SQL). Read codes were used to create an algorithm of inclusion/exclusion criteria with which to identify suitable anonymous participants. Two independent clinicians rated the eligibility of the potential participants' identified. Inter-rater reliability was assessed using the kappa statistic and inter-class correlation. Results The study population (N = 37263) comprised all adults registered at five general practices in Swansea UK. Using the algorithm 867 anonymous potential participants were identified. The sensitivity and specificity results > 0.9 suggested a high degree of accuracy from the algorithm. The inter-rater reliability results indicated strong agreement between the confirming raters. The Intra Class Correlation Coefficient (Cronbach's Alpha) > 0.9, suggested excellent agreement and Kappa coefficient > 0.8; almost perfect agreement. Conclusions This proof of concept study showed that routinely collected primary care data can be used to identify potential participants for a pragmatic randomised controlled trial of folate augmentation of antidepressant therapy for the treatment of depression. Further work will be needed to assess generalisability to other conditions and settings and the inclusion of this approach to support Electronic Enhanced Recruitment (EER). PMID:20398303
Reliability of the Balance Evaluation Systems Test (BESTest) and BESTest sections for adults with hemiparesis

PubMed Central

Rodrigues, Letícia C.; Marques, Aline P.; Barros, Paula B.; Michaelsen, Stella M.

2014-01-01

BACKGROUND: The Balance Evaluation Systems Test (BESTest) was recently created to allow the development of treatments according to the specific balance system affected in each patient. The Brazilian version of the BESTest has not been specifically tested after stroke. OBJECTIVE: To evaluate the intra- and inter-rater reliability and concurrent and convergent validity of the total score of the BESTest and BESTest sections for adults with hemiparesis after stroke. METHOD: The study included 16 subjects (61.1±7.5 years) with chronic hemiparesis (54.5±43.5 months after stroke). The BESTest was administered by two raters in the same week and one of the raters repeated the test after a one-week interval. Intraclass correlation coefficient (ICC) was calculated to assess intra- and interrater reliability. Concurrent validity with the Berg Balance Scale (BBS) and convergent validity with the Activities-specific Balance Confidence scale (ABC-Brazil) were assessed using Pearson's correlation coefficient. RESULTS: Both the BESTest total score (ICC=0.98) and the BESTest sections (ICC between 0.85 and 0.96) have excellent intrarater reliability. Interrater reliability for the total score was excellent (ICC=0.93) and, for the sections, it ranged between 0.71 and 0.94. The correlation coefficient between the BESTest and the BBS and ABC-Brazil were 0.78 and 0.59, respectively. CONCLUSIONS: The Brazilian version of the BESTest demonstrated adequate reliability when measured by sections and could identify what balance system was affected in patients after stroke. Concurrent validity was excellent with the BBS total score and good to excellent with the sections. The total scores but not the sections present adequate convergent validity with the ABC-Brazil. However, other psychometric properties should be further investigated. PMID:25003281
Periorbital Biometric Measurements using ImageJ Software: Standardisation of Technique and Assessment Of Intra- and Interobserver Variability

PubMed Central

Rajyalakshmi, R.; Prakash, Winston D.; Ali, Mohammad Javed; Naik, Milind N.

2017-01-01

Purpose: To assess the reliability and repeatability of periorbital biometric measurements using ImageJ software and to assess if the horizontal visible iris diameter (HVID) serves as a reliable scale for facial measurements. Methods: This study was a prospective, single-blind, comparative study. Two clinicians performed 12 periorbital measurements on 100 standardised face photographs. Each individual’s HVID was determined by Orbscan IIz and used as a scale for measurements using ImageJ software. All measurements were repeated using the ‘average’ HVID of the study population as a measurement scale. Intraclass correlation coefficient (ICC) and Pearson product-moment coefficient were used as statistical tests to analyse the data. Results: The range of ICC for intra- and interobserver variability was 0.79–0.99 and 0.86–0.99, respectively. Test-retest reliability ranged from 0.66–1.0 to 0.77–0.98, respectively. When average HVID of the study population was used as scale, ICC ranged from 0.83 to 0.99, and the test-retest reliability ranged from 0.83 to 0.96 and the measurements correlated well with recordings done with individual Orbscan HVID measurements. Conclusion: Periorbital biometric measurements using ImageJ software are reproducible and repeatable. Average HVID of the population as measured by Orbscan is a reliable scale for facial measurements. PMID:29403183
[Cross-cultural adaptation and validation of the PROMIS Global Health scale in the Portuguese language].

PubMed

Zumpano, Camila Eugênia; Mendonça, Tânia Maria da Silva; Silva, Carlos Henrique Martins da; Correia, Helena; Arnold, Benjamin; Pinto, Rogério de Melo Costa

2017-01-23

This study aimed to perform the cross-cultural adaptation and validation of the Patient-Reported Outcomes Measurement Information System (PROMIS) Global Health scale in the Portuguese language. The ten Global Health items were cross-culturally adapted by the method proposed in the Functional Assessment of Chronic Illness Therapy (FACIT). The instrument's final version in Portuguese was self-administered by 1,010 participants in Brazil. The scale's precision was verified by floor and ceiling effects analysis, reliability of internal consistency, and test-retest reliability. Exploratory and confirmatory factor analyses were used to assess the construct's validity and instrument's dimensionality. Calibration of the items used the Gradual Response Model proposed by Samejima. Four global items required adjustments after the pretest. Analysis of the psychometric properties showed that the Global Health scale has good reliability, with Cronbach's alpha of 0.83 and intra-class correlation of 0.89. Exploratory and confirmatory factor analyses showed good fit in the previously established two-dimensional model. The Global Physical Health and Global Mental Health scale showed good latent trait coverage according to the Gradual Response Model. The PROMIS Global Health items showed equivalence in Portuguese compared to the original version and satisfactory psychometric properties for application in clinical practice and research in the Brazilian population.
Test-retest reliability and agreement of the Satisfaction with the Assistive Technology Services (SATS) instrument in two Nordic countries.

PubMed

Sund, Terje; Iwarsson, Susanne; Anttila, Heidi; Helle, Tina; Brandt, Ase

2014-07-01

The purpose of this study was to investigate test-retest reliability, agreement, internal consistency, and floor- and ceiling effects of the Danish and Finnish versions of the Satisfaction with the Assistive Technology Services (SATS) instrument among adult users of powered wheelchairs (PWCs) or powered scooters (scooters). Test-retest design, two telephone interviews 7-18 days apart of 40 informants, with mean age of 67.5 (SD 13.09) years in the Danish; and 54 informants with mean age of 55.6 (SD 12.09) years in the Finnish sample. The intra-class correlation coefficient varied between 0.57 and 0.93 for items in the Danish and between 0.41 and 0.93 in the Finnish sample. The percentage agreement varied between 54.2 and 79.5 for items in the Danish and between 69.2 and 81.1 in the Finnish sample, while the Cronbach's alpha values varied between 0.87 and 0.96 in the two samples. A ceiling effect was found in all items of both samples. This study indicates that the SATS may be reliably administered for telephone interviews among adult PWC and scooter users, and give information about aspects of the service delivery process for quality development improvement purposes. Further psychometric testing of the SATS is required.

Reliability of Achilles Tendon Moment Arm Measured In Vivo Using Freehand Three-Dimensional Ultrasound.

PubMed

Obst, Steven J; Barber, Lee; Miller, Ashton; Barrett, Rod S

2017-08-01

This study investigated reliability of freehand three-dimensional ultrasound (3DUS) measurement of in vivo human Achilles tendon (AT) moment arm. Sixteen healthy adults were scanned on 2 separate occasions by a single investigator. 3DUS scans were performed over the free AT, medial malleolus, and lateral malleolus with the ankle passively positioned in maximal dorsiflexion, mid dorsiflexion, neutral, mid plantar flexion and maximal plantar flexion. 3D reconstructions of the AT, medial malleolus, and lateral malleolus were created from manual segmentation of the ultrasound images and used to geometrically determine the AT moment arm using both a straight (straight AT MA ) and curved (curved AT MA ) tendon line-of-action. Both methods were reliable within- and between-session (intra-class correlation coefficients > 0.92; coefficient of variation < 2.5 %) and revealed that AT moment arm increased by ∼ 7 mm from maximal dorsiflexion (∼ 41mm) to maximal plantar flexion (∼ 48 mm). Failing to account for tendon curvature led to a small overestimation (< 2 mm) of AT moment arm that was most pronounced in ankle plantar flexion, but was less than the minimal detectable change of the method and could be disregarded.
Characterising smoking cessation smartphone applications in terms of behaviour change techniques, engagement and ease-of-use features.

PubMed

Ubhi, Harveen Kaur; Michie, Susan; Kotz, Daniel; van Schayck, Onno C P; Selladurai, Abiram; West, Robert

2016-09-01

The aim of this study was to assess whether or not behaviour change techniques (BCTs) as well as engagement and ease-of-use features used in smartphone applications (apps) to aid smoking cessation can be identified reliably. Apps were coded for presence of potentially effective BCTs, and engagement and ease-of-use features. Inter-rater reliability for this coding was assessed. Inter-rater agreement for identifying presence of potentially effective BCTs ranged from 66.8 to 95.1 % with 'prevalence and bias adjusted kappas' (PABAK) ranging from 0.35 to 0.90 (p < 0.001). The intra-class correlation coefficients between the two coders for scores denoting the proportions of (a) a set of engagement features and (b) a set of ease-of-use features, which were included, were 0.77 and 0.75, respectively (p < 0.001). Prevalence estimates for BCTs ranged from <10 % for medication advice to >50 % for rewarding abstinence. The average proportions of specified engagement and ease-of-use features included in the apps were 69 and 83 %, respectively. The study found that it is possible to identify potentially effective BCTs, and engagement and ease-of-use features in smoking cessation apps with fair to high inter-rater reliability.
Feasibility and reliability of a virtual reality oculus platform to measure sensory integration for postural control in young adults.

PubMed

Lubetzky, Anat V; Kary, Erinn E; Harel, Daphna; Hujsak, Bryan; Perlin, Ken

2018-01-24

Using Unity for the Oculus Development-Kit 2, we have developed an affordable, portable virtual reality platform that targets the visuomotor domain, a missing link in current clinical assessments of postural control. Here, we describe the design and technical development as well as report its feasibility with regards to cybersickness and test-retest reliability in healthy young adults. Our virtual reality paradigm includes two functional scenes ('City' and 'Park') and four moving dots scenes. Twenty-one healthy young adults were tested twice, one to two weeks apart. They completed a simulator sickness questionnaire several times per session. Their postural sway response was recorded from a forceplate underneath their feet while standing on the floor, stability trainers, or a Both Sides Up (BOSU) ball. Sample entropy, postural displacement, velocity, and excursion were calculated and compared between sessions given the visual and surface conditions. Participants reported slight-to-moderate transient side effects. Intra-Class Correlation values mostly ranged from 0.5 to 0.7 for displacement and velocity, were above 0.5 (stability trainer conditions) and above 0.4 (floor mediolateral conditions) for sample entropy, and minimal for excursion. Our novel portable VR platform was found to be feasible and reliable in healthy young adults.
Validity and reliability of the Turkish version of the pressure ulcer prevention knowledge assessment instrument.

PubMed

Tulek, Zeliha; Polat, Cansu; Ozkan, Ilknur; Theofanidis, Dimitris; Togrol, Rifat Erdem

2016-11-01

Sound knowledge of pressure ulcers is important to enable good prevention. There are limited instruments assessing pressure ulcer knowledge. The Pressure Ulcer Prevention Knowledge Assessment Instrument is among the scales of which psychometric properties have been studied rigorously and reflects the latest evidence. This study aimed to evaluate the validity and reliability of the Turkish version of the Pressure Ulcer Prevention Knowledge Assessment Instrument (PUPKAI-T), an instrument that assesses knowledge of pressure ulcer prevention by using multiple-choice questions. Linguistic validity was verified through front-to-back translation. Psychometric properties of the instrument were studied on a sample of 150 nurses working in a tertiary hospital in Istanbul, Turkey. The content validity index of the translated instrument was 0.94, intra-class correlation coefficients were between 0.37 and 0.80, item difficulty indices were between 0.21 and 0.88, discrimination indices were 0.20-0.78, and the Kuder Richardson for the internal consistency was 0.803. The PUPKAI-T was found to be a valid and reliable tool to evaluate nurses' knowledge on pressure ulcer prevention. The PUPKAI-T may be a useful tool for determining educational needs of nurses on pressure ulcer prevention. Copyright © 2016 Tissue Viability Society. Published by Elsevier Ltd. All rights reserved.
A new approach to determining net impulse and identification of its characteristics in countermovement jumping: reliability and validity.

PubMed

Mizuguchi, Satoshi; Sands, William A; Wassinger, Craig A; Lamont, Hugh S; Stone, Michael H

2015-06-01

Examining a countermovement jump (CMJ) force-time curve related to net impulse might be useful in monitoring athletes' performance. This study aimed to investigate the reliability of alternative net impulse calculation and net impulse characteristics (height, width, rate of force development, shape factor, and proportion) and validate against the traditional calculation in the CMJ. Twelve participants performed the CMJ in two sessions (48 hours apart) for test-retest reliability. Twenty participants were involved for the validity assessment. Results indicated intra-class correlation coefficient (ICC) of ≥ 0.89 and coefficient of variation (CV) of ≤ 5.1% for all of the variables except for rate of force development (ICC = 0.78 and CV = 22.3%). The relationship between the criterion and alternative calculations was r = 1.00. While the difference between them was statistically significant (245.96 ± 63.83 vs. 247.14 ± 64.08 N s, p < 0.0001), the effect size was trivial and deemed practically minimal (d = 0.02). In conclusion, variability of rate of force development will pose a greater challenge in detecting performance changes. Also, the alternative calculation can be used practically in place of the traditional calculation to identify net impulse characteristics and monitor and study athletes' performance in greater depth.
Assessment of the amount of tooth wear on dental casts and intra-oral photographs.

PubMed

Wetselaar, P; Wetselaar-Glas, M J M; Koutris, M; Visscher, C M; Lobbezoo, F

2016-08-01

Tooth wear is a multifactorial condition, leading to the loss of dental hard tissues. Many grading scales are available to assess the amount of tooth wear, one of which is the tooth wear evaluation system (TWES). A grading scale can be used chairside, on casts and on photographs. The aim was to test whether the grading scales of the TWES, used on casts and on photographs, resulted in comparable scores. In addition, it was tested whether these scales can be used to assess tooth wear reliably on photographs. Of 75 tooth wear patients, sets of casts and series of photographs were obtained and graded. Comparison of the grading on casts and on photographs revealed equal median values and percentiles for both occlusal/incisal grading and non-occlusal/non-incisal grading. The grading on casts and on photographs showed a high correlation for the occlusal/incisal grading and a low correlation for the non-occlusal/non-incisal grading (Spearman's rho = 0·74 and rho = 0·47; P < 0·001). Concerning the grading on photographs, the interexaminer reliability was fair-to-good (ICC = 0·41 to ICC = 0·55) while the intra-examiner reliability was fair-to-good to excellent (ICC = 0·68 to ICC = 0·86) for the occlusal/incisal grading. For the non-occlusal/non-incisal grading, the interexaminer reliability was poor to fair-to-good (ICC = 0·22 to ICC = 0·59), while the intra-examiner reliability was fair-to-good to excellent (ICC = 0·64 to ICC = 0·82). It was concluded that the scores obtained with the grading scales of the TWES on casts and on photographs are comparable. The grading scales can be used in a reliable way on photographs, which is especially the case for occlusal/incisal grading. © 2016 John Wiley & Sons Ltd.
Repeatability of self-report measures of physical activity, sedentary and travel behaviour in Hong Kong adolescents for the iHealt(H) and IPEN - Adolescent studies.

PubMed

Cerin, Ester; Sit, Cindy H P; Huang, Ya-Jun; Barnett, Anthony; Macfarlane, Duncan J; Wong, Stephen S H

2014-06-06

Physical activity and sedentary behaviour are important contributors to adolescents' health. These behaviours may be affected by the school and neighbourhood built environments. However, current evidence on such effects is mainly limited to Western countries. The International Physical Activity and the Environment Network (IPEN)-Adolescent study aims to examine associations of the built environment with adolescent physical activity and sedentary behaviour across five continents.We report on the repeatability of measures of in-school and out-of school physical activity, plus measures of out-of-school sedentary and travel behaviours adopted by the IPEN - Adolescent study and adapted for Chinese-speaking Hong Kong adolescents participating in the international Healthy environments and active living in teenagers-(Hong Kong) [iHealt(H)] study, which is part of IPEN-Adolescent. Items gauging in-school physical activity and out-of-school physical activity, and out-of-school sedentary and travel behaviours developed for the IPEN - Adolescent study were translated from English into Chinese, adapted, and pilot tested. Sixty-eight Chinese-speaking 12-17 year old secondary school students (36 boys; 32 girls) residing in areas of Hong Kong differing in transport-related walkability were recruited. They self-completed the survey items twice, 8-16 days apart. Test-retest reliability was assessed for the whole sample and by gender using one-way random effects intra-class correlation coefficients (ICC). Test-retest reliability of items with restricted variability was assessed using percentage agreement. Overall test-retest reliability of items and scales was moderate to excellent (ICC = 0.47-0.92). Items with restricted variability in responses had a high percentage agreement (92%-100%). Test-retest reliability was similar in girls and boys, with the exception of daily hours of homework (reliability higher in girls) and number of school-based sports teams or after-school physical activity classes (reliability higher in boys). The translated and adapted self-report measures of physical activity, sedentary and travel behaviours used in the iHealt(H) study are sufficiently reliable. Levels of reliability are comparable or slightly higher than those observed for the original measures.
Measuring the quality of Hospital Food Services: Development and reliability of a Meal Quality Audit Tool.

PubMed

Banks, Merrilyn; Hannan-Jones, Mary; Ross, Lynda; Buckley, Ann; Ellick, Jennifer; Young, Adrienne

2017-04-01

To develop and test the reliability of a Meal Quality Audit Tool (MQAT) to audit the quality of hospital meals to assist food service managers and dietitians in identifying areas for improvement. The MQAT was developed using expert opinion and was modified over time with extensive use and feedback. A phased approach was used to assess content validity and test reliability: (i) trial with 60 dietetic students, (ii) trial with 12 food service dietitians in practice and (iii) interrater reliability study. Phases 1 and 2 confirmed content validity and informed minor revision of scoring, language and formatting of the MQAT. To assess reliability of the final MQAT, eight separate meal quality audits of five identical meals were conducted over several weeks in the hospital setting. Each audit comprised an 'expert' team and four 'test' teams (dietitians, food services and ward staff). Interrater reliability was determined using intra-class correlation analysis. There was statistically significant interrater reliability for dimensions of Temperature and Accuracy (P < 0.001) but not for Appearance or Sensory. Composition of the 'test' team appeared to influence results for Appearance and Sensory, with food service-led teams scoring higher on these dimensions. 'Test' teams reported that MQAT was clear and easy to use. MQAT was found to be reliable for Temperature and Accuracy domains, with further work required to improve the reliability of the Appearance and Sensory dimensions. The systematic use of the tool, used in conjunction with patient satisfaction, could provide pertinent and useful information regarding the quality of food services and areas for improvement. © 2017 Dietitians Association of Australia.
Isometric and isokinetic muscle strength in the upper extremity can be reliably measured in persons with chronic stroke.

PubMed

Ekstrand, Elisabeth; Lexell, Jan; Brogårdh, Christina

2015-09-01

To evaluate the test-retest reliability of isometric and isokinetic muscle strength measurements in the upper extremity after stroke. A test-retest design. Forty-five persons with mild to moderate paresis in the upper extremity > 6 months post-stroke. Isometric arm strength (shoulder abduction, elbow flexion), isokinetic arm strength (elbow extension/flexion) and isometric grip strength were measured with electronic dynamometers. Reliability was evaluated with intra-class correlation coefficients (ICC), changes in the mean, standard error of measurements (SEM) and smallest real differences (SRD). Reliability was high (ICCs: 0.92-0.97). The absolute and relative (%) SEM ranged from 2.7 Nm (5.6%) to 3.0 Nm (9.4%) for isometric arm strength, 2.6 Nm (7.4%) to 2.9 Nm (12.6%) for isokinetic arm strength, and 22.3 N (7.6%) to 26.4 N (9.2%) for grip strength. The absolute and relative (%) SRD ranged from 7.5 Nm (15.5%) to 8.4 Nm (26.1%) for isometric arm strength, 7.1 Nm (20.6%) to 8.0 Nm (34.8%) for isokinetic arm strength, and 61.8 N (21.0%) to 73.3 N (25.6%) for grip strength. Muscle strength in the upper extremity can be reliably measured in persons with chronic stroke. Isometric measurements yield smaller measurement errors than isokinetic measurements and might be preferred, but the choice depends on the research question.
Reliable and valid assessment of Lichtenstein hernia repair skills.

PubMed

Carlsen, C G; Lindorff-Larsen, K; Funch-Jensen, P; Lund, L; Charles, P; Konge, L

2014-08-01

Lichtenstein hernia repair is a common surgical procedure and one of the first procedures performed by a surgical trainee. However, formal assessment tools developed for this procedure are few and sparsely validated. The aim of this study was to determine the reliability and validity of an assessment tool designed to measure surgical skills in Lichtenstein hernia repair. Key issues were identified through a focus group interview. On this basis, an assessment tool with eight items was designed. Ten surgeons and surgical trainees were video recorded while performing Lichtenstein hernia repair, (four experts, three intermediates, and three novices). The videos were blindly and individually assessed by three raters (surgical consultants) using the assessment tool. Based on these assessments, validity and reliability were explored. The internal consistency of the items was high (Cronbach's alpha = 0.97). The inter-rater reliability was very good with an intra-class correlation coefficient (ICC) = 0.93. Generalizability analysis showed a coefficient above 0.8 even with one rater. The coefficient improved to 0.92 if three raters were used. One-way analysis of variance found a significant difference between the three groups which indicates construct validity, p < 0.001. Lichtenstein hernia repair skills can be assessed blindly by a single rater in a reliable and valid fashion with the new procedure-specific assessment tool. We recommend this tool for future assessment of trainees performing Lichtenstein hernia repair to ensure that the objectives of competency-based surgical training are met.
Translation, cultural adaptation and validation of the Diabetes Attitudes Scale - third version into Brazilian Portuguese 1

PubMed Central

Vieira, Gisele de Lacerda Chaves; Pagano, Adriana Silvino; Reis, Ilka Afonso; Rodrigues, Júlia Santos Nunes; Torres, Heloísa de Carvalho

2018-01-01

ABSTRACT Objective: to perform the translation, adaptation and validation of the Diabetes Attitudes Scale - third version instrument into Brazilian Portuguese. Methods: methodological study carried out in six stages: initial translation, synthesis of the initial translation, back-translation, evaluation of the translated version by the Committee of Judges (27 Linguists and 29 health professionals), pre-test and validation. The pre-test and validation (test-retest) steps included 22 and 120 health professionals, respectively. The Content Validity Index, the analyses of internal consistency and reproducibility were performed using the R statistical program. Results: in the content validation, the instrument presented good acceptance among the Judges with a mean Content Validity Index of 0.94. The scale presented acceptable internal consistency (Cronbach’s alpha = 0.60), while the correlation of the total score at the test and retest moments was considered high (Polychoric Correlation Coefficient = 0.86). The Intra-class Correlation Coefficient, for the total score, presented a value of 0.65. Conclusion: the Brazilian version of the instrument (Escala de Atitudes dos Profissionais em relação ao Diabetes Mellitus) was considered valid and reliable for application by health professionals in Brazil. PMID:29319739
The accuracy of nurses' estimates of their absenteeism.

PubMed

Gaudine, Alice; Gregory, Connie

2010-07-01

The purpose of the present study was to determine the accuracy of nurses' self-reports of absence by examining: (1) the correlation, intra-class correlation, and Cronbach's alpha for self-reported absence and absence as reported in organizational records, (2) difference in central tendency for the two measures of absence and (3) the percentage of nurses who underestimate their absence. Research on nurses' absenteeism has often relied on self-reports of absence. However, nurses may not be aware of their actual absenteeism, or they may underestimate it. Self-reported absence from questionnaires completed by 215 Canadian nurses was compared with their absence from organizational records. There is a strong positive correlation, a strong intra-class correlation and Cronbach's alpha for the two measures of absence. However, there is a difference in central tendency that is related to the majority of nurses in this study (51.1%) underestimating their days absent from work. Research examining the predictors of absence may consider measuring absence with self-reports. Nevertheless, nurses demonstrated a bias to underestimate their absence. Feedback interventions to reduce absenteeism can be developed to include providing nurses with accurate information about their absence.
Development of a Brazilian Portuguese adapted version of the Gap-Kalamazoo communication skills assessment form.

PubMed

Amaral, Anna Beatriz C N; Rider, Elizabeth A; Lajolo, Paula P; Tone, Luiz G; Pinto, Rogerio M C; Lajolo, Marisa P; Calhoun, Aaron W

2016-12-11

The goal of this study was to translate, adapt and validate the items of the Gap-Kalamazoo Communication Skills Assessment Form for use in the Brazilian cultural setting. The Gap-Kalamazoo Communication Skills Assessment Form was translated into Portuguese by two independent bilingual Brazilian translators and was reconciled by a third bilingual healthcare professional. The translated text was then assessed for content using a modified Delphi technique and adjusted as needed to assure content validity. A total of nine phrases in the completed tool were adjusted. The final tool was then used to assess videotaped simulations as a means of validation. Response process was assessed using exploratory factor analysis and internal structure was assessed via Cronbach's Alpha (internal consistency) and Intraclass Correlation (test-retest reliability and inter-rater reliability). One hundred and four (104) videotaped communication skills simulations were assessed by 38 subjects (6 staff physicians, 4 faculty physicians, 8 resident physicians, 4 professional actors with experience in simulation, and 16 other allied healthcare professionals). Measures of Internal consistency (Cronbach's alpha = 0.818) and test-retest reliability (intra-class correlation coefficient = 0.942) were high. Exploratory factor analysis confirmed the uni-dimensionality of the instrument. Our results support the validity and reliability of the Brazilian Gap-Kalamazoo Communication Skills Assessment Form when used among Brazilian medical residents. The Brazilian version of Gap-Kalamazoo Communication Skills Assessment Form was found to be adequate both in the linguistic and technical aspects. The use of this instrument in Brazilian medical education can enhance the assessment of physician-patient-team relationships on an ongoing basis.
Validation of the Headache Impact Test (HIT-6™) across episodic and chronic migraine

PubMed Central

Yang, Min; Rendas-Baum, Regina; Varon, Sepideh F; Kosinski, Mark

2011-01-01

Objective: The purpose of this study was to assess psychometric properties of the six-item Headache Impact Text (HIT-6™) across episodic and chronic migraine. Methods: Using a migraine screener and number of headache days per month (HDPM), participants from the National Survey of Headache Impact (NSHI) study and the HIT-6 validation study (HIT6-V) were selected for this study. Eligible participants were categorized into three groups: chronic migraine (CM: ≥ 15 HDPM); episodic migraine (EM: < 15 HDPM); non-migraine headaches. Reliability and validity of the HIT-6 were evaluated. Results: A total of 2,049 survey participants met the inclusion/exclusion criteria for this study. Participants were identified as 6.4% CM; 42.1% EM; 51.5% non-migraine, with respective mean HIT-6 scores: 62.5 ± 7.8; 60.2 ± 6.8; and 49.1 ± 8.7. High reliability was demonstrated with internal consistency (time1/time2) of 0.83/0.87 in NSHI, and 0.82/0.92 in HIT6-V. Intra-class correlation for test-retest reliability was very good at 0.77. HIT-6 scores correlated significantly (p < .0001) with total Migraine Disability Assessment Scale scores (r = 0.56), headache pain severity (r = 0.46), and HDPM (r = 0.29). Discriminant validity analysis showed significantly different HIT-6 scores (F = 488.02, p < .0001) across the groups. Conclusion: Results from these analyses confirm that the HIT-6 is a reliable and valid tool for discriminating headache impact across episodic and chronic migraine. PMID:20819842
Translation and validation of the Dutch version of the Fear of Cancer Recurrence Inventory (FCRI-NL).

PubMed

van Helmondt, Sanne Jasperine; van der Lee, Marije Liesbeth; de Vries, Jolanda

2017-11-01

The study objectives are to translate the FCRI in Dutch, and to explore the factor structure and the psychometric qualities of the Dutch translation of the Fear of Cancer Recurrence Inventory (FCRI-NL). The original French-Canadian FCRI had been forward-backward translated into English by the developers, and this method was also used to translate the English version of the FCRI into Dutch. Patients were recruited via patient organizations between July 2011 and October 2013. To replicate the original 7-factor structure of the FCRI, confirmatory factor analysis (CFA) was performed. To examine the psychometric qualities, reliability (Cronbach's alpha), test-retest reliability (intra-class correlations; ICC), and convergent and divergent validity (Spearman's correlations) were calculated. From 290 cancer patients, 255 (88%) were eligible for analysis (aged 51.0±9.8years, 88.6% women). CFA showed a reasonable yet suboptimal fit of the hypothesized model to the data. The FCRI-NL has good reliability (Cronbach's α=0.93 for the total scale and α=0.75-0.92 for the subscales) and test-retest reliability (ICC=0.84 for the total scale and ICC=0.56-0.87 for the subscales). Convergent (r=0.53-0.66 for the FCRI-NL and r=0.48-0.57 for the FCRI-SF-NL) and divergent (r=-0.20--0.07 for the FCRI-NL and r=-0.28--0.17 for the FCRI-SF-NL) validity was demonstrated. The FCRI-NL seems to have sufficient psychometric properties. However, the FCRI-NL total score should be interpreted with caution. The Severity subscale (FCRI-SF-NL) may be a valuable screening tool for fear of cancer recurrence severity in clinical care. Copyright © 2017. Published by Elsevier Inc.
Translation, Validation and Cross-Cultural Adaptation of a Simplified-Chinese Version of the Tegner Activity Score in Chinese Patients with Anterior Cruciate Ligament Injury.

PubMed

Huang, Hongshi; Zhang, Dongxia; Jiang, Yanfang; Yang, Jie; Feng, Tao; Gong, Xi; Wang, Jianquan; Ao, Yingfang

2016-01-01

To translate the English version of Tegner Activity Score into a Simplified-Chinese version (Tegner-C) and evaluate its psychometric properties. Tegner-C was cross-culturally adapted according to established guidelines. The validity and reliability of Tegner-C were assessed in 78 participants, with 19-20 participants in each of the four groups: before anterior cruciate ligament reconstruction (pre-ACLR) group, 2-3 months after ACLR group, 3-12 months after ACLR group, and healthy control group. Each participant was asked to complete the Tegner-C and Chinese version of International Knee Documentation Committee Subjective Knee Form (IKDC-SKF-C) twice, with an interval of 5±2 days. Intra-class correlation coefficient (ICC2, 1) was used to assess the reliability and Spearman's rank correlation was used for construct validity. The ICC2,1 was higher than 0.90 for all groups except in the pre-ACLR group, for which the ICC2,1 was 0.71 (0.41, 0.87) (All with p<0.001). The absolute reliability as evaluated by the smallest detectable change was 0.43, 2.12, 0.89, and 0.44 for the healthy control group, pre-ACLR group, 2-3 months after ACLR group, and 3-12 months after ACLR group, respectively. Neither a ceiling effect nor a floor effect was observed for any group. Significant difference was observed for both Tegner-C and IKDC-SKF-C scores between the control and the other three groups (all with p<0.001), and between pre-ACLR and the 2-3 months after ACLR group (p<0.001). Tegner-C demonstrated comparable psychometric properties to the original English version and thus is reliable and valid for Chinese-speaking patients with ACL injury.
Reliability of Real-time Ultrasound Imaging for the Assessment of Trunk Stabilizer Muscles: A Systematic Review of the Literature.

PubMed

Taghipour, Morteza; Mohseni-Bandpei, Mohammad Ali; Behtash, Hamid; Abdollahi, Iraj; Rajabzadeh, Fatemeh; Pourahmadi, Mohammad Reza; Emami, Mahnaz

2018-04-24

Rehabilitative ultrasound (US) imaging is one of the popular methods for investigating muscle morphologic characteristics and dimensions in recent years. The reliability of this method has been investigated in different studies. As studies have been performed with different designs and quality, reported values of rehabilitative US have a wide range. The objective of this study was to systematically review the literature conducted on the reliability of rehabilitative US imaging for the assessment of deep abdominal and lumbar trunk muscle dimensions. The PubMed/MEDLINE, Scopus, Google Scholar, Science Direct, Embase, Physiotherapy Evidence, Ovid, and CINAHL databases were searched to identify original research articles conducted on the reliability of rehabilitative US imaging published from June 2007 to August 2017. The articles were qualitatively assessed; reliability data were extracted; and the methodological quality was evaluated by 2 independent reviewers. Of the 26 included studies, 16 were considered of high methodological quality. Except for 2 studies, all high-quality studies reported intraclass correlation coefficients (ICCs) for intra-rater reliability of 0.70 or greater. Also, ICCs reported for inter-rater reliability in high-quality studies were generally greater than 0.70. Among low-quality studies, reported ICCs ranged from 0.26 to 0.99 and 0.68 to 0.97 for intra- and inter-rater reliability, respectively. Also, the reported standard error of measurement and minimal detectable change for rehabilitative US were generally in an acceptable range. Generally, the results of the reviewed studies indicate that rehabilitative US imaging has good levels of both inter- and intra-rater reliability. © 2018 by the American Institute of Ultrasound in Medicine.
Reliability and Validity of the Early Years Physical Activity Questionnaire (EY-PAQ)

PubMed Central

Bingham, Daniel D.; Collings, Paul J.; Clemes, Stacy A.; Costa, Silvia; Santorelli, Gillian; Griffiths, Paula; Barber, Sally E.

2016-01-01

Measuring physical activity (PA) and sedentary time (ST) in young children (<5 years) is complex. Objective measures have high validity but require specialist expertise, are expensive, and can be burdensome for participants. A proxy-report instrument for young children that accurately measures PA and ST is needed. The aim of this study was to assess the reliability and validity of the Early Years Physical Activity Questionnaire (EY-PAQ). In a setting where English and Urdu are the predominant languages spoken by parents of young children, a sample of 196 parents and their young children (mean age 3.2 ± 0.8 years) from Bradford, UK took part in the study. A total of 156 (79.6%) questionnaires were completed in English and 40 (20.4%) were completed in transliterated Urdu. A total of 109 parents took part in the reliability aspect of the study, which involved completion of the EY-PAQ on two occasions (7.2 days apart; standard deviation (SD) = 1.1). All 196 participants took part in the validity aspect which involved comparison of EY-PAQ scores against accelerometry. Validty anaylsis used all data and data falling with specific MVPA and ST boundaries. Reliability was assessed using intra-class correlations (ICC) and validity by Bland–Altman plots and rank correlation coefficients. The test re-test reliability of the EY-PAQ was moderate for ST (ICC = 0.47) and fair for moderate-to-vigorous physical activity (MVPA)(ICC = 0.35). The EY-PAQ had poor agreement with accelerometer-determined ST (mean difference = −87.5 min·day−1) and good agreement for MVPA (mean difference = 7.1 min·day−1) limits of agreement were wide for all variables. The rank correlation coefficient was non-significant for ST (rho = 0.19) and significant for MVPA (rho = 0.30). The EY-PAQ has comparable validity and reliability to other PA self-report tools and is a promising population-based measure of young children’s habitual MVPA but not ST. In situations when objective methods are not possible for measurement of young children’s MVPA, the EY-PAQ may be a suitable alternative but only if boundaries are applied.
Validity and reliability of a modified english version of the physical activity questionnaire for adolescents.

PubMed

Aggio, Daniel; Fairclough, Stuart; Knowles, Zoe; Graves, Lee

2016-01-01

Adaptation of physical activity self-report questionnaires is sometimes required to reflect the activity behaviours of diverse populations. The processes used to modify self-report questionnaires though are typically underreported. This two-phased study used a formative approach to investigate the validity and reliability of the Physical Activity Questionnaire for Adolescents (PAQ-A) in English youth. Phase one examined test content and response process validity and subsequently informed a modified version of the PAQ-A. Phase two assessed the validity and reliability of the modified PAQ-A. In phase one, focus groups (n = 5) were conducted with adolescents (n = 20) to investigate test content and response processes of the original PAQ-A. Based on evidence gathered in phase one, a modified version of the questionnaire was administered to participants (n = 169, 14.5 ± 1.7 years) in phase two. Internal consistency and test-retest reliability were assessed using Cronbach's alpha and intra-class correlations, respectively. Spearman correlations were used to assess associations between modified PAQ-A scores and accelerometer-derived physical activity, self-reported fitness and physical activity self-efficacy. Phase one revealed that the original PAQ-A was unrepresentative for English youth and that item comprehension varied. Contextual and population/cultural-specific modifications were made to the PAQ-A for use in the subsequent phase. In phase two, modified PAQ-A scores had acceptable internal consistency (α = 0.72) and test-retest reliability (ICC = 0.78). Modified PAQ-A scores were significantly associated with objectively assessed moderate-to-vigorous physical activity (r = 0.39), total physical activity (r = 0.42), self-reported fitness (r = 0.35), and physical activity self-efficacy (r = 0.32) (p ≤ 0.01). The modified PAQ-A had acceptable internal consistency and test-retest reliability. Modified PAQ-A scores displayed weak-to-moderate correlations with objectively measured physical activity, self-reported fitness, and self-efficacy providing evidence of satisfactory criterion and construct validity, respectively. Further testing with more diverse English samples is recommended to provide a more complete assessment of the tool.
Measurement characteristics of the childhood Asthma-Control Test and a shortened, child-only version

PubMed Central

Bime, Christian; Gerald, Joe K; Wei, Christine Y; Holbrook, Janet T; Teague, William G; Wise, Robert A; Gerald, Lynn B

2016-01-01

The childhood Asthma-Control Test (C-ACT) is validated for assessing asthma control in paediatric asthma. Among children aged 4–11 years, the C-ACT requires the simultaneous presence of both parent and child. There is an unmet need for a tool that can be used to assess asthma control in children when parents or caregivers are not present such as in the school setting. We assessed the psychometric properties and estimated the minimally important difference (MID) of the C-ACT and a modified version, comprising only the child responses (C-ACTc). Asthma patients aged 6–11 years (n=161) from a previously completed multicenter randomised trial were included. Demographic information, spirometry and questionnaire scores were obtained at baseline and during follow-up. Participants or their guardians kept a daily asthma diary. Internal consistency reliabilities of the C-ACT and C-ACTc were 0.76 and 0.67 (Cronbach’s α), respectively. Test–retest reliabilities of the C-ACT and C-ACTc were 0.72 and 0.66 (intra-class correlation), respectively. Significant correlations were noted between C-ACT scores and ACQ scores (Spearman’s correlation r=−0.56, 95% CI (−0.66, −0.44), P<0.001). The strength of the correlation between C-ACTc scores and ACQ scores was weaker (Spearman’s correlation r=−0.46, 95% CI (−0.58, −0.33), P<0.001). We estimated the MID for the C-ACT and C-ACTc to be 2 points and 1 point, respectively. Among asthma patients aged 6–11 years, the C-ACT had good psychometric properties. The psychometric properties of a shortened child-only version (C-ACTc), although acceptable, are not as strong. PMID:27763622

THE NAVICULAR POSITION TEST – A RELIABLE MEASURE OF THE NAVICULAR BONE POSITION DURING REST AND LOADING

PubMed Central

Spörndly-Nees, Søren; Dåsberg, Brian; Nielsen, Rasmus Oestergaard; Boesen, Morten Ilum

2011-01-01

Background: Lower limb injuries are a large problem in athletes. However, there is a paucity of knowledge on the relationship between alignment of the medial longitudinal arch (MLA) of the foot and development of such injuries. A reliable and valid test to quantify foot type is needed to be able to investigate the relationship between arch type and injury likelihood. Feiss Line is a valid clinical measure of the MLA. However, no study has investigated the reliability of the test. Objectives: The purpose was to describe a modified version of the Feiss Line test and to determine the intra- and inter-tester reliability of this new foot alignment test. To emphasize the purpose of the modified test, the authors have named it The Navicular Position Test. Methods: Intra- and inter-tester reliability were evaluated of The Navicular Position Test with the use of ICC (interclass correlation coefficient) and Bland-Altman limits of agreement on 43 healthy, young, subjects. Results: Inter-tester mean difference -0.35 degrees [–1.32; 0.62] p = 0.47. Bland-Altman limits of agreement –6.55 to 5.85 degrees, ICC = 0.94. Intra-tester mean difference 0.47 degrees [–0.57; 1.50] p = 0.37. Bland-Altman limits of agreement –6.15 to 7.08 degrees, ICC = 0.91. Discussion: The present data support The Navicular Position Test as a reliable test of the navicular bone position during rest and loading measured in a simple test set-up. Conclusion: The Navicular Position Test was shown to have a high intraday-, intra- and inter-tester reliability. When cut off values to categorize the MLA into planus, rectus, or cavus feet, has been determined and presented, the test could be used in prospective observational studies investigating the role of the arch type on the development of various lower limb injuries. PMID:21904698
Evaluation of the iPhone with an acrylic sleeve versus the Scoliometer for rib hump measurement in scoliosis.

PubMed

Izatt, Maree T; Bateman, Gary R; Adam, Clayton J

2012-07-30

Vertebral rotation found in structural scoliosis contributes to trunkal asymmetry which is commonly measured with a simple Scoliometer device on a patient's thorax in the forward flexed position. The new generation of mobile 'smartphones' have an integrated accelerometer, making accurate angle measurement possible, which provides a potentially useful clinical tool for assessing rib hump deformity. This study aimed to compare rib hump angle measurements performed using a Smartphone and traditional Scoliometer on a set of plaster torsos representing the range of torsional deformities seen in clinical practice. Nine observers measured the rib hump found on eight plaster torsos moulded from scoliosis patients with both a Scoliometer and an Apple iPhone on separate occasions. Each observer repeated the measurements at least a week after the original measurements, and were blinded to previous results. Intra-observer reliability and inter-observer reliability were analysed using the method of Bland and Altman and 95% confidence intervals were calculated. The Intra-Class Correlation Coefficients (ICC) were calculated for repeated measurements of each of the eight plaster torso moulds by the nine observers. Mean absolute difference between pairs of iPhone/Scoliometer measurements was 2.1 degrees, with a small (1 degrees) bias toward higher rib hump angles with the iPhone. 95% confidence intervals for intra-observer variability were +/- 1.8 degrees (Scoliometer) and +/- 3.2 degrees (iPhone). 95% confidence intervals for inter-observer variability were +/- 4.9 degrees (iPhone) and +/- 3.8 degrees (Scoliometer). The measurement errors and confidence intervals found were similar to or better than the range of previously published thoracic rib hump measurement studies. The iPhone is a clinically equivalent rib hump measurement tool to the Scoliometer in spinal deformity patients. The novel use of plaster torsos as rib hump models avoids the variables of patient fatigue and discomfort, inconsistent positioning and deformity progression using human subjects in a single or multiple measurement sessions.
Evaluation of the iPhone with an acrylic sleeve versus the Scoliometer for rib hump measurement in scoliosis

PubMed Central

2012-01-01

Background Vertebral rotation found in structural scoliosis contributes to trunkal asymmetry which is commonly measured with a simple Scoliometer device on a patient's thorax in the forward flexed position. The new generation of mobile 'smartphones' have an integrated accelerometer, making accurate angle measurement possible, which provides a potentially useful clinical tool for assessing rib hump deformity. This study aimed to compare rib hump angle measurements performed using a Smartphone and traditional Scoliometer on a set of plaster torsos representing the range of torsional deformities seen in clinical practice. Methods Nine observers measured the rib hump found on eight plaster torsos moulded from scoliosis patients with both a Scoliometer and an Apple iPhone on separate occasions. Each observer repeated the measurements at least a week after the original measurements, and were blinded to previous results. Intra-observer reliability and inter-observer reliability were analysed using the method of Bland and Altman and 95% confidence intervals were calculated. The Intra-Class Correlation Coefficients (ICC) were calculated for repeated measurements of each of the eight plaster torso moulds by the nine observers. Results Mean absolute difference between pairs of iPhone/Scoliometer measurements was 2.1 degrees, with a small (1 degrees) bias toward higher rib hump angles with the iPhone. 95% confidence intervals for intra-observer variability were +/- 1.8 degrees (Scoliometer) and +/- 3.2 degrees (iPhone). 95% confidence intervals for inter-observer variability were +/- 4.9 degrees (iPhone) and +/- 3.8 degrees (Scoliometer). The measurement errors and confidence intervals found were similar to or better than the range of previously published thoracic rib hump measurement studies. Conclusions The iPhone is a clinically equivalent rib hump measurement tool to the Scoliometer in spinal deformity patients. The novel use of plaster torsos as rib hump models avoids the variables of patient fatigue and discomfort, inconsistent positioning and deformity progression using human subjects in a single or multiple measurement sessions. PMID:22846346
Intra- and interrater reliability of the Chicago Classification of achalasia subtypes in pediatric high-resolution esophageal manometry (HRM) recordings.

PubMed

Singendonk, M M J; Rosen, R; Oors, J; Rommel, N; van Wijk, M P; Benninga, M A; Nurko, S; Omari, T I

2017-11-01

Subtyping achalasia by high-resolution manometry (HRM) is clinically relevant as response to therapy and prognosis have shown to vary accordingly. The aim of this study was to assess inter- and intrarater reliability of diagnosing achalasia and achalasia subtyping in children using the Chicago Classification (CC) V3.0. Six observers analyzed 40 pediatric HRM recordings (22 achalasia and 18 non-achalasia) twice by using dedicated analysis software (ManoView 3.0, Given Imaging, Los Angeles, CA, USA). Integrated relaxation pressure (IRP4s), distal contractile integral (DCI), intrabolus pressurization pattern (IBP), and distal latency (DL) were extracted and analyzed hierarchically. Cohen's κ (2 raters) and Fleiss' κ (>2 raters) and the intraclass correlation coefficient (ICC) were used for categorical and ordinal data, respectively. Based on the results of dedicated analysis software only, intra- and interrater reliability was excellent and moderate (κ=0.89 and κ=0.52, respectively) for differentiating achalasia from non-achalasia. For subtyping achalasia, reliability decreased to substantial and fair (κ=0.72 and κ=0.28, respectively). When observers were allowed to change the software-driven diagnosis according to their own interpretation of the manometric patterns, intra- and interrater reliability increased for diagnosing achalasia (κ=0.98 and κ=0.92, respectively) and for subtyping achalasia (κ=0.79 and κ=0.58, respectively). Intra- and interrater agreement for diagnosing achalasia when using HRM and the CC was very good to excellent when results of automated analysis software were interpreted by experienced observers. More variability was seen when relying solely on the software-driven diagnosis and for subtyping achalasia. Therefore, diagnosing and subtyping achalasia should be performed in pediatric motility centers with significant expertise. © 2017 John Wiley & Sons Ltd.
Limited utility of tissue micro-arrays in detecting intra-tumoral heterogeneity in stem cell characteristics and tumor progression markers in breast cancer.

PubMed

Kündig, Pascale; Giesen, Charlotte; Jackson, Hartland; Bodenmiller, Bernd; Papassotirolopus, Bärbel; Freiberger, Sandra Nicole; Aquino, Catharine; Opitz, Lennart; Varga, Zsuzsanna

2018-05-08

Intra-tumoral heterogeneity has been recently addressed in different types of cancer, including breast cancer. A concept describing the origin of intra-tumoral heterogeneity is the cancer stem-cell hypothesis, proposing the existence of cancer stem cells that can self-renew limitlessly and therefore lead to tumor progression. Clonal evolution in accumulated single cell genomic alterations is a further possible explanation in carcinogenesis. In this study, we addressed the question whether intra-tumoral heterogeneity can be reliably detected in tissue-micro-arrays in breast cancer by comparing expression levels of conventional predictive/prognostic tumor markers, tumor progression markers and stem cell markers between central and peripheral tumor areas. We analyzed immunohistochemical expression and/or gene amplification status of conventional prognostic tumor markers (ER, PR, HER2, CK5/6), tumor progression markers (PTEN, PIK3CA, p53, Ki-67) and stem cell markers (mTOR, SOX2, SOX9, SOX10, SLUG, CD44, CD24, TWIST) in 372 tissue-micro-array samples from 72 breast cancer patients. Expression levels were compared between central and peripheral tumor tissue areas and were correlated to histopathological grading. 15 selected cases additionally underwent RNA sequencing for transcriptome analysis. No significant difference in any of the analyzed between central and peripheral tumor areas was seen with any of the analyzed methods/or results that showed difference. Except mTOR, PIK3CA and SOX9 (nuclear) protein expression, all markers correlated significantly (p < 0.05) with histopathological grading both in central and peripheral areas. Our results suggest that intra-tumoral heterogeneity of stem-cell and tumor-progression markers cannot be reliably addressed in tissue-micro-array samples in breast cancer. However, most markers correlated strongly with histopathological grading confirming prognostic information as expression profiles were independent on the site of the biopsy was taken.
Examiner Training and Reliability in Two Randomized Clinical Trials of Adult Dental Caries

PubMed Central

Banting, David W.; Amaechi, Bennett T.; Bader, James D.; Blanchard, Peter; Gilbert, Gregg H.; Gullion, Christina M.; Holland, Jan Carlton; Makhija, Sonia K.; Papas, Athena; Ritter, André V.; Singh, Mabi L.; Vollmer, William M.

2013-01-01

Objectives This report describes the training of dental examiners participating in two dental caries clinical trials and reports the inter- and intra- examiner reliability scores from the initial standardization sessions. Methods Study examiners were trained to use a modified ICDAS-II system to detect the visual signs of non-cavitated and cavitated dental caries in adult subjects. Dental caries was classified as no caries (S), non-cavitated caries (D1), enamel caries (D2) and dentine caries (D3). Three standardization sessions involving 60 subjects and 3604 tooth surface calls were used to calculate several measures of examiner reliability. Results The prevalence of dental caries observed in the standardization sessions ranged from 1.4% to 13.5% of the coronal tooth surfaces examined. Overall agreement between pairs of examiners ranged from 0.88 to 0.99. An intra-class coefficient threshold of 0.60 was surpassed for all but one examiner. Inter-examiner unweighted kappa values were low (0.23– 0.35) but weighted kappas and the ratio of observed to maximum kappas were more encouraging (0.42– 0.83). The highest kappa values occurred for the S/D1 vs. D2/D3 two-level classification of dental caries, for which seven of the eight examiners achieved observed to maximum kappa values over 0.90.Intra-examiner reliability was notably higher than inter-examiner reliability for all measures and dental caries classification systems employed. Conclusion The methods and results for the initial examiner training and standardization sessions for two large clinical trials are reported. Recommendations for others planning examiner training and standardization sessions are offered. PMID:22320292
Examiner training and reliability in two randomized clinical trials of adult dental caries.

PubMed

Banting, David W; Amaechi, Bennett T; Bader, James D; Blanchard, Peter; Gilbert, Gregg H; Gullion, Christina M; Holland, Jan Carlton; Makhija, Sonia K; Papas, Athena; Ritter, André V; Singh, Mabi L; Vollmer, William M

2011-01-01

This report describes the training of dental examiners participating in two dental caries clinical trials and reports the inter- and intra-examiner reliability scores from the initial standardization sessions. Study examiners were trained to use a modified International Caries Detection and Assessment System II system to detect the visual signs of non-cavitated and cavitated dental caries in adult subjects. Dental caries was classified as no caries (S), non-cavitated caries (D1), enamel caries (D2), and dentine caries (D3). Three standardization sessions involving 60 subjects and 3,604 tooth surface calls were used to calculate several measures of examiner reliability. The prevalence of dental caries observed in the standardization sessions ranged from 1.4 percent to 13.5 percent of the coronal tooth surfaces examined. Overall agreement between pairs of examiners ranged from 0.88 to 0.99. An intra-class coefficient threshold of 0.60 was surpassed for all but one examiner. Inter-examiner unweighted kappa values were low (0.23-0.35), but weighted kappas and the ratio of observed to maximum kappas were more encouraging (0.42-0.83). The highest kappa values occurred for the S/D1 versus D2/D3 two-level classification of dental caries, for which seven of the eight examiners achieved observed to maximum kappa values over 0.90. Intra-examiner reliability was notably higher than inter-examiner reliability for all measures and dental caries classifications employed. The methods and results for the initial examiner training and standardization sessions for two large clinical trials are reported. Recommendations for others planning examiner training and standardization sessions are offered. © 2011 American Association of Public Health Dentistry.
Cross-cultural adaptation of the Korean version of the Boston carpal tunnel questionnaire: its clinical evaluation in patients with carpal tunnel syndrome following local corticosteroid injection.

PubMed

Park, Dong-Jin; Kang, Ji-Hyoun; Lee, Jeong-Won; Lee, Kyung-Eun; Wen, Lihui; Kim, Tae-Jong; Park, Yong-Wook; Nam, Tai-Seung; Kim, Myung-Sun; Lee, Shin-Seok

2013-07-01

The aim of this study was to assess and validate the Korean version of the Boston Carpal Tunnel Questionnaire (K-BCTQ) in patients with carpal tunnel syndrome (CTS). After translation and cultural adaptation of the BCTQ to a Korean version, the K-BCTQ was administered to 54 patients with CTS; it was administered again after 2 weeks to assess reliability. Additionally, we administered K-DASH and EQ-5D to assess construct-validity. In a prospective study of responsiveness to clinical change, 29 of 54 patients were treated by ultrasonography-guided local corticosteroid injection therapy. The internal consistency of the K-BCTQ was high (Cronbach's alpha: 0.915) and the intra-class correlation coefficients were 0.931 for the symptom severity scale (P<0.001) and 0.844 for the functional severity scale (P<0.001). The construct-validity between the symptom severity scale and the K-DASH, and between the functional severity scale and the K-DASH were significantly correlated (both P<0.001). Clinical improvement was noted in 29 patients with injection therapy. The effect size of symptom severity was 0.67, and that of functional severity was 0.58. In conclusion, the K-BCTQ shows good reliability, construct-validity, and acceptable responsiveness after local corticosteroid injection therapy (Clinical trial number, KCT0000050).
Validity of faculty and resident global assessment of medical students' clinical knowledge during their pediatrics clerkship.

PubMed

Dudas, Robert A; Colbert, Jorie M; Goldstein, Seth; Barone, Michael A

2012-01-01

Medical knowledge is one of six core competencies in medicine. Medical student assessments should be valid and reliable. We assessed the relationship between faculty and resident global assessment of pediatric medical student knowledge and performance on a standardized test in medical knowledge. Retrospective cross-sectional study of medical students on a pediatric clerkship in academic year 2008-2009 at one academic health center. Faculty and residents rated students' clinical knowledge on a 5-point Likert scale. The inter-rater reliability of clinical knowledge ratings was assessed by calculating the intra-class correlation coefficient (ICC) for residents' ratings, faculty ratings, and both rating types combined. Convergent validity between clinical knowledge ratings and scores on the National Board of Medical Examiners (NBME) clinical subject examination in pediatrics was assessed with Pearson product moment correlation correction and the coefficient of the determination. There was moderate agreement for global clinical knowledge ratings by faculty and moderate agreement for ratings by residents. The agreement was also moderate when faculty and resident ratings were combined. Global ratings of clinical knowledge had high convergent validity with pediatric examination scores when students were rated by both residents and faculty. Our findings provide evidence for convergent validity of global assessment of medical students' clinical knowledge with NBME subject examination scores in pediatrics. Copyright Â© 2012 Academic Pediatric Association. Published by Elsevier Inc. All rights reserved.
[Validity and reliability of the screening questionnaire for geriatric depression used in the Mexican Health and Age Study].

PubMed

Aguilar-Navarro, Sara Gloria; Fuentes-Cantú, Alejandro; Avila-Funes, José Alberto; García-Mayo, Emilio José

2007-01-01

To assess the validity and reliability of a geriatric depression questionnaire used in the Mexican Health and Age Study (MHAS). The study was conducted at the Instituto Nacional de Ciencias Médicas y Nutrición Salvador Zubirán (INCMNSZ) clinic from May 2005 to March 2006. This depression screening nine-item questionnaire was validated using the Diagnostic and Statistical Manual of Mental Disorders (DSM-IV-TR) (fourth revised version) and Yesavage's 15-item Geriatric Depression Scale (GDS-15) criteria. The instrument belongs to the MHAS, a prospective panel study of health and aging in Mexico. A total of 199 subjects 65 years of age and older participated in the validation process (median age= 79.5 years). MHAS questionnaire result was significantly correlated to the clinical depression diagnosis (p<0.001) and to the GDS-15 score (p<0.001). Internal consistency was adequate (alpha coefficient: 0.74). The cutoff point > or = 5/9 points yielded an 80.7% and 68.7% sensitivity and specificity respectively. The fidelity for the test retest was excellent (intra-class correlation coefficient= 0.933). Finally, the Bland and Altman agreement points indicated a difference 0.22 percent points between test retest. The MHAS questionnaire is valid and trustworthy, and allows screening in the research field for the presence of depression in the elderly.
A new music therapy engagement scale for persons with dementia.

PubMed

Tan, Jane; Wee, Shiou-Liang; Yeo, Pei Shi; Choo, Juliet; Ritholz, Michele; Yap, Philip

2018-05-25

ABSTRACTObjectives:To develop and validate a new scale to assess music therapy engagement in persons with dementia (PWDs). A draft scale was derived from literature review and >2 years of qualitative recording of PWDs during music therapy. Content validity was attained through iterative consultations, trial sessions, and revisions. The final five-item Music Therapy Engagement scale for Dementia (MTED) assessed music and non-music related elements. Internal consistency and inter-rater reliability were assessed over 120 music therapy sessions. MTED was validated with the Greater Cincinnati Chapter Well-being Observation Tool, Holden Communication Scale, and Participant Engagement Observation Checklist - Music Sessions. A total of 62 PWDs (83.2 ± 7.7 years, modified version of the mini-mental state examination = 13.2/30 ± 4.1) in an acute hospital dementia unit were involved. The mean MTED score was 13.02/30 ± 4.27; internal consistency (Cronbach's α = 0.87) and inter-rater reliability (intra-class correlation = 0.96) were good. Principal component analysis revealed a one-factor structure with Eigen value > 1 (3.27), which explained 65.4% of the variance. MTED demonstrated good construct validity. The MTED total score correlated strongly with the combined items comprising Pleasure, Interest, Sadness, and Sustained attention of the Greater Cincinnati Chapter Well-being Observation Tool (rs = 0.88, p < 0.001). MTED is a clinically appropriate and psychometrically valid scale to evaluate music therapy engagement in PWDs.
Intra-observer reproducibility and diagnostic performance of breast shear-wave elastography in Asian women.

PubMed

Park, Hye Young; Han, Kyung Hwa; Yoon, Jung Hyun; Moon, Hee Jung; Kim, Min Jung; Kim, Eun-Kyung

2014-06-01

Our aim was to evaluate intra-observer reproducibility of shear-wave elastography (SWE) in Asian women. Sixty-four breast masses (24 malignant, 40 benign) were examined with SWE in 53 consecutive Asian women (mean age, 44.9 y old). Two SWE images were obtained for each of the lesions. The intra-observer reproducibility was assessed by intra-class correlation coefficients (ICC). We also evaluated various clinicoradiologic factors that can influence reproducibility in SWE. The ICC of intra-observer reproducibility was 0.789. In clinicoradiologic factor evaluation, masses surrounded by mixed fatty and glandular tissue (ICC: 0.619) showed lower intra-observer reproducibility compared with lesions that were surrounded by glandular tissue alone (ICC: 0.937; p < 0.05). Overall, the intra-observer reproducibility of breast SWE was excellent in Asian women. However, it may decrease when breast tissue is in a heterogeneous background. Therefore, SWE should be performed carefully in these cases. Copyright © 2014 World Federation for Ultrasound in Medicine & Biology. Published by Elsevier Inc. All rights reserved.
The modified patient enablement instrument: a Portuguese cross-cultural adaptation, validity and reliability study.

PubMed

Remelhe, Mafalda; Teixeira, Pedro M; Lopes, Irene; Silva, Luís; Correia de Sousa, Jaime

2017-01-12

Enabling patients with asthma to obtain the knowledge, confidence and skills they need in order to assume a major role in the management of their disease is cost effective. It should be an integral part of any plan for long-term control of asthma. The modified Patient Enablement Instrument (mPEI) is an easily administered questionnaire that was adapted in the United Kingdom to measure patient enablement in asthma, but its applicability in Portugal is not known. Validity and reliability of questionnaires should be tested before use in settings different from those of the original version. The purpose of this study was to test the applicability of the mPEI to Portuguese asthma patients after translation and cross-cultural adaptation, and to verify the structural validity, internal consistency and reproducibility of the instrument. The mPEI was translated to Portuguese and back translated to English. Its content validity was assessed by a debriefing interview with 10 asthma patients. The translated instrument was then administered to a random sample of 142 patients with persistent asthma. Structural validity and internal consistency were assessed. For reproducibility analysis, 86 patients completed the instrument again 7 days later. Item-scale correlations and exploratory factor analysis were used to assess structural validity. Cronbach's alpha was used to test internal consistency, and the intra-class correlation coefficient was used for the analysis of reproducibility. All items of the Portuguese version of the mPEI were found to be equivalent to the original English version. There were strong item-scale correlations that confirmed construct validity, with a one component structure and good internal consistency (Cronbach's alpha >0.8) as well as high test-retest reliability (ICC=0.85). The mPEI showed sound psychometric properties for the evaluation of enablement in patients with asthma making it a reliable instrument for use in research and clinical practice in Portugal. Further studies are needed to confirm its responsiveness.
Psychometric Properties of Translation of the Child Perception Questionnaire (CPQ11-14) in Telugu Speaking Indian Children.

PubMed

Kumar, Santhosh; Kroon, Jeroen; Lalloo, Ratilal; Johnson, Newell W

2016-01-01

Oral health related quality of life research among children in India is still nascent and no measures have been validated to date. Although CPQ11-14 has been previously used in studies from the Indian sub-continent, the instrument has never been tested for cross-cultural adaptability. This study aimed to assess the validity and reliability of CPQ11-14 in Telugu speaking Indian school children. Primary school children of Medak district, Telangana State, India, were recruited by a multi-stage probability sampling method. The translated questionnaire was initially pilot tested on a small subset of children (n = 40). Children with informed consent from parents (N = 1342) were then provided with questionnaires containing the Telugu translation of CPQ11-14, followed by a clinical examination conducted by a single examiner, using Basic WHO survey methods for dental caries, malocclusion, and Dean's Fluorosis index. Children (n = 161) in randomly chosen schools were re-administered the same questionnaire after a two week interval to test reliability of CPQ11-14 on repeated administrations. Internal consistency and test-retest reliability as determined by Cronbach's alpha and Intra-class correlation coefficient for overall CPQ11-14 scale were 0.925 and 0.923, respectively. CPQ11-14 discriminated between the categories of fluorosis and malocclusion while its discriminant validity with respect to dental caries was limited. CPQ11-14 also demonstrated good construct validity with both overall CPQ11-14 and its subscales having significant positive correlation with global ratings of oral health and overall wellbeing, even after adjusting for confounding variables. CPQ11-14 had a correlation of 0.405 with self-evaluated oral health and 0.407 with self-evaluated impact of oral health on overall wellbeing. In conclusion, Telugu translation of CPQ11-14 demonstrated good internal consistency and excellent reliability on repeated administrations after two weeks. It also exhibited good discriminant and construct validity.
The reliability of the quantitative timed up and go test (QTUG) measured over five consecutive days under single and dual-task conditions in community dwelling older adults.

PubMed

Smith, Erin; Walsh, Lorcan; Doyle, Julie; Greene, Barry; Blake, Catherine

2016-01-01

The timed up and go (TUG) test is a commonly used assessment in older people with variations including the addition of a motor or cognitive dual-task, however in high functioning older adults it is more difficult to assess change. The quantified TUG (QTUG) uses inertial sensors to detect test and gait parameters during the test. If it is to be used in the longitudinal assessment of older adults, it is important that we know which parameters are reliable and under which conditions. This study aims to examine the relative reliability of the QTUG over five consecutive days under single, motor and cognitive dual-task conditions. Twelve community dwelling older adults (10 females, mean age 74.17 (3.88)) performed the QTUG under three conditions for five consecutive days. The relative reliability of each of the gait parameters was assessed using intra-class correlation coefficient (ICC 3,1) and standard error of measurement (SEM). Five of the measures demonstrated excellent reliability (ICC>0.70) under all three conditions (time to complete test, walk time, number of gait cycles, number of steps and return from turn time). Measures of variability and turn derived parameters demonstrated weak reliability under all three conditions (ICC=0.05-0.49). For the most reliable parameters under single-task conditions, the addition of a cognitive task resulted in a reduction in reliability suggesting caution when interpreting results under these conditions. Certain sensor derived parameters during the QTUG test may provide an additional resource in the longitudinal assessment of older people and earlier identification of falls risk. Copyright © 2015 Elsevier B.V. All rights reserved.
The test-retest reliability of the latent construct of executive function depends on whether tasks are represented as formative or reflective indicators.

PubMed

Willoughby, Michael T; Kuhn, Laura J; Blair, Clancy B; Samek, Anya; List, John A

2017-10-01

This study investigates the test-retest reliability of a battery of executive function (EF) tasks with a specific interest in testing whether the method that is used to create a battery-wide score would result in differences in the apparent test-retest reliability of children's performance. A total of 188 4-year-olds completed a battery of computerized EF tasks twice across a period of approximately two weeks. Two different approaches were used to create a score that indexed children's overall performance on the battery-i.e., (1) the mean score of all completed tasks and (2) a factor score estimate which used confirmatory factor analysis (CFA). Pearson and intra-class correlations were used to investigate the test-retest reliability of individual EF tasks, as well as an overall battery score. Consistent with previous studies, the test-retest reliability of individual tasks was modest (rs ≈ .60). The test-retest reliability of the overall battery scores differed depending on the scoring approach (r mean = .72; r factor_ score = .99). It is concluded that the children's performance on individual EF tasks exhibit modest levels of test-retest reliability. This underscores the importance of administering multiple tasks and aggregating performance across these tasks in order to improve precision of measurement. However, the specific strategy that is used has a large impact on the apparent test-retest reliability of the overall score. These results replicate our earlier findings and provide additional cautionary evidence against the routine use of factor analytic approaches for representing individual performance across a battery of EF tasks.
Test-retest reliability and construct validity of the ENERGY-child questionnaire on energy balance-related behaviours and their potential determinants: the ENERGY-project.

PubMed

Singh, Amika S; Vik, Froydis N; Chinapaw, Mai J M; Uijtdewilligen, Léonie; Verloigne, Maïté; Fernández-Alvira, Juan M; Stomfai, Sarolta; Manios, Yannis; Martens, Marloes; Brug, Johannes

2011-12-09

Insight in children's energy balance-related behaviours (EBRBs) and their determinants is important to inform obesity prevention research. Therefore, reliable and valid tools to measure these variables in large-scale population research are needed. To examine the test-retest reliability and construct validity of the child questionnaire used in the ENERGY-project, measuring EBRBs and their potential determinants among 10-12 year old children. We collected data among 10-12 year old children (n = 730 in the test-retest reliability study; n = 96 in the construct validity study) in six European countries, i.e. Belgium, Greece, Hungary, the Netherlands, Norway, and Spain. Test-retest reliability was assessed using the intra-class correlation coefficient (ICC) and percentage agreement comparing scores from two measurements, administered one week apart. To assess construct validity, the agreement between questionnaire responses and a subsequent face-to-face interview was assessed using ICC and percentage agreement. Of the 150 questionnaire items, 115 (77%) showed good to excellent test-retest reliability as indicated by ICCs > .60 or percentage agreement ≥ 75%. Test-retest reliability was moderate for 34 items (23%) and poor for one item. Construct validity appeared to be good to excellent for 70 (47%) of the 150 items, as indicated by ICCs > .60 or percentage agreement ≥ 75%. From the other 80 items, construct validity was moderate for 39 (26%) and poor for 41 items (27%). Our results demonstrate that the ENERGY-child questionnaire, assessing EBRBs of the child as well as personal, family, and school-environmental determinants related to these EBRBs, has good test-retest reliability and moderate to good construct validity for the large majority of items.
Test-retest reliability and construct validity of the ENERGY-child questionnaire on energy balance-related behaviours and their potential determinants: the ENERGY-project

PubMed Central

2011-01-01

Background Insight in children's energy balance-related behaviours (EBRBs) and their determinants is important to inform obesity prevention research. Therefore, reliable and valid tools to measure these variables in large-scale population research are needed. Objective To examine the test-retest reliability and construct validity of the child questionnaire used in the ENERGY-project, measuring EBRBs and their potential determinants among 10-12 year old children. Methods We collected data among 10-12 year old children (n = 730 in the test-retest reliability study; n = 96 in the construct validity study) in six European countries, i.e. Belgium, Greece, Hungary, the Netherlands, Norway, and Spain. Test-retest reliability was assessed using the intra-class correlation coefficient (ICC) and percentage agreement comparing scores from two measurements, administered one week apart. To assess construct validity, the agreement between questionnaire responses and a subsequent face-to-face interview was assessed using ICC and percentage agreement. Results Of the 150 questionnaire items, 115 (77%) showed good to excellent test-retest reliability as indicated by ICCs > .60 or percentage agreement ≥ 75%. Test-retest reliability was moderate for 34 items (23%) and poor for one item. Construct validity appeared to be good to excellent for 70 (47%) of the 150 items, as indicated by ICCs > .60 or percentage agreement ≥ 75%. From the other 80 items, construct validity was moderate for 39 (26%) and poor for 41 items (27%). Conclusions Our results demonstrate that the ENERGY-child questionnaire, assessing EBRBs of the child as well as personal, family, and school-environmental determinants related to these EBRBs, has good test-retest reliability and moderate to good construct validity for the large majority of items. PMID:22152048
Test-retest reliability of sensor-based sit-to-stand measures in young and older adults.

PubMed

Regterschot, G Ruben H; Zhang, Wei; Baldus, Heribert; Stevens, Martin; Zijlstra, Wiebren

2014-01-01

This study investigated test-retest reliability of sensor-based sit-to-stand (STS) peak power and other STS measures in young and older adults. In addition, test-retest reliability of the sensor method was compared to test-retest reliability of the Timed Up and Go Test (TUGT) and Five-Times-Sit-to-Stand Test (FTSST) in older adults. Ten healthy young female adults (20-23 years) and 31 older adults (21 females; 73-94 years) participated in two assessment sessions separated by 3-8 days. Vertical peak power was assessed during three (young adults) and five (older adults) normal and fast STS trials with a hybrid motion sensor worn on the hip. Older adults also performed the FTSST and TUGT. The average sensor-based STS peak power of the normal STS trials and the average sensor-based STS peak power of the fast STS trials showed excellent test-retest reliability in young adults (intra-class correlation (ICC)≥0.90; zero in 95% confidence interval of mean difference between test and retest (95%CI of D); standard error of measurement (SEM)≤6.7% of mean peak power) and older adults (ICC≥0.91; zero in 95%CI of D; SEM≤9.9%). Test-retest reliability of sensor-based STS peak power and TUGT (ICC=0.98; zero in 95%CI of D; SEM=8.5%) was comparable in older adults, test-retest reliability of the FTSST was lower (ICC=0.73; zero outside 95%CI of D; SEM=14.4%). Sensor-based STS peak power demonstrated excellent test-retest reliability and may therefore be useful for clinical assessment of functional status and fall risk. Copyright © 2014 Elsevier B.V. All rights reserved.
Assessing Lower Limb Alignment: Comparison of Standard Knee Xray vs Long Leg View.

PubMed

Zampogna, Biagio; Vasta, Sebastiano; Amendola, Annunziato; Uribe-Echevarria Marbach, Bastian; Gao, Yubo; Papalia, Rocco; Denaro, Vincenzo

2015-01-01

High tibial osteotomy (HTO) is a well-established and commonly utilized technique in medial knee osteoarthritis secondary to varus malalignment. Accurate measurement of the preoperative limb alignment, and the amount of correction required are essential when planning limb realignment surgery. The hip-knee-ankle angle (HKA) measured on a full length weightbearing (FLWB) X-ray in the standing position is considered the gold standard, since it allows for reliable and accurate measurement of the mechanical axis of the whole lower extremity. In general practice, alignment is often evaluated on standard anteroposterior weightbearing (APWB) X-rays, as the angle between the femur and tibial anatomic axis (TFa). It is, therefore, of value to establish if measuring the anatomical axis from limited APWB is an effective measure of knee alignment especially in patients undergoing osteotomy about the knee. Three independent observers measured preoperative and postoperative FTa with standard method (FTa1) and with circles method (FTa2) on APWB X-ray and the HKA on FLWB X-ray at three different time-points separated by a two-week period. Intra-observer and inter-observer reliabilities and the comparison and relationship between anatomical and mechanical alignment were calculated. Intra- and interclass coefficients for all the three methods indicated excellent reliability, having all the values above 0.80. Using the mean of paired t-student test, the comparison of HKA versus TFa1 and TFa2 showed a statistically significant difference (p<.0001) both for the pre-operative and post-operative sets of values. The correlation between the HKA and FTal was found poor for the preoperative set (R=0.26) and fair for the postoperative one (R=0.53), while the new circles method showed a higher correlation in both the preoperative (R=0.71) and postoperative sets (R=0.79). Intra-observer reliability was high for HKA, FTal and FTa2 on APWB x-rays in the pre- and post-operative setting. Inter-rater reliability was higher for HKA and TFa2 compared to FTal. The femoro-tibial angle as measured on APWB with the traditional method (FTal) has a weak correlation with the HKA, and based on these findings, should not be used in everyday practice. The FTa2 showed better correlation with the HKA, although not excellent. Level III, Retrospective study.

Assessing Lower Limb Alignment: Comparison of Standard Knee Xray vs Long Leg View

PubMed Central

Zampogna, Biagio; Vasta, Sebastiano; Amendola, Annunziato; Uribe-Echevarria Marbach, Bastian; Gao, Yubo; Papalia, Rocco; Denaro, Vincenzo

2015-01-01

Background High tibial osteotomy (HTO) is a well-established and commonly utilized technique in medial knee osteoarthritis secondary to varus malalignment. Accurate measurement of the preoperative limb alignment, and the amount of correction required are essential when planning limb realignment surgery. The hip-knee-ankle angle (HKA) measured on a full length weightbearing (FLWB) X-ray in the standing position is considered the gold standard, since it allows for reliable and accurate measurement of the mechanical axis of the whole lower extremity. In general practice, alignment is often evaluated on standard anteroposterior weightbearing (APWB) X-rays, as the angle between the femur and tibial anatomic axis (TFa). It is, therefore, of value to establish if measuring the anatomical axis from limited APWB is an effective measure of knee alignment especially in patients undergoing osteotomy about the knee. Methods Three independent observers measured preoperative and postoperative FTa with standard method (FTa1) and with circles method (FTa2) on APWB X-ray and the HKA on FLWB X-ray at three different time-points separated by a two-week period. Intra-observer and inter-observer reliabilities and the comparison and relationship between anatomical and mechanical alignment were calculated. Results Intra- and interclass coefficients for all the three methods indicated excellent reliability, having all the values above 0.80. Using the mean of paired t-student test, the comparison of HKA versus TFa1 and TFa2 showed a statistically significant difference (p<.0001) both for the pre-operative and post-operative sets of values. The correlation between the HKA and FTal was found poor for the preoperative set (R=0.26) and fair for the postoperative one (R=0.53), while the new circles method showed a higher correlation in both the preoperative (R=0.71) and postoperative sets (R=0.79). Conclusions Intra-observer reliability was high for HKA, FTal and FTa2 on APWB x-rays in the pre- and post-operative setting. Inter-rater reliability was higher for HKA and TFa2 compared to FTal. The femoro-tibial angle as measured on APWB with the traditional method (FTal) has a weak correlation with the HKA, and based on these findings, should not be used in everyday practice. The FTa2 showed better correlation with the HKA, although not excellent Level of Evidence Level III, Retrospective study. PMID:26361444
Test-retest reliability of prefrontal transcranial Direct Current Stimulation (tDCS) effects on functional MRI connectivity in healthy subjects.

PubMed

Wörsching, Jana; Padberg, Frank; Helbich, Konstantin; Hasan, Alkomiet; Koch, Lena; Goerigk, Stephan; Stoecklein, Sophia; Ertl-Wagner, Birgit; Keeser, Daniel

2017-07-15

Transcranial Direct Current Stimulation (tDCS) of the prefrontal cortex (PFC) can be used for probing functional brain connectivity and meets general interest as novel therapeutic intervention in psychiatric and neurological disorders. Along with a more extensive use, it is important to understand the interplay between neural systems and stimulation protocols requiring basic methodological work. Here, we examined the test-retest (TRT) characteristics of tDCS-induced modulations in resting-state functional-connectivity MRI (RS fcMRI). Twenty healthy subjects received 20minutes of either active or sham tDCS of the dorsolateral PFC (2mA, anode over F3 and cathode over F4, international 10-20 system), preceded and ensued by a RS fcMRI (10minutes each). All subject underwent three tDCS sessions with one-week intervals in between. Effects of tDCS on RS fcMRI were determined at an individual as well as at a group level using both ROI-based and independent-component analyses (ICA). To evaluate the TRT reliability of individual active-tDCS and sham effects on RS fcMRI, voxel-wise intra-class correlation coefficients (ICC) of post-tDCS maps between testing sessions were calculated. For both approaches, results revealed low reliability of RS fcMRI after active tDCS (ICC (2,1) = -0.09 - 0.16). Reliability of RS fcMRI (baselines only) was low to moderate for ROI-derived (ICC (2,1) = 0.13 - 0.50) and low for ICA-derived connectivity (ICC (2,1) = 0.19 - 0.34). Thus, for ROI-based analyses, the distribution of voxel-wise ICC was shifted to lower TRT reliability after active, but not after sham tDCS, for which the distribution was similar to baseline. The intra-individual variation observed here resembles variability of tDCS effects in motor regions and may be one reason why in this study robust tDCS effects at a group level were missing. The data can be used for appropriately designing large scale studies investigating methodological issues such as sources of variability and localisation of tDCS effects. Copyright © 2017 Elsevier Inc. All rights reserved.
Study protocol of psychometric properties of the Spanish translation of a competence test in evidence based practice: the Fresno test.

PubMed

Argimon-Pallàs, Josep M; Flores-Mateo, Gemma; Jiménez-Villa, Josep; Pujol-Ribera, Enriqueta; Foz, Gonçal; Bundó-Vidiella, Magda; Juncosa, Sebastià; Fuentes-Bellido, Cruz M; Pérez-Rodríguez, Belén; Margalef-Pallarès, Francesc; Villafafila-Ferrero, Rosa; Forès-Garcia, Dolors; Roman-Martínez, Josep; Vilert-Garroga, Esther

2009-02-24

There are few high-quality instruments for evaluating the effectiveness of Evidence-Based Practice (EBP) curricula with objective outcomes measures. The Fresno test is an instrument that evaluates most of EBP steps with a high reliability and validity in the English original version. The present study has the aims to translate the Fresno questionnaire into Spanish and its subsequent validation to ensure the equivalence of the Spanish version against the English original. The questionnaire will be translated with the back translation technique and tested in Primary Care Teaching Units in Catalonia (PCTU). Participants will be: (a) tutors of Family Medicine residents (expert group); (b) Family Medicine residents in their second year of the Family Medicine training program (novice group), and (c) Family Medicine physicians (intermediate group). The questionnaire will be administered before and after an educational intervention. The educational intervention will be an interactive four half-day sessions designed to develop the knowledge and skills required to EBP. Responsiveness statistics used in the analysis will be the effect size, the standardised response mean and Guyatt's method. For internal consistency reliability, two measures will be used: corrected item-total correlations and Cronbach's alpha. Inter-rater reliability will be tested using Kappa coefficient for qualitative items and intra-class correlation coefficient for quantitative items and the overall score. Construct validity, item difficulty, item discrimination and feasibility will be determined. The validation of the Fresno questionnaire into different languages will enable the expansion of the questionnaire, as well as allowing comparison between countries and the evaluation of different teaching models.
Validation of the Spanish version of the Amsterdam Preoperative Anxiety and Information Scale (APAIS).

PubMed

Vergara-Romero, Manuel; Morales-Asencio, José Miguel; Morales-Fernández, Angelines; Canca-Sanchez, Jose Carlos; Rivas-Ruiz, Francisco; Reinaldo-Lapuerta, Jose Antonio

2017-06-07

Preoperative anxiety is a frequent and challenging problem with deleterious effects on the development of surgical procedures and postoperative outcomes. To prevent and treat preoperative anxiety effectively, the level of anxiety of patients needs to be assessed through valid and reliable measuring instruments. One such measurement tool is the Amsterdam Preoperative Anxiety and Information Scale (APAIS), of which a Spanish version has not been validated yet. To perform a Spanish cultural adaptation and empirical validation of the APAIS for assessing preoperative anxiety in the Spanish population. A two-step forward/back translation of the APAIS scale was performed to ensure a reliable Spanish cultural adaptation. The final Spanish version of the APAIS questionnaire was administered to 529 patients between the ages of 18 to 70 undergoing elective surgery at hospitals of the Agencia Sanitaria Costa del Sol (Spain). Cronbach's alpha, homogeneity index, intra-class correlation coefficient, and confirmatory factor analysis were calculated to assess internal consistency and criteria and construct validity. Confirmatory factor analysis showed that a one-factor model was better fitted than a two-factor model, with good fitting patterns (root mean square error of approximation: 0.05, normed-fit index: 0.99, goodness-of-fit statistic: 0.99). The questionnaire showed high internal consistency (Cronbach's alpha: 0.84) and a good correlation with the Goldberg Anxiety Scale (CCI: 0.62 (95% CI: 0.55 to 0.68). The Spanish version of the APAIS is a valid and reliable preoperative anxiety measurement tool and shows psychometric properties similar to those obtained by similar previous studies.
The Chinese version of the Child and Adolescent Scale of Environment (CASE-C): validity and reliability for children with disabilities in Taiwan.

PubMed

Kang, Lin-Ju; Yen, Chia-Feng; Bedell, Gary; Simeonsson, Rune J; Liou, Tsan-Hon; Chi, Wen-Chou; Liu, Shu-Wen; Liao, Hua-Fang; Hwang, Ai-Wen

2015-03-01

Measurement of children's participation and environmental factors is a key component of the assessment in the new Disability Evaluation System (DES) in Taiwan. The Child and Adolescent Scale of Environment (CASE) was translated into Traditional Chinese (CASE-C) and used for assessing environmental factors affecting the participation of children and youth with disabilities in the DES. The aim of this study was to validate the CASE-C. Participants were 614 children and youth aged 6.0-17.9 years with disabilities, with the largest condition group comprised of children with intellectual disability (61%). Internal structure, internal consistency, test-retest reliability, convergent validity, and discriminant (known group) validity were examined using exploratory factor analyses, Cronbach's α coefficient, intra-class correlation coefficients (ICC), correlation analyses, and univariate ANOVAs. A three-factor structure (Family/Community Resources, Assistance/Attitude Supports, and Physical Design Access) of the CASE-C was produced with 38% variance explained. The CASE-C had adequate internal consistency (Cronbach's α=.74-.86) and test-retest reliability (ICCs=.73-.90). Children and youth with disabilities who had higher levels of severity of impairment encountered more environmental barriers and those experiencing more environmental problems also had greater restrictions in participation. The CASE-C scores were found to distinguish children on the basis of disability condition and impairment severity, but not on the basis of age or sex. The CASE-C is valid for assessing environmental problems experienced by children and youth with disabilities in Taiwan. Copyright © 2014 Elsevier Ltd. All rights reserved.
Development and validation of a self-efficacy questionnaire (SE-12) measuring the clinical communication skills of health care professionals.

PubMed

Axboe, Mette K; Christensen, Kaj S; Kofoed, Poul-Erik; Ammentorp, Jette

2016-10-18

The outcome of communication training is widely measured by self-efficacy ratings, and different questionnaires have been used. Nevertheless, none of these questionnaires have been formally validated through systematic measurement of assessment properties. Consequently, we decided to further develop a self-efficacy questionnaire which has been used in previous studies. This study aims to examine the content, internal structure, and relations with other variables of the new version of the self-efficacy questionnaire (SE-12). The questionnaire was developed on the basis of the theoretical approach applied in the communication course, statements from former course participants, teachers, and experts in the field. The questionnaire was initially validated through face-to-face interviews with 9 staff members following a test-retest including 195 participants. After minor adjustments, the SE-12 questionnaire demonstrated evidence of content validity. An explorative factor analysis indicated unidimensionality with highly correlated items. A Cronbach's α of 0.95 and a Loevinger's H coefficient of 0.71 provided evidence of statistical reliability and scalability. The test-retest reliability had a value of 0.71 when evaluated using intra-class correlation. Expected relations with other variables were partially confirmed in two of three hypotheses, but a ceiling effect was present in 9 of 12 items. The SE-12 scale should be regarded a reliable and partially valid instrument. We consider the questionnaire useful for self-evaluation of clinical communication skills; the SE-12 is user-friendly and can be administered as an electronic questionnaire. However, future research should explore potential needs for adjustments to reduce the identified ceiling effect.
Measuring Food Brand Awareness in Australian Children: Development and Validation of a New Instrument.

PubMed

Turner, Laura; Kelly, Bridget; Boyland, Emma; Bauman, Adrian E

2015-01-01

Children's exposure to food marketing is one environmental determinant of childhood obesity. Measuring the extent to which children are aware of food brands may be one way to estimate relative prior exposures to food marketing. This study aimed to develop and validate an Australian Brand Awareness Instrument (ABAI) to estimate children's food brand awareness. The ABAI incorporated 30 flashcards depicting food/drink logos and their corresponding products. An abbreviated version was also created using 12 flashcards (ABAI-a). The ABAI was presented to 60 primary school aged children (7-11 yrs) attending two Australian after-school centres. A week later, the full-version was repeated on approximately half the sample (n=27) and the abbreviated-version was presented to the remaining half (n=30). The test-retest reliability of the ABAI was analysed using Intra-class correlation coefficients. The concordance of the ABAI-a and full-version was assessed using Bland-Altman plots. The 'nomological' validity of the full tool was investigated by comparing children's brand awareness with food marketing-related variables (e.g. television habits, intake of heavily promoted foods). Brand awareness increased with age (p<0.01) but was not significantly correlated with other variables. Bland-Altman analyses showed good agreement between the ABAI and ABAI-a. Reliability analyses revealed excellent agreement between the two administrations of the full-ABAI. The ABAI was able to differentiate children's varying levels of brand awareness. It was shown to be a valid and reliable tool and may allow quantification of brand awareness as a proxy measure for children's prior food marketing exposure.
Student performance of the general physical examination in internal medicine: an observational study.

PubMed

Haring, Catharina M; Cools, Bernadette M; van der Meer, Jos Wm; Postma, Cornelis T

2014-04-08

Many practicing physicians lack skills in physical examination. It is not known whether physical examination skills already show deficiencies after an early phase of clinical training. At the end of the internal medicine clerkship students are expected to be able to perform a general physical examination in every new patient encounter. In a previous study, the basic physical examination items that should standardly be performed were set by consensus. The aim of the current observational study was to assess whether medical students were able to correctly perform a general physical examination regarding completeness as well as technique at the end of the clerkship internal medicine. One hundred students who had just finished their clerkship internal medicine were asked to perform a general physical examination on a standardized patient as they had learned during the clerkship. They were recorded on camera. Frequency of performance of each component of the physical examination was counted. Adequacy of performance was determined as either correct or incorrect or not assessable using a checklist of short descriptions of each physical examination component. A reliability analysis was performed by calculation of the intra class correlation coefficient for total scores of five physical examinations rated by three trained physicians and for their agreement on performance of all items. Approximately 40% of the agreed standard physical examination items were not performed by the students. Students put the most emphasis on examination of general parameters, heart, lungs and abdomen. Many components of the physical examination were not performed as was taught during precourses. Intra-class correlation was high for total scores of the physical examinations 0.91 (p <0.001) and for agreement on performance of the five physical examinations (0.79-0.92 p <0.001). In conclusion, performance of the general physical examination was already below expectation at the end of the internal medicine clerkship. Possible causes and suggestions for improvement are discussed.
Student performance of the general physical examination in internal medicine: an observational study

PubMed Central

2014-01-01

Background Many practicing physicians lack skills in physical examination. It is not known whether physical examination skills already show deficiencies after an early phase of clinical training. At the end of the internal medicine clerkship students are expected to be able to perform a general physical examination in every new patient encounter. In a previous study, the basic physical examination items that should standardly be performed were set by consensus. The aim of the current observational study was to assess whether medical students were able to correctly perform a general physical examination regarding completeness as well as technique at the end of the clerkship internal medicine. Methods One hundred students who had just finished their clerkship internal medicine were asked to perform a general physical examination on a standardized patient as they had learned during the clerkship. They were recorded on camera. Frequency of performance of each component of the physical examination was counted. Adequacy of performance was determined as either correct or incorrect or not assessable using a checklist of short descriptions of each physical examination component. A reliability analysis was performed by calculation of the intra class correlation coefficient for total scores of five physical examinations rated by three trained physicians and for their agreement on performance of all items. Results Approximately 40% of the agreed standard physical examination items were not performed by the students. Students put the most emphasis on examination of general parameters, heart, lungs and abdomen. Many components of the physical examination were not performed as was taught during precourses. Intra-class correlation was high for total scores of the physical examinations 0.91 (p <0.001) and for agreement on performance of the five physical examinations (0.79-0.92 p <0.001). Conclusions In conclusion, performance of the general physical examination was already below expectation at the end of the internal medicine clerkship. Possible causes and suggestions for improvement are discussed. PMID:24712683
The intra-individual reproducibility of flash-evoked potentials in a sample of children.

PubMed

Schellberg, D; Gasser, T; Köhler, W

1987-07-01

Visual evoked potentials (VEPs) to flash stimuli were recorded twice from 26 children aged 10-13 years, with an intersession interval of about 10 months. Test-retest reliability was poor for recordings taken from scalp locations overlying non-specific cortex and somewhat better for specific cortex. The size of consistency coefficients (i.e. correlations within session) showed that noise and artefacts were not the decisive factors which lower reliability. A comparison with retest correlations of broad band parameters of the EEG at rest for the same sample showed, to our surprise, smaller retest reliability for VEP parameters. Variability of the VEP in children over time seems to be a substantial as its well-known inter-individual variability.
Reproducibility of repeated measurements with the Kikuhime pressure sensor under pressure garments in burn scar treatment.

PubMed

Van den Kerckhove, Eric; Fieuws, Steffen; Massagé, Patrick; Hierner, Robert; Boeckx, Willy; Deleuze, Jean-Paul; Laperre, Jan; Anthonissen, Mieke

2007-08-01

This study investigated the reproducibility of repeated measurements with the Kikuhime pressure sensor under two different types of pressure garments used in the treatment and prevention of scars after burns. Also efficiency of garments was assessed in clinical circumstances by assessing pressure loss and residual pressure after 1 month. Intra- and inter-observer reproducibility and repeated measurements with 1-month time lapse were examined on 55 sites in 26 subjects by means of intra-class correlation coefficients and standard error of measurements. Results showed good to excellent ICC and low SEMs in the two conditions. There was a significant difference in pressure after 1 month between elastic tricot and weft knit garments, although evolution of pressure loss after 1 month was similar. Concerning different locations, there was a significant difference in pressure loss after 1 month between gloves and sleeves with the largest pressure loss for sleeves. Considering these results we concluded that the Kikuhime pressure sensor provides valid and reliable information and can be used in comparative clinical trials to evaluate pressure garments used in burn scar treatment. Secondly, elastic tricot garments in our study tended to have higher clinical pressures but both types of garments had similar pressure loss over time.
Validity and Reliability of the 30-s Continuous Jump for Anaerobic Power and Capacity Assessment in Combat Sport

PubMed Central

Čular, Drazen; Ivančev, Vladimir; Zagatto, Alessandro M.; Milić, Mirjana; Beslija, Tea; Sellami, Maha; Padulo, Johnny

2018-01-01

Cycling test such Wingate anaerobic test (WAnT) is used to measure anaerobic power (AP), but not anaerobic capacity (AC, i.e., the metabolic energy demand). However, in sports that do not involve cycling movements (Karate), the continuous jump for 30 s (vertical jumps for 30 s) has been extensively used to measure anaerobic performance in all young athletes. Limited information’s are available concerning its validity and reliability especially in children. As such, the current study aimed to test validity and reliability of a continuous jumps test (the CJ30s), using WAnT as a reference. Thirteen female Karate kids (age: 11.07 ± 1.32 years; mass: 41.76 ± 15.32 kg; height: 152 ± 11.52 cm; training experience: 4.38 ± 2.14 years) were tested on three separate sessions. The first and second sessions were used to assess the reliability using Intra-class correlation coefficient (ICC) of CJ30s, whereas on the third session WAnT was administered. Following CJ30s and WAnT, we assessed AP (1/CJ30s, as jump height [JH], fatigue index [FI], and blood lactate [BL]; 2/WAnT, as mechanical power [P], FI, and BL) and AC as the excess post-exercise oxygen consumption (EPOC). Large/highly significant correlations were found between CJ30s and WAnT EPOCs (r = 0.730, P = 0.003), and BLs (r = 0.713, P = 0.009). Moderate/significant correlations were found between CJ30s and WAnT FIs (r = 0.640, P = 0.014), CJ30s first four jumps mean JH and WAnT peak P (r = 0.572, P = 0.032), and CJ30s mean JH and WAnT mean P (r = 0.589, P = 0.021). CJ30s showed excellent and moderate reliability (ICC) for AP (maximal JH 0.884, mean JH 0.742, FI 0.657, BL 0.653) and AC (EPOC 0.788), respectively. Correlations observed especially in terms of AC between CJ30s and WAnT provide evidence that former may adequately assess anaerobic performance for the young combat athlete. CJ30 is a reliable test and allow an easy assessment of AP and AC in karate children. PMID:29867580
Cross-cultural adaptation and validation of the Korean version of the EQ-5D in patients with rheumatic diseases.

PubMed

Kim, Myoung-Hee; Cho, Young-Shin; Uhm, Wan-Sik; Kim, Sehyun; Bae, Sang-Cheol

2005-06-01

This study aimed to determine the cross-cultural adaptation and validation of the Korean version of the EQ-5D in rheumatic conditions. Translation, back-translation and cognitive debriefing were performed according to the EuroQol group's guidelines. For validity, 508 patients were recruited and administered the EQ-5D, Short-Form 36 and condition-specific measures. Construct validity and sensitivity were evaluated by testing a-priori hypotheses. For reliability, another 57 patients repeated the EQ-5D at 1-week interval, and intra-class correlations (ICC) and kappa statistics were estimated. For responsiveness, another 60 patients repeated it at 12-week interval within the context of clinical trial, and standardized response mean(SRM) were calculated. The cross-cultural adaptation produced no major modifications in the scale. The associations of the EQ-5D with the generic- and condition-specific measures were observed as expected in hypotheses: the higher EQ-5Dindex and EQ-5D(VAS) scores, the better health status by generic- or condition-specific measures, and the better functional class. The ICCs were 0.751 and 0.767, respectively, and kappa ranged from 0.455 to 0.772. The SRM were 0.649 and 0.410, respectively. The Korean EQ-5D exhibits good validity and sensitivity in various rheumatic conditions. Although its reliability and responsiveness were not excellent, it seems acceptable if condition-specific measures are applied together.
Factors affecting unsafe behavior in construction projects: development and validation of a new questionnaire.

PubMed

Asilian-Mahabadi, Hassan; Khosravi, Yahya; Hassanzadeh-Rangi, Narmin; Hajizadeh, Ebrahim; Behzadan, Amir H

2018-02-05

Occupational safety in general, and construction safety in particular, is a complex phenomenon. This study was designed to develop a new valid measure to evaluate factors affecting unsafe behavior in the construction industry. A new questionnaire was generated from qualitative research according to the principles of grounded theory. Key measurement properties (face validity, content validity, construct validity, reliability and discriminative validity) were examined using qualitative and quantitative approaches. The receiver operating characteristic curve was used to estimate the discriminating power and the optimal cutoff score. Construct validity revealed an interpretable 12-factor structure which explained 61.87% of variance. Good internal consistency (Cronbach's α = 0.94) and stability (intra-class correlation coefficient = 0.93) were found for the new instrument. The area under the curve, sensitivity and specificity were 0.80, 0.80 and 0.75, respectively. The new instrument also discriminated safety performance among the construction sites with different workers' accident histories (F = 6.40, p < 0.05). The new instrument appears to be a valid, reliable and sensitive instrument that will contribute to investigating the root causes of workers' unsafe behaviors, thus promoting safety performance in the construction industry.
Focused physician-performed echocardiography in sports medicine: a potential screening tool for detecting aortic root dilatation in athletes.

PubMed

Yim, Eugene S; Kao, Daniel; Gillis, Edward F; Basilico, Frederick C; Corrado, Gianmichael D

2013-12-01

The purpose of this study was to investigate whether sports medicine physicians can obtain accurate measurements of the aortic root in young athletes. Twenty male collegiate athletes, aged 18 to 21 years, were prospectively enrolled. Focused echocardiography was performed by a board-certified sports medicine physician and a medical student, followed by comprehensive echocardiography within 2 weeks by a cardiac sonographer. A left parasternal long-axis view was acquired to measure the aortic root diameter at the sinuses of Valsalva. Intraclass correlation coefficients (ICCs) were used to assess inter-rater reliability compared to a reference standard and intra-rater reliability of repeated measurements obtained by the sports medicine physician and medical student. The ICCs between the sports medicine physician and cardiac sonographer and between the medical student and cardiac sonographer were strong: 0.80 and 0.76, respectively. Across all 3 readers, the ICC was 0.89, indicating strong inter-rater reliability and concordance. The ICC for the 2 measurements taken by the sports medicine physician for each athlete was 0.75, indicating strong intra-rater reliability. The medical student had moderate intra-rater reliability, with an ICC of 0.59. Sports medicine physicians are able to obtain measurements of the aortic root by focused echocardiography that are consistent with those obtained by a cardiac sonographer. Focused physician-performed echocardiography may serve as a promising technique for detecting aortic root dilatation and may contribute in this manner to preparticipation cardiovascular screening for athletes.
Salivary Cortisol Protocol Adherence and Reliability by Sociodemographic Features: the Multi-Ethnic Study of Atherosclerosis

PubMed Central

Golden, Sherita Hill; Sánchez, Brisa N.; DeSantis, Amy S.; Wu, Meihua; Castro, Cecilia; Seeman, Teresa E.; Tadros, Sameh; Shrager, Sandi; Diez Roux, Ana V.

2014-01-01

Collection of salivary cortisol has become increasingly popular in large population-based studies. However, the impact of protocol compliance on day-to-day reliabilities of measures, and the extent to which reliabilities differ systematically according to socio-demographic characteristics, has not been well characterized in large-scale population-based studies to date. Using data on 935 men and women from the Multi-ethnic Study of Atherosclerosis, we investigated whether sampling protocol compliance differs systematically according to socio-demographic factors and whether compliance was associated with cortisol estimates, as well as whether associations of cortisol with both compliance and socio-demographic characteristics were robust to adjustments for one another. We further assessed the day-to-day reliability for cortisol features and the extent to which reliabilities vary according to socio-demographic factors and sampling protocol compliance. Overall, we found higher compliance among persons with higher levels of income and education. Lower compliance was significantly associated with a less pronounced cortisol awakening response (CAR) but was not associated with any other cortisol features, and adjustment for compliance did not affect associations of socio-demographic characteristics with cortisol. Reliability was higher for area under the curve (AUC) and wake up values than for other features, but generally did not vary according to socio-demographic characteristics, with few exceptions. Our findings regarding intra-class correlation coefficients (ICCs) support prior research indicating that multiple day collection is preferable to single day collection, particularly for CAR and slopes, more so than wakeup and AUC. There were few differences in reliability by socio-demographic characteristics. Thus, it is unlikely that group-specific sampling protocols are warranted. PMID:24703168
Test-retest reliability and construct validity of the ENERGY-parent questionnaire on parenting practices, energy balance-related behaviours and their potential behavioural determinants: the ENERGY-project.

PubMed

Singh, Amika S; Chinapaw, Mai J M; Uijtdewilligen, Léonie; Vik, Froydis N; van Lippevelde, Wendy; Fernández-Alvira, Juan M; Stomfai, Sarolta; Manios, Yannis; van der Sluijs, Maria; Terwee, Caroline; Brug, Johannes

2012-08-13

Insight in parental energy balance-related behaviours, their determinants and parenting practices are important to inform childhood obesity prevention. Therefore, reliable and valid tools to measure these variables in large-scale population research are needed. The objective of the current study was to examine the test-retest reliability and construct validity of the parent questionnaire used in the ENERGY-project, assessing parental energy balance-related behaviours, their determinants, and parenting practices among parents of 10-12 year old children. We collected data among parents (n = 316 in the test-retest reliability study; n = 109 in the construct validity study) of 10-12 year-old children in six European countries, i.e. Belgium, Greece, Hungary, the Netherlands, Norway, and Spain. Test-retest reliability was assessed using the intra-class correlation coefficient (ICC) and percentage agreement comparing scores from two measurements, administered one week apart. To assess construct validity, the agreement between questionnaire responses and a subsequent interview was assessed using ICC and percentage agreement.All but one item showed good to excellent test-retest reliability as indicated by ICCs > .60 or percentage agreement ≥ 75%. Construct validity appeared to be good to excellent for 92 out of 121 items, as indicated by ICCs > .60 or percentage agreement ≥ 75%. From the other 29 items, construct validity was moderate for 24 and poor for 5 items. The reliability and construct validity of the items of the ENERGY-parent questionnaire on multiple energy balance-related behaviours, their potential determinants, and parenting practices appears to be good. Based on the results of the validity study, we strongly recommend adapting parts of the ENERGY-parent questionnaire if used in future research.
Reliability of Phase Velocity Measurements of Flexural Acoustic Waves in the Human Tibia In-Vivo.

PubMed

Vogl, Florian; Schnüriger, Karin; Gerber, Hans; Taylor, William R

2016-01-01

Axial-transmission acoustics have shown to be a promising technique to measure individual bone properties and detect bone pathologies. With the ultimate goal being the in-vivo application of such systems, quantification of the key aspects governing the reliability is crucial to bring this method towards clinical use. This work presents a systematic reliability study quantifying the sources of variability and their magnitudes of in-vivo measurements using axial-transmission acoustics. 42 healthy subjects were measured by an experienced operator twice per week, over a four-month period, resulting in over 150000 wave measurements. In a complementary study to assess the influence of different operators performing the measurements, 10 novice operators were trained, and each measured 5 subjects on a single occasion, using the same measurement protocol as in the first part of the study. The estimated standard error for the measurement protocol used to collect the study data was ∼ 17 m/s (∼ 4% of the grand mean) and the index of dependability, as a measure of reliability, was Φ = 0.81. It was shown that the method is suitable for multi-operator use and that the reliability can be improved efficiently by additional measurements with device repositioning, while additional measurements without repositioning cannot improve the reliability substantially. Phase velocity values were found to be significantly higher in males than in females (p < 10-5) and an intra-class correlation coefficient of r = 0.70 was found between the legs of each subject. The high reliability of this non-invasive approach and its intrinsic sensitivity to mechanical properties opens perspectives for the rapid and inexpensive clinical assessment of bone pathologies, as well as for monitoring programmes without any radiation exposure for the patient.
Allied health clinicians using translational research in action to develop a reliable stroke audit tool.

PubMed

Abery, Philip; Kuys, Suzanne; Lynch, Mary; Low Choy, Nancy

2018-05-23

To design and establish reliability of a local stroke audit tool by engaging allied health clinicians within a privately funded hospital. Design: Two-stage study involving a modified Delphi process to inform stroke audit tool development and inter-tester reliability. Allied health clinicians. A modified Delphi process to select stroke guideline recommendations for inclusion in the audit tool. Reliability study: 1 allied health representative from each discipline audited 10 clinical records with sequential admissions to acute and rehabilitation services. Recommendations were admitted to the audit tool when 70% agreement was reached, with 50% set as the reserve agreement. Inter-tester reliability was determined using intra-class correlation coefficients (ICCs) across 10 clinical records. Twenty-two participants (92% female, 50% physiotherapists, 17% occupational therapists) completed the modified Delphi process. Across 6 voting rounds, 8 recommendations reached 70% agreement and 2 reached 50% agreement. Two recommendations (nutrition/hydration; goal setting) were added to ensure representation for all disciplines. Substantial consistency across raters was established for the audit tool applied in acute stroke (ICC .71; range .48 to .90) and rehabilitation (ICC.78; range .60 to .93) services. Allied health clinicians within a privately funded hospital generally agreed in an audit process to develop a reliable stroke audit tool. Allied health clinicians agreed on stroke guideline recommendations to inform a stroke audit tool. The stroke audit tool demonstrated substantial consistency supporting future use for service development. This process, which engages local clinicians, could be adopted by other facilities to design reliable audit tools to identify local service gaps to inform changes to clinical practice. © 2018 John Wiley & Sons, Ltd.
The test-retest reliability and minimal detectable change of spatial and temporal gait variability during usual over-ground walking for younger and older adults.

PubMed

Almarwani, Maha; Perera, Subashan; VanSwearingen, Jessie M; Sparto, Patrick J; Brach, Jennifer S

2016-02-01

Gait variability is a marker of gait performance and future mobility status in older adults. Reliability of gait variability has been examined mainly in community dwelling older adults who are likely to fluctuate over time. The purpose of this study was to compare test-retest reliability and determine minimal detectable change (MDC) of spatial and temporal gait variability in younger and older adults. Forty younger (mean age=26.6 ± 6.0 years) and 46 older adults (mean age=78.1 ± 6.2 years) were included in the study. Gait characteristics were measured twice, approximately 1 week apart, using a computerized walkway (GaitMat II). Participants completed 4 passes on the GaitMat II at their self-selected walking speed. Test-retest reliability was calculated using Intra-class correlation coefficients (ICCs(2,1)), 95% limits of agreement (95% LoA) in conjunction with Bland-Altman plots, relative limits of agreement (LoA%) and standard error of measurement (SEM). The MDC at 90% and 95% level were also calculated. ICCs of gait variability ranged 0.26-0.65 in younger and 0.28-0.74 in older adults. The LoA% and SEM were consistently higher (i.e. less reliable) for all gait variables in older compared to younger adults except SEM for step width. The MDC was consistently larger for all gait variables in older compared to younger adults except step width. ICCs were of limited utility due to restricted ranges in younger adults. Based on absolute reliability measures and MDC, younger had greater test-retest reliability and smaller MDC of spatial and temporal gait variability compared to older adults. Copyright © 2015 Elsevier B.V. All rights reserved.

Depth as an Organizing Force in Pocillopora damicornis: Intra-Reef Genetic Architecture

PubMed Central

Gorospe, Kelvin D.; Karl, Stephen A.

2015-01-01

Relative to terrestrial plants, and despite similarities in life history characteristics, the potential for corals to exhibit intra-reef local adaptation in the form of genetic differentiation along an environmental gradient has received little attention. The potential for natural selection to act on such small scales is likely increased by the ability of coral larval dispersal and settlement to be influenced by environmental cues. Here, we combine genetic, spatial, and environmental data for a single patch reef in Kāne‘ohe Bay, O‘ahu, Hawai‘i, USA in a landscape genetics framework to uncover environmental drivers of intra-reef genetic structuring. The genetic dataset consists of near-exhaustive sampling (n = 2352) of the coral, Pocillopora damicornis at our study site and six microsatellite genotypes. In addition, three environmental parameters – depth and two depth-independent temperature indices – were collected on a 4 m grid across 85 locations throughout the reef. We use ordinary kriging to spatially interpolate our environmental data and estimate the three environmental parameters for each colony. Partial Mantel tests indicate a significant correlation between genetic relatedness and depth while controlling for space. These results are also supported by multi-model inference. Furthermore, spatial Principle Component Analysis indicates a statistically significant genetic cline along a depth gradient. Binning the genetic dataset based on size-class revealed that the correlation between genetic relatedness and depth was significant for new recruits and increased for larger size classes, suggesting a possible role of larval habitat selection as well as selective mortality in structuring intra-reef genetic diversity. That both pre- and post-recruitment processes may be involved points to the adaptive role of larval habitat selection in increasing adult survival. The conservation importance of uncovering intra-reef patterns of genetic diversity is discussed. PMID:25806798
Poor visualization limits diagnosis of proximal junctional kyphosis in adolescent idiopathic scoliosis.

PubMed

Basques, Bryce A; Long, William D; Golinvaux, Nicholas S; Bohl, Daniel D; Samuel, Andre M; Lukasiewicz, Adam M; Webb, Matthew L; Grauer, Jonathan N

2017-06-01

Multiple methods are used to measure proximal junctional angle (PJA) and diagnose proximal junctional kyphosis (PJK) after fusion for adolescent idiopathic scoliosis (AIS); however, there is no gold standard. Previous studies using the three most common measurement methods, upper-instrumented vertebra (UIV)+1, UIV+2, and UIV to T2, have minimized the difficulty in obtaining these measurements, and often exclude patients for which measurements cannot be recorded. The purpose of this study is to assess the technical feasibility of measuring PJA and PJK in a series of AIS patients who have undergone posterior instrumented fusion and to assess the variability in results depending on the measurement technique used. A retrospective cohort study was carried out. There were 460 radiographs from 98 patients with AIS who underwent posterior spinal fusion at a single institution from 2006 through 2012. The outcomes for this study were the ability to obtain a PJA measurement for each method, the ability to diagnose PJK, and the inter- and intra-rater reliability of these measurements. Proximal junctional angle was determined by measuring the sagittal Cobb angle on preoperative and postoperative lateral upright films using the three most common methods (UIV+1, UIV+2, and UIV to T2). The ability to obtain a PJA measurement, the ability to assess PJK, and the total number of patients with a PJK diagnosis were tabulated for each method based on established definitions. Intra- and inter-rater reliability of each measurement method was assessed using intra-class correlation coefficients (ICCs). A total of 460 radiographs from 98 patients were evaluated. The average number of radiographs per patient was 5.3±1.7 (mean±standard deviation), with an average follow-up of 2.1 years (780±562 days). A PJA measurement was only readable on 13%-18% of preoperative filmsand 31%-49% of postoperative films (range based on measurement technique). Only 12%-31% of films were able to be assessed for PJK based on established definitions. The rate of PJK diagnosis ranged from 1% to 29%. Of these diagnoses, 21%-100% disappeared on at least one subsequent film for the given patient. ICC ranges for intra-rater and inter-rater reliability were 0.730-0.799 and 0.794-0.836, respectively. This study suggests significant limitations of the three most common methods of measuring and diagnosing PJK. The results of studies using these methods can be significantly affected based on the exclusion of patients for whom measurements cannot be made and choice of measurement technique. Copyright © 2015 Elsevier Inc. All rights reserved.
Evaluation of the Walking Index for Spinal Cord Injury II (WISCI-II) in children with Spinal Cord Injury (SCI).

PubMed

Calhoun Thielen, C; Sadowsky, C; Vogel, L C; Taylor, H; Davidson, L; Bultman, J; Gaughan, J; Mulcahey, M J

2017-05-01

Mixed methods were used in this study. The appropriateness of the levels of the Walking Index for Spinal Cord Injury II (WISCI-II) for application in children was critically reviewed by physical therapists using the Modified Delphi Technique, and the inter- and intra-rater reliability of the WISCI-II in children was evaluated. To examine the construct validity, and to establish reliability of the WISCI-II related to its use in children with spinal cord injury (SCI). United States of America. Using a Modified Delphi Technique, physical therapists critically reviewed the WISCI-II levels for pediatric utilization. Concurrently, ambulatory children under age 18 years with SCI were evaluated using the WISCI-II on two occasions by the same therapist to establish intra-rater reliability. One trial was photographed and de-identified. Each photograph was reviewed by four different physical therapists who gave WISCI-II scores to establish inter-rater reliability. Summary and descriptive statistics were used to calculate the frequency of yes/no responses for each WISCI-II level question and to determine the percent agreement for each question. Inter- and intra-rater reliability was calculated using interclass correlation coefficients (ICCs) with 95% confidence intervals (CI). Construct validity was confirmed after one Delphi round during which at least 80% agreement was established by 51 physical therapists on the appropriateness of the WISCI-II levels for children. Fifty-two children with SCI aged 2-17 years completed repeated WISCI-II assessments and 40 de-identified photographs were scored by four physical therapists. Intra- and inter-rater reliability was high (ICC=0.997, CI=0.995-0.998 and ICC=0.97, CI=0.95-0.98, respectively). This study demonstrates support for the use of the WISCI-II in ambulatory children with SCI. This study was funded by the Craig H Neilsen Foundation, Spinal Cord Injury Research on the Translation Spectrum, Senior Research Award #282592 (Mulcahey, PI).
Patient reported outcomes in GNE myopathy: incorporating a valid assessment of physical function in a rare disease.

PubMed

Slota, Christina; Bevans, Margaret; Yang, Li; Shrader, Joseph; Joe, Galen; Carrillo, Nuria

2018-05-01

The aim of this analysis was to evaluate the psychometric properties of three patient reported outcome (PRO) measures characterizing physical function in GNE myopathy: the Human Activity Profile, the Inclusion Body Myositis Functional Rating Scale, and the Activities-specific Balance Confidence scale. This analysis used data from 35 GNE myopathy subjects participating in a natural history study. For construct validity, correlational and known-group analyses were between the PROs and physical assessments. Reliability of the PROs between baseline and 6 months was evaluated using the intra-class correlation coefficient model; internal consistency was tested with Cronbach's alpha. The hypothesized moderate positive correlations for construct validity were supported; the strongest correlation was between the human activity profile adjusted activity score and the adult myopathy assessment endurance subscale score (r = 0.81; p < 0.0001). The PROs were able to discriminate between known high and low functioning groups for the adult myopathy assessment tool. Internal consistency of the PROs was high (α > 0.8) and there was strong reliability (ICC >0.62). The PROs are valid and reliable measures of physical function in GNE myopathy and should be incorporated in investigations to better understand the impact of progressive muscle weakness on physical function in this rare disease population. Implications for Rehabilitation GNE myopathy is a rare muscle disease that results in slow progressive muscle atrophy and weakness, ultimately leading to wheelchair use and dependence on a caregiver. There is limited knowledge on the impact of this disease on the health-related quality of life, specifically physical function, of this rare disease population. Three patient reported outcomes have been shown to be valid and reliable in GNE myopathy subjects and should be incorporated in future investigations to better understand how progressive muscle weakness impacts physical functions in this rare disease population. The patient reported outcome scores of GNE myopathy patients indicate a high risk for falls and impaired physical functioning, so it is important clinicians assess and provide interventions for these subjects to maintain their functional capacity.
Validation of the UCLA Scleroderma Clinical Trial Consortium Gastrointestinal Tract Instrument 2.0 in English- and Chinese-speaking patients in a multi-ethnic Singapore systemic sclerosis cohort.

PubMed

Low, Andrea Hsiu Ling; Xin, Xiaohui; Law, Weng Giap; Teng, Gim Gee; Santosa, Amelia; Lim, Anita; Chan, Grace; Ng, Swee Cheng; Thumboo, Julian

2017-07-01

The aim of this study was to (1) translate the Gastrointestinal Tract Instrument (GIT) 2.0 from English to Chinese and (2) validate both versions in a multi-ethnic systemic sclerosis cohort in Singapore (SCORE). The English GIT2.0 was translated to Chinese using a standard forward-backward translation approach. Psychometric evaluation of the GIT2.0 included internal consistency reliability (using Cronbach's alpha), test-retest reliability (using intra-class correlation coefficient (ICC)), scale level factor analysis, and construct validity (using Spearman correlation) against the modified Scleroderma Health Assessment Questionnaire (S-HAQ) and the SF-36 v2. Most of the patients were females (88.6%) and Chinese (78.2%), with mean (SD) age of 51.0 (13.0) years and median disease duration of 4.5 years. We administered English (n = 146) and Chinese (n = 74) GIT2.0. The mean (SD) total GIT score was 0.29 (0.37). There was good internal consistency (Cronbach's alpha >0.70 for all subscales) and good test-retest reliability for the scale and all subscales (ICC 0.71-0.92) except for "diarrhoea" (ICC = 0.54). Our hypothesised a priori construct validity was supported by moderate correlations between the total GIT score and S-HAQ GI subscale (r = 0.446), and the social functioning subscale and SF36v2 role-social domain (r = 0.337), and weak-to-moderate correlation between the emotional subscale and SF-36v2 role-emotional (r = 0.295) and mental health (r = 0.298) domains and mental component summary (r = 0.356). Exploratory factor analysis of the seven subscales yielded a two-factor solution explaining 69.63% of the total variance. This study provides evidence for the reliability and validity of the English and Chinese GIT2.0 to be used in Singapore for research and routine practice.
Isolated glenohumeral range of motion, excluding side-to-side difference in humeral retroversion, in asymptomatic high-school baseball players.

PubMed

Mihata, Teruhisa; Takeda, Atsushi; Kawakami, Takeshi; Itami, Yasuo; Watanabe, Chisato; Doi, Munekazu; Neo, Masashi

2016-06-01

Glenohumeral range of motion is correlated with shoulder capsular condition and is thus considered to be predictive of shoulder pathology. However, in throwing athletes, a side-to-side difference in humeral retroversion makes it difficult to evaluate capsular condition on the basis of glenohumeral range of motion measured by using the conventional technique. The purpose of this study was to measure isolated glenohumeral rotation, excluding side-to-side differences in humeral retroversion, in asymptomatic high-school baseball players. A total of 195 high-school baseball players (52 pitchers and 143 position players; median age, 16 years) and 20 high-school non-throwing athletes (median age, 16 years) without any shoulder symptoms were enroled in this study. Glenohumeral external and internal rotations were measured by using both a conventional technique and our ultrasound-assisted technique. This technique, neutral rotation, was standardized on the basis of the ultrasonographically visualized location of the bicipital groove to exclude side-to-side differences in humeral retroversion from the calculated rotation angle. Intra- and inter-observer agreements of rotational measurements were evaluated by using intra-class correlation coefficients (ICCs). Isolated glenohumeral rotation measurements, excluding side-to-side differences in humeral retroversion, demonstrated excellent intra-observer (ICC > 0.89) and inter-observer (ICC > 0.78) agreements. Isolated glenohumeral internal rotation was significantly less in the dominant shoulder than in the non-dominant shoulder in asymptomatic baseball players (P < 0.001). Isolated glenohumeral external rotation in baseball players was significantly greater than in non-throwing athletes (P < 0.05). In the baseball players, humeral torsion in the dominant shoulder was significantly greater than that in the non-dominant shoulder (P < 0.001), indicating that the retroversion angle was greater in dominant shoulders than in non-dominant shoulders. Isolated glenohumeral external and internal rotations can be measured with high intra- and inter-observer reliability with the exclusion of side-to-side differences in humeral retroversion. Capsular and muscular changes in the throwing shoulder may be better evaluated by using our ultrasound-assisted technique. Cross-sectional study, Level III.
Reliability of lower limb alignment measures using an established landmark-based method with a customized computer software program

PubMed Central

Sled, Elizabeth A.; Sheehy, Lisa M.; Felson, David T.; Costigan, Patrick A.; Lam, Miu; Cooke, T. Derek V.

2010-01-01

The objective of the study was to evaluate the reliability of frontal plane lower limb alignment measures using a landmark-based method by (1) comparing inter- and intra-reader reliability between measurements of alignment obtained manually with those using a computer program, and (2) determining inter- and intra-reader reliability of computer-assisted alignment measures from full-limb radiographs. An established method for measuring alignment was used, involving selection of 10 femoral and tibial bone landmarks. 1) To compare manual and computer methods, we used digital images and matching paper copies of five alignment patterns simulating healthy and malaligned limbs drawn using AutoCAD. Seven readers were trained in each system. Paper copies were measured manually and repeat measurements were performed daily for 3 days, followed by a similar routine with the digital images using the computer. 2) To examine the reliability of computer-assisted measures from full-limb radiographs, 100 images (200 limbs) were selected as a random sample from 1,500 full-limb digital radiographs which were part of the Multicenter Osteoarthritis (MOST) Study. Three trained readers used the software program to measure alignment twice from the batch of 100 images, with two or more weeks between batch handling. Manual and computer measures of alignment showed excellent agreement (intraclass correlations [ICCs] 0.977 – 0.999 for computer analysis; 0.820 – 0.995 for manual measures). The computer program applied to full-limb radiographs produced alignment measurements with high inter- and intra-reader reliability (ICCs 0.839 – 0.998). In conclusion, alignment measures using a bone landmark-based approach and a computer program were highly reliable between multiple readers. PMID:19882339
RELIABILITY CONCERNS IN THE REPEATED COMPUTERIZED ASSESSMENT OF ATTENTION IN CHILDREN

PubMed Central

Zabel, T. Andrew; von Thomsen, Christian; Cole, Carolyn; Martin, Rebecca; Mahone, E. Mark

2010-01-01

Assessment of attentional processes via computerized assessment is frequently used to quantify intra-individual cognitive improvement or decline in response to treatment. However, assessment of intra-individual change is highly dependent on sufficient test reliability. We examined the test–retest reliability of selected variables from one popular computerized continuous performance test (CPT)—i.e., the Conners’ CPT – Second Edition (CPT-II). Participants were 39 healthy children (20 girls) ages 6–18 without intellectual impairment (mean PPVT-III SS = 102.6), LD, or psychiatric disorders (DICA-IV). Test–retest reliability over the 3–8 month interval (mean = 6 months) was acceptable (Intraclass Correlations [ICC] = .82 to .92) on comparison measures (Beery Test of Visual Perception, WISC-IV Block Design, PPVT-III). In contrast, test–retest reliability was only modest for CPT-II raw scores (ICCs ranging from .62 to .82) and T-scores (ICCs ranging from .33 to .65) for variables of interest (Omissions, Commissions, Variability, Hit Reaction Time, and Attentiveness). Using test–retest reliability information published in the CPT-II manual, 90% confidence intervals based on reliable change index (RCI) methodology were constructed to examine the significance of test–retest difference/change scores. Of the participants in this sample of typically developing youth, 30% generated intra-individual changes in T-scores on the Omissions and Attentiveness variables that exceeded the 90% confidence intervals and qualified as “statistically rare” changes in score. These results suggest a considerable degree of normal variability in CPT-II test scores over extended test–retest intervals, and suggest a need for caution when interpreting test score changes in neurologically unstable clinical populations. PMID:19452302
Gait Deviation Index, Gait Profile Score and Gait Variable Score in children with spastic cerebral palsy: Intra-rater reliability and agreement across two repeated sessions.

PubMed

Rasmussen, Helle Mätzke; Nielsen, Dennis Brandborg; Pedersen, Niels Wisbech; Overgaard, Søren; Holsgaard-Larsen, Anders

2015-07-01

The Gait Deviation Index (GDI) and Gait Profile Score (GPS) are the most used summary measures of gait in children with cerebral palsy (CP). However, the reliability and agreement of these indices have not been investigated, limiting their clinimetric quality for research and clinical practice. The aim of this study was to investigate the intra-rater reliability and agreement of summary measures of gait (GDI; GPS; and the Gait Variable Score (GVS) derived from the GPS). The intra-rater reliability and agreement were investigated across two repeated sessions in 18 children aged 5-12 years diagnosed with spastic CP. No systematic bias was observed between the sessions and no heteroscedasticity was observed in Bland-Altman plots. For the GDI and GPS, excellent reliability with intraclass correlation coefficient (ICC) values of 0.8-0.9 was found, while the GVS was found to have fair to good reliability with ICCs of 0.4-0.7. The agreement for the GDI and the logarithmically transformed GPS, in terms of the standard error of measurement as a percentage of the grand mean (SEM%) varied from 4.1 to 6.7%, whilst the smallest detectable change in percent (SDC%) ranged from 11.3 to 18.5%. For the logarithmically transformed GVS, we found a fair to large variation in SEM% from 7 to 29% and in SDC% from 18 to 81%. The GDI and GPS demonstrated excellent reliability and acceptable agreement proving that they can both be used in research and clinical practice. However, the observed large variability for some of the GVS requires cautious consideration when selecting outcome measures. Copyright © 2015 Elsevier B.V. All rights reserved.
Concurrent validity and reliability of torso-worn inertial measurement unit for jump power and height estimation.

PubMed

Rantalainen, Timo; Gastin, Paul B; Spangler, Rhys; Wundersitz, Daniel

2018-09-01

The purpose of the present study was to evaluate the concurrent validity and test-retest repeatability of torso-worn IMU-derived power and jump height in a counter-movement jump test. Twenty-seven healthy recreationally active males (age, 21.9 [SD 2.0] y, height, 1.76 [0.7] m, mass, 73.7 [10.3] kg) wore an IMU and completed three counter-movement jumps a week apart. A force platform and a 3D motion analysis system were used to concurrently measure the jumps and subsequently derive power and jump height (based on take-off velocity and flight time). The IMU significantly overestimated power (mean difference = 7.3 W/kg; P < 0.001) compared to force-platform-derived power but good correspondence between methods was observed (Intra-class correlation coefficient [ICC] = 0.69). IMU-derived power exhibited good reliability (ICC = 0.67). Velocity-derived jump heights exhibited poorer concurrent validity (ICC = 0.72 to 0.78) and repeatability (ICC = 0.68) than flight-time-derived jump heights, which exhibited excellent validity (ICC = 0.93 to 0.96) and reliability (ICC = 0.91). Since jump height and power are closely related, and flight-time-derived jump height exhibits excellent concurrent validity and reliability, flight-time-derived jump height could provide a more desirable measure compared to power when assessing athletic performance in a counter-movement jump with IMUs.
Is One Trial Sufficient to Obtain Excellent Pressure Pain Threshold Reliability in the Low Back of Asymptomatic Individuals? A Test-Retest Study.

PubMed

Balaguier, Romain; Madeleine, Pascal; Vuillerme, Nicolas

2016-01-01

The assessment of pressure pain threshold (PPT) provides a quantitative value related to the mechanical sensitivity to pain of deep structures. Although excellent reliability of PPT has been reported in numerous anatomical locations, its absolute and relative reliability in the lower back region remains to be determined. Because of the high prevalence of low back pain in the general population and because low back pain is one of the leading causes of disability in industrialized countries, assessing pressure pain thresholds over the low back is particularly of interest. The purpose of this study study was (1) to evaluate the intra- and inter- absolute and relative reliability of PPT within 14 locations covering the low back region of asymptomatic individuals and (2) to determine the number of trial required to ensure reliable PPT measurements. Fifteen asymptomatic subjects were included in this study. PPTs were assessed among 14 anatomical locations in the low back region over two sessions separated by one hour interval. For the two sessions, three PPT assessments were performed on each location. Reliability was assessed computing intraclass correlation coefficients (ICC), standard error of measurement (SEM) and minimum detectable change (MDC) for all possible combinations between trials and sessions. Bland-Altman plots were also generated to assess potential bias in the dataset. Relative reliability for both intra- and inter- session was almost perfect with ICC ranged from 0.85 to 0.99. With respect to the intra-session, no statistical difference was reported for ICCs and SEM regardless of the conducted comparisons between trials. Conversely, for inter-session, ICCs and SEM values were significantly larger when two consecutive PPT measurements were used for data analysis. No significant difference was observed for the comparison between two consecutive measurements and three measurements. Excellent relative and absolute reliabilities were reported for both intra- and inter-session. Reliable measurements can be equally achieved when using the mean of two or three consecutive PPT measurements, as usually proposed in the literature, or with only the first one. Although reliability was almost perfect regardless of the conducted comparison between PPT assessments, our results suggest using two consecutive measurements to obtain higher short term absolute reliability.
Development of a questionnaire to evaluate asthma control in Japanese asthma patients.

PubMed

Tohda, Yuji; Hozawa, Soichiro; Tanaka, Hiroshi

2018-01-01

The asthma control questionnaires used in Japan are Japanese translations of those developed outside Japan, and have some limitations; a questionnaire designed to optimally evaluate asthma control levels for Japanese may be necessary. The present study was conducted to validate the Japan Asthma Control Survey (JACS) questionnaire in Japanese asthma patients. A total of 226 adult patients with mild to severe persistent asthma were enrolled and responded to the JACS questionnaire, asthma control questionnaire (ACQ), and Mini asthma quality of life questionnaire (Mini AQLQ) at Weeks 0 and 4. The reliability, validity, and sensitivity/responsiveness of the JACS questionnaire were evaluated. The intra-class correlation coefficients (ICCs) were within the range of 0.55-0.75 for all JACS scores, indicating moderate/substantial reproducibility. For internal consistency, Cronbach's alpha coefficients ranged from 0.76 to 0.92 in total and subscale scores, which were greater than the lower limit of internal consistency. As for factor validity, the cumulative contribution ratio of four main factors was 0.66. For criterion-related validity, the correlation coefficients between the JACS total score and ACQ5, ACQ6, and Mini AQLQ scores were -0.78, -0.78, and 0.77, respectively, showing a significant correlation (p < 0.0001). The JACS questionnaire was validated in terms of reliability and validity. It will be necessary to evaluate the therapeutic efficacy measured by the JACS questionnaire and calculate cutoff values for the asthma control status in a higher number of patients. UMIN000016589. Copyright © 2017 Japanese Society of Allergology. Production and hosting by Elsevier B.V. All rights reserved.
Clinical significance of plasma apolipoprotein F in Japanese healthy and hypertriglyceridemic subjects.

PubMed

Kujiraoka, Takeshi; Nakamoto, Takaaki; Sugimura, Hiroyuki; Iwasaki, Tadao; Ishihara, Mitsuaki; Hoshi, Toshiyasu; Horie, Yasuto; Ogawa, Kazuyuki; Todoroki, Masakatsu; Nakatani, Yuki; Banba, Nobuyuki; Yasu, Takanori; Hattori, Hiroaki

2013-01-01

Apolipoprotein F (apo F), also known as lipid transfer inhibitory protein (LTIP), is a protein component of plasma lipoprotein classes including HDL and functions to inhibit lipid transfer between lipoproteins in vitro. To study the role of plasma apo F, a reliable and sensitive tool for the quantification would be needed. We have developed a sandwich ELISA using two monoclonal antibodies for human plasma apo F, and analyzed apo F concentration in 397 Japanese healthy and 221 hypertriglyceridemic subjects. Our ELISA enables apo F to be assayed in the range of 0.6-25 µg/mL with intra- and inter-assay coefficients of variation less than 3.8% and 7.8%, respectively. In healthy subjects, plasma apo F concentration was 12.5±2.9 µg/mL (mean±SD), and was significantly higher in females than in males (p<0.05). By linear regression analysis in healthy subjects, plasma apo F concentration correlated positively with HDL cholesterol and apo A-I levels, and in males but not in females, negatively with apo B and triglyceride levels. It also correlated negatively with intrinsic CETP activity measured using intrinsic apo B-containing lipoprotein as an acceptor, and positively with PLTP mass and apo J levels. Apo F concentration in hypertriglyceridemic patients (10.3±3.1 µg/mL) was lower than in healthy controls (p<0.0001) and correlated positively with PLTP mass. Our ELISA is reliable and sensitive for the quantification of plasma apo F concentration. This system can be applicable for clinical significance in lipoprotein metabolism and reverse cholesterol transport.
Validation of World Health Organization Assessment Schedule 2.0 in specialized somatic rehabilitation services in Norway.

PubMed

Moen, Vegard Pihl; Drageset, Jorunn; Eide, Geir Egil; Klokkerud, Mari; Gjesdal, Sturla

2017-02-01

The World Health Organization Disability Assessment Schedule (WHODAS) 2.0 is a generic instrument to assess disability covering six domains. The purpose of this study was to investigate the potential of the instrument for monitoring disability in specialized somatic rehabilitation by testing reliability, construct validity and responsiveness of WHODAS 2.0, Norwegian version, among patients with various health conditions. For taxonomy, terminology and definitions, the Consensus-based Standards for the Selection of Health Measurement Instruments were followed. Reproducibility was investigated by the intra-class correlation coefficient (ICC) in a randomly selected sample. Internal consistency was assessed by Cronbach's alpha. Construct validity was evaluated by correlations between WHODAS 2.0 and the Medical Outcomes Study 36-item Short Form, and fit of the hypothesized structure using confirmatory factor analysis (CFA). Responsiveness was evaluated in another randomly selected sample by testing a priori formulated hypotheses. Nine hundred seventy patients were included in the study. Reproducibility and responsiveness were evaluated in 53 and 104 patients, respectively. The ICC for the WHODAS 2.0 domains ranged from 0.63 to 0.84 and was 0.87 for total score. Cronbach's alpha for domains ranged from 0.75 to 0.94 and was 0.93 for total score. For construct validity, 6 of 12 expected correlations were confirmed and CFA did not achieve satisfactory fit indices. For responsiveness, 3 of 8 hypotheses were confirmed. The Norwegian version of WHODAS 2.0 showed moderate to satisfactory reliability and moderate validity in rehabilitation patients. However, the present study indicated possible limitations in terms of responsiveness.
EQ-5D-5L and SF-6D Utility Measures in Symptomatic benign Thyroid Nodules: Acceptability and Psychometric Evaluation.

PubMed

Wong, Carlos K H; Lang, Brian H H; Yu, Hill M S; Lam, Cindy L K

2017-08-01

The aim of this study was to examine the acceptability, validity, and reliability of the EuroQoL Five-Dimension Five-Level (EQ-5D-5L) and Short-Form Six-Dimension (SF-6D) health utility measures in patients with symptomatic benign thyroid nodules. Data from a randomized controlled trial (ClinicalTrials.gov identifier: NCT02398721) of 294 patients with symptomatic benign thyroid nodules were utilized for this psychometric evaluation of health-related quality of life (HR-QOL) measurement. Three HR-QOL questionnaires-the generic 12-item Short Form Health Survey (SF-12v2), EQ-5D-5L, and SF-6D-were interviewer-administered at baseline and 2 weeks afterwards. Responses to SF-6D were transformed to SF-6D utility scores using a Hong Kong population scoring algorithm derived by standard gamble, whereas responses to EQ-5D-5L were mapped onto EQ-5D-3L response via interim mapping algorithms and then converted to EQ-5D-5L utility scores using a Chinese-specific value set. Construct validity was determined by evaluating Spearman correlation between SF-12v2 scores and utility scores. Two-week test-retest reliability was assessed using intra-class correlation coefficient. No significant (>15%) floor and ceiling effects were observed for SF-6D utility scores. The SF-6D utility scores had a moderate Spearman rank correlation with the SF-12v2 domain score providing evidence for adequate construct validity. The SF-6D utility scores showed good test-retest reliability (0.794; range 0.696-0.860). Better reliability was observed in SF-6D utility scores than in EQ-5D-5L utility scores. While the EQ-5D-5L instrument was less reproducible, the SF-6D instrument appeared to be an applicable, valid, and reliable measure in assessing the HR-QOL of Chinese patients with symptomatic benign thyroid nodules. The impact of utility score selection on the effectiveness and cost effectiveness of clinical interventions targeted to these patients needs further exploration. NCT02398721, ClinicalTrials.gov.
Pitfalls and important issues in testing reliability using intraclass correlation coefficients in orthopaedic research.

PubMed

Lee, Kyoung Min; Lee, Jaebong; Chung, Chin Youb; Ahn, Soyeon; Sung, Ki Hyuk; Kim, Tae Won; Lee, Hui Jong; Park, Moon Seok

2012-06-01

Intra-class correlation coefficients (ICCs) provide a statistical means of testing the reliability. However, their interpretation is not well documented in the orthopedic field. The purpose of this study was to investigate the use of ICCs in the orthopedic literature and to demonstrate pitfalls regarding their use. First, orthopedic articles that used ICCs were retrieved from the Pubmed database, and journal demography, ICC models and concurrent statistics used were evaluated. Second, reliability test was performed on three common physical examinations in cerebral palsy, namely, the Thomas test, the Staheli test, and popliteal angle measurement. Thirty patients were assessed by three orthopedic surgeons to explore the statistical methods testing reliability. Third, the factors affecting the ICC values were examined by simulating the data sets based on the physical examination data where the ranges, slopes, and interobserver variability were modified. Of the 92 orthopedic articles identified, 58 articles (63%) did not clarify the ICC model used, and only 5 articles (5%) described all models, types, and measures. In reliability testing, although the popliteal angle showed a larger mean absolute difference than the Thomas test and the Staheli test, the ICC of popliteal angle was higher, which was believed to be contrary to the context of measurement. In addition, the ICC values were affected by the model, type, and measures used. In simulated data sets, the ICC showed higher values when the range of data sets were larger, the slopes of the data sets were parallel, and the interobserver variability was smaller. Care should be taken when interpreting the absolute ICC values, i.e., a higher ICC does not necessarily mean less variability because the ICC values can also be affected by various factors. The authors recommend that researchers clarify ICC models used and ICC values are interpreted in the context of measurement.
A comparative study of software programmes for cross-sectional skeletal muscle and adipose tissue measurements on abdominal computed tomography scans of rectal cancer patients.

PubMed

van Vugt, Jeroen L A; Levolger, Stef; Gharbharan, Arvind; Koek, Marcel; Niessen, Wiro J; Burger, Jacobus W A; Willemsen, Sten P; de Bruin, Ron W F; IJzermans, Jan N M

2017-04-01

The association between body composition (e.g. sarcopenia or visceral obesity) and treatment outcomes, such as survival, using single-slice computed tomography (CT)-based measurements has recently been studied in various patient groups. These studies have been conducted with different software programmes, each with their specific characteristics, of which the inter-observer, intra-observer, and inter-software correlation are unknown. Therefore, a comparative study was performed. Fifty abdominal CT scans were randomly selected from 50 different patients and independently assessed by two observers. Cross-sectional muscle area (CSMA, i.e. rectus abdominis, oblique and transverse abdominal muscles, paraspinal muscles, and the psoas muscle), visceral adipose tissue area (VAT), and subcutaneous adipose tissue area (SAT) were segmented by using standard Hounsfield unit ranges and computed for regions of interest. The inter-software, intra-observer, and inter-observer agreement for CSMA, VAT, and SAT measurements using FatSeg, OsiriX, ImageJ, and sliceOmatic were calculated using intra-class correlation coefficients (ICCs) and Bland-Altman analyses. Cohen's κ was calculated for the agreement of sarcopenia and visceral obesity assessment. The Jaccard similarity coefficient was used to compare the similarity and diversity of measurements. Bland-Altman analyses and ICC indicated that the CSMA, VAT, and SAT measurements between the different software programmes were highly comparable (ICC 0.979-1.000, P < 0.001). All programmes adequately distinguished between the presence or absence of sarcopenia (κ = 0.88-0.96 for one observer and all κ = 1.00 for all comparisons of the other observer) and visceral obesity (all κ = 1.00). Furthermore, excellent intra-observer (ICC 0.999-1.000, P < 0.001) and inter-observer (ICC 0.998-0.999, P < 0.001) agreement for all software programmes were found. Accordingly, excellent Jaccard similarity coefficients were found for all comparisons (mean ≥ 0.964). FatSeg, OsiriX, ImageJ, and sliceOmatic showed an excellent agreement for CSMA, VAT, and SAT measurements on abdominal CT scans. Furthermore, excellent inter-observer and intra-observer agreement were achieved. Therefore, results of studies using these different software programmes can reliably be compared. © 2016 The Authors. Journal of Cachexia, Sarcopenia and Muscle published by John Wiley & Sons Ltd on behalf of the Society on Sarcopenia, Cachexia and Wasting Disorders.
Joint mobilization forces and therapist reliability in subjects with knee osteoarthritis

PubMed Central

Tragord, Bradley S; Gill, Norman W; Silvernail, Jason L; Teyhen, Deydre S; Allison, Stephen C

2013-01-01

Objectives: This study determined biomechanical force parameters and reliability among clinicians performing knee joint mobilizations. Methods: Sixteen subjects with knee osteoarthritis and six therapists participated in the study. Forces were recorded using a capacitive-based pressure mat for three techniques at two grades of mobilization, each with two trials of 15 seconds. Dosage (force–time integral), amplitude, and frequency were also calculated. Analysis of variance was used to analyze grade differences, intraclass correlation coefficients determined reliability, and correlations assessed force associations with subject and rater variables. Results: Grade IV mobilizations produced higher mean forces (P<0.001) and higher dosage (P<0.001), while grade III produced higher maximum forces (P = 0.001). Grade III forces (Newtons) by technique (mean, maximum) were: extension 48, 81; flexion 41, 68; and medial glide 21, 34. Grade IV forces (Newtons) by technique (mean, maximum) were: extension 58, 78; flexion 44, 60; and medial glide 22, 30. Frequency (Hertz) ranged between 0.9–1.1 (grade III) and 1.4–1.6 (grade IV). Intra-clinician reliability was excellent (>0.90). Inter-clinician reliability was moderate for force and dosage, and poor for amplitude and frequency. Discussion: Force measurements were consistent with previously reported ranges and clinical constructs. Grade III and grade IV mobilizations can be distinguished from each other with differences for force and frequency being small, and dosage and amplitude being large. Intra-clinician reliability was excellent for all biomechanical parameters and inter-clinician reliability for dosage, the main variable of clinical interest, was moderate. This study quantified the applied forces among multiple clinicians, which may help determine optimal dosage and standardize care. PMID:24421632
Ultrasound-based motor control training for the pelvic floor pre- and post-prostatectomy: Scoring reliability and skill acquisition.

PubMed

Doorbar-Baptist, Stuart; Adams, Roger; Rebbeck, Trudy

2017-04-01

This study documents a protocol designed to evaluate pelvic floor motor control in men with prostate cancer. It also aims to evaluate the reliability of therapists in rating motor control of pelvic floor muscles (PFMs) using real time ultrasound imaging (RUSI) video clips. We further determine predictors of acquiring motor control. Ninety-one men diagnosed with prostate cancer attending a physiotherapy clinic for pelvic floor exercises were taught detailed pelvic floor motor control exercises by a physiotherapist using trans-abdominal RUSI for biofeedback. A new protocol to rate motor control skill acquisition was developed. Three independent physiotherapists assessed motor control skill attainment by viewing RUSI videos of the contractions. Inter-rater reliability was evaluated using intra-class correlation coefficients. Logistic regression analysis was conducted to identify predictors of successful skill attainment. Acquisition of the skill was compared between pre- and post-operative participants using an independent-group t-test. There was good reliability for rating the RUSI video clips (ICC 0.73 (95%CI 0.59-0.82)) for experienced therapists. Having low BMI and being seen pre-operatively predicted motor skill attainment, accounting for 46.3% of the variance. Significantly more patients trained pre-operatively acquired the skill of pelvic floor control compared with patients initially seen post-operatively (OR 11.87, 95%CI 1.4 to 99.5, p = 0.02). A new protocol to evaluate attainment of pelvic floor control in men with prostate cancer can be assessed reliably from RUSI images, and is most effectively delivered pre-operatively.
Reliability of reported breastfeeding duration among reproductive-aged women from Mexico

PubMed Central

Cupul-Uicab, Lea A.; Gladen, Beth C.; Hernández-Ávila, Mauricio; Longnecker, Matthew P.

2010-01-01

Breastfed children have lower risk of infectious diseases, post-neonatal mortality and chronic diseases later in life. Because epidemiologic studies usually rely on reported history of previous breastfeeding, data on the accuracy and precision of recalled histories allow improved interpretation of the epidemiologic findings. We evaluated the reliability of two reported breastfeeding durations in 567 reproductive-aged women from Mexico using information obtained from nearly identical sets of questions applied at different times after weaning. We compared differences between reports, and examined the intra-class correlation coefficient (ICC) for any and for exclusive breastfeeding (EBF). Logistic regression was used to evaluate the determinants of poor recall (difference between reports of >20%). The reliability of duration of any breastfeeding was high (ICC 0.94). Overall, differences between reports of duration were usually <1 month, and for 385/567, the difference was ≤0.5 months. Predictors of poorer recall were having ≥4 children, and time between reports of >2 months. The only predictor of better recall was greater age of the baby at weaning. The reliability of EBF duration was lower (ICC 0.49). In this population with a relatively long duration of breastfeeding, reliability of any breast-feeding duration was high. Age, education and previous breastfeeding were not important predictors of recall, in contrast to findings in earlier studies. Consistent with previous reports, however, parity and length of recall were associated with poorer recall of duration of any breastfeeding. Future studies that use reported breastfeeding duration may want to consider the effect of these variables on recall. PMID:19292747

Reliability of reflectance measures in passive filters

NASA Astrophysics Data System (ADS)

Saldiva de André, Carmen Diva; Afonso de André, Paulo; Rocha, Francisco Marcelo; Saldiva, Paulo Hilário Nascimento; Carvalho de Oliveira, Regiani; Singer, Julio M.

2014-08-01

Measurements of optical reflectance in passive filters impregnated with a reactive chemical solution may be transformed to ozone concentrations via a calibration curve and constitute a low cost alternative for environmental monitoring, mainly to estimate human exposure. Given the possibility of errors caused by exposure bias, it is common to consider sets of m filters exposed during a certain period to estimate the latent reflectance on n different sample occasions at a certain location. Mixed models with sample occasions as random effects are useful to analyze data obtained under such setups. The intra-class correlation coefficient of the mean of the m measurements is an indicator of the reliability of the latent reflectance estimates. Our objective is to determine m in order to obtain a pre-specified reliability of the estimates, taking possible outliers into account. To illustrate the procedure, we consider an experiment conducted at the Laboratory of Experimental Air Pollution, University of São Paulo, Brazil (LPAE/FMUSP), where sets of m = 3 filters were exposed during 7 days on n = 9 different occasions at a certain location. The results show that the reliability of the latent reflectance estimates for each occasion obtained under homoskedasticity is km = 0.74. A residual analysis suggests that the within-occasion variance for two of the occasions should be different from the others. A refined model with two within-occasion variance components was considered, yielding km = 0.56 for these occasions and km = 0.87 for the remaining ones. To guarantee that all estimates have a reliability of at least 80% we require measurements on m = 10 filters on each occasion.
Translation, reliability, and clinical utility of the Melbourne Assessment 2.

PubMed

Gerber, Corinna N; Plebani, Anael; Labruyère, Rob

2017-10-12

The aims were to (i) provide a German translation of the Melbourne Assessment 2 (MA2), a quantitative test to measure unilateral upper limb function in children with neurological disabilities and (ii) to evaluate its reliability and aspects of clinical utility. After its translation into German and approval of the back translation by the original authors, the MA2 was performed and videotaped twice with 30 children with neuromotor disorders. For each participant, two raters scored the video of the first test for inter-rater reliability. To determine test-retest reliability, one rater additionally scored the video of the second test while the other rater repeated the scoring of the first video to evaluate intra-rater reliability. Time needed for rater training, test administration, and scoring was recorded. The four subscale scores showed excellent intra-, inter-rater, and test-retest reliability with intraclass correlation coefficients of 0.90-1.00 (95%-confidence intervals 0.78-1.00). Score items revealed substantial to almost perfect intra-rater reliability (weighted kappa k w = 0.66-1.00) for the more affected side. Score item inter-rater and test-retest reliability of the same extremity were, with one exception, moderate to almost perfect (k w = 0.42-0.97; k w = 0.40-0.89). Furthermore, the MA2 was feasible and acceptable for patients and clinicians. The MA2 showed excellent subscale and moderate to almost perfect score item reliability. Implications for Rehabilitation There is a lack of high-quality studies about psychometric properties of upper limb measurement tools in the neuropediatric population. The Melbourne Assessment 2 is a promising tool for reliable measurement of unilateral upper limb movement quality in the neuropediatric population. The Melbourne Assessment 2 is acceptable and practicable to therapists and patients for routine use in clinical care.
How reliable are Functional Movement Screening scores? A systematic review of rater reliability.

PubMed

Moran, Robert W; Schneiders, Anthony G; Major, Katherine M; Sullivan, S John

2016-05-01

Several physical assessment protocols to identify intrinsic risk factors for injury aetiology related to movement quality have been described. The Functional Movement Screen (FMS) is a standardised, field-expedient test battery intended to assess movement quality and has been used clinically in preparticipation screening and in sports injury research. To critically appraise and summarise research investigating the reliability of scores obtained using the FMS battery. Systematic literature review. Systematic search of Google Scholar, Scopus (including ScienceDirect and PubMed), EBSCO (including Academic Search Complete, AMED, CINAHL, Health Source: Nursing/Academic Edition), MEDLINE and SPORTDiscus. Studies meeting eligibility criteria were assessed by 2 reviewers for risk of bias using the Quality Appraisal of Reliability Studies checklist. Overall quality of evidence was determined using van Tulder's levels of evidence approach. 12 studies were appraised. Overall, there was a 'moderate' level of evidence in favour of 'acceptable' (intraclass correlation coefficient ≥0.6) inter-rater and intra-rater reliability for composite scores derived from live scoring. For inter-rater reliability of composite scores derived from video recordings there was 'conflicting' evidence, and 'limited' evidence for intra-rater reliability. For inter-rater reliability based on live scoring of individual subtests there was 'moderate' evidence of 'acceptable' reliability (κ≥0.4) for 4 subtests (Deep Squat, Shoulder Mobility, Active Straight-leg Raise, Trunk Stability Push-up) and 'conflicting' evidence for the remaining 3 (Hurdle Step, In-line Lunge, Rotary Stability). This review found 'moderate' evidence that raters can achieve acceptable levels of inter-rater and intra-rater reliability of composite FMS scores when using live ratings. Overall, there were few high-quality studies, and the quality of several studies was impacted by poor study reporting particularly in relation to rater blinding. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/
Validity and reliability of balance assessment software using the Nintendo Wii balance board: usability and validation

PubMed Central

2014-01-01

Background A balance test provides important information such as the standard to judge an individual’s functional recovery or make the prediction of falls. The development of a tool for a balance test that is inexpensive and widely available is needed, especially in clinical settings. The Wii Balance Board (WBB) is designed to test balance, but there is little software used in balance tests, and there are few studies on reliability and validity. Thus, we developed a balance assessment software using the Nintendo Wii Balance Board, investigated its reliability and validity, and compared it with a laboratory-grade force platform. Methods Twenty healthy adults participated in our study. The participants participated in the test for inter-rater reliability, intra-rater reliability, and concurrent validity. The tests were performed with balance assessment software using the Nintendo Wii balance board and a laboratory-grade force platform. Data such as Center of Pressure (COP) path length and COP velocity were acquired from the assessment systems. The inter-rater reliability, the intra-rater reliability, and concurrent validity were analyzed by an intraclass correlation coefficient (ICC) value and a standard error of measurement (SEM). Results The inter-rater reliability (ICC: 0.89-0.79, SEM in path length: 7.14-1.90, SEM in velocity: 0.74-0.07), intra-rater reliability (ICC: 0.92-0.70, SEM in path length: 7.59-2.04, SEM in velocity: 0.80-0.07), and concurrent validity (ICC: 0.87-0.73, SEM in path length: 5.94-0.32, SEM in velocity: 0.62-0.08) were high in terms of COP path length and COP velocity. Conclusion The balance assessment software incorporating the Nintendo Wii balance board was used in our study and was found to be a reliable assessment device. In clinical settings, the device can be remarkably inexpensive, portable, and convenient for the balance assessment. PMID:24912769
Validity and reliability of balance assessment software using the Nintendo Wii balance board: usability and validation.

PubMed

Park, Dae-Sung; Lee, GyuChang

2014-06-10

A balance test provides important information such as the standard to judge an individual's functional recovery or make the prediction of falls. The development of a tool for a balance test that is inexpensive and widely available is needed, especially in clinical settings. The Wii Balance Board (WBB) is designed to test balance, but there is little software used in balance tests, and there are few studies on reliability and validity. Thus, we developed a balance assessment software using the Nintendo Wii Balance Board, investigated its reliability and validity, and compared it with a laboratory-grade force platform. Twenty healthy adults participated in our study. The participants participated in the test for inter-rater reliability, intra-rater reliability, and concurrent validity. The tests were performed with balance assessment software using the Nintendo Wii balance board and a laboratory-grade force platform. Data such as Center of Pressure (COP) path length and COP velocity were acquired from the assessment systems. The inter-rater reliability, the intra-rater reliability, and concurrent validity were analyzed by an intraclass correlation coefficient (ICC) value and a standard error of measurement (SEM). The inter-rater reliability (ICC: 0.89-0.79, SEM in path length: 7.14-1.90, SEM in velocity: 0.74-0.07), intra-rater reliability (ICC: 0.92-0.70, SEM in path length: 7.59-2.04, SEM in velocity: 0.80-0.07), and concurrent validity (ICC: 0.87-0.73, SEM in path length: 5.94-0.32, SEM in velocity: 0.62-0.08) were high in terms of COP path length and COP velocity. The balance assessment software incorporating the Nintendo Wii balance board was used in our study and was found to be a reliable assessment device. In clinical settings, the device can be remarkably inexpensive, portable, and convenient for the balance assessment.
In vivo quantification of amygdala subnuclei using 4.7 T fast spin echo imaging.

PubMed

Aghamohammadi-Sereshki, Arash; Huang, Yushan; Olsen, Fraser; Malykhin, Nikolai V

2018-04-15

The amygdala (AG) is an almond-shaped heterogeneous structure located in the medial temporal lobe. The majority of previous structural Magnetic Resonance Imaging (MRI) volumetric methods for AG measurement have so far only been able to examine this region as a whole. In order to understand the role of the AG in different neuropsychiatric disorders, it is necessary to understand the functional role of its subnuclei. The main goal of the present study was to develop a reliable volumetric method to delineate major AG subnuclei groups using ultra-high resolution high field MRI. 38 healthy volunteers (15 males and 23 females, 21-60 years of age) without any history of medical or neuropsychiatric disorders were recruited for this study. Structural MRI datasets were acquired at 4.7 T Varian Inova MRI system using a fast spin echo (FSE) sequence. The AG was manually segmented into its five major anatomical subdivisions: lateral (La), basal (B), accessory basal (AB) nuclei, and cortical (Co) and centromedial (CeM) groups. Inter-(intra-) rater reliability of our novel volumetric method was assessed using intra-class correlation coefficient (ICC) and Dice's Kappa. Our results suggest that reliable measurements of the AG subnuclei can be obtained by image analysts with experience in AG anatomy. We provided a step-by-step segmentation protocol and reported absolute and relative volumes for the AG subnuclei. Our results showed that the basolateral (BLA) complex occupies seventy-eight percent of the total AG volume, while CeM and Co groups occupy twenty-two percent of the total AG volume. Finally, we observed no hemispheric effects and no gender differences in the total AG volume and the volumes of its subnuclei. Future applications of this method will help to understand the selective vulnerability of the AG subnuclei in neurological and psychiatric disorders. Copyright © 2017 Elsevier Inc. All rights reserved.
Test-retest reliability of speech-evoked auditory brainstem response in healthy children at a low sensation level.

PubMed

Zakaria, Mohd Normani; Jalaei, Bahram

2017-11-01

Auditory brainstem responses evoked by complex stimuli such as speech syllables have been studied in normal subjects and subjects with compromised auditory functions. The stability of speech-evoked auditory brainstem response (speech-ABR) when tested over time has been reported but the literature is limited. The present study was carried out to determine the test-retest reliability of speech-ABR in healthy children at a low sensation level. Seventeen healthy children (6 boys, 11 girls) aged from 5 to 9 years (mean = 6.8 ± 3.3 years) were tested in two sessions separated by a 3-month period. The stimulus used was a 40-ms syllable /da/ presented at 30 dB sensation level. As revealed by pair t-test and intra-class correlation (ICC) analyses, peak latencies, peak amplitudes and composite onset measures of speech-ABR were found to be highly replicable. Compared to other parameters, higher ICC values were noted for peak latencies of speech-ABR. The present study was the first to report the test-retest reliability of speech-ABR recorded at low stimulation levels in healthy children. Due to its good stability, it can be used as an objective indicator for assessing the effectiveness of auditory rehabilitation in hearing-impaired children in future studies. Copyright © 2017 Elsevier B.V. All rights reserved.
Construction and validation of a questionnaire on the knowledge of healthy habits and risk factors for cardiovascular disease in schoolchildren.

PubMed

Cecchetto, Fátima H; Pellanda, Lucia C

2014-01-01

To develop and analyze the reliability and validity of a questionnaire on the knowledge of healthy habits and risk factors for cardiovascular disease (CARDIOKID) to be used in schoolchildren. The study included 145 children aged 7 to 11 years. The measured factors were the knowledge of healthy habits and risk factors for cardiovascular disease. Cronbach's alpha and intra-class correlation coefficient (ICC) were used to verify reliability, and exploratory factor analysis was used to assess the validity of the questionnaire. The sample consisted of 60% females and 40% males. In factorial analysis, the Kaiser-Meyer-Olkin (KMO) test result was measures of sampling adequacy (MSA)=0.81 and Bartlett's test of sphericity was X(2)=(66)=458.64 (p<0.001). In the factorial analysis with varimax rotation, two dimensions were defined. The "healthy habits" dimension was composed of five factors (ICC=0.87 and α=0.93) and the "cardiovascular risk factors" dimension was composed of seven factors (ICC=0.83 and α=0.91). In the individual factor analysis, Cronbach's alphas were between 0.93 and 0.91. Total variance was 46.87%. There were no significant differences between test and retest applications. The questionnaire presented satisfactory validity and reliability (internal consistency and reproducibility), allowing for its use in children. Copyright © 2014 Sociedade Brasileira de Pediatria. Published by Elsevier Editora Ltda. All rights reserved.
Adaptation and Validation of a Nutrition Environment Measures Survey for University Grab-and-Go Establishments.

PubMed

Lo, Brian K C; Minaker, Leia; Chan, Alicia N T; Hrgetic, Jessica; Mah, Catherine L

2016-03-01

To adapt and validate a survey instrument to assess the nutrition environment of grab-and-go establishments at a university campus. A version of the Nutrition Environment Measures Survey for grab-and-go establishments (NEMS-GG) was adapted from existing NEMS instruments and tested for reliability and validity through a cross-sectional assessment of the grab-and-go establishments at the University of Toronto. Product availability, price, and presence of nutrition information were evaluated. Cohen's kappa coefficient and intra-class correlation coefficients (ICC) were assessed for inter-rater reliability, and construct validity was assessed using the known-groups comparison method (via store scores). Fifteen grab-and-go establishments were assessed. Inter-rater reliability was high with an almost perfect agreement for availability (mean κ = 0.995) and store scores (ICC = 0.999). The tool demonstrated good face and construct validity. About half of the venues carried fruit and vegetables (46.7% and 53.3%, respectively). Regular and healthier entrée items were generally the same price. Healthier grains were cheaper than regular options. Six establishments displayed nutrition information. Establishments operated by the university's Food Services consistently scored the highest across all food premise types for nutrition signage, availability, and cost of healthier options. Health promotion strategies are needed to address availability and variety of healthier grab-and-go options in university settings.
A Compact Forearm Crutch Based on Force Sensors for Aided Gait: Reliability and Validity.

PubMed

Chamorro-Moriana, Gema; Sevillano, José Luis; Ridao-Fernández, Carmen

2016-06-21

Frequently, patients who suffer injuries in some lower member require forearm crutches in order to partially unload weight-bearing. These lesions cause pain in lower limb unloading and their progression should be controlled objectively to avoid significant errors in accuracy and, consequently, complications and after effects in lesions. The design of a new and feasible tool that allows us to control and improve the accuracy of loads exerted on crutches during aided gait is necessary, so as to unburden the lower limbs. In this paper, we describe such a system based on a force sensor, which we have named the GCH System 2.0. Furthermore, we determine the validity and reliability of measurements obtained using this tool via a comparison with the validated AMTI (Advanced Mechanical Technology, Inc., Watertown, MA, USA) OR6-7-2000 Platform. An intra-class correlation coefficient demonstrated excellent agreement between the AMTI Platform and the GCH System. A regression line to determine the predictive ability of the GCH system towards the AMTI Platform was found, which obtained a precision of 99.3%. A detailed statistical analysis is presented for all the measurements and also segregated for several requested loads on the crutches (10%, 25% and 50% of body weight). Our results show that our system, designed for assessing loads exerted by patients on forearm crutches during assisted gait, provides valid and reliable measurements of loads.
A Compact Forearm Crutch Based on Force Sensors for Aided Gait: Reliability and Validity

PubMed Central

Chamorro-Moriana, Gema; Sevillano, José Luis; Ridao-Fernández, Carmen

2016-01-01

Frequently, patients who suffer injuries in some lower member require forearm crutches in order to partially unload weight-bearing. These lesions cause pain in lower limb unloading and their progression should be controlled objectively to avoid significant errors in accuracy and, consequently, complications and after effects in lesions. The design of a new and feasible tool that allows us to control and improve the accuracy of loads exerted on crutches during aided gait is necessary, so as to unburden the lower limbs. In this paper, we describe such a system based on a force sensor, which we have named the GCH System 2.0. Furthermore, we determine the validity and reliability of measurements obtained using this tool via a comparison with the validated AMTI (Advanced Mechanical Technology, Inc., Watertown, MA, USA) OR6-7-2000 Platform. An intra-class correlation coefficient demonstrated excellent agreement between the AMTI Platform and the GCH System. A regression line to determine the predictive ability of the GCH system towards the AMTI Platform was found, which obtained a precision of 99.3%. A detailed statistical analysis is presented for all the measurements and also segregated for several requested loads on the crutches (10%, 25% and 50% of body weight). Our results show that our system, designed for assessing loads exerted by patients on forearm crutches during assisted gait, provides valid and reliable measurements of loads. PMID:27338396
Ecologically relevant outcome measure for post-inpatient rehabilitation.

PubMed

Marquez de la Plata, Carlos; Qualls, Devin; Plenger, Patrick; Malec, James F; Hayden, Mary Ellen

2017-01-01

Transfer of skills learned within the clinic environment to patients' home or community is important in post-inpatient brain injury rehabilitation (PBIR). Outcome measures used in PBIR assess level of independence during functional tasks; however, available functional instruments do not quantitate the environment in which the behaviors occur. To examine the reliability and validity of an instrument used to assess patients' functional abilities while quantifying the amount of structure and distractions in the environment. 2501 patients who sustained a traumatic brain injury (TBI) or cerebrovascular accident (CVA) and participated in a multidisciplinary PBIR program between 2006 and 2014 were identified retrospectively for this study. The PERPOS and MPAI-4 were used to assess functional abilities at admission and at discharge. Construct validity was assessed using a bivariate Spearman rho analysis A subsample of 56 consecutive admissions during 2014 were examined to determine inter-rater reliability. Intra-class correlation coefficient (ICC) and Kappa coefficients assessed inter-rater agreement of the total PERPOS and PERPOS subscales respectively. The PERPOS and MPAI-4 demonstrated a strong negative association among both TBI and CVA patients. Kappa scores for the three PERPOS scales each demonstrated good to excellent inter-rater agreement. The ICC for overall PERPOS scores fell in the good agreement range. The PERPOS can be used reliably in PBIR to quantify patients' functional abilities within the context of environmental demands.
Measurement of compartment elasticity using pressure related ultrasound: a method to identify patients with potential compartment syndrome.

PubMed

Sellei, R M; Hingmann, S J; Kobbe, P; Weber, C; Grice, J E; Zimmerman, F; Jeromin, S; Gansslen, A; Hildebrand, F; Pape, H C

2015-01-01

PURPOSE OF THE STUDY Decision-making in treatment of an acute compartment syndrome is based on clinical assessment, supported by invasive monitoring. Thus, evolving compartment syndrome may require repeated pressure measurements. In suspected cases of potential compartment syndromes clinical assessment alone seems to be unreliable. The objective of this study was to investigate the feasibility of a non-invasive application estimating whole compartmental elasticity by ultrasound, which may improve accuracy of diagnostics. MATERIAL AND METHODS In an in-vitro model, using an artificial container simulating dimensions of the human anterior tibial compartment, intracompartmental pressures (p) were raised subsequently up to 80 mm Hg by infusion of saline solution. The compartmental depth (mm) in the cross-section view was measured before and after manual probe compression (100 mm Hg) upon the surface resulting in a linear compartmental displacement (Δd). This was repeated at rising compartmental pressures. The resulting displacements were related to the corresponding intra-compartmental pressures simulated in our model. A hypothesized relationship between pressures related compartmental displacement and the elasticity at elevated compartment pressures was investigated. RESULTS With rising compartmental pressures, a non-linear, reciprocal proportional relation between the displacement (mm) and the intra-compartmental pressure (mm Hg) occurred. The Pearson's coefficient showed a high correlation (r2 = -0.960). The intraobserver reliability value kappa resulted in a statistically high reliability (κ = 0.840). The inter-observer value indicated a fair reliability (κ = 0.640). CONCLUSIONS Our model reveals that a strong correlation between compartmental strain displacements assessed by ultrasound and the intra-compartmental pressure changes occurs. Further studies are required to prove whether this assessment is transferable to human muscle tissue. Determining the complete compartmental elasticity by ultrasound enhancement, this application may improve detection of early signs of potential compartment syndrome. Key words: compartment syndrome, intra-compartmental pressure, non-invasive diagnostic, elasticity measurement, elastography.
Development of an Encompassing Questionnaire for Evaluating the Outcomes Following Total Knee Arthroplasty.

PubMed

Chughtai, Morad; Khlopas, Anton; Thomas, Melbin; Gwam, Chukwuweike U; Jauregui, Julio J; Elmallah, Randa K; Roche, Martin; Delanois, Ronald E

2017-01-10

There are many standardized scales and questionnaires used to evaluate TKA patients; however, individually they do not always assess patients adequately. Consequently, many are used in combinations to provide a thorough evaluation. However, this leads to redundancy, confusion, and an excessive patient time-burden. Therefore, the purpose of this study was to develop a usable combined knee questionnaire that combines questions in a non-redundant manner. Specifically, we aimed to: 1) create a combined knee questionnaire that encompasses questions from multiple systems, while eliminating redundancy; 2) correlate the new system with the existing validated questionnaires; and 3) determine the length of time it takes to administer this new questionnaire. In a previous study, it was determined that the six most commonly cited validated systems to assess the knee were the: Knee Society Score (KSS), The Western Ontario and McMaster Universities Arthritis Index (WOMAC), Knee injury and Osteoarthritis Outcome Score (KOOS), Lower Extremity Functional Scale (LEFS), Activity Rating Scale (ARS), and Short-Form-36 (SF-36). Therefore, we ensured that the new questionnaire encompassed all elements of these systems. After development of the combined questionnaire, we co-administered it to 20 subjects alongside the above validated questionnaires. We then transposed the corresponding answers from the combined questionnaire to each selected validated system to perform an intra-class correlation analysis. In addition, we recorded the length of time it took to administer the new questionnaire and compared it to the time it took to administer the individual validated questionnaires. Intra-class correlation analysis demonstrated statistically significant positive correlations between the KSS, WOMAC, KOOS, LEFS, ARS, SF-36, and the corresponding questions in the combined questionnaire. The mean length of time it took to administer the combined questionnaire (mean, 10.1 minutes, range, 6.6 to 12.6 minutes) was significantly shorter than the time it took to administer the selected validated questionnaires (mean, 21.3 minutes, range, 17.3 to 24.1 minutes). We have proposed an all-encompassing combined knee questionnaire that eliminates redundancy and inefficiency during the evaluation of TKA patients. It is a reliable, time-efficient system that can be utilized to fill out the most commonly used questionnaires for assessing TKA. Standardization and uniform use of this questionnaire may simplify future patient assessment following TKA.
Automatic target recognition using a feature-based optical neural network

NASA Technical Reports Server (NTRS)

Chao, Tien-Hsin

1992-01-01

An optical neural network based upon the Neocognitron paradigm (K. Fukushima et al. 1983) is introduced. A novel aspect of the architectural design is shift-invariant multichannel Fourier optical correlation within each processing layer. Multilayer processing is achieved by iteratively feeding back the output of the feature correlator to the input spatial light modulator and updating the Fourier filters. By training the neural net with characteristic features extracted from the target images, successful pattern recognition with intra-class fault tolerance and inter-class discrimination is achieved. A detailed system description is provided. Experimental demonstration of a two-layer neural network for space objects discrimination is also presented.
Reproducibility of thoracic kyphosis measurements in patients with adolescent idiopathic scoliosis.

PubMed

Ohrt-Nissen, Søren; Cheung, Jason Pui Yin; Hallager, Dennis Winge; Gehrchen, Martin; Kwan, Kenny; Dahl, Benny; Cheung, Kenneth M C; Samartzis, Dino

2017-01-01

Current surgical treatment for adolescent idiopathic scoliosis (AIS) involves correction in both the coronal and sagittal plane, and thorough assessment of these parameters is essential for evaluation of surgical results. However, various definitions of thoracic kyphosis (TK) have been proposed, and the intra- and inter-rater reproducibility of these measures has not been determined. As such, the purpose of the current study was to determine the intra- and inter-rater reproducibility of several TK measurements used in the assessment of AIS. Twenty patients (90% females) surgically treated for AIS with alternate-level pedicle screw fixation were included in the study. Three raters independently evaluated pre- and postoperative standing lateral plain radiographs. For each radiograph, several definitions of TK were measured as well as L1-S1 and nonfixed lumbar lordosis. All variables were measured twice 14 days apart, and a mixed effects model was used to determine the repeatability coefficient (RC), which is a measure of the agreement between repeated measurements. Also, the intra- and inter-rater intra-class correlation coefficient (ICC) was determined as a measure of reliability. Preoperative median Cobb angle was 58° (range 41°-86°), and median surgical curve correction was 68% (range 49-87%). Overall intra-rater RC was highest for T2-T12 and nonfixed TK (11°) and lowest for T4-T12 and T5-T12 (8°). Inter-rater RC was highest for T1-T12, T1-nonfixed, and nonfixed TK (13°) and lowest for T5-T12 (9°). Agreement varied substantially between pre- and postoperative radiographs. Inter-rater ICC was highest for T4-T12 (0.92; 95% CI 0.88-0.95) and T5-T12 (0.92; 95% CI 0.88-0.95) and lowest for T1-nonfixed (0.80; 95% CI 0.72-0.88). Considerable variation for all TK measurements was noted. Intra- and inter-rater reproducibility was best for T4-T12 and T5-T12. Future studies should consider adopting a relevant minimum difference as a limit for true change in TK.
The reliability of a segmentation methodology for assessing intramuscular adipose tissue and other soft-tissue compartments of lower leg MRI images.

PubMed

Karampatos, Sarah; Papaioannou, Alexandra; Beattie, Karen A; Maly, Monica R; Chan, Adrian; Adachi, Jonathan D; Pritchard, Janet M

2016-04-01

Determine the reliability of a magnetic resonance (MR) image segmentation protocol for quantifying intramuscular adipose tissue (IntraMAT), subcutaneous adipose tissue, total muscle and intermuscular adipose tissue (InterMAT) of the lower leg. Ten axial lower leg MRI slices were obtained from 21 postmenopausal women using a 1 Tesla peripheral MRI system. Images were analyzed using sliceOmatic™ software. The average cross-sectional areas of the tissues were computed for the ten slices. Intra-rater and inter-rater reliability were determined and expressed as the standard error of measurement (SEM) (absolute reliability) and intraclass coefficient (ICC) (relative reliability). Intra-rater and inter-rater reliability for IntraMAT were 0.991 (95% confidence interval [CI] 0.978-0.996, p < 0.05) and 0.983 (95% CI 0.958-9.993, p < 0.05), respectively. For the other soft tissue compartments, the ICCs were all >0.90 (p < 0.05). The absolute intra-rater and inter-rater reliability (expressed as SEM) for segmenting IntraMAT were 22.19 mm(2) (95% CI 16.97-32.04) and 78.89 mm(2) (95% CI 60.36-113.92), respectively. This is a reliable segmentation protocol for quantifying IntraMAT and other soft-tissue compartments of the lower leg. A standard operating procedure manual is provided to assist users, and SEM values can be used to estimate sample size and determine confidence in repeated measurements in future research.
Diagnosis of long head of biceps tendinopathy in rotator cuff tear patients: correlation of imaging and arthroscopy data.

PubMed

Rol, Morgane; Favard, Luc; Berhouet, Julien

2018-06-01

The goal of this prospective study was to assess the reliability of pre-operative cross-sectional imaging for the diagnosis of long head of biceps (LHB) tendinopathy in patients with a rotator cuff tear. Cross-sectional imaging with MRI or CT arthrography data from 25 patients operated upon because of a rotator cuff tear between 1 October 2015 and 1 April 2016 was analysed by one experienced orthopaedic surgeon, one experienced radiologist and one orthopaedic resident. The analysis consisted of determining whether the LHB was present, the extrinsic tendon abnormalities (dislocation, tendon coverage) and intrinsic abnormalities (fraying, inflammation, degeneration). These findings were then compared to intra-operative arthroscopy findings, which were used as the benchmark. The interobserver correlation between the three different examiners for the cross-sectional imaging analysis as well as the correlation between the imaging and arthroscopy data were determined. The correlation between the imaging and arthroscopy data was the highest (80%) for the determination of LHB dislocation from the bicipital groove. The other diagnostic elements (subluxation, coverage and tendon degeneration) were difficult to discern with preoperative imaging, and correlated poorly with the arthroscopy findings (45% to 65%). The interobserver correlation was moderate to strong for the diagnosis of extrinsic tendon abnormalities. It was low to moderate for intrinsic abnormalities. Except for LHB dislocation, pre-operative imaging is not sufficient to make a reliable diagnosis of LHB tendinopathy. Arthroscopy remains the gold standard for the management of LHB tendinopathy, as diagnosed intra-operatively.
Intra- and interobserver reliability of quantitative ultrasound measurement of the plantar fascia.

PubMed

Rathleff, Michael Skovdal; Moelgaard, Carsten; Lykkegaard Olesen, Jens

2011-01-01

To determine intra- and interobserver reliability and measurement precision of sonographic assessment of plantar fascia thickness when using one, the mean of two, or the mean of three measurements. Two experienced observers scanned 20 healthy subjects twice with 60 minutes between test and retest. A GE LOGIQe ultrasound scanner was used in the study. The built-in software in the scanner was used to measure the thickness of the plantar fascia (PF). Reliability was calculated using intraclass correlation coefficient (ICC) and limits of agreement (LOA). Intraobserver reliability (ICC) using one measurement was 0.50 for one observer and 0.52 for the other, and using the mean of three measurements intraobserver reliability increased up to 0.77 and 0.67, respectively. Interobserver reliability (ICC) when using one measurement was 0.62 and increased to 0.82 when using the average of three measurements. LOA showed that when using the average of three measurements, LOA decreased to 0.6 mm, corresponding to 17.5% of the mean thickness of the PF. The results showed that reliability increases when using the mean of three measurements compared with one. Limits of agreement based on intratester reliability shows that changes in thickness that are larger than 0.6 mm can be considered actual changes in thickness and not a result of measurement error. Copyright © 2011 Wiley Periodicals, Inc.
Reliability and validity of the upper-body dressing scale in Japanese patients with vascular dementia with hemiparesis.

PubMed

Endo, Arisa; Suzuki, Makoto; Akagi, Atsumi; Chiba, Naoyuki; Ishizaka, Ikuyo; Matsunaga, Atsuhiko; Fukuda, Michinari

2015-03-01

The purpose of this study was to examine the reliability and validity of the Upper-body Dressing Scale (UBDS) for buttoned shirt dressing, which evaluates the learning process of new component actions of upper-body dressing in patients diagnosed with dementia and hemiparesis. This was a preliminary correlational study of concurrent validity and reliability in which 10 vascular dementia patients with hemiparesis were enrolled and assessed repeatedly by six occupational therapists by means of the UBDS and the dressing item of the Functional Independence Measure (FIM). Intraclass correlation coefficient was 0.97 for intra-rater reliability and 0.99 for inter-rater reliability. The level of correlation between UBDS score and FIM dressing item scores was -0.93. UBDS scores for paralytic hand passed into the sleeve and sleeve pulled up beyond the shoulder joint were worse than the scores for the other components of the task. The UBDS has good reliability and validity for vascular dementia patients with hemiparesis. Further research is needed to investigate the relation between UBDS score and the effect of intervention and to clarify sensitivity or responsiveness of the scale to clinical change. Copyright © 2014 John Wiley & Sons, Ltd.

The use of portable 2D echocardiography and 'frame-based' bubble counting as a tool to evaluate diving decompression stress.

PubMed

Germonpré, Peter; Papadopoulou, Virginie; Hemelryck, Walter; Obeid, Georges; Lafère, Pierre; Eckersley, Robert J; Tang, Meng-Xing; Balestra, Costantino

2014-03-01

'Decompression stress' is commonly evaluated by scoring circulating bubble numbers post dive using Doppler or cardiac echography. This information may be used to develop safer decompression algorithms, assuming that the lower the numbers of venous gas emboli (VGE) observed post dive, the lower the statistical risk of decompression sickness (DCS). Current echocardiographic evaluation of VGE, using the Eftedal and Brubakk method, has some disadvantages as it is less well suited for large-scale evaluation of recreational diving profiles. We propose and validate a new 'frame-based' VGE-counting method which offers a continuous scale of measurement. Nine 'raters' of varying familiarity with echocardiography were asked to grade 20 echocardiograph recordings using both the Eftedal and Brubakk grading and the new 'frame-based' counting method. They were also asked to count the number of bubbles in 50 still-frame images, some of which were randomly repeated. A Wilcoxon Spearman ρ calculation was used to assess test-retest reliability of each rater for the repeated still frames. For the video images, weighted kappa statistics, with linear and quadratic weightings, were calculated to measure agreement between raters for the Eftedal and Brubakk method. Bland-Altman plots and intra-class correlation coefficients were used to measure agreement between raters for the frame-based counting method. Frame-based counting showed a better inter-rater agreement than the Eftedal and Brubakk grading, even with relatively inexperienced assessors, and has good intra- and inter-rater reliability. Frame-based bubble counting could be used to evaluate post-dive decompression stress, and offers possibilities for computer-automated algorithms to allow near-real-time counting.
Inter and intra-observer reliability in assessment of the position of the lateral sesamoid in determining the severity of hallux valgus.

PubMed

Panchani, Sunil; Reading, Jonathan; Mehta, Jaysheel

2016-06-01

The position of the lateral sesamoid on standard dorso-plantar weight bearing radiographs, with respect to the lateral cortex of the first metatarsal, has been shown to correlate well with the degree of the hallux valgus angle. This study aimed to assess the inter- and intra-observer error of this new classification system. Five orthopaedic consultants and five trainee orthopaedic surgeons were recruited to assess and document the degree of displacement of the lateral sesamoid on 144 weight-bearing dorso-plantar radiographs on two separate occasions. The severity of hallux valgus was defined as normal (0%), mild (≤50%), moderate (51-≤99%) or severe (≥100%) depending on the percentage displacement of the lateral sesamoid body from the lateral cortical border of the first metatarsal. Consultant intra-observer variability showed good agreement between repeated assessment of the radiographs (mean Kappa=0.75). Intra-observer variability for trainee orthopaedic surgeons also showed good agreement with a mean Kappa=0.73. Intraclass correlations for consultants and trainee surgeons was also high. The new classification system of assessing the severity of hallux valgus shows high inter- and intra-observer variability with good agreement and reproducibility between surgeons of consultant and trainee grades. Copyright © 2015 Elsevier Ltd. All rights reserved.
Reliability and validity of urinary nerve growth factor measurement in women with lower urinary tract symptoms.

PubMed

Vijaya, Gopalan; Cartwright, Rufus; Bhide, Alka; Derpapas, Alexandros; Fernando, Ruwan; Khullar, Vik

2016-11-01

The validity and reliability of measurement of urinary NGF as a diagnostic biomarker in women with lower urinary tract dysfunction (LUTD) is uncertain. We aimed to evaluate both the diagnostic and discriminant validity, and the test-retest reliability of urinary NGF measurement in women with LUTD. Urinary NGF was measured in women with LUTD (n = 205) and asymptomatic subjects (n = 31). Urinary NGF was assayed using an ELISA method and normalized against urinary creatinine. NGF/creatinine ratios were compared between symptom subgroups using Mann-Whitney U test, and between different urodynamic diagnoses using the Kruskal-Wallis test. Receiver Operator Characteristic (ROC) analysis was employed to evaluate the diagnostic performance of urinary NGF. Test-retest reliability of NGF measurement was assessed using intra-class correlation (ICC). Urinary NGF was significantly but non-specifically increased in symptomatic patients when compared to controls (13.33 vs. 2.05 ng NGF/g Cr, P < 0.001). On multivariate logistic regression NGF was a good predictor of patients having OAB or not, however, the adjusted odds ratio only 1.006. ROC analysis demonstrated poor discriminant ability between different symptomatic groups and urodynamic groups. Using a cut off of 13.0 ng NGF/g creatinine the test provides a sensitivity of 81%, but a specificity of only 39% for overactive bladder. The assays demonstrated good test-retest reliability with ICC of 0.889. Although urinary NGF can be reliably assayed, and is increased in various LUTDs, it discriminates poorly between these disorders therefore has very limited potential as a biomarker. Neurourol. Urodynam. 35:944-948, 2016. © 2015 Wiley Periodicals, Inc. © 2015 Wiley Periodicals, Inc.
Development of a Digital-Based Instrument to Assess Perceived Motor Competence in Children: Face Validity, Test-Retest Reliability, and Internal Consistency

PubMed Central

Palmer, Kara K.

2017-01-01

Assessing children’s perceptions of their movement abilities (i.e., perceived competence) is traditionally done using picture scales—Pictorial Scale of Perceived Competence and Acceptance for Young Children or Pictorial Scale of Perceived Movement Skill Competence. Pictures fail to capture the temporal components of movement. To address this limitation, we created a digital-based instrument to assess perceived motor competence: the Digital Scale of Perceived Motor Competence. The purpose of this study was to determine the validity, reliability, and internal consistency of the Digital-based Scale of Perceived Motor Skill Competence. The Digital-based Scale of Perceived Motor Skill Competence is based on the twelve fundamental motor skills from the Test of Gross Motor Development-2nd Edition with a similar layout and item structure as the Pictorial Scale of Perceived Movement Skill Competence. Face Validity of the instrument was examined in Phase I (n = 56; Mage = 8.6 ± 0.7 years, 26 girls). Test-retest reliability and internal consistency were assessed in Phase II (n = 54, Mage = 8.7 years ± 0.5 years, 26 girls). Intra-class correlations (ICC) and Cronbach’s alpha were conducted to determine test-retest reliability and internal consistency for all twelve skills along with locomotor and object control subscales. The Digital Scale of Perceived Motor Competence demonstrates excellent test-retest reliability (ICC = 0.83, total; ICC = 0.77, locomotor; ICC = 0.79, object control) and acceptable/good internal consistency (α = 0.62, total; α = 0.57, locomotor; α = 0.49, object control). Findings provide evidence of the reliability of the three level digital-based instrument of perceived motor competence for older children. PMID:29910408
Demonstration of the test-retest reliability and sensitivity of the Lower Limb Functional Index-10 as a measure of functional recovery post burn injury: a cross-sectional repeated measures study design.

PubMed

Ryland, Margaret E; Grisbrook, Tiffany L; Wood, Fiona M; Phillips, Michael; Edgar, Dale W

2016-01-01

Lower limb burns can significantly delay recovery of function. Measuring lower limb functional outcomes is challenging in the unique burn patient population and necessitates the use of reliable and valid tools. The aims of this study were to examine the test-retest reliability, sensitivity, and internal consistency of Sections 1 and 3 of the Lower Limb Functional Index-10 (LLFI-10) questionnaire for measuring functional ability in patients with lower limb burns over time. Twenty-nine adult patients who had sustained a lower limb burn injury in the previous 12 months completed the test-retest procedure of the study. In addition, the minimal detectable change (MDC) was calculated for Section 1 and 3 of the LLFI-10. Section 1 is focused on the activity limitations experienced by patients with a lower limb disorder whereas Section 3 involves patients indicating their current percentage of pre-injury duties. Section 1 of the LLFI-10 demonstrated excellent test-retest reliability (intra-class correlation coefficient (ICC) 0.98, 95 % CI 0.96-0.99) whilst Section 3 demonstrated high test-retest reliability (ICC 0.88, 95 % CI 0.79-0.94). MDC scores for Sections 1 and 3 were 1.27 points and 30.22 %, respectively. Internal consistency was demonstrated with a significant negative association (r s = -0.83) between Sections 1 and 3 of the LLFI-10 (p < 0.001). This study demonstrates that Section 1 and 3 of the LLFI-10 are reliable for measuring functional ability in patients who have sustained lower limb burns in the previous 12 months, and furthermore, Section 1 is sensitive to changes in patient function over time.
Use of a tibial accelerometer to measure ground reaction force in running: A reliability and validity comparison with force plates.

PubMed

Raper, Damian P; Witchalls, Jeremy; Philips, Elissa J; Knight, Emma; Drew, Michael K; Waddington, Gordon

2018-01-01

The use of microsensor technologies to conduct research and implement interventions in sports and exercise medicine has increased recently. The objective of this paper was to determine the validity and reliability of the ViPerform as a measure of load compared to vertical ground reaction force (GRF) as measured by force plates. Absolute reliability assessment, with concurrent validity. 10 professional triathletes ran 10 trials over force plates with the ViPerform mounted on the mid portion of the medial tibia. Calculated vertical ground reaction force data from the ViPerform was matched to the same stride on the force plate. Bland-Altman (BA) plot of comparative measure of agreement was used to assess the relationship between the calculated load from the accelerometer and the force plates. Reliability was calculated by intra-class correlation coefficients (ICC) with 95% confidence intervals. BA plot indicates minimal agreement between the measures derived from the force plate and ViPerform, with variation at an individual participant plot level. Reliability was excellent (ICC=0.877; 95% CI=0.825-0.917) in calculating the same vertical GRF in a repeated trial. Standard error of measure (SEM) equalled 99.83 units (95% CI=82.10-119.09), which, in turn, gave a minimum detectable change (MDC) value of 276.72 units (95% CI=227.32-330.07). The ViPerform does not calculate absolute values of vertical GRF similar to those measured by a force plate. It does provide a valid and reliable calculation of an athlete's lower limb load at constant velocity. Copyright © 2017 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.
Reliability and validity of the Performance Recorder 1 for measuring isometric knee flexor and extensor strength.

PubMed

Neil, Sarah E; Myring, Alec; Peeters, Mon Jef; Pirie, Ian; Jacobs, Rachel; Hunt, Michael A; Garland, S Jayne; Campbell, Kristin L

2013-11-01

Muscular strength is a key parameter of rehabilitation programs and a strong predictor of functional capacity. Traditional methods to measure strength, such as manual muscle testing (MMT) and hand-held dynamometry (HHD), are limited by the strength and experience of the tester. The Performance Recorder 1 (PR1) is a strength assessment tool attached to resistance training equipment and may be a time- and cost-effective tool to measure strength in clinical practice that overcomes some limitations of MMT and HHD. However, reliability and validity of the PR1 have not been reported. Test-retest and inter-rater reliability was assessed using the PR1 in healthy adults (n = 15) during isometric knee flexion and extension. Criterion-related validity was assessed through comparison of values obtained from the PR1 and Biodex® isokinetic dynamometer. Test-retest reliability was excellent for peak knee flexion (intra-class correlation coefficient [ICC] of 0.96, 95% CI: 0.85, 0.99) and knee extension (ICC = 0.96, 95% CI: 0.87, 0.99). Inter-rater reliability was also excellent for peak knee flexion (ICC = 0.95, 95% CI: 0.85, 0.99) and peak knee extension (ICC = 0.97, 95% CI: 0.91, 0.99). Validity was moderate for peak knee flexion (ICC = 0.75, 95% CI: 0.38, 0.92) but poor for peak knee extension (ICC = 0.37, 95% CI: 0, 0.73). The PR1 provides a reliable measure of isometric knee flexor and extensor strength in healthy adults that could be used in the clinical setting, but absolute values may not be comparable to strength assessment by gold-standard measures.
Adaptation and reliability of neighborhood environment walkability scale (NEWS) for Iran: A questionnaire for assessing environmental correlates of physical activity

PubMed Central

Hakimian, Pantea; Lak, Azadeh

2016-01-01

Background: In spite of the increased range of inactivity and obesity among Iranian adults, insufficient research has been done on environmental factors influencing physical activity. As a result adapting a subjective (self-report) measurement tool for assessment of physical environment in Iran is critical. Accordingly, in this study Neighborhood Environment Walkability Scale (NEWS) was adapted for Iran and also its reliability was evaluated. Methods: This study was conducted using a systematic adaptation method consisting of 3 steps: translate-back translation procedures, revision by a multidisciplinary panel of local experts and a cognitive study. Then NEWS-Iran was completed among adults aged 18 to 65 years (N=19) with an interval of 15 days. Intra-Class Coefficient (ICC) was used to evaluate the reliability of the adapted questionnaire. Results: NEWS-Iran is an adapted version of NEWS-A (abbreviated) and in the adaptation process five items were added from other versions of NEWS, two subscales were significantly modified for a shorter and more effective questionnaire, and five new items were added about climate factors and site-specific uses. NEWS-Iran showed almost perfect reliability (ICCs: more than 0.8) for all subscales, with items having moderate to almost perfect reliability scores (ICCs: 0.56-0.96). Conclusion: This study introduced NEWS-Iran, which is a reliable version of NEWS for measuring environmental perceptions related to physical activity behavior adapted for Iran. It is the first adapted version of NEWS which demonstrates a systematic adaptation process used by earlier studies. It can be used for other developing countries with similar environmental, social and cultural context. PMID:28210592
Adaptation and reliability of neighborhood environment walkability scale (NEWS) for Iran: A questionnaire for assessing environmental correlates of physical activity.

PubMed

Hakimian, Pantea; Lak, Azadeh

2016-01-01

Background: In spite of the increased range of inactivity and obesity among Iranian adults, insufficient research has been done on environmental factors influencing physical activity. As a result adapting a subjective (self-report) measurement tool for assessment of physical environment in Iran is critical. Accordingly, in this study Neighborhood Environment Walkability Scale (NEWS) was adapted for Iran and also its reliability was evaluated. Methods: This study was conducted using a systematic adaptation method consisting of 3 steps: translate-back translation procedures, revision by a multidisciplinary panel of local experts and a cognitive study. Then NEWS-Iran was completed among adults aged 18 to 65 years (N=19) with an interval of 15 days. Intra-Class Coefficient (ICC) was used to evaluate the reliability of the adapted questionnaire. Results: NEWS-Iran is an adapted version of NEWS-A (abbreviated) and in the adaptation process five items were added from other versions of NEWS, two subscales were significantly modified for a shorter and more effective questionnaire, and five new items were added about climate factors and site-specific uses. NEWS-Iran showed almost perfect reliability (ICCs: more than 0.8) for all subscales, with items having moderate to almost perfect reliability scores (ICCs: 0.56-0.96). Conclusion: This study introduced NEWS-Iran, which is a reliable version of NEWS for measuring environmental perceptions related to physical activity behavior adapted for Iran. It is the first adapted version of NEWS which demonstrates a systematic adaptation process used by earlier studies. It can be used for other developing countries with similar environmental, social and cultural context.
Validity and inter-observer reliability of subjective hand-arm vibration assessments.

PubMed

Coenen, Pieter; Formanoy, Margriet; Douwes, Marjolein; Bosch, Tim; de Kraker, Heleen

2014-07-01

Exposure to mechanical vibrations at work (e.g., due to handling powered tools) is a potential occupational risk as it may cause upper extremity complaints. However, reliable and valid assessment methods for vibration exposure at work are lacking. Measuring hand-arm vibration objectively is often difficult and expensive, while often used information provided by manufacturers lacks detail. Therefore, a subjective hand-arm vibration assessment method was tested on validity and inter-observer reliability. In an experimental protocol, sixteen tasks handling powered tools were executed by two workers. Hand-arm vibration was assessed subjectively by 16 observers according to the proposed subjective assessment method. As a gold standard reference, hand-arm vibration was measured objectively using a vibration measurement device. Weighted κ's were calculated to assess validity, intra-class-correlation coefficients (ICCs) were calculated to assess inter-observer reliability. Inter-observer reliability of the subjective assessments depicting the agreement among observers can be expressed by an ICC of 0.708 (0.511-0.873). The validity of the subjective assessments as compared to the gold-standard reference can be expressed by a weighted κ of 0.535 (0.285-0.785). Besides, the percentage of exact agreement of the subjective assessment compared to the objective measurement was relatively low (i.e., 52% of all tasks). This study shows that subjectively assessed hand-arm vibrations are fairly reliable among observers and moderately valid. This assessment method is a first attempt to use subjective risk assessments of hand-arm vibration. Although, this assessment method can benefit from some future improvement, it can be of use in future studies and in field-based ergonomic assessments. Copyright © 2014 Elsevier Ltd and The Ergonomics Society. All rights reserved.
Validity and reliability of the Spanish version of the DN4 (Douleur Neuropathique 4 questions) questionnaire for differential diagnosis of pain syndromes associated to a neuropathic or somatic component

PubMed Central

Perez, Concepcion; Galvez, Rafael; Huelbes, Silvia; Insausti, Joaquin; Bouhassira, Didier; Diaz, Silvia; Rejas, Javier

2007-01-01

Background This study assesses the validity and reliability of the Spanish version of DN4 questionnaire as a tool for differential diagnosis of pain syndromes associated to a neuropathic (NP) or somatic component (non-neuropathic pain, NNP). Methods A study was conducted consisting of two phases: cultural adaptation into the Spanish language by means of conceptual equivalence, including forward and backward translations in duplicate and cognitive debriefing, and testing of psychometric properties in patients with NP (peripheral, central and mixed) and NNP. The analysis of psychometric properties included reliability (internal consistency, inter-rater agreement and test-retest reliability) and validity (ROC curve analysis, agreement with the reference diagnosis and determination of sensitivity, specificity, and positive and negative predictive values in different subsamples according to type of NP). Results A sample of 164 subjects (99 women, 60.4%; age: 60.4 ± 16.0 years), 94 (57.3%) with NP (36 with peripheral, 32 with central, and 26 with mixed pain) and 70 with NNP was enrolled. The questionnaire was reliable [Cronbach's alpha coefficient: 0.71, inter-rater agreement coefficient: 0.80 (0.71–0.89), and test-retest intra-class correlation coefficient: 0.95 (0.92–0.97)] and valid for a cut-off value ≥ 4 points, which was the best value to discriminate between NP and NNP subjects. Discussion This study, representing the first validation of the DN4 questionnaire into another language different than the original, not only supported its high discriminatory value for identification of neuropathic pain, but also provided supplemental psychometric validation (i.e. test-retest reliability, influence of educational level and pain intensity) and showed its validity in mixed pain syndromes. PMID:18053212
Validity of palatal superimposition of 3-dimensional digital models in cases treated with rapid maxillary expansion and maxillary protraction headgear

PubMed Central

Choi, Jin-Il; Jost-Brinkmann, Paul-Georg; Choi, Dong-Soon; Jang, In-San

2012-01-01

Objective The purpose of this study was to evaluate the validity of the 3-dimensional (3D) superimposition method of digital models in patients who received treatment with rapid maxillary expansion (RME) and maxillary protraction headgear. Methods The material consisted of pre- and post-treatment maxillary dental casts and lateral cephalograms of 30 patients, who underwent RME and maxillary protraction headgear treatment. Digital models were superimposed using the palate as a reference area. The movement of the maxillary central incisor and the first molar was measured on superimposed cephalograms and 3D digital models. To determine whether any difference existed between the 2 measuring techniques, intra-class correlation (ICC) and Bland-Altman plots were analyzed. Results The measurements on the 3D digital models and cephalograms showed a very high correlation in the antero-posterior direction (ICC, 0.956 for central incisor and 0.941 for first molar) and a moderate correlation in the vertical direction (ICC, 0.748 for central incisor and 0.717 for first molar). Conclusions The 3D model superimposition method using the palate as a reference area is as clinically reliable for assessing antero-posterior tooth movement as cephalometric superimposition, even in cases treated with orthopedic appliances, such as RME and maxillary protraction headgear. PMID:23173116
Psychometric testing of the modified Care Dependency Scale (Neuro-CDS).

PubMed

Piredda, Michela; Biagioli, Valentina; Gambale, Giulia; Porcelli, Elisa; Barbaranelli, Claudio; Palese, Alvisa; De Marinis, Maria Grazia

2016-01-01

Effective measures of nursing care dependency in neurorehabilitation are warranted to plan nursing interventions to help patients avoid increasing dependency. The Care Dependency Scale (CDS) is a theory-based, comprehensive tool to evaluate functional disability. This study aimed to modify the CDS for neurological and neurorehabilitation patients (Neuro-CDS) and to test its psychometric properties in adult neurorehabilitation inpatients. Exploratory factor analysis (EFA) was performed using a Maximum Likelihood robust (MLR) estimator. The Barthel Index (BI) was used to evaluate concurrent validity. Stability was measured using the Intra-class Correlation Coefficient (ICC). The sample included 124 patients (mean age = 69.7 years, 54% male). The EFA revealed a two-factor structure with good fit indexes, Factor 1 (Physical care dependence) loaded by 11 items and Factor 2 (Psycho-social care dependence) loaded by 4 items. The correlation between factors was 0.61. Correlations between Factor 1 and the BI and between Factor 2 and the BI were r = 0.843 and r = 0.677, respectively (p < 0.001). The Cronbach's alpha coefficients were 0.99 and 0.88 (Factor 1 and 2). The ICC was 0.98. The Neuro-CDS is multidimensional, valid, reliable, straightforward, and able to measure care dependence in neurorehabilitation patients as a basis for individualized and holistic care.
A disease-specific measure of health-related quality of life for use in adults with immune thrombocytopenic purpura: its development and validation.

PubMed

Mathias, Susan D; Bussel, James B; George, James N; McMillan, Robert; Okano, Gary J; Nichol, Janet L

2007-02-22

No validated disease-specific measures are available to assess health-related quality of life (HRQoL) in adult subjects with immune thrombocytopenic purpura (ITP). Therefore, we sought to develop and validate the ITP-Patient Assessment Questionnaire (ITP-PAQ) for adult subjects with ITP. Information from literature reviews, focus groups with subjects, and clinicians were used to develop 50 ITP-PAQ items. Factor analyses were conducted to develop the scale structure and reduce the number of items. The final 44-item ITP-PAQ, which includes ten scales [Symptoms (S), Bother-Physical Health (B), Fatigue/Sleep (FT), Activity (A), Fear (FR), Psychological Health (PH), Work (W), Social Activity (SA), Women's Reproductive Health (RH), and Overall (QoL)], was self-administered to adult ITP subjects at baseline and 7-10 days later. Test-retest reliability, internal consistency reliability, construct and known groups validity of the final ITP-PAQ were evaluated. Seventy-three subjects with ITP completed the questionnaire twice. Test-retest reliability, as measured by the intra-class correlation, ranged from 0.52-0.90. Internal consistency reliability was demonstrated with Cronbach's alpha for all scales above the acceptable level of 0.70 (range: 0.71-0.92), except for RH (0.66). Construct validity, assessed by correlating ITP-PAQ scales with established measures (Short Form-36 v.1, SF-36 and Center for Epidemiologic Studies Depression Scale, CES-D), was demonstrated through moderate correlations between the ITP-PAQ SA and SF-36 Social Function scales (r = 0.67), and between ITP-PAQ PH and SF-36 Mental Health Scales (r = 0.63). Moderate to strong inter-scale correlations were reported between ITP-PAQ scales and the CES-D, except for the RH scale. Known groups validity was evaluated by comparing mean scores for groups that differed clinically. Statistically significant differences (p < 0.01) were observed when subjects were categorized by treatment status [S, FT, B, A, PH, and QoL, perceived effectiveness of ITP treatment [S], and time elapsed since ITP diagnosis [PH]. Results provide preliminary evidence of the reliability and validity of the ITP-PAQ in adult subjects with ITP. Further work should be conducted to assess the responsiveness and to estimate the minimal clinical important difference of the ITP-PAQ to more fully understand the impact of ITP and its treatments on HRQoL.
Inter- and intrarater reliability of the Chicago Classification in pediatric high-resolution esophageal manometry recordings.

PubMed

Singendonk, M M J; Smits, M J; Heijting, I E; van Wijk, M P; Nurko, S; Rosen, R; Weijenborg, P W; Abu-Assi, R; Hoekman, D R; Kuizenga-Wessel, S; Seiboth, G; Benninga, M A; Omari, T I; Kritas, S

2015-02-01

The Chicago Classification (CC) facilitates interpretation of high-resolution manometry (HRM) recordings. Application of this adult based algorithm to the pediatric population is unknown. We therefore assessed intra and interrater reliability of software-based CC diagnosis in a pediatric cohort. Thirty pediatric solid state HRM recordings (13M; mean age 12.1 ± 5.1 years) assessing 10 liquid swallows per patient were analyzed twice by 11 raters (six experts, five non-experts). Software-placed anatomical landmarks required manual adjustment or removal. Integrated relaxation pressure (IRP4s), distal contractile integral (DCI), contractile front velocity (CFV), distal latency (DL) and break size (BS), and an overall CC diagnosis were software-generated. In addition, raters provided their subjective CC diagnosis. Reliability was calculated with Cohen's and Fleiss' kappa (κ) and intraclass correlation coefficient (ICC). Intra- and interrater reliability of software-generated CC diagnosis after manual adjustment of landmarks was substantial (mean κ = 0.69 and 0.77 respectively) and moderate-substantial for subjective CC diagnosis (mean κ = 0.70 and 0.58 respectively). Reliability of both software-generated and subjective diagnosis of normal motility was high (κ = 0.81 and κ = 0.79). Intra- and interrater reliability were excellent for IRP4s, DCI, and BS. Experts had higher interrater reliability than non-experts for DL (ICC = 0.65 vs ICC = 0.36 respectively) and the software-generated diagnosis diffuse esophageal spasm (DES, κ = 0.64 vs κ = 0.30). Among experts, the reliability for the subjective diagnosis of achalasia and esophageal gastric junction outflow obstruction was moderate-substantial (κ = 0.45-0.82). Inter- and intrarater reliability of software-based CC diagnosis of pediatric HRM recordings was high overall. However, experience was a factor influencing the diagnosis of some motility disorders, particularly DES and achalasia. © 2014 John Wiley & Sons Ltd.
NovoTTF™-100A System (Tumor Treating Fields) transducer array layout planning for glioblastoma: a NovoTAL™ system user study.

PubMed

Chaudhry, Aafia; Benson, Laura; Varshaver, Michael; Farber, Ori; Weinberg, Uri; Kirson, Eilon; Palti, Yoram

2015-11-11

Optune™, previously known as the NovoTTF-100A System™, generates Tumor Treating Fields (TTFields), an effective anti-mitotic therapy for glioblastoma. The system delivers intermediate frequency, alternating electric fields to the supratentorial brain. Patient therapy is personalized by configuring transducer array layout placement on the scalp to the tumor site using MRI measurements and the NovoTAL System. Transducer array layout mapping optimizes therapy by maximizing electric field intensity to the tumor site. This study evaluated physician performance in conducting transducer array layout mapping using the NovoTAL System compared with mapping performed by the Novocure in-house clinical team. Fourteen physicians (7 neuro-oncologists, 4 medical oncologists, and 3 neurosurgeons) evaluated five blinded cases of recurrent glioblastoma and performed head size and tumor location measurements using a standard Digital Imaging and Communications in Medicine reader. Concordance with Novocure measurement and intra- and inter-rater reliability were assessed using relevant correlation coefficients. The study criterion for success was a concordance correlation coefficient (CCC) >0.80. CCC for each physician versus Novocure on 20 MRI measurements was 0.96 (standard deviation, SD ± 0.03, range 0.90-1.00), indicating very high agreement between the two groups. Intra- and inter-rater reliability correlation coefficients were similarly high: 0.83 (SD ±0.15, range 0.54-1.00) and 0.80 (SD ±0.18, range 0.48-1.00), respectively. This user study demonstrated an excellent level of concordance between prescribing physicians and Novocure in-house clinical teams in performing transducer array layout planning. Intra-rater reliability was very high, indicating reproducible performance. Physicians prescribing TTFields, when trained on the NovoTAL System, can independently perform transducer array layout mapping required for the initiation and maintenance of patients on TTFields therapy.
The functional significance of EEG microstates--Associations with modalities of thinking.

PubMed

Milz, P; Faber, P L; Lehmann, D; Koenig, T; Kochi, K; Pascual-Marqui, R D

2016-01-15

The momentary, global functional state of the brain is reflected by its electric field configuration. Cluster analytical approaches consistently extracted four head-surface brain electric field configurations that optimally explain the variance of their changes across time in spontaneous EEG recordings. These four configurations are referred to as EEG microstate classes A, B, C, and D and have been associated with verbal/phonological, visual, subjective interoceptive-autonomic processing, and attention reorientation, respectively. The present study tested these associations via an intra-individual and inter-individual analysis approach. The intra-individual approach tested the effect of task-induced increased modality-specific processing on EEG microstate parameters. The inter-individual approach tested the effect of personal modality-specific parameters on EEG microstate parameters. We obtained multichannel EEG from 61 healthy, right-handed, male students during four eyes-closed conditions: object-visualization, spatial-visualization, verbalization (6 runs each), and resting (7 runs). After each run, we assessed participants' degrees of object-visual, spatial-visual, and verbal thinking using subjective reports. Before and after the recording, we assessed modality-specific cognitive abilities and styles using nine cognitive tests and two questionnaires. The EEG of all participants, conditions, and runs was clustered into four classes of EEG microstates (A, B, C, and D). RMANOVAs, ANOVAs and post-hoc paired t-tests compared microstate parameters between conditions. TANOVAs compared microstate class topographies between conditions. Differences were localized using eLORETA. Pearson correlations assessed interrelationships between personal modality-specific parameters and EEG microstate parameters during no-task resting. As hypothesized, verbal as opposed to visual conditions consistently affected the duration, occurrence, and coverage of microstate classes A and B. Contrary to associations suggested by previous reports, parameters were increased for class A during visualization, and class B during verbalization. In line with previous reports, microstate D parameters were increased during no-task resting compared to the three internal, goal-directed tasks. Topographic differences between conditions included particular sub-regions of components of the metabolic default mode network. Modality-specific personal parameters did not consistently correlate with microstate parameters except verbal cognitive style which correlated negatively with microstate class A duration and positively with class C occurrence. This is the first study that aimed to induce EEG microstate class parameter changes based on their hypothesized functional significance. Beyond the associations of microstate classes A and B with visual and verbal processing, respectively, our results suggest that a finely-tuned interplay between all four EEG microstate classes is necessary for the continuous formation of visual and verbal thoughts. Our results point to the possibility that the EEG microstate classes may represent the head-surface measured activity of intra-cortical sources primarily exhibiting inhibitory functions. However, additional studies are needed to verify and elaborate on this hypothesis. Copyright © 2015 Elsevier Inc. All rights reserved.
Is scaffold hopping a reliable indicator for the ability of computational methods to identify structurally diverse active compounds?

NASA Astrophysics Data System (ADS)

Dimova, Dilyana; Bajorath, Jürgen

2017-07-01

Computational scaffold hopping aims to identify core structure replacements in active compounds. To evaluate scaffold hopping potential from a principal point of view, regardless of the computational methods that are applied, a global analysis of conventional scaffolds in analog series from compound activity classes was carried out. The majority of analog series was found to contain multiple scaffolds, thus enabling the detection of intra-series scaffold hops among closely related compounds. More than 1000 activity classes were found to contain increasing proportions of multi-scaffold analog series. Thus, using such activity classes for scaffold hopping analysis is likely to overestimate the scaffold hopping (core structure replacement) potential of computational methods, due to an abundance of artificial scaffold hops that are possible within analog series.
Development, scoring, and reliability of the Microscale Audit of Pedestrian Streetscapes (MAPS)

PubMed Central

2013-01-01

Background Streetscape (microscale) features of the built environment can influence people’s perceptions of their neighborhoods’ suitability for physical activity. Many microscale audit tools have been developed, but few have published systematic scoring methods. We present the development, scoring, and reliability of the Microscale Audit of Pedestrian Streetscapes (MAPS) tool and its theoretically-based subscales. Methods MAPS was based on prior instruments and was developed to assess details of streetscapes considered relevant for physical activity. MAPS sections (route, segments, crossings, and cul-de-sacs) were scored by two independent raters for reliability analyses. There were 290 route pairs, 516 segment pairs, 319 crossing pairs, and 53 cul-de-sac pairs in the reliability sample. Individual inter-rater item reliability analyses were computed using Kappa, intra-class correlation coefficient (ICC), and percent agreement. A conceptual framework for subscale creation was developed using theory, expert consensus, and policy relevance. Items were grouped into subscales, and subscales were analyzed for inter-rater reliability at tiered levels of aggregation. Results There were 160 items included in the subscales (out of 201 items total). Of those included in the subscales, 80 items (50.0%) had good/excellent reliability, 41 items (25.6%) had moderate reliability, and 18 items (11.3%) had low reliability, with limited variability in the remaining 21 items (13.1%). Seventeen of the 20 route section subscales, valence (positive/negative) scores, and overall scores (85.0%) demonstrated good/excellent reliability and 3 demonstrated moderate reliability. Of the 16 segment subscales, valence scores, and overall scores, 12 (75.0%) demonstrated good/excellent reliability, three demonstrated moderate reliability, and one demonstrated poor reliability. Of the 8 crossing subscales, valence scores, and overall scores, 6 (75.0%) demonstrated good/excellent reliability, and 2 demonstrated moderate reliability. The cul-de-sac subscale demonstrated good/excellent reliability. Conclusions MAPS items and subscales predominantly demonstrated moderate to excellent reliability. The subscales and scoring system represent a theoretically based framework for using these complex microscale data and may be applicable to other similar instruments. PMID:23621947
Validity and reliability of the robotic objective structured assessment of technical skills

PubMed Central

Siddiqui, Nazema Y.; Galloway, Michael L.; Geller, Elizabeth J.; Green, Isabel C.; Hur, Hye-Chun; Langston, Kyle; Pitter, Michael C.; Tarr, Megan E.; Martino, Martin A.

2015-01-01

Objective Objective structured assessments of technical skills (OSATS) have been developed to measure the skill of surgical trainees. Our aim was to develop an OSATS specifically for trainees learning robotic surgery. Study Design This is a multi-institutional study in eight academic training programs. We created an assessment form to evaluate robotic surgical skill through five inanimate exercises. Obstetrics/gynecology, general surgery, and urology residents, fellows, and faculty completed five robotic exercises on a standard training model. Study sessions were recorded and randomly assigned to three blinded judges who scored performance using the assessment form. Construct validity was evaluated by comparing scores between participants with different levels of surgical experience; inter- and intra-rater reliability were also assessed. Results We evaluated 83 residents, 9 fellows, and 13 faculty, totaling 105 participants; 88 (84%) were from obstetrics/gynecology. Our assessment form demonstrated construct validity, with faculty and fellows performing significantly better than residents (mean scores: 89 ± 8 faculty; 74 ± 17 fellows; 59 ± 22 residents, p<0.01). In addition, participants with more robotic console experience scored significantly higher than those with fewer prior console surgeries (p<0.01). R-OSATS demonstrated good inter-rater reliability across all five drills (mean Cronbach's α: 0.79 ± 0.02). Intra-rater reliability was also high (mean Spearman's correlation: 0.91 ± 0.11). Conclusions We developed an assessment form for robotic surgical skill that demonstrates construct validity, inter- and intra-rater reliability. When paired with standardized robotic skill drills this form may be useful to distinguish between levels of trainee performance. PMID:24807319

Some links on this page may take you to non-federal websites. Their policies may differ from this site.