obtain reliable measurements: Topics by Science.gov

Sample records for obtain reliable measurements

The Data Evaluation for Obtaining Accuracy and Reliability

NASA Astrophysics Data System (ADS)

Kim, Chang Geun; Chae, Kyun Shik; Lee, Sang Tae; Bhang, Gun Woong

2012-11-01

Nemours scientific measurement results are flooded from the paper, data book, etc. as fast growing of internet. We meet many different measurement results on the same measurand. In this moment, we are face to choose most reliable one out of them. But it is not easy to choose and use the accurate and reliable data as we do at an ice cream parlor. Even expert users feel difficult to distinguish the accurate and reliable scientific data from huge amount of measurement results. For this reason, the data evaluation is getting more important as the fast growing of internet and globalization. Furthermore the expressions of measurement results are not in standardi-zation. As these need, the international movement has been enhanced. At the first step, the global harmonization of terminology used in metrology and the expression of uncertainty in measurement were published in ISO. These methods are wide spread to many area of science on their measurement to obtain the accuracy and reliability. In this paper, it is introduced that the GUM, SRD and data evaluation on atomic collisions.
Reliability and validity of pendulum test measures of spasticity obtained with the Polhemus tracking system from patients with chronic stroke.

PubMed

Bohannon, Richard W; Harrison, Steven; Kinsella-Shaw, Jeffrey

2009-07-30

Spasticity is a common impairment accompanying stroke. Spasticity of the quadriceps femoris muscle can be quantified using the pendulum test. The measurement properties of pendular kinematics captured using a magnetic tracking system has not been studied among patients who have experienced a stroke. Therefore, this study describes the test-retest reliability and known groups and convergent validity of the pendulum test measures obtained with the Polhemus tracking system. Eight patients with chronic stroke underwent pendulum tests with their affected and unaffected lower limbs, with and without the addition of a 2.2 kg cuff weight at the ankle, using the Polhemus magnetic tracking system. Also measured bilaterally were knee resting angles, Ashworth scores (grades 0-4) of quadriceps femoris muscles, patellar tendon (knee jerk) reflexes (grades 0-4), and isometric knee extension force. Three measures obtained from pendular traces of the affected side were reliable (intraclass correlation coefficient > or = .844). Known groups validity was confirmed by demonstration of a significant difference in the measurements between sides. Convergent validity was supported by correlations > or = .57 between pendulum test measures and other measures reflective of spasticity. Pendulum test measures obtained with the Polhemus tracking system from the affected side of patients with stroke have good test-retest reliability and both known groups and convergent validity.
Reliability and validity of pendulum test measures of spasticity obtained with the Polhemus tracking system from patients with chronic stroke

PubMed Central

Bohannon, Richard W; Harrison, Steven; Kinsella-Shaw, Jeffrey

2009-01-01

Background Spasticity is a common impairment accompanying stroke. Spasticity of the quadriceps femoris muscle can be quantified using the pendulum test. The measurement properties of pendular kinematics captured using a magnetic tracking system has not been studied among patients who have experienced a stroke. Therefore, this study describes the test-retest reliability and known groups and convergent validity of the pendulum test measures obtained with the Polhemus tracking system. Methods Eight patients with chronic stroke underwent pendulum tests with their affected and unaffected lower limbs, with and without the addition of a 2.2 kg cuff weight at the ankle, using the Polhemus magnetic tracking system. Also measured bilaterally were knee resting angles, Ashworth scores (grades 0–4) of quadriceps femoris muscles, patellar tendon (knee jerk) reflexes (grades 0–4), and isometric knee extension force. Results Three measures obtained from pendular traces of the affected side were reliable (intraclass correlation coefficient ≥ .844). Known groups validity was confirmed by demonstration of a significant difference in the measurements between sides. Convergent validity was supported by correlations ≥ .57 between pendulum test measures and other measures reflective of spasticity. Conclusion Pendulum test measures obtained with the Polhemus tracking system from the affected side of patients with stroke have good test-retest reliability and both known groups and convergent validity. PMID:19642989
Reliability of the measures of weight-bearing distribution obtained during quiet stance by digital scales in subjects with and without hemiparesis.

PubMed

de Araujo-Barbosa, Paulo Henrique Ferreira; de Menezes, Lidiane Teles; Costa, Abraão Souza; Couto Paz, Clarissa Cardoso Dos Santos; Fachin-Martins, Emerson

2015-05-01

Described as an alternative way of assessing weight-bearing asymmetries, the measures obtained from digital scales have been used as an index to classify weight-bearing distribution. This study aimed to describe the intra-test and the test/retest reliability of measures in subjects with and without hemiparesis during quiet stance. The percentage of body weight borne by one limb was calculated for a sample of subjects with hemiparesis and for a control group that was matched by gender and age. A two-way analysis of variance was used to verify the intra-test reliability. This analysis was calculated using the differences between the averages of the measures obtained during single, double or triple trials. The intra-class correlation coefficient (ICC) was utilized and data plotted using the Bland-Altman method. The intra-test analysis showed significant differences, only observed in the hemiparesis group, between the measures obtained by single and triple trials. Excellent and moderate ICC values (0.69-0.84) between test and retest were observed in the hemiparesis group, while for control groups ICC values (0.41-0.74) were classified as moderate, progressing from almost poor for measures obtained by a single trial to almost excellent for those obtained by triple trials. In conclusion, good reliability ranging from moderate to excellent classifications was found for participants with and without hemiparesis. Moreover, an improvement of the repeatability was observed with fewer trials for participants with hemiparesis, and with more trials for participants without hemiparesis.
Reliability and Accuracy of Static Parameters Obtained From Ink and Pressure Platform Footprints.

PubMed

Zuil-Escobar, Juan Carlos; Martínez-Cepa, Carmen Belén; Martín-Urrialde, Jose Antonio; Gómez-Conesa, Antonia

2016-09-01

The purpose of this study was to evaluate the accuracy and the intrarater reliability of arch angle (AA), Staheli Index (SI), and Chippaux-Smirak Index (CSI) obtained from ink and pressure platform footprints. We obtained AA, SI, and CSI measurements from ink pedigraph footprints and pressure platform footprints in 40 healthy participants (aged 25.65 ± 5.187 years). Intrarater reliability was calculated for all parameters obtained using the 2 methods. Standard error of measurement and minimal detectable change were also calculated. A repeated-measure analysis of variance was used to identify differences between ink and pressure platform footprints. Intraclass correlation coefficient and Bland and Altman plots were used to assess similar parameters obtained using different methods. Intrarater reliability was >0.9 for all parameters and was slightly higher for the ink footprints. No statistical difference was reported in repeated-measure analysis of variance for any of the parameters. Intraclass correlation coefficient values from AA, SI, and CSI that were obtained using ink footprints and pressure platform footprints were excellent, ranging from 0.797 to 0.829. However, pressure platform overestimated AA and underestimated SI and CSI. Our study revealed that AA, SI, and CSI were similar regardless of whether the ink or pressure platform method was used. In addition, the parameters indicated high intrarater reliability and were reproducible. Copyright © 2016. Published by Elsevier Inc.
Reproducibility and interoperator reliability of obtaining images and measurements of the cervix and uterus with brachytherapy treatment applicators in situ using transabdominal ultrasound.

PubMed

van Dyk, Sylvia; Garth, Margaret; Oates, Amanda; Kondalsamy-Chennakesavan, Srinivas; Schneider, Michal; Bernshaw, David; Narayan, Kailash

2016-01-01

To validate interoperator reliability of brachytherapy radiation therapists (RTs) in obtaining an ultrasound image and measuring the cervix and uterine dimensions using transabdominal ultrasound. Patients who underwent MRI with applicators in situ after the first insertion were included in the study. Imaging was performed by three RTs (RT1, RT2, and RT3) with varying degrees of ultrasound experience. All RTs were required to obtain a longitudinal planning image depicting the applicator in the uterine canal and measure the cervix and uterus. The MRI scan, taken 1 hour after the ultrasound, was used as the reference standard against which all measurements were compared. Measurements were analyzed with intraclass correlation coefficient and Bland-Altman plots. All RTs were able to obtain a suitable longitudinal image for each patient in the study. Mean differences (SD) between MRI and ultrasound measurements obtained by RTs ranged from 3.5 (3.6) to 4.4 (4.23) mm and 0 (3.0) to 0.9 (2.5) mm on the anterior and posterior surface of the cervix, respectively. Intraclass correlation coefficient for absolute agreement between MRI and RTs was >0.9 for all posterior measurement points in the cervix and ranged from 0.41 to 0.92 on the anterior surface. Measurements were not statistically different between RTs at any measurement point. RTs with variable training attained high levels of interoperator reliability when using transabdominal ultrasound to obtain images and measurements of the uterus and cervix with brachytherapy applicators in situ. Access to training and use of a well-defined protocol assist in achieving these high levels of reliability. Copyright © 2016 American Brachytherapy Society. Published by Elsevier Inc. All rights reserved.
Obtaining Reliable Estimates of Ambulatory Physical Activity in People with Parkinson's Disease.

PubMed

Paul, Serene S; Ellis, Terry D; Dibble, Leland E; Earhart, Gammon M; Ford, Matthew P; Foreman, K Bo; Cavanaugh, James T

2016-05-05

We determined the number of days required, and whether to include weekdays and/or weekends, to obtain reliable measures of ambulatory physical activity in people with Parkinson's disease (PD). Ninety-two persons with PD wore a step activity monitor for seven days. The number of days required to obtain a reliable estimate of daily activity was determined from the mean intraclass correlation (ICC2,1) for all possible combinations of 1-6 consecutive days of monitoring. Two days of monitoring were sufficient to obtain reliable daily activity estimates (ICC2,1 > 0.9). Amount (p = 0.03) but not intensity (p = 0.13) of ambulatory activity was greater on weekdays than weekends. Activity prescription based on amount rather than intensity may be more appropriate for people with PD.
Uncertainties in obtaining high reliability from stress-strength models

NASA Technical Reports Server (NTRS)

Neal, Donald M.; Matthews, William T.; Vangel, Mark G.

1992-01-01

There has been a recent interest in determining high statistical reliability in risk assessment of aircraft components. The potential consequences are identified of incorrectly assuming a particular statistical distribution for stress or strength data used in obtaining the high reliability values. The computation of the reliability is defined as the probability of the strength being greater than the stress over the range of stress values. This method is often referred to as the stress-strength model. A sensitivity analysis was performed involving a comparison of reliability results in order to evaluate the effects of assuming specific statistical distributions. Both known population distributions, and those that differed slightly from the known, were considered. Results showed substantial differences in reliability estimates even for almost nondetectable differences in the assumed distributions. These differences represent a potential problem in using the stress-strength model for high reliability computations, since in practice it is impossible to ever know the exact (population) distribution. An alternative reliability computation procedure is examined involving determination of a lower bound on the reliability values using extreme value distributions. This procedure reduces the possibility of obtaining nonconservative reliability estimates. Results indicated the method can provide conservative bounds when computing high reliability. An alternative reliability computation procedure is examined involving determination of a lower bound on the reliability values using extreme value distributions. This procedure reduces the possibility of obtaining nonconservative reliability estimates. Results indicated the method can provide conservative bounds when computing high reliability.
The number of measurements needed to obtain high reliability for traits related to enzymatic activities and photosynthetic compounds in soybean plants infected with Phakopsora pachyrhizi.

PubMed

Oliveira, Tássia Boeno de; Azevedo Peixoto, Leonardo de; Teodoro, Paulo Eduardo; Alvarenga, Amauri Alves de; Bhering, Leonardo Lopes; Campo, Clara Beatriz Hoffmann

2018-01-01

Asian rust affects the physiology of soybean plants and causes losses in yield. Repeatability coefficients may help breeders to know how many measurements are needed to obtain a suitable reliability for a target trait. Therefore, the objectives of this study were to determine the repeatability coefficients of 14 traits in soybean plants inoculated with Phakopsora pachyrhizi and to establish the minimum number of measurements needed to predict the breeding value with high accuracy. Experiments were performed in a 3x2 factorial arrangement with three treatments and two inoculations in a random block design. Repeatability coefficients, coefficients of determination and number of measurements needed to obtain a certain reliability were estimated using ANOVA, principal component analysis based on the covariance matrix and the correlation matrix, structural analysis and mixed model. It was observed that the principal component analysis based on the covariance matrix out-performed other methods for almost all traits. Significant differences were observed for all traits except internal CO2 concentration for the treatment effects. For the measurement effects, all traits were significantly different. In addition, significant differences were found for all Treatment x Measurement interaction traits except coumestrol, chitinase and chlorophyll content. Six measurements were suitable to obtain a coefficient of determination higher than 0.7 for all traits based on principal component analysis. The information obtained from this research will help breeders and physiologists determine exactly how many measurements are needed to evaluate each trait in soybean plants infected by P. pachyrhizi with a desirable reliability.
The number of measurements needed to obtain high reliability for traits related to enzymatic activities and photosynthetic compounds in soybean plants infected with Phakopsora pachyrhizi

PubMed Central

de Oliveira, Tássia Boeno; Teodoro, Paulo Eduardo; de Alvarenga, Amauri Alves; Bhering, Leonardo Lopes; Campo, Clara Beatriz Hoffmann

2018-01-01

Asian rust affects the physiology of soybean plants and causes losses in yield. Repeatability coefficients may help breeders to know how many measurements are needed to obtain a suitable reliability for a target trait. Therefore, the objectives of this study were to determine the repeatability coefficients of 14 traits in soybean plants inoculated with Phakopsora pachyrhizi and to establish the minimum number of measurements needed to predict the breeding value with high accuracy. Experiments were performed in a 3x2 factorial arrangement with three treatments and two inoculations in a random block design. Repeatability coefficients, coefficients of determination and number of measurements needed to obtain a certain reliability were estimated using ANOVA, principal component analysis based on the covariance matrix and the correlation matrix, structural analysis and mixed model. It was observed that the principal component analysis based on the covariance matrix out-performed other methods for almost all traits. Significant differences were observed for all traits except internal CO2 concentration for the treatment effects. For the measurement effects, all traits were significantly different. In addition, significant differences were found for all Treatment x Measurement interaction traits except coumestrol, chitinase and chlorophyll content. Six measurements were suitable to obtain a coefficient of determination higher than 0.7 for all traits based on principal component analysis. The information obtained from this research will help breeders and physiologists determine exactly how many measurements are needed to evaluate each trait in soybean plants infected by P. pachyrhizi with a desirable reliability. PMID:29438380
Measurements using orthodontic analysis software on digital models obtained by 3D scans of plaster casts : Intrarater reliability and validity.

PubMed

Czarnota, Judith; Hey, Jeremias; Fuhrmann, Robert

2016-01-01

The purpose of this work was to determine the reliability and validity of measurements performed on digital models with a desktop scanner and analysis software in comparison with measurements performed manually on conventional plaster casts. A total of 20 pairs of plaster casts reflecting the intraoral conditions of 20 fully dentate individuals were digitized using a three-dimensional scanner (D700; 3Shape). A series of defined parameters were measured both on the resultant digital models with analysis software (Ortho Analyzer; 3Shape) and on the original plaster casts with a digital caliper (Digimatic CD-15DCX; Mitutoyo). Both measurement series were repeated twice and analyzed for intrarater reliability based on intraclass correlation coefficients (ICCs). The results from the digital models were evaluated for their validity against the casts by calculating mean-value differences and associated 95 % limits of agreement (Bland-Altman method). Statistically significant differences were identified via a paired t test. Significant differences were obtained for 16 of 24 tooth-width measurements, for 2 of 5 sites of contact-point displacement in the mandibular anterior segment, for overbite, for maxillary intermolar distance, for Little's irregularity index, and for the summation indices of maxillary and mandibular incisor width. Overall, however, both the mean differences between the results obtained on the digital models versus on the plaster casts and the dispersion ranges associated with these differences suggest that the deviations incurred by the digital measuring technique are not clinically significant. Digital models are adequately reproducible and valid to be employed for routine measurements in orthodontic practice.
Dental measurements and Bolton index reliability and accuracy obtained from 2D digital, 3D segmented CBCT, and 3d intraoral laser scanner

PubMed Central

San José, Verónica; Bellot-Arcís, Carlos; Tarazona, Beatriz; Zamora, Natalia; O Lagravère, Manuel

2017-01-01

Background To compare the reliability and accuracy of direct and indirect dental measurements derived from two types of 3D virtual models: generated by intraoral laser scanning (ILS) and segmented cone beam computed tomography (CBCT), comparing these with a 2D digital model. Material and Methods One hundred patients were selected. All patients’ records included initial plaster models, an intraoral scan and a CBCT. Patients´ dental arches were scanned with the iTero® intraoral scanner while the CBCTs were segmented to create three-dimensional models. To obtain 2D digital models, plaster models were scanned using a conventional 2D scanner. When digital models had been obtained using these three methods, direct dental measurements were measured and indirect measurements were calculated. Differences between methods were assessed by means of paired t-tests and regression models. Intra and inter-observer error were analyzed using Dahlberg´s d and coefficients of variation. Results Intraobserver and interobserver error for the ILS model was less than 0.44 mm while for segmented CBCT models, the error was less than 0.97 mm. ILS models provided statistically and clinically acceptable accuracy for all dental measurements, while CBCT models showed a tendency to underestimate measurements in the lower arch, although within the limits of clinical acceptability. Conclusions ILS and CBCT segmented models are both reliable and accurate for dental measurements. Integration of ILS with CBCT scans would get dental and skeletal information altogether. Key words:CBCT, intraoral laser scanner, 2D digital models, 3D models, dental measurements, reliability. PMID:29410764
Circumferential finger measurements utilizing a torque meter to increase reliability.

PubMed

King, T I

1993-01-01

The purpose of this study was to compare the reliabilities of two methods of measuring finger circumference. Traditionally, finger circumference is determined clinically by the use of a tape measure. In this study, a tape-measure device for recording finger circumference utilizing a torque meter was compared with the traditional method to determine reliability differences. Ninety-two occupational therapists and occupational therapy students obtained circumferential measurements of the author's left index finger at the middle of the proximal phalanx utilizing the two methods. The readings obtained for each method were analyzed to determine the coefficient of variation and to compare their variances. The coefficient of variation for the traditional method was 2.92 and for the device utilizing the torque meter was 0.75. The F ratio was 15.63, which is significant at the 0.01 level. The results of this study indicate greater interrater reliability using a device that can accurately measure torque and allow the therapist to control the amount of tension applied when obtaining circumferential measurements using a tape measure.
Portfolio assessment during medical internships: How to obtain a reliable and feasible assessment procedure?

PubMed

Michels, Nele R M; Driessen, Erik W; Muijtjens, Arno M M; Van Gaal, Luc F; Bossaert, Leo L; De Winter, Benedicte Y

2009-12-01

A portfolio is used to mentor and assess students' clinical performance at the workplace. However, students and raters often perceive the portfolio as a time-consuming instrument. In this study, we investigated whether assessment during medical internship by a portfolio can combine reliability and feasibility. The domain-oriented reliability of 61 double-rated portfolios was measured, using a generalisability analysis with portfolio tasks and raters as sources of variation in measuring the performance of a student. We obtained reliability (Phi coefficient) of 0.87 with this internship portfolio containing 15 double-rated tasks. The generalisability analysis showed that an acceptable level of reliability (Phi = 0.80) was maintained when the amount of portfolio tasks was decreased to 13 or 9 using one and two raters, respectively. Our study shows that a portfolio can be a reliable method for the assessment of workplace learning. The possibility of reducing the amount of tasks or raters while maintaining a sufficient level of reliability suggests an increase in feasibility of portfolio use for both students and raters.
A comparison of manual anthropometric measurements with Kinect-based scanned measurements in terms of precision and reliability.

PubMed

Bragança, Sara; Arezes, Pedro; Carvalho, Miguel; Ashdown, Susan P; Castellucci, Ignacio; Leão, Celina

2018-01-01

Collecting anthropometric data for real-life applications demands a high degree of precision and reliability. It is important to test new equipment that will be used for data collectionOBJECTIVE:Compare two anthropometric data gathering techniques - manual methods and a Kinect-based 3D body scanner - to understand which of them gives more precise and reliable results. The data was collected using a measuring tape and a Kinect-based 3D body scanner. It was evaluated in terms of precision by considering the regular and relative Technical Error of Measurement and in terms of reliability by using the Intraclass Correlation Coefficient, Reliability Coefficient, Standard Error of Measurement and Coefficient of Variation. The results obtained showed that both methods presented better results for reliability than for precision. Both methods showed relatively good results for these two variables, however, manual methods had better results for some body measurements. Despite being considered sufficiently precise and reliable for certain applications (e.g. apparel industry), the 3D scanner tested showed, for almost every anthropometric measurement, a different result than the manual technique. Many companies design their products based on data obtained from 3D scanners, hence, understanding the precision and reliability of the equipment used is essential to obtain feasible results.
Reliability of reflectance measures in passive filters

NASA Astrophysics Data System (ADS)

Saldiva de André, Carmen Diva; Afonso de André, Paulo; Rocha, Francisco Marcelo; Saldiva, Paulo Hilário Nascimento; Carvalho de Oliveira, Regiani; Singer, Julio M.

2014-08-01

Measurements of optical reflectance in passive filters impregnated with a reactive chemical solution may be transformed to ozone concentrations via a calibration curve and constitute a low cost alternative for environmental monitoring, mainly to estimate human exposure. Given the possibility of errors caused by exposure bias, it is common to consider sets of m filters exposed during a certain period to estimate the latent reflectance on n different sample occasions at a certain location. Mixed models with sample occasions as random effects are useful to analyze data obtained under such setups. The intra-class correlation coefficient of the mean of the m measurements is an indicator of the reliability of the latent reflectance estimates. Our objective is to determine m in order to obtain a pre-specified reliability of the estimates, taking possible outliers into account. To illustrate the procedure, we consider an experiment conducted at the Laboratory of Experimental Air Pollution, University of São Paulo, Brazil (LPAE/FMUSP), where sets of m = 3 filters were exposed during 7 days on n = 9 different occasions at a certain location. The results show that the reliability of the latent reflectance estimates for each occasion obtained under homoskedasticity is km = 0.74. A residual analysis suggests that the within-occasion variance for two of the occasions should be different from the others. A refined model with two within-occasion variance components was considered, yielding km = 0.56 for these occasions and km = 0.87 for the remaining ones. To guarantee that all estimates have a reliability of at least 80% we require measurements on m = 10 filters on each occasion.
The reliability and validity of a three-camera foot image system for obtaining foot anthropometrics.

PubMed

O'Meara, Damien; Vanwanseele, Benedicte; Hunt, Adrienne; Smith, Richard

2010-08-01

The purpose was to develop a foot image capture and measurement system with web cameras (the 3-FIS) to provide reliable and valid foot anthropometric measures with efficiency comparable to that of the conventional method of using a handheld anthropometer. Eleven foot measures were obtained from 10 subjects using both methods. Reliability of each method was determined over 3 consecutive days using the intraclass correlation coefficient and root mean square error (RMSE). Reliability was excellent for both the 3-FIS and the handheld anthropometer for the same 10 variables, and good for the fifth metatarsophalangeal joint height. The RMSE values over 3 days ranged from 0.9 to 2.2 mm for the handheld anthropometer, and from 0.8 to 3.6 mm for the 3-FIS. The RMSE values between the 3-FIS and the handheld anthropometer were between 2.3 and 7.4 mm. The 3-FIS required less time to collect and obtain the final variables than the handheld anthropometer. The 3-FIS provided accurate and reproducible results for each of the foot variables and in less time than the conventional approach of a handheld anthropometer.
Validity and reliability of a new ankle dorsiflexion measurement device.

PubMed

Gatt, Alfred; Chockalingam, Nachiappan

2013-08-01

The assessment of the maximum ankle dorsiflexion angle is an important clinical examination procedure. Evidence shows that the traditional goniometer is highly unreliable, and various designs of goniometers to measure the maximum ankle dorsiflexion angle rely on the application of a known force to obtain reliable results. Hence, an innovative ankle dorsiflexion measurement device was designed to make this measurement more reliable by holding the foot in a selected posture without the application of a known moment. To report on the comprehensive validity and reliability testing carried out on the new device. Following validity testing, four different trials to test reliability of the ankle dorsiflexion measurement device were performed. These trials included inter-rater and intra-rater testings with a controlled moment, intra-rater reliability testing with knees flexed and extended without a controlled moment, intra-rater testing with a patient population, and inter-rater reliability testing between four raters of varying experience without controlling moment. All raters were blinded. A series of trials to test intra-rater and inter-rater reliabilities. Intra-rater reliability intraclass correlation coefficient was 0.98 and inter-rater reliability intraclass correlation coefficient (2,1) was 0.953 with a controlled moment. With uncontrolled moment, very high reliability for intra-tester was also achieved (intraclass correlation coefficient = 0.94 with knees extended and intraclass correlation coefficient = 0.95 with knees flexed). For the trial investigating test-retest reliability with actual patients, intraclass correlation coefficient of 0.99 was obtained. In the trial investigating four different raters with uncontrolled moment, intraclass correlation coefficient of 0.91 was achieved. The new ankle dorsiflexion measurement device is a valid and reliable device for measuring ankle dorsiflexion in both healthy subjects and patients, with both controlled and
Validity and Reliability of Scores Obtained on Multiple-Choice Questions: Why Functioning Distractors Matter

ERIC Educational Resources Information Center

Ali, Syed Haris; Carr, Patrick A.; Ruit, Kenneth G.

2016-01-01

Plausible distractors are important for accurate measurement of knowledge via multiple-choice questions (MCQs). This study demonstrates the impact of higher distractor functioning on validity and reliability of scores obtained on MCQs. Freeresponse (FR) and MCQ versions of a neurohistology practice exam were given to four cohorts of Year 1 medical…
Nanoscale deformation measurements for reliability assessment of material interfaces

NASA Astrophysics Data System (ADS)

Keller, Jürgen; Gollhardt, Astrid; Vogel, Dietmar; Michel, Bernd

2006-03-01

With the development and application of micro/nano electronic mechanical systems (MEMS, NEMS) for a variety of market segments new reliability issues will arise. The understanding of material interfaces is the key for a successful design for reliability of MEMS/NEMS and sensor systems. Furthermore in the field of BIOMEMS newly developed advanced materials and well known engineering materials are combined despite of fully developed reliability concepts for such devices and components. In addition the increasing interface-to volume ratio in highly integrated systems and nanoparticle filled materials are challenges for experimental reliability evaluation. New strategies for reliability assessment on the submicron scale are essential to fulfil the needs of future devices. In this paper a nanoscale resolution experimental method for the measurement of thermo-mechanical deformation at material interfaces is introduced. The determination of displacement fields is based on scanning probe microscopy (SPM) data. In-situ SPM scans of the analyzed object (i.e. material interface) are carried out at different thermo-mechanical load states. The obtained images are compared by grayscale cross correlation algorithms. This allows the tracking of local image patterns of the analyzed surface structure. The measurement results are full-field displacement fields with nanometer resolution. With the obtained data the mixed mode type of loading at material interfaces can be analyzed with highest resolution for future needs in micro system and nanotechnology.

A Direct Method for Obtaining Approximate Standard Error and Confidence Interval of Maximal Reliability for Composites with Congeneric Measures

ERIC Educational Resources Information Center

Raykov, Tenko; Penev, Spiridon

2006-01-01

Unlike a substantial part of reliability literature in the past, this article is concerned with weighted combinations of a given set of congeneric measures with uncorrelated errors. The relationship between maximal coefficient alpha and maximal reliability for such composites is initially dealt with, and it is shown that the former is a lower…
Tactile Acuity Charts: A Reliable Measure of Spatial Acuity

PubMed Central

Bruns, Patrick; Camargo, Carlos J.; Campanella, Humberto; Esteve, Jaume; Dinse, Hubert R.; Röder, Brigitte

2014-01-01

For assessing tactile spatial resolution it has recently been recommended to use tactile acuity charts which follow the design principles of the Snellen letter charts for visual acuity and involve active touch. However, it is currently unknown whether acuity thresholds obtained with this newly developed psychophysical procedure are in accordance with established measures of tactile acuity that involve passive contact with fixed duration and control of contact force. Here we directly compared tactile acuity thresholds obtained with the acuity charts to traditional two-point and grating orientation thresholds in a group of young healthy adults. For this purpose, two types of charts, using either Braille-like dot patterns or embossed Landolt rings with different orientations, were adapted from previous studies. Measurements with the two types of charts were equivalent, but generally more reliable with the dot pattern chart. A comparison with the two-point and grating orientation task data showed that the test-retest reliability of the acuity chart measurements after one week was superior to that of the passive methods. Individual thresholds obtained with the acuity charts agreed reasonably with the grating orientation threshold, but less so with the two-point threshold that yielded relatively distinct acuity estimates compared to the other methods. This potentially considerable amount of mismatch between different measures of tactile acuity suggests that tactile spatial resolution is a complex entity that should ideally be measured with different methods in parallel. The simple test procedure and high reliability of the acuity charts makes them a promising complement and alternative to the traditional two-point and grating orientation thresholds. PMID:24504346
The role of test-retest reliability in measuring individual and group differences in executive functioning.

PubMed

Paap, Kenneth R; Sawi, Oliver

2016-12-01

Studies testing for individual or group differences in executive functioning can be compromised by unknown test-retest reliability. Test-retest reliabilities across an interval of about one week were obtained from performance in the antisaccade, flanker, Simon, and color-shape switching tasks. There is a general trade-off between the greater reliability of single mean RT measures, and the greater process purity of measures based on contrasts between mean RTs in two conditions. The individual differences in RT model recently developed by Miller and Ulrich was used to evaluate the trade-off. Test-retest reliability was statistically significant for 11 of the 12 measures, but was of moderate size, at best, for the difference scores. The test-retest reliabilities for the Simon and flanker interference scores were lower than those for switching costs. Standard practice evaluates the reliability of executive-functioning measures using split-half methods based on data obtained in a single day. Our test-retest measures of reliability are lower, especially for difference scores. These reliability measures must also take into account possible day effects that classical test theory assumes do not occur. Measures based on single mean RTs tend to have acceptable levels of reliability and convergent validity, but are "impure" measures of specific executive functions. The individual differences in RT model shows that the impurity problem is worse than typically assumed. However, the "purer" measures based on difference scores have low convergent validity that is partly caused by deficiencies in test-retest reliability. Copyright © 2016 Elsevier B.V. All rights reserved.
Reliable absolute analog code retrieval approach for 3D measurement

NASA Astrophysics Data System (ADS)

Yu, Shuang; Zhang, Jing; Yu, Xiaoyang; Sun, Xiaoming; Wu, Haibin; Chen, Deyun

2017-11-01

The wrapped phase of phase-shifting approach can be unwrapped by using Gray code, but both the wrapped phase error and Gray code decoding error can result in period jump error, which will lead to gross measurement error. Therefore, this paper presents a reliable absolute analog code retrieval approach. The combination of unequal-period Gray code and phase shifting patterns at high frequencies are used to obtain high-frequency absolute analog code, and at low frequencies, the same unequal-period combination patterns are used to obtain the low-frequency absolute analog code. Next, the difference between the two absolute analog codes was employed to eliminate period jump errors, and a reliable unwrapped result can be obtained. Error analysis was used to determine the applicable conditions, and this approach was verified through theoretical analysis. The proposed approach was further verified experimentally. Theoretical analysis and experimental results demonstrate that the proposed approach can perform reliable analog code unwrapping.
Reliability of Pain Measurements Using Computerized Cuff Algometry: A DoloCuff Reliability and Agreement Study.

PubMed

Kvistgaard Olsen, Jack; Fener, Dilay Kesgin; Waehrens, Eva Elisabet; Wulf Christensen, Anton; Jespersen, Anders; Danneskiold-Samsøe, Bente; Bartels, Else Marie

2017-07-01

Computerized pneumatic cuff pressure algometry (CPA) using the DoloCuff is a new method for pain assessment. Intra- and inter-rater reliabilities have not yet been established. Our aim was to examine the inter- and intrarater reliabilities of DoloCuff measures in healthy subjects. Twenty healthy subjects (ages 20 to 29 years) were assessed three times at 24-hour intervals by two trained raters. Inter-rater reliability was established based on the first and second assessments, whereas intrarater reliability was based on the second and third assessments. Subjects were randomized 1:1 to first assessment at either rater 1 or rater 2. The variables of interest were pressure pain threshold (PT), pressure pain tolerance (PTol), and temporal summation index (TSI). Reliability was estimated by a two-way mixed intraclass correlation coefficient (ICC) absolute agreement analysis. Reliability was considered excellent if ICC > 0.75, fair to good if 0.4 < ICC < 0.75, and poor if ICC < 0.4. Bias and random errors between raters and assessments were evaluated using 95% confidence interval (CI) and Bland-Altman plots. Inter-rater reliability for PT, PTol, and TSI was 0.88 (95% CI: 0.69 to 0.95), 0.86 (95% CI: 0.65 to 0.95), and 0.81 (95% CI: 0.42 to 0.94), respectively. The intrarater reliability for PT, PTol, and TSI was 0.81 (95% CI: 0.53 to 0.92), 0.89 (95% CI: 0.74 to 0.96), and 0.75 (95% CI: 0.28 to 0.91), respectively. Inter-rater reliability was excellent for PT, PTol, and TSI. Similarly, the intrarater reliability for PT and PTol was excellent, while borderline excellent/good for TSI. Therefore, the DoloCuff can be used to obtain reliable measures of pressure pain parameters in healthy subjects. © 2016 World Institute of Pain.
The Reliability of a Novel Mobile 3-dimensional Wound Measurement Device.

PubMed

Anghel, Ersilia L; Kumar, Anagha; Bigham, Thomas E; Maselli, Kathryn M; Steinberg, John S; Evans, Karen K; Kim, Paul J; Attinger, Christopher E

2016-11-01

Objective assessment of wound dimensions is essential for tracking progression and determining treatment effectiveness. A reliability study was designed to establish intrarater and interrater reliability of a novel mobile 3-dimensional wound measurement (3DWM) device. Forty-five wounds were assessed by 2 raters using a 3DWM device to obtain length, width, area, depth, and volume measurements. Wounds were also measured manually, using a disposable ruler and digital planimetry. The intraclass correlation coefficient (ICC) was used to establish intrarater and interrater reliability. High levels of intrarater and interrater agreement were observed for area, length, and width; ICC = 0.998, 0.977, 0.955 and 0.999, 0.997, 0.995, respectively. Moderate levels of intrarater (ICC = 0.888) and interrater (ICC = 0.696) agreement were observed for volume. Lastly, depth yielded an intrarater ICC of 0.360 and an interrater ICC of 0.649. Measures from the 3DWM device were highly correlated with those obtained from scaled photography for length, width, and area (ρ = 0.997, 0.988, 0.997, P < 0.001). The 3DWM device yielded correlations of ρ = 0.990, 0.987, 0.996 with P < 0.001 for length, width, and area when compared to manual measurements. The 3DWM device was found to be highly reliable for measuring wound areas for a range of wound sizes and types as compared to manual measurement and digital planimetry. The depth and therefore volume measurement using the 3DWM device was found to have a lower ICC, but volume ICC alone was moderate. Overall, this device offers a mobile option for objective wound measurement in the clinical setting.
Reliability of rehabilitative ultrasonographic imaging for muscle thickness measurement of the rhomboid major.

PubMed

Jeong, Ju Ri; Ko, Young Jun; Ha, Hyun Geun; Lee, Wan Hee

2016-03-01

This study was to establish inter-rater and intrarater reliability of the rehabilitative ultrasonographic imaging (RUSI) technique for muscle thickness measurement of the rhomboid major at rest and with the shoulder abducted to 90°. Twenty-four young adults (eight men, 16 women; right-handed; mean age [±SD], 24·4 years [±2·6]) with no history of neck, shoulder, or arm pain were recruited. Rhomboid major muscle images were obtained in the resting position and with shoulder in 90° abduction using an ultrasonography system with a 7·5-MHz linear transducer. In these two positions, the examiners found the site at which the transducer could be placed. Two examiners obtained the images of all participants in three test sessions at random. Intraclass correlation coefficients (ICC) were used to estimate reliability. All ICCs (95% CI) were >0·75, ranging from 0·93 to 0·98, which indicates good reliability. The ICCs for inter-rater reliability ranged from 0·75 to 0·94. For the absolute value of the difference in the intra-examiner reliability between the right and left ratios, the ICCs ranged from 0·58 to 0·91. In this study, the intra- and interexaminer reliability of muscle thickness measurements of the rhomboid major were good. Therefore, we suggest that muscle thickness measurements of the rhomboid major obtained with the RUSI technique would be useful for clinical rehabilitative assessment. © 2014 Scandinavian Society of Clinical Physiology and Nuclear Medicine. Published by John Wiley & Sons Ltd.
Reliability of the nursing care hour measure: a descriptive study.

PubMed

Klaus, Susan F; Dunton, Nancy; Gajewski, Byron; Potter, Catima

2013-07-01

The nursing care hour has become an international standard unit of measure in research where nurse staffing is a key variable. Until now, there have been no studies verifying whether nursing care hours obtained from hospital data sources can be collected reliably. To examine the processes used by hospitals to generate nursing care hour data and to evaluate inter-rater reliability and guideline compliance with standards of the National Database of Nursing Quality Indicators(®) (NDNQI(®)) and the National Quality Forum. Two-phase descriptive study of all NDNQI hospitals that submitted data in third quarter of 2007. Data for phase I came from an online survey created by the authors to ascertain the processes used by hospitals to collect nursing care hours and their compliance with standardized data collection guidelines. In phase II, inter-rater reliability was measured using intra-class correlations between nursing care hours generated from clock hour files submitted to the study team by participants' payroll/accounting departments and aggregated data submitted previously. Phase I data were obtained from a total of 714 respondents. Nearly half (48%) of all sites use payroll records to obtain nursing care hour data and 70% use one of the standardized methods for converting the bi-weekly hours into months. Unit secretaries were reportedly included in NCH by 17.4% of respondents and only 26.2% of sites could accurately identify the point at which newly hired nurses should be included. The phase II findings (n=11) support the ability of two independent raters to obtain similar results when calculating total nursing care hours according to standard guidelines (ICC=0.76-0.99). Although barriers exist, this study found support for hospitals' abilities to collect reliable nursing care hour data. Copyright © 2012 Elsevier Ltd. All rights reserved.
The Reliability and Validity of the Computerized Double Inclinometer in Measuring Lumbar Mobility

PubMed Central

MacDermid, Joy Christine; Arumugam, Vanitha; Vincent, Joshua Israel; Carroll, Krista L

2014-01-01

Study Design : Repeated measures reliability/validity study. Objectives : To determine the concurrent validity, test-retest, inter-rater and intra-rater reliability of lumbar flexion and extension measurements using the Tracker M.E. computerized dual inclinometer (CDI) in comparison to the modified-modified Schober (MMS) Summary of Background : Numerous studies have evaluated the reliability and validity of the various methods of measuring spinal motion, but the results are inconsistent. Differences in equipment and techniques make it difficult to correlate results. Methods : Twenty subjects with back pain and twenty without back pain were selected through convenience sampling. Two examiners measured sagittal plane lumbar range of motion for each subject. Two separate tests with the CDI and one test with the MMS were conducted. Each test consisted of three trials. Instrument and examiner order was randomly assigned. Intra-class correlations (ICCs 2, 2 and 2, 2) and Pearson correlation coefficients (r) were used to calculate reliability and concurrent validity respectively. Results : Intra-trial reliability was high to very high for both the CDI (ICCs 0.85 - 0.96) and MMS (ICCs 0.84 - 0.98). However, the reliability was poor to moderate, when the CDI unit had to be repositioned either by the same rate (ICCs 0.16 - 0.59) or a different rater (ICCs 0.45 - 0.52). Inter-rater reliability for the MMS was moderate to high (ICCs 0.75 - 0.82) which bettered the moderate correlation obtained for the CDI (ICCs 0.45 - 0.52). Correlations between the CDI and MMS were poor for flexion (0.32; p<0.05) and poor to moderate (-0.42 - -0.51; p<0.05) for extension measurements. Conclusion : When using the CDI, an average of subsequent tests is required to obtain moderate reliability. The MMS was highly reliable than the CDI. The MMS and the CDI measure lumbar movement on a different metric that are not highly related to each other. PMID:25352928
Reliability of abstracting performance measures: results of the cardiac rehabilitation referral and reliability (CR3) project.

PubMed

Thomas, Randal J; Chiu, Jensen S; Goff, David C; King, Marjorie; Lahr, Brian; Lichtman, Steven W; Lui, Karen; Pack, Quinn R; Shahriary, Melanie

2014-01-01

Assessment of the reliability of performance measure (PM) abstraction is an important step in PM validation. Reliability has not been previously assessed for abstracting PMs for the referral of patients to cardiac rehabilitation (CR) and secondary prevention (SP) programs. To help validate these PMs, we carried out a multicenter assessment of their reliability. Hospitals and clinical practices from around the United States were invited to participate in the Cardiac Rehabilitation Referral Reliability (CR3) Project. Twenty-nine hospitals and 23 outpatient centers expressed interest in participating. Seven hospitals and 6 outpatient centers met participation criteria and submitted completed data. Site coordinators identified 35 patients whose charts were reviewed by 2 site abstractors twice, 1 week apart. Percent agreement and the Cohen κ statistic were used to describe intra- and interabstractor reliability for patient eligibility for CR/SP, patient exceptions for CR/SP referral, and documented referral to CR/SP. Results were obtained from within-site data, as well as from pooled data of all inpatient and all outpatient sites. We found that intra-abstractor reliability reflected excellent repeatability (≥ 90% agreement; κ ≥ 0.75) for ratings of CR/SP eligibility, exceptions, and referral, both from pooled and site-specific analyses of inpatient and outpatient data. Similarly, the interabstractor agreement from pooled analysis ranged from good to excellent for the 3 items, although with slightly lower measures of reliability. Abstraction of PMs for CR/SP referral has high reliability, supporting the use of these PMs in quality improvement initiatives aimed at increasing CR/SP delivery to patients with cardiovascular disease.
Modeling reliability measurement of interface on information system: Towards the forensic of rules

NASA Astrophysics Data System (ADS)

Nasution, M. K. M.; Sitompul, Darwin; Harahap, Marwan

2018-02-01

Today almost all machines depend on the software. As a software and hardware system depends also on the rules that are the procedures for its use. If the procedure or program can be reliably characterized by involving the concept of graph, logic, and probability, then regulatory strength can also be measured accordingly. Therefore, this paper initiates an enumeration model to measure the reliability of interfaces based on the case of information systems supported by the rules of use by the relevant agencies. An enumeration model is obtained based on software reliability calculation.
The Reliability of Criterion-Referenced Measures.

ERIC Educational Resources Information Center

Livingston, Samuel A.

The assumptions of the classical test-theory model are used to develop a theory of reliability for criterion-referenced measures which parallels that for norm-referenced measures. It is shown that the Spearman-Brown formula holds for criterion-referenced measures and that the criterion-referenced reliability coefficient can be used to correct…
Reliability and validity in measurement of true humeral retroversion by a three-dimensional cylinder fitting method.

PubMed

Saka, Masayuki; Yamauchi, Hiroki; Hoshi, Kenji; Yoshioka, Toru; Hamada, Hidetoshi; Gamada, Kazuyoshi

2015-05-01

Humeral retroversion is defined as the orientation of the humeral head relative to the distal humerus. Because none of the previous methods used to measure humeral retroversion strictly follow this definition, values obtained by these techniques vary and may be biased by morphologic variations of the humerus. The purpose of this study was 2-fold: to validate a method to define the axis of the distal humerus with a virtual cylinder and to establish the reliability of 3-dimensional (3D) measurement of humeral retroversion by this cylinder fitting method. Humeral retroversion in 14 baseball players (28 humeri) was measured by the 3D cylinder fitting method. The root mean square error was calculated to compare values obtained by a single tester and by 2 different testers using the embedded coordinate system. To establish the reliability, intraclass correlation coefficient (ICC) and precision (standard error of measurement [SEM]) were calculated. The root mean square errors for the humeral coordinate system were <1.0 mm/1.0° for comparison of all translations/rotations obtained by a single tester and <1.0 mm/2.0° for comparison obtained by 2 different testers. Assessment of reliability and precision of the 3D measurement of retroversion yielded an intratester ICC of 0.99 (SEM, 1.0°) and intertester ICC of 0.96 (SEM, 2.8°). The error in measurements obtained by a distal humerus cylinder fitting method was small enough not to affect retroversion measurement. The 3D measurement of retroversion by this method provides excellent intratester and intertester reliability. Copyright © 2015 Journal of Shoulder and Elbow Surgery Board of Trustees. Published by Elsevier Inc. All rights reserved.
Real-time reliability measure-driven multi-hypothesis tracking using 2D and 3D features

NASA Astrophysics Data System (ADS)

Zúñiga, Marcos D.; Brémond, François; Thonnat, Monique

2011-12-01

We propose a new multi-target tracking approach, which is able to reliably track multiple objects even with poor segmentation results due to noisy environments. The approach takes advantage of a new dual object model combining 2D and 3D features through reliability measures. In order to obtain these 3D features, a new classifier associates an object class label to each moving region (e.g. person, vehicle), a parallelepiped model and visual reliability measures of its attributes. These reliability measures allow to properly weight the contribution of noisy, erroneous or false data in order to better maintain the integrity of the object dynamics model. Then, a new multi-target tracking algorithm uses these object descriptions to generate tracking hypotheses about the objects moving in the scene. This tracking approach is able to manage many-to-many visual target correspondences. For achieving this characteristic, the algorithm takes advantage of 3D models for merging dissociated visual evidence (moving regions) potentially corresponding to the same real object, according to previously obtained information. The tracking approach has been validated using video surveillance benchmarks publicly accessible. The obtained performance is real time and the results are competitive compared with other tracking algorithms, with minimal (or null) reconfiguration effort between different videos.
Reliability analysis for digital adolescent idiopathic scoliosis measurements.

PubMed

Kuklo, Timothy R; Potter, Benjamin K; O'Brien, Michael F; Schroeder, Teresa M; Lenke, Lawrence G; Polly, David W

2005-04-01

Analysis of adolescent idiopathic scoliosis (AIS) requires a thorough clinical and radiographic evaluation to completely assess the three-dimensional deformity. Recently, these radiographic parameters have been analyzed for reliability and reproducibility following manual measurements; however, most of these parameters have not been analyzed with regard to digital measurements. The purpose of this study is to determine the intra- and interobserver reliability of common scoliosis radiographic parameters using a digital software measurement program. Thirty sets of preoperative (posteroanterior [PA], lateral, and side-bending [SB]) and postoperative (PA and lateral) radiographs were analyzed by three independent observers on two separate occasions using a software measurement program (PhDx, Albuquerque, NM). Coronal measures included main thoracic (MT) and thoracolumbar-lumbar (TL/L) Cobb, SB MT Cobb, MT and TL/L apical vertical translation (AVT), C7 to center sacral vertical line (CSVL), T1 tilt, LIV tilt, disk below lowest instrumented vertebra (LIV), coronal balance, and Risser, whereas sagittal measures included T2-T5, T5-T12, T2-T12, T10-L2, T12-S1, and sagittal balance. Analysis of variance for repeated measures or Cohen three-way kappa correlation coefficient analysis was performed as appropriate to calculate the intra- and interobserver reliability for each parameter. The majority of the radiographic parameters assessed demonstrated good or excellent intra- and interobserver reliability. The relationship of the LIV to the CSVL (intraobserver kappaa = 0.48-0.78, fair to excellent; interobserver kappaa = 0.34-0.41, fair to poor), interobserver measurement of AVT (rho = 0.49-0.73, low to good), Risser grade (intraobserver rho = 0.41-0.97, low to excellent; interobserver rho = 0.60-0.70, fair to good), intraobserver measurement of the angulation of the disk inferior to the LIV (rho = 0.53-0.88, fair to good), apical Nash-Moe vertebral rotation (intraobserver rho
Measuring deception: test-retest reliability of physicians' self-reported manipulation of reimbursement rules for patients.

PubMed

VanGeest, Jonathan B; Wynia, Matthew K; Cummins, Deborah S; Wilson, Ira B

2002-06-01

This study examined the test-retest reliability of physicians' self-reported manipulation of reimbursement rules for patients. The test-retest reliability of self-report of three specific tactics were examined: (1) exaggerating the severity of patients' conditions, (2) changing a patient's official (billing) diagnosis, and (3) reporting signs or symptoms that patients did not have. The reliability of a scaled summary measure of physicians' manipulation of reimbursement rules was also assessed. Overall, the authors found high levels of test-retest agreement across all three items and the summary measure. These findings suggest that self-report can be used to produce reliable data on this controversial issue. Specifically, the three items reported here can be used to produce a reliable summary measure of physicians' manipulation of reimbursement rules to help patients obtain care that physicians perceive as necessary.
Characterization of perovskite solar cells: Towards a reliable measurement protocol

NASA Astrophysics Data System (ADS)

Zimmermann, Eugen; Wong, Ka Kan; Müller, Michael; Hu, Hao; Ehrenreich, Philipp; Kohlstädt, Markus; Würfel, Uli; Mastroianni, Simone; Mathiazhagan, Gayathri; Hinsch, Andreas; Gujar, Tanaji P.; Thelakkat, Mukundan; Pfadler, Thomas; Schmidt-Mende, Lukas

2016-09-01

Lead halide perovskite solar cells have shown a tremendous rise in power conversion efficiency with reported record efficiencies of over 20% making this material very promising as a low cost alternative to conventional inorganic solar cells. However, due to a differently severe "hysteretic" behaviour during current density-voltage measurements, which strongly depends on scan rate, device and measurement history, preparation method, device architecture, etc., commonly used solar cell measurements do not give reliable or even reproducible results. For the aspect of commercialization and the possibility to compare results of different devices among different laboratories, it is necessary to establish a measurement protocol which gives reproducible results. Therefore, we compare device characteristics derived from standard current density-voltage measurements with stabilized values obtained from an adaptive tracking of the maximum power point and the open circuit voltage as well as characteristics extracted from time resolved current density-voltage measurements. Our results provide insight into the challenges of a correct determination of device performance and propose a measurement protocol for a reliable characterisation which is easy to implement and has been tested on varying perovskite solar cells fabricated in different laboratories.
Reliability of laboratory measurement of human food intake.

PubMed

Laessle, R; Geiermann, L

2012-02-01

The universal eating monitor (UEM) of Kissileff for laboratory measurement of food intake was modified and used with a newly developed special software to compute cumulative intake data. To explore the measurement precision of the UEM an investigation of test-retest-reliability of food intake parameters was conducted. The intake characteristics of 125 males and females were measured repeatedly in the laboratory with a measurement interval of 1 week. Pudding of preferred flavour served as test meal. Test-retest-reliability of intake characteristics ranged from .49 (change of eating rate) to .89 (initial eating rate). All test-retest correlations were highly significant. Sex, BMI and eating habits according to TFEQ-factors had no significant effects on reliability of intake characteristics. The test-retest-reliability of the laboratory intake measures is as good as those of personality questionnaires, where it should be better than .80. Reliability coefficients are valid independent of sex, BMI or trait characteristics of eating behaviour. Copyright © 2011 Elsevier Ltd. All rights reserved.
Reliability and Validity of Nonradiologic Measures of Forward Flexed Posture in Parkinson Disease.

PubMed

Nair, Prajakta; Bohannon, Richard W; Devaney, Laurie; Maloney, Catherine; Romano, Alexis

2017-03-01

To examine the intertester reliability and validity of 5 nonradiologic measures of forward flexed posture in individuals with Parkinson disease (PD). Cross-sectional observational study. University outpatient facility and community centers. Individuals (N=28) with PD with Hoehn and Yahr scores of 1 through 4. Not applicable. Occiput to wall status, tragus to wall distance, C7 to wall distance, photographically derived trunk flexion angle, and inclinometric kyphosis measure. Participants were older adults (mean, 69.7±10.6y) with a 14-month to 15-year (mean, 5.9±3.5y) history of PD. Intertester reliability was excellent for all measures (κ=.89 [cued condition] and 1.0 [relaxed condition] for occiput to wall status; intraclass correlation coefficients, .779-.897 for tragus to wall distance, C7 to wall distance, flexion angle, and inclinometric kyphosis measure). Convergent validity was supported for all measures by significant correlations between the same measures obtained during relaxed and cued conditions (eg, occiput to wall relaxed and cued) and for most measures by significant correlations between measures obtained under the same condition (eg, occiput to wall cued and tragus to wall cued). Significant correlations between tragus to wall distance, C7 to wall distance, flexion angle, and inclinometric kyphosis measure and the Unified Parkinson Disease Rating Scale item 28 (posture) also supported convergent validity. Significant differences between tragus to wall distance, C7 to wall distance, and inclinometric kyphosis measure values under relaxed and cued conditions supported known condition validity. Known group validity was demonstrated by significant differences in tragus to wall distance, C7 to wall distance, and inclinometric kyphosis measure obtained from individuals able and individuals unable to touch their occiput to wall when cued to stand tall. Tragus to wall distance, C7 to wall distance, and inclinometric kyphosis measure are reliable and valid
MEASUREMENT: ACCOUNTING FOR RELIABILITY IN PERFORMANCE ESTIMATES.

PubMed

Waterman, Brian; Sutter, Robert; Burroughs, Thomas; Dunagan, W Claiborne

2014-01-01

When evaluating physician performance measures, physician leaders are faced with the quandary of determining whether departures from expected physician performance measurements represent a true signal or random error. This uncertainty impedes the physician leader's ability and confidence to take appropriate performance improvement actions based on physician performance measurements. Incorporating reliability adjustment into physician performance measurement is a valuable way of reducing the impact of random error in the measurements, such as those caused by small sample sizes. Consequently, the physician executive has more confidence that the results represent true performance and is positioned to make better physician performance improvement decisions. Applying reliability adjustment to physician-level performance data is relatively new. As others have noted previously, it's important to keep in mind that reliability adjustment adds significant complexity to the production, interpretation and utilization of results. Furthermore, the methods explored in this case study only scratch the surface of the range of available Bayesian methods that can be used for reliability adjustment; further study is needed to test and compare these methods in practice and to examine important extensions for handling specialty-specific concerns (e.g., average case volumes, which have been shown to be important in cardiac surgery outcomes). Moreover, it's important to note that the provider group average as a basis for shrinkage is one of several possible choices that could be employed in practice and deserves further exploration in future research. With these caveats, our results demonstrate that incorporating reliability adjustment into physician performance measurements is feasible and can notably reduce the incidence of "real" signals relative to what one would expect to see using more traditional approaches. A physician leader who is interested in catalyzing performance improvement

Reliability of measures of transient evoked otoacoustic emissions with contralateral suppression.

PubMed

Stuart, Andrew; Cobb, Kensi M

2015-01-01

The reliability of measures of transient evoked otoacoustic emissions (TEOAEs) with contralateral suppression was examined. The effect of test session (i.e., initial test; retest without probe removal; retest with probe removal; and retest 1-2 days post initial test), gender, and ear was examined in 14 young adult females and 14 young adult males. TEOAEs were obtained bilaterally with 60 dB peSPL linear click stimuli with and without a contralateral 65 dB SPL broadband noise suppressor. Absolute TEOAE suppression and a normalized index of TEOAE suppression (i.e., percentage of suppression) were examined. Reliability of these measures was assessed with repeated measures linear mixed model analysis of variance, a coefficient of reliability, and Bland-Altman analyses. There were no statistically significant (p>0.05) main effects of test, gender, and ear or interactions for both absolute dB and % TEOAE suppression values. Cronbach's α were greater than 0.90 across the four tests for both TEOAE measures. Mean test differences or bias (i.e., between the initial and subsequent tests) for absolute and % TEOAE suppression ranged from -0.05 to 0.11 dB and -1.5% to 1.1%, respectively. There was no proportional/systematic bias with the mean differences of the first and subsequent measurements. Data herein were consistent with the view that bilateral TEOAE suppression measures are reliable across test sessions of 1-2 days among females and males and may provide a method to monitor medial olivocochlear efferent reflex status over time. Copyright © 2015 Elsevier Inc. All rights reserved.
Validity and reliability of head posture measurement using Microsoft Kinect.

PubMed

Oh, Baek-Lok; Kim, Jongmin; Kim, Jongshin; Hwang, Jeong-Min; Lee, Jehee

2014-11-01

To investigate the validity and reliability of Microsoft Kinect-based head tracker (KHT) for measuring head posture. Considering the cervical range of motion (CROM) as a reference, one-dimensional and three-dimensional (1D and 3D) head postures of 12 normal subjects (28-58 years of age; 6 women and 6 men) were obtained using the KHT. The KHT was validated by Pearson's correlation coefficient and intraclass correlation (ICC) coefficient. Test-retest reliability of the KHT was determined by its 95% limit of agreement (LoA) with the Bland-Altman plot. Face recognition success rate was evaluated for each head posture. Measurements of 1D and 3D head posture performed using the KHT were very close to those of the CROM with correlation coefficients of 0.99 and 0.97 (p<0.05), respectively, as well as with an ICC of >0.99 and 0.98, respectively. The reliability tests of the KHT in terms of 1D and 3D head postures had 95% LoA angles of approximately ±2.5° and ±6.5°, respectively. The KHT showed good agreement with the CROM and relatively favourable test-retest reliability. Considering its high performance, convenience and low cost, KHT could be clinically used as a head posture-measuring system. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Test-retest reliability of posture measurements in adolescents with idiopathic scoliosis.

PubMed

Heitz, Pierre-Henri; Aubin-Fournier, Jean-François; Parent, Éric; Fortin, Carole

2018-05-07

Posture changes are a major consequence of IS (IS). Posture changes can lead to psychosocial and physical impairments in adolescents with IS. Therefore, it is important to assess posture but the test-retest reliability of posture measurements still remains unknown in this population. The primary objective was to determine the test-retest reliability of 25 head and trunk posture indices using the Clinical Photographic Postural Assessment Tool (CPPAT) in adolescents with IS. The secondary objective was to determine the standard error of measurement and the minimal detectable change. This is a prospective test-retest reliability study carried out at two tertiary university hospital centers. Forty-one adolescents with IS, aged 10 to 16 years old with curves 10 to 45 o and treated non-operatively were recruited. Two posture assessments were done using the CPPAT five to 10 days apart following a standardized procedure. Photographs were analyzed with the CPPAT software by digitizing reference landmarks placed on the participant by a physiotherapist evaluator. Generalizability theory was used to obtain a coefficient of dependability, standard error of measurement and the minimal detectable change at the 90% confidence interval. This project was supported by the Canadian Pediatric Spine Society (CPSS: 10000$). There is no study-specific conflicts of interest-associated biases. Fourteen of 25 posture indices had a good reliability (ϕ ≥ 0.78), ten of 25 had moderate reliability (ϕ = 0.55 to 0.74) and one had poor reliability (ϕ = 0.45). The most reliable posture indices were waist angles asymmetry (ϕ = 0.93), right waist angle (ϕ = 0.91) and frontal trunk list (ϕ = 0.92). Right sagittal trunk list was the least reliable posture index (ϕ = 0.45). The MDC 90 values ranged from 2.6 to 10.3° for angular measurements and from 8.4 to 35.1 mm for linear measurements. This study demonstrates that most posture indices, especially the trunk posture indices, are reproducible
Interhemispheric Inhibition Measurement Reliability in Stroke: A Pilot Study

PubMed Central

Cassidy, Jessica M.; Chu, Haitao; Chen, Mo; Kimberley, Teresa J.; Carey, James R.

2016-01-01

Objective Reliable transcranial magnetic stimulation (TMS) measures for probing corticomotor excitability are important when assessing the physiological effects of non-invasive brain stimulation. The primary objective of this study was to examine test-retest reliability of an interhemispheric inhibition (IHI) index measurement in stroke. Materials and Methods Ten subjects with chronic stroke (≥ 6 months) completed two IHI testing sessions per week for three weeks (six testing sessions total). A single investigator measured IHI in the contra- to-ipsilesional primary motor cortex direction and in the opposite direction using bilateral paired-pulse TMS. Weekly sessions were separated by 24 hours with a 1-week washout period separating testing weeks. To determine if motor-evoked potential (MEP) quantification method affected measurement reliability, IHI indices computed from both MEP amplitude and area responses were found. Reliability was assessed with two-way, mixed intraclass correlation coefficients (ICC(3,k)). Standard error of measurement and minimal detectable difference statistics were also determined. Results With the exception of the initial testing week, IHI indices measured in the contra-to-ipsilesional hemisphere direction demonstrated moderate to excellent reliability (ICC = 0.725 – 0.913). Ipsi-to-contralesional IHI indices depicted poor or invalid reliability estimates throughout the three-week testing duration (ICC= −1.153 – 0.105). The overlap of ICC 95% confidence intervals suggested that IHI indices using MEP amplitude vs. area measures did not differ with respect to reliability. Conclusions IHI indices demonstrated varying magnitudes of reliability irrespective of MEP quantification method. Several strategies for improving IHI index measurement reliability are discussed. PMID:27333364
Reliability of quantitative EEG (qEEG) measures and LORETA current source density at 30 days.

PubMed

Cannon, Rex L; Baldwin, Debora R; Shaw, Tiffany L; Diloreto, Dominic J; Phillips, Sherman M; Scruggs, Annie M; Riehl, Timothy C

2012-06-14

There is a growing interest for using quantitative EEG and LORETA current source density in clinical and research settings. Importantly, if these indices are to be employed in clinical settings then the reliability of these measures is of great concern. Neuroguide (Applied Neurosciences) is sophisticated software developed for the analyses of power, and connectivity measures of the EEG as well as LORETA current source density. To date there are relatively few data evaluating topographical EEG reliability contrasts for all 19 channels and no studies have evaluated reliability for LORETA calculations. We obtained 4 min eyes-closed and eyes-opened EEG recordings at 30-day intervals. The EEG was analyzed in Neuroguide and FFT power, coherence and phase was computed for traditional frequency bands (delta, theta, alpha and beta) and LORETA current source density was calculated in 1 Hz increments and summed for total power in eight regions of interest (ROI). In order to obtain a robust measure of reliability we utilized a random effects model with an absolute agreement definition. The results show very good reproducibility for total absolute power and coherence. Phase shows lower reliability coefficients. LORETA current source density shows very good reliability with an average 0.81 for ECB and 0.82 for EOB. Similarly, the eight regions of interest show good to very good agreement across time. Implications for future directions and use of qEEG and LORETA in clinical populations are discussed. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.
Is One Trial Sufficient to Obtain Excellent Pressure Pain Threshold Reliability in the Low Back of Asymptomatic Individuals? A Test-Retest Study.

PubMed

Balaguier, Romain; Madeleine, Pascal; Vuillerme, Nicolas

2016-01-01

- and inter-session. Reliable measurements can be equally achieved when using the mean of two or three consecutive PPT measurements, as usually proposed in the literature, or with only the first one. Although reliability was almost perfect regardless of the conducted comparison between PPT assessments, our results suggest using two consecutive measurements to obtain higher short term absolute reliability.
Electronics reliability and measurement technology

NASA Technical Reports Server (NTRS)

Heyman, Joseph S. (Editor)

1987-01-01

A summary is presented of the Electronics Reliability and Measurement Technology Workshop. The meeting examined the U.S. electronics industry with particular focus on reliability and state-of-the-art technology. A general consensus of the approximately 75 attendees was that "the U.S. electronics industries are facing a crisis that may threaten their existence". The workshop had specific objectives to discuss mechanisms to improve areas such as reliability, yield, and performance while reducing failure rates, delivery times, and cost. The findings of the workshop addressed various aspects of the industry from wafers to parts to assemblies. Key problem areas that were singled out for attention are identified, and action items necessary to accomplish their resolution are recommended.
Goniometric reliability in a clinical setting. Shoulder measurements.

PubMed

Riddle, D L; Rothstein, J M; Lamb, R L

1987-05-01

The purpose of this study was to examine the intratester and intertester reliabilities for clinical goniometric measurements of shoulder passive range of motion (PROM) using two different sizes of universal goniometers. Patients were measured without controlling therapist goniometric placement technique or patient position during measurements. Repeated PROM measurements of shoulder flexion, extension, abduction, shoulder horizontal abduction, horizontal adduction, lateral (external) rotation, and medial (internal) rotation were taken of two groups of 50 subjects each. The intratester intraclass correlation coefficients (ICCs) for all motions ranged from .87 to .99. The ICCs for the intertester reliability of PROM measurements of horizontal abduction, horizontal adduction, extension, and medial rotation ranged from .26 to .55. The intertester ICCs for PROM measurements of flexion, abduction, and lateral rotation ranged from .84 to .90. Goniometric PROM measurements for the shoulder appear to be highly reliable when taken by the same physical therapist, regardless of the size of the goniometer used. The degree of intertester reliability for these measurements appears to be range-of-motion specific.
Reliability of real-time ultrasound measurement of transversus abdominis thickness in healthy trained subjects.

PubMed

Gnat, Rafael; Saulicz, Edward; Miądowicz, Barbara

2012-08-01

To investigate intra- and inter-rater reliability of the ultrasound measurement of transversus abdominis (TrA) thickness and thickness change (difference between thickness at rest and during contraction) in asymptomatic, trained subjects. To define the number of repeated measurements that provide acceptable level of reliability. To investigate variability of the measurements over time of 5 days and the reliability of duplicate analysis of images. A single-group repeated-measures design was used to assess reliability. Healthy volunteers (n = 10) were subjected to 1-week training in voluntary activation of TrA. Real-time ultrasound imaging and subsequent measurement of the TrA thickness at rest and during voluntary contraction were repeated on Monday, Wednesday and Friday of the next week. Using a single repeated measurement, intraclass correlation coefficients (ICCs) for TrA thickness were: 0.86-0.95 (intra-rater), 0.86-0.92 (inter-rater); and for TrA thickness change: 0.34-0.56 (intra-rater), 0.47-0.61 (inter-rater). Using the mean of three repeated measurements respective values were: 0.97, 0.96-0.98; and 0.81-0.84, 0.80-0.90. No significant differences were found between mean values of TrA thickness as well as thickness change obtained on three consecutive measurement days. Duplicate analysis of the images was highly reliable with ICCs of 0.89-0.99. Two repeated measurements for TrA thickness and at least three measurements for TrA thickness change are needed to achieve acceptable levels of intra- and inter-rater reliability. In healthy trained volunteers TrA thickness and thickness change are relatively stable parameters over a 5-day period. Duplicate analysis of the same images by two blinded observers is reliable.
Effect of image resolution manipulation in rearfoot angle measurements obtained with photogrammetry

PubMed Central

Sacco, I.C.N.; Picon, A.P.; Ribeiro, A.P.; Sartor, C.D.; Camargo-Junior, F.; Macedo, D.O.; Mori, E.T.T.; Monte, F.; Yamate, G.Y.; Neves, J.G.; Kondo, V.E.; Aliberti, S.

2012-01-01

The aim of this study was to investigate the influence of image resolution manipulation on the photogrammetric measurement of the rearfoot static angle. The study design was that of a reliability study. We evaluated 19 healthy young adults (11 females and 8 males). The photographs were taken at 1536 pixels in the greatest dimension, resized into four different resolutions (1200, 768, 600, 384 pixels) and analyzed by three equally trained examiners on a 96-pixels per inch (ppi) screen. An experienced physiotherapist marked the anatomic landmarks of rearfoot static angles on two occasions within a 1-week interval. Three different examiners had marked angles on digital pictures. The systematic error and the smallest detectable difference were calculated from the angle values between the image resolutions and times of evaluation. Different resolutions were compared by analysis of variance. Inter- and intra-examiner reliability was calculated by intra-class correlation coefficients (ICC). The rearfoot static angles obtained by the examiners in each resolution were not different (P > 0.05); however, the higher the image resolution the better the inter-examiner reliability. The intra-examiner reliability (within a 1-week interval) was considered to be unacceptable for all image resolutions (ICC range: 0.08-0.52). The whole body image of an adult with a minimum size of 768 pixels analyzed on a 96-ppi screen can provide very good inter-examiner reliability for photogrammetric measurements of rearfoot static angles (ICC range: 0.85-0.92), although the intra-examiner reliability within each resolution was not acceptable. Therefore, this method is not a proper tool for follow-up evaluations of patients within a therapeutic protocol. PMID:22911379
Effect of image resolution manipulation in rearfoot angle measurements obtained with photogrammetry.

PubMed

Sacco, I C N; Picon, A P; Ribeiro, A P; Sartor, C D; Camargo-Junior, F; Macedo, D O; Mori, E T T; Monte, F; Yamate, G Y; Neves, J G; Kondo, V E; Aliberti, S

2012-09-01

The aim of this study was to investigate the influence of image resolution manipulation on the photogrammetric measurement of the rearfoot static angle. The study design was that of a reliability study. We evaluated 19 healthy young adults (11 females and 8 males). The photographs were taken at 1536 pixels in the greatest dimension, resized into four different resolutions (1200, 768, 600, 384 pixels) and analyzed by three equally trained examiners on a 96-pixels per inch (ppi) screen. An experienced physiotherapist marked the anatomic landmarks of rearfoot static angles on two occasions within a 1-week interval. Three different examiners had marked angles on digital pictures. The systematic error and the smallest detectable difference were calculated from the angle values between the image resolutions and times of evaluation. Different resolutions were compared by analysis of variance. Inter- and intra-examiner reliability was calculated by intra-class correlation coefficients (ICC). The rearfoot static angles obtained by the examiners in each resolution were not different (P > 0.05); however, the higher the image resolution the better the inter-examiner reliability. The intra-examiner reliability (within a 1-week interval) was considered to be unacceptable for all image resolutions (ICC range: 0.08-0.52). The whole body image of an adult with a minimum size of 768 pixels analyzed on a 96-ppi screen can provide very good inter-examiner reliability for photogrammetric measurements of rearfoot static angles (ICC range: 0.85-0.92), although the intra-examiner reliability within each resolution was not acceptable. Therefore, this method is not a proper tool for follow-up evaluations of patients within a therapeutic protocol.
Ultrasound measures of supraspinatus tendon thickness and acromiohumeral distance in rotator cuff tendinopathy are reliable.

PubMed

McCreesh, Karen M; Anjum, Shakeel; Crotty, James M; Lewis, Jeremy S

2016-01-01

Rotator cuff (RC) tendinopathy has been widely ascribed to impingement of the supraspinatus tendon (SsT) in the subacromial space, measured as the acromiohumeral distance (AHD). Ultrasound (US) is suitable for measuring AHD and SsT thickness, but few reliability studies have been carried out in symptomatic populations, and interrater reliability is unconfirmed. This study aimed to examine the intrarater and interrater reliability of US measurements of AHD and SsT thickness in asymptomatic control subjects and patients with RC tendinopathy. Seventy participants were recruited and grouped as healthy controls (n = 25) and RC tendinopathy (n = 45). Repeated US measurements of AHD and SsT thickness were obtained by one rater in both groups and by two raters in the RC tendinopathy group. Intrarater and interrater reliability coefficients were excellent for both measurements (intraclass correlation > 0.92), but the intrarater reliability was superior. The minimal detectable change values in the symptomatic group were 0.7 mm for AHD and 0.6 mm for SsT thickness for a single experienced examiner; the values rose to 1.2 mm and 1.3 mm, respectively, for the pair of examiners. The results support the reliability of US for the measurement of AHD and SsT thickness in patients with symptomatic RC tendinopathy and provide minimal detectable change values for use in future research studies. © 2015 Wiley Periodicals, Inc.
Assessing reliability and validity measures in managed care studies.

PubMed

Montoya, Isaac D

2003-01-01

To review the reliability and validity literature and develop an understanding of these concepts as applied to managed care studies. Reliability is a test of how well an instrument measures the same input at varying times and under varying conditions. Validity is a test of how accurately an instrument measures what one believes is being measured. A review of reliability and validity instructional material was conducted. Studies of managed care practices and programs abound. However, many of these studies utilize measurement instruments that were developed for other purposes or for a population other than the one being sampled. In other cases, instruments have been developed without any testing of the instrument's performance. The lack of reliability and validity information may limit the value of these studies. This is particularly true when data are collected for one purpose and used for another. The usefulness of certain studies without reliability and validity measures is questionable, especially in cases where the literature contradicts itself
Reliability of smartphone-based gait measurements for quantification of physical activity/inactivity levels.

PubMed

Ebara, Takeshi; Azuma, Ryohei; Shoji, Naoto; Matsukawa, Tsuyoshi; Yamada, Yasuyuki; Akiyama, Tomohiro; Kurihara, Takahiro; Yamada, Shota

2017-11-25

Objective measurements using built-in smartphone sensors that can measure physical activity/inactivity in daily working life have the potential to provide a new approach to assessing workers' health effects. The aim of this study was to elucidate the characteristics and reliability of built-in step counting sensors on smartphones for development of an easy-to-use objective measurement tool that can be applied in ergonomics or epidemiological research. To evaluate the reliability of step counting sensors embedded in seven major smartphone models, the 6-minute walk test was conducted and the following analyses of sensor precision and accuracy were performed: 1) relationship between actual step count and step count detected by sensors, 2) reliability between smartphones of the same model, and 3) false detection rates when sitting during office work, while riding the subway, and driving. On five of the seven models, the inter-class correlations coefficient (ICC (3,1) ) showed high reliability with a range of 0.956-0.993. The other two models, however, had ranges of 0.443-0.504 and the relative error ratios of the sensor-detected step count to the actual step count were ±48.7%-49.4%. The level of agreement between the same models was ICC (3,1) : 0.992-0.998. The false detection rates differed between the sitting conditions. These results suggest the need for appropriate regulation of step counts measured by sensors, through means such as correction or calibration with a predictive model formula, in order to obtain the highly reliable measurement results that are sought in scientific investigation.
How to Measure the Onset of Babbling Reliably?

ERIC Educational Resources Information Center

Molemans, Inge; van den Berg, Renate; van Severen, Lieve; Gillis, Steven

2012-01-01

Various measures for identifying the onset of babbling have been proposed in the literature, but a formal definition of the exact procedure and a thorough validation of the sample size required for reliably establishing babbling onset is lacking. In this paper the reliability of five commonly used measures is assessed using a large longitudinal…
Measuring acuity of the approximate number system reliably and validly: the evaluation of an adaptive test procedure

PubMed Central

Lindskog, Marcus; Winman, Anders; Juslin, Peter; Poom, Leo

2013-01-01

Two studies investigated the reliability and predictive validity of commonly used measures and models of Approximate Number System acuity (ANS). Study 1 investigated reliability by both an empirical approach and a simulation of maximum obtainable reliability under ideal conditions. Results showed that common measures of the Weber fraction (w) are reliable only when using a substantial number of trials, even under ideal conditions. Study 2 compared different purported measures of ANS acuity as for convergent and predictive validity in a within-subjects design and evaluated an adaptive test using the ZEST algorithm. Results showed that the adaptive measure can reduce the number of trials needed to reach acceptable reliability. Only direct tests with non-symbolic numerosity discriminations of stimuli presented simultaneously were related to arithmetic fluency. This correlation remained when controlling for general cognitive ability and perceptual speed. Further, the purported indirect measure of ANS acuity in terms of the Numeric Distance Effect (NDE) was not reliable and showed no sign of predictive validity. The non-symbolic NDE for reaction time was significantly related to direct w estimates in a direction contrary to the expected. Easier stimuli were found to be more reliable, but only harder (7:8 ratio) stimuli contributed to predictive validity. PMID:23964256
Intrarater reliability of measuring the patella position by ultrasonography in weight-bearing condition.

PubMed

Chen, Chia Lin; Lo, Chu Ling; Huang, Kai Chu; Huang, Chen Fu

2017-10-01

[Purpose] The aim of this study was to determine the intrarater reliability of using ultrasonography as a measurement tool to assess the patella position in a weight-bearing condition. [Subjects and Methods] Ten healthy adults participated in this study. Ultrasonography was used to assess the patella position during step down with the loading knee in flexion (0° and 20°). The distance between the patella and lateral condyle was measured to represent the patella position on the condylar groove. Two measurements were obtained on the first day and the day after 1 week by the same investigator. [Results] Excellent intrarater reliability, ranging from 0.83 to 0.93, was shown in both conditions. Standard errors of the measurements were 0.5 mm in the straight knee and 0.7 mm in the knee flexion at 20°. Minimal differences in knee flexion at 0° and knee flexion at 20° were 1.5 mm and 1.9 mm, respectively. [Conclusion] Ultrasonography is a reliable assessment tool for evaluating the positional changes of the patella in weight-bearing activities, and it can be easily used by practitioners in the clinical setting.
The determination of measures of software reliability

NASA Technical Reports Server (NTRS)

Maxwell, F. D.; Corn, B. C.

1978-01-01

Measurement of software reliability was carried out during the development of data base software for a multi-sensor tracking system. The failure ratio and failure rate were found to be consistent measures. Trend lines could be established from these measurements that provide good visualization of the progress on the job as a whole as well as on individual modules. Over one-half of the observed failures were due to factors associated with the individual run submission rather than with the code proper. Possible application of these findings for line management, project managers, functional management, and regulatory agencies is discussed. Steps for simplifying the measurement process and for use of these data in predicting operational software reliability are outlined.
Defining ulnar variance in the adolescent wrist: measurement technique and interobserver reliability.

PubMed

Goldfarb, Charles A; Strauss, Nicole L; Wall, Lindley B; Calfee, Ryan P

2011-02-01

The measurement technique for ulnar variance in the adolescent population has not been well established. The purpose of this study was to assess the reliability of a standard ulnar variance assessment in the adolescent population. Four orthopedic surgeons measured 138 adolescent wrist radiographs for ulnar variance using a standard technique. There were 62 male and 76 female radiographs obtained in a standardized fashion for subjects aged 12 to 18 years. Skeletal age was used for analysis. We determined mean variance and assessed for differences related to age and gender. We also determined the interrater reliability. The mean variance was -0.7 mm for boys and -0.4 mm for girls; there was no significant difference between the 2 groups overall. When subdivided by age and gender, the younger group (≤ 15 y of age) was significantly less negative for girls (boys, -0.8 mm and girls, -0.3 mm, p < .05). There was no significant difference between boys and girls in the older group. The greatest difference between any 2 raters was 1 mm; exact agreement was obtained in 72 subjects. Correlations between raters were high (r(p) 0.87-0.97 in boys and 0.82-0.96 for girls). Interrater reliability was excellent (Cronbach's alpha, 0.97-0.98). Standard assessment techniques for ulnar variance are reliable in the adolescent population. Open growth plates did not interfere with this assessment. Young adolescent boys demonstrated a greater degree of negative ulnar variance compared with young adolescent girls. Copyright © 2011 American Society for Surgery of the Hand. Published by Elsevier Inc. All rights reserved.
Topics in Measurement: Reliability and Validity.

ERIC Educational Resources Information Center

Dick, Walter; Hagerty, Nancy

This text was developed on an autoinstructional basis to familiarize the reader with the various interpretations of reliability and validity, their measurement and evaluation, and factors influencing their measurement. The text enables those with prior knowledge of statistics to increase their understanding of variance and correlation. Review…

The Reliability of Anthropometric Measurements Used Preoperatively in Aesthetic Breast Surgery.

PubMed

Isaac, Kathryn V; Murphy, Blake D; Beber, Brett; Brown, Mitchell

2016-04-01

Patient outcomes in aesthetic breast surgery are highly dependent on breast measurements used in preoperative planning. The purpose of this study is to determine the reliability of anthropometric breast measurements. Four raters measured 28 women using 7 measurements: sternal notch to nipple distance (Sn-N), nipple to midline (N-M), nipple to inframammary-fold distance under maximal stretch (N-IMF), breast base width (BW), soft tissue pinch thickness of the upper pole (STPT:UP), STPT at the inframammary fold (STPT:IMF), and anterior pull skin stretch (APSS). Reliability was assessed using intra-class correlation coefficients (ICCs). Inter-rater reliability was excellent for Sn-N, N-M, and BW (ICC = 0.94, 0.90, and 0.76, respectively) and was good for N-IMF (ICC = 0.70). The STPT:UP, STPT:IMF, and APSS measurements were not reliable between raters (ICC < 0.2). Intra-rater reliability was excellent for Sn-N, N-M, and BW for all raters (all ICC > 0.75). The N-IMF intra-rater reliability was excellent in senior raters (ICC > 0.75) and good in junior raters (ICC > 0.6). The STPT:UP, STPT:IMF, and APSS measurements showed fair or poor reliability for most raters (ICC < 0.6). The Sn-N, N-M, and BW measurements are very reliable. Dynamic measurements including APSS, STPT:UP, and STUP:IMF are unreliable. N-IMF is the only reliable dynamic measurement, and its reliability improves with increasing clinical experience. The variable reliability of preoperative measurements must be considered in the planning of aesthetic breast surgery. 4 Diagnostic. © 2015 The American Society for Aesthetic Plastic Surgery, Inc. Reprints and permission: journals.permissions@oup.com.
Reliability of smartphone-based gait measurements for quantification of physical activity/inactivity levels

PubMed Central

Ebara, Takeshi; Azuma, Ryohei; Shoji, Naoto; Matsukawa, Tsuyoshi; Yamada, Yasuyuki; Akiyama, Tomohiro; Kurihara, Takahiro; Yamada, Shota

2017-01-01

Objectives: Objective measurements using built-in smartphone sensors that can measure physical activity/inactivity in daily working life have the potential to provide a new approach to assessing workers' health effects. The aim of this study was to elucidate the characteristics and reliability of built-in step counting sensors on smartphones for development of an easy-to-use objective measurement tool that can be applied in ergonomics or epidemiological research. Methods: To evaluate the reliability of step counting sensors embedded in seven major smartphone models, the 6-minute walk test was conducted and the following analyses of sensor precision and accuracy were performed: 1) relationship between actual step count and step count detected by sensors, 2) reliability between smartphones of the same model, and 3) false detection rates when sitting during office work, while riding the subway, and driving. Results: On five of the seven models, the inter-class correlations coefficient (ICC (3,1)) showed high reliability with a range of 0.956-0.993. The other two models, however, had ranges of 0.443-0.504 and the relative error ratios of the sensor-detected step count to the actual step count were ±48.7%-49.4%. The level of agreement between the same models was ICC (3,1): 0.992-0.998. The false detection rates differed between the sitting conditions. Conclusions: These results suggest the need for appropriate regulation of step counts measured by sensors, through means such as correction or calibration with a predictive model formula, in order to obtain the highly reliable measurement results that are sought in scientific investigation. PMID:28835575
Reliability of infrared thermometric measurements of skin temperature in the hand.

PubMed

Packham, Tara L; Fok, Diana; Frederiksen, Karen; Thabane, Lehana; Buckley, Norman

2012-01-01

Clinical measurement study. Skin temperature asymmetries (STAs) are used in the diagnosis of complex regional pain syndrome (CRPS), but little evidence exists for reliability of the equipment and methods. This study examined the reliability of an inexpensive infrared (IR) thermometer and measurement points in the hand for the study of STA. ST was measured three times at five points on both hands with an IR thermometer by two raters in 20 volunteers (12 normals and 8 CRPS). ST measurement results using IR thermometers support inter-rater reliability: intraclass correlation coefficient (ICC) estimate for single measures 0.80; all ST measurement points were also highly reliable (ICC single measures, 0.83-0.91). The equipment demonstrated excellent reliability, with little difference in the reliability of the five measurement sites. These preliminary findings support their use in future CRPS research. Not applicable. Copyright © 2012 Hanley & Belfus. Published by Elsevier Inc. All rights reserved.
Validity and reliability of rectus femoris ultrasound measurements: Comparison of curved-array and linear-array transducers.

PubMed

Hammond, Kendra; Mampilly, Jobby; Laghi, Franco A; Goyal, Amit; Collins, Eileen G; McBurney, Conor; Jubran, Amal; Tobin, Martin J

2014-01-01

Muscle-mass loss augers increased morbidity and mortality in critically ill patients. Muscle-mass loss can be assessed by wide linear-array ultrasound transducers connected to cumbersome, expensive console units. Whether cheaper, hand-carried units equipped with curved-array transducers can be used as alternatives is unknown. Accordingly, our primary aim was to investigate in 15 nondisabled subjects the validity of measurements of rectus femoris cross-sectional area by using a curved-array transducer against a linear-array transducer-the reference-standard technique. In these subjects, we also determined the reliability of measurements obtained by a novice operator versus measurements obtained by an experienced operator. Lastly, the relationship between quadriceps strength and rectus area recorded by two experienced operators with a curved-array transducer was assessed in 17 patients with chronic obstructive pulmonary disease (COPD). In nondisabled subjects, the rectus cross-sectional area measured with the curved-array transducer by the novice and experienced operators was valid (intraclass correlation coefficient [ICC]: 0.98, typical percentage error [%TE]: 3.7%) and reliable (ICC: 0.79, %TE: 9.7%). In the subjects with COPD, both reliability (ICC: 0.99) and repeatability (%TE: 7.6% and 9.8%) were high. Rectus area was related to quadriceps strength in COPD for both experienced operators (coefficient of determination: 0.67 and 0.70). In conclusion, measurements of rectus femoris cross-sectional area recorded with a curved-array transducer connected to a hand-carried unit are valid, reliable, and reproducible, leading us to contend that this technique is suitable for cross-sectional and longitudinal studies.
Reliable intraocular pressure measurement using automated radio-wave telemetry

PubMed Central

Paschalis, Eleftherios I; Cade, Fabiano; Melki, Samir; Pasquale, Louis R; Dohlman, Claes H; Ciolino, Joseph B

2014-01-01

Purpose To present an autonomous intraocular pressure (IOP) measurement technique using a wireless implantable transducer (WIT) and a motion sensor. Methods The WIT optical aid was implanted within the ciliary sulcus of a normotensive rabbit eye after extracapsular clear lens extraction. An autonomous wireless data system (AWDS) comprising of a WIT and an external antenna aided by a motion sensor provided continuous IOP readings. The sensitivity of the technique was determined by the ability to detect IOP changes resulting from the administration of latanoprost 0.005% or dorzolamide 2%, while the reliability was determined by the agreement between baseline and vehicle (saline) IOP. Results On average, 12 diurnal and 205 nocturnal IOP measurements were performed with latanoprost, and 26 diurnal and 205 nocturnal measurements with dorzolamide. No difference was found between mean baseline IOP (13.08±2.2 mmHg) and mean vehicle IOP (13.27±2.1 mmHg) (P=0.45), suggesting good measurement reliability. Both antiglaucoma medications caused significant IOP reduction compared to baseline; latanoprost reduced mean IOP by 10% (1.3±3.54 mmHg; P<0.001), and dorzolamide by 5% (0.62±2.22 mmHg; P<0.001). Use of latanoprost resulted in an overall twofold higher IOP reduction compared to dorzolamide (P<0.001). Repeatability was ±1.8 mmHg, assessed by the variability of consecutive IOP measurements performed in a short period of time (≤1 minute), during which the IOP is not expected to change. Conclusion IOP measurements in conscious rabbits obtained without the need for human interactions using the AWDS are feasible and provide reproducible results. PMID:24531415
Reliable intraocular pressure measurement using automated radio-wave telemetry.

PubMed

Paschalis, Eleftherios I; Cade, Fabiano; Melki, Samir; Pasquale, Louis R; Dohlman, Claes H; Ciolino, Joseph B

2014-01-01

To present an autonomous intraocular pressure (IOP) measurement technique using a wireless implantable transducer (WIT) and a motion sensor. The WIT optical aid was implanted within the ciliary sulcus of a normotensive rabbit eye after extracapsular clear lens extraction. An autonomous wireless data system (AWDS) comprising of a WIT and an external antenna aided by a motion sensor provided continuous IOP readings. The sensitivity of the technique was determined by the ability to detect IOP changes resulting from the administration of latanoprost 0.005% or dorzolamide 2%, while the reliability was determined by the agreement between baseline and vehicle (saline) IOP. On average, 12 diurnal and 205 nocturnal IOP measurements were performed with latanoprost, and 26 diurnal and 205 nocturnal measurements with dorzolamide. No difference was found between mean baseline IOP (13.08±2.2 mmHg) and mean vehicle IOP (13.27±2.1 mmHg) (P=0.45), suggesting good measurement reliability. Both antiglaucoma medications caused significant IOP reduction compared to baseline; latanoprost reduced mean IOP by 10% (1.3±3.54 mmHg; P<0.001), and dorzolamide by 5% (0.62±2.22 mmHg; P<0.001). Use of latanoprost resulted in an overall twofold higher IOP reduction compared to dorzolamide (P<0.001). Repeatability was ±1.8 mmHg, assessed by the variability of consecutive IOP measurements performed in a short period of time (≤1 minute), during which the IOP is not expected to change. IOP measurements in conscious rabbits obtained without the need for human interactions using the AWDS are feasible and provide reproducible results.
The Validity and Reliability of an iPhone App for Measuring Running Mechanics.

PubMed

Balsalobre-Fernández, Carlos; Agopyan, Hovannes; Morin, Jean-Benoit

2017-07-01

The purpose of this investigation was to analyze the validity of an iPhone application (Runmatic) for measuring running mechanics. To do this, 96 steps from 12 different runs at speeds ranging from 2.77-5.55 m·s -1 were recorded simultaneously with Runmatic, as well as with an opto-electronic device installed on a motorized treadmill to measure the contact and aerial time of each step. Additionally, several running mechanics variables were calculated using the contact and aerial times measured, and previously validated equations. Several statistics were computed to test the validity and reliability of Runmatic in comparison with the opto-electronic device for the measurement of contact time, aerial time, vertical oscillation, leg stiffness, maximum relative force, and step frequency. The running mechanics values obtained with both the app and the opto-electronic device showed a high degree of correlation (r = .94-.99, p < .001). Moreover, there was very close agreement between instruments as revealed by the ICC (2,1) (ICC = 0.965-0.991). Finally, both Runmatic and the opto-electronic device showed almost identical reliability levels when measuring each set of 8 steps for every run recorded. In conclusion, Runmatic has been proven to be a highly reliable tool for measuring the running mechanics studied in this work.
Measurement of fatigue: Comparison of the reliability and validity of single-item and short measures to a comprehensive measure.

PubMed

Kim, Hee-Ju; Abraham, Ivo

2017-01-01

Evidence is needed on the clinicometric properties of single-item or short measures as alternatives to comprehensive measures. We examined whether two single-item fatigue measures (i.e., Likert scale, numeric rating scale) or a short fatigue measure were comparable to a comprehensive measure in reliability (i.e., internal consistency and test-retest reliability) and validity (i.e., convergent, concurrent, and predictive validity) in Korean young adults. For this quantitative study, we selected the Functional Assessment of Chronic Illness Therapy-Fatigue for the comprehensive measure and the Profile of Mood States-Brief, Fatigue subscale for the short measure; and constructed two single-item measures. A total of 368 students from four nursing colleges in South Korea participated. We used Cronbach's alpha and item-total correlation for internal consistency reliability and intraclass correlation coefficient for test-retest reliability. We assessed Pearson's correlation with a comprehensive measure for convergent validity, with perceived stress level and sleep quality for concurrent validity and the receiver operating characteristic curve for predictive validity. The short measure was comparable to the comprehensive measure in internal consistency reliability (Cronbach's alpha=0.81 vs. 0.88); test-retest reliability (intraclass correlation coefficient=0.66 vs. 0.61); convergent validity (r with comprehensive measure=0.79); concurrent validity (r with perceived stress=0.55, r with sleep quality=0.39) and predictive validity (area under curve=0.88). Single-item measures were not comparable to the comprehensive measure. A short fatigue measure exhibited similar levels of reliability and validity to the comprehensive measure in Korean young adults. Copyright Â© 2016 Elsevier Ltd. All rights reserved.
Reliability and measurement error of sagittal spinal motion parameters in 220 patients with chronic low back pain using a three-dimensional measurement device.

PubMed

Mieritz, Rune M; Bronfort, Gert; Jakobsen, Markus D; Aagaard, Per; Hartvigsen, Jan

2014-09-01

A basic premise for any instrument measuring spinal motion is that reliable outcomes can be obtained on a relevant sample under standardized conditions. The purpose of this study was to assess the overall reliability and measurement error of regional spinal sagittal plane motion in patients with chronic low back pain (LBP), and then to evaluate the influence of body mass index, examiner, gender, stability of pain, and pain distribution on reliability and measurement error. This study comprises a test-retest design separated by 7 to 14 days. The patient cohort consisted of 220 individuals with chronic LBP. Kinematics of the lumbar spine were sampled during standardized spinal extension-flexion testing using a 6-df instrumented spatial linkage system. Test-retest reliability and measurement error were evaluated using interclass correlation coefficients (ICC(1,1)) and Bland-Altman limits of agreement (LOAs). The overall test-retest reliability (ICC(1,1)) for various motion parameters ranged from 0.51 to 0.70, and relatively wide LOAs were observed for all parameters. Reliability measures in patient subgroups (ICC(1,1)) ranged between 0.34 and 0.77. In general, greater (ICC(1,1)) coefficients and smaller LOAs were found in subgroups with patients examined by the same examiner, patients with a stable pain level, patients with a body mass index less than below 30 kg/m(2), patients who were men, and patients in the Quebec Task Force classifications Group 1. This study shows that sagittal plane kinematic data from patients with chronic LBP may be sufficiently reliable in measurements of groups of patients. However, because of the large LOAs, this test procedure appears unusable at the individual patient level. Furthermore, reliability and measurement error varies substantially among subgroups of patients. Copyright © 2014 Elsevier Inc. All rights reserved.
A Reliable Method to Measure Lip Height Using Photogrammetry in Unilateral Cleft Lip Patients.

PubMed

van der Zeeuw, Frederique; Murabit, Amera; Volcano, Johnny; Torensma, Bart; Patel, Brijesh; Hay, Norman; Thorburn, Guy; Morris, Paul; Sommerlad, Brian; Gnarra, Maria; van der Horst, Chantal; Kangesu, Loshan

2015-09-01

There is still no reliable tool to determine the outcome of the repaired unilateral cleft lip (UCL). The aim of this study was therefore to develop an accurate, reliable tool to measure vertical lip height from photographs. The authors measured the vertical height of the cutaneous and vermilion parts of the lip in 72 anterior-posterior view photographs of 17 patients with repairs to a UCL. Points on the lip's white roll and vermillion were marked on both the cleft and the noncleft sides on each image. Two new concepts were tested. First, photographs were standardized using the horizontal (medial to lateral) eye fissure width (EFW) for calibration. Second, the authors tested the interpupillary line (IPL) and the alar base line (ABL) for their reliability as horizontal lines of reference. Measurements were taken by 2 independent researchers, at 2 different time points each. Overall 2304 data points were obtained and analyzed. Results showed that the method was very effective in measuring the height of the lip on the cleft side with the noncleft side. When using the IPL, inter- and intra-rater reliability was 0.99 to 1.0, with the ABL it varied from 0.91 to 0.99 with one exception at 0.84. The IPL was easier to define because in some subjects the overhanging nasal tip obscured the alar base and gave more consistent measurements possibly because the reconstructed alar base was sometimes indistinct. However, measurements from the IPL can only give the percentage difference between the left and right sides of the lip, whereas those from the ABL can also give exact measurements. Patient examples were given that show how the measurements correlate with clinical assessment. The authors propose this method of photogrammetry with the innovative use of the IPL as a reliable horizontal plane and use of the EFW for calibration as a useful and reliable tool to assess the outcome of UCL repair.
Reliability and validity of the test of incremental respiratory endurance measures of inspiratory muscle performance in COPD.

PubMed

Formiga, Magno F; Roach, Kathryn E; Vital, Isabel; Urdaneta, Gisel; Balestrini, Kira; Calderon-Candelario, Rafael A; Campos, Michael A; Cahalin, Lawrence P

2018-01-01

The Test of Incremental Respiratory Endurance (TIRE) provides a comprehensive assessment of inspiratory muscle performance by measuring maximal inspiratory pressure (MIP) over time. The integration of MIP over inspiratory duration (ID) provides the sustained maximal inspiratory pressure (SMIP). Evidence on the reliability and validity of these measurements in COPD is not currently available. Therefore, we assessed the reliability, responsiveness and construct validity of the TIRE measures of inspiratory muscle performance in subjects with COPD. Test-retest reliability, known-groups and convergent validity assessments were implemented simultaneously in 81 male subjects with mild to very severe COPD. TIRE measures were obtained using the portable PrO2 device, following standard guidelines. All TIRE measures were found to be highly reliable, with SMIP demonstrating the strongest test-retest reliability with a nearly perfect intraclass correlation coefficient (ICC) of 0.99, while MIP and ID clustered closely together behind SMIP with ICC values of about 0.97. Our findings also demonstrated known-groups validity of all TIRE measures, with SMIP and ID yielding larger effect sizes when compared to MIP in distinguishing between subjects of different COPD status. Finally, our analyses confirmed convergent validity for both SMIP and ID, but not MIP. The TIRE measures of MIP, SMIP and ID have excellent test-retest reliability and demonstrated known-groups validity in subjects with COPD. SMIP and ID also demonstrated evidence of moderate convergent validity and appear to be more stable measures in this patient population than the traditional MIP.
Developing safety performance functions incorporating reliability-based risk measures.

PubMed

Ibrahim, Shewkar El-Bassiouni; Sayed, Tarek

2011-11-01

Current geometric design guides provide deterministic standards where the safety margin of the design output is generally unknown and there is little knowledge of the safety implications of deviating from these standards. Several studies have advocated probabilistic geometric design where reliability analysis can be used to account for the uncertainty in the design parameters and to provide a risk measure of the implication of deviation from design standards. However, there is currently no link between measures of design reliability and the quantification of safety using collision frequency. The analysis presented in this paper attempts to bridge this gap by incorporating a reliability-based quantitative risk measure such as the probability of non-compliance (P(nc)) in safety performance functions (SPFs). Establishing this link will allow admitting reliability-based design into traditional benefit-cost analysis and should lead to a wider application of the reliability technique in road design. The present application is concerned with the design of horizontal curves, where the limit state function is defined in terms of the available (supply) and stopping (demand) sight distances. A comprehensive collision and geometric design database of two-lane rural highways is used to investigate the effect of the probability of non-compliance on safety. The reliability analysis was carried out using the First Order Reliability Method (FORM). Two Negative Binomial (NB) SPFs were developed to compare models with and without the reliability-based risk measures. It was found that models incorporating the P(nc) provided a better fit to the data set than the traditional (without risk) NB SPFs for total, injury and fatality (I+F) and property damage only (PDO) collisions. Copyright © 2011 Elsevier Ltd. All rights reserved.
Validity and reliability of intraoral scanners compared to conventional gypsum models measurements: a systematic review.

PubMed

Aragón, Mônica L C; Pontes, Luana F; Bichara, Lívia M; Flores-Mir, Carlos; Normando, David

2016-08-01

The development of 3D technology and the trend of increasing the use of intraoral scanners in dental office routine lead to the need for comparisons with conventional techniques. To determine if intra- and inter-arch measurements from digital dental models acquired by an intraoral scanner are as reliable and valid as the similar measurements achieved from dental models obtained through conventional intraoral impressions. An unrestricted electronic search of seven databases until February 2015. Studies that focused on the accuracy and reliability of images obtained from intraoral scanners compared to images obtained from conventional impressions. After study selection the QUADAS risk of bias assessment tool for diagnostic studies was used to assess the risk of bias (RoB) among the included studies. Four articles were included in the qualitative synthesis. The scanners evaluated were OrthoProof, Lava, iOC intraoral, Lava COS, iTero and D250. These studies evaluated the reliability of tooth widths, Bolton ratio measurements, and image superimposition. Two studies were classified as having low RoB; one had moderate RoB and the remaining one had high RoB. Only one study evaluated the time required to complete clinical procedures and patient's opinion about the procedure. Patients reported feeling more comfortable with the conventional dental impression method. Associated costs were not considered in any of the included study. Inter- and intra-arch measurements from digital models produced from intraoral scans appeared to be reliable and accurate in comparison to those from conventional impressions. This assessment only applies to the intraoral scanners models considered in the finally included studies. Digital models produced by intraoral scan eliminate the need of impressions materials; however, currently, longer time is needed to take the digital images. PROSPERO (CRD42014009702). None. © The Author 2016. Published by Oxford University Press on behalf of the European
Comparison of Reliability Measures under Factor Analysis and Item Response Theory

ERIC Educational Resources Information Center

Cheng, Ying; Yuan, Ke-Hai; Liu, Cheng

2012-01-01

Reliability of test scores is one of the most pervasive psychometric concepts in measurement. Reliability coefficients based on a unifactor model for continuous indicators include maximal reliability rho and an unweighted sum score-based omega, among many others. With increasing popularity of item response theory, a parallel reliability measure pi…
Quantitative measurement of hypertrophic scar: interrater reliability and concurrent validity.

PubMed

Nedelec, Bernadette; Correa, José A; Rachelska, Grazyna; Armour, Alexis; LaSalle, Léo

2008-01-01

Research into the pathophysiology and treatment of hypertrophic scar (HSc) remains limited by the heterogeneity of scar and the imprecision with which its severity is measured. The objective of this study was to test the interrater reliability and concurrent validity of the Cutometer measurement of elasticity, the Mexameter measurement of erythema and pigmentation, and total thickness measure of the DermaScan C relative to the modified Vancouver Scar Scale (mVSS) in patient-matched normal skin, normal scar, and HSc. Three independent investigators evaluated 128 sites (severe HSc, moderate or mild HSc, donor site, and normal skin) on 32 burn survivors using all of the above measurement tools. The intraclass correlation coefficient, which was used to measure interrater reliability, reflects the inherent amount of error in the measure and is considered acceptable when it is >0.75. Interrater reliability of the totals of the height, pliability, and vascularity subscales of the mVSS fell below the acceptable limit ( congruent with0.50). The individual subscales of the mVSS fell well below the acceptable level (< or =0.3). The Cutometer reading of elasticity provided acceptable reliability (>0.89) for each study site with the exception of severe scar. Mexameter and DermaScan C reliability measurements were acceptable for all sites (>0.82). Concurrent validity correlations with the mVSS were significant except for the comparison of the mVSS pliability subscale and the Cutometer maximum deformation measure comparison in severe scar. In conclusion, the Mexameter and DermaScan C measurements of scar color and thickness of all sites, as well as the Cutometer measurement of elasticity in all but the most severe scars shows high interrater reliability. Their significant concurrent validity with the mVSS confirms that these tools are measuring the same traits as the mVSS, and in a more objective way.
Objective measurements of excess skin in post bariatric patients--inter-rater reliability.

PubMed

Biörserud, Christina; Fagevik Olsén, Monika; Elander, Anna; Wiklund, Malin

2016-01-01

An ability to reliably assess excess skin after massive weight loss using well-described and transferrable methods is important. The aim of this trial was to evaluate inter-rater reliability of ptosis and circumference measurements in patients with excess skin after bariatric surgery. Twenty-five postbariatric patients were included in the study, and their excess skin was measured 18 months after surgery. A protocol was designed to measure excess skin in a standardised way. To evaluate the inter-rater reliability in the measuring protocol, all patients were measured twice, by a specialist nurse and a specialist physiotherapist. All circumference measurements on different body parts had an ICC > 0.9, indicating high reliability. Furthermore, all breast and abdominal ptosis measurements had high reliability. In contrast, visual evaluation of abdominal ptosis had poor reliability. Measurements of ptoses on different body parts had an ICC > 0.6. There were no systematic differences between the results of the two testers, except for measurements of the buttocks and maximal knee circumference. The measuring protocol presented in this study has high reliability and, therefore, represents a useful instrument to provide a consistent and objective assessment of excess skin in the postbariatric patient.
Reliability and validity of the test of incremental respiratory endurance measures of inspiratory muscle performance in COPD

PubMed Central

Formiga, Magno F; Roach, Kathryn E; Vital, Isabel; Urdaneta, Gisel; Balestrini, Kira; Calderon-Candelario, Rafael A

2018-01-01

Purpose The Test of Incremental Respiratory Endurance (TIRE) provides a comprehensive assessment of inspiratory muscle performance by measuring maximal inspiratory pressure (MIP) over time. The integration of MIP over inspiratory duration (ID) provides the sustained maximal inspiratory pressure (SMIP). Evidence on the reliability and validity of these measurements in COPD is not currently available. Therefore, we assessed the reliability, responsiveness and construct validity of the TIRE measures of inspiratory muscle performance in subjects with COPD. Patients and methods Test–retest reliability, known-groups and convergent validity assessments were implemented simultaneously in 81 male subjects with mild to very severe COPD. TIRE measures were obtained using the portable PrO2 device, following standard guidelines. Results All TIRE measures were found to be highly reliable, with SMIP demonstrating the strongest test–retest reliability with a nearly perfect intraclass correlation coefficient (ICC) of 0.99, while MIP and ID clustered closely together behind SMIP with ICC values of about 0.97. Our findings also demonstrated known-groups validity of all TIRE measures, with SMIP and ID yielding larger effect sizes when compared to MIP in distinguishing between subjects of different COPD status. Finally, our analyses confirmed convergent validity for both SMIP and ID, but not MIP. Conclusion The TIRE measures of MIP, SMIP and ID have excellent test–retest reliability and demonstrated known-groups validity in subjects with COPD. SMIP and ID also demonstrated evidence of moderate convergent validity and appear to be more stable measures in this patient population than the traditional MIP. PMID:29805255
Reliability and Validity of Advanced Phonics Measures

ERIC Educational Resources Information Center

Doty, Sara J.; Hixson, Michael D.; Decker, Dawn M.; Reynolds, Jennifer L.; Drevon, Daniel D.

2015-01-01

Two studies explored the technical adequacy of various measures of advanced phonics skills. In Study 1, the advanced phonics measures consisted of pseudowords, real words, or a combination of both. Participants included 39 students in the third grade. Test-retest correlations for all measures were above 0.8 and interrater reliability was high.…
Toward a Common Language for Measuring Patient Mobility in the Hospital: Reliability and Construct Validity of Interprofessional Mobility Measures.

PubMed

Hoyer, Erik H; Young, Daniel L; Klein, Lisa M; Kreif, Julie; Shumock, Kara; Hiser, Stephanie; Friedman, Michael; Lavezza, Annette; Jette, Alan; Chan, Kitty S; Needham, Dale M

2018-02-01

Daily Living Scale. The results were obtained from inpatients in the neuroscience department of a single hospital. The AM-PAC IMSF and JH-HLM had excellent interrater reliability and test-retest reliability for both physical therapists and nurses. The evaluation of convergent validity suggested that AM-PAC IMSF and JH-HLM measured constructs of patient mobility and physical functioning. © 2017 American Physical Therapy Association
Validity and reliability of a low-cost digital dynamometer for measuring isometric strength of lower limb.

PubMed

Romero-Franco, Natalia; Jiménez-Reyes, Pedro; Montaño-Munuera, Juan A

2017-11-01

Lower limb isometric strength is a key parameter to monitor the training process or recognise muscle weakness and injury risk. However, valid and reliable methods to evaluate it often require high-cost tools. The aim of this study was to analyse the concurrent validity and reliability of a low-cost digital dynamometer for measuring isometric strength in lower limb. Eleven physically active and healthy participants performed maximal isometric strength for: flexion and extension of ankle, flexion and extension of knee, flexion, extension, adduction, abduction, internal and external rotation of hip. Data obtained by the digital dynamometer were compared with the isokinetic dynamometer to examine its concurrent validity. Data obtained by the digital dynamometer from 2 different evaluators and 2 different sessions were compared to examine its inter-rater and intra-rater reliability. Intra-class correlation (ICC) for validity was excellent in every movement (ICC > 0.9). Intra and inter-tester reliability was excellent for all the movements assessed (ICC > 0.75). The low-cost digital dynamometer demonstrated strong concurrent validity and excellent intra and inter-tester reliability for assessing isometric strength in the main lower limb movements.

Agreement, the F-Measure, and Reliability in Information Retrieval

PubMed Central

Hripcsak, George; Rothschild, Adam S.

2005-01-01

Information retrieval studies that involve searching the Internet or marking phrases usually lack a well-defined number of negative cases. This prevents the use of traditional interrater reliability metrics like the κ statistic to assess the quality of expert-generated gold standards. Such studies often quantify system performance as precision, recall, and F-measure, or as agreement. It can be shown that the average F-measure among pairs of experts is numerically identical to the average positive specific agreement among experts and that κ approaches these measures as the number of negative cases grows large. Positive specific agreement—or the equivalent F-measure—may be an appropriate way to quantify interrater reliability and therefore to assess the reliability of a gold standard in these studies. PMID:15684123
Agreement and reliability of pelvic floor measurements during contraction using three-dimensional pelvic floor ultrasound and virtual reality.

PubMed

Speksnijder, L; Rousian, M; Steegers, E A P; Van Der Spek, P J; Koning, A H J; Steensma, A B

2012-07-01

Virtual reality is a novel method of visualizing ultrasound data with the perception of depth and offers possibilities for measuring non-planar structures. The levator ani hiatus has both convex and concave aspects. The aim of this study was to compare levator ani hiatus volume measurements obtained with conventional three-dimensional (3D) ultrasound and with a virtual reality measurement technique and to establish their reliability and agreement. 100 symptomatic patients visiting a tertiary pelvic floor clinic with a normal intact levator ani muscle diagnosed on translabial ultrasound were selected. Datasets were analyzed using a rendered volume with a slice thickness of 1.5 cm at the level of minimal hiatal dimensions during contraction. The levator area (in cm(2)) was measured and multiplied by 1.5 to get the levator ani hiatus volume in conventional 3D ultrasound (in cm(3)). Levator ani hiatus volume measurements were then measured semi-automatically in virtual reality (cm(3) ) using a segmentation algorithm. An intra- and interobserver analysis of reliability and agreement was performed in 20 randomly chosen patients. The mean difference between levator ani hiatus volume measurements performed using conventional 3D ultrasound and virtual reality was 0.10 (95% CI, - 0.15 to 0.35) cm(3). The intraclass correlation coefficient (ICC) comparing conventional 3D ultrasound with virtual reality measurements was > 0.96. Intra- and interobserver ICCs for conventional 3D ultrasound measurements were > 0.94 and for virtual reality measurements were > 0.97, indicating good reliability for both. Levator ani hiatus volume measurements performed using virtual reality were reliable and the results were similar to those obtained with conventional 3D ultrasonography. Copyright © 2012 ISUOG. Published by John Wiley & Sons, Ltd.
Reliability of Various Measurement Stations for Determining Plantar Fascia Thickness and Echogenicity.

PubMed

Bisi-Balogun, Adebisi; Cassel, Michael; Mayer, Frank

2016-04-13

This study aimed to determine the relative and absolute reliability of ultrasound (US) measurements of the thickness and echogenicity of the plantar fascia (PF) at different measurement stations along its length using a standardized protocol. Twelve healthy subjects (24 feet) were enrolled. The PF was imaged in the longitudinal plane. Subjects were assessed twice to evaluate the intra-rater reliability. A quantitative evaluation of the thickness and echogenicity of the plantar fascia was performed using Image J, a digital image analysis and viewer software. A sonography evaluation of the thickness and echogenicity of the PF showed a high relative reliability with an Intra class correlation coefficient of ≥0.88 at all measurement stations. However, the measurement stations for both the PF thickness and echogenicity which showed the highest intraclass correlation coefficient (ICCs) did not have the highest absolute reliability. Compared to other measurement stations, measuring the PF thickness at 3 cm distal and the echogenicity at a region of interest 1 cm to 2 cm distal from its insertion at the medial calcaneal tubercle showed the highest absolute reliability with the least systematic bias and random error. Also, the reliability was higher using a mean of three measurements compared to one measurement. To reduce discrepancies in the interpretation of the thickness and echogenicity measurements of the PF, the absolute reliability of the different measurement stations should be considered in clinical practice and research rather than the relative reliability with the ICC.
Reliability of Various Measurement Stations for Determining Plantar Fascia Thickness and Echogenicity

PubMed Central

Bisi-Balogun, Adebisi; Cassel, Michael; Mayer, Frank

2016-01-01

This study aimed to determine the relative and absolute reliability of ultrasound (US) measurements of the thickness and echogenicity of the plantar fascia (PF) at different measurement stations along its length using a standardized protocol. Twelve healthy subjects (24 feet) were enrolled. The PF was imaged in the longitudinal plane. Subjects were assessed twice to evaluate the intra-rater reliability. A quantitative evaluation of the thickness and echogenicity of the plantar fascia was performed using Image J, a digital image analysis and viewer software. A sonography evaluation of the thickness and echogenicity of the PF showed a high relative reliability with an Intra class correlation coefficient of ≥0.88 at all measurement stations. However, the measurement stations for both the PF thickness and echogenicity which showed the highest intraclass correlation coefficient (ICCs) did not have the highest absolute reliability. Compared to other measurement stations, measuring the PF thickness at 3 cm distal and the echogenicity at a region of interest 1 cm to 2 cm distal from its insertion at the medial calcaneal tubercle showed the highest absolute reliability with the least systematic bias and random error. Also, the reliability was higher using a mean of three measurements compared to one measurement. To reduce discrepancies in the interpretation of the thickness and echogenicity measurements of the PF, the absolute reliability of the different measurement stations should be considered in clinical practice and research rather than the relative reliability with the ICC. PMID:27089369
Measurement and Reliability of Response Inhibition

PubMed Central

Congdon, Eliza; Mumford, Jeanette A.; Cohen, Jessica R.; Galvan, Adriana; Canli, Turhan; Poldrack, Russell A.

2012-01-01

Response inhibition plays a critical role in adaptive functioning and can be assessed with the Stop-signal task, which requires participants to suppress prepotent motor responses. Evidence suggests that this ability to inhibit a prepotent motor response (reflected as Stop-signal reaction time (SSRT)) is a quantitative and heritable measure of interindividual variation in brain function. Although attention has been given to the optimal method of SSRT estimation, and initial evidence exists in support of its reliability, there is still variability in how Stop-signal task data are treated across samples. In order to examine this issue, we pooled data across three separate studies and examined the influence of multiple SSRT calculation methods and outlier calling on reliability (using Intra-class correlation). Our results suggest that an approach which uses the average of all available sessions, all trials of each session, and excludes outliers based on predetermined lenient criteria yields reliable SSRT estimates, while not excluding too many participants. Our findings further support the reliability of SSRT, which is commonly used as an index of inhibitory control, and provide support for its continued use as a neurocognitive phenotype. PMID:22363308
Generalizability Theory as a Unifying Framework of Measurement Reliability in Adolescent Research

ERIC Educational Resources Information Center

Fan, Xitao; Sun, Shaojing

2014-01-01

In adolescence research, the treatment of measurement reliability is often fragmented, and it is not always clear how different reliability coefficients are related. We show that generalizability theory (G-theory) is a comprehensive framework of measurement reliability, encompassing all other reliability methods (e.g., Pearson "r,"…
Intra- and interobserver reliability of quantitative ultrasound measurement of the plantar fascia.

PubMed

Rathleff, Michael Skovdal; Moelgaard, Carsten; Lykkegaard Olesen, Jens

2011-01-01

To determine intra- and interobserver reliability and measurement precision of sonographic assessment of plantar fascia thickness when using one, the mean of two, or the mean of three measurements. Two experienced observers scanned 20 healthy subjects twice with 60 minutes between test and retest. A GE LOGIQe ultrasound scanner was used in the study. The built-in software in the scanner was used to measure the thickness of the plantar fascia (PF). Reliability was calculated using intraclass correlation coefficient (ICC) and limits of agreement (LOA). Intraobserver reliability (ICC) using one measurement was 0.50 for one observer and 0.52 for the other, and using the mean of three measurements intraobserver reliability increased up to 0.77 and 0.67, respectively. Interobserver reliability (ICC) when using one measurement was 0.62 and increased to 0.82 when using the average of three measurements. LOA showed that when using the average of three measurements, LOA decreased to 0.6 mm, corresponding to 17.5% of the mean thickness of the PF. The results showed that reliability increases when using the mean of three measurements compared with one. Limits of agreement based on intratester reliability shows that changes in thickness that are larger than 0.6 mm can be considered actual changes in thickness and not a result of measurement error. Copyright © 2011 Wiley Periodicals, Inc.
Reliability of lower limb alignment measures using an established landmark-based method with a customized computer software program

PubMed Central

Sled, Elizabeth A.; Sheehy, Lisa M.; Felson, David T.; Costigan, Patrick A.; Lam, Miu; Cooke, T. Derek V.

2010-01-01

The objective of the study was to evaluate the reliability of frontal plane lower limb alignment measures using a landmark-based method by (1) comparing inter- and intra-reader reliability between measurements of alignment obtained manually with those using a computer program, and (2) determining inter- and intra-reader reliability of computer-assisted alignment measures from full-limb radiographs. An established method for measuring alignment was used, involving selection of 10 femoral and tibial bone landmarks. 1) To compare manual and computer methods, we used digital images and matching paper copies of five alignment patterns simulating healthy and malaligned limbs drawn using AutoCAD. Seven readers were trained in each system. Paper copies were measured manually and repeat measurements were performed daily for 3 days, followed by a similar routine with the digital images using the computer. 2) To examine the reliability of computer-assisted measures from full-limb radiographs, 100 images (200 limbs) were selected as a random sample from 1,500 full-limb digital radiographs which were part of the Multicenter Osteoarthritis (MOST) Study. Three trained readers used the software program to measure alignment twice from the batch of 100 images, with two or more weeks between batch handling. Manual and computer measures of alignment showed excellent agreement (intraclass correlations [ICCs] 0.977 – 0.999 for computer analysis; 0.820 – 0.995 for manual measures). The computer program applied to full-limb radiographs produced alignment measurements with high inter- and intra-reader reliability (ICCs 0.839 – 0.998). In conclusion, alignment measures using a bone landmark-based approach and a computer program were highly reliable between multiple readers. PMID:19882339
Reliability of Wearable Inertial Measurement Units to Measure Physical Activity in Team Handball.

PubMed

Luteberget, Live S; Holme, Benjamin R; Spencer, Matt

2018-04-01

To assess the reliability and sensitivity of commercially available inertial measurement units to measure physical activity in team handball. Twenty-two handball players were instrumented with 2 inertial measurement units (OptimEye S5; Catapult Sports, Melbourne, Australia) taped together. They participated in either a laboratory assessment (n = 10) consisting of 7 team handball-specific tasks or field assessment (n = 12) conducted in 12 training sessions. Variables, including PlayerLoad™ and inertial movement analysis (IMA) magnitude and counts, were extracted from the manufacturers' software. IMA counts were divided into intensity bands of low (1.5-2.5 m·s -1 ), medium (2.5-3.5 m·s -1 ), high (>3.5 m·s -1 ), medium/high (>2.5 m·s -1 ), and total (>1.5 m·s -1 ). Reliability between devices and sensitivity was established using coefficient of variation (CV) and smallest worthwhile difference (SWD). Laboratory assessment: IMA magnitude showed a good reliability (CV = 3.1%) in well-controlled tasks. CV increased (4.4-6.7%) in more-complex tasks. Field assessment: Total IMA counts (CV = 1.8% and SWD = 2.5%), PlayerLoad (CV = 0.9% and SWD = 2.1%), and their associated variables (CV = 0.4-1.7%) showed a good reliability, well below the SWD. However, the CV of IMA increased when categorized into intensity bands (2.9-5.6%). The reliability of IMA counts was good when data were displayed as total, high, or medium/high counts. A good reliability for PlayerLoad and associated variables was evident. The CV of the previously mentioned variables was well below the SWD, suggesting that OptimEye's inertial measurement unit and its software are sensitive for use in team handball.
Precision of lumbar intervertebral measurements: does a computer-assisted technique improve reliability?

PubMed

Pearson, Adam M; Spratt, Kevin F; Genuario, James; McGough, William; Kosman, Katherine; Lurie, Jon; Sengupta, Dilip K

2011-04-01

Comparison of intra- and interobserver reliability of digitized manual and computer-assisted intervertebral motion measurements and classification of "instability." To determine if computer-assisted measurement of lumbar intervertebral motion on flexion-extension radiographs improves reliability compared with digitized manual measurements. Many studies have questioned the reliability of manual intervertebral measurements, although few have compared the reliability of computer-assisted and manual measurements on lumbar flexion-extension radiographs. Intervertebral rotation, anterior-posterior (AP) translation, and change in anterior and posterior disc height were measured with a digitized manual technique by three physicians and by three other observers using computer-assisted quantitative motion analysis (QMA) software. Each observer measured 30 sets of digital flexion-extension radiographs (L1-S1) twice. Shrout-Fleiss intraclass correlation coefficients for intra- and interobserver reliabilities were computed. The stability of each level was also classified (instability defined as >4 mm AP translation or 10° rotation), and the intra- and interobserver reliabilities of the two methods were compared using adjusted percent agreement (APA). Intraobserver reliability intraclass correlation coefficients were substantially higher for the QMA technique THAN the digitized manual technique across all measurements: rotation 0.997 versus 0.870, AP translation 0.959 versus 0.557, change in anterior disc height 0.962 versus 0.770, and change in posterior disc height 0.951 versus 0.283. The same pattern was observed for interobserver reliability (rotation 0.962 vs. 0.693, AP translation 0.862 vs. 0.151, change in anterior disc height 0.862 vs. 0.373, and change in posterior disc height 0.730 vs. 0.300). The QMA technique was also more reliable for the classification of "instability." Intraobserver APAs ranged from 87 to 97% for QMA versus 60% to 73% for digitized manual
Establishing Immediate Reliability of Sonographic Measurements of the Transversus Abdominis in Asymptomatic Adults Performing Upright Loaded Functional Tasks in a Clinical Context Without Delayed Recorded Measurement.

PubMed

McPherson, Sue; Watson, Todd; Pate, Lindsey

2016-08-01

This study examined the reliability of sonographic measurements of the transversus abdominis of adults without low back pain during upright loaded functional tasks in real time, without relying on delayed recorded images. A single-group repeated-measures reliability study was conducted on 12 healthy participants without low back pain. Six of these adults reported a prior history of abdominal drawing-in maneuver training without sonographic measurement. The participants performed 3 trials of neutral standing, a loaded forward reach, and a loaded box lift under rest and with abdominal drawing-in maneuver instructions; task order was randomized. Transversus abdominis thickness measurements were obtained by an experienced rater using B-mode sonography in real-time via electronic calipers twice on the same static image during all trials by a rater. The rater was masked to group assignment and on-screen measurement output and required to respond to trivia questions between repeated measurements. The participants included 6 male and 6 female adults with a mean age ± SD of 26.3 ± 3.7 years. Intra-rater intraclass correlation coefficients (2,3) were high and precise for the rater's first and second measurements for all tasks and instruction conditions for mean transversus abdominis thickness and percent change in thickness measurements (eg, ranges were 0.968-0.997 for intraclass correlation coefficients, 0.01-0.21 mm for standard errors of the measurement, and 0.01-0.58 mm for minimal detectable changes). Calipers cleared by the rater or a research assistant produced similar findings of excellent reliability and precision. High intra-rater reliability and precision of transversus abdominis thickness measurements were obtained by a physical therapist in real time from asymptomatic adults performing upright loaded functional tasks under rest and with abdominal drawing-in maneuver instructions.
Comparing the reliability of a trigonometric technique to goniometry and inclinometry in measuring ankle dorsiflexion.

PubMed

Sidaway, Ben; Euloth, Tracey; Caron, Heather; Piskura, Matthew; Clancy, Jessica; Aide, Alyson

2012-07-01

The purpose of this study was to compare the reliability of three previously used techniques for the measurement of ankle dorsiflexion ROM, open-chained goniometry, closed-chained goniometry, and inclinometry, to a novel trigonometric technique. Twenty-one physiotherapy students used four techniques (open-chained goniometry, closed-chained goniometry, inclinometry, and trigonometry) to assess dorsiflexion range of motion in 24 healthy volunteers. All student raters underwent training to establish competence in the four techniques. Raters then measured dorsiflexion with a randomly assigned measuring technique four times over two sessions, one week apart. Data were analyzed using a technique by session analysis of variance, technique measurement variability being the primary index of reliability. Comparisons were also made between the measurements derived from the four techniques and those obtained from a computerized video analysis system. Analysis of the rater measurement variability around the technique means revealed significant differences between techniques with the least variation being found in the trigonometric technique. Significant differences were also found between the technique means but no differences between sessions were evident. The trigonometric technique produced mean ROMs closest in value to those derived from computer analysis. Application of the trigonometric technique resulted in the least variability in measurement across raters and consequently should be considered for use when changes in dorsiflexion ROM need to be reliably assessed. Copyright © 2012 Elsevier B.V. All rights reserved.
Measuring the Pain Area: An Intra- and Inter-Rater Reliability Study Using Image Analysis Software.

PubMed

Dos Reis, Felipe Jose Jandre; de Barros E Silva, Veronica; de Lucena, Raphaela Nunes; Mendes Cardoso, Bruno Alexandre; Nogueira, Leandro Calazans

2016-01-01

Pain drawings have frequently been used for clinical information and research. The aim of this study was to investigate intra- and inter-rater reliability of area measurements performed on pain drawings. Our secondary objective was to verify the reliability when using computers with different screen sizes, both with and without mouse hardware. Pain drawings were completed by patients with chronic neck pain or neck-shoulder-arm pain. Four independent examiners participated in the study. Examiners A and B used the same computer with a 16-inch screen and wired mouse hardware. Examiner C used a notebook with a 16-inch screen and no mouse hardware, and Examiner D used a computer with an 11.6-inch screen and a wireless mouse. Image measurements were obtained using GIMP and NIH ImageJ computer programs. The length of all the images was measured using GIMP software to a set scale in ImageJ. Thus, each marked area was encircled and the total surface area (cm(2) ) was calculated for each pain drawing measurement. A total of 117 areas were identified and 52 pain drawings were analyzed. The intrarater reliability between all examiners was high (ICC = 0.989). The inter-rater reliability was also high. No significant differences were observed when using different screen sizes or when using or not using the mouse hardware. This suggests that the precision of these measurements is acceptable for the use of this method as a measurement tool in clinical practice and research. © 2014 World Institute of Pain.
Automated reliability assessment for spectroscopic redshift measurements

NASA Astrophysics Data System (ADS)

Jamal, S.; Le Brun, V.; Le Fèvre, O.; Vibert, D.; Schmitt, A.; Surace, C.; Copin, Y.; Garilli, B.; Moresco, M.; Pozzetti, L.

2018-03-01

Context. Future large-scale surveys, such as the ESA Euclid mission, will produce a large set of galaxy redshifts (≥106) that will require fully automated data-processing pipelines to analyze the data, extract crucial information and ensure that all requirements are met. A fundamental element in these pipelines is to associate to each galaxy redshift measurement a quality, or reliability, estimate. Aim. In this work, we introduce a new approach to automate the spectroscopic redshift reliability assessment based on machine learning (ML) and characteristics of the redshift probability density function. Methods: We propose to rephrase the spectroscopic redshift estimation into a Bayesian framework, in order to incorporate all sources of information and uncertainties related to the redshift estimation process and produce a redshift posterior probability density function (PDF). To automate the assessment of a reliability flag, we exploit key features in the redshift posterior PDF and machine learning algorithms. Results: As a working example, public data from the VIMOS VLT Deep Survey is exploited to present and test this new methodology. We first tried to reproduce the existing reliability flags using supervised classification in order to describe different types of redshift PDFs, but due to the subjective definition of these flags (classification accuracy 58%), we soon opted for a new homogeneous partitioning of the data into distinct clusters via unsupervised classification. After assessing the accuracy of the new clusters via resubstitution and test predictions (classification accuracy 98%), we projected unlabeled data from preliminary mock simulations for the Euclid space mission into this mapping to predict their redshift reliability labels. Conclusions: Through the development of a methodology in which a system can build its own experience to assess the quality of a parameter, we are able to set a preliminary basis of an automated reliability assessment for
Is radiographic measurement of bony landmarks reliable for lateral meniscal sizing?

PubMed

Yoon, Jung-Ro; Kim, Taik-Seon; Lim, Hong-Chul; Lim, Hyung-Tae; Yang, Jae-Hyuk

2011-03-01

The accuracy of meniscal measurement methods is still in debate. The authors' protocol for radiologic measurements will provide reproducible bony landmarks, and this measurement method of the lateral tibial plateau will correlate with the actual anatomic value. Controlled laboratory study. Twenty-five samples of fresh lateral meniscus with attached proximal tibia were obtained during total knee arthroplasty. Each sample was obtained without damage to the meniscus and bony attachment sites. The inclusion criterion was mild to moderate osteoarthritis in patients with mechanical axis deviation of less than 15°. Knees with lateral compartment osteoarthritic change or injured or degenerated menisci were excluded. For the lateral tibial plateau length measurements, the radiographic beam was angled 10° caudally at neutral rotation, which allowed differentiation of the lateral plateau cortical margins from the medial plateau. The transition points were identified and used for length measurement. The values of length were then compared with the conventional Pollard method and the anatomic values. The width measurement was done according to Pollard's protocol. For each knee, the percentage deviation from the anatomic dimension was recorded. Intraobserver error and interobserver error were calculated. The deviation of the authors' radiographic length measurements from anatomic dimensions was 1.4 ± 1.1 mm. The deviation of Pollard's radiographic length measurements was 4.1 ± 2.0 mm. With respect to accuracy-which represents the frequency of measurements that fall within 10% of measurements-the accuracy of authors' length was 98%, whereas for Pollard's method it was 40%. There was a good correlation between anatomic meniscal dimensions and each radiologic plateau dimensions for lateral meniscal width (R(2) = .790) and the authors' lateral meniscal length (R(2) = .823) and fair correlation for Pollard's lateral meniscal length (R(2) = .660). The reliability of each
Measurement of fetal head descent using the 'angle of progression' on transperineal ultrasound imaging is reliable regardless of fetal head station or ultrasound expertise.

PubMed

Dückelmann, A M; Bamberg, C; Michaelis, S A M; Lange, J; Nonnenmacher, A; Dudenhausen, J W; Kalache, K D

2010-02-01

To assess whether ultrasound experience or fetal head station affects the reliability of measurement of fetal head descent using the angle of progression on intrapartum ultrasound images obtained by a single experienced operator, and to determine reliability of measurements when images were acquired by different operators with variable ultrasound experience. One experienced obstetrician performed 44 transperineal ultrasound examinations of women at term and in prolonged second stage of labor with the fetus in the occipitoanterior position. Three midwives without ultrasound experience, three obstetricians with < 5 years' experience and three obstetricians with > 10 years' experience measured fetal head descent based on the angle of progression in the images obtained. The angle of progression was measured by two obstetricians in independent ultrasound examinations of 24 laboring women at term with the fetus in the cephalic position to allow assessment of the reliability of image acquisition. Intraclass correlation coefficients (ICCs) with 95% confidence interval (CI) were used to evaluate interobserver reliability and Bland-Altman analysis was used to assess interobserver agreement. In total, 444 measurements were performed and compared. Interobserver reliability with respect to offline image analysis was substantial (overall ICC, 0.72; 95% CI, 0.63-0.81). ICCs were 0.82 (95% CI, 0.70-0.89), 0.81 (95% CI, 0.71-0.88) and 0.61 (95% CI, 0.43-074) for observers with > 10 years', < 5 years' and no ultrasound experience, respectively. There were no significant differences between ICCs among observer groups according to ultrasound experience. Fetal head station did not affect reliability. Bland-Altman analysis indicated reasonable agreement between measurements obtained by two different operators with > 10 years' and < 5 years' ultrasound experience (bias, -1.09 degrees ; 95% limits of agreement, -8.76 to 6.58). The reliability of measurement of the angle of progression
Reliability of Measurements Performed by Community-Drawn Anthropometrists from Rural Ethiopia

PubMed Central

Ayele, Berhan; Aemere, Abaineh; Gebre, Teshome; Tadesse, Zerihun; Stoller, Nicole E.; See, Craig W.; Yu, Sun N.; Gaynor, Bruce D.; McCulloch, Charles E.; Porco, Travis C.; Emerson, Paul M.; Lietman, Thomas M.; Keenan, Jeremy D.

2012-01-01

Background Undernutrition is an important risk factor for childhood mortality, and remains a major problem facing many developing countries. Millennium Development Goal 1 calls for a reduction in underweight children, implemented through a variety of interventions. To adequately judge the impact of these interventions, it is important to know the reproducibility of the main indicators for undernutrition. In this study, we trained individuals from rural communities in Ethiopia in anthropometry techniques and measured intra- and inter-observer reliability. Methods and Findings We trained 6 individuals without prior anthropometry experience to perform weight, height, and middle upper arm circumference (MUAC) measurements. Two anthropometry teams were dispatched to 18 communities in rural Ethiopia and measurements performed on all consenting pre-school children. Anthropometry teams performed a second independent measurement on a convenience sample of children in order to assess intra-anthropometrist reliability. Both teams measured the same children in 2 villages to assess inter-anthropometrist reliability. We calculated several metrics of measurement reproducibility, including the technical error of measurement (TEM) and relative TEM. In total, anthropometry teams performed measurements on 606 pre-school children, 84 of which had repeat measurements performed by the same team, and 89 of which had measurements performed by both teams. Intra-anthropometrist TEM (and relative TEM) were 0.35 cm (0.35%) for height, 0.05 kg (0.39%) for weight, and 0.18 cm (1.27%) for MUAC. Corresponding values for inter-anthropometrist reliability were 0.67 cm (0.75%) for height, 0.09 kg (0.79%) for weight, and 0.22 kg (1.53%) for MUAC. Inter-anthropometrist measurement error was greater for smaller children than for larger children. Conclusion Measurements of height and weight were more reproducible than measurements of MUAC and measurements of larger children were more reliable than those
Reliability of Total Test Scores When Considered as Ordinal Measurements

ERIC Educational Resources Information Center

Biswas, Ajoy Kumar

2006-01-01

This article studies the ordinal reliability of (total) test scores. This study is based on a classical-type linear model of observed score (X), true score (T), and random error (E). Based on the idea of Kendall's tau-a coefficient, a measure of ordinal reliability for small-examinee populations is developed. This measure is extended to large…
Inter-rater Reliability of Real-Time Ultrasound to Measure Acromiohumeral Distance.

PubMed

Mackenzie, Tanya Anne; Bdaiwi, Alya H; Herrington, Lee; Cools, Ann

2016-07-01

Real-time ultrasound (RTUS) has been suggested as a reliable measure of acromiohumeral distance. However, to date, no vigorous assessment and reporting of inter-rater reliability of this method has been performed with the shoulder in a neutral position or with active and passive arm abduction. To assess intrasession inter-rater reliability of using RTUS to measure acromiohumeral distance with the shoulder in a neutral position and with 60° active and passive abduction. Inter-rater intrasession reliability of repeated measures. Human performance laboratory. Twenty persons (12 male and 8 female) with an average age of 29.86 years (standard deviation, 7.8). In an inter-rater, intrasession study, RTUS was used to measure the acromiohumeral distance with the shoulder in a neutral position and with 60° of both active and passive abduction. Acromiohumeral distance. Intraclass correlation coefficient (ICC)2.1 scores ranged between 0.65-0.88 (standard error of the mean = 0.81-1.2 mm and minimal detectable differences with 95% confidence = 2.2-2.3 mm) for inter-rater intrasession reliability. RTUS was found to have fair to good inter-rater reliability as a tool to measure acromiohumeral distance with the shoulder in a neutral position and with 60° of both active and passive arm abduction. Copyright © 2016 American Academy of Physical Medicine and Rehabilitation. Published by Elsevier Inc. All rights reserved.
Measurements of Weight Bearing Asymmetry Using the Nintendo Wii Fit Balance Board Are Not Reliable for Older Adults and Individuals With Stroke.

PubMed

Liuzzo, Derek M; Peters, Denise M; Middleton, Addie; Lanier, Wes; Chain, Rebecca; Barksdale, Brittany; Fritz, Stacy L

Clinicians and researchers have used bathroom scales, balance performance monitors with feedback, postural scale analysis, and force platforms to evaluate weight bearing asymmetry (WBA). Now video game consoles offer a novel alternative for assessing this construct. By using specialized software, the Nintendo Wii Fit balance board can provide reliable measurements of WBA in healthy, young adults. However, reliability of measurements obtained using only the factory settings to assess WBA in older adults and individuals with stroke has not been established. To determine whether measurements of WBA obtained using the Nintendo Wii Fit balance board and default settings are reliable in older adults and individuals with stroke. Weight bearing asymmetry was assessed using the Nintendo Wii Fit balance board in 2 groups of participants-individuals older than 65 years (n = 41) and individuals with stroke (n = 41). Participants were given a standardized set of instructions and were not provided auditory or visual feedback. Two trials were performed. Intraclass correlation coefficients (ICC), standard error of measure (SEM), and minimal detectable change (MDC) scores were determined for each group. The ICC for the older adults sample was 0.59 (0.35-0.76) with SEM95 = 6.2% and MDC95 = 8.8%. The ICC for the sample including individuals with stroke was 0.60 (0.47-0.70) with SEM95 = 9.6% and MDC95 = 13.6%. Although measurements of WBA obtained using the Nintendo Wii Fit balance board, and its default factory settings, demonstrate moderate reliability in older adults and individuals with stroke, the relatively high associated SEM and MDC values substantially reduce the clinical utility of the Nintendo Wii Fit balance board as an assessment tool for WBA. Weight bearing asymmetry cannot be measured reliably in older adults and individuals with stroke using the Nintendo Wii Fit balance board without the use of specialized software.

Reliability of human-supervised formant-trajectory measurement for forensic voice comparison.

PubMed

Zhang, Cuiling; Morrison, Geoffrey Stewart; Ochoa, Felipe; Enzinger, Ewald

2013-01-01

Acoustic-phonetic approaches to forensic voice comparison often include human-supervised measurement of vowel formants, but the reliability of such measurements is a matter of concern. This study assesses the within- and between-supervisor variability of three sets of formant-trajectory measurements made by each of four human supervisors. It also assesses the validity and reliability of forensic-voice-comparison systems based on these measurements. Each supervisor's formant-trajectory system was fused with a baseline mel-frequency cepstral-coefficient system, and performance was assessed relative to the baseline system. Substantial improvements in validity were found for all supervisors' systems, but some supervisors' systems were more reliable than others.
Reliability of EEG Interactions Differs between Measures and Is Specific for Neurological Diseases

PubMed Central

Höller, Yvonne; Butz, Kevin; Thomschewski, Aljoscha; Schmid, Elisabeth; Uhl, Andreas; Bathke, Arne C.; Zimmermann, Georg; Tomasi, Santino O.; Nardone, Raffaele; Staffen, Wolfgang; Höller, Peter; Leitinger, Markus; Höfler, Julia; Kalss, Gudrun; Taylor, Alexandra C.; Kuchukhidze, Giorgi; Trinka, Eugen

2017-01-01

Alterations of interaction (connectivity) of the EEG reflect pathological processes in patients with neurologic disorders. Nevertheless, it is questionable whether these patterns are reliable over time in different measures of interaction and whether this reliability of the measures is the same across different patient populations. In order to address this topic we examined 22 patients with mild cognitive impairment, five patients with subjective cognitive complaints, six patients with right-lateralized temporal lobe epilepsy, seven patients with left lateralized temporal lobe epilepsy, and 20 healthy controls. We calculated 14 measures of interaction from two EEG-recordings separated by 2 weeks. In order to characterize test-retest reliability, we correlated these measures for each group and compared the correlations between measures and between groups. We found that both measures of interaction as well as groups differed from each other in terms of reliability. The strongest correlation coefficients were found for spectrum, coherence, and full frequency directed transfer function (average rho > 0.9). In the delta (2–4 Hz) range, reliability was lower for mild cognitive impairment compared to healthy controls and left lateralized temporal lobe epilepsy. In the beta (13–30 Hz), gamma (31–80 Hz), and high gamma (81–125 Hz) frequency ranges we found decreased reliability in subjective cognitive complaints compared to mild cognitive impairment. In the gamma and high gamma range we found increased reliability in left lateralized temporal lobe epilepsy patients compared to healthy controls. Our results emphasize the importance of documenting reliability of measures of interaction, which may vary considerably between measures, but also between patient populations. We suggest that studies claiming clinical usefulness of measures of interaction should provide information on the reliability of the results. In addition, differences between patient groups in
Measuring the Cobb angle with the iPhone in kyphoses: a reliability study.

PubMed

Jacquot, Frederic; Charpentier, Axelle; Khelifi, Sofiane; Gastambide, Daniel; Rigal, Regis; Sautet, Alain

2012-08-01

Smartphones have gained widespread use in the healthcare field to fulfill a variety of tasks. We developed a small iPhone application to take advantage of the built-in position sensor to measure angles in a variety of spinal deformities. We present a reliability study of this tool in measuring kyphotic angles. Radiographs taken from 20 different patients' charts were presented to a panel of six operators at two different times. Radiographs were measured with the protractor and the iPhone application and statistical analysis was applied to measure intraclass correlation coefficients between both measurement methods, and to measure intra- and interobserver reliability The intraclass correlation coefficient calculated between methods (i.e. CobbMeter application on the iPhone versus standard method with the protractor) was 0.963 for all measures, indicating excellent correlation was obtained between the CobbMeter application and the standard method. The interobserver correlation coefficient was 0.965. The intraobserver ICC was 0.977, indicating excellent reproductibility of measurements at different times for all operators. The interobserver ICC between fellowship trained senior surgeons and general orthopaedic residents was 0.989. Consistently, the ICC for intraobserver and interobserver correlations was higher with the CobbMeter application than with the regular protractor method. This difference was not statistically significant. Measuring kyphotic angles with the iPhone application appears to be a valid procedure and is in no way inferior to the standard way of measuring the Cobb angle in kyphotic deformities.
Measuring eating disorder attitudes and behaviors: a reliability generalization study

PubMed Central

2014-01-01

Background Although score reliability is a sample-dependent characteristic, researchers often only report reliability estimates from previous studies as justification for employing particular questionnaires in their research. The present study followed reliability generalization procedures to determine the mean score reliability of the Eating Disorder Inventory and its most commonly employed subscales (Drive for Thinness, Bulimia, and Body Dissatisfaction) and the Eating Attitudes Test as a way to better identify those characteristics that might impact score reliability. Methods Published studies that used these measures were coded based on their reporting of reliability information and additional study characteristics that might influence score reliability. Results Score reliability estimates were included in 26.15% of studies using the EDI and 36.28% of studies using the EAT. Mean Cronbach’s alphas for the EDI (total score = .91; subscales = .75 to .89), EAT-40 (total score = .81) and EAT-26 (total score = .86; subscales = .56 to .80) suggested variability in estimated internal consistency. Whereas some EDI subscales exhibited higher score reliability in clinical eating disorder samples than in nonclinical samples, other subscales did not exhibit these differences. Score reliability information for the EAT was primarily reported for nonclinical samples, making it difficult to characterize the effect of type of sample on these measures. However, there was a tendency for mean score reliability to be higher in the adult (vs. adolescent) samples and in female (vs. male) samples. Conclusions Overall, this study highlights the importance of assessing and reporting internal consistency during every test administration because reliability is affected by characteristics of the participants being examined. PMID:24764530
The reliability of three devices used for measuring vertical jump height.

PubMed

Nuzzo, James L; Anning, Jonathan H; Scharfenberg, Jessica M

2011-09-01

The purpose of this investigation was to assess the intrasession and intersession reliability of the Vertec, Just Jump System, and Myotest for measuring countermovement vertical jump (CMJ) height. Forty male and 39 female university students completed 3 maximal-effort CMJs during 2 testing sessions, which were separated by 24-48 hours. The height of the CMJ was measured from all 3 devices simultaneously. Systematic error, relative reliability, absolute reliability, and heteroscedasticity were assessed for each device. Systematic error across the 3 CMJ trials was observed within both sessions for males and females, and this was most frequently observed when the CMJ height was measured by the Vertec. No systematic error was discovered across the 2 testing sessions when the maximum CMJ heights from the 2 sessions were compared. In males, the Myotest demonstrated the best intrasession reliability (intraclass correlation coefficient [ICC] = 0.95; SEM = 1.5 cm; coefficient of variation [CV] = 3.3%) and intersession reliability (ICC = 0.88; SEM = 2.4 cm; CV = 5.3%; limits of agreement = -0.08 ± 4.06 cm). Similarly, in females, the Myotest demonstrated the best intrasession reliability (ICC = 0.91; SEM = 1.4 cm; CV = 4.5%) and intersession reliability (ICC = 0.92; SEM = 1.3 cm; CV = 4.1%; limits of agreement = 0.33 ± 3.53 cm). Additional analysis revealed that heteroscedasticity was present in the CMJ when measured from all 3 devices, indicating that better jumpers demonstrate greater fluctuations in CMJ scores across testing sessions. To attain reliable CMJ height measurements, practitioners are encouraged to familiarize athletes with the CMJ technique and then allow the athletes to complete numerous repetitions until performance plateaus, particularly if the Vertec is being used.
The Reliability and Validity of Using Regression Residuals to Measure Institutional Effectiveness in Promoting Degree Completion

ERIC Educational Resources Information Center

Horn, Aaron S.; Lee, Giljae

2016-01-01

A relatively simple way of measuring institutional effectiveness in relation to degree completion is to estimate the difference between an actual and predicted graduation rate, but the reliability and validity of this method have not been thoroughly examined. Longitudinal data were obtained from IPEDS for both public and private not-for-profit…
Radiographic measurement reliability of lumbar lordosis in ankylosing spondylitis.

PubMed

Lee, Jung Sub; Goh, Tae Sik; Park, Shi Hwan; Lee, Hong Seok; Suh, Kuen Tak

2013-04-01

Intraobserver and interobserver reliabilities of the several different methods to measure lumbar lordosis have been reported. However, it has not been studied sofar in patients with ankylosing spondylitis (AS). We evaluated the inter and intraobserver reliabilities of six specific measures of global lumbar lordosis in patients with AS. Ninety-one consecutive patients with AS who met the most recently modified New York criteria were enrolled and underwent anteroposterior and lateral radiographs of whole spine. The radiographs were divided into non-ankylosis (no bony bridge in the lumbar spine), incomplete ankylosis (lumbar spines were partially connected by bony bridge) and complete ankylosis groups to evaluate the reliability of the Cobb L1-S1, Cobb L1-L5, centroid, posterior tangent L1-S1, posterior tangent L1-L5, and TRALL methods. The radiographs were composed of 39 non-ankylosis, 27 incomplete ankylosis and 25 complete ankylosis. Intra- and inter-class correlation coefficients (ICCs) of all six methods were generally high. The ICCs were all ≥0.77 (excellent) for the six radiographic methods in the combined group. However, a comparison of the ICCs, 95 % confidence intervals and mean absolute difference (MAD) between groups with varying degrees of ankylosis showed that the reliability of the lordosis measurements decreased in proportion to the severity of ankylosis. The Cobb L1-S1, Cobb L1-L5 and posterior tangent L1-S1 method demonstrated higher ICCs for both inter and intraobserver comparisons and the other methods showed lower ICCs in all groups. The intraobserver MAD was similar in the Cobb L1-S1 and Cobb L1-L5 (2.7°-4.3°), but the other methods showed higher intraobserver MAD. Interobserver MAD of Cobb L1-L5 only showed low in all group. These results are the first to provide a reliability analysis of different global lumbar lordosis measurement methods in AS. The findings in this study demonstrated that the Cobb L1-L5 method is reliable for measuring
RELIABILITY OF ANKLE-FOOT MORPHOLOGY, MOBILITY, STRENGTH, AND MOTOR PERFORMANCE MEASURES.

PubMed

Fraser, John J; Koldenhoven, Rachel M; Saliba, Susan A; Hertel, Jay

2017-12-01

Assessment of foot posture, morphology, intersegmental mobility, strength and motor control of the ankle-foot complex are commonly used clinically, but measurement properties of many assessments are unclear. To determine test-retest and inter-rater reliability, standard error of measurement, and minimal detectable change of morphology, joint excursion and play, strength, and motor control of the ankle-foot complex. Reliability study. 24 healthy, recreationally-active young adults without history of ankle-foot injury were assessed by two clinicians on two occasions, three to ten days apart. Measurement properties were assessed for foot morphology (foot posture index, total and truncated length, width, arch height), joint excursion (weight-bearing dorsiflexion, rearfoot and hallux goniometry, forefoot inclinometry, 1 st metatarsal displacement) and joint play, strength (handheld dynamometry), and motor control rating during intrinsic foot muscle (IFM) exercises. Clinician order was randomized using a Latin Square. The clinicians performed independent examinations and did not confer on the findings for the duration of the study. Test-retest and inter-tester reliability and agreement was assessed using intraclass correlation coefficients (ICC 2,k ) and weighted kappa ( K w ). Test-retest reliability ICC were as follows: morphology: .80-1.00, joint excursion: .58-.97, joint play: -.67-.84, strength: .67-.92, IFM motor rating: K W -.01-.71. Inter-rater reliability ICC were as follows: morphology: .81-1.00, joint excursion: .32-.97, joint play: -1.06-1.00, strength: .53-.90, and IFM motor rating: K w .02-.56. Measures of ankle-foot posture, morphology, joint excursion, and strength demonstrated fair to excellent test-retest and inter-rater reliability. Test-retest reliability for rating of perceived difficulty and motor performance was good to excellent for short-foot, toe-spread-out, and hallux exercises and poor to fair for lesser toe extension. Joint play measures had
Behavioral Scale Reliability and Measurement Invariance Evaluation Using Latent Variable Modeling

ERIC Educational Resources Information Center

Raykov, Tenko

2004-01-01

A latent variable modeling approach to reliability and measurement invariance evaluation for multiple-component measuring instruments is outlined. An initial discussion deals with the limitations of coefficient alpha, a frequently used index of composite reliability. A widely and readily applicable structural modeling framework is next described…
Inter-Rater Reliability of Cyclotorsion Measurements Using Fundus Photography.

PubMed

Dysli, Muriel; Kanku, Madeleine; Traber, Ghislaine L

2018-04-01

The foveo-papillary angle (FPA) on fundus photographs is the accepted standard for the measurement of ocular cyclotorsion. We assessed the inter-rater reliability of this method in healthy subjects and in patients with trochlear nerve palsies. In this methodological study, fundus photographs of healthy subjects and of patients with trochlear nerve palsies were made with a fundus camera (Zeiss Fundus Camera FF 450 plus, Jena, Germany). Three independent observers measured the FPA on the fundus photographs of all subjects in synedra View (synedra View 16, Version 16.0.0.11, Innsbruck, Austria). One hundred and four eyes of 52 subjects (26 healthy controls and 26 patients) were assessed. The mean FPA of the healthy controls was 5.80 degrees (°) [± 0.44 standard error of the mean (SEM)] compared to 11.55° (± 0.80 SEM) for patients with trochlear nerve palsies. The inter-rater reliability of all measured FPAs showed an intraclass correlation coefficient (ICC) of 0.98 (95% CI 0.97 - 0.98). The inter-rater reliability of objective cyclotorsion measurements using fundus photographs was very high. Georg Thieme Verlag KG Stuttgart · New York.
Reliability of reference distances used in photogrammetry.

PubMed

Aksu, Muge; Kaya, Demet; Kocadereli, Ilken

2010-07-01

To determine the reliability of the reference distances used for photogrammetric assessment. The sample consisted of 100 subjects with mean ages of 22.97 +/- 2.98 years. Five lateral and four frontal parameters were measured directly on the subjects' faces. For photogrammetric assessment, two reference distances for the profile view and three reference distances for the frontal view were established. Standardized photographs were taken and all the parameters that had been measured directly on the face were measured on the photographs. The reliability of the reference distances was checked by comparing direct and indirect values of the parameters obtained from the subjects' faces and photographs. Repeated measure analysis of variance (ANOVA) and Bland-Altman analyses were used for statistical assessment. For profile measurements, the indirect values measured were statistically different from the direct values except for Sn-Sto in male subjects and Prn-Sn and Sn-Sto in female subjects. The indirect values of Prn-Sn and Sn-Sto were reliable in both sexes. The poorest results were obtained in the indirect values of the N-Sn parameter for female subjects and the Sn-Me parameter for male subjects according to the Sa-Sba reference distance. For frontal measurements, the indirect values were statistically different from the direct values in both sexes except for one in male subjects. The indirect values measured were not statistically different from the direct values for Go-Go. The indirect values of Ch-Ch were reliable in male subjects. The poorest results were obtained according to the P-P reference distance. For profile assessment, the T-Ex reference distance was reliable for Prn-Sn and Sn-Sto in both sexes. For frontal assessment, Ex-Ex and En-En reference distances were reliable for Ch-Ch in male subjects.
Inter-arch digital model vs. manual cast measurements: Accuracy and reliability.

PubMed

Kiviahde, Heikki; Bukovac, Lea; Jussila, Päivi; Pesonen, Paula; Sipilä, Kirsi; Raustia, Aune; Pirttiniemi, Pertti

2017-06-28

The purpose of this study was to evaluate the accuracy and reliability of inter-arch measurements using digital dental models and conventional dental casts. Thirty sets of dental casts with permanent dentition were examined. Manual measurements were done with a digital caliper directly on the dental casts, and digital measurements were made on 3D models by two independent examiners. Intra-class correlation coefficients (ICC), a paired sample t-test or Wilcoxon signed-rank test, and Bland-Altman plots were used to evaluate intra- and inter-examiner error and to determine the accuracy and reliability of the measurements. The ICC values were generally good for manual and excellent for digital measurements. The Bland-Altman plots of all the measurements showed good agreement between the manual and digital methods and excellent inter-examiner agreement using the digital method. Inter-arch occlusal measurements on digital models are accurate and reliable and are superior to manual measurements.
Stress Rupture Life Reliability Measures for Composite Overwrapped Pressure Vessels

NASA Technical Reports Server (NTRS)

Murthy, Pappu L. N.; Thesken, John C.; Phoenix, S. Leigh; Grimes-Ledesma, Lorie

2007-01-01

Composite Overwrapped Pressure Vessels (COPVs) are often used for storing pressurant gases onboard spacecraft. Kevlar (DuPont), glass, carbon and other more recent fibers have all been used as overwraps. Due to the fact that overwraps are subjected to sustained loads for an extended period during a mission, stress rupture failure is a major concern. It is therefore important to ascertain the reliability of these vessels by analysis, since the testing of each flight design cannot be completed on a practical time scale. The present paper examines specifically a Weibull statistics based stress rupture model and considers the various uncertainties associated with the model parameters. The paper also examines several reliability estimate measures that would be of use for the purpose of recertification and for qualifying flight worthiness of these vessels. Specifically, deterministic values for a point estimate, mean estimate and 90/95 percent confidence estimates of the reliability are all examined for a typical flight quality vessel under constant stress. The mean and the 90/95 percent confidence estimates are computed using Monte-Carlo simulation techniques by assuming distribution statistics of model parameters based also on simulation and on the available data, especially the sample sizes represented in the data. The data for the stress rupture model are obtained from the Lawrence Livermore National Laboratories (LLNL) stress rupture testing program, carried out for the past 35 years. Deterministic as well as probabilistic sensitivities are examined.
Reliability: A Comparison of Absence Measures.

DTIC Science & Technology

1985-09-01

28 8. Sick Leave Frequency Index ... . .. . . . . . . . 29 J.v -. ~~~~i j.- .i...- - AFIT/GSM/LSB/85S-19 Abstract Absenteeism is an important and...RELIABILITY: A COMPARISON OF ABSENCE MEASURES I. Literature Review Introduction Absenteeism may be defined as undesired work absence. This withdrawal...behavior is an important organizational problem because of probable reduced labor productivity and increased training costs. * Absenteeism is almost
Validity and intra-rater reliability of an Android phone application to measure cervical range-of-motion

PubMed Central

2014-01-01

Background Concurrent validity and intra-rater reliability using a customized Android phone application to measure cervical-spine range-of-motion (ROM) has not been previously validated against a gold-standard three-dimensional motion analysis (3DMA) system. Findings Twenty-one healthy individuals (age:31 ± 9.1 years, male:11) participated, with 16 re-examined for intra-rater reliability 1–7 days later. An Android phone was fixed on a helmet, which was then securely fastened on the participant’s head. Cervical-spine ROM in flexion, extension, lateral flexion and rotation were performed in sitting with concurrent measurements obtained from both a 3DMA system and the phone. The phone demonstrated moderate to excellent (ICC = 0.53-0.98, Spearman ρ = 0.52-0.98) concurrent validity for ROM measurements in cervical flexion, extension, lateral-flexion and rotation. However, cervical rotation demonstrated both proportional and fixed bias. Excellent intra-rater reliability was demonstrated for cervical flexion, extension and lateral flexion (ICC = 0.82-0.90), but poor for right- and left-rotation (ICC = 0.05-0.33) using the phone. Possible reasons for the outcome are that flexion, extension and lateral-flexion measurements are detected by gravity-dependent accelerometers while rotation measurements are detected by the magnetometer which can be adversely affected by surrounding magnetic fields. Conclusion The results of this study demonstrate that the tested Android phone application is valid and reliable to measure ROM of the cervical-spine in flexion, extension and lateral-flexion but not in rotation likely due to magnetic interference. The clinical implication of this study is that therapists should be mindful of the plane of measurement when using the Android phone to measure ROM of the cervical-spine. PMID:24742001
Validity and intra-rater reliability of an android phone application to measure cervical range-of-motion.

PubMed

Quek, June; Brauer, Sandra G; Treleaven, Julia; Pua, Yong-Hao; Mentiplay, Benjamin; Clark, Ross Allan

2014-04-17

Concurrent validity and intra-rater reliability using a customized Android phone application to measure cervical-spine range-of-motion (ROM) has not been previously validated against a gold-standard three-dimensional motion analysis (3DMA) system. Twenty-one healthy individuals (age:31 ± 9.1 years, male:11) participated, with 16 re-examined for intra-rater reliability 1-7 days later. An Android phone was fixed on a helmet, which was then securely fastened on the participant's head. Cervical-spine ROM in flexion, extension, lateral flexion and rotation were performed in sitting with concurrent measurements obtained from both a 3DMA system and the phone.The phone demonstrated moderate to excellent (ICC = 0.53-0.98, Spearman ρ = 0.52-0.98) concurrent validity for ROM measurements in cervical flexion, extension, lateral-flexion and rotation. However, cervical rotation demonstrated both proportional and fixed bias. Excellent intra-rater reliability was demonstrated for cervical flexion, extension and lateral flexion (ICC = 0.82-0.90), but poor for right- and left-rotation (ICC = 0.05-0.33) using the phone. Possible reasons for the outcome are that flexion, extension and lateral-flexion measurements are detected by gravity-dependent accelerometers while rotation measurements are detected by the magnetometer which can be adversely affected by surrounding magnetic fields. The results of this study demonstrate that the tested Android phone application is valid and reliable to measure ROM of the cervical-spine in flexion, extension and lateral-flexion but not in rotation likely due to magnetic interference. The clinical implication of this study is that therapists should be mindful of the plane of measurement when using the Android phone to measure ROM of the cervical-spine.
Obtaining Reliable Predictions of Terrestrial Energy Coupling From Real-Time Solar Wind Measurements

NASA Technical Reports Server (NTRS)

Weimer, Daniel R.

2002-01-01

Measurements of the interplanetary magnetic field (IMF) from the ACE (Advanced Composition Explorer), Wind, IMP-8 (Interplanetary Monitoring Platform), and Geotail spacecraft have revealed that the IMF variations are contained in phase planes that are tilted with respect to the propagation direction, resulting in continuously variable changes in propagation times between spacecraft, and therefore, to the Earth. Techniques for using 'minimum variance analysis' have been developed in order to be able to measure the phase front tilt angles, and better predict the actual propagation times from the L1 orbit to the Earth, using only the real-time IMF measurements from one spacecraft. The use of empirical models with the IMF measurements at L1 from ACE (or future satellites) for predicting 'space weather' effects has also been demonstrated.
Measurement-based reliability/performability models

NASA Technical Reports Server (NTRS)

Hsueh, Mei-Chen

1987-01-01

Measurement-based models based on real error-data collected on a multiprocessor system are described. Model development from the raw error-data to the estimation of cumulative reward is also described. A workload/reliability model is developed based on low-level error and resource usage data collected on an IBM 3081 system during its normal operation in order to evaluate the resource usage/error/recovery process in a large mainframe system. Thus, both normal and erroneous behavior of the system are modeled. The results provide an understanding of the different types of errors and recovery processes. The measured data show that the holding times in key operational and error states are not simple exponentials and that a semi-Markov process is necessary to model the system behavior. A sensitivity analysis is performed to investigate the significance of using a semi-Markov process, as opposed to a Markov process, to model the measured system.
Responsiveness to change and reliability of measurement of radiographic joint space width in osteoarthritis of the knee: a systematic review.

PubMed

Reichmann, W M; Maillefert, J F; Hunter, D J; Katz, J N; Conaghan, P G; Losina, E

2011-05-01

The goal of this systematic review was to report the responsiveness to change and reliability of conventional radiographic joint space width (JSW) measurement. We searched the PubMed and Embase databases using the following search criteria: [osteoarthritis (OA) (MeSH)] AND (knee) AND (X-ray OR radiography OR diagnostic imaging OR radiology OR disease progression) AND (joint space OR JSW or disease progression). We assessed responsiveness by calculating the standardized response mean (SRM). We assessed reliability using intra- and inter-reader intra-class correlation (ICC) and coefficient of variation (CV). Random-effects models were used to pool results from multiple studies. Results were stratified by study duration, design, techniques of obtaining radiographs, and measurement method. We identified 998 articles using the search terms. Of these, 32 articles (43 estimates) reported data on responsiveness of JSW measurement and 24 (50 estimates) articles reported data on measures of reliability. The overall pooled SRM was 0.33 [95% confidence interval (CI): 0.26, 0.41]. Responsiveness of change in JSW measurement was improved substantially in studies of greater than 2 years duration (0.57). Further stratifying this result in studies of greater than 2 years duration, radiographs obtained with the knee in a flexed position yielded an SRM of 0.71. Pooled intra-reader ICC was estimated at 0.97 (95% CI: 0.92, 1.00) and the intra-reader CV estimated at 3.0 (95% CI: 2.0, 4.0). Pooled inter-reader ICC was estimated at 0.93 (95% CI: 0.86, 0.99) and the inter-reader CV estimated at 3.4% (95% CI: 1.3%, 5.5%). Measurement of JSW obtained from radiographs in persons with knee is reliable. These data will be useful to clinicians who are planning RCTs where the change in minimum JSW is the outcome of interest. Copyright © 2011 Osteoarthritis Research Society International. Published by Elsevier Ltd. All rights reserved.
Nutrition environment measures survey-vending: development, dissemination, and reliability.

PubMed

Voss, Carol; Klein, Susan; Glanz, Karen; Clawson, Margaret

2012-07-01

Researchers determined a need to develop an instrument to assess the vending machine environment that was comparably reliable and valid to other Nutrition Environment Measures Survey tools and that would provide consistent and comparable data for businesses, schools, and communities. Tool development, reliability testing, and dissemination of the Nutrition Environment Measures Survey-Vending (NEMS-V) involved a collaboration of students, professionals, and community leaders. Interrater reliability testing showed high levels of agreement among trained raters on the products and evaluations of products. NEMS-V can benefit public health partners implementing policy and environmental change initiatives as a part of their community wellness activities. The vending machine project will support a policy calling for state facilities to provide a minimum of 30% of foods and beverages in vending machines as healthy options, based on NEMS-V criteria, which will be used as a model for other businesses.

Measurements of Weight Bearing Asymmetry Using the Nintendo Wii Fit Balance Board Are Not Reliable for Older Adults and Individuals With Stroke

PubMed Central

Liuzzo, Derek M.; Peters, Denise M.; Middleton, Addie; Lanier, Wes; Chain, Rebecca; Barksdale, Brittany; Fritz, Stacy L.

2015-01-01

Background Clinicians and researchers have used bathroom scales, balance performance monitors with feedback, postural scale analysis, and force platforms to evaluate weight bearing asymmetry (WBA). Now video game consoles offer a novel alternative for assessing this construct. By using specialized software, the Nintendo Wii Fit balance board can provide reliable measurements of WBA in healthy, young adults. However, reliability of measurements obtained using only the factory settings to assess WBA in older adults and individuals with stroke has not been established. Purpose To determine whether measurements of WBA obtained using the Nintendo Wii Fit balance board and default settings are reliable in older adults and individuals with stroke. Methods Weight bearing asymmetry was assessed using the Nintendo Wii Fit balance board in 2 groups of participants—individuals older than 65 years (n = 41) and individuals with stroke (n = 41). Participants were given a standardized set of instructions and were not provided auditory or visual feedback. Two trials were performed. Intraclass correlation coefficients (ICC), standard error of measure (SEM), and minimal detectable change (MDC) scores were determined for each group. Results The ICC for the older adults sample was 0.59 (0.35–0.76) with SEM95= 6.2% and MDC95= 8.8%. The ICC for the sample including individuals with stroke was 0.60 (0.47–0.70) with SEM95= 9.6% and MDC95= 13.6%. Discussion Although measurements of WBA obtained using the Nintendo Wii Fit balance board, and its default factory settings, demonstrate moderate reliability in older adults and individuals with stroke, the relatively high associated SEM and MDC values substantially reduce the clinical utility of the Nintendo Wii Fit balance board as an assessment tool for WBA. Conclusions Weight bearing asymmetry cannot be measured reliably in older adults and individuals with stroke using the Nintendo Wii Fit balance board without the use of specialized
Reliability of fish size estimates obtained from multibeam imaging sonar

USGS Publications Warehouse

Hightower, Joseph E.; Magowan, Kevin J.; Brown, Lori M.; Fox, Dewayne A.

2013-01-01

Multibeam imaging sonars have considerable potential for use in fisheries surveys because the video-like images are easy to interpret, and they contain information about fish size, shape, and swimming behavior, as well as characteristics of occupied habitats. We examined images obtained using a dual-frequency identification sonar (DIDSON) multibeam sonar for Atlantic sturgeon Acipenser oxyrinchus oxyrinchus, striped bass Morone saxatilis, white perch M. americana, and channel catfish Ictalurus punctatus of known size (20–141 cm) to determine the reliability of length estimates. For ranges up to 11 m, percent measurement error (sonar estimate – total length)/total length × 100 varied by species but was not related to the fish's range or aspect angle (orientation relative to the sonar beam). Least-square mean percent error was significantly different from 0.0 for Atlantic sturgeon (x̄ = −8.34, SE = 2.39) and white perch (x̄ = 14.48, SE = 3.99) but not striped bass (x̄ = 3.71, SE = 2.58) or channel catfish (x̄ = 3.97, SE = 5.16). Underestimating lengths of Atlantic sturgeon may be due to difficulty in detecting the snout or the longer dorsal lobe of the heterocercal tail. White perch was the smallest species tested, and it had the largest percent measurement errors (both positive and negative) and the lowest percentage of images classified as good or acceptable. Automated length estimates for the four species using Echoview software varied with position in the view-field. Estimates tended to be low at more extreme azimuthal angles (fish's angle off-axis within the view-field), but mean and maximum estimates were highly correlated with total length. Software estimates also were biased by fish images partially outside the view-field and when acoustic crosstalk occurred (when a fish perpendicular to the sonar and at relatively close range is detected in the side lobes of adjacent beams). These sources of
Reliability, agreement, and validity of digital weighing scale with MatScan in limb load measurement.

PubMed

Kumar, Senthil N S; Omar, Baharudin; Htwe, Ohnmar; Joseph, Leonard H; Krishnan, Jagannathan; Jafarzedah Esfehani, Ali; Min, Lee L

2014-01-01

Limb loading measurements serve as an objective evaluation of asymmetrical weight bearing in the lower limb. Digital weighing scales (DWSs) could be used in clinical settings for measurement of static limb loading. However, ambiguity exists whether limb loading measurements of DWSs are comparable with a standard tool such as MatScan. A cross-sectional study composed of 33 nondisabled participants was conducted to investigate the reliability, agreement, and validity of DWSs with MatScan in static standing. Amounts of weight distribution and plantar pressure on the individual lower limb were measured using two DWSs (A, B) and MatScan during eyes open (EO) and eyes closed (EC) conditions. The results showed that intra- and interrater reliability (3, 1) were excellent (0.94-0.97) within and between DWS A and B. Bland-Altman plot revealed good agreement between DWS and MatScan in EO and EC conditions. The area under the receiver operating characteristic curve was significant and identified as 0.68 (p = 0.01). The measurements obtained with DWSs are valid and in agreement with MatScan measurements. Hence, DWSs could be used interchangeably with MatScan and could provide clinicians an objective measurement of limb loading suitable for clinical settings.
Reliability measurement during software development. [for a multisensor tracking system

NASA Technical Reports Server (NTRS)

Hecht, H.; Sturm, W. A.; Trattner, S.

1977-01-01

During the development of data base software for a multi-sensor tracking system, reliability was measured. The failure ratio and failure rate were found to be consistent measures. Trend lines were established from these measurements that provided good visualization of the progress on the job as a whole as well as on individual modules. Over one-half of the observed failures were due to factors associated with the individual run submission rather than with the code proper. Possible application of these findings for line management, project managers, functional management, and regulatory agencies is discussed. Steps for simplifying the measurement process and for use of these data in predicting operational software reliability are outlined.
Reliability measures in item response theory: manifest versus latent correlation functions.

PubMed

Milanzi, Elasma; Molenberghs, Geert; Alonso, Ariel; Verbeke, Geert; De Boeck, Paul

2015-02-01

For item response theory (IRT) models, which belong to the class of generalized linear or non-linear mixed models, reliability at the scale of observed scores (i.e., manifest correlation) is more difficult to calculate than latent correlation based reliability, but usually of greater scientific interest. This is not least because it cannot be calculated explicitly when the logit link is used in conjunction with normal random effects. As such, approximations such as Fisher's information coefficient, Cronbach's α, or the latent correlation are calculated, allegedly because it is easy to do so. Cronbach's α has well-known and serious drawbacks, Fisher's information is not meaningful under certain circumstances, and there is an important but often overlooked difference between latent and manifest correlations. Here, manifest correlation refers to correlation between observed scores, while latent correlation refers to correlation between scores at the latent (e.g., logit or probit) scale. Thus, using one in place of the other can lead to erroneous conclusions. Taylor series based reliability measures, which are based on manifest correlation functions, are derived and a careful comparison of reliability measures based on latent correlations, Fisher's information, and exact reliability is carried out. The latent correlations are virtually always considerably higher than their manifest counterparts, Fisher's information measure shows no coherent behaviour (it is even negative in some cases), while the newly introduced Taylor series based approximations reflect the exact reliability very closely. Comparisons among the various types of correlations, for various IRT models, are made using algebraic expressions, Monte Carlo simulations, and data analysis. Given the light computational burden and the performance of Taylor series based reliability measures, their use is recommended. © 2014 The British Psychological Society.
Obtaining reliable phase-gradient delays from otoacoustic emission data.

PubMed

Shera, Christopher A; Bergevin, Christopher

2012-08-01

Reflection-source otoacoustic emission phase-gradient delays are widely used to obtain noninvasive estimates of cochlear function and properties, such as the sharpness of mechanical tuning and its variation along the length of the cochlear partition. Although different data-processing strategies are known to yield different delay estimates and trends, their relative reliability has not been established. This paper uses in silico experiments to evaluate six methods for extracting delay trends from reflection-source otoacoustic emissions (OAEs). The six methods include both previously published procedures (e.g., phase smoothing, energy-weighting, data exclusion based on signal-to-noise ratio) and novel strategies (e.g., peak-picking, all-pass factorization). Although some of the methods perform well (e.g., peak-picking), others introduce substantial bias (e.g., phase smoothing) and are not recommended. In addition, since standing waves caused by multiple internal reflection can complicate the interpretation and compromise the application of OAE delays, this paper develops and evaluates two promising signal-processing strategies, the first based on time-frequency filtering using the continuous wavelet transform and the second on cepstral analysis, for separating the direct emission from its subsequent reflections. Altogether, the results help to resolve previous disagreements about the frequency dependence of human OAE delays and the sharpness of cochlear tuning while providing useful analysis methods for future studies.
High day-to-day reliability in lower leg volume measured by water displacement.

PubMed

Pasley, Jeffrey D; O'Connor, Patrick J

2008-07-01

The day-to-day reliability of lower leg volume is poorly documented. This investigation determined the day-to-day reliability of lower leg volume (soleus and gastrocnemius) measured using water displacement. Thirty young adults (15 men and 15 women) had their right lower leg volume measured by water displacement on five separate occasions. The participants performed normal activities of daily living and were measured at the same time of day after being seated for 30 min. The results revealed a high day-to-day reliability for lower leg volume. The mean percentage change in lower leg volume across days compared to day 1 ranged between 0 and 0.37%. The mean within subjects coefficient of variation in lower leg volume was 0.72% and the coefficient of variation for the entire sample across days ranged from 5.66 to 6.32%. A two way mixed model intraclass correlation (30 subjects x 5 days) showed that the lower leg volume measurement was highly reliable (ICC = 0.972). Foot and total lower leg volumes showed similarly high reliability. Water displacement offers a cost effective and reliable solution for the measurement of lower leg edema across days.
Reliability and validity of the Performance Recorder 1 for measuring isometric knee flexor and extensor strength.

PubMed

Neil, Sarah E; Myring, Alec; Peeters, Mon Jef; Pirie, Ian; Jacobs, Rachel; Hunt, Michael A; Garland, S Jayne; Campbell, Kristin L

2013-11-01

Muscular strength is a key parameter of rehabilitation programs and a strong predictor of functional capacity. Traditional methods to measure strength, such as manual muscle testing (MMT) and hand-held dynamometry (HHD), are limited by the strength and experience of the tester. The Performance Recorder 1 (PR1) is a strength assessment tool attached to resistance training equipment and may be a time- and cost-effective tool to measure strength in clinical practice that overcomes some limitations of MMT and HHD. However, reliability and validity of the PR1 have not been reported. Test-retest and inter-rater reliability was assessed using the PR1 in healthy adults (n = 15) during isometric knee flexion and extension. Criterion-related validity was assessed through comparison of values obtained from the PR1 and Biodex® isokinetic dynamometer. Test-retest reliability was excellent for peak knee flexion (intra-class correlation coefficient [ICC] of 0.96, 95% CI: 0.85, 0.99) and knee extension (ICC = 0.96, 95% CI: 0.87, 0.99). Inter-rater reliability was also excellent for peak knee flexion (ICC = 0.95, 95% CI: 0.85, 0.99) and peak knee extension (ICC = 0.97, 95% CI: 0.91, 0.99). Validity was moderate for peak knee flexion (ICC = 0.75, 95% CI: 0.38, 0.92) but poor for peak knee extension (ICC = 0.37, 95% CI: 0, 0.73). The PR1 provides a reliable measure of isometric knee flexor and extensor strength in healthy adults that could be used in the clinical setting, but absolute values may not be comparable to strength assessment by gold-standard measures.
Ultrasound virtual endoscopy: Polyp detection and reliability of measurement in an in vitro study with pig intestine specimens

PubMed Central

Liu, Jin-Ya; Chen, Li-Da; Cai, Hua-Song; Liang, Jin-Yu; Xu, Ming; Huang, Yang; Li, Wei; Feng, Shi-Ting; Xie, Xiao-Yan; Lu, Ming-De; Wang, Wei

2016-01-01

AIM: To present our initial experience regarding the feasibility of ultrasound virtual endoscopy (USVE) and its measurement reliability for polyp detection in an in vitro study using pig intestine specimens. METHODS: Six porcine intestine specimens containing 30 synthetic polyps underwent USVE, computed tomography colonography (CTC) and optical colonoscopy (OC) for polyp detection. The polyp measurement defined as the maximum polyp diameter on two-dimensional (2D) multiplanar reformatted (MPR) planes was obtained by USVE, and the absolute measurement error was analyzed using the direct measurement as the reference standard. RESULTS: USVE detected 29 (96.7%) of 30 polyps, remaining a 7-mm one missed. There was one false-positive finding. Twenty-six (89.7%) of 29 reconstructed images were clearly depicted, while 29 (96.7%) of 30 polyps were displayed on CTC with one false-negative finding. In OC, all the polyps were detected. The intraclass correlation coefficient was 0.876 (95%CI: 0.745-0.940) for measurements obtained with USVE. The pooled absolute measurement errors ± the standard deviations of the depicted polyps with actual sizes ≤ 5 mm, 6-9 mm, and ≥ 10 mm were 1.9 ± 0.8 mm, 0.9 ± 1.2 mm, and 1.0 ± 1.4 mm, respectively. CONCLUSION: USVE is reliable for polyp detection and measurement in in vitro study. PMID:27022217
A Study on the Reliability of Sasang Constitutional Body Trunk Measurement

PubMed Central

Jang, Eunsu; Kim, Jong Yeol; Lee, Haejung; Kim, Honggie; Baek, Younghwa; Lee, Siwoo

2012-01-01

Objective. Body trunk measurement for human plays an important diagnostic role not only in conventional medicine but also in Sasang constitutional medicine (SCM). The Sasang constitutional body trunk measurement (SCBTM) consists of the 5-widths and the 8-circumferences which are standard locations currently employed in the SCM society. This study suggests to what extent a comprehensive training can improve the reliability of the SCBTM. Methods. We recruited 10 male subjects and 5 male observers with no experience of anthropometric measurement. We conducted measurements twice before and after a comprehensive training. Relative technical error of measurement (%TEMs) was produced to assess intra and inter observer reliabilities. Results. Post-training intra-observer %TEMs of the SCBTM were 0.27% to 1.85% reduced from 0.27% to 6.26% in pre-training, respectively. Post-training inter-observer %TEMs of those were 0.56% to 1.66% reduced from 1.00% to 9.60% in pre-training, respectively. Post-training % total TEMs which represent the whole reliability were 0.68% to 2.18% reduced from maximum value of 10.18%. Conclusion. A comprehensive training makes the SCBTM more reliable, hence giving a sufficiently confident diagnostic tool. It is strongly recommended to give a comprehensive training in advance to take the SCBTM. PMID:21822442
Reliability of quadriceps surface electromyography measurements is improved by two vs. single site recordings.

PubMed

Balshaw, T G; Fry, A; Maden-Wilkinson, T M; Kong, P W; Folland, J P

2017-06-01

The reliability of surface electromyography (sEMG) is typically modest even with rigorous methods, and therefore further improvements in sEMG reliability are desirable. This study compared the between-session reliability (both within participant absolute reliability and between-participant relative reliability) of sEMG amplitude from single vs. average of two distinct recording sites, for individual muscle (IM) and whole quadriceps (WQ) measures during voluntary and evoked contractions. Healthy males (n = 20) performed unilateral isometric knee extension contractions: voluntary maximum and submaximum (60%), as well as evoked twitch contractions on two separate days. sEMG was recorded from two distinct sites on each superficial quadriceps muscle. Averaging two recording sites vs. using single site measures improved reliability for IM and WQ measurements during voluntary (16-26% reduction in within-participant coefficient of variation, CV W ) and evoked contractions (40-56% reduction in CV W ). For sEMG measurements from large muscles, averaging the recording of two distinct sites is recommended as it improves within-participant reliability. This improved sensitivity has application to clinical and research measurement of sEMG amplitude.
Reliability of anthropometric measurements in European preschool children: the ToyBox-study.

PubMed

De Miguel-Etayo, P; Mesana, M I; Cardon, G; De Bourdeaudhuij, I; Góźdź, M; Socha, P; Lateva, M; Iotova, V; Koletzko, B V; Duvinage, K; Androutsos, O; Manios, Y; Moreno, L A

2014-08-01

The ToyBox-study aims to develop and test an innovative and evidence-based obesity prevention programme for preschoolers in six European countries: Belgium, Bulgaria, Germany, Greece, Poland and Spain. In multicentre studies, anthropometric measurements using standardized procedures that minimize errors in the data collection are essential to maximize reliability of measurements. The aim of this paper is to describe the standardization process and reliability (intra- and inter-observer) of height, weight and waist circumference (WC) measurements in preschoolers. All technical procedures and devices were standardized and centralized training was given to the fieldworkers. At least seven children per country participated in the intra- and inter-observer reliability testing. Intra-observer technical error ranged from 0.00 to 0.03 kg for weight and from 0.07 to 0.20 cm for height, with the overall reliability being above 99%. A second training was organized for WC due to low reliability observed in the first training. Intra-observer technical error for WC ranged from 0.12 to 0.71 cm during the first training and from 0.05 to 1.11 cm during the second training, and reliability above 92% was achieved. Epidemiological surveys need standardized procedures and training of researchers to reduce measurement error. In the ToyBox-study, very good intra- and-inter-observer agreement was achieved for all anthropometric measurements performed. © 2014 World Obesity.
Estimating Measures of Pass-Fail Reliability from Parallel Half-Tests.

ERIC Educational Resources Information Center

Woodruff, David J.; Sawyer, Richard L.

Two methods for estimating measures of pass-fail reliability are derived, by which both theta and kappa may be estimated from a single test administration. The methods require only a single test administration and are computationally simple. Both are based on the Spearman-Brown formula for estimating stepped-up reliability. The non-distributional…
Reliability of cervical lordosis measurement techniques on long-cassette radiographs.

PubMed

Janusz, Piotr; Tyrakowski, Marcin; Yu, Hailong; Siemionow, Kris

2016-11-01

Lateral radiographs are commonly used to assess cervical sagittal alignment. Three assessment methods have been described and are commonly utilized in clinical practice. These methods are described for perfect lateral cervical radiographs, however in everyday practice radiograph quality varies. The aim of this study was to compare the reliability and reproducibility of 3 cervical lordosis (CL) measurement methods. Forty-four standing lateral radiographs were randomly chosen from a lateral long-cassette radiograph database. Measurements of CL were performed with: Cobb method C2-C7 (CM), C2-C7 posterior tangent method (PTM), sum of posterior tangent method for each segment (SPTM). Three independent orthopaedic surgeons measured CL using the three methods on 44 lateral radiographs. One researcher used the three methods to measured CL three times at 4-week time intervals. Agreement between the methods as well as their intra- and interobserver reliability were tested and quantified by intraclass correlation coefficient (ICC) and median error for a single measurement (SEM). ICC of 0.75 or more reflected an excellent agreement/reliability. The results were compared with repeated ANOVA test, with p < 0.05 considered as significant. All methods revealed excellent intra- and interobserver reliability. Agreement (ICC, SEM) between three methods was (0.89°, 3.44°), between CM and SPTM was (0.82°, 4.42°), between CM and PTM was (0.80°, 4.80°) and between PTM and SPTM was (0.99°, 1.10°). Mean values CL for a CM, PTM, SPTM were 10.5° ± 13.9°, 17.5° ± 15.6° and 17.7° ± 15.9° (p < 0.0001), respectively. The significant difference was between CM vs PTM (p < 0.0001) and CM vs SPTM (p < 0.0001), but not between PTM vs SPTM (p > 0.05). All three methods appeared to be highly reliable. Although, high agreement between all measurement methods was shown, we do not recommend using Cobb measurement method interchangeably with PTM or SPTM within a single
Relating Measurement Invariance, Cross-Level Invariance, and Multilevel Reliability.

PubMed

Jak, Suzanne; Jorgensen, Terrence D

2017-01-01

Data often have a nested, multilevel structure, for example when data are collected from children in classrooms. This kind of data complicate the evaluation of reliability and measurement invariance, because several properties can be evaluated at both the individual level and the cluster level, as well as across levels. For example, cross-level invariance implies equal factor loadings across levels, which is needed to give latent variables at the two levels a similar interpretation. Reliability at a specific level refers to the ratio of true score variance over total variance at that level. This paper aims to shine light on the relation between reliability, cross-level invariance, and strong factorial invariance across clusters in multilevel data. Specifically, we will illustrate how strong factorial invariance across clusters implies cross-level invariance and perfect reliability at the between level in multilevel factor models.
A Model for Estimating the Reliability and Validity of Criterion-Referenced Measures.

ERIC Educational Resources Information Center

Edmonston, Leon P.; Randall, Robert S.

A decision model designed to determine the reliability and validity of criterion referenced measures (CRMs) is presented. General procedures which pertain to the model are discussed as to: Measures of relationship, Reliability, Validity (content, criterion-oriented, and construct validation), and Item Analysis. The decision model is presented in…
Reliability and Validity of Finger Strength and Endurance Measurements in Rock Climbing

ERIC Educational Resources Information Center

Michailov, Michail Lubomirov; Baláš, Jirí; Tanev, Stoyan Kolev; Andonov, Hristo Stoyanov; Kodejška, Jan; Brown, Lee

2018-01-01

Purpose: An advanced system for the assessment of climbing-specific performance was developed and used to: (a) investigate the effect of arm fixation (AF) on construct validity evidence and reliability of climbing-specific finger-strength measurement; (b) assess reliability of finger-strength and endurance measurements; and (c) evaluate the…
Ultrasonographic measurement of the acromiohumeral distance in spinal cord injury: Reliability and effects of shoulder positioning.

PubMed

Lin, Yen-Sheng; Boninger, Michael L; Day, Kevin A; Koontz, Alicia M

2015-11-01

To investigate the reliability of ultrasonographic measurement of acromiohumeral distance (AHD) and the effects of shoulder positioning on AHD among manual wheelchair users (MWUs) with spinal cord injury (SCI) and an able-bodied control group. Ten MWUs with SCI and 10 able-bodied subjects participated in this study. The ultrasonographic measurements of AHD from each subject were obtained by two raters during passive and active scapular plane arm elevation in neutral, 45°, 90° with and without resistance and in a weight relief raise position. The measurements were recorded again by each rater using the same procedures after a 30-minute time interval. All raters were blinded to each other's measurements. University Laboratories and Veteran Affairs Healthcare System. Intra-rater (intraclass correlation coefficient, ICC > 0.83) and inter-rater (ICC > 0.78) reliability was excellent for both the MWUs with SCI and able-bodied groups across all arm positions except for the 45° position in the control group for one of the raters (intra-rater: ICC < 0.40 and inter-rater: ICC < 0.60). AHD significantly reduced when the shoulder was in the 90° arm elevated positions with or without resistance. Findings from our study demonstrated that ultrasonography is a reliable means to evaluate AHD in both able bodied and individuals with SCI, who are known to have significant shoulder pathology. This technique could be used to develop reference measures and to identify changes in AHD caused by interventions.
Between-day reliability of time-to-contact measures used to assess postural stability.

PubMed

Wheat, Jonathan S; Haddad, Jeffrey M; Scaife, Robert

2012-02-01

Traditional measures of postural stability consider movement of the center of pressure (COP) or the center of mass (COM) without regard to the boundary of support (BOS). A potentially more appropriate measure is postural time-to-contact (TtC) which defines the spatio-temporal proximity of the COM or COP to the BOS. Given the increasing popularity of TtC measures, it is important to determine their reliability. Therefore, the purpose of this study was to determine the effects of the number of trials and trial duration on the reliability of postural TtC measures. COP data were collected (100 Hz) in 16 young healthy participants during 10 trials (60-s duration) of quiet standing with eyes open on two occasions - seven days apart. Postural TtC of each trial was calculated using two different methods. The intersession reliability of the TtC measures was assessed by calculating between session intraclass correlation coefficients (ICC(2,1)) using different combinations of the number of trials (1-10) and trial duration (10, 20, 30, 40, 50 and 60s). Both TtC methods were very reliable. Additionally, both measures of TtC were more reliable than the standard deviation of the anterior-posterior COP and slightly more reliable than path length. This difference was most pronounced when fewer and shorter trials were used. Copyright © 2011 Elsevier B.V. All rights reserved.
Comparison of Temperature Measurements in the Middle Atmosphere by Satellite with Profiles Obtained by Meteorological Rockets

NASA Technical Reports Server (NTRS)

Goldberg, Richard A.; Schmidlin, Francis J.; Feofilov, Artem; Bedrick, M.; Rose, R. Lynn

2012-01-01

Measurements using the inflatable falling sphere technique have occasionally been used to obtain temperature results from density data and thereby provide comparison with temperature profiles obtained by satellite sounders in the mesosphere and stratosphere. To insure density measurements within narrow time frames and close in space, the inflatable falling sphere is launched within seconds of the nearly overhead satellite pass. Sphere measurements can be used to validate remotely measured temperatures but also have the advantage of measuring small-scale atmospheric features. Even so, with the dearth of remaining falling spheres available (the manufacture of these systems has been discontinued), it may be time to consider whether the remote measurements are mature enough to stand alone. Three field studies are considered, one in 2003 from Northern Sweden, and two in 2010 from the vicinity of Kwajalein Atoll in the South Pacific and from Barking Sands, Hawaii. All three sites are used to compare temperature retrievals between satellite and in situ falling spheres. The major satellite instruments employed are SABER, MLS, and AIRS. The comparisons indicate that remotely measured temperatures mimic the sphere temperature measurements quite well. The data also confirm that satellite retrievals, while not always at the exact location required for detailed studies in space and time, compare sufficiently well to be highly useful. Although the falling sphere will provide a measurement at a specific location and time, satellites only pass a given location daily or less frequently. This report reveals that averaged satellite measurements can provide temperatures and densities comparable to those obtained from the falling sphere, thereby providing a reliable measure of global temperature

Reliability of Performance-Based Clinical Measurements to Assess Shoulder Girdle Kinematics and Positioning: Systematic Review.

PubMed

D'hondt, Norman E; Kiers, Henri; Pool, Jan J M; Hacquebord, Sijmen T; Terwee, Caroline B; Veeger, Dirkjan H E J

2017-01-01

Deviant shoulder girdle movement is suggested as an eminent factor in the etiology of shoulder pain. Reliable measurements of shoulder girdle kinematics are a prerequisite for optimizing clinical management strategies. The purpose of this study was to evaluate the reliability, measurement error, and internal consistency of measurements with performance-based clinical tests for shoulder girdle kinematics and positioning in patients with shoulder pain. The MEDLINE, Embase, CINAHL, and SPORTDiscus databases were systematically searched from inception to August 2015. Articles published in Dutch, English, or German were included if they involved the evaluation of at least one of the measurement properties of interest. Two reviewers independently evaluated the methodological quality per studied measurement property with the 4-point-rating scale of the COSMIN (COnsensus-based Standards for the selection of health Measurement INstruments) checklist, extracted data, and assessed the adequacy of the measurement properties. Forty studies comprising more than 30 clinical tests were included. Actual reported measurements of the tests were categorized into: (1) positional measurement methods, (2) measurement methods to determine dynamic characteristics, and (3) tests to diagnose impairments of shoulder girdle function. Best evidence synthesis of the tests was performed per measurement for each measurement property. All studies had significant limitations, including incongruence between test description and actual reported measurements and a lack of reporting on minimal important change. In general, the methodological quality of the selected studies was fair to poor. High-quality evidence indicates that measurements obtained with the Modified Scapular Assistance Test are not reliable for clinical use. Sound recommendations for the use of other tests could not be made due to inadequate evidence. Across studies, diversity in description, performance, and interpretation of similar
Reliability of Scores Obtained from Self-, Peer-, and Teacher-Assessments on Teaching Materials Prepared by Teacher Candidates

ERIC Educational Resources Information Center

Nalbantoglu Yilmaz, Funda

2017-01-01

This study aims to determine the reliability of scores obtained from self-, peer-, and teacher-assessments in terms of teaching materials prepared by teacher candidates. The study group of this research constitutes 56 teacher candidates. In the scope of research, teacher candidates were asked to develop teaching material related to their study.…
Reliability of heart rate measures during walking before and after running maximal efforts.

PubMed

Boullosa, D A; Barros, E S; del Rosso, S; Nakamura, F Y; Leicht, A S

2014-11-01

Previous studies on HR recovery (HRR) measures have utilized the supine and the seated postures. However, the most common recovery mode in sport and clinical settings after running exercise is active walking. The aim of the current study was to examine the reliability of HR measures during walking (4 km · h(-1)) before and following a maximal test. Twelve endurance athletes performed an incremental running test on 2 days separated by 48 h. Absolute (coefficient of variation, CV, %) and relative [Intraclass correlation coefficient, (ICC)] reliability of time domain and non-linear measures of HR variability (HRV) from 3 min recordings, and HRR parameters over 5 min were assessed. Moderate to very high reliability was identified for most HRV indices with short-term components of time domain and non-linear HRV measures demonstrating the greatest reliability before (CV: 12-22%; ICC: 0.73-0.92) and after exercise (CV: 14-32%; ICC: 0.78-0.91). Most HRR indices and parameters of HRR kinetics demonstrated high to very high reliability with HR values at a given point and the asymptotic value of HR being the most reliable (CV: 2.5-10.6%; ICC: 0.81-0.97). These findings demonstrate these measures as reliable tools for the assessment of autonomic control of HR during walking before and after maximal efforts. © Georg Thieme Verlag KG Stuttgart · New York.
Reliability and concurrent validity of postural asymmetry measurement in adolescent idiopathic scoliosis

PubMed Central

Prowse, Ashleigh; Aslaksen, Berit; Kierkegaard, Marie; Furness, James; Gerdhem, Paul; Abbott, Allan

2017-01-01

AIM To investigate the reliability and concurrent validity of the Baseline® Body Level/Scoliosis meter for adolescent idiopathic scoliosis postural assessment in three anatomical planes. METHODS This is an observational reliability and concurrent validity study of adolescent referrals to the Orthopaedic department for scoliosis screening at Karolinska University Hospital, Stockholm, Sweden between March-May 2012. A total of 31 adolescents with idiopathic scoliosis (13.6 ± 0.6 years old) of mild-moderate curvatures (25° ± 12°) were consecutively recruited. Measurement of cervical, thoracic and lumbar curvatures, pelvic and shoulder tilt, and axial thoracic rotation (ATR) were performed by two trained physiotherapists in one day. The intraclass correlation coefficient (ICC) was used to determine the inter-examiner reliability (ICC2,1) and the intra-rater reliability (ICC3,3) of the Baseline® Body Level/Scoliosis meter. Spearman’s correlation analyses were used to estimate concurrent validity between the Baseline® Body Level/Scoliosis meter and Gold Standard Cobb angles from radiographs and the Orthopaedic Systems Inc. Scoliometer. RESULTS There was excellent reliability between examiners for thoracic kyphosis (ICC2,1 = 0.94), ATR (ICC2,1 = 0.92) and lumbar lordosis (ICC2,1 = 0.79). There was adequate reliability between examiners for cervical lordosis (ICC2,1 = 0.51), however poor reliability for pelvic and shoulder tilt. Both devices were reproducible in the measurement of ATR when repeated by one examiner (ICC3,3 0.98-1.00). The device had a good correlation with the Scoliometer (rho = 0.78). When compared with Cobb angle from radiographs, there was a moderate correlation for ATR (rho = 0.627). CONCLUSION The Baseline® Body Level/Scoliosis meter provides reliable transverse and sagittal cervical, thoracic and lumbar measurements and valid transverse plan measurements of mild-moderate scoliosis deformity. PMID:28144582
Reliability and concurrent validity of postural asymmetry measurement in adolescent idiopathic scoliosis.

PubMed

Prowse, Ashleigh; Aslaksen, Berit; Kierkegaard, Marie; Furness, James; Gerdhem, Paul; Abbott, Allan

2017-01-18

To investigate the reliability and concurrent validity of the Baseline ® Body Level/Scoliosis meter for adolescent idiopathic scoliosis postural assessment in three anatomical planes. This is an observational reliability and concurrent validity study of adolescent referrals to the Orthopaedic department for scoliosis screening at Karolinska University Hospital, Stockholm, Sweden between March-May 2012. A total of 31 adolescents with idiopathic scoliosis (13.6 ± 0.6 years old) of mild-moderate curvatures (25° ± 12°) were consecutively recruited. Measurement of cervical, thoracic and lumbar curvatures, pelvic and shoulder tilt, and axial thoracic rotation (ATR) were performed by two trained physiotherapists in one day. The intraclass correlation coefficient (ICC) was used to determine the inter-examiner reliability (ICC2,1) and the intra-rater reliability (ICC3,3) of the Baseline ® Body Level/Scoliosis meter. Spearman's correlation analyses were used to estimate concurrent validity between the Baseline ® Body Level/Scoliosis meter and Gold Standard Cobb angles from radiographs and the Orthopaedic Systems Inc. Scoliometer. There was excellent reliability between examiners for thoracic kyphosis (ICC2,1 = 0.94), ATR (ICC2,1 = 0.92) and lumbar lordosis (ICC2,1 = 0.79). There was adequate reliability between examiners for cervical lordosis (ICC2,1 = 0.51), however poor reliability for pelvic and shoulder tilt. Both devices were reproducible in the measurement of ATR when repeated by one examiner (ICC3,3 0.98-1.00). The device had a good correlation with the Scoliometer (rho = 0.78). When compared with Cobb angle from radiographs, there was a moderate correlation for ATR (rho = 0.627). The Baseline ® Body Level/Scoliosis meter provides reliable transverse and sagittal cervical, thoracic and lumbar measurements and valid transverse plan measurements of mild-moderate scoliosis deformity.
Reliability of Two Smartphone Applications for Radiographic Measurements of Hallux Valgus Angles.

PubMed

Mattos E Dinato, Mauro Cesar; Freitas, Marcio de Faria; Milano, Cristiano; Valloto, Elcio; Ninomiya, André Felipe; Pagnano, Rodrigo Gonçalves

The objective of the present study was to assess the reliability of 2 smartphone applications compared with the traditional goniometer technique for measurement of radiographic angles in hallux valgus and the time required for analysis with the different methods. The radiographs of 31 patients (52 feet) with a diagnosis of hallux valgus were analyzed. Four observers, 2 with >10 years' experience in foot and ankle surgery and 2 in-training surgeons, measured the hallux valgus angle and intermetatarsal angle using a manual goniometer technique and 2 smartphone applications (Hallux Angles and iPinPoint). The interobserver and intermethod reliability were estimated using intraclass correlation coefficients (ICCs), and the time required for measurement of the angles among the 3 methods was compared using the Friedman test. A very good or good interobserver reliability was found among the 4 observers measuring the hallux valgus angle and intermetatarsal angle using the goniometer (ICC 0.913 and 0.821, respectively) and iPinPoint (ICC 0.866 and 0.638, respectively). Using the Hallux Angles application, a very good interobserver reliability was found for measurements of the hallux valgus angle (ICC 0.962) and intermetatarsal angle (ICC 0.935) only among the more experienced observers. The time required for the measurements was significantly shorter for the measurements using both smartphone applications compared with the goniometer method. One smartphone application (iPinPoint) was reliable for measurements of the hallux valgus angles by either experienced or nonexperienced observers. The use of these tools might save time in the evaluation of radiographic angles in the hallux valgus. Copyright © 2016 American College of Foot and Ankle Surgeons. Published by Elsevier Inc. All rights reserved.
Reliability of segmental accelerations measured using a new wireless gait analysis system.

PubMed

Kavanagh, Justin J; Morrison, Steven; James, Daniel A; Barrett, Rod

2006-01-01

The purpose of this study was to determine the inter- and intra-examiner reliability, and stride-to-stride reliability, of an accelerometer-based gait analysis system which measured 3D accelerations of the upper and lower body during self-selected slow, preferred and fast walking speeds. Eight subjects attended two testing sessions in which accelerometers were attached to the head, neck, lower trunk, and right shank. In the initial testing session, two different examiners attached the accelerometers and performed the same testing procedures. A single examiner repeated the procedure in a subsequent testing session. All data were collected using a new wireless gait analysis system, which features near real-time data transmission via a Bluetooth network. Reliability for each testing condition (4 locations, 3 directions, 3 speeds) was quantified using a waveform similarity statistic known as the coefficient of multiple determination (CMD). CMD's ranged from 0.60 to 0.98 across all test conditions and were not significantly different for inter-examiner (0.86), intra-examiner (0.87), and stride-to-stride reliability (0.86). The highest repeatability for the effect of location, direction and walking speed were for the shank segment (0.94), the vertical direction (0.91) and the fast walking speed (0.91), respectively. Overall, these results indicate that a high degree of waveform repeatability was obtained using a new gait system under test-retest conditions involving single and dual examiners. Furthermore, differences in acceleration waveform repeatability associated with the reapplication of accelerometers were small in relation to normal motor variability.
Ultrasound measures of tendon thickness: Intra-rater, Inter-rater and Inter-machine reliability.

PubMed

Del Baño-Aledo, María Elena; Martínez-Payá, Jacinto Javier; Ríos-Díaz, José; Mejías-Suárez, Silvia; Serrano-Carmona, Sergio; de Groot-Ferrando, Ana

2017-01-01

Ultrasound imaging is often used by physiotherapists and other healthcare professionals but the reliability of image acquisition with different ultrasound machines is unknown. The objective was to compare the intra-rater, inter-rater and intermachine reliability of thickness measurements of the plantar fascia (PF), Achilles tendon (AT), patellar tendon (PT) and elbow common extensor tendon (ECET) with musculoskeletal ultrasound imaging (MSUS). Tendon thickness was measured in four anatomical structures (14 participants, 28 images per tendon) by two sonographers and with two different ultrasound machines. Intraclass Correlation Coefficients (ICCs) and Bland-Altman plots were calculated. The standard error of measurement (SEM) and minimum detectable difference (MDD) were calculated. Inter-rater reliability was excellent for AT (ICC=0.98; 95% CI= 0.96-0.99) and very good for PT (ICC=0.85; 95% CI = 0.67-0.93) and ECET (ICC=0.81; 95% CI= 0.72-0.94). Reliability for PF was moderate, with an ICC of 0.63 (CI 95%= 0.20-0.83). Bland-Altman plot for inter-machine reliability showed a mean difference of 1 m for PF measurements and a mean difference of 4 m and 20 m for AT and PT. The relative SEMs were below 7% and the MDCs were below 0.7 mm. The MSUS reliability in measuring thickness of the four tendons is confirmed by the homogeneous readings intra sonographers, between operators and between different machines. Level of evidence: Tendon thickness can be measured reliably on different ultrasound devices, which is an important step forward in the use of this technique in daily clinical practice and research. III.
Reliability of ultrasound thickness measurement of the abdominal muscles during clinical isometric endurance tests.

PubMed

ShahAli, Shabnam; Arab, Amir Massoud; Talebian, Saeed; Ebrahimi, Esmaeil; Bahmani, Andia; Karimi, Noureddin; Nabavi, Hoda

2015-07-01

The study was designed to evaluate the intra-examiner reliability of ultrasound (US) thickness measurement of abdominal muscles activity when supine lying and during two isometric endurance tests in subjects with and without Low back pain (LBP). A total of 19 women (9 with LBP, 10 without LBP) participated in the study. Within-day reliability of the US thickness measurements at supine lying and the two isometric endurance tests were assessed in all subjects. The intra-class correlation coefficient (ICC) was used to assess the relative reliability of thickness measurement. The standard error of measurement (SEM), minimal detectable change (MDC) and the coefficient of variation (CV) were used to evaluate the absolute reliability. Results indicated high ICC scores (0.73-0.99) and also small SEM and MDC scores for within-day reliability assessment. The Bland-Altman plots of agreement in US measurement of the abdominal muscles during the two isometric endurance tests demonstrated that 95% of the observations fall between the limits of agreement for test and retest measurements. Together the results indicate high intra-tester reliability for the US measurement of the thickness of abdominal muscles in all the positions tested. According to the study's findings, US imaging can be used as a reliable method for assessment of abdominal muscles activity in supine lying and the two isometric endurance tests employed, in participants with and without LBP. Copyright © 2014 Elsevier Ltd. All rights reserved.
The Reliability of Lumbar Lordosis Measurements Using a Flexible-Rule.

DTIC Science & Technology

The purpose of this study was to examine the intra-rater and intra-rater reliability of lumbar lordosis measurements taken with a flexible-rule. Two...coefficients (ICC) were used to determine the degree of agreement between measurements. The results suggest that measurements of lumbar lordosis with a
Reliability and validity of the AutoCAD software method in lumbar lordosis measurement

PubMed Central

Letafatkar, Amir; Amirsasan, Ramin; Abdolvahabi, Zahra; Hadadnezhad, Malihe

2011-01-01

Objective The aim of this study was to determine the reliability and validity of the AutoCAD software method in lumbar lordosis measurement. Methods Fifty healthy volunteers with a mean age of 23 ± 1.80 years were enrolled. A lumbar lateral radiograph was taken on all participants, and the lordosis was measured according to the Cobb method. Afterward, the lumbar lordosis degree was measured via AutoCAD software and flexible ruler methods. The current study is accomplished in 2 parts: intratester and intertester evaluations of reliability as well as the validity of the flexible ruler and software methods. Results Based on the intraclass correlation coefficient, AutoCAD's reliability and validity in measuring lumbar lordosis were 0.984 and 0.962, respectively. Conclusions AutoCAD showed to be a reliable and valid method to measure lordosis. It is suggested that this method may replace those that are costly and involve health risks, such as radiography, in evaluating lumbar lordosis. PMID:22654681
Reliability and validity of the AutoCAD software method in lumbar lordosis measurement.

PubMed

Letafatkar, Amir; Amirsasan, Ramin; Abdolvahabi, Zahra; Hadadnezhad, Malihe

2011-12-01

The aim of this study was to determine the reliability and validity of the AutoCAD software method in lumbar lordosis measurement. Fifty healthy volunteers with a mean age of 23 ± 1.80 years were enrolled. A lumbar lateral radiograph was taken on all participants, and the lordosis was measured according to the Cobb method. Afterward, the lumbar lordosis degree was measured via AutoCAD software and flexible ruler methods. The current study is accomplished in 2 parts: intratester and intertester evaluations of reliability as well as the validity of the flexible ruler and software methods. Based on the intraclass correlation coefficient, AutoCAD's reliability and validity in measuring lumbar lordosis were 0.984 and 0.962, respectively. AutoCAD showed to be a reliable and valid method to measure lordosis. It is suggested that this method may replace those that are costly and involve health risks, such as radiography, in evaluating lumbar lordosis.
A new approach to the measurement of pelvic asymmetry: proposed methods and reliability.

PubMed

Gnat, Rafael; Biały, Maciej

2015-05-01

This is a methodological study presenting a novel method of pelvic asymmetry (PA) measurement for use in the research laboratory setting. The purpose of the study is (1) to establish intrarater and interrater reliability of the proposed measures of PA, (2) to verify the influence of repeated measurements on the reliability, and (3) to assess correlation between the proposed measures of PA. Twelve healthy volunteers participated, and 2 teams of raters were involved. Registration of anatomic landmarks' positions in the optical motion capture system was repeated 3 times. Two asymmetry indexes were calculated: for pelvic torsion and for lateral pelvic tilt. Interclass correlation coefficients (ICCs), standard errors of measurement, and smallest detectable differences were used to describe the intrarater and interrater reliability of the 2 indexes. After 2 repeated registrations of pelvic landmarks' positions, the reliability of our asymmetry indexes was good and excellent. The ICCs for intrarater reliability ranged from 0.96 to 0.97; the ICCs for interrater reliability ranged 0.81 to 0.90. There was moderate, nonsignificant correlation between asymmetry indexes for pelvis torsion and for lateral pelvic tilt (r = 0.45, P = .14). The 2 proposed asymmetry indexes showed good and excellent intrarater and interrater reliability after 2 repeated registrations of pelvic landmarks' positions and thus may be useful in the research laboratory setting. However, these indexes are not strongly correlated, which suggests that the 2 types of PA may constitute different clinical entities. Copyright © 2015 National University of Health Sciences. Published by Elsevier Inc. All rights reserved.
A digital photographic measurement method for quantifying foot posture: validity, reliability, and descriptive data.

PubMed

Cobb, Stephen C; James, C Roger; Hjertstedt, Matthew; Kruk, James

2011-01-01

Although abnormal foot posture long has been associated with lower extremity injury risk, the evidence is equivocal. Poor intertester reliability of traditional foot measures might contribute to the inconsistency. To investigate the validity and reliability of a digital photographic measurement method (DPMM) technology, the reliability of DPMM-quantified foot measures, and the concurrent validity of the DPMM with clinical-measurement methods (CMMs) and to report descriptive data for DPMM measures with moderate to high intratester and intertester reliability. Descriptive laboratory study. Biomechanics research laboratory. A total of 159 people participated in 3 groups. Twenty-eight people (11 men, 17 women; age = 25 ± 5 years, height = 1.71 ± 0.10 m, mass = 77.6 ± 17.3 kg) were recruited for investigation of intratester and intertester reliability of the DPMM technology; 20 (10 men, 10 women; age = 24 ± 2 years, height = 1.71 ± 0.09 m, mass = 76 ± 16 kg) for investigation of DPMM and CMM reliability and concurrent validity; and 111 (42 men, 69 women; age = 22.8 ± 4.7 years, height = 168.5 ± 10.4 cm, mass = 69.8 ± 13.3 kg) for development of a descriptive data set of the DPMM foot measurements with moderate to high intratester and intertester reliabilities. The dimensions of 10 model rectangles and the 28 participants' feet were measured, and DPMM foot posture was measured in the 111 participants. Two clinicians assessed the DPMM and CMM foot measures of the 20 participants. Validity and reliability were evaluated using mean absolute and percentage errors and intraclass correlation coefficients. Descriptive data were computed from the DPMM foot posture measures. The DPMM technology intratester and intertester reliability intraclass correlation coefficients were 1.0 for each tester and variable. Mean absolute errors were equal to or less than 0.2 mm for the bottom and right-side variables and 0.1° for the
Reliability of anthropometric measurements in young male and female artistic gymnasts.

PubMed

Siatras, Theophanis; Skaperda, Malamati; Mameletzi, Dimitra

2010-12-01

Body dimensions and body composition of children participating in artistic activities, such as gymnastics and many types of dancing, are important factors in performance improvement. The present study aimed to determine the reliability of a series of selected anthropometric measurements in young male and female gymnasts. Segment lengths, body breadths, circumferences, and skinfold thickness were measured in 20 young gymnasts by the same experienced examiner, using portable and easy-to-use instruments. All parameters were measured twice (test-retest) under the same conditions within a week's period. The high intra-class correlation coefficient (ICC) values ranging from 0.87 to 0.99, as well as the low coefficient of variation (CV) values (<5.3%), affirmed that the selected measurements were highly reliable. The technical error of measurement (TEM) values for lengths and breadths were 0.15 to 0.80 cm, for circumferences 0.22 to 1 cm, and for skinfold thickness 0.33 to 0.58 mm. The high test-retest ICC and the low CV and TEM values confirmed the reliability of all anthropometric measurements in young artistic gymnasts. Therefore, these measurements could contribute to further research in this field of investigation, helping to monitor young artistic gymnasts' growth status and identify specific characteristics for increased performance in this sport.
Using LISREL to Evaluate Measurement Models and Scale Reliability.

ERIC Educational Resources Information Center

Fleishman, John; Benson, Jeri

1987-01-01

LISREL program was used to examine measurement model assumptions and to assess reliability of Coopersmith Self-Esteem Inventory for Children, Form B. Data on 722 third-sixth graders from over 70 schools in large urban school district were used. LISREL program assessed (1) nature of basic measurement model for scale, (2) scale invariance across…
A Digital Photographic Measurement Method for Quantifying Foot Posture: Validity, Reliability, and Descriptive Data

PubMed Central

Cobb, Stephen C.; James, C. Roger; Hjertstedt, Matthew; Kruk, James

2011-01-01

Abstract Context: Although abnormal foot posture long has been associated with lower extremity injury risk, the evidence is equivocal. Poor intertester reliability of traditional foot measures might contribute to the inconsistency. Objectives: To investigate the validity and reliability of a digital photographic measurement method (DPMM) technology, the reliability of DPMM-quantified foot measures, and the concurrent validity of the DPMM with clinical-measurement methods (CMMs) and to report descriptive data for DPMM measures with moderate to high intratester and intertester reliability. Design: Descriptive laboratory study. Setting: Biomechanics research laboratory. Patients or Other Participants: A total of 159 people participated in 3 groups. Twenty-eight people (11 men, 17 women; age = 25 ± 5 years, height = 1.71 ± 0.10 m, mass = 77.6 ± 17.3 kg) were recruited for investigation of intratester and intertester reliability of the DPMM technology; 20 (10 men, 10 women; age = 24 ± 2 years, height = 1.71 ± 0.09 m, mass = 76 ± 16 kg) for investigation of DPMM and CMM reliability and concurrent validity; and 111 (42 men, 69 women; age = 22.8 ± 4.7 years, height = 168.5 ± 10.4 cm, mass = 69.8 ± 13.3 kg) for development of a descriptive data set of the DPMM foot measurements with moderate to high intratester and intertester reliabilities. Intervention(s): The dimensions of 10 model rectangles and the 28 participants' feet were measured, and DPMM foot posture was measured in the 111 participants. Two clinicians assessed the DPMM and CMM foot measures of the 20 participants. Main Outcome Measure(s): Validity and reliability were evaluated using mean absolute and percentage errors and intraclass correlation coefficients. Descriptive data were computed from the DPMM foot posture measures. Results: The DPMM technology intratester and intertester reliability intraclass correlation coefficients were 1.0 for
Ultrasonographic measurement of the acromiohumeral distance in spinal cord injury: Reliability and effects of shoulder positioning

PubMed Central

Lin, Yen-Sheng; Boninger, Michael L.; Day, Kevin A.

2015-01-01

Objective To investigate the reliability of ultrasonographic measurement of acromiohumeral distance (AHD) and the effects of shoulder positioning on AHD among manual wheelchair users (MWUs) with spinal cord injury (SCI) and an able-bodied control group. Methods Ten MWUs with SCI and 10 able-bodied subjects participated in this study. The ultrasonographic measurements of AHD from each subject were obtained by two raters during passive and active scapular plane arm elevation in neutral, 45°, 90° with and without resistance and in a weight relief raise position. The measurements were recorded again by each rater using the same procedures after a 30-minute time interval. All raters were blinded to each other's measurements. Setting University Laboratories and Veteran Affairs Healthcare System. Results Intra-rater (intraclass correlation coefficient, ICC > 0.83) and inter-rater (ICC > 0.78) reliability was excellent for both the MWUs with SCI and able-bodied groups across all arm positions except for the 45° position in the control group for one of the raters (intra-rater: ICC < 0.40 and inter-rater: ICC < 0.60). AHD significantly reduced when the shoulder was in the 90° arm elevated positions with or without resistance. Conclusion Findings from our study demonstrated that ultrasonography is a reliable means to evaluate AHD in both able bodied and individuals with SCI, who are known to have significant shoulder pathology. This technique could be used to develop reference measures and to identify changes in AHD caused by interventions. PMID:24968117
Reliability of plasma lipopolysaccharide-binding protein (LBP) from repeated measures in healthy adults.

PubMed

Citronberg, Jessica S; Wilkens, Lynne R; Lim, Unhee; Hullar, Meredith A J; White, Emily; Newcomb, Polly A; Le Marchand, Loïc; Lampe, Johanna W

2016-09-01

Plasma lipopolysaccharide-binding protein (LBP), a measure of internal exposure to bacterial lipopolysaccharide, has been associated with several chronic conditions and may be a marker of chronic inflammation; however, no studies have examined the reliability of this biomarker in a healthy population. We examined the temporal reliability of LBP measured in archived samples from participants in two studies. In Study one, 60 healthy participants had blood drawn at two time points: baseline and follow-up (either three, six, or nine months). In Study two, 24 individuals had blood drawn three to four times over a seven-month period. We measured LBP in archived plasma by ELISA. Test-retest reliability was estimated by calculating the intraclass correlation coefficient (ICC). Plasma LBP concentrations showed moderate reliability in Study one (ICC 0.60, 95 % CI 0.43-0.75) and Study two (ICC 0.46, 95 % CI 0.26-0.69). Restricting the follow-up period improved reliability. In Study one, the reliability of LBP over a three-month period was 0.68 (95 % CI: 0.41-0.87). In Study two, the ICC of samples taken ≤seven days apart was 0.61 (95 % CI 0.29-0.86). Plasma LBP concentrations demonstrated moderate test-retest reliability in healthy individuals with reliability improving over a shorter follow-up period.
A method of bias correction for maximal reliability with dichotomous measures.

PubMed

Penev, Spiridon; Raykov, Tenko

2010-02-01

This paper is concerned with the reliability of weighted combinations of a given set of dichotomous measures. Maximal reliability for such measures has been discussed in the past, but the pertinent estimator exhibits a considerable bias and mean squared error for moderate sample sizes. We examine this bias, propose a procedure for bias correction, and develop a more accurate asymptotic confidence interval for the resulting estimator. In most empirically relevant cases, the bias correction and mean squared error correction can be performed simultaneously. We propose an approximate (asymptotic) confidence interval for the maximal reliability coefficient, discuss the implementation of this estimator, and investigate the mean squared error of the associated asymptotic approximation. We illustrate the proposed methods using a numerical example.

Measuring older adults' sedentary time: reliability, validity, and responsiveness.

PubMed

Gardiner, Paul A; Clark, Bronwyn K; Healy, Genevieve N; Eakin, Elizabeth G; Winkler, Elisabeth A H; Owen, Neville

2011-11-01

With evidence that prolonged sitting has deleterious health consequences, decreasing sedentary time is a potentially important preventive health target. High-quality measures, particularly for use with older adults, who are the most sedentary population group, are needed to evaluate the effect of sedentary behavior interventions. We examined the reliability, validity, and responsiveness to change of a self-report sedentary behavior questionnaire that assessed time spent in behaviors common among older adults: watching television, computer use, reading, socializing, transport and hobbies, and a summary measure (total sedentary time). In the context of a sedentary behavior intervention, nonworking older adults (n = 48, age = 73 ± 8 yr (mean ± SD)) completed the questionnaire on three occasions during a 2-wk period (7 d between administrations) and wore an accelerometer (ActiGraph model GT1M) for two periods of 6 d. Test-retest reliability (for the individual items and the summary measure) and validity (self-reported total sedentary time compared with accelerometer-derived sedentary time) were assessed during the 1-wk preintervention period, using Spearman (ρ) correlations and 95% confidence intervals (CI). Responsiveness to change after the intervention was assessed using the responsiveness statistic (RS). Test-retest reliability was excellent for television viewing time (ρ (95% CI) = 0.78 (0.63-0.89)), computer use (ρ (95% CI) = 0.90 (0.83-0.94)), and reading (ρ (95% CI) = 0.77 (0.62-0.86)); acceptable for hobbies (ρ (95% CI) = 0.61 (0.39-0.76)); and poor for socializing and transport (ρ < 0.45). Total sedentary time had acceptable test-retest reliability (ρ (95% CI) = 0.52 (0.27-0.70)) and validity (ρ (95% CI) = 0.30 (0.02-0.54)). Self-report total sedentary time was similarly responsive to change (RS = 0.47) as accelerometer-derived sedentary time (RS = 0.39). The summary measure of total sedentary time has good repeatability and modest validity and is
Validity and reliability of a scale to measure genital body image.

PubMed

Zielinski, Ruth E; Kane-Low, Lisa; Miller, Janis M; Sampselle, Carolyn

2012-01-01

Women's body image dissatisfaction extends to body parts usually hidden from view--their genitals. Ability to measure genital body image is limited by lack of valid and reliable questionnaires. We subjected a previously developed questionnaire, the Genital Self Image Scale (GSIS) to psychometric testing using a variety of methods. Five experts determined the content validity of the scale. Then using four participant groups, factor analysis was performed to determine construct validity and to identify factors. Further construct validity was established using the contrasting groups approach. Internal consistency and test-retest reliability was determined. Twenty one of 29 items were considered content valid. Two items were added based on expert suggestions. Factor analysis was undertaken resulting in four factors, identified as Genital Confidence, Appeal, Function, and Comfort. The revised scale (GSIS-20) included 20 items explaining 59.4% of the variance. Women indicating an interest in genital cosmetic surgery exhibited significantly lower scores on the GSIS-20 than those who did not. The final 20 item scale exhibited internal reliability across all sample groups as well as test-retest reliability. The GSIS-20 provides a measure of genital body image demonstrating reliability and validity across several populations of women.
Analysis of linear measurements on 3D surface models using CBCT data segmentation obtained by automatic standard pre-set thresholds in two segmentation software programs: an in vitro study.

PubMed

Poleti, Marcelo Lupion; Fernandes, Thais Maria Freire; Pagin, Otávio; Moretti, Marcela Rodrigues; Rubira-Bullen, Izabel Regina Fischer

2016-01-01

The aim of this in vitro study was to evaluate the reliability and accuracy of linear measurements on three-dimensional (3D) surface models obtained by standard pre-set thresholds in two segmentation software programs. Ten mandibles with 17 silica markers were scanned for 0.3-mm voxels in the i-CAT Classic (Imaging Sciences International, Hatfield, PA, USA). Twenty linear measurements were carried out by two observers two times on the 3D surface models: the Dolphin Imaging 11.5 (Dolphin Imaging & Management Solutions, Chatsworth, CA, USA), using two filters(Translucent and Solid-1), and in the InVesalius 3.0.0 (Centre for Information Technology Renato Archer, Campinas, SP, Brazil). The physical measurements were made by another observer two times using a digital caliper on the dry mandibles. Excellent intra- and inter-observer reliability for the markers, physical measurements, and 3D surface models were found (intra-class correlation coefficient (ICC) and Pearson's r ≥ 0.91). The linear measurements on 3D surface models by Dolphin and InVesalius software programs were accurate (Dolphin Solid-1 > InVesalius > Dolphin Translucent). The highest absolute and percentage errors were obtained for the variable R1-R1 (1.37 mm) and MF-AC (2.53 %) in the Dolphin Translucent and InVesalius software, respectively. Linear measurements on 3D surface models obtained by standard pre-set thresholds in the Dolphin and InVesalius software programs are reliable and accurate compared with physical measurements. Studies that evaluate the reliability and accuracy of the 3D models are necessary to ensure error predictability and to establish diagnosis, treatment plan, and prognosis in a more realistic way.
Sonographic measurements of the achilles tendon, plantar fascia, and heel fat pad are reliable: A test-retest intra- and intertester study.

PubMed

Johannsen, Finn; Jensen, Signe; Stallknecht, Sandra E; Olsen, Lars Otto; Magnusson, S Peter

2016-10-01

To determine intra- and interobserver reliability and precision of sonographic (US) scanning in measuring thickness of the Achilles tendon, plantar fascia, and heel fat pad in patients with heel pain. Seventeen consecutive patients referred with heel pain were included. Two evaluators blinded to the diagnosis performed independently US scanning of both feet without any dialogue with the patient. The examiner left the room, and the next examiner entered. All patients had two US scans performed by each examiner. Two months later, the US images were randomly presented to the evaluators for measurements. Reliability and agreement were assessed by calculation of intraclass correlation coefficient (ICC), 95% limits of agreement (LOA), and typical error (TE). LOA was calculated as a percentage of the mean thickness of each structure to obtain a unitless parameter. We found excellent intratester reliability (ICC 0.78-0.98) and good intertester reliability using one measurement (ICC 0.72-0.91) and excellent (ICC 0.85-0.95) when using average of two measurements. The intratester agreements were good with LOA: 9.5-23.4% and TE: 3.4-8.4%. The intertester agreements were acceptable using one measurement with LOA: 16.1-36.4%, and better using two measurements with LOA: 14.4-33.2%. US is a reliable technique of measurement in the daily clinic, and one single measurement is sufficient. In research, we recommend that the same observer performs the US measurements, if one single scanning is preferred; if more researchers are involved, the average measurement of two US scans is recommended. © 2016 Wiley Periodicals, Inc. J Clin Ultrasound 44:480-486, 2016. © 2016 Wiley Periodicals, Inc.
A Measure for the Reliability of a Rating Scale Based on Longitudinal Clinical Trial Data

ERIC Educational Resources Information Center

Laenen, Annouschka; Alonso, Ariel; Molenberghs, Geert

2007-01-01

A new measure for reliability of a rating scale is introduced, based on the classical definition of reliability, as the ratio of the true score variance and the total variance. Clinical trial data can be employed to estimate the reliability of the scale in use, whenever repeated measurements are taken. The reliability is estimated from the…
Reliability analysis and fault-tolerant system development for a redundant strapdown inertial measurement unit. [inertial platforms

NASA Technical Reports Server (NTRS)

Motyka, P.

1983-01-01

A methodology is developed and applied for quantitatively analyzing the reliability of a dual, fail-operational redundant strapdown inertial measurement unit (RSDIMU). A Markov evaluation model is defined in terms of the operational states of the RSDIMU to predict system reliability. A 27 state model is defined based upon a candidate redundancy management system which can detect and isolate a spectrum of failure magnitudes. The results of parametric studies are presented which show the effect on reliability of the gyro failure rate, both the gyro and accelerometer failure rates together, false alarms, probability of failure detection, probability of failure isolation, and probability of damage effects and mission time. A technique is developed and evaluated for generating dynamic thresholds for detecting and isolating failures of the dual, separated IMU. Special emphasis is given to the detection of multiple, nonconcurrent failures. Digital simulation time histories are presented which show the thresholds obtained and their effectiveness in detecting and isolating sensor failures.
Reliability of in-Shoe Plantar Pressure Measurements in Rheumatoid Arthritis Patients

ERIC Educational Resources Information Center

Vidmar, Gaj; Novak, Primoz

2009-01-01

Plantar pressures measurement is a frequently used method in rehabilitation and related research. Metric characteristics of the F-Scan system have been assessed from different standpoints and in different patients, but not its reliability in rheumatoid arthritis patients. Therefore, our objective was to assess reliability of the F-Scan plantar…
Obtaining Reliable Predictions of Terrestrial Energy Coupling From Real-Time Solar Wind Measurement

NASA Technical Reports Server (NTRS)

Weimer, Daniel R.

2001-01-01

The first draft of a manuscript titled "Variable time delays in the propagation of the interplanetary magnetic field" has been completed, for submission to the Journal of Geophysical Research. In the preparation of this manuscript all data and analysis programs had been updated to the highest temporal resolution possible, at 16 seconds or better. The program which computes the "measured" IMF propagation time delays from these data has also undergone another improvement. In another significant development, a technique has been developed in order to predict IMF phase plane orientations, and the resulting time delays, using only measurements from a single satellite at L1. The "minimum variance" method is used for this computation. Further work will be done on optimizing the choice of several parameters for the minimum variance calculation.
Reliability and validity of non-radiographic methods of thoracic kyphosis measurement: a systematic review.

PubMed

Barrett, Eva; McCreesh, Karen; Lewis, Jeremy

2014-02-01

A wide array of instruments are available for non-invasive thoracic kyphosis measurement. Guidelines for selecting outcome measures for use in clinical and research practice recommend that properties such as validity and reliability are considered. This systematic review reports on the reliability and validity of non-invasive methods for measuring thoracic kyphosis. A systematic search of 11 electronic databases located studies assessing reliability and/or validity of non-invasive thoracic kyphosis measurement techniques. Two independent reviewers used a critical appraisal tool to assess the quality of retrieved studies. Data was extracted by the primary reviewer. The results were synthesized qualitatively using a level of evidence approach. 27 studies satisfied the eligibility criteria and were included in the review. The reliability, validity and both reliability and validity were investigated by sixteen, two and nine studies respectively. 17/27 studies were deemed to be of high quality. In total, 15 methods of thoracic kyphosis were evaluated in retrieved studies. All investigated methods showed high (ICC ≥ .7) to very high (ICC ≥ .9) levels of reliability. The validity of the methods ranged from low to very high. The strongest levels of evidence for reliability exists in support of the Debrunner kyphometer, Spinal Mouse and Flexicurve index, and for validity supports the arcometer and Flexicurve index. Further reliability and validity studies are required to strengthen the level of evidence for the remaining methods of measurement. This should be addressed by future research. Copyright © 2013 Elsevier Ltd. All rights reserved.
The concurrent validity and reliability of a low-cost, high-speed camera-based method for measuring the flight time of vertical jumps.

PubMed

Balsalobre-Fernández, Carlos; Tejero-González, Carlos M; del Campo-Vecino, Juan; Bavaresco, Nicolás

2014-02-01

Flight time is the most accurate and frequently used variable when assessing the height of vertical jumps. The purpose of this study was to analyze the validity and reliability of an alternative method (i.e., the HSC-Kinovea method) for measuring the flight time and height of vertical jumping using a low-cost high-speed Casio Exilim FH-25 camera (HSC). To this end, 25 subjects performed a total of 125 vertical jumps on an infrared (IR) platform while simultaneously being recorded with a HSC at 240 fps. Subsequently, 2 observers with no experience in video analysis analyzed the 125 videos independently using the open-license Kinovea 0.8.15 software. The flight times obtained were then converted into vertical jump heights, and the intraclass correlation coefficient (ICC), Bland-Altman plot, and Pearson correlation coefficient were calculated for those variables. The results showed a perfect correlation agreement (ICC = 1, p < 0.0001) between both observers' measurements of flight time and jump height and a highly reliable agreement (ICC = 0.997, p < 0.0001) between the observers' measurements of flight time and jump height using the HSC-Kinovea method and those obtained using the IR system, thus explaining 99.5% (p < 0.0001) of the differences (shared variance) obtained using the IR platform. As a result, besides requiring no previous experience in the use of this technology, the HSC-Kinovea method can be considered to provide similarly valid and reliable measurements of flight time and vertical jump height as more expensive equipment (i.e., IR). As such, coaches from many sports could use the HSC-Kinovea method to measure the flight time and height of their athlete's vertical jumps.
Validity and reliability of food security measures.

PubMed

Cafiero, Carlo; Melgar-Quiñonez, Hugo R; Ballard, Terri J; Kepple, Anne W

2014-12-01

This paper reviews some of the existing food security indicators, discussing the validity of the underlying concept and the expected reliability of measures under reasonably feasible conditions. The main objective of the paper is to raise awareness on existing trade-offs between different qualities of possible food security measurement tools that must be taken into account when such tools are proposed for practical application, especially for use within an international monitoring framework. The hope is to provide a timely, useful contribution to the process leading to the definition of a food security goal and the associated monitoring framework within the post-2015 Development Agenda. © 2014 New York Academy of Sciences.
Insightful practice: a reliable measure for medical revalidation

PubMed Central

Guthrie, Bruce; Sullivan, Frank M; Mercer, Stewart W; Russell, Andrew; Bruce, David A

2012-01-01

Background Medical revalidation decisions need to be reliable if they are to reassure on the quality and safety of professional practice. This study tested an innovative method in which general practitioners (GPs) were assessed on their reflection and response to a set of externally specified feedback. Setting and participants 60 GPs and 12 GP appraisers in the Tayside region of Scotland, UK. Methods A feedback dataset was specified as (1) GP-specific data collected by GPs themselves (patient and colleague opinion; open book self-evaluated knowledge test; complaints) and (2) Externally collected practice-level data provided to GPs (clinical quality and prescribing safety). GPs' perceptions of whether the feedback covered UK General Medical Council specified attributes of a ‘good doctor’ were examined using a mapping exercise. GPs' professionalism was examined in terms of appraiser assessment of GPs' level of insightful practice, defined as: engagement with, insight into and appropriate action on feedback data. The reliability of assessment of insightful practice and subsequent recommendations on GPs' revalidation by face-to-face and anonymous assessors were investigated using Generalisability G-theory. Main outcome measures Coverage of General Medical Council attributes by specified feedback and reliability of assessor recommendations on doctors' suitability for revalidation. Results Face-to-face assessment proved unreliable. Anonymous global assessment by three appraisers of insightful practice was highly reliable (G=0.85), as were revalidation decisions using four anonymous assessors (G=0.83). Conclusions Unlike face-to-face appraisal, anonymous assessment of insightful practice offers a valid and reliable method to decide GP revalidation. Further validity studies are needed. PMID:22653078
Reliability generalization of the Multigroup Ethnic Identity Measure-Revised (MEIM-R).

PubMed

Herrington, Hayley M; Smith, Timothy B; Feinauer, Erika; Griner, Derek

2016-10-01

[Correction Notice: An Erratum for this article was reported in Vol 63(5) of Journal of Counseling Psychology (see record 2016-33161-001). The name of author Erika Feinauer was misspelled as Erika Feinhauer. All versions of this article have been corrected.] Individuals' strength of ethnic identity has been linked with multiple positive indicators, including academic achievement and overall psychological well-being. The measure researchers use most often to assess ethnic identity, the Multigroup Ethnic Identity Measure (MEIM), underwent substantial revision in 2007. To inform scholars investigating ethnic identity, we performed a reliability generalization analysis on data from the revised version (MEIM-R) and compared it with data from the original MEIM. Random-effects weighted models evaluated internal consistency coefficients (Cronbach's alpha). Reliability coefficients for the MEIM-R averaged α = .88 across 37 samples, a statistically significant increase over the average of α = .84 for the MEIM across 75 studies. Reliability coefficients for the MEIM-R did not differ across study and participant characteristics such as sample gender and ethnic composition. However, consistently lower reliability coefficients averaging α = .81 were found among participants with low levels of education, suggesting that greater attention to data reliability is warranted when evaluating the ethnic identity of individuals such as middle-school students. Future research will be needed to ascertain whether data with other measures of aspects of personal identity (e.g., racial identity, gender identity) also differ as a function of participant level of education and associated cognitive or maturation processes. (PsycINFO Database Record (c) 2016 APA, all rights reserved).
Reliability of intraventricular pressure measurement with fiberoptic or solid-state transducers: avoidance of a methodological error.

PubMed

Raabe, A; Stöckel, R; Hohrein, D; Schöche, J

1998-01-01

The failure of intraventricular pressure measurement in cases of catheter blockage or dislodgement is thought to be eliminated by using intraventricular microtransducers. We report on an avoidable methodological error that may affect the reliability of intraventricular pressure measurement with these devices. Intraventricular fiberoptic or solid-state devices were implanted in 43 patients considered to be at risk for developing catheter occlusion. Two different types were used, i.e., devices in which the transducer is placed inside the ventriculostomy catheter (Type A) and devices in which the transducer is integrated in the external surface of the catheter (Type B). Type A devices were used in 15 patients and Type B devices in 28 patients. Pressure recordings were checked at bedside for the validity and reliability of the measurement. Of the 15 patients treated with Type A devices, no reliable pressure recordings were able to be obtained in three patients in whom ventricular punctures were not successful. In 4 of the remaining 12 patients, periods of erroneous pressure readings were detected. After opening of cerebrospinal fluid drainage, all Type A devices failed to reflect real intraventricular pressure. In patients treated with Type B devices, no erroneous pressure recordings were able to be identified, irrespective of whether cerebrospinal fluid drainage was performed. Even when ventricular puncture failed, pressure measurement was correct each time. Transducers that are simply placed inside the ventriculostomy catheter require fluid-coupling. They may fail, either during cerebrospinal fluid drainage or when the catheter is blocked or placed within the parenchyma.
Reliability Concerns in Measuring Respondent Skin Tone by Interviewer Observation

PubMed Central

Hannon, Lance; DeFina, Robert

2016-01-01

The current study assesses the intercoder reliability of one of the most important skin tone measurement instruments—the Massey–Martin scale. This scale is used in several high-profile social surveys, but has not yet been psychometrically evaluated. The current evaluation is only possible because, for the first time, the General Social Survey’s 2010–2014 panel used the instrument to guide interviewers’ skin tone observation of the same respondents in two different years (2012 and 2014). Despite the widespread use of the Massey–Martin scale to investigate potential effects of skin tone on social attitudes and outcomes, the data suggest that the measure has low intercoder reliability. Implications for researchers and survey practitioners are discussed. PMID:27274576
Validity and reliability of the G-Cog device for kinematic measurements.

PubMed

Chiementin, X; Crequy, S; Bertucci, W

2013-11-01

The aim of this study was to test the validity and the reliability of the G-Cog which is a new BMX powermeter allowing for the measurements of the acceleration on X-Y-Z axis (250 Hz) at the BMX rear wheel. These measurements allow computing lateral, angular, linear acceleration, angular, linear velocity and the distance. Mechanical measurements at submaximal intensities in standardized laboratory conditions and during maximal exercises in the field conditions were performed to analyse the reliability of the G-Cog accelerometers. The performances were evaluated in comparison with an industrial accelerometer and with 2 powermeters, the SRM and PowerTap. Our results in laboratory conditions show that the G-Cog measurements have low value of variation coefficient (CV=2.35%). These results suggest that the G-cog accelerometers measurements are reproducible. The ratio limits of agreement of the rear hub angular velocity differences between the SRM and the G-Cog were 1.010 × ÷ 1.024 (95%CI=0.986-1.034) and between PowerTap and G-Cog were 0.993 × ÷ 1.019 (95%CI=0.974-1.012). In conclusion, our results suggest that the G-Cog angular velocity measurements are valid and reliable compared with SRM and PowerTap and could be used to analyse the kinematics during BMX actual conditions. © Georg Thieme Verlag KG Stuttgart · New York.
Improved reliability of pH measurements.

PubMed

Spitzer, Petra; Werner, Barbara

2002-11-01

Measurements of pH are performed on a large scale at laboratory level, and in industry. To meet the quality-control requirements and other technical specifications there is a need for traceability in measurement results. The prerequisite for the international acceptance of analytical data is reliability. To measure means to compare. Comparability entails use of recognised references to which the standard buffer solutions used for calibration of pH meter-electrode assemblies can be traced. The new recommendation on the measurement of pH recently published as a provisional document by the International Union on Pure and Applied Chemistry (IUPAC) enables traceability for measured pH values to a conventional reference frame which is recognised world-wide. The primary method for pH will be described. If analytical data are to be accepted internationally it is necessary to demonstrate the equivalence of the national traceability structures, including national measurement standards. For the first time key comparisons for pH have been performed by the Consultative Committee for Amount of Substance (CCQM, set up by the International Bureau of Weights and Measures, BIPM) to assess the equivalence of the national measurement procedures used to determine the pH of primary standard buffer solutions. The results of the first key comparison on pH CCQM-K9, and other international initiatives to improve the consistency of the results of measurement for pH, are reported.
Reliability and validity: Part II.

PubMed

Davis, Debora Winders

2004-01-01

Determining measurement reliability and validity involves complex processes. There is usually room for argument about most instruments. It is important that the researcher clearly describes the processes upon which she made the decision to use a particular instrument, and presents the evidence available showing that the instrument is reliable and valid for the current purposes. In some cases, the researcher may need to conduct pilot studies to obtain evidence upon which to decide whether the instrument is valid for a new population or a different setting. In all cases, the researcher must present a clear and complete explanation for the choices, she has made regarding reliability and validity. The consumer must then judge the degree to which the researcher has provided adequate and theoretically sound rationale. Although I have tried to touch on most of the important concepts related to measurement reliability and validity, it is beyond the scope of this column to be exhaustive. There are textbooks devoted entirely to specific measurement issues if readers require more in-depth knowledge.
Comparison of Automated Brain Volume Measures obtained with NeuroQuant and FreeSurfer.

PubMed

Ochs, Alfred L; Ross, David E; Zannoni, Megan D; Abildskov, Tracy J; Bigler, Erin D

2015-01-01

To examine intermethod reliabilities and differences between FreeSurfer and the FDA-cleared congener, NeuroQuant, both fully automated methods for structural brain MRI measurements. MRI scans from 20 normal control subjects, 20 Alzheimer's disease patients, and 20 mild traumatically brain-injured patients were analyzed with NeuroQuant and with FreeSurfer. Intermethod reliability was evaluated. Pairwise correlation coefficients, intraclass correlation coefficients, and effect size differences were computed. NeuroQuant versus FreeSurfer measures showed excellent to good intermethod reliability for the 21 regions evaluated (r: .63 to .99/ICC: .62 to .99/ES: -.33 to 2.08) except for the pallidum (r/ICC/ES = .31/.29/-2.2) and cerebellar white matter (r/ICC/ES = .31/.31/.08). Volumes reported by NeuroQuant were generally larger than those reported by FreeSurfer with the whole brain parenchyma volume reported by NeuroQuant 6.50% larger than the volume reported by FreeSurfer. There was no systematic difference in results between the 3 subgroups. NeuroQuant and FreeSurfer showed good to excellent intermethod reliability in volumetric measurements for all brain regions examined with the only exceptions being the pallidum and cerebellar white matter. This finding was robust for normal individuals, patients with Alzheimer's disease, and patients with mild traumatic brain injury. Copyright © 2015 by the American Society of Neuroimaging.
Improving Metrological Reliability of Information-Measuring Systems Using Mathematical Modeling of Their Metrological Characteristics

NASA Astrophysics Data System (ADS)

Kurnosov, R. Yu; Chernyshova, T. I.; Chernyshov, V. N.

2018-05-01

The algorithms for improving the metrological reliability of analogue blocks of measuring channels and information-measuring systems are developed. The proposed algorithms ensure the optimum values of their metrological reliability indices for a given analogue circuit block solution.

Comparison of reliability and responsiveness of patient-reported clinical outcome measures in knee osteoarthritis rehabilitation.

PubMed

Williams, Valerie J; Piva, Sara R; Irrgang, James J; Crossley, Chad; Fitzgerald, G Kelley

2012-08-01

Secondary analysis, pretreatment-posttreatment observational study. To compare the reliability and responsiveness of the Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC), the Knee Outcome Survey activities of daily living subscale (KOS-ADL), and the Lower Extremity Functional Scale (LEFS) in individuals with knee osteoarthritis (OA). The WOMAC is the current standard in patient-reported measures of function in patients with knee OA. The KOS-ADL and LEFS were designed for potential use in patients with knee OA. If the KOS-ADL and LEFS are to be considered viable alternatives to the WOMAC for measuring patient-reported function in individuals with knee OA, they should have measurement properties comparable to the WOMAC. It would also be important to determine whether either of these instruments may be superior to the WOMAC in terms of reliability or responsiveness in this population. Data from 168 subjects with knee OA, who participated in a rehabilitation program, were used in the analyses. Reliability and responsiveness of each outcome measure were estimated at follow-ups of 2, 6, and 12 months. Reliability was estimated by calculating the intraclass correlation coefficient (ICC2,1) for subjects who were unchanged in status from baseline at each follow-up time, based on a global rating of change score. To examine responsiveness, the standard error of the measurement, minimal detectable change, minimal clinically important difference, and the Guyatt responsiveness index were calculated for each outcome measure at each follow-up time. All 3 outcome measures demonstrated reasonable reliability and responsiveness to change. Reliability and responsiveness tended to decrease somewhat with increasing follow-up time. There were no substantial differences between outcome measures for reliability or any of the 3 measures of responsiveness at any follow-up time. The results do not indicate that one outcome measure is more reliable or responsive than
Reliability of Physical Activity Measures During Free-Living Activities in People After Total Knee Arthroplasty.

PubMed

Almeida, Gustavo J; Irrgang, James J; Fitzgerald, G Kelley; Jakicic, John M; Piva, Sara R

2016-06-01

Few instruments that measure physical activity (PA) can accurately quantify PA performed at light and moderate intensities, which is particularly relevant in older adults. The evidence of their reliability in free-living conditions is limited. The study objectives were: (1) to determine the test-retest reliability of the Actigraph (ACT), SenseWear Armband (SWA), and Community Healthy Activities Model Program for Seniors (CHAMPS) questionnaire in assessing free-living PA at light and moderate intensities in people after total knee arthroplasty; (2) to compare the reliability of the 3 instruments relative to each other; and (3) to determine the reliability of commonly used monitoring time frames (24 hours, waking hours, and 10 hours from awakening). A one-group, repeated-measures design was used. Participants wore the activity monitors for 2 weeks, and the CHAMPS questionnaire was completed at the end of each week. Test-retest reliability was determined by using the intraclass correlation coefficient (ICC [2,k]) to compare PA measures from one week with those from the other week. Data from 28 participants who reported similar PA during the 2 weeks were included in the analysis. The mean age of these participants was 69 years (SD=8), and 75% of them were women. Reliability ranged from moderate to excellent for the ACT (ICC=.75-.86) and was excellent for the SWA (ICC=.93-.95) and the CHAMPS questionnaire (ICC=.86-.92). The 95% confidence intervals (95% CI) of the ICCs from the SWA were the only ones within the excellent reliability range (.85-.98). The CHAMPS questionnaire showed systematic bias, with less PA being reported in week 2. The reliability of PA measures in the waking-hour time frame was comparable to that in the 24-hour time frame and reflected most PA performed during this period. Reliability may be lower for time intervals longer than 1 week. All PA measures showed good reliability. The reliability of the ACT was lower than those of the SWA and the CHAMPS
Toward Obtaining Reliable Particulate Air Quality Information from Satellites

NASA Astrophysics Data System (ADS)

Strawa, A. W.; Chatfield, R. B.; Legg, M.; Esswein, R.; Justice, E.

2009-12-01

Air quality agencies use ground sites to monitor air quality, providing accurate information at particular points. Using measurements from satellite imagery has the potential to provide air quality information in a timely manner with better spatial resolution and at a lower cost that can also useful for model validation. While previous studies show acceptable correlations between Aerosol Optical Depth (AOD) derived from MODIS and surface Particulate Matter (PM) measurements on the eastern US, the data do not correlate well in the western US (Al-Saadi et al., 2005; Engle-Cox et al., 2004) . This paper seeks to improve the AOD-PM correlations by using advanced statistical analysis techniques. Our study area is the San Joaquin Valley in California because air quality in this region has failed to meet state and federal attainment standards for PM for the past several years. A previous investigation found good correlation of the AOD values between MODIS, MISR and AERONET, but poor correlations (R2 ~ 0.02) between satellite-based AOD and surface PM2.5 measurements. PM2.5 measurements correlated somewhat better (R2 ~ 0.18) with MODIS-derived AOD using the Deep Blue surface reflectance algorithm (Hsu et al., 2006) rather than the standard MODIS algorithm. This level of correlation is not adequate for reliable air quality measurements. Pelletier et al. (2007) used generalized additive models (GAMs) and meteorological data to improve the correlation between PM and AERONET AOD in western Europe. Additive models are more flexible than linear models and the functional relationships can be plotted to give a sense of the relationship between the predictor and the response. In this paper we use GAMs to improve surface PM2.5 to MODIS-AOD correlations. For example, we achieve an R2 ~ 0.44 using a GAM that includes the Deep Blue AOD, and day of year as parameters. Including NOx observations, improves the R2 ~ 0.64. Surprisingly Ångström exponent did not prove to be a significant
Estimation of reliable range of electron temperature measurements with sets of given optical bandpass filters for KSTAR Thomson scattering system based on synthetic Thomson data

NASA Astrophysics Data System (ADS)

Kim, K.-h.; Oh, T.-s.; Park, K.-r.; Lee, J. H.; Ghim, Y.-c.

2017-11-01

One factor determining the reliability of measurements of electron temperature using a Thomson scattering (TS) system is transmittance of the optical bandpass filters in polychromators. We investigate the system performance as a function of electron temperature to determine reliable range of measurements for a given set of the optical bandpass filters. We show that such a reliability, i.e., both bias and random errors, can be obtained by building a forward model of the KSTAR TS system to generate synthetic TS data with the prescribed electron temperature and density profiles. The prescribed profiles are compared with the estimated ones to quantify both bias and random errors.
Software Reliability, Measurement, and Testing. Volume 2. Guidebook for Software Reliability Measurement and Testing

DTIC Science & Technology

1992-04-01

contractor’s existing data collection, analysis and corrective action system shall be utilized, with modification only as necessary to meet the...either from test or from analysis of field data . The procedures of MIL-STD-756B assume that the reliability of a 18 DEFINE IDENTIFY SOFTWARE LIFE CYCLE...to generate sufficient data to report a statistically valid reliability figure for a class of software. Casual data gathering accumulates data more
The better way to determine the validity, reliability, objectivity and accuracy of measuring devices.

PubMed

Pazira, Parvin; Rostami Haji-Abadi, Mahdi; Zolaktaf, Vahid; Sabahi, Mohammadfarzan; Pazira, Toomaj

2016-06-08

In relation to statistical analysis, studies to determine the validity, reliability, objectivity and precision of new measuring devices are usually incomplete, due in part to using only correlation coefficient and ignoring the data dispersion. The aim of this study was to demonstrate the best way to determine the validity, reliability, objectivity and accuracy of an electro-inclinometer or other measuring devices. Another purpose of this study is to answer the question of whether reliability and objectivity represent accuracy of measuring devices. The validity of an electro-inclinometer was examined by mechanical and geometric methods. The objectivity and reliability of the device was assessed by calculating Cronbach's alpha for repeated measurements by three raters and by measurements on the same person by mechanical goniometer and the electro-inclinometer. Measurements were performed on "hip flexion with the extended knee" and "shoulder abduction with the extended elbow." The raters measured every angle three times within an interval of two hours. The three-way ANOVA was used to determine accuracy. The results of mechanical and geometric analysis showed that validity of the electro-inclinometer was 1.00 and level of error was less than one degree. Objectivity and reliability of electro-inclinometer was 0.999, while objectivity of mechanical goniometer was in the range of 0.802 to 0.966 and the reliability was 0.760 to 0.961. For hip flexion, the difference between raters in joints angle measurement by electro-inclinometer and mechanical goniometer was 1.74 and 16.33 degree (P<0.05), respectively. The differences for shoulder abduction measurement by electro-inclinometer and goniometer were 0.35 and 4.40 degree (P<0.05). Although both the objectivity and reliability are acceptable, the results showed that measurement error was very high in the mechanical goniometer. Therefore, it can be concluded that objectivity and reliability alone cannot determine the accuracy
Can Reliability of Multiple Component Measuring Instruments Depend on Response Option Presentation Mode?

ERIC Educational Resources Information Center

Menold, Natalja; Raykov, Tenko

2016-01-01

This article examines the possible dependency of composite reliability on presentation format of the elements of a multi-item measuring instrument. Using empirical data and a recent method for interval estimation of group differences in reliability, we demonstrate that the reliability of an instrument need not be the same when polarity of the…
Reliability of a Measure of Institutional Discrimination against Minorities

DTIC Science & Technology

1979-12-01

samples are presented. The first is based upon classical statistical theory and the second derives from a series of computer-generated Monte Carlo...Institutional racism and sexism . Englewood Cliffs, N. J.: Prentice-Hall, Inc., 1978. Hays, W. L. and Winkler, R. L. Statistics : probability, inference... statistical measure of the e of institutional discrimination are discussed. Two methods of dealing with the problem of reliability of the measure in small
GT3X+ accelerometer placement affects the reliability of step-counts measured during running and pedal-revolution counts measured during bicycling.

PubMed

Gatti, Anthony A; Stratford, Paul W; Brenneman, Elora C; Maly, Monica R

2016-01-01

Accelerometers provide a measure of step-count. Reliability and validity of step-count and pedal-revolution count measurements by the GT3X+ accelerometer, placed at different anatomical locations, is absent in the literature. The purpose of this study was to investigate the reliability and validity of step and pedal-revolution counts produced by the GT3X+ placed at different anatomical locations during running and bicycling. Twenty-two healthy adults (14 men and 8 women) completed running and bicycling activity bouts (5 minutes each) while wearing 6 accelerometers: 2 each at the waist, thigh and shank. Accelerometer and video data were collected during activity. Excellent reliability and validity were found for measurements taken from accelerometers mounted at the waist and shank during running (Reliability: intraclass correlation (ICC) ≥ 0.99; standard error of measurement (SEM) ≤1.0 steps; Pearson ≥ 0.99) and at the thigh and shank during bicycling (Reliability: ICC ≥ 0.99; SEM ≤1.0 revolutions; Pearson ≥ 0.99). Excellent reliability was found between measurements taken at the waist and shank during running (ICC ≥ 0.98; SEM ≤1.6 steps) and between measurements taken at the thigh and shank during bicycling (ICC ≥ 0.99; SEM ≤1.0 revolutions). These data suggest that the GT3X+ can be used for measuring step-count during running and pedal-revolution count during bicycling. Only shank placement is recommended for both activities.
Measuring the anaesthesia clinical learning environment at the department level is feasible and reliable.

PubMed

Castanelli, D J; Smith, N A

2017-05-01

The learning environment describes the context and culture in which trainees learn. In order to establish the feasibility and reliability of measuring the anaesthetic learning environment in individual departments we implemented a previously developed instrument in hospitals across New South Wales. We distributed the instrument to trainees from 25 anaesthesia departments and supplied summarized results to individual departments. Exploratory and confirmatory factor analyses were performed to assess internal structure validity and generalizability theory was used to calculate reliability. The number of trainees required for acceptable precision in results was determined using the standard error of measurement. We received 172 responses (59% response rate). Suitable internal structure validity was confirmed. Measured reliability was acceptable (G-coefficient 0.69) with nine trainees per department. Eight trainees were required for a 95% confidence interval of plus or minus 0.25 in the mean total score. Eight trainees as assessors also allow a 95% confidence interval of approximately plus or minus 0.3 in the subscale mean scores. Results for individual departments varied, with scores below the expected level recorded on individual subscales, particularly the 'teaching' subscale. Our results confirm that, using this instrument, individual departments can obtain acceptable precision in results with achievable trainee numbers. Additionally, with the exception of departments with few trainees, implementation proved feasible across a training region. Repeated use would allow departments or accrediting bodies to monitor their individual learning environment and the impact of changes such as the introduction of new curricular elements, or local initiatives to improve trainee experience. © The Author 2017. Published by Oxford University Press on behalf of the British Journal of Anaesthesia. All rights reserved. For Permissions, please email: journals.permissions@oup.com
A scale for measuring hygiene behavior: development, reliability and validity.

PubMed

Stevenson, Richard J; Case, Trevor I; Hodgson, Deborah; Porzig-Drummond, Renata; Barouei, Javad; Oaten, Megan J

2009-09-01

There is currently no general self-report measure for assessing hygiene behavior. This article details the development and testing of such a measure. In studies 1 to 4, a total of 855 participants were used for scale and subscale development and for reliability and validity testing. The latter involved establishing the relationships between self-reported hygiene behavior and existing measures, hand hygiene behavior, illness rates, and a physiological marker of immune function. In study 5, a total of 507 participants were used to assess the psychometric properties of the final revised version of the scale. The final 23-item scale comprised 5 subscales: general, household, food-related, handwashing technique, and personal hygiene. Studies 1 to 4 confirmed the scale's reliability and validity, and study 5 confirmed the scale's 5-factor structure. The scale is potentially suitable for multiple uses, in various settings, and for experimental and correlational approaches.
Reliability and accuracy analysis of a new semiautomatic radiographic measurement software in adult scoliosis.

PubMed

Aubin, Carl-Eric; Bellefleur, Christian; Joncas, Julie; de Lanauze, Dominic; Kadoury, Samuel; Blanke, Kathy; Parent, Stefan; Labelle, Hubert

2011-05-20

Radiographic software measurement analysis in adult scoliosis. To assess the accuracy as well as the intra- and interobserver reliability of measuring different indices on preoperative adult scoliosis radiographs using a novel measurement software that includes a calibration procedure and semiautomatic features to facilitate the measurement process. Scoliosis requires a careful radiographic evaluation to assess the deformity. Manual and computer radiographic process measures have been studied extensively to determine the reliability and reproducibility in adolescent idiopathic scoliosis. Most studies rely on comparing given measurements, which are repeated by the same user or by an expert user. A given measure with a small intra- or interobserver error might be deemed as good repeatability, but all measurements might not be truly accurate because the ground-truth value is often unknown. Thorough accuracy assessment of radiographic measures is necessary to assess scoliotic deformities, compare these measures at different stages or to permit valid multicenter studies. Thirty-four sets of adult scoliosis digital radiographs were measured two times by three independent observers using a novel radiographic measurement software that includes semiautomatic features to facilitate the measurement process. Twenty different measures taken from the Spinal Deformity Study Group radiographic measurement manual were performed on the coronal and sagittal images. Intra- and intermeasurer reliability for each measure was assessed. The accuracy of the measurement software was also assessed using a physical spine model in six different scoliotic configurations as a true reference. The majority of the measures demonstrated good to excellent intra- and intermeasurer reliability, except for sacral obliquity. The standard variation of all the measures was very small: ≤ 4.2° for Cobb angles, ≤ 4.2° for the kyphosis, ≤ 5.7° for the lordosis, ≤ 3.9° for the pelvic angles, and �
Research Measures for Dyscalculia: A Validity and Reliability Study.

ERIC Educational Resources Information Center

Geiman, R. M.

1986-01-01

This study sought to evaluate a measure of dyscalculia to determine its validity and reliability. It also tested use of the instrument with seventh graders and ascertained where errors attributed to dyscalculia were also present in an average sample of seventh graders. Results varied. (MNS)
Validity and reliability of the Diagnostic Adaptive Behaviour Scale.

PubMed

Tassé, M J; Schalock, R L; Balboni, G; Spreat, S; Navas, P

2016-01-01

The Diagnostic Adaptive Behaviour Scale (DABS) is a new standardised adaptive behaviour measure that provides information for evaluating limitations in adaptive behaviour for the purpose of determining a diagnosis of intellectual disability. This article presents validity evidence and reliability data for the DABS. Validity evidence was based on comparing DABS scores with scores obtained on the Vineland Adaptive Behaviour Scale, second edition. The stability of the test scores was measured using a test and retest, and inter-rater reliability was assessed by computing the inter-respondent concordance. The DABS convergent validity coefficients ranged from 0.70 to 0.84, while the test-retest reliability coefficients ranged from 0.78 to 0.95, and the inter-rater concordance as measured by intraclass correlation coefficients ranged from 0.61 to 0.87. All obtained validity and reliability indicators were strong and comparable with the validity and reliability coefficients of the most commonly used adaptive behaviour instruments. These results and the advantages of the DABS for clinician and researcher use are discussed. © 2015 MENCAP and International Association of the Scientific Study of Intellectual and Developmental Disabilities and John Wiley & Sons Ltd.
An Examination of the True Reliability of Lower Limb Stiffness Measures During Overground Hopping.

PubMed

Diggin, David; Anderson, Ross; Harrison, Andrew J

2016-06-01

Evidence suggests reports describing the reliability of leg-spring (kleg) and joint stiffness (kjoint) measures are contaminated by artifacts originating from digital filtering procedures. In addition, the intraday reliability of kleg and kjoint requires investigation. This study examined the effects of experimental procedures on the inter- and intraday reliability of kleg and kjoint. Thirty-two participants completed 2 trials of single-legged hopping at 1.5, 2.2, and 3.0 Hz at the same time of day across 3 days. On the final test day a fourth experimental bout took place 6 hours before or after participants' typical testing time. Kinematic and kinetic data were collected throughout. Stiffness was calculated using models of kleg and kjoint. Classifications of measurement agreement were established using thresholds for absolute and relative reliability statistics. Results illustrated that kleg and kankle exhibited strong agreement. In contrast, kknee and khip demonstrated weak-to-moderate consistency. Results suggest limits in kjoint reliability persist despite employment of appropriate filtering procedures. Furthermore, diurnal fluctuations in lower-limb muscle-tendon stiffness exhibit little effect on intraday reliability. The present findings support the existence of kleg as an attractor state during hopping, achieved through fluctuations in kjoint variables. Limits to kjoint reliability appear to represent biological function rather than measurement artifact.
The Reliability and Validity of Measures of Gait Variability in Community-Dwelling Older Adults

PubMed Central

Brach, Jennifer S.; Perera, Subashan; Studenski, Stephanie; Newman, Anne B.

2009-01-01

Objective To examine the test-retest reliability and concurrent validity of variability of gait characteristics. Design Cross-sectional study. Setting Research laboratory. Participants Older adults (N=558) from the Cardiovascular Health Study. Interventions Not applicable. Main Outcome Measures Gait characteristics were measured using a 4-m computerized walkway. SD determined from the steps recorded were used as the measures of variability. Intraclass correlation coefficients (ICC) were calculated to examine test-retest reliability of a 4-m walk and two 4-m walks. To establish concurrent validity, the measures of gait variability were compared across levels of health, functional status, and physical activity using independent t tests and analysis of variances. Results Gait variability measures from the two 4-m walks demonstrated greater test-retest reliability than those from the single 4-m walk (ICC=.22–.48 and ICC=.40–.63, respectively). Greater step length and stance time variability were associated with poorer health, functional status and physical activity (P<.05). Conclusions Gait variability calculated from a limited number of steps has fair to good test-retest reliability and concurrent validity. Reliability of gait variability calculated from a greater number of steps should be assessed to determine if the consistency can be improved. PMID:19061741
TEST–RETEST RELIABILITY OF CAPABILITY MEASUREMENT IN THE UK GENERAL POPULATION

PubMed Central

Al-Janabi, Hareth; Flynn, Terry N; Peters, Tim J; Bryan, Stirling; Coast, Joanna

2015-01-01

Although philosophically attractive, it may be difficult, in practice, to measure individuals' capabilities (what they are able to do in their lives) as opposed to their functionings (what they actually do). To examine whether capability information could be reliably self-reported, we administered a measure of self-reported capability (the Investigating Choice Experiments Capability Measure for Adults, ICECAP-A) on two occasions, 2 weeks apart, alongside a self-reported health measure (the EuroQol Five Dimensional Questionnaire with 3 levels, EQ-5D-3L). We found that respondents were able to report capabilities with a moderate level of consistency, although somewhat less reliably than their health status. The more socially orientated nature of some of the capability questions may account for the difference. © 2014 The Authors Health Economics Published by John Wiley & Sons Ltd. PMID:25204621
A simple and reliable sensor for accurate measurement of angular speed for low speed rotating machinery

NASA Astrophysics Data System (ADS)

Kuosheng, Jiang; Guanghua, Xu; Tangfei, Tao; Lin, Liang; Yi, Wang; Sicong, Zhang; Ailing, Luo

2014-01-01

This paper presents the theory and implementation of a novel sensor system for measuring the angular speed (AS) of a shaft rotating at a very low speed range, nearly zero speed. The sensor system consists mainly of an eccentric sleeve rotating with the shaft on which the angular speed to be measured, and an eddy current displacement sensor to obtain the profile of the sleeve for AS calculation. When the shaft rotates at constant speed the profile will be a pure sinusoidal trace. However, the profile will be a phase modulated signal when the shaft speed is varied. By applying a demodulating procedure, the AS can be obtained in a straightforward manner. The sensor system was validated experimentally based on a gearbox test rig and the result shows that the AS obtained are consistent with that obtained by a conventional encoder. However, the new sensor gives very smooth and stable traces of the AS, demonstrating its higher accuracy and reliability in obtaining the AS of the low speed operations with speed-up and down transients. In addition, the experiment also shows that it is easy and cost-effective to be realised in different applications such as condition monitoring and process control.
The concurrent validity and intrarater reliability of the Microsoft Kinect to measure thoracic kyphosis.

PubMed

Quek, June; Brauer, Sandra G; Treleaven, Julia; Clark, Ross A

2017-09-01

This study aims to investigate the concurrent validity and intrarater reliability of the Microsoft Kinect to measure thoracic kyphosis against the Flexicurve. Thirty-three healthy individuals (age: 31±11.0 years, men: 17, height: 170.2±8.2 cm, weight: 64.2±12.0 kg) participated, with 29 re-examined for intrarater reliability 1-7 days later. Thoracic kyphosis was measured using the Flexicurve and the Microsoft Kinect consecutively in both standing and sitting positions. Both the kyphosis index and angle were calculated. The Microsoft Kinect showed excellent concurrent validity (intraclass correlation coefficient=0.76-0.82) and reliability (intraclass correlation coefficient=0.81-0.98) for measuring thoracic kyphosis (angle and index) in both standing and sitting postures. This study is the first to show that the Microsoft Kinect has excellent validity and intrarater reliability to measure thoracic kyphosis, which is promising for its use in the clinical setting.
Reliability of shoulder internal rotation passive range of motion measurements in the supine versus sidelying position.

PubMed

Lunden, Jason B; Muffenbier, Mike; Giveans, M Russell; Cieminski, Cort J

2010-09-01

Clinical measurement, reliability. To compare intrarater and interrater reliability of shoulder internal rotation (IR) passive range of motion measurements utilizing a standard supine position and a sidelying position. Glenohumeral IR range of motion deficits are often noted in patients with shoulder pathology. Excellent intrarater reliability has been found when measuring this motion. However, interrater reliability has been reported as poor to fair. Some clinicians currently use a sidelying position for IR stretching with patients who have shoulder pathology. However, no objective data exist for IR passive range of motion measured in this sidelying position, either in terms of reliability or normative values. Seventy subjects (mean age, 36.8 years), with (n = 19) and without (n = 51) shoulder pathology, were included in this study. Shoulder IR passive range of motion of the dominant shoulder or involved shoulder was measured by 2 investigators in 2 positions: (1) a standard supine position, with the shoulder at 90 degrees of abduction, and (2) in sidelying on the tested side, with the shoulder flexed to 90 degrees . Intrarater reliability for supine measurements was good to excellent (ICC3,1 = 0.70-0.93) and for sidelying measurements was excellent (ICC3,1 = 0.94-0.98). Interrater reliability was fair to good for the supine measurement (ICC2,2 = 0.74-0.81) and good to excellent for the sidelying measurement (ICC2,2 = 0.88-0.96). The mean (range) value of the dominant shoulder sidelying IR passive range of motion was 40 degrees (11 degrees to 69 degrees ) for healthy subjects and 25 degrees (-16 degrees to 49 degrees) for subjects with shoulder pathology. For subjects with shoulder pathology, measurements of shoulder IR made in the sidelying position had superior intrarater and interrater reliability compared to those in the standard supine position.

A weighted reliability measure for phonetic transcription.

PubMed

Oller, D Kimbrough; Ramsdell, Heather L

2006-12-01

The purpose of the present work is to describe and illustrate the utility of a new tool for assessment of transcription agreement. Traditional measures have not characterized overall transcription agreement with sufficient resolution, specifically because they have often treated all phonetic differences between segments in transcriptions as equivalent, thus constituting an unweighted approach to agreement assessment. The measure the authors have developed calculates a weighted transcription agreement value based on principles derived from widely accepted tenets of phonological theory. To investigate the utility of the new measure, 8 coders transcribed samples of speech and infant vocalizations. Comparing the transcriptions through a computer-based implementation of the new weighted and the traditional unweighted measures, they investigated the scaling properties of both. The results illustrate better scaling with the weighted measure, in particular because the weighted measure is not subject to the floor effects that occur with the traditional measure when applied to samples that are difficult to transcribe. Furthermore, the new weighted measure shows orderly relations in degree of agreement across coded samples of early canonical-stage babbling, early meaningful speech in English, and 3 adult languages. The authors conclude that the weighted measure may provide improved foundations for research on phonetic transcription and for monitoring of transcription reliability.
Speech-evoked activation in adult temporal cortex measured using functional near-infrared spectroscopy (fNIRS): Are the measurements reliable?

PubMed

Wiggins, Ian M; Anderson, Carly A; Kitterick, Pádraig T; Hartley, Douglas E H

2016-09-01

Functional near-infrared spectroscopy (fNIRS) is a silent, non-invasive neuroimaging technique that is potentially well suited to auditory research. However, the reliability of auditory-evoked activation measured using fNIRS is largely unknown. The present study investigated the test-retest reliability of speech-evoked fNIRS responses in normally-hearing adults. Seventeen participants underwent fNIRS imaging in two sessions separated by three months. In a block design, participants were presented with auditory speech, visual speech (silent speechreading), and audiovisual speech conditions. Optode arrays were placed bilaterally over the temporal lobes, targeting auditory brain regions. A range of established metrics was used to quantify the reproducibility of cortical activation patterns, as well as the amplitude and time course of the haemodynamic response within predefined regions of interest. The use of a signal processing algorithm designed to reduce the influence of systemic physiological signals was found to be crucial to achieving reliable detection of significant activation at the group level. For auditory speech (with or without visual cues), reliability was good to excellent at the group level, but highly variable among individuals. Temporal-lobe activation in response to visual speech was less reliable, especially in the right hemisphere. Consistent with previous reports, fNIRS reliability was improved by averaging across a small number of channels overlying a cortical region of interest. Overall, the present results confirm that fNIRS can measure speech-evoked auditory responses in adults that are highly reliable at the group level, and indicate that signal processing to reduce physiological noise may substantially improve the reliability of fNIRS measurements. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.
The Reliability and Stability of an Inferred Phylogenetic Tree from Empirical Data.

PubMed

Katsura, Yukako; Stanley, Craig E; Kumar, Sudhir; Nei, Masatoshi

2017-03-01

The reliability of a phylogenetic tree obtained from empirical data is usually measured by the bootstrap probability (Pb) of interior branches of the tree. If the bootstrap probability is high for most branches, the tree is considered to be reliable. If some interior branches show relatively low bootstrap probabilities, we are not sure that the inferred tree is really reliable. Here, we propose another quantity measuring the reliability of the tree called the stability of a subtree. This quantity refers to the probability of obtaining a subtree (Ps) of an inferred tree obtained. We then show that if the tree is to be reliable, both Pb and Ps must be high. We also show that Ps is given by a bootstrap probability of the subtree with the closest outgroup sequence, and computer program RESTA for computing the Pb and Ps values will be presented. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Reliability of measuring hip abductor strength following total knee arthroplasty using a hand-held dynamometer.

PubMed

Schache, Margaret B; McClelland, Jodie A; Webster, Kate E

2016-01-01

To investigate the test-retest reliability of measuring hip abductor strength in patients with total knee arthroplasty (TKA) using a hand-held dynamometer (HHD) with two different types of resistance: belt and manual resistance. Test-retest reliability of 30 subjects (17 female, 13 male, 71.9 ± 7.4 years old), 9.2 ± 2.7 days post TKA was measured using belt and therapist resistance. Retest reliability was calculated with intra-class coefficients (ICC3,1) and 95% confidence intervals (CI) for both the group average and the individual scores. A paired t-test assessed whether a difference existed between the belt and therapist methods of resistance. ICCs were 0.82 and 0.80 for the belt and therapist resisted methods, respectively. Hip abductor strength increases of 8 N (14%) for belt resisted and 14 N (17%) for therapist resisted measurements of the group average exceeded the 95% CI and may represent real change. For individuals, hip abductor strength increases of 33 N (72%) (belt resisted) and 57 N (79%) (therapist resisted) could be interpreted as real change. Hip abductor strength can be reliably measured using HHD in the clinical setting with the described protocol. Belt resistance demonstrated slightly higher test-retest reliability. Reliable measurement of hip abductor muscle strength in patients with TKA is important to ensure deficiencies are addressed in rehabilitation programs and function is maximized. Hip abductor strength can be reliably measured with a hand-held dynamometer in the clinical setting using manual or belt resistance.
Reliability of EEG Measures of Interaction: A Paradigm Shift Is Needed to Fight the Reproducibility Crisis

PubMed Central

Höller, Yvonne; Uhl, Andreas; Bathke, Arne; Thomschewski, Aljoscha; Butz, Kevin; Nardone, Raffaele; Fell, Jürgen; Trinka, Eugen

2017-01-01

Measures of interaction (connectivity) of the EEG are at the forefront of current neuroscientific research. Unfortunately, test-retest reliability can be very low, depending on the measure and its estimation, the EEG-frequency of interest, the length of the signal, and the population under investigation. In addition, artifacts can hamper the continuity of the EEG signal, and in some clinical situations it is impractical to exclude artifacts. We aimed to examine factors that moderate test-retest reliability of measures of interaction. The study involved 40 patients with a range of neurological diseases and memory impairments (age median: 60; range 21–76; 40% female; 22 mild cognitive impairment, 5 subjective cognitive complaints, 13 temporal lobe epilepsy), and 20 healthy controls (age median: 61.5; range 23–74; 70% female). We calculated 14 measures of interaction based on the multivariate autoregressive model from two EEG-recordings separated by 2 weeks. We characterized test-retest reliability by correlating the measures between the two EEG-recordings for variations of data length, data discontinuity, artifact exclusion, model order, and frequency over all combinations of channels and all frequencies, individually for each subject, yielding a correlation coefficient for each participant. Excluding artifacts had strong effects on reliability of some measures, such as classical, real valued coherence (~0.1 before, ~0.9 after artifact exclusion). Full frequency directed transfer function was highly reliable and robust against artifacts. Variation of data length decreased reliability in relation to poor adjustment of model order and signal length. Variation of discontinuity had no effect, but reliabilities were different between model orders, frequency ranges, and patient groups depending on the measure. Pathology did not interact with variation of signal length or discontinuity. Our results emphasize the importance of documenting reliability, which may vary
Reliability of Phase Velocity Measurements of Flexural Acoustic Waves in the Human Tibia In-Vivo.

PubMed

Vogl, Florian; Schnüriger, Karin; Gerber, Hans; Taylor, William R

2016-01-01

Axial-transmission acoustics have shown to be a promising technique to measure individual bone properties and detect bone pathologies. With the ultimate goal being the in-vivo application of such systems, quantification of the key aspects governing the reliability is crucial to bring this method towards clinical use. This work presents a systematic reliability study quantifying the sources of variability and their magnitudes of in-vivo measurements using axial-transmission acoustics. 42 healthy subjects were measured by an experienced operator twice per week, over a four-month period, resulting in over 150000 wave measurements. In a complementary study to assess the influence of different operators performing the measurements, 10 novice operators were trained, and each measured 5 subjects on a single occasion, using the same measurement protocol as in the first part of the study. The estimated standard error for the measurement protocol used to collect the study data was ∼ 17 m/s (∼ 4% of the grand mean) and the index of dependability, as a measure of reliability, was Φ = 0.81. It was shown that the method is suitable for multi-operator use and that the reliability can be improved efficiently by additional measurements with device repositioning, while additional measurements without repositioning cannot improve the reliability substantially. Phase velocity values were found to be significantly higher in males than in females (p < 10-5) and an intra-class correlation coefficient of r = 0.70 was found between the legs of each subject. The high reliability of this non-invasive approach and its intrinsic sensitivity to mechanical properties opens perspectives for the rapid and inexpensive clinical assessment of bone pathologies, as well as for monitoring programmes without any radiation exposure for the patient.
Reliability of the American Medical Association guides' model for measuring spinal range of motion. Its implication for whole-person impairment rating.

PubMed

Nitschke, J E; Nattrass, C L; Disler, P B; Chou, M J; Ooi, K T

1999-02-01

Repeated measures design for intra- and interrater reliability. To determine the intra- and interrater reliability of the lumbar spine range of motion measured with a dual inclinometer, and the thoracolumbar spine range of motion measured with a long-arm goniometer, as recommended in the American Medical Association Guides. The American Medical Association Guides (2nd and 4th editions) recommend using measurements of thoracolumbar and lumbar range of movement, respectively, to estimate the percentage of permanent impairment in patients with chronic low back pain. However, the reliability of this method of estimating impairment has not been determined. In all, 34 subjects participated in the study, 21 women with a mean age of 40.1 years (SD, +/- 11.1) and 13 men with a mean age of 47.7 years (SD, +/- 12.1). Measures of thoracolumbar flexion, extension, lateral flexion, and rotation were obtained with a long-arm goniometer. Lumbar flexion, extension, and lateral flexion were measured with a dual inclinometer. Measurements were taken by two examiners on one occasion and by one examiner on two occasions approximately 1 week apart. The results showed poor intra- and interrater reliability for all measurements taken with both instruments. Measurement error expressed in degrees showed that measurements taken by different raters exhibited systematic as well as random differences. As a result, subjects measured by two different examiners on the same day, with either instrument, could give impairment ratings ranging between 0% and 18% of the whole person (excluding rotation), in which percentage impairment is calculated using the average range of motion and the average systematic and random error in degrees for the group for each movement (flexion, extension, and lateral flexion). The poor reliability of the American Medical Association Guides' spinal range of motion model can result in marked variation in the percentage of whole-body impairment. These findings have
Blood pressure measurement reliability among different racial-ethnic groups in a stroke prevention study.

PubMed

Estol, Conrado J; Bath, Philip M W; Gorelick, Philip B; Cotton, Daniel; Martin, Renee H; Weber, Michael A; Dahlof, Bjorn

2014-10-01

High blood pressure (BP) is commonly not diagnosed, and patients do not achieve target values when treated. Among 20,000 patients encompassing most races-ethnicities, we evaluated BP measurements and treatment response in a stroke prevention trial. Our goal was to identify BP measurement differences between clinical trial and patient determinations and among the racial-ethnic groups. A total of 20,332 patients with ischemic stroke were randomized to receive antiplatelet treatment and 80 mg of telmisartan versus placebo. BP measurements were obtained at the first clinic visit and then 1 and 3 months later and every 6 months thereafter. One week after the first clinic visit, patients were requested to report a BP measurement obtained elsewhere. Measurements at the trial clinics were obtained with the same electronic device. Statistical analysis was used to detect significant differences. The mean patient age was 66 years; 36% were women, and race-ethnicity comprised 58% Whites, 33% Asian, 4.9% Hispanic, and 4% Black. Overall, 74% of patients were hypertensive. BP varied between the race-ethnicity groups, being highest in Hispanics (145/85) and lowest in Blacks (144/82). BP at visits clinic 1, nonclinic 1A, and clinic 2 were, respectively, 144/84, 137/80, and 139/81 mmHg, with the difference between visits 1-2 and visit 1A being significant. BPs were normal in 42% of the cases at visit 1A, and of these, only 44% were normal at visit 1 and 57.6% were normal on visit 2. Similar findings were noted for all race-ethnicity groups. BP values varied among race-ethnicities and showed differences between clinic and patient measurements. This finding questions the reliability of self-reported BP and has implications for BP management in daily clinical practice.
Reliability and Validity Testing of the Physical Resilience Measure

ERIC Educational Resources Information Center

Resnick, Barbara; Galik, Elizabeth; Dorsey, Susan; Scheve, Ann; Gutkin, Susan

2011-01-01

Objective: The purpose of this study was to test reliability and validity of the Physical Resilience Scale. Methods: A single-group repeated measure design was used and 130 older adults from three different housing sites participated. Participants completed the Physical Resilience Scale, Hardy-Gill Resilience Scale, 14-item Resilience Scale,…
Reliability of the Q Force; a mobile instrument for measuring isometric quadriceps muscle strength.

PubMed

Douma, K W; Regterschot, G R H; Krijnen, W P; Slager, G E C; van der Schans, C P; Zijlstra, W

2016-01-01

The ability to generate muscle strength is a pre-requisite for all human movement. Decreased quadriceps muscle strength is frequently observed in older adults and is associated with a decreased performance and activity limitations. To quantify the quadriceps muscle strength and to monitor changes over time, instruments and procedures with a sufficient reliability are needed. The Q Force is an innovative mobile muscle strength measurement instrument suitable to measure in various degrees of extension. Measurements between 110 and 130° extension present the highest values and the most significant increase after training. The objective of this study is to determine the test-retest reliability of muscle strength measurements by the Q Force in older adults in 110° extension. Forty-one healthy older adults, 13 males and 28 females were included in the study. Mean (SD) age was 81.9 (4.89) years. Isometric muscle strength of the Quadriceps muscle was assessed with the Q Force at 110° of knee extension. Participants were measured at two sessions with a three to eight day interval between sessions. To determine relative reliability, the intraclass correlation coefficient (ICC) was calculated. To determine absolute reliability, Bland and Altman Limits of Agreement (LOA) were calculated and t-tests were performed. Relative reliability of the Q Force is good to excellent as all ICC coefficients are higher than 0.75. Generally a large 95 % LOA, reflecting only moderate absolute reliability, is found as exemplified for the peak torque left leg of -18.6 N to 33.8 N and the right leg of -9.2 N to 26.4 N was between 15.7 and 23.6 Newton representing 25.2 % to 39.9 % of the size of the mean. Small systematic differences in mean were found between measurement session 1 and 2. The present study shows that the Q Force has excellent relative test-retest reliability, but limited absolute test-retest reliability. Since the Q Force is relatively cheap and mobile it is suitable for
Interrater Reliability of mHealth App Rating Measures: Analysis of Top Depression and Smoking Cessation Apps.

PubMed

Powell, Adam C; Torous, John; Chan, Steven; Raynor, Geoffrey Stephen; Shwarts, Erik; Shanahan, Meghan; Landman, Adam B

2016-02-10

There are over 165,000 mHealth apps currently available to patients, but few have undergone an external quality review. Furthermore, no standardized review method exists, and little has been done to examine the consistency of the evaluation systems themselves. We sought to determine which measures for evaluating the quality of mHealth apps have the greatest interrater reliability. We identified 22 measures for evaluating the quality of apps from the literature. A panel of 6 reviewers reviewed the top 10 depression apps and 10 smoking cessation apps from the Apple iTunes App Store on these measures. Krippendorff's alpha was calculated for each of the measures and reported by app category and in aggregate. The measure for interactiveness and feedback was found to have the greatest overall interrater reliability (alpha=.69). Presence of password protection (alpha=.65), whether the app was uploaded by a health care agency (alpha=.63), the number of consumer ratings (alpha=.59), and several other measures had moderate interrater reliability (alphas>.5). There was the least agreement over whether apps had errors or performance issues (alpha=.15), stated advertising policies (alpha=.16), and were easy to use (alpha=.18). There were substantial differences in the interrater reliabilities of a number of measures when they were applied to depression versus smoking apps. We found wide variation in the interrater reliability of measures used to evaluate apps, and some measures are more robust across categories of apps than others. The measures with the highest degree of interrater reliability tended to be those that involved the least rater discretion. Clinical quality measures such as effectiveness, ease of use, and performance had relatively poor interrater reliability. Subsequent research is needed to determine consistent means for evaluating the performance of apps. Patients and clinicians should consider conducting their own assessments of apps, in conjunction with
Validity and reliability of a novel measure of activity performance and participation.

PubMed

Murgatroyd, Phil; Karimi, Leila

2016-01-01

To develop and evaluate an innovative clinician-rated measure, which produces global numerical ratings of activity performance and participation. Repeated measures study with 48 community-dwelling participants investigating clinical sensibility, comprehensiveness, practicality, inter-rater reliability, responsiveness, sensitivity and concurrent validity with Barthel Index. Important clinimetric characteristics including comprehensiveness and ease of use were rated >8/10 by clinicians. Inter-rater reliability was excellent on the summary scores (intraclass correlation of 0.95-0.98). There was good evidence that the new outcome measure distinguished between known high and low functional scoring groups, including both responsiveness to change and sensitivity at the same time point in numerous tests. Concurrent validity with the Barthel Index was fair to high (Spearman Rank Order Correlation 0.32-0.85, p > 0.05). The new measure's summary scores were nearly twice as responsive to change compared with the Barthel Index. Other more detailed data could also be generated by the new measure. The Activity Performance Measure is an innovative outcome instrument that showed good clinimetric qualities in this initial study. Some of the results were strong, given the sample size, and further trial and evaluation is appropriate. Implications for Rehabilitation The Activity Performance Measure is an innovative outcome measure covering activity performance and participation. In an initial evaluation, it showed good clinimetric qualities including responsiveness to change, sensitivity, practicality, clinical sensibility, item coverage, inter-rater reliability and concurrent validity with the Barthel Index. Further trial and evaluation is appropriate.
Test-retest reliability of behavioral measures of impulsive choice, impulsive action, and inattention

PubMed Central

Weafer, Jessica; Baggott, Matthew J.; de Wit, Harriet

2014-01-01

Behavioral measures of impulsivity are widely used in substance abuse research, yet relatively little attention has been devoted to establishing their psychometric properties, especially their reliability over repeated administration. The current study examined the test-retest reliability of a battery of standardized behavioral impulsivity tasks, including measures of impulsive choice (delay discounting, probability discounting, and the Balloon Analogue Risk Task), impulsive action (the stop signal task, the go/no-go task, and commission errors on the continuous performance task), and inattention (attention lapses on a simple reaction time task and omission errors on the continuous performance task). Healthy adults (n=128) performed the battery on two separate occasions. Reliability estimates for the individual tasks ranged from moderate to high, with Pearson correlations within the specific impulsivity domains as follows: impulsive choice (r = .76 - .89, ps < .001); impulsive action (r = .65 - .73, ps < .001); and inattention (r = .38-.42, ps < .001). Additionally, the influence of day-to-day fluctuations in mood as measured by the Profile of Mood States was assessed in relation to variability in performance on each of the behavioral tasks. Change in performance on the delay discounting task was significantly associated with change in positive mood and arousal. No other behavioral measures were significantly associated with mood. In sum, the current analysis demonstrates that behavioral measures of impulsivity are reliable measures and thus can be confidently used to assess various facets of impulsivity as intermediate phenotypes for drug abuse. PMID:24099351
Reliability of Upright and Supine Power Measurements Using an Inertial Load Cycle Ergometer

NASA Technical Reports Server (NTRS)

Wickwire, P. J.; Leach, M.; Ryder, J.; Ploutz-Snyder, R.; Ploutz-Snyder, L.

2011-01-01

Practical, reliable, and time efficient methods of measuring muscular power are desirable for both research and applied testing situations. The inertial-load cycling method (ILC; Power/Cycle, Austin, TX) requires subjects to pedal as fast as possible against the inertial load of a flywheel for only 3-5 seconds, which could help reduce the time and effort required for maximal power testing. PURPOSE: 1) To test the intramachine reliability of ILC over 3 separate sessions, 2) to compare postural stance (upright vs. supine) during testing, and 3) to compare the maximal power (Pmax) output measured using ILC to that obtained from traditional isokinetic and leg press testing. METHODS: Subjects (n = 12) were tested on 4 non-consecutive days. The following tests were done on the first day of testing: isometric knee extension, isokinetic knee extension at several speeds, isokinetic power/endurance at 180/sec (Biodex System 4), leg press maximal isometric force, and leg press power/endurance. The other 3 days consisted exclusively of ILC testing. Subjects performed 6 ILC tests in an upright position and 6 ILC tests in a supine position on each day. The starting position was counterbalanced. Mixed-effects linear modeling was used to determine if any differences existed between testing days and between upright and supine for Pmax and revolutions per minute at Pmax (RPMpk). Mixed-modeling was also used to calculate intraclass correlation coefficients (ICC) to determine the reliability of the ILC on each testing day for Pmax and RPMpk (ICCs were calculated separately for upright and supine). gKendall fs Tau a h was used to determine the association between ILC Pmax and isokinetic and leg press data. RESULTS: For Pmax, significant differences were found between days 1 and 2 (upright: p = 0.018; supine: p = 0.014) and between days 1 and 3 (upright: p = 0.001; supine: p = 0.002), but not between days 2 and 3 (upright: p = 0.422; supine: p = 0.501). Pmax ICC values were greater than
Test-retest reliability of 3D ultrasound measurements of the thoracic spine.

PubMed

Fölsch, Christian; Schlögel, Stefanie; Lakemeier, Stefan; Wolf, Udo; Timmesfeld, Nina; Skwara, Adrian

2012-05-01

To explore the reliability of the Zebris CMS 20 ultrasound analysis system with pointer application for measuring end-range flexion, end-range extension, and neutral kyphosis angle of the thoracic spine. The study was performed within the School of Physiotherapy in cooperation with the Orthopedic Department at a University Hospital. The thoracic spines of 28 healthy subjects were measured. Measurements for neutral kyphosis angle, end-range flexion, and end-range extension were taken once at each time point. The bone landmarks were palpated by one examiner and marked with a pointer containing 2 transmitters using a frequency of 40 kHz. A third transmitter was fixed to the pelvis, and 3 microphones were used as receiver. The real angle was calculated by the software. Bland-Altman plots with 95% limits of agreement, intraclass correlations (ICC), standard deviations of mean measurements, and standard error of measurements were used for statistical analyses. The test-retest reliability in this study was measured within a 24-hour interval. Statistical parameters were used to judge reliability. The mean kyphosis angle was 44.8° with a standard deviation of 17.3° at the first measurement and a mean of 45.8° with a standard deviation of 16.2° the following day. The ICC was high at 0.95 for the neutral kyphosis angle, and the Bland-Altman 95% limits of agreement were within clinical acceptable margins. The ICC was 0.71 for end-range flexion and 0.34 for end-range extension, whereas the Bland-Altman 95% limits of agreement were wider than with the static measurement of kyphosis. Compared with static measurements, the analysis of motion with 3-dimensional ultrasound showed an increased standard deviation for test-retest measurements. The test-retest reliability of ultrasound measuring of the neutral kyphosis angle of the thoracic spine was demonstrated within 24 hours. Bland-Altman 95% limits of agreement and the standard deviation of differences did not appear to be
Reliability-Productivity Curve, a Tool for Adaptation Measures Identification

NASA Astrophysics Data System (ADS)

Chávez-Jiménez, A.; Granados, A.; Garrote, L. M.

2015-12-01

Due to climate change effects, water scarcity problems would intensify in several regions. These problems are going to impact negatively in the water low-priority demands, since these will be reduced in favor of those with high-priority. An example would be the reduction of agriculture water resources in favor of the urban ones. Then, it is important the evaluation of adaptation measures for a better water resources management. An important tool to face this challenge is the economic valuation of the water demands' impact within a water resources system. In agriculture this valuation is usually performed through the water productivity evaluation. The water productivity evaluation requires detailed information regarding the different crops like the applied technology, the agricultural supplies management, the water availability, etc. This is a restriction for an evaluation at basin scale due to the difficulty of gathers this level of detailed information. Besides, only the water availability is taken into account, but not the period when the water is distributed (i.e. water resources reliability). Water resources reliability is one of the most important variables in water resources management. This research proposes a methodology to determine the agriculture water productivity, using as variables the crops information, the crops price, the water resources availability, and the water resources reliability, at a basin scale. This methodology would allow identifying general water resources adaptation measures, providing the basis for further detailed studies in critical regions.
Reliability of the Kinetic Measures under Different Heel Conditions during Normal Walking

ERIC Educational Resources Information Center

Liu, Yuanlong; Wang, Yong Tai

2004-01-01

The purpose of this study was to determine and compare the reliability of 3 dimension reaction forces and impulses in walking with 3 different heel shoe conditions. These results suggest that changing the height of the heels affects mainly the reliability of the ground reaction force and impulse measures on the medial and lateral dimension and not…
Computing Reliabilities Of Ceramic Components Subject To Fracture

NASA Technical Reports Server (NTRS)

Nemeth, N. N.; Gyekenyesi, J. P.; Manderscheid, J. M.

1992-01-01

CARES calculates fast-fracture reliability or failure probability of macroscopically isotropic ceramic components. Program uses results from commercial structural-analysis program (MSC/NASTRAN or ANSYS) to evaluate reliability of component in presence of inherent surface- and/or volume-type flaws. Computes measure of reliability by use of finite-element mathematical model applicable to multiple materials in sense model made function of statistical characterizations of many ceramic materials. Reliability analysis uses element stress, temperature, area, and volume outputs, obtained from two-dimensional shell and three-dimensional solid isoparametric or axisymmetric finite elements. Written in FORTRAN 77.
Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 5. Technical Report #1204

ERIC Educational Resources Information Center

Park, Bitnara Jasmine; Irvin, P. Shawn; Lai, Cheng-Fei; Alonzo, Julie; Tindal, Gerald

2012-01-01

In this technical report, we present the results of a reliability study of the fifth-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…
Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 2. Technical Report #1201

ERIC Educational Resources Information Center

Lai, Cheng-Fei; Irvin, P. Shawn; Alonzo, Julie; Park, Bitnara Jasmine; Tindal, Gerald

2012-01-01

In this technical report, we present the results of a reliability study of the second-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…

Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 4. Technical Report #1203

ERIC Educational Resources Information Center

Park, Bitnara Jasmine; Irvin, P. Shawn; Alonzo, Julie; Lai, Cheng-Fei; Tindal, Gerald

2012-01-01

In this technical report, we present the results of a reliability study of the fourth-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…
Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 6. Technical Report #1205

ERIC Educational Resources Information Center

Irvin, P. Shawn; Alonzo, Julie; Park, Bitnara Jasmine; Lai, Cheng-Fei; Tindal, Gerald

2012-01-01

In this technical report, we present the results of a reliability study of the sixth-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…
Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 7. Technical Report #1206

ERIC Educational Resources Information Center

Irvin, P. Shawn; Alonzo, Julie; Lai, Cheng-Fei; Park, Bitnara Jasmine; Tindal, Gerald

2012-01-01

In this technical report, we present the results of a reliability study of the seventh-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…
Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 3. Technical Report #1202

ERIC Educational Resources Information Center

Lai, Cheng-Fei; Irvin, P. Shawn; Park, Bitnara Jasmine; Alonzo, Julie; Tindal, Gerald

2012-01-01

In this technical report, we present the results of a reliability study of the third-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…
Test-retest reliability of sudden ankle inversion measurements in subjects with healthy ankle joints.

PubMed

Eechaute, Christophe; Vaes, Peter; Duquet, William; Van Gheluwe, Bart

2007-01-01

Sudden ankle inversion tests have been used to investigate whether the onset of peroneal muscle activity is delayed in patients with chronically unstable ankle joints. Before interpreting test results of latency times in patients with chronic ankle instability and healthy subjects, the reliability of these measures must be first demonstrated. To investigate the test-retest reliability of variables measured during a sudden ankle inversion movement in standing subjects with healthy ankle joints. Validation study. Research laboratory. 15 subjects with healthy ankle joints (30 ankles). Subjects stood on an ankle inversion platform with both feet tightly fixed to independently moveable trapdoors. An unexpected sudden ankle inversion of 50 degrees was imposed. We measured latency and motor response times and electromechanical delay of the peroneus longus muscle, along with the time and angular position of the first and second decelerating moments, the mean and maximum inversion speed, and the total inversion time. Correlation coefficients and standard error of measurements were calculated. Intraclass correlation coefficients ranged from 0.17 for the electromechanical delay of the peroneus longus muscle (standard error of measurement = 2.7 milliseconds) to 0.89 for the maximum inversion speed (standard error of measurement = 34.8 milliseconds). The reliability of the latency and motor response times of the peroneus longus muscle, the time of the first and second decelerating moments, and the mean and maximum inversion speed was acceptable in subjects with healthy ankle joints and supports the investigation of the reliability of these measures in subjects with chronic ankle instability. The lower reliability of the electromechanical delay of the peroneus longus muscle and the angular positions of both decelerating moments calls the use of these variables into question.
Night-to-night arousal variability and interscorer reliability of arousal measurements.

PubMed

Loredo, J S; Clausen, J L; Ancoli-Israel, S; Dimsdale, J E

1999-11-01

Measurement of arousals from sleep is clinically important, however, their definition is not well standardized, and little data exist on reliability. The purpose of this study is to determine factors that affect arousal scoring reliability and night-to-night arousal variability. The night-to-night arousal variability and interscorer reliability was assessed in 20 subjects with and without obstructive sleep apnea undergoing attended polysomnography during two consecutive nights. Five definitions of arousal were studied, assessing duration of electroencephalographic (EEG) frequency changes, increases in electromyographic (EMG) activity and leg movement, association with respiratory events, as well as the American Sleep Disorders Association (ASDA) definition of arousals. NA. NA. NA. Interscorer reliability varied with the definition of arousal and ranged from an Intraclass correlation (ICC) of 0.19 to 0.92. Arousals that included increases in EMG activity or leg movement had the greatest reliability, especially when associated with respiratory events (ICC 0.76 to 0.92). The ASDA arousal definition had high interscorer reliability (ICC 0.84). Reliability was lowest for arousals consisting of EEG changes lasting <3 seconds (ICC 0.19 to 0.37). The within subjects night-to-night arousal variability was low for all arousal definitions In a heterogeneous population, interscorer arousal reliability is enhanced by increases in EMG activity, leg movements, and respiratory events and decreased by short duration EEG arousals. The arousal index night-to-night variability was low for all definitions.
Comparison of macro and micro Raman measurement for reliable quantitative analysis of pharmaceutical polymorphs.

PubMed

Paiva, Eduardo M; da Silva, Vitor H; Poppi, Ronei J; Pereira, Claudete F; Rohwedder, Jarbas J R

2018-05-12

This work reports on the use of micro- and macro-Raman measurements for quantification of mebendazole (MBZ) polymorphs A, B, and C in mixtures. Three Raman spectrophotometers were studied with a laser spot size of 3, 80 and 100 μm and spectral resolutions of 3.9, 9 and 4 cm -1 , respectively. The samples studied were ternary mixtures varying the MBZ polymorphs A and C from 0 to 100% and polymorph B from 0 to 30%. Partial Least Squares (PLS) regression models were developed using the pre-processing spectra (2nd derivative) of the ternary mixtures. The best performance was obtained when the macro-Raman configuration was applied, obtaining RMSEP values of 1.68%, 1.24% and 2.03% w/w for polymorphs A, B, and C, respectively. In general, micro-Raman presented worst results for MBZ polymorphs prediction because the spectra obtained with this configuration does not represent the bulk proportion of mixtures, which have different particle morphologies and sizes. In addition, the influence of these particle features on micro-Raman measurements was also studied. Finally, the results demonstrated that reliable analytical quantifying of MBZ polymorphs can be reached using a laser with wider area illuminated, thus enabling acquisition of more reproductive and representative spectra of the mixtures. Copyright © 2018 Elsevier B.V. All rights reserved.
Low-Budget Instrumentation of a Conventional Leg Press to Measure Reliable Isometric-Strength Capacity.

PubMed

Baur, Heiner; Groppa, Alessia Severina; Limacher, Regula; Radlinger, Lorenz

2016-02-02

Maximum strength and rate of force development (RFD) are 2 important strength characteristics for everyday tasks and athletic performance. Measurements of both parameters must be reliable. Expensive isokinetic devices with isometric modes are often used. The possibility of cost-effective measurements in a practical setting would facilitate quality control. The purpose of this study was to assess the reliability of measurements of maximum isometric strength (Fmax) and RFD on a conventional leg press. Sixteen subjects (23 ± 2 y, 1.68 ± 0.05 m, 59 ± 5 kg) were tested twice within 1 session. After warm-up, subjects performed 2 times 5 trials eliciting maximum voluntary isometric contractions on an instrumented leg press (1- and 2-legged randomized). Fmax (N) and RFD (N/s) were extracted from force-time curves. Reliability was determined for Fmax and RFD by calculating the intraclass correlation coefficient (ICC), the test-retest variability (TRV), and the bias and limits of agreement. Reliability measures revealed good to excellent ICCs of .80-.93. TRV showed mean differences between measurement sessions of 0.4-6.9%. The systematic error was low compared with the absolute mean values (Fmax 5-6%, RFD 1-4%). The implementation of a force transducer into a conventional leg press provides a viable procedure to assess Fmax and RFD. Both performance parameters can be assessed with good to excellent reliability allowing quality control of interventions.
Test-retest reliability of behavioral measures of impulsive choice, impulsive action, and inattention.

PubMed

Weafer, Jessica; Baggott, Matthew J; de Wit, Harriet

2013-12-01

Behavioral measures of impulsivity are widely used in substance abuse research, yet relatively little attention has been devoted to establishing their psychometric properties, especially their reliability over repeated administration. The current study examined the test-retest reliability of a battery of standardized behavioral impulsivity tasks, including measures of impulsive choice (i.e., delay discounting, probability discounting, and the Balloon Analogue Risk Task), impulsive action (i.e., the stop signal task, the go/no-go task, and commission errors on the continuous performance task), and inattention (i.e., attention lapses on a simple reaction time task and omission errors on the continuous performance task). Healthy adults (n = 128) performed the battery on two separate occasions. Reliability estimates for the individual tasks ranged from moderate to high, with Pearson correlations within the specific impulsivity domains as follows: impulsive choice (r range: .76-.89, ps < .001); impulsive action (r range: .65-.73, ps < .001); and inattention (r range: .38-.42, ps < .001). Additionally, the influence of day-to-day fluctuations in mood, as measured by the Profile of Mood States, was assessed in relation to variability in performance on each of the behavioral tasks. Change in performance on the delay discounting task was significantly associated with change in positive mood and arousal. No other behavioral measures were significantly associated with mood. In sum, the current analysis demonstrates that behavioral measures of impulsivity are reliable measures and thus can be confidently used to assess various facets of impulsivity as intermediate phenotypes for drug abuse.
Validity and reliability of the iPhone to measure rib hump in scoliosis.

PubMed

Balg, Frederic; Juteau, Mathieu; Theoret, Chantal; Svotelis, Amy; Grenier, Guillaume

2014-12-01

This was a prospective blinded validity and reliability analysis. The aim of this study was validation and reliability evaluation of the Scoligauge iPhone app. The scoliometer is used to clinically measure the rib hump in scoliosis as a means to evaluate the axial trunk rotation. The increasing availability of smartphone with built-in accelerometer led to the development of a vast number of applications to measure angles. Of these, the Scoligauge mimics a scoliometer. The aim of this study was to compare the validity of the Scoligauge iPhone application without an associated adapter with the traditional scoliometer and to test the reliability of the application in a clinical setting. Two observers measured the rib hump deformity on 34 consecutive patients with idiopathic scoliosis with an average Cobb angle of 24.2 ± 13.5 degrees (range, 4 to 65 degrees). Measurements were made with an iPhone without the adapter and with a scoliometer. The validity as well as the interobserver and intraobserver reliability were calculated using the intraclass coefficient (ICC) and the Bland-Altman test. The mean difference between the scoliometer and the Scoligauge application was 0.4 degrees [95% confidence interval (CI) of ± 3.1 degrees] with an ICC of 0.947 (P < 0.001). The intraobserver and interobserver ICC were 0.961 (P < 0.001) and 0.901 (P < 0.001), respectively. The mean intraobserver difference was 0.0 degrees (95% CI of ± 2.7 degrees) and the mean interobserver difference was 0.1 degrees (95% CI of ± 4.4 degrees). The intraobserver and interobserver reliability of the Scoligauge iPhone app, as well as its validity compared with the scoliometer, are excellent. The mean differences between measurements are small and clinically not significant. Thus, the Scoligauge application is valid for clinical evaluation even without special adapter. Level I (Diagnostic Study).
Accuracy and reliability of Chile's National Air Quality Information System for measuring particulate matter: Beta attenuation monitoring issue.

PubMed

Toro A, Richard; Campos, Claudia; Molina, Carolina; Morales S, Raul G E; Leiva-Guzmán, Manuel A

2015-09-01

A critical analysis of Chile's National Air Quality Information System (NAQIS) is presented, focusing on particulate matter (PM) measurement. This paper examines the complexity, availability and reliability of monitoring station information, the implementation of control systems, the quality assurance protocols of the monitoring station data and the reliability of the measurement systems in areas highly polluted by particulate matter. From information available on the NAQIS website, it is possible to confirm that the PM2.5 (PM10) data available on the site correspond to 30.8% (69.2%) of the total information available from the monitoring stations. There is a lack of information regarding the measurement systems used to quantify air pollutants, most of the available data registers contain gaps, almost all of the information is categorized as "preliminary information" and neither standard operating procedures (operational and validation) nor assurance audits or quality control of the measurements are reported. In contrast, events that cause saturation of the monitoring detectors located in northern and southern Chile have been observed using beta attenuation monitoring. In these cases, it can only be concluded that the PM content is equal to or greater than the saturation concentration registered by the monitors and that the air quality indexes obtained from these measurements are underestimated. This occurrence has been observed in 12 (20) public and private stations where PM2.5 (PM10) is measured. The shortcomings of the NAQIS data have important repercussions for the conclusions obtained from the data and for how the data are used. However, these issues represent opportunities for improving the system to widen its use, incorporate comparison protocols between equipment, install new stations and standardize the control system and quality assurance. Copyright © 2015 Elsevier Ltd. All rights reserved.
Between-day reliability of centre of pressure measures for balance assessment in hemiplegic stroke patients.

PubMed

Gasq, David; Labrunée, Marc; Amarantini, David; Dupui, Philippe; Montoya, Richard; Marque, Philippe

2014-03-21

Stroke patients have impaired postural balance that increases the risk of falls and impairs their mobility. Assessment of postural balance is commonly carried out by recording centre of pressure (CoP) displacements, but the lack of data concerning reliability of these measures compromises their interpretation. The purpose of this study was to investigate the between-day reliability of six CoP-based variables, in order to provide i) reliability data for monitoring postural sway and weight-bearing asymmetry of stroke patients in clinical practice and ii) consistent assessment method of measurement error for applications in physical medicine and rehabilitation. Postural balance of 20 stroke patients was assessed in quiet standing on a force platform, in two sessions, 7 days apart. Six CoP-based variables were collected in eyes open and eyes closed conditions: postural sway was assessed with mean and standart deviation of CoP-velocity, CoP-velocity along the mediolateral and anteroposterior axes, and confidence ellipse area (CE(AREA)); weight-bearing asymmetry was assessed with mean CoP position along the mediolateral axis (CoP(ML)). The intraclass correlation coefficient (ICC) was used to determine the level of agreement between test-retest. Small real difference (SRD), corresponding to the smallest change that indicates a real improvement for a single individual, was used to determine the extent of measurement error. ICCs were satisfactory (>0.9) for all CoP-based variables, except for CE(AREA) in eyes open condition and CoP(ML) (<0.8). The SRDs (eyes open/closed conditions) were: 6.1/9.5 mm.s(-1) for mean velocity; 12.3/12.2 mm.s(-1) for standard deviation of CoP-velocity; 3.6/5.5 mm.s(-1) and 4.9/7.3 mm.s(-1) for CoP-velocity in mediolateral and anteroposterior axes, respectively; 17.4/21.4 mm for CoP(ML). Because CE(AREA) showed heteroscedasticity of measurement error distribution, SRD (eyes open/closed conditions) was expressed as a percentage (121/75%) and a
Measuring Quadriceps strength in adults with severe or moderate intellectual and visual disabilities: Feasibility and reliability.

PubMed

Dijkhuizen, Annemarie; Douma, Rob K; Krijnen, Wim P; van der Schans, Cees P; Waninge, Aly

2018-05-30

A feasible and reliable instrument to measure strength in persons with severe intellectual and visual disabilities (SIVD) is lacking. The aim of our study was to determine feasibility, learning period and reliability of three strength tests. Twenty-nine participants with SIVD performed the Minimum Sit-to-Stand Height test (MSST), the Leg Extension test (LE) and the 30 seconds Chair-Stand test (30sCS), once per week for 5 weeks. Feasibility was determined by the percentage of successful measurements; learning effect by using paired t test between two consecutive measurements; test-retest reliability by intraclass correlation coefficient and Limits of Agreement and, correlations by Pearson correlations. A sufficient feasibility and learning period of the tests was shown. The methods had sufficient test-retest reliability and moderate-to-sufficient correlations. The MSST, the LE, and the 30sCS are feasible tests for measuring muscle strength in persons with SIVD, having sufficient test re-test reliability. © 2018 John Wiley & Sons Ltd.
Reliability and Validity of the Behavioral Addiction Measure for Video Gaming.

PubMed

Sanders, James L; Williams, Robert J

2016-01-01

Most tests of video game addiction have weak construct validity and limited ability to correctly identify people in denial. The purpose of the present research was to investigate the reliability and validity of a new test of video game addiction (Behavioral Addiction Measure-Video Gaming [BAM-VG]) that was developed in part to address these deficiencies. Regular adult video gamers (n = 506) were recruited from a Canadian online panel and completed a survey containing three measures of excessive video gaming (BAM-VG; DSM-5 criteria for Internet Gaming Disorder [IGD]; and the IGD-20), as well as questions concerning extensiveness of video game involvement and self-report of problems associated with video gaming. One month later, they were reassessed for the purposes of establishing test-retest reliability. The BAM-VG demonstrated good internal consistency as well as 1 month test-retest reliability. Criterion-related validity was demonstrated by significant correlations with the following: time spent playing, self-identification of video game problems, and scores on other instruments designed to assess video game addiction (DSM-5 IGD, IGD-20). Consistent with the theory, principal component analysis identified two components underlying the BAM-VG that roughly correspond with impaired control and significant negative consequences deriving from this impaired control. Together with its excellent construct validity and other technical features, the BAM-VG represents a reliable and valid test of video game addiction.
Acromiohumeral distance measurement in rotator cuff tendinopathy: is there a reliable, clinically applicable method? A systematic review.

PubMed

McCreesh, Karen M; Crotty, James M; Lewis, Jeremy S

2015-03-01

Narrowing of the subacromial space has been noted as a common feature of rotator cuff (RC) tendinopathy. It has been implicated in the development of symptoms and forms the basis for some surgical and rehabilitation approaches. Various radiological methods have been used to measure the subacromial space, which is represented by a two-dimensional measurement of acromiohumeral distance (AHD). A reliable method of measurement could be used to assess the impact of rehabilitation or surgical interventions for RC tendinopathy; however, there are no published reviews assessing the reliability of AHD measurement. The aim of this review was to systematically assess the evidence for the intrarater and inter-rater reliability of radiological methods of measuring AHD, in order to identify the most reliable method for use in RC tendinopathy. An electronic literature search was carried out and studies describing the reliability of any radiological method of measuring AHD in either healthy or RC tendinopathy groups were included. Eighteen studies met the inclusion criteria and were appraised by two reviewers using the Quality Appraisal for reliability Studies checklist. Eight studies were deemed to be of high methodological quality. Study weaknesses included lack of tester blinding, inadequate description of tester experience, lack of inclusion of symptomatic populations, poor reporting of statistical methods and unclear diagnosis. There was strong evidence for the reliability of ultrasound for measuring AHD, with moderate evidence for MRI and CT measures and conflicting evidence for radiographic methods. Overall, there was lack of research in RC tendinopathy populations, with only six studies including participants with shoulder pain. The results support the reliability of ultrasound and CT or MRI for the measurement of AHD; however, more studies in symptomatic populations are required. The reliability of AHD measurement using radiographs has not been supported by the studies
Feasibility and Reliability of Body Composition Measurements in Adults with Severe Intellectual and Sensory Disabilities

ERIC Educational Resources Information Center

Waninge, A.; van der Weide, W.; Evenhuis, I. J.; van Wijck, R.; van der Schans, C. P.

2009-01-01

Background: Anthropometric measurements are widely used to reliably quantify body composition and to estimate risks of overweight in healthy subjects and in patients. However, information about the reliability of anthropometric measurements in subjects with severe intellectual and sensory disabilities is lacking. Objective: The purpose of this…
Interrater Reliability of mHealth App Rating Measures: Analysis of Top Depression and Smoking Cessation Apps

PubMed Central

Chan, Steven; Raynor, Geoffrey Stephen; Shwarts, Erik; Shanahan, Meghan; Landman, Adam B

2016-01-01

Background There are over 165,000 mHealth apps currently available to patients, but few have undergone an external quality review. Furthermore, no standardized review method exists, and little has been done to examine the consistency of the evaluation systems themselves. Objective We sought to determine which measures for evaluating the quality of mHealth apps have the greatest interrater reliability. Methods We identified 22 measures for evaluating the quality of apps from the literature. A panel of 6 reviewers reviewed the top 10 depression apps and 10 smoking cessation apps from the Apple iTunes App Store on these measures. Krippendorff’s alpha was calculated for each of the measures and reported by app category and in aggregate. Results The measure for interactiveness and feedback was found to have the greatest overall interrater reliability (alpha=.69). Presence of password protection (alpha=.65), whether the app was uploaded by a health care agency (alpha=.63), the number of consumer ratings (alpha=.59), and several other measures had moderate interrater reliability (alphas>.5). There was the least agreement over whether apps had errors or performance issues (alpha=.15), stated advertising policies (alpha=.16), and were easy to use (alpha=.18). There were substantial differences in the interrater reliabilities of a number of measures when they were applied to depression versus smoking apps. Conclusions We found wide variation in the interrater reliability of measures used to evaluate apps, and some measures are more robust across categories of apps than others. The measures with the highest degree of interrater reliability tended to be those that involved the least rater discretion. Clinical quality measures such as effectiveness, ease of use, and performance had relatively poor interrater reliability. Subsequent research is needed to determine consistent means for evaluating the performance of apps. Patients and clinicians should consider conducting their
Measuring awareness of financial skills: reliability and validity of a new measure.

PubMed

Cramer, K; Tuokko, H A; Mateer, C A; Hultsch, D F

2004-03-01

This paper examines the psychometric properties of a three-part (participant, informant, and performance) Measure for assessing Awareness of Financial Skills (MAFS). The MAFS was administered to 10 seniors with dementia and 25 well-functioning seniors, and their informants. Measures of cognitive functioning, social desirability, neuroticism, and perceived control were administered to each participant to allow for an assessment of validity. Internal consistency estimates for the participant and informant questionnaires were found to be 0.92 and 0.97, respectively. Convergent validity analysis indicated that performance on this measure was related to level of cognitive functioning, with higher level of unawareness associated with decreased cognitive ability. Discriminant validity analysis showed that performance on this measure was not related to social desirability or neuroticism. This study provides evidence that the MAFS is a reliable and valid tool for assessing awareness of financial skills in older adults.
Evaluation of Scale Reliability with Binary Measures Using Latent Variable Modeling

ERIC Educational Resources Information Center

Raykov, Tenko; Dimitrov, Dimiter M.; Asparouhov, Tihomir

2010-01-01

A method for interval estimation of scale reliability with discrete data is outlined. The approach is applicable with multi-item instruments consisting of binary measures, and is developed within the latent variable modeling methodology. The procedure is useful for evaluation of consistency of single measures and of sum scores from item sets…
The reliability and validity of hand-held refractometry water content measures of hydrogel lenses.

PubMed

Nichols, Jason J; Mitchell, G Lynn; Good, Gregory W

2003-06-01

To investigate within- and between-examiner reliability and validity of hand-held refractometry water content measures of hydrogel lenses. Nineteen lenses of various nominal water contents were examined by two examiners on two occasions separated by 1 hour. An Atago N2 hand-held refractometer was used for all water content measures. Lenses were presented in a random order to each examiner by a third party, and examiners were masked to any potential lens identifiers. Intraclass correlation coefficients (ICC), 95% limits of agreement, and Wilcoxon signed rank test were used to characterize the within- and between-examiner reliability and validity of lens water content measures. Within-examiner reliability was excellent (ICC, 0.97; 95% limits of agreement, -3.6% to +5.7%), and the inter-visit mean difference of 1.1 +/- 2.4% was not biased (p = 0.08). Between-examiner reliability was also excellent (ICC, 0.98; 95% limits of agreement, -4.1% to +3.9%). The mean difference between examiners was -0.1 +/- 2.1% (p = 0.83). The mean difference between the nominally reported water content and our water content measures was -2.1 +/- 1.7% (p < 0.001); the 95% limits of agreement for this difference were -5.4% to +1.1%. There is good reliability within and between examiners in measuring water content of hydrogel lenses. However, with our sample of lenses, examiners tended to overestimate the nominal water content of hydrogel lenses. As discussed, this bias may be associated with the Brix scale used in refractometry and is material dependent. Therefore, investigators may need to account for bias when measuring hydrogel lens water content via hand-held refractometry.

The Vanderbilt Holistic Face Processing Test: A short and reliable measure of holistic face processing

PubMed Central

Richler, Jennifer J.; Floyd, R. Jackie; Gauthier, Isabel

2014-01-01

Efforts to understand individual differences in high-level vision necessitate the development of measures that have sufficient reliability, which is generally not a concern in group studies. Holistic processing is central to research on face recognition and, more recently, to the study of individual differences in this area. However, recent work has shown that the most popular measure of holistic processing, the composite task, has low reliability. This is particularly problematic for the recent surge in interest in studying individual differences in face recognition. Here, we developed and validated a new measure of holistic face processing specifically for use in individual-differences studies. It avoids some of the pitfalls of the standard composite design and capitalizes on the idea that trial variability allows for better traction on reliability. Across four experiments, we refine this test and demonstrate its reliability. PMID:25228629
Compensation method for obtaining accurate, sub-micrometer displacement measurements of immersed specimens using electronic speckle interferometry.

PubMed

Fazio, Massimo A; Bruno, Luigi; Reynaud, Juan F; Poggialini, Andrea; Downs, J Crawford

2012-03-01

We proposed and validated a compensation method that accounts for the optical distortion inherent in measuring displacements on specimens immersed in aqueous solution. A spherically-shaped rubber specimen was mounted and pressurized on a custom apparatus, with the resulting surface displacements recorded using electronic speckle pattern interferometry (ESPI). Point-to-point light direction computation is achieved by a ray-tracing strategy coupled with customized B-spline-based analytical representation of the specimen shape. The compensation method reduced the mean magnitude of the displacement error induced by the optical distortion from 35% to 3%, and ESPI displacement measurement repeatability showed a mean variance of 16 nm at the 95% confidence level for immersed specimens. The ESPI interferometer and numerical data analysis procedure presented herein provide reliable, accurate, and repeatable measurement of sub-micrometer deformations obtained from pressurization tests of spherically-shaped specimens immersed in aqueous salt solution. This method can be used to quantify small deformations in biological tissue samples under load, while maintaining the hydration necessary to ensure accurate material property assessment.
Complexity of GPs' explanations about mental health problems: development, reliability, and validity of a measure

PubMed Central

Cape, John; Morris, Elena; Burd, Mary; Buszewicz, Marta

2008-01-01

Background How GPs understand mental health problems determines their treatment choices; however, measures describing GPs' thinking about such problems are not currently available. Aim To develop a measure of the complexity of GP explanations of common mental health problems and to pilot its reliability and validity. Design of study A qualitative development of the measure, followed by inter-rater reliability and validation pilot studies. Setting General practices in North London. Method Vignettes of simulated consultations with patients with mental health problems were videotaped, and an anchored measure of complexity of psychosocial explanation in response to these vignettes was developed. Six GPs, four psychologists, and two lay people viewed the vignettes. Their responses were rated for complexity, both using the anchored measure and independently by two experts in primary care mental health. In a second reliability and revalidation study, responses of 50 GPs to two vignettes were rated for complexity. The GPs also completed a questionnaire to determine their interest and training in mental health, and they completed the Depression Attitudes Questionnaire. Results Inter-rater reliability of the measure of complexity of explanation in both pilot studies was satisfactory (intraclass correlation coefficient = 0.78 and 0.72). The measure correlated with expert opinion as to what constitutes a complex explanation, and the responses of psychologists, GPs, and lay people differed in measured complexity. GPs with higher complexity scores had greater interest, more training in mental health, and more positive attitudes to depression. Conclusion Results suggest that the complexity of GPs' psychosocial explanations about common mental health problems can be reliably and validly assessed by this new standardised measure. PMID:18505616
Peer Evaluation Can Reliably Measure Local Knowledge

ERIC Educational Resources Information Center

Reyes-García, Victoria; Díaz-Reviriego, Isabel; Duda, Romain; Fernández-Llamazares, Álvaro; Gallois, Sandrine; Guèze, Maximilien; Napitupulu, Lucentezza; Pyhälä, Aili

2016-01-01

We assess the consistency of measures of individual local ecological knowledge obtained through peer evaluation against three standard measures: identification tasks, structured questionnaires, and self-reported skills questionnaires. We collected ethnographic information among the Baka (Congo), the Punan (Borneo), and the Tsimane' (Amazon) to…
Test-retest reliability of the safe driving behavior measure for community-dwelling elderly drivers.

PubMed

Song, Chiang-Soon; Lee, Joo-Hyun; Han, Sang-Woo

2016-06-01

[Purpose] The Safe Driving Behavior Measure (SDBM) is a self-report measurement tools that assesses the safe-driving behaviors of the elderly. The purpose of this study was to evaluate the test-retest reliability of the SDBM among community-dwelling elderly drivers. [Subjects and Methods] A total of sixty-one community-dwelling elderly were enrolled to investigate the reliability of the SDBM. The SDBM was assessed in two sessions that were conducted three days apart in a quiet and well-organized assessment room. That test-retest reliability of overall scores and three domain scores of the SDBM were statistically evaluated using intraclass correlation coefficients [ICC (2.1)]. Pearson correlation coefficients were used to quantify bivariate associations among the three domains of the SDBM. [Results] The SDBM demonstrated excellent rest-retest reliability for community-dwelling elderly drivers. The Cronbach alpha coefficients of the three domains of person-vehicle (0.979), person-environment (0.944), and person-vehicle-environment (0.971) of the SDBM indicate high internal consistency. [Conclusion] The results of this study suggest that the SDBM is a reliable measure for evaluating the safe- driving of automobiles by community-dwelling elderly, and is adequate for detecting changes in scores in clinical settings.
Reliability and group differences in quantitative cervicothoracic measures among individuals with and without chronic neck pain

PubMed Central

2012-01-01

Background Clinicians frequently rely on subjective categorization of impairments in mobility, strength, and endurance for clinical decision-making; however, these assessments are often unreliable and lack sensitivity to change. The objective of this study was to determine the inter-rater reliability, minimum detectable change (MDC), and group differences in quantitative cervicothoracic measures for individuals with and without chronic neck pain (NP). Methods Nineteen individuals with NP and 20 healthy controls participated in this case control study. Two physical therapists performed a 30-minute examination on separate days. A handheld dynamometer, gravity inclinometer, ruler, and stopwatch were used to quantify cervical range of motion (ROM), cervical muscle strength and endurance, and scapulothoracic muscle length and strength, respectively. Results Intraclass correlation coefficients for inter-rater reliability were significantly greater than zero for most impairment measures, with point estimates ranging from 0.45 to 0.93. The NP group exhibited reduced cervical ROM (P ≤ 0.012) and muscle strength (P ≤ 0.038) in most movement directions, reduced cervical extensor endurance (P = 0.029), and reduced rhomboid and middle trapezius muscle strength (P ≤ 0.049). Conclusions Results demonstrate the feasibility of obtaining objective cervicothoracic impairment measures with acceptable inter-rater agreement across time. The clinical utility of these measures is supported by evidence of impaired mobility, strength, and endurance among patients with NP, with corresponding MDC values that can help establish benchmarks for clinically significant change. PMID:23114092
Reliability and validity of cervical position measurements in individuals with and without chronic neck pain.

PubMed

Dunleavy, Kim; Neil, Joseph; Tallon, Allison; Adamo, Diane E

2015-09-01

The cervical range of motion device (CROM) has been shown to provide reliable forward head position (FHP) measurement when the upper cervical angle (UCA) is controlled. However, measurement without UCA standardization is reflective of habitual patterns. Criterion validity has not been reported. The purposes of this study were to establish: (1) criterion validity of CROM FHP and UCA compared to Optotrak data, (2) relative reliability and minimal detectable change (MDC95) in patients with and without cervical pain, and (3) to compare UCA and FHP in patients with and without pain in habitual postures. (1) Within-subjects single session concurrent criterion validity design. Simultaneous CROM and OP measurement was conducted in habitual sitting posture in 16 healthy young adults. (2) Reliability and MDC95 of UCA and FHP were calculated from three trials. (3) Values for adults over 35 years with cervical pain and age-matched healthy controls were compared. (1) Forward head position distances were moderately correlated and UCA angles were highly correlated. The mean (standard deviation) differences can be expected to vary between 1·48 cm (1·74) for FHP and -1·7 (2·46)° for UCA. (2) Reliability for CROM FHP measurements were good to excellent (no pain) and moderate (pain). Cervical range of motion FHP MDC95 was moderately low (no pain), and moderate (pain). Reliability for CROM UCA measurements was excellent and MDC95 low for both groups. There was no difference in FHP distances between the pain and no pain groups, UCA was significantly more extended in the pain group (P<0·05). Cervical range of motion FHP measurements were only moderately correlated with Optotrak data, and limits of agreement (LOA) and MDC95 were relatively large. There was also no difference in CROM FHP distance between older symptomatic and asymptomatic individuals. Cervical range of motion FHP measurement is therefore not recommended as a clinical outcome measure. Cervical range of motion UCA
Reliability and validity of cervical position measurements in individuals with and without chronic neck pain

PubMed Central

Neil, Joseph; Tallon, Allison; Adamo, Diane E.

2015-01-01

Objectives The cervical range of motion device (CROM) has been shown to provide reliable forward head position (FHP) measurement when the upper cervical angle (UCA) is controlled. However, measurement without UCA standardization is reflective of habitual patterns. Criterion validity has not been reported. The purposes of this study were to establish: (1) criterion validity of CROM FHP and UCA compared to Optotrak data, (2) relative reliability and minimal detectable change (MDC95) in patients with and without cervical pain, and (3) to compare UCA and FHP in patients with and without pain in habitual postures. Methods (1) Within-subjects single session concurrent criterion validity design. Simultaneous CROM and OP measurement was conducted in habitual sitting posture in 16 healthy young adults. (2) Reliability and MDC95 of UCA and FHP were calculated from three trials. (3) Values for adults over 35 years with cervical pain and age-matched healthy controls were compared. Results (1) Forward head position distances were moderately correlated and UCA angles were highly correlated. The mean (standard deviation) differences can be expected to vary between 1·48 cm (1·74) for FHP and −1·7 (2·46)° for UCA. (2) Reliability for CROM FHP measurements were good to excellent (no pain) and moderate (pain). Cervical range of motion FHP MDC95 was moderately low (no pain), and moderate (pain). Reliability for CROM UCA measurements was excellent and MDC95 low for both groups. There was no difference in FHP distances between the pain and no pain groups, UCA was significantly more extended in the pain group (P<0·05). Discussion Cervical range of motion FHP measurements were only moderately correlated with Optotrak data, and limits of agreement (LOA) and MDC95 were relatively large. There was also no difference in CROM FHP distance between older symptomatic and asymptomatic individuals. Cervical range of motion FHP measurement is therefore not recommended as a clinical outcome
Measurement of body temperature in adult patients: comparative study of accuracy, reliability and validity of different devices.

PubMed

Rubia-Rubia, J; Arias, A; Sierra, A; Aguirre-Jaime, A

2011-07-01

We compared a range of alternative devices with core body temperature measured at the pulmonary artery to identify the most valid and reliable instrument for measuring temperature in routine conditions in health services. 201 patients from the intensive care unit of the Candelaria University Hospital, Canary Islands, admitted to hospital between April 2006 and July 2007. All patients (or their families) gave informed consent. Readings from gallium-in-glass, reactive strip and digital in axilla, infra-red ear and frontal thermometers were compared with the pulmonary artery core temperature simultaneously. External factors suspected of having an influence on the differences were explored. The cut-off point readings for each thermometer were fixed for the maximum negative predictive value in comparison with the core temperature. The validity, reliability, accuracy, external influence, the waste they generated, ease of use, speed, durability, security, comfort and cost of each thermometer was evaluated. An ad hoc overall valuation score was obtained from these parameters for each instrument. For an error of ± 0.2°C and concordance with respect to fever, the gallium-in-glass thermometer gave the best results. The largest area under the receiver operating characteristic (ROC) curve is obtained by the digital axillar thermometer with probe (0.988 ± 0.007). The minimum difference between readings was given by the infrared ear thermometer, in comparison with the core temperature (-0.1 ± 0.3°C). Age, weight, level of conscience, male sex, environmental temperature and vaso-constrictor medication increases the difference in the readings and fever treatment reduces it, although this is not the same for all thermometers. The compact digital axillar thermometer and the digital thermometer with probe obtained the highest overall valuation score. If we only evaluate the aspects of validity, reliability, accuracy and external influence, the best thermometer would be the
Inter and intra-rater reliability of mobile device goniometer in measuring lumbar flexion range of motion.

PubMed

Bedekar, Nilima; Suryawanshi, Mayuri; Rairikar, Savita; Sancheti, Parag; Shyam, Ashok

2014-01-01

Evaluation of range of motion (ROM) is integral part of assessment of musculoskeletal system. This is required in health fitness and pathological conditions; also it is used as an objective outcome measure. Several methods are described to check spinal flexion range of motion. Different methods for measuring spine ranges have their advantages and disadvantages. Hence, a new device was introduced in this study using the method of dual inclinometer to measure lumbar spine flexion range of motion (ROM). To determine Intra and Inter-rater reliability of mobile device goniometer in measuring lumbar flexion range of motion. iPod mobile device with goniometer software was used. The part being measure i.e the back of the subject was suitably exposed. Subject was standing with feet shoulder width apart. Spinous process of second sacral vertebra S2 and T12 were located, these were used as the reference points and readings were taken. Three readings were taken for each: inter-rater reliability as well as the intra-rater reliability. Sufficient rest was given between each flexion movement. Intra-rater reliability using ICC was r=0.920 and inter-rater r=0.812 at CI 95%. Validity r=0.95. Mobile device goniometer has high intra-rater reliability. The inter-rater reliability was moderate. This device can be used to assess range of motion of spine flexion, representing uni-planar movement.
Measures of Reliability in Behavioral Observation: The Advantage of "Real Time" Data Acquisition.

ERIC Educational Resources Information Center

Hollenbeck, Albert R.; Slaby, Ronald G.

Two observers who were using an electronic digital data acquisition system were spot checked for reliability at random times over a four month period. Between-and within-observer reliability was assessed for frequency, duration, and duration-per-event measures of four infant behaviors. The results confirmed the problem of observer drift--the…
Reliability of cystometrically obtained intravesical pressures in patients with neurogenic bladders.

PubMed

Hess, Marika J; Lim, Lance; Yalla, Subbarao V

2002-01-01

Urodynamic studies in patients with neurogenic bladder detect and categorize neurourodynamic states, identify the risk for urologic sequelae, and determine the necessity for interventions. Because urodynamic studies serves as a prognostic indicator and guides patient management, pressure measurements during the study must accurately represent bladder function under physiologic conditions. Because nonphysiologic bladder filling used during conventional urodynamic studies may alter the bladder's accommodative properties, we studied how closely the intravesical pressures obtained before filling cystometry resembled those obtained during the filling phase of the cystometrogram. Twenty-two patients (21 men, 1 woman) with neurogenic bladders underwent standard urodynamic studies. A 16F triple-lumen catheter was inserted into the bladder, and the intravesical pressures were recorded (physiologic volume-specific pressures, PVSP). After emptying the bladder, an equal volume of normal saline solution was reinfused, and the pressures were recorded again (cystometric volume-specific pressure, CVSP). All patients underwent routine fluoroscopically assisted urodynamic testing. The PVSP and the CVSP were compared using the Wilcoxon signed ranks test. P value of .05 was significant. The mean PVSP was 14.5 cmH2O (range, 4-42 cmH2O) and mean CVSP was 20.6 cmH2O (range, 6-70 cmH2O). The CVSP was significantly higher than the PVSP (P = .01). Filling pressures during cystometry (CVSP) were significantly higher than the pressures measured at rest (PVSP). This study also suggests a strong correlation between PVSP and CVSP.
The reliability of non-invasive biophysical outcome measures for evaluating normal and hyperkeratotic foot skin.

PubMed

Hashmi, Farina; Wright, Ciaran; Nester, Christopher; Lam, Sharon

2015-01-01

Hyperkeratosis of foot skin is a common skin problem affecting people of different ages. The clinical presentation of this condition can range from dry flaky skin, which can lead to fissures, to hard callused skin which is often painful and debilitating. The purpose of this study was to test the reliability of certain non-invasive skin measurement devices on foot skin in normal and hyperkeratotic states, with a view to confirming their use as quantitative outcome measures in future clinical trials. Twelve healthy adult participants with a range of foot skin conditions (xerotic skin, heel fissures and plantar calluses) were recruited to the study. Measurements of normal and hyperkeratotic skin sites were taken using the following devices: Corneometer® CM 825, Cutometer® 580 MPA, Reviscometer® RVM 600, Visioline® VL 650 Quantiride® and Visioscan® VC 98, by two investigators on two consecutive days. The intra and inter rater reliability and standard error of measurement for each device was calculated. The data revealed the majority of the devices to be reliable measurement tools for normal and hyperkeratotic foot skin (ICC values > 0.6). The surface evaluation parameters for skin: SEsc and SEsm have greater reliability compared to the SEr measure. The Cutometer® is sensitive to soft tissue movement within the probe, therefore measurement of plantar soft tissue areas should be approached with caution. Reviscometer® measures on callused skin demonstrated an unusually high degree of error. These results confirm the intra and inter rater reliability of the Corneometer®, Cutometer®, Visioline® and Visioscan® in quantifying specific foot skin biophysical properties.
Measuring standing balance in multiple sclerosis: Further progress towards an automatic and reliable method in clinical practice.

PubMed

Keune, Philipp M; Young, William R; Paraskevopoulos, Ioannis T; Hansen, Sascha; Muenssinger, Jana; Oschmann, Patrick; Müller, Roy

2017-08-15

Balance deficits in multiple sclerosis (MS) are often monitored by means of observer-rated tests. These may provide reliable data, but may also be time-consuming, subject to inter-rater variability, and potentially insensitive to mild fluctuations throughout the clinical course. On the other hand, laboratory assessments are often not available. The Nintendo Wii Balance Board (WBB) may represent a low-cost solution. The purpose of the current study was to examine the methodological quality of WBB data in MS (internal consistency, test-retest reliability), convergent validity with observer-rated tests (Berg Balance Scale, BBS; Timed-Up and Go Test, TUG), and discriminative validity concerning clinical status (Expanded Disability Status Scale, EDSS). Standing balance was assessed with the WBB for 4min in 63 MS patients at two assessment points, four months apart. Additionally, patients were examined with the BBS, TUG and the EDSS. A period of 4min on the WBB provided data characterized by excellent internal consistency and test-retest reliability. Significant correlations between WBB data and results of the BBS and TUG were obtained after merely 2min on the board. An EDSS median-split revealed that higher EDSS values (>3) were associated with significantly increased postural sway on the WBB. WBB measures reflecting postural sway are methodologically robust in MS, involving excellent internal consistency and test-retest reliability. They are also characterized by convergent validity with other considerably lengthier observer-rated balance measures (BBS) and sensitive to broader clinical characteristics (EDSS). The WBB may hence represent an effective, easy-to-use monitoring tool for MS patients in clinical practice. Copyright © 2017 Elsevier B.V. All rights reserved.
The reliability of multidimensional neuropsychological measures: from alpha to omega.

PubMed

Watkins, Marley W

To demonstrate that Coefficient omega, a model-based estimate, is more a more appropriate index of reliability than coefficient alpha for the multidimensional scales that are commonly employed by neuropsychologists. As an illustration, a structural model of an overarching general factor and four first-order factors for the WAIS-IV based on the standardization sample of 2200 participants was identified and omega coefficients were subsequently computed for WAIS-IV composite scores. Alpha coefficients were ≥ .90 and omega coefficients ranged from .75 to .88 for WAIS-IV factor index scores, indicating that the blend of general and group factor variance in each index score created a reliable multidimensional composite. However, the amalgam of variance from general and group factors did not allow the precision of Full Scale IQ (FSIQ) and factor index scores to be disentangled. In contrast, omega hierarchical coefficients were low for all four factor index scores (.10-.41), indicating that most of the reliable variance of each factor index score was due to the general intelligence factor. In contrast, the omega hierarchical coefficient for the FSIQ score was .84. Meaningful interpretation of WAIS-IV factor index scores as unambiguous indicators of group factors is imprecise, thereby fostering unreliable identification of neurocognitive strengths and weaknesses, whereas the WAIS-IV FSIQ score can be interpreted as a reliable measure of general intelligence. It was concluded that neuropsychologists should base their clinical decisions on reliable scores as indexed by coefficient omega.
Validity and reliability of a Dutch version of the Foot and Ankle Ability Measure.

PubMed

Weel, Hanneke; Zwiers, Ruben; Azim, Donija; Sierevelt, Inger N; Haverkamp, Daniel; van Dijk, C Niek; Kerkhoffs, Gino M M J

2016-04-01

The aim of the study was to develop a Dutch language version of the Foot and Ankle Ability Measure (FAAM) and evaluate its measurement properties according to the consensus-based standards for the selection of health measurement instruments (COSMIN) definitions. A forward-backward translation procedure was performed and subsequently the Dutch version of the FAAM was evaluated for its reliability and validity in 369 patients with a variety of foot and ankle complaints. The reliability was assessed by calculating the intraclass correlation coefficients (ICC, test-retest reliability), Cronbach's alpha (internal consistency), the standard error of measurement and the minimal detectable change (MDC). Additionally, this was done for athletes. The construct validity was assessed by the use of Spearman's correlation coefficient between FAAM domains and similar and contradictory domains of the Foot and Ankle Outcome Score, Short Form 36 and the Numeric Rating Scale for pain. The ICC of the subscales ranged from 0.62 to 0.86. Cronbach's alpha's minimum was 0.97. At individual level, the MDC ranged from 23.9 to 44.7 and at group level from 2.77 to 4.32. In the subgroup of athletes, the reliability was higher. The hypothesized correlations of the construct validity were supported by an 80% confirmation rate. The Dutch version of the FAAM met adequate measurement properties, although the reliability is not optimal. The FAAM-Sport subscale is more useful in athletes and the FAAM-Sport % seems not to contribute. In athletes with various foot and ankle symptoms, the FAAM can be used for functional assessment and follow-up at group level. For the general population, the FAAM is less appropriate. Diagnostic study, Level I.
Alberta infant motor scale: reliability and validity when used on preterm infants in Taiwan.

PubMed

Jeng, S F; Yau, K I; Chen, L C; Hsiao, S F

2000-02-01

The goal of this study was to examine the reliability and validity of measurements obtained with the Alberta Infant Motor Scale (AIMS) for evaluation of preterm infants in Taiwan. Two independent groups of preterm infants were used to investigate the reliability (n=45) and validity (n=41) for the AIMS. In the reliability study, the AIMS was administered to the infants by a physical therapist, and infant performance was videotaped. The performance was then rescored by the same therapist and by 2 other therapists to examine the intrarater and interrater reliability. In the validity study, the AIMS and the Bayley Motor Scale were administered to the infants at 6 and 12 months of age to examine criterion-related validity. Intraclass correlation coefficients (ICCs) for intrarater and interrater reliability of measurements obtained with the AIMS were high (ICC=.97-.99). The AIMS scores correlated with the Bayley Motor Scale scores at 6 and 12 months (r=.78 and.90), although the AIMS scores at 6 months were only moderately predictive of the motor function at 12 months (r=.56). The results suggest that measurements obtained with the AIMS have acceptable reliability and concurrent validity but limited predictive value for evaluating preterm Taiwanese infants.
Dynamic footprint measurement collection technique and intrarater reliability: ink mat, paper pedography, and electronic pedography.

PubMed

Fascione, Jeanna M; Crews, Ryan T; Wrobel, James S

2012-01-01

Identifying the variability of footprint measurement collection techniques and the reliability of footprint measurements would assist with appropriate clinical foot posture appraisal. We sought to identify relationships between these measures in a healthy population. On 30 healthy participants, midgait dynamic footprint measurements were collected using an ink mat, paper pedography, and electronic pedography. The footprints were then digitized, and the following footprint indices were calculated with photo digital planimetry software: footprint index, arch index, truncated arch index, Chippaux-Smirak Index, and Staheli Index. Differences between techniques were identified with repeated-measures analysis of variance with post hoc test of Scheffe. In addition, to assess practical similarities between the different methods, intraclass correlation coefficients (ICCs) were calculated. To assess intrarater reliability, footprint indices were calculated twice on 10 randomly selected ink mat footprint measurements, and the ICC was calculated. Dynamic footprint measurements collected with an ink mat significantly differed from those collected with paper pedography (ICC, 0.85-0.96) and electronic pedography (ICC, 0.29-0.79), regardless of the practical similarities noted with ICC values (P = .00). Intrarater reliability for dynamic ink mat footprint measurements was high for the footprint index, arch index, truncated arch index, Chippaux-Smirak Index, and Staheli Index (ICC, 0.74-0.99). Footprint measurements collected with various techniques demonstrate differences. Interchangeable use of exact values without adjustment is not advised. Intrarater reliability of a single method (ink mat) was found to be high.
How to measure wisdom: content, reliability, and validity of five measures

PubMed Central

Glück, Judith; König, Susanne; Naschenweng, Katja; Redzanowski, Uwe; Dorner, Lara; Straßer, Irene; Wiedermann, Wolfgang

2013-01-01

Wisdom is a field of growing interest both inside and outside academic psychology, and researchers are increasingly interested in using measures of wisdom in their work. However, wisdom is a highly complex construct, and its various operationalizations are based on quite different definitions. Which measure a researcher chooses for a particular research project may have a strong influence on the results. This study compares four well-established measures of wisdom—the Self-Assessed Wisdom Scale (Webster, 2003, 2007), the Three-Dimensional Wisdom Scale (Ardelt, 2003), the Adult Self-Transcendence Inventory (Levenson et al., 2005), and the Berlin Wisdom Paradigm (Baltes and Smith, 1990; Baltes and Staudinger, 2000)—with respect to content, reliability, factorial structure, and construct validity (relationships to wisdom nomination, interview-based wisdom ratings, and correlates of wisdom). The sample consisted of 47 wisdom nominees and 123 control participants. While none of the measures performed “better” than the others by absolute standards, recommendations are given for researchers to select the most suitable measure for their substantive interests. In addition, a “Brief Wisdom Screening Scale” is introduced that contains those 20 items from the three self-report scales that were most highly correlated with the common factor across the scales. PMID:23874310
Reliability and Measurement Error of Tensiomyography to Assess Mechanical Muscle Function: A Systematic Review.

PubMed

Martín-Rodríguez, Saúl; Loturco, Irineu; Hunter, Angus M; Rodríguez-Ruiz, David; Munguia-Izquierdo, Diego

2017-12-01

Martín-Rodríguez, S, Loturco, I, Hunter, AM, Rodríguez-Ruiz, D, and Munguia-Izquierdo, D. Reliability and measurement error of tensiomyography to assess mechanical muscle function: A systematic review. J Strength Cond Res 31(12): 3524-3536, 2017-Interest in studying mechanical skeletal muscle function through tensiomyography (TMG) has increased in recent years. This systematic review aimed to (a) report the reliability and measurement error of all TMG parameters (i.e., maximum radial displacement of the muscle belly [Dm], contraction time [Tc], delay time [Td], half-relaxation time [½ Tr], and sustained contraction time [Ts]) and (b) to provide critical reflection on how to perform accurate and appropriate measurements for informing clinicians, exercise professionals, and researchers. A comprehensive literature search was performed of the Pubmed, Scopus, Science Direct, and Cochrane databases up to July 2017. Eight studies were included in this systematic review. Meta-analysis could not be performed because of the low quality of the evidence of some studies evaluated. Overall, the review of the 9 studies involving 158 participants revealed high relative reliability (intraclass correlation coefficient [ICC]) for Dm (0.91-0.99); moderate-to-high ICC for Ts (0.80-0.96), Tc (0.70-0.98), and ½ Tr (0.77-0.93); and low-to-high ICC for Td (0.60-0.98), independently of the evaluated muscles. In addition, absolute reliability (coefficient of variation [CV]) was low for all TMG parameters except for ½ Tr (CV = >20%), whereas measurement error indexes were high for this parameter. In conclusion, this study indicates that 3 of the TMG parameters (Dm, Td, and Tc) are highly reliable, whereas ½ Tr demonstrate insufficient reliability, and thus should not be used in future studies.

The reliability of dual-energy X-ray absorptiometry measurements of bone mineral density in the metatarsals.

PubMed

Fuller, Joel T; Archer, Jane; Buckley, Jonathan D; Tsiros, Margarita D; Thewlis, Dominic

2016-01-01

To investigate the reliability of a simple, efficient technique for measuring bone mineral density (BMD) in the metatarsals using dual-energy X-ray absorptiometry (DXA). BMD of the right foot of 32 trained male distance runners was measured using a DXA scanner with the foot in the plantar position. Separate regions of interest (ROI) were used to assess the BMD of each metatarsal shaft (1st-5th) for each participant. ROI analysis was repeated by the same investigator to determine within-scan intra-rater reliability and by a different investigator to determine within-scan inter-rater reliability. Repeat DXA scans were undertaken for ten participants to assess between-scan intra-rater reliability. Assessment of BMD was consistently most reliable for the first metatarsal across all domains of reliability assessed (intra-class correlation coefficient [ICC] ≥0.97; coefficient of variation [CV] ≤1.5%; limits of agreement [LOA] ≤4.2%). Reasonable levels of intra-rater reliability were also achieved for the second and fifth metatarsals (ICC ≥0.90; CV ≤4.2%; LOA ≤11.9%). Poorer levels of reliability were demonstrated for the third (ICC ≥0.64; CV ≤8.2%; LOA ≤23.6%) and fourth metatarsals (ICC ≥0.67; CV ≤9.6%; LOA ≤27.5%). BMD was greatest in the first and second metatarsals (P < 0.01). Reliable measurements of BMD were achieved for the first, second and fifth metatarsals.
Timeline historical review of income and financial transactions: a reliable assessment of personal finances.

PubMed

Black, Anne C; Serowik, Kristin L; Ablondi, Karen M; Rosen, Marc I

2013-01-01

The need for accurate and reliable information about income and resources available to individuals with psychiatric disabilities is critical for the assessment of need and evaluation of programs designed to alleviate financial hardship or affect finance allocation. Measurement of finances is ubiquitous in studies of economics, poverty, and social services. However, evidence has demonstrated that these measures often contain error. We compare the 1-week test-retest reliability of income and finance data from 24 adult psychiatric outpatients using assessment-as-usual (AAU) and a new instrument, the Timeline Historical Review of Income and Financial Transactions (THRIFT). Reliability estimates obtained with the THRIFT for Income (0.77), Expenses (0.91), and Debt (0.99) domains were significantly better than those obtained with AAU. Reliability estimates for Balance did not differ. THRIFT reduced measurement error and provided more reliable information than AAU for assessment of personal finances in psychiatric patients receiving Social Security benefits. The instrument also may be useful with other low-income groups.
Inter- and intraobserver reliability assessment of the axial trunk rotation: manual versus smartphone-aided measurement tools.

PubMed

Qiao, Jun; Xu, Leilei; Zhu, Zezhang; Zhu, Feng; Liu, Zhen; Qian, Bangping; Qiu, Yong

2014-10-11

Scoliogauge, has been developed for the measurement of ATR on iPhone smartphones. This study was to evaluate the reliability for the smartphone-aided ATR measurement method and to compare its reliability with that of the manual method. Sixty-four AIS patients with single thoracic or lumbar curve participated in this study. Of these patients, thirty-two patients had main thoracic scoliosis while other thirty-two had main thoracolumbar/lumbar scoliosis. Two spine surgeons performed the measurements with Scoliometer and Scoliogauge. The Scoliogauge measurements were conducted on an iPhone 4 smartphone. The intraclass correlation coefficient (ICC) 2-way mixed model on absolute agreement was used to analyze the reliability categorized according to regions: thoracic or lumbar, and Cobb angles: <20 degrees and >40 degrees. ICC < 0.40 is considered as poor, 0.40-0.59 as fair, 0.60-0.74 as good, and 0.75-1.00 as excellent. The overall intraobserver variability was 0.954 and the overall interobserver variability was 0.943 for the scoliometer set, whereas the intraobserver variability was 0.965 and interobserver variability was 0.964 for the scoliogauge set. Both the intraobserver and interobserver ICCs reached the excellent value in the 2 sets for both observers. The mean Cobb angle of thoracic curves in patients with main thoracic scoliosis was similar to that of lumbar curves in those with main thoracolumbar/lumbar scoliosis (35.7 degrees vs. 36.1 degrees). The intraobserver and interobserver reliability was similar between two groups (thoracic vs. lumbar) in the 2 sets. There were 21 patients having Cobb angles < 20 degrees, while 20 patients >40 degrees. The intraobserver and interobserver reliability was better in severe curve(>40 degrees) group. Smartphone-aided measurement for ATR showed excellent reliability, and the reliability of measurement with either scoliometer or scoliogauge could be influenced by Cobb angle that reliability was better for curves with
Accuracy and reliability of observational gait analysis data: judgments of push-off in gait after stroke.

PubMed

McGinley, Jennifer L; Goldie, Patricia A; Greenwood, Kenneth M; Olney, Sandra J

2003-02-01

Physical therapists routinely observe gait in clinical practice. The purpose of this study was to determine the accuracy and reliability of observational assessments of push-off in gait after stroke. Eighteen physical therapists and 11 subjects with hemiplegia following a stroke participated in the study. Measurements of ankle power generation were obtained from subjects following stroke using a gait analysis system. Concurrent videotaped gait performances were observed by the physical therapists on 2 occasions. Ankle power generation at push-off was scored as either normal or abnormal using two 11-point rating scales. These observational ratings were correlated with the measurements of peak ankle power generation. A high correlation was obtained between the observational ratings and the measurements of ankle power generation (mean Pearson r=.84). Interobserver reliability was moderately high (mean intraclass correlation coefficient [ICC (2,1)]=.76). Intraobserver reliability also was high, with a mean ICC (2,1) of.89 obtained. Physical therapists were able to make accurate and reliable judgments of push-off in videotaped gait of subjects following stroke using observational assessment. Further research is indicated to explore the accuracy and reliability of data obtained with observational gait analysis as it occurs in clinical practice.
Reliability of Pressure Ulcer Rates: How Precisely Can We Differentiate Among Hospital Units, and Does the Standard Signal-Noise Reliability Measure Reflect This Precision?

PubMed

Staggs, Vincent S; Cramer, Emily

2016-08-01

Hospital performance reports often include rankings of unit pressure ulcer rates. Differentiating among units on the basis of quality requires reliable measurement. Our objectives were to describe and apply methods for assessing reliability of hospital-acquired pressure ulcer rates and evaluate a standard signal-noise reliability measure as an indicator of precision of differentiation among units. Quarterly pressure ulcer data from 8,199 critical care, step-down, medical, surgical, and medical-surgical nursing units from 1,299 US hospitals were analyzed. Using beta-binomial models, we estimated between-unit variability (signal) and within-unit variability (noise) in annual unit pressure ulcer rates. Signal-noise reliability was computed as the ratio of between-unit variability to the total of between- and within-unit variability. To assess precision of differentiation among units based on ranked pressure ulcer rates, we simulated data to estimate the probabilities of a unit's observed pressure ulcer rate rank in a given sample falling within five and ten percentiles of its true rank, and the probabilities of units with ulcer rates in the highest quartile and highest decile being identified as such. We assessed the signal-noise measure as an indicator of differentiation precision by computing its correlations with these probabilities. Pressure ulcer rates based on a single year of quarterly or weekly prevalence surveys were too susceptible to noise to allow for precise differentiation among units, and signal-noise reliability was a poor indicator of precision of differentiation. To ensure precise differentiation on the basis of true differences, alternative methods of assessing reliability should be applied to measures purported to differentiate among providers or units based on quality. © 2016 The Authors. Research in Nursing & Health published by Wiley Periodicals, Inc. © 2016 The Authors. Research in Nursing & Health published by Wiley Periodicals, Inc.
Reliability of Pressure Ulcer Rates: How Precisely Can We Differentiate Among Hospital Units, and Does the Standard Signal‐Noise Reliability Measure Reflect This Precision?

PubMed Central

Cramer, Emily

2016-01-01

Abstract Hospital performance reports often include rankings of unit pressure ulcer rates. Differentiating among units on the basis of quality requires reliable measurement. Our objectives were to describe and apply methods for assessing reliability of hospital‐acquired pressure ulcer rates and evaluate a standard signal‐noise reliability measure as an indicator of precision of differentiation among units. Quarterly pressure ulcer data from 8,199 critical care, step‐down, medical, surgical, and medical‐surgical nursing units from 1,299 US hospitals were analyzed. Using beta‐binomial models, we estimated between‐unit variability (signal) and within‐unit variability (noise) in annual unit pressure ulcer rates. Signal‐noise reliability was computed as the ratio of between‐unit variability to the total of between‐ and within‐unit variability. To assess precision of differentiation among units based on ranked pressure ulcer rates, we simulated data to estimate the probabilities of a unit's observed pressure ulcer rate rank in a given sample falling within five and ten percentiles of its true rank, and the probabilities of units with ulcer rates in the highest quartile and highest decile being identified as such. We assessed the signal‐noise measure as an indicator of differentiation precision by computing its correlations with these probabilities. Pressure ulcer rates based on a single year of quarterly or weekly prevalence surveys were too susceptible to noise to allow for precise differentiation among units, and signal‐noise reliability was a poor indicator of precision of differentiation. To ensure precise differentiation on the basis of true differences, alternative methods of assessing reliability should be applied to measures purported to differentiate among providers or units based on quality. © 2016 The Authors. Research in Nursing & Health published by Wiley Periodicals, Inc. PMID:27223598
Ultrasonographic measurements of lower trapezius muscle thickness at rest and during isometric contraction: a reliability study.

PubMed

Talbott, Nancy R; Witt, Dexter W

2014-07-01

The purpose of this study was to determine the intra-rater reliability and inter-rater reliability of ultrasound imaging (USI) thickness measurements of the lower trapezius (LT) at rest and during active contractions when the transverse process and the lamina were used as reference sites for the measurement process. Twenty healthy individuals between the ages of 22 and 32 years volunteered. With the subject prone and the shoulder in 145° of abduction, images of the LT were taken bilaterally by one examiner as the subject: (1) rested; (2) actively held the test position; and (3) actively held the test position while holding a weight. Ten subjects returned and testing was repeated by the same examiner and by a second examiner. LT thickness measurements were recorded at the level of the transverse process and at the level of the lamina. Intra-class correlation coefficients (ICC) for within session intra-rater reliability (ICC3,3) ranged from 0.951 to 0.986 for both measurement sites while between session intra-rater reliability (ICC3,2) ranged from 0.935 to 0.962. Within session inter-rater reliability (ICC2,2) ranged from 0.934 to 0.973. USI can be used to reliably measure LT thickness at rest, during active contraction and during active contraction when holding a weight. The described protocol can be utilized during shoulder examinations to provide an additional assessment tool for monitoring changes in LT thickness.
Three-dimensional and thermal surface imaging produces reliable measures of joint shape and temperature: a potential tool for quantifying arthritis

PubMed Central

Spalding, Steven J; Kwoh, C Kent; Boudreau, Robert; Enama, Joseph; Lunich, Julie; Huber, Daniel; Denes, Louis; Hirsch, Raphael

2008-01-01

Introduction The assessment of joints with active arthritis is a core component of widely used outcome measures. However, substantial variability exists within and across examiners in assessment of these active joint counts. Swelling and temperature changes, two qualities estimated during active joint counts, are amenable to quantification using noncontact digital imaging technologies. We sought to explore the ability of three dimensional (3D) and thermal imaging to reliably measure joint shape and temperature. Methods A Minolta 910 Vivid non-contact 3D laser scanner and a Meditherm med2000 Pro Infrared camera were used to create digital representations of wrist and metacarpalphalangeal (MCP) joints. Specialized software generated 3 quantitative measures for each joint region: 1) Volume; 2) Surface Distribution Index (SDI), a marker of joint shape representing the standard deviation of vertical distances from points on the skin surface to a fixed reference plane; 3) Heat Distribution Index (HDI), representing the standard error of temperatures. Seven wrists and 6 MCP regions from 5 subjects with arthritis were used to develop and validate 3D image acquisition and processing techniques. HDI values from 18 wrist and 9 MCP regions were obtained from 17 patients with active arthritis and compared to data from 10 wrist and MCP regions from 5 controls. Standard deviation (SD), coefficient of variation (CV), and intraclass correlation coefficients (ICC) were calculated for each quantitative measure to establish their reliability. CVs for volume and SDI were <1.3% and ICCs were greater than 0.99. Results Thermal measures were less reliable than 3D measures. However, significant differences were observed between control and arthritis HDI values. Two case studies of arthritic joints demonstrated quantifiable changes in swelling and temperature corresponding with changes in symptoms and physical exam findings. Conclusion 3D and thermal imaging provide reliable measures of
Construction of reliable protein-protein interaction networks with a new interaction generality measure.

PubMed

Saito, Rintaro; Suzuki, Harukazu; Hayashizaki, Yoshihide

2003-04-12

Recent screening techniques have made large amounts of protein-protein interaction data available, from which biologically important information such as the function of uncharacterized proteins, the existence of novel protein complexes, and novel signal-transduction pathways can be discovered. However, experimental data on protein interactions contain many false positives, making these discoveries difficult. Therefore computational methods of assessing the reliability of each candidate protein-protein interaction are urgently needed. We developed a new 'interaction generality' measure (IG2) to assess the reliability of protein-protein interactions using only the topological properties of their interaction-network structure. Using yeast protein-protein interaction data, we showed that reliable protein-protein interactions had significantly lower IG2 values than less-reliable interactions, suggesting that IG2 values can be used to evaluate and filter interaction data to enable the construction of reliable protein-protein interaction networks.
Reliability of center of pressure measures within and between sessions in individuals post-stroke and healthy controls.

PubMed

Gray, Vicki L; Ivanova, Tanya D; Garland, S Jayne

2014-01-01

Knowing the reliability of the center of pressure (COP) is important for interpreting balance deficits post-stroke, especially when the balance deficits can necessitate the use of short duration trials. The novel aspect of this reliability study was to examine the center of pressure measures using two adjacent force platforms between and within sessions in stroke and controls. After stroke, it is important to understand the contribution of the paretic and non-paretic leg to the motor control of standing balance. Because there is a considerable body of knowledge on COP reliability on a single platform, we chose to examine reliability using two adjacent platforms which has not been examined previously in stroke. Twenty participants post-stroke and 22 controls performed an arm raise, load drop and quiet stance balance task while standing on two adjacent force platforms, on two separate days. Intraclass correlations coefficient (ICC2,1) and percentage standard error of measurement (SEM%) were calculated for COP velocity, ellipse area, anterior-posterior (AP) displacement, and medial-lateral (ML) displacement. Between sessions, COP velocity was the most reliable with high ICCs and low SEM% across groups and tasks and ellipse area was less reliable with low ICCs across groups and tasks. COP measures were less reliable during the arm raise than load drop post-stroke. Within session reliability was high for COP velocity and ML displacement requiring no more than six trials across tasks. The COP velocity was the most reliable measure with high ICCs between sessions and the high reliability was achieved with fewer trials in both groups in a single session. Copyright © 2014 Elsevier B.V. All rights reserved.
Harmonization Process and Reliability Assessment of Anthropometric Measurements in the Elderly EXERNET Multi-Centre Study

PubMed Central

Gómez-Cabello, Alba; Vicente-Rodríguez, Germán; Albers, Ulrike; Mata, Esmeralda; Rodriguez-Marroyo, Jose A.; Olivares, Pedro R.; Gusi, Narcis; Villa, Gerardo; Aznar, Susana; Gonzalez-Gross, Marcela; Casajús, Jose A.; Ara, Ignacio

2012-01-01

Background The elderly EXERNET multi-centre study aims to collect normative anthropometric data for old functionally independent adults living in Spain. Purpose To describe the standardization process and reliability of the anthropometric measurements carried out in the pilot study and during the final workshop, examining both intra- and inter-rater errors for measurements. Materials and Methods A total of 98 elderly from five different regions participated in the intra-rater error assessment, and 10 different seniors living in the city of Toledo (Spain) participated in the inter-rater assessment. We examined both intra- and inter-rater errors for heights and circumferences. Results For height, intra-rater technical errors of measurement (TEMs) were smaller than 0.25 cm. For circumferences and knee height, TEMs were smaller than 1 cm, except for waist circumference in the city of Cáceres. Reliability for heights and circumferences was greater than 98% in all cases. Inter-rater TEMs were 0.61 cm for height, 0.75 cm for knee-height and ranged between 2.70 and 3.09 cm for the circumferences measured. Inter-rater reliabilities for anthropometric measurements were always higher than 90%. Conclusion The harmonization process, including the workshop and pilot study, guarantee the quality of the anthropometric measurements in the elderly EXERNET multi-centre study. High reliability and low TEM may be expected when assessing anthropometry in elderly population. PMID:22860013
Measuring Eccentric Strength of the Shoulder External Rotators Using a Handheld Dynamometer: Reliability and Validity

PubMed Central

Johansson, Fredrik R.; Skillgate, Eva; Lapauw, Mattis L.; Clijmans, Dorien; Deneulin, Valentijn P.; Palmans, Tanneke; Engineer, Human Kinetic; Cools, Ann M.

2015-01-01

Context Shoulder strength assessment plays an important role in the clinical examination of the shoulder region. Eccentric strength measurements are of special importance in guiding the clinician in injury prevention or return-to-play decisions after injury. Objective To examine the absolute and relative reliability and validity of a standardized eccentric strength-measurement protocol for the glenohumeral external rotators. Design Descriptive laboratory study. Setting Testing environment at the Department of Rehabilitation Sciences and Physiotherapy of Ghent University, Belgium. Patients or Other Participants Twenty-five healthy participants (9 men and 16 women) without any history of shoulder pain were tested by 2 independent assessors using a handheld dynamometer (HHD) and underwent an isokinetic testing procedure. Intervention(s) The clinical protocol used an HHD, a DynaPort accelerometer to measure acceleration and angular velocity of testing 30°/s over 90° of range of motion, and a Biodex dynamometer to measure isokinetic activity. Main Outcome Measure(s) Three eccentric strength measurements: (1) tester 1 with the HHD, (2) tester 2 with the HHD, and (3) Biodex isokinetic strength measurement. Results The intratester reliability was excellent (0.879 and 0.858), whereas the intertester reliability was good, with an intraclass correlation coefficient between testers of 0.714. Pearson product moment correlation coefficients of 0.78 and 0.70 were noted between the HHD and the isokinetic data, showing good validity of this new procedure. Conclusions Standardized eccentric rotator cuff strength can be tested and measured in the clinical setting with good-to-excellent reliability and validity using an HHD. PMID:25974381
Accuracy and reliability testing of two methods to measure internal rotation of the glenohumeral joint.

PubMed

Hall, Justin M; Azar, Frederick M; Miller, Robert H; Smith, Richard; Throckmorton, Thomas W

2014-09-01

We compared accuracy and reliability of a traditional method of measurement (most cephalad vertebral spinous process that can be reached by a patient with the extended thumb) to estimates made with the shoulder in abduction to determine if there were differences between the two methods. Six physicians with fellowship training in sports medicine or shoulder surgery estimated measurements in 48 healthy volunteers. Three were randomly chosen to make estimates of both internal rotation measurements for each volunteer. An independent observer made objective measurements on lateral scoliosis films (spinous process method) or with a goniometer (abduction method). Examiners were blinded to objective measurements as well as to previous estimates. Intraclass coefficients for interobserver reliability for the traditional method averaged 0.75, indicating good agreement among observers. The difference in vertebral level estimated by the examiner and the actual radiographic level averaged 1.8 levels. The intraclass coefficient for interobserver reliability for the abduction method averaged 0.81 for all examiners, indicating near-perfect agreement. Confidence intervals indicated that estimates were an average of 8° different from the objective goniometer measurements. Pearson correlation coefficients of intraobserver reliability for the abduction method averaged 0.94, indicating near-perfect agreement within observers. Confidence intervals demonstrated repeated estimates between 5° and 10° of the original. Internal rotation estimates made with the shoulder abducted demonstrated interobserver reliability superior to that of spinous process estimates, and reproducibility was high. On the basis of this finding, we now take glenohumeral internal rotation measurements with the shoulder in abduction and use a goniometer to maximize accuracy and objectivity. Copyright © 2014 Journal of Shoulder and Elbow Surgery Board of Trustees. Published by Mosby, Inc. All rights reserved.
Measuring the Environment for Friendliness Toward Physical Activity: A Comparison of the Reliability of 3 Questionnaires

PubMed Central

Brownson, Ross C.; Chang, Jen Jen; Eyler, Amy A.; Ainsworth, Barbara E.; Kirtland, Karen A.; Saelens, Brian E.; Sallis, James F.

2004-01-01

Objectives. We tested the reliability of 3 instruments that assessed social and physical environments. Methods. We conducted a test–retest study among US adults (n = 289). We used telephone survey methods to measure suitableness of the perceived (vs objective) environment for recreational physical activity and nonmotorized transportation. Results. Most questions in our surveys that attempted to measure specific characteristics of the built environment showed moderate to high reliability. Questions about the social environment showed lower reliability than those that assessed the physical environment. Certain blocks of questions appeared to be selectively more reliable for urban or rural respondents. Conclusions. Despite differences in content and in response formats, all 3 surveys showed evidence of reliability, and most items are now ready for use in research and in public health surveillance. PMID:14998817
Reliability and Validity of an Internet-based Questionnaire Measuring Lifetime Physical Activity

PubMed Central

De Vera, Mary A.; Ratzlaff, Charles; Doerfling, Paul; Kopec, Jacek

2010-01-01

Lifetime exposure to physical activity is an important construct for evaluating associations between physical activity and disease outcomes, given the long induction periods in many chronic diseases. The authors' objective in this study was to evaluate the measurement properties of the Lifetime Physical Activity Questionnaire (L-PAQ), a novel Internet-based, self-administered instrument measuring lifetime physical activity, among Canadian men and women in 2005–2006. Reliability was examined using a test-retest study. Validity was examined in a 2-part study consisting of 1) comparisons with previously validated instruments measuring similar constructs, the Lifetime Total Physical Activity Questionnaire (LT-PAQ) and the Chasan-Taber Physical Activity Questionnaire (CT-PAQ), and 2) a priori hypothesis tests of constructs measured by the L-PAQ. The L-PAQ demonstrated good reliability, with intraclass correlation coefficients ranging from 0.67 (household activity) to 0.89 (sports/recreation). Comparison between the L-PAQ and the LT-PAQ resulted in Spearman correlation coefficients ranging from 0.41 (total activity) to 0.71 (household activity); comparison between the L-PAQ and the CT-PAQ yielded coefficients of 0.58 (sports/recreation), 0.56 (household activity), and 0.50 (total activity). L-PAQ validity was further supported by observed relations between the L-PAQ and sociodemographic variables, consistent with a priori hypotheses. Overall, the L-PAQ is a useful instrument for assessing multiple domains of lifetime physical activity with acceptable reliability and validity. PMID:20876666
Reliability and validity of an internet-based questionnaire measuring lifetime physical activity.

PubMed

De Vera, Mary A; Ratzlaff, Charles; Doerfling, Paul; Kopec, Jacek

2010-11-15

Lifetime exposure to physical activity is an important construct for evaluating associations between physical activity and disease outcomes, given the long induction periods in many chronic diseases. The authors' objective in this study was to evaluate the measurement properties of the Lifetime Physical Activity Questionnaire (L-PAQ), a novel Internet-based, self-administered instrument measuring lifetime physical activity, among Canadian men and women in 2005-2006. Reliability was examined using a test-retest study. Validity was examined in a 2-part study consisting of 1) comparisons with previously validated instruments measuring similar constructs, the Lifetime Total Physical Activity Questionnaire (LT-PAQ) and the Chasan-Taber Physical Activity Questionnaire (CT-PAQ), and 2) a priori hypothesis tests of constructs measured by the L-PAQ. The L-PAQ demonstrated good reliability, with intraclass correlation coefficients ranging from 0.67 (household activity) to 0.89 (sports/recreation). Comparison between the L-PAQ and the LT-PAQ resulted in Spearman correlation coefficients ranging from 0.41 (total activity) to 0.71 (household activity); comparison between the L-PAQ and the CT-PAQ yielded coefficients of 0.58 (sports/recreation), 0.56 (household activity), and 0.50 (total activity). L-PAQ validity was further supported by observed relations between the L-PAQ and sociodemographic variables, consistent with a priori hypotheses. Overall, the L-PAQ is a useful instrument for assessing multiple domains of lifetime physical activity with acceptable reliability and validity.
The reliability of a severity rating scale to measure stuttering in an unfamiliar language.

PubMed

Hoffman, Laura; Wilson, Linda; Copley, Anna; Hewat, Sally; Lim, Valerie

2014-06-01

With increasing multiculturalism, speech-language pathologists (SLPs) are likely to work with stuttering clients from linguistic backgrounds that differ from their own. No research to date has estimated SLPs' reliability when measuring severity of stuttering in an unfamiliar language. Therefore, this study was undertaken to estimate the reliability of SLPs' use of a 9-point severity rating (SR) scale, to measure severity of stuttering in a language that was different from their own. Twenty-six Australian SLPs rated 20 speech samples (10 Australian English [AE] and 10 Mandarin) of adults who stutter using a 9-point SR scale on two separate occasions. Judges showed poor agreement when using the scale to measure stuttering in Mandarin samples. Results also indicated that 50% of individual judges were unable to reliably measure the severity of stuttering in AE. The results highlight the need for (a) SLPs to develop intra- and inter-judge agreement when using the 9-point SR scale to measure severity of stuttering in their native language (in this case AE) and in unfamiliar languages; and (b) research into the development and evaluation of practice and/or training packages to assist SLPs to do so.
The Brighton musculoskeletal Patient-Reported Outcome Measure (BmPROM): An assessment of validity, reliability, and responsiveness.

PubMed

Bryant, Elizabeth; Murtagh, Shemane; Finucane, Laura; McCrum, Carol; Mercer, Christopher; Smith, Toby; Canby, Guy; Rowe, David A; Moore, Ann P

2018-05-11

In response for the need of a freely available, stand-alone, validated outcome measure for use within musculoskeletal (MSK) physiotherapy practice, sensitive enough to measure clinical effectiveness, we developed an MSK patient reported outcome measure. This study examined the validity and reliability of the newly developed Brighton musculoskeletal Patient-Reported Outcome Measure (BmPROM) within physiotherapy outpatient settings. Two hundred twenty-four patients attending physiotherapy outpatient departments in South East England with an MSK condition participated in this study. The BmPROM was assessed for user friendliness (rated feedback, N = 224), reliability (internal consistency and test-retest reliability, n = 42), validity (internal and external construct validity, N = 224), and responsiveness (internal, n = 25). Exploratory factor analysis indicated that a two-factor model provides a good fit to the data. Factors were representative of "Functionality" and "Wellbeing". Correlations observed between the BmPROM and SF-36 domains provided evidence of convergent validity. Reliability results indicated that both subscales were internally consistent with alphas above the acceptable limits for both "Functionality" (α = .85, 95% CI [.81, .88]) and 'Wellbeing' (α = .80, 95% CI [.75, .84]). Test-retest analyses (n = 42) demonstrated a high degree of reliability between "Functionality" (ICC = .84; 95% CI [.72, .91]) and "Wellbeing" scores (ICC = .84; 95% CI [.72, .91]). Further examination of test-retest reliability through the Bland-Altman analysis demonstrated that the difference between "Functionality" and "Wellbeing" test scores did not vary as a function of absolute test score. Large treatment effect sizes were found for both subscales (Functionality d = 1.10; Wellbeing 1.03). The BmPROM is a reliable and valid outcome measure for use in evaluating physiotherapy treatment of MSK conditions. Copyright © 2018 John Wiley & Sons, Ltd.
The reliability and reproducibility of cephalometric measurements: a comparison of conventional and digital methods

PubMed Central

AlBarakati, SF; Kula, KS; Ghoneima, AA

2012-01-01

Objective The aim of this study was to assess the reliability and reproducibility of angular and linear measurements of conventional and digital cephalometric methods. Methods A total of 13 landmarks and 16 skeletal and dental parameters were defined and measured on pre-treatment cephalometric radiographs of 30 patients. The conventional and digital tracings and measurements were performed twice by the same examiner with a 6 week interval between measurements. The reliability within the method was determined using Pearson's correlation coefficient (r2). The reproducibility between methods was calculated by paired t-test. The level of statistical significance was set at p < 0.05. Results All measurements for each method were above 0.90 r2 (strong correlation) except maxillary length, which had a correlation of 0.82 for conventional tracing. Significant differences between the two methods were observed in most angular and linear measurements except for ANB angle (p = 0.5), angle of convexity (p = 0.09), anterior cranial base (p = 0.3) and the lower anterior facial height (p = 0.6). Conclusion In general, both methods of conventional and digital cephalometric analysis are highly reliable. Although the reproducibility of the two methods showed some statistically significant differences, most differences were not clinically significant. PMID:22184624
The reliability of a simplified water displacement instrument: a method for measuring arm volume.

PubMed

Sagen, Ase; Kåresen, Rolf; Risberg, May Arna

2005-01-01

To present a new water displacement measurement, the Simplified Water Displacement Instrument (SWDI), and to evaluate its intra- and intertester reliability. Reliability design. Hospital setting. Fifty-six healthy people were studied. Intratester reliability was evaluated once a week for 4 weeks in 20 women and 10 men. Intertester reliability was assessed by 2 physical therapists in 26 people. Not applicable. Coefficients of variation (CVs) and intraclass correlation coefficients (ICCs). The intratester reliability showed a CV range of 2.2% to 2.6% and an ICC range of .98 to .99. The intertester reliability showed a CV of 1.3% and an ICC of .99. There was a significant increase in arm volume in men compared with women. There were no significant differences in changes in volume over the 4 weeks. There was a significant greater right arm volume (3.3%) among the right-handed subjects (P<.001). Both intra- and intertester reliability were satisfactory for the SWDI.

Reliability of Eustachian tube function measurements in a hypobaric and hyperbaric pressure chamber.

PubMed

Meyer, M F; Jansen, S; Mordkovich, O; Hüttenbrink, K-B; Beutner, D

2017-12-01

Measurement of the Eustachian tube (ET) function is a challenge. The demand for a precise and meaningful diagnostic tool increases-especially because more and more operative therapies are being offered without objective evidence. The measurement of the ET function by continuous impedance recording in a pressure chamber is an established method, although the reliability of the measurements is still unclear. Twenty-five participants (50 ears) were exposed to phases of compression and decompression in a hypo- and hyperbaric pressure chamber. The ET function reflecting parameters-ET opening pressure (ETOP), ET opening duration (ETOD) and ET opening frequency (ETOF)-were determined under exactly the same preconditions three times in a row. The intraclass correlation coefficient (ICC) and Bland and Altman plot were used to assess test-retest reliability. ICCs revealed a high correlation for ETOP and ETOF in phases of decompression (passive equalisation) as well as ETOD and ETOP in phases of compression (active induced equalisation). Very high correlation could be shown for ETOD in decompression and ETOF in compression phases. The Bland and Altman graphs could show that measurements provide results within a 95 % confidence interval in compression and decompression phases. We conclude that measurements in a pressure chamber are a very valuable tool in terms of estimating the ET opening and closing function. Measurements show some variance comparing participants, but provide reliable results within a 95 % confidence interval in retest. This study is the basis for enabling efficacy measurements of ET treatment modalities. © 2017 John Wiley & Sons Ltd.
The Reliability and Validity of Discrete and Continuous Measures of Psychopathology: A Quantitative Review

ERIC Educational Resources Information Center

Markon, Kristian E.; Chmielewski, Michael; Miller, Christopher J.

2011-01-01

In 2 meta-analyses involving 58 studies and 59,575 participants, we quantitatively summarized the relative reliability and validity of continuous (i.e., dimensional) and discrete (i.e., categorical) measures of psychopathology. Overall, results suggest an expected 15% increase in reliability and 37% increase in validity through adoption of a…
Reliability of recurrence quantification analysis measures of the center of pressure during standing in individuals with musculoskeletal disorders.

PubMed

Mazaheri, Masood; Negahban, Hossein; Salavati, Mahyar; Sanjari, Mohammad Ali; Parnianpour, Mohamad

2010-09-01

Although the application of nonlinear tools including recurrence quantification analysis (RQA) has increasingly grown in the recent years especially in balance-disordered populations, there have been few studies which determine their measurement properties. Therefore, a methodological study was performed to estimate the intersession and intrasession reliability of some dynamic features provided by RQA for nonlinear analysis of center of pressure (COP) signals recorded during quiet standing in a sample of patients with musculoskeletal disorders (MSDs) including low back pain (LBP), anterior cruciate ligament (ACL) injury and functional ankle instability (FAI). The subjects completed postural measurements with three levels of difficulty (rigid surface-eyes open, rigid surface-eyes closed, and foam surface-eyes closed). Four RQA measures (% recurrence, % determinism, entropy, and trend) were extracted from the recurrence plot. Relative reliability of these measures was assessed using intraclass correlation coefficient and absolute reliability using standard error of measurement and coefficient of variation. % Determinism and entropy were the most reliable features of RQA for the both intersession and intrasession reliability measures. High level of reliability of % determinism and entropy in this preliminary investigation may show their clinical promise for discriminative and evaluative purposes of balance performance. 2010 IPEM. Published by Elsevier Ltd. All rights reserved.
Reliability of light microscopy and a computer-assisted replica measurement technique for evaluating the fit of dental copings.

PubMed

Rudolph, Heike; Ostertag, Silke; Ostertag, Michael; Walter, Michael H; Luthardt, Ralph Gunnar; Kuhn, Katharina

2018-02-01

The aim of this in vitro study was to assess the reliability of two measurement systems for evaluating the marginal and internal fit of dental copings. Sixteen CAD/CAM titanium copings were produced for a prepared maxillary canine. To modify the CAD surface model using different parameters (data density; enlargement in different directions), varying fit was created. Five light-body silicone replicas representing the gap between the canine and the coping were made for each coping and for each measurement method: (1) light microscopy measurements (LMMs); and (2) computer-assisted measurements (CASMs) using an optical digitizing system. Two investigators independently measured the marginal and internal fit using both methods. The inter-rater reliability [intraclass correlation coefficient (ICC)] and agreement [Bland-Altman (bias) analyses]: mean of the differences (bias) between two measurements [the closer to zero the mean (bias) is, the higher the agreement between the two measurements] were calculated for several measurement points (marginal-distal, marginal-buccal, axial-buccal, incisal). For the LMM technique, one investigator repeated the measurements to determine repeatability (intra-rater reliability and agreement). For inter-rater reliability, the ICC was 0.848-0.998 for LMMs and 0.945-0.999 for CASMs, depending on the measurement point. Bland-Altman bias was -15.7 to 3.5 μm for LMMs and -3.0 to 1.9 μm for CASMs. For LMMs, the marginal-distal and marginal-buccal measurement points showed the lowest ICC (0.848/0.978) and the highest bias (-15.7 μm/-7.6 μm). With the intra-rater reliability and agreement (repeatability) for LMMs, the ICC was 0.970-0.998 and bias was -1.3 to 2.3 μm. LMMs showed lower interrater reliability and agreement at the marginal measurement points than CASMs, which indicates a more subjective influence with LMMs at these measurement points. The values, however, were still clinically acceptable. LMMs showed very high intra
Reliability of light microscopy and a computer-assisted replica measurement technique for evaluating the fit of dental copings

PubMed Central

Rudolph, Heike; Ostertag, Silke; Ostertag, Michael; Walter, Michael H.; LUTHARDT, Ralph Gunnar; Kuhn, Katharina

2018-01-01

Abstract The aim of this in vitro study was to assess the reliability of two measurement systems for evaluating the marginal and internal fit of dental copings. Material and Methods Sixteen CAD/CAM titanium copings were produced for a prepared maxillary canine. To modify the CAD surface model using different parameters (data density; enlargement in different directions), varying fit was created. Five light-body silicone replicas representing the gap between the canine and the coping were made for each coping and for each measurement method: (1) light microscopy measurements (LMMs); and (2) computer-assisted measurements (CASMs) using an optical digitizing system. Two investigators independently measured the marginal and internal fit using both methods. The inter-rater reliability [intraclass correlation coefficient (ICC)] and agreement [Bland-Altman (bias) analyses]: mean of the differences (bias) between two measurements [the closer to zero the mean (bias) is, the higher the agreement between the two measurements] were calculated for several measurement points (marginal-distal, marginal-buccal, axial-buccal, incisal). For the LMM technique, one investigator repeated the measurements to determine repeatability (intra-rater reliability and agreement). Results For inter-rater reliability, the ICC was 0.848-0.998 for LMMs and 0.945-0.999 for CASMs, depending on the measurement point. Bland-Altman bias was −15.7 to 3.5 μm for LMMs and −3.0 to 1.9 μm for CASMs. For LMMs, the marginal-distal and marginal-buccal measurement points showed the lowest ICC (0.848/0.978) and the highest bias (-15.7 μm/-7.6 μm). With the intra-rater reliability and agreement (repeatability) for LMMs, the ICC was 0.970-0.998 and bias was −1.3 to 2.3 μm. Conclusion LMMs showed lower interrater reliability and agreement at the marginal measurement points than CASMs, which indicates a more subjective influence with LMMs at these measurement points. The values, however, were still
Strength Measurements in Acute Hamstring Injuries: Intertester Reliability and Prognostic Value of Handheld Dynamometry.

PubMed

Reurink, Gustaaf; Goudswaard, Gert Jan; Moen, Maarten H; Tol, Johannes L; Verhaar, Jan A N; Weir, Adam

2016-08-01

Study Design Cohort study, repeated measures. Background Although hamstring strength measurements are used for assessing prognosis and monitoring recovery after hamstring injury, their actual clinical relevance has not been established. Handheld dynamometry (HHD) is a commonly used method of measuring muscle strength. The reliability of HHD has not been determined in athletes with acute hamstring injuries. Objectives To determine the intertester reliability and the prognostic value of hamstring HHD strength measurement in acute hamstring injuries. Methods We measured knee flexion strength with HHD in 75 athletes at 2 visits, at baseline (within 5 days of hamstring injury) and follow-up (5 to 7 days after the baseline measurement). We assessed isometric hamstring strength in 15° and 90° of knee flexion. Reliability analysis testing was performed by 2 testers independently at the follow-up visit. We recorded the time needed to return to play (RTP) up to 6 months following baseline. Results The intraclass correlation coefficients of the strength measurements in injured hamstrings were between 0.75 and 0.83. There was a statistically significant but weak correlation between the time to RTP and the strength deficit at 15° of knee flexion measured at baseline (Spearman r = 0.25, P = .045) and at the follow-up visit (Spearman r = 0.26, P = .034). Up to 7% of the variance in time to RTP is explained by this strength deficit. None of the other strength variables were significantly correlated with time to RTP. Conclusion Hamstring strength can be reliably measured with HHD in athletes with acute hamstring injuries. The prognostic value of strength measurements is limited, as there is only a weak association between the time to RTP and hamstring strength deficit after acute injury. Level of Evidence Prognosis, level 4. J Orthop Sports Phys Ther 2016;46(8):689-696. Epub 12 May 2016. doi:10.2519/jospt.2016.6363.
Reliability of 3-Dimensional Measures of Single-Leg Cross Drop Landing Across 3 Different Institutions

PubMed Central

DiCesare, Christopher A.; Bates, Nathaniel A.; Barber Foss, Kim D.; Thomas, Staci M.; Wordeman, Samuel C.; Sugimoto, Dai; Roewer, Benjamin D.; Medina McKeon, Jennifer M.; Di Stasi, Stephanie; Noehren, Brian W.; Ford, Kevin R.; Kiefer, Adam W.; Hewett, Timothy E.; Myer, Gregory D.

2015-01-01

Background: Anterior cruciate ligament (ACL) injuries are physically and financially devastating but affect a relatively small percentage of the population. Prospective identification of risk factors for ACL injury necessitates a large sample size; therefore, study of this injury would benefit from a multicenter approach. Purpose: To determine the reliability of kinematic and kinetic measures of a single-leg cross drop task across 3 institutions. Study Design: Controlled laboratory study. Methods: Twenty-five female high school volleyball players participated in this study. Three-dimensional motion data of each participant performing the single-leg cross drop were collected at 3 institutions over a period of 4 weeks. Coefficients of multiple correlation were calculated to assess the reliability of kinematic and kinetic measures during the landing phase of the movement. Results: Between-centers reliability for kinematic waveforms in the frontal and sagittal planes was good, but moderate in the transverse plane. Between-centers reliability for kinetic waveforms was good in the sagittal, frontal, and transverse planes. Conclusion: Based on these findings, the single-leg cross drop task has moderate to good reliability of kinematic and kinetic measures across institutions after implementation of a standardized testing protocol. Clinical Relevance: Multicenter collaborations can increase study numbers and generalize results, which is beneficial for studies of relatively rare phenomena, such as ACL injury. An important step is to determine the reliability of risk assessments across institutions before a multicenter collaboration can be initiated. PMID:26779550
Region of Interest Correction Factors Improve Reliability of Diffusion Imaging Measures Within and Across Scanners and Field Strengths

PubMed Central

Venkatraman, Vijay K; Gonzalez, Christopher E.; Landman, Bennett; Goh, Joshua; Reiter, David A.; An, Yang; Resnick, Susan M.

2017-01-01

Diffusion tensor imaging (DTI) measures are commonly used as imaging markers to investigate individual differences in relation to behavioral and health-related characteristics. However, the ability to detect reliable associations in cross-sectional or longitudinal studies is limited by the reliability of the diffusion measures. Several studies have examined reliability of diffusion measures within (i.e. intra-site) and across (i.e. inter-site) scanners with mixed results. Our study compares the test-retest reliability of diffusion measures within and across scanners and field strengths in cognitively normal older adults with a follow-up interval less than 2.25 years. Intra-class correlation (ICC) and coefficient of variation (CoV) of fractional anisotropy (FA) and mean diffusivity (MD) were evaluated in sixteen white matter and twenty-six gray matter bilateral regions. The ICC for intra-site reliability (0.32 to 0.96 for FA and 0.18 to 0.95 for MD in white matter regions; 0.27 to 0.89 for MD and 0.03 to 0.79 for FA in gray matter regions) and inter-site reliability (0.28 to 0.95 for FA in white matter regions, 0.02 to 0.86 for MD in gray matter regions) with longer follow-up intervals were similar to earlier studies using shorter follow-up intervals. The reliability of across field strengths comparisons was lower than intra- and inter-site reliability. Within and across scanner comparisons showed that diffusion measures were more stable in larger white matter regions (> 1500 mm3). For gray matter regions, the MD measure showed stability in specific regions and was not dependent on region size. Linear correction factor estimated from cross-sectional or longitudinal data improved the reliability across field strengths. Our findings indicate that investigations relating diffusion measures to external variables must consider variable reliability across the distinct regions of interest and that correction factors can be used to improve consistency of measurement across
Reliability of BOD POD Measurements Remains High After a Short-Duration Low-Carbohydrate Diet.

PubMed

Greer, Beau Kjerulf; Edsall, Kathleen M; Greer, Anna E

2016-04-01

The purpose of the current study was to determine whether expected changes in body weight via a 3-day low-carbohydrate (LC) diet will disrupt the reliability of air displacement plethysmography measurements via BOD POD. Twenty-four subjects recorded their typical diets for 3 days before BOD POD and 7-site skinfold analyses. Subjects were matched for lean body mass and divided into low-CHO (LC) and control (CON) groups. The LC group was given instruction intended to prevent more than 50 grams/day of carbohydrate consumption for 3 consecutive days, and the CON group replicated their previously recorded diet. Body composition measurements were repeated after dietary intervention. Test-retest reliability measures were significant (p < .01) and high for body fat percentage in both the LC and the CON groups (rs = .993 and .965, respectively). Likewise, skinfold analysis for body fat percentage reliability was high in both groups (rs = .996 and .997, respectively). There were significant differences between 1st and 2nd BOD POD measurements for body mass (72.9 ± 13.3 vs. 72.1 ± 13.0 kg [M ± SD]) and body volume (69.0 ± 12.7-68.1 ± 12.2 L) in the LC group (p < .05). However, there were no differences (p > .05) in BOD POD-determined body fat percentage, lean body mass, or fat mass between the 1st and 2nd trial in either group. Body composition measures via BOD POD and 7-site skinfolds remain reliable after 3 days of an LC diet despite significant decreases in body mass.
Reliability and validity of the Pragmatics Observational Measure (POM): a new observational measure of pragmatic language for children.

PubMed

Cordier, Reinie; Munro, Natalie; Wilkes-Gillan, Sarah; Speyer, Renée; Pearce, Wendy M

2014-07-01

There is a need for a reliable and valid assessment of childhood pragmatic language skills during peer-peer interactions. This study aimed to evaluate the psychometric properties of a newly developed pragmatic assessment, the Pragmatic Observational Measure (POM). The psychometric properties of the POM were investigated from observational data of two studies - study 1 involved 342 children aged 5-11 years (108 children with ADHD; 108 typically developing playmates; 126 children in the control group), and study 2 involved 9 children with ADHD who attended a 7-week play-based intervention. The psychometric properties of the POM were determined based on the COnsensus-based Standards for the selection of health status Measurement INstruments (COSMIN) taxonomy of psychometric properties and definitions for health-related outcomes; the Pragmatic Protocol was used as the reference tool against which the POM was evaluated. The POM demonstrated sound psychometric properties in all the reliability, validity and interpretability criteria against which it was assessed. The findings showed that the POM is a reliable and valid measure of pragmatic language skills of children with ADHD between the age of 5 and 11 years and has clinical utility in identifying children with pragmatic language difficulty. Copyright © 2014 The Authors. Published by Elsevier Ltd.. All rights reserved.
TEPC measurements obtained on the Mir space station.

PubMed

Bottollier-Depois, J F; Siegrist, M; Petrov, V M; Shurshakov, V V; Bengin, V; Koslova, S B

2002-10-01

Measurements of the radiation environment inside the Mir space station were performed with a tissue equivalent proportional counter (TEPC) during the Antares mission in 1992, and over a long period following it. Interesting results concerning radiation measurements show (a) the South Atlantic Anomaly crossing, (b) the increase of radiation near the poles, and (c) the effects of solar particle events (the most important one occurring in early November 1992). This data also provides information about the dose and the quality factor of the radiation to which the cosmonauts were exposed during different missions. These data are compared with measurements obtained using a solid state detector. c2002 Elsevier Science Ltd. All rights reserved.
Is the MARS questionnaire a reliable measure of medication adherence in childhood asthma?

PubMed

Garcia-Marcos, Patricia W; Brand, Paul L P; Kaptein, Adrian A; Klok, Ted

2016-12-01

To assess the reliability of the Medication Adherence Report Scale (MARS-5) for assessing adherence in clinical practice and research. Prospective cohort study following electronically measured inhaled corticosteroids (ICS) adherence for 1 year in 2-13-year-old children with persistent asthma. The relationship between electronically measured adherence and MARS-5 scores (ranging from 5 to 25) was assessed by Spearman's rank correlation coefficient. A ROC (receiver operating characteristic) curve was performed testing MARS-5 against electronically measured adherence. Sensitivity, specificity, positive and negative likelihood ratios of the closest MARS-5 cut-off values to the top left-hand corner of the ROC curve were calculated. High MARS scores were obtained (median 24, interquartile range 22-24). Despite a statistically significant correlation between MARS-5 and electronically assessed adherence (Spearman's rho = 0.47; p < 0.0001), there was considerable variation of adherence rates at every MARS-5 score. The area under the ROC curve was 0.7188. A MARS-5 score ≥23 had the best predictive ability for electronically assessed adherence, but positive and negative likelihood ratios were too small to be useful (1.65 and 0.27, respectively). Self-report using MARS-5 is too inaccurate to be a useful measure of adherence in children with asthma, both in clinical practice and in research.
Reliability of a single objective measure in assessing sleepiness.

PubMed

Sunwoo, Bernie Y; Jackson, Nicholas; Maislin, Greg; Gurubhagavatula, Indira; George, Charles F; Pack, Allan I

2012-01-01

To evaluate reliability of single objective tests in assessing sleepiness. Subjects who completed polysomnography underwent a 4-nap multiple sleep latency test (MSLT) the following day. Prior to each nap opportunity on MSLT, subjects performed the psychomotor vigilance test (PVT) and divided attention driving task (DADT). Results of single versus multiple test administrations were compared using the intraclass correlation coefficient (ICC) and adjusted for test administration order effects to explore time of day effects. Measures were explored as continuous and binary (i.e., impaired or not impaired). Community-based sample evaluated at a tertiary, university-based sleep center. 372 adult commercial vehicle operators oversampled for increased obstructive sleep apnea risk. N/A. AS CONTINUOUS MEASURES, ICC WERE AS FOLLOWS: MSLT 0.45, PVT median response time 0.69, PVT number of lapses 0.51, 10-min DADT tracking error 0.87, 20-min DADT tracking error 0.90. Based on binary outcomes, ICC were: MSLT 0.63, PVT number of lapses 0.85, 10-min DADT 0.95, 20-min DADT 0.96. Statistically significant time of day effects were seen in both the MSLT and PVT but not the DADT. Correlation between ESS and different objective tests was strongest for MSLT, range [-0.270 to -0.195] and persisted across all time points. Single DADT and PVT administrations are reliable measures of sleepiness. A single MSLT administration can reasonably discriminate individuals with MSL < 8 minutes. These results support the use of a single administration of some objective tests of sleepiness when performed under controlled conditions in routine clinical care.
Intra-rater reliability and agreement of various methods of measurement to assess dorsiflexion in the Weight Bearing Dorsiflexion Lunge Test (WBLT) among female athletes.

PubMed

Langarika-Rocafort, Argia; Emparanza, José Ignacio; Aramendi, José F; Castellano, Julen; Calleja-González, Julio

2017-01-01

To examine the intra-observer reliability and agreement between five methods of measurement for dorsiflexion during Weight Bearing Dorsiflexion Lunge Test and to assess the degree of agreement between three methods in female athletes. Repeated measurements study design. Volleyball club. Twenty-five volleyball players. Dorsiflexion was evaluated using five methods: heel-wall distance, first toe-wall distance, inclinometer at tibia, inclinometer at Achilles tendon and the dorsiflexion angle obtained by a simple trigonometric function. For the statistical analysis, agreement was studied using the Bland-Altman method, the Standard Error of Measurement and the Minimum Detectable Change. Reliability analysis was performed using the Intraclass Correlation Coefficient. Measurement methods using the inclinometer had more than 6° of measurement error. The angle calculated by trigonometric function had 3.28° error. The reliability of inclinometer based methods had ICC values < 0.90. Distance based methods and trigonometric angle measurement had an ICC values > 0.90. Concerning the agreement between methods, there was from 1.93° to 14.42° bias, and from 4.24° to 7.96° random error. To assess DF angle in WBLT, the angle calculated by a trigonometric function is the most repeatable method. The methods of measurement cannot be used interchangeably. Copyright © 2016 Elsevier Ltd. All rights reserved.
A systematic review of statistical methods used to test for reliability of medical instruments measuring continuous variables.

PubMed

Zaki, Rafdzah; Bulgiba, Awang; Nordin, Noorhaire; Azina Ismail, Noor

2013-06-01

Reliability measures precision or the extent to which test results can be replicated. This is the first ever systematic review to identify statistical methods used to measure reliability of equipment measuring continuous variables. This studyalso aims to highlight the inappropriate statistical method used in the reliability analysis and its implication in the medical practice. In 2010, five electronic databases were searched between 2007 and 2009 to look for reliability studies. A total of 5,795 titles were initially identified. Only 282 titles were potentially related, and finally 42 fitted the inclusion criteria. The Intra-class Correlation Coefficient (ICC) is the most popular method with 25 (60%) studies having used this method followed by the comparing means (8 or 19%). Out of 25 studies using the ICC, only 7 (28%) reported the confidence intervals and types of ICC used. Most studies (71%) also tested the agreement of instruments. This study finds that the Intra-class Correlation Coefficient is the most popular method used to assess the reliability of medical instruments measuring continuous outcomes. There are also inappropriate applications and interpretations of statistical methods in some studies. It is important for medical researchers to be aware of this issue, and be able to correctly perform analysis in reliability studies.
Inter-rater reliability of kinesthetic measurements with the KINARM robotic exoskeleton.

PubMed

Semrau, Jennifer A; Herter, Troy M; Scott, Stephen H; Dukelow, Sean P

2017-05-22

Kinesthesia (sense of limb movement) has been extremely difficult to measure objectively, especially in individuals who have survived a stroke. The development of valid and reliable measurements for proprioception is important to developing a better understanding of proprioceptive impairments after stroke and their impact on the ability to perform daily activities. We recently developed a robotic task to evaluate kinesthetic deficits after stroke and found that the majority (~60%) of stroke survivors exhibit significant deficits in kinesthesia within the first 10 days post-stroke. Here we aim to determine the inter-rater reliability of this robotic kinesthetic matching task. Twenty-five neurologically intact control subjects and 15 individuals with first-time stroke were evaluated on a robotic kinesthetic matching task (KIN). Subjects sat in a robotic exoskeleton with their arms supported against gravity. In the KIN task, the robot moved the subjects' stroke-affected arm at a preset speed, direction and distance. As soon as subjects felt the robot begin to move their affected arm, they matched the robot movement with the unaffected arm. Subjects were tested in two sessions on the KIN task: initial session and then a second session (within an average of 18.2 ± 13.8 h of the initial session for stroke subjects), which were supervised by different technicians. The task was performed both with and without the use of vision in both sessions. We evaluated intra-class correlations of spatial and temporal parameters derived from the KIN task to determine the reliability of the robotic task. We evaluated 8 spatial and temporal parameters that quantify kinesthetic behavior. We found that the parameters exhibited moderate to high intra-class correlations between the initial and retest conditions (Range, r-value = [0.53-0.97]). The robotic KIN task exhibited good inter-rater reliability. This validates the KIN task as a reliable, objective method for quantifying
Reliability and validity of quantifying absolute muscle hardness using ultrasound elastography.

PubMed

Chino, Kentaro; Akagi, Ryota; Dohi, Michiko; Fukashiro, Senshi; Takahashi, Hideyuki

2012-01-01

Muscle hardness is a mechanical property that represents transverse muscle stiffness. A quantitative method that uses ultrasound elastography for quantifying absolute human muscle hardness has been previously devised; however, its reliability and validity have not been completely verified. This study aimed to verify the reliability and validity of this quantitative method. The Young's moduli of seven tissue-mimicking materials (in vitro; Young's modulus range, 20-80 kPa; increments of 10 kPa) and the human medial gastrocnemius muscle (in vivo) were quantified using ultrasound elastography. On the basis of the strain/Young's modulus ratio of two reference materials, one hard and one soft (Young's moduli of 7 and 30 kPa, respectively), the Young's moduli of the tissue-mimicking materials and medial gastrocnemius muscle were calculated. The intra- and inter-investigator reliability of the method was confirmed on the basis of acceptably low coefficient of variations (≤6.9%) and substantially high intraclass correlation coefficients (≥0.77) obtained from all measurements. The correlation coefficient between the Young's moduli of the tissue-mimicking materials obtained using a mechanical method and ultrasound elastography was 0.996, which was equivalent to values previously obtained using magnetic resonance elastography. The Young's moduli of the medial gastrocnemius muscle obtained using ultrasound elastography were within the range of values previously obtained using magnetic resonance elastography. The reliability and validity of the quantitative method for measuring absolute muscle hardness using ultrasound elastography were thus verified.
ImageJ: A Free, Easy, and Reliable Method to Measure Leg Ulcers Using Digital Pictures.

PubMed

Aragón-Sánchez, Javier; Quintana-Marrero, Yurena; Aragón-Hernández, Cristina; Hernández-Herero, María José

2017-12-01

Wound measurement to document the healing course of chronic leg ulcers has an important role in the management of these patients. Digital cameras in smartphones are readily available and easy to use, and taking pictures of wounds is becoming a routine in specialized departments. Analyzing digital pictures with appropriate software provides clinicians a quick, clean, and easy-to-use tool for measuring wound area. A set of 25 digital pictures of plain foot and leg ulcers was the basis of this study. Photographs were taken placing a ruler next to the wound in parallel with the healthy skin with the iPhone 6S (Apple Inc, Cupertino, CA), which has a camera of 12 megapixels using the flash. The digital photographs were visualized with ImageJ 1.45s freeware (National Institutes of Health, Rockville, MD; http://imagej.net/ImageJ ). Wound area measurement was carried out by 4 raters: head of the department, wound care nurse, physician, and medical student. We assessed intra- and interrater reliability using the interclass correlation coefficient. To determine intraobserver reliability, 2 of the raters repeated the measurement of the set 1 week after the first reading. The interrater model displayed an interclass correlation coefficient of 0.99 with 95% confidence interval of 0.999 to 1.000, showing excellent reliability. The intrarater model of both examiners showed excellent reliability. In conclusion, analyzing digital images of leg ulcers with ImageJ estimates wound area with excellent reliability. This method provides a free, rapid, and accurate way to measure wounds and could routinely be used to document wound healing in daily clinical practice.
The Design, Development, and Reliability Testing of a New Innovative Device to Measure Ankle Joint Dorsiflexion.

PubMed

Charles, James

2016-09-02

In clinical and research settings, ankle joint dorsiflexion needs to be reliably measured. Dorsiflexion is often measured by goniometry, but the intrarater and interrater reliability of this technique have been reported to be poor. Many devices to measure dorsiflexion have been developed for clinical and research use. An evaluation of 12 current tools showed that none met all of the desirable criteria. The purpose of this study was to design and develop a device that rates highly in all of the criteria and that can be proved to be highly reliable. While supine on a treatment table, 14 participants had a foot placed in the Charles device and ankle joint dorsiflexion measured and recorded three times with a digital inclinometer. The mean of the three readings was determined to be the ankle joint dorsiflexion. The analysis used was intraclass correlation coefficient (ICC). There was very little difference in ICC single or average measures between left and right feet, so data were pooled (N = 28). The single-measure ICC was 0.998 (95% confidence interval, 0.996-0.998). The average-measure ICC was 0.998 (95% confidence interval, 0.995-0.999). Limits of agreement for the average measure were also very good: -1.30° to 1.65°. The Charles device meets all of the desirable criteria and has many innovative features, increasing its appropriateness for clinical and research applications. It has a suitable design for measuring dorsiflexion and high intrarater and interrater reliability.
Reproducibility of the measurement of central corneal thickness in healthy subjects obtained with the optical low coherence reflectometry pachymeter and comparison with the ultrasonic pachymetry.

PubMed

Garza-Leon, Manuel; Plancarte-Lozano, Eduardo; Valle-Penella, Agustín Del; Guzmán-Martínez, María de Lourdes; Villarreal-González, Andrés

2018-01-01

Corneal pachymetry is widely used for refractive surgery and follow up in keratoconus, accurate measurement is essential for a safe surgery. To assess intraobserver reliability of central corneal thickness (CCT) measurements using optical low-coherence reflectometry (OLCR) technology and its agreement with ultrasonic pachymeter (US). Randomized and prospective comparative evaluation of diagnostic technology. One randomly healthy eye of subjects was scanned three times with both devices. Intraobserver within-subject standard deviation (Sw), coefficient of variation (CVw) and intraclass correlation coefficient (ICC) were obtained for reliability analysis; for study agreement, data were analyzed using the paired-sample t test and the Bland-Altman LoA method. The mean of three scans of each equipment was used to assess the LoA. The study enrolled 30 eyes of 30 subjects with average age of 28.70 ± 8.06 years. For repeatability, the Sw were 3.41 and 5.96 µ, the intraobserver CVw was 2 and 4% and ICC 0.991 and 0.988, for OLCR and US respectively. The mean CCT difference between OLCR and US was 8.90 ± 9.03 µ (95% confidence interval: 5.52-2.27 µ), and the LoA was 35.40 µ. OLCR technology provided reliable intraobserver CCT measurements. Both pachymetry measurements may be used interchangeably with minimum calibration adjustment. Copyright: © 2018 Permanyer.

Reliability of pelvic floor measurements on three- and four-dimensional ultrasound during and after first pregnancy: implications for training.

PubMed

van Veelen, G A; Schweitzer, K J; van der Vaart, C H

2013-11-01

To evaluate the reliability of measurements of the levator hiatus and levator-urethra gap (LUG) using three/four-dimensional (3D/4D) transperineal ultrasound in women during their first pregnancy and 6 months postpartum, and to assess the learning process for these measurements. An inexperienced observer was taught to perform measurements of the levator hiatus and LUG by an experienced observer. After training, 3D/4D ultrasound volume datasets of 40 women in the first trimester were analyzed by these two observers. Another training session then took place and both observers repeated the analyses of the same volume datasets. Finally, analyses of 40 volume datasets of the women 6 months postpartum were performed by both observers. Intra- and interobserver reliability were determined by intraclass correlation coefficients (ICC) with 95% CIs. For levator hiatal measurements, in the women during their first pregnancy the interobserver reliability was substantial to almost perfect after both the first and second training session (ICC, 0.62-0.83 and 0.71-0.89, respectively, for anteroposterior diameter, transverse diameter and area at rest, on contraction and on Valsalva) and the intraobserver reliability was substantial to almost perfect for both observers. For these measurements performed once the women had delivered, interobserver reliability was moderate to almost perfect. For LUG measurements performed during pregnancy, interobserver reliability was slight to moderate after the first training session (ICC, 0.14-0.54), but improved after the second training session (ICC, 0.38-0.71), and intraobserver reliability was moderate to substantial for the experienced observer and slight to moderate for the inexperienced observer. For these measurements performed when the women had delivered, interobserver reliability was fair to moderate. The levator hiatus and LUG can be measured reliably using 3D/4D ultrasound in primigravid and primiparous women. The technique to measure
Reliability of Leg and Vertical Stiffness During High Speed Treadmill Running.

PubMed

Pappas, Panagiotis; Dallas, Giorgos; Paradisis, Giorgos

2017-04-01

In research, the accurate and reliable measurement of leg and vertical stiffness could contribute to valid interpretations. The current study aimed at determining the intraparticipant variability (ie, intraday and interday reliabilities) of leg and vertical stiffness, as well as related parameters, during high speed treadmill running, using the "sine-wave" method. Thirty-one males ran on a treadmill at 6.67 m∙s -1 , and the contact and flight times were measured. To determine the intraday reliability, three 10-s running bouts with 10-min recovery were performed. In addition, to examine the interday reliability, three 10-s running bouts on 3 separate days with 48-h interbout intervals were performed. The reliability statistics included repeated-measure analysis of variance, average intertrial correlations, intraclass correlation coefficients (ICCs), Cronbach's α reliability coefficient, and the coefficient of variation (CV%). Both intraday and interday reliabilities were high for leg and vertical stiffness (ICC > 0.939 and CV < 4.3%), as well as related variables (ICC > 0.934 and CV < 3.9%). It was thus inferred that the measurements of leg and vertical stiffness, as well as the related parameters obtained using the "sine-wave" method during treadmill running at 6.67 m∙s -1 , were highly reliable, both within and across days.
Reliability and responsiveness of the gross motor function measure-88 in children with cerebral palsy.

PubMed

Ko, Jooyeon; Kim, MinYoung

2013-03-01

The Gross Motor Function Measure (GMFM-88) is commonly used in the evaluation of gross motor function in children with cerebral palsy (CP). The relative reliability of GMFM-88 has been assessed in children with CP. However, little information is available regarding the absolute reliability or responsiveness of GMFM-88. The purpose of this study was to determine the absolute and relative reliability and the responsiveness of the GMFM-88 in evaluating gross motor function in children with CP. A clinical measurement design was used. Ten raters scored the GMFM-88 in 84 children (mean age=3.7 years, SD=1.9, range=10 months to 9 years 9 months) from video records across all Gross Motor Function Classification System (GMFCS) levels to establish interrater reliability. Two raters participated to assess intrarater reliability. Responsiveness was determined from 3 additional assessments after the baseline assessment. The interrater and intrarater intraclass correlation coefficients (ICCs) with 95% confidence intervals, standard error of measurement (SEM), smallest real difference (SRD), effect size (ES), and standardized response mean (SRM) were calculated. The relative reliability of the GMFM was excellent (ICCs=.952-1.000). The SEM and SRD for total score of the GMFM were acceptable (1.60 and 3.14, respectively). Additionally, the ES and SRM of the dimension goal scores increased gradually in the 3 follow-up assessments (GMFCS levels I and II: ES=0.5, 0.6, and 0.8 and SRM=1.3, 1.8, and 2.0; GMFCS levels III-V: ES=0.4, 0.7, and 0.9 and SRM=1.5, 1.7, and 2.0). Children over 10 years of age with CP were not included in this study, so the results should not be generalized to all children with CP. Both the reliability and the responsiveness of the GMFM-88 are reasonable for measuring gross motor function in children with CP.
Cyber Physical Systems for User Reliability Measurements in a Sharing Economy Environment

PubMed Central

Seo, Aria; Kim, Yeichang

2017-01-01

As the sharing economic market grows, the number of users is also increasing but many problems arise in terms of reliability between providers and users in the processing of services. The existing methods provide shared economic systems that judge the reliability of the provider from the viewpoint of the user. In this paper, we have developed a system for establishing mutual trust between providers and users in a shared economic environment to solve existing problems. In order to implement a system that can measure and control users’ situation in a shared economic environment, we analyzed the necessary factors in a cyber physical system (CPS). In addition, a user measurement system based on a CPS structure in a sharing economic environment is implemented through analysis of the factors to consider when constructing a CPS. PMID:28805709
Cyber Physical Systems for User Reliability Measurements in a Sharing Economy Environment.

PubMed

Seo, Aria; Jeong, Junho; Kim, Yeichang

2017-08-13

As the sharing economic market grows, the number of users is also increasing but many problems arise in terms of reliability between providers and users in the processing of services. The existing methods provide shared economic systems that judge the reliability of the provider from the viewpoint of the user. In this paper, we have developed a system for establishing mutual trust between providers and users in a shared economic environment to solve existing problems. In order to implement a system that can measure and control users' situation in a shared economic environment, we analyzed the necessary factors in a cyber physical system (CPS). In addition, a user measurement system based on a CPS structure in a sharing economic environment is implemented through analysis of the factors to consider when constructing a CPS.
Measurement of Latent Variables with Different Rating Scales: Testing Reliability and Measurement Equivalence by Varying the Verbalization and Number of Categories

ERIC Educational Resources Information Center

Menold, Natalja; Tausch, Anja

2016-01-01

Effects of rating scale forms on cross-sectional reliability and measurement equivalence were investigated. A randomized experimental design was implemented, varying category labels and number of categories. The participants were 800 students at two German universities. In contrast to previous research, reliability assessment method was used,…
Reliability and Validity of the Turkish Version of the Voice-Related Quality of Life Measure.

PubMed

Tezcaner, Zahide Çiler; Aksoy, Songül

2017-03-01

This study aims to test the validity and reliability of the Turkish version of the Voice-Related Quality of Life (V-RQOL) questionnaire. This is a nonrandomized, prospective study with control group. The questionnaire was administered to 249 individuals-130 with vocal complaint and 119 without-with a mean age of 37.8 ± 12.3 years. The Turkish version of the Voice Handicap Index (VHI) and perceptual voice evaluation measures were also administered at 2-14 days for retest reliability. The instrument was submitted to validity and reliability evaluation. The V-RQOL measure showed a strong internal consistency and test-retest reliability; the Cronbach's alpha coefficient for the overall V-RQOL was 0.969, the physical functioning domain was 0.949, and the social-emotional domain was 0.940. In the test-retest reliability test, the overall V-RQOL was found to be 0.989. The construct validity of the V-RQOL was determined based on the strength and direction of its relation to the VHI and the perceptual voice evaluation measure. The higher the VHI level, the lower the physical functioning, social-emotional, and overall score levels of the V-RQOL (r = -0.927, r = -0.912, r = -0.944, respectively; P < 0.001). Following the perceptual voice self-assessment, a statistically significant difference was found between the V-RQOL scores of individuals who defined their voices as good, very good, and perfect, and those who defined their voices as bad and very bad (P < 0.001). The results suggest that the Turkish version of the V-RQOL measure has reliability and validity and may play a crucial role in evaluating Turkish-speaking patients with voice disorders. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Pneumothorax size measurements on digital chest radiographs: Intra- and inter- rater reliability.

PubMed

Thelle, Andreas; Gjerdevik, Miriam; Grydeland, Thomas; Skorge, Trude D; Wentzel-Larsen, Tore; Bakke, Per S

2015-10-01

Detailed and reliable methods may be important for discussions on the importance of pneumothorax size in clinical decision-making. Rhea's method is widely used to estimate pneumothorax size in percent based on chest X-rays (CXRs) from three measure points. Choi's addendum is used for anterioposterior projections. The aim of this study was to examine the intrarater and interrater reliability of the Rhea and Choi method using digital CXR in the ward based PACS monitors. Three physicians examined a retrospective series of 80 digital CXRs showing pneumothorax, using Rhea and Choi's method, then repeated in a random order two weeks later. We used the analysis of variance technique by Eliasziw et al. to assess the intrarater and interrater reliability in altogether 480 estimations of pneumothorax size. Estimated pneumothorax sizes ranged between 5% and 100%. The intrarater reliability coefficient was 0.98 (95% one-sided lower-limit confidence interval C 0.96), and the interrater reliability coefficient was 0.95 (95% one-sided lower-limit confidence interval 0.93). This study has shown that the Rhea and Choi method for calculating pneumothorax size has high intrarater and interrater reliability. These results are valid across gender, side of pneumothorax and whether the patient is diagnosed with primary or secondary pneumothorax. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Evaluating test-retest reliability in patient-reported outcome measures for older people: A systematic review.

PubMed

Park, Myung Sook; Kang, Kyung Ja; Jang, Sun Joo; Lee, Joo Yun; Chang, Sun Ju

2018-03-01

This study aimed to evaluate the components of test-retest reliability including time interval, sample size, and statistical methods used in patient-reported outcome measures in older people and to provide suggestions on the methodology for calculating test-retest reliability for patient-reported outcomes in older people. This was a systematic literature review. MEDLINE, Embase, CINAHL, and PsycINFO were searched from January 1, 2000 to August 10, 2017 by an information specialist. This systematic review was guided by both the Preferred Reporting Items for Systematic Reviews and Meta-Analyses checklist and the guideline for systematic review published by the National Evidence-based Healthcare Collaborating Agency in Korea. The methodological quality was assessed by the Consensus-based Standards for the selection of health Measurement Instruments checklist box B. Ninety-five out of 12,641 studies were selected for the analysis. The median time interval for test-retest reliability was 14days, and the ratio of sample size for test-retest reliability to the number of items in each measure ranged from 1:1 to 1:4. The most frequently used statistical methods for continuous scores was intraclass correlation coefficients (ICCs). Among the 63 studies that used ICCs, 21 studies presented models for ICC calculations and 30 studies reported 95% confidence intervals of the ICCs. Additional analyses using 17 studies that reported a strong ICC (>0.09) showed that the mean time interval was 12.88days and the mean ratio of the number of items to sample size was 1:5.37. When researchers plan to assess the test-retest reliability of patient-reported outcome measures for older people, they need to consider an adequate time interval of approximately 13days and the sample size of about 5 times the number of items. Particularly, statistical methods should not only be selected based on the types of scores of the patient-reported outcome measures, but should also be described clearly in
Reliability of the ecSatter Inventory as a tool to measure eating competence.

PubMed

Stotts, Jodi L; Lohse, Barbara

2007-01-01

To examine the reliability of the ecSatter Inventory (ecSI), a measure of eating competence. Self-report questionnaires were administered in person or by mail. Retesting occurred 2 to 6 weeks after completion of the first questionnaire. Both administrations of the questionnaire were completed by 259 participants who were mostly food secure, white females with some college education; mean age was 26.9 +/- 10.4 years. Test-retest reliability and internal consistency. Spearman's rank correlation coefficients to estimate test-retest reliability and Cronbach alpha coefficients to estimate internal consistency. Spearman's rank correlation coefficient for ecSI total score was 0.68; subscale coefficients were 0.70 for eating attitudes, 0.70 for contextual skills, 0.65 for food acceptance, and 0.52 for internal regulation. Cronbach alpha coefficient for ecSI total score was 0.77. Subscale alphas coefficients were 0.80 for eating attitudes, 0.69 for contextual skills, 0.68 for food acceptance, and 0.66 for internal regulation. This study provides psychometric evidence about the reliability of ecSI as a measure of eating competence in this sample. Although some ecSI items may require revision, results suggest that the instrument may be used to evaluate nutrition education designed to improve eating competence.
Measurement of glenohumeral joint translation using real-time ultrasound imaging: A physiotherapist and sonographer intra-rater and inter-rater reliability study.

PubMed

Rathi, Sangeeta; Taylor, Nicholas F; Gee, Jamie; Green, Rodney A

2016-12-01

Ultrasonography is an economical and non-invasive method for measuring real-time joint movements. Although physiotherapists are increasingly using ultrasound imaging for rotator cuff disorders, there is a lack of evidence on their reliability in using ultrasonography to measure glenohumeral translation. The aim of this study was to evaluate the reliability of a physiotherapist in measuring anterior and posterior glenohumeral joint translation with ultrasound. Study design: within day reliability. Anterior and posterior glenohumeral translations were measured at rest, in response to passive accessory motion testing force, and with isometric internal and external rotation in 12 young healthy adults. All the measurements were made in real time by a physiotherapist and an experienced sonographer in two positions (neutral and abducted) and in two views (anterior and posterior). Intra-rater and inter-rater reliability were expressed using intraclass correlation coefficients (ICC) and measurement error (mm). Intra-rater reliability was good for both raters (ICC P : 0.86-0.98; ICC S : 0.85-0.96). The inter-rater reliability between the physiotherapist and sonographer was moderate to good for posterior measurements (ICC 0.50-0.75) and poor to moderate for anterior measurements (ICC 0.31-0.53). For both intra-rater and inter-rater measurements, posterior translation was more reliable than the anterior translation with smaller measurement errors (posterior: 0.1-0.2 mm, anterior: 0.2-0.3 mm). A physiotherapist with minimal training was reliable in measuring glenohumeral joint translations. The ultrasound method was reliable for repeated measurement of both anterior and posterior glenohumeral translations with posterior measurements being more reliable than anterior. This method is recommended for future research to investigate the stabilising role of rotator cuff muscles. Copyright © 2016 Elsevier Ltd. All rights reserved.
Reliability Measurement for Mixed Mode Failures of 33/11 Kilovolt Electric Power Distribution Stations

PubMed Central

Alwan, Faris M.; Baharum, Adam; Hassan, Geehan S.

2013-01-01

The reliability of the electrical distribution system is a contemporary research field due to diverse applications of electricity in everyday life and diverse industries. However a few research papers exist in literature. This paper proposes a methodology for assessing the reliability of 33/11 Kilovolt high-power stations based on average time between failures. The objective of this paper is to find the optimal fit for the failure data via time between failures. We determine the parameter estimation for all components of the station. We also estimate the reliability value of each component and the reliability value of the system as a whole. The best fitting distribution for the time between failures is a three parameter Dagum distribution with a scale parameter and shape parameters and . Our analysis reveals that the reliability value decreased by 38.2% in each 30 days. We believe that the current paper is the first to address this issue and its analysis. Thus, the results obtained in this research reflect its originality. We also suggest the practicality of using these results for power systems for both the maintenance of power systems models and preventive maintenance models. PMID:23936346
[Study of the reliability in one dimensional size measurement with digital slit lamp microscope].

PubMed

Wang, Tao; Qi, Chaoxiu; Li, Qigen; Dong, Lijie; Yang, Jiezheng

2010-11-01

To study the reliability of digital slit lamp microscope as a tool for quantitative analysis in one dimensional size measurement. Three single-blinded observers acquired and repeatedly measured the images with a size of 4.00 mm and 10.00 mm on the vernier caliper, which simulatated the human eye pupil and cornea diameter under China-made digital slit lamp microscope in the objective magnification of 4 times, 10 times, 16 times, 25 times, 40 times and 4 times, 10 times, 16 times, respectively. The correctness and precision of measurement were compared. The images with 4 mm size were measured by three investigators and the average values were located between 3.98 to 4.06. For the images with 10.00 mm size, the average values fell within 10.00 ~ 10.04. Measurement results of 4.00 mm images showed, except A4, B25, C16 and C25, significant difference was noted between the measured value and the true value. Regarding measurement results of 10.00 mm iamges indicated, except A10, statistical significance was found between the measured value and the true value. In terms of comparing the results of the same size measured at different magnifications by the same investigator, except for investigators A's measurements of 10.00 mm dimension, the measurement results by all the remaining investigators presented statistical significance at different magnifications. Compared measurements of the same size with different magnifications, measurements of 4.00 mm in 4-fold magnification had no significant difference among the investigators', the remaining results were statistically significant. The coefficient of variation of all measurement results were less than 5%; as magnification increased, the coefficient of variation decreased. The measurement of digital slit lamp microscope in one-dimensional size has good reliability,and should be performed for reliability analysis before used for quantitative analysis to reduce systematic errors.
Camera-based measurement of respiratory rates is reliable.

PubMed

Becker, Christoph; Achermann, Stefan; Rocque, Mukul; Kirenko, Ihor; Schlack, Andreas; Dreher-Hummel, Thomas; Zumbrunn, Thomas; Bingisser, Roland; Nickel, Christian H

2017-06-01

Respiratory rate (RR) is one of the most important vital signs used to detect whether a patient is in critical condition. It is part of many risk scores and its measurement is essential for triage of patients in emergency departments. It is often not recorded as measurement is cumbersome and time-consuming. We intended to evaluate the accuracy of camera-based measurements as an alternative measurement to the current practice of manual counting. We monitored the RR of healthy male volunteers with a camera-based prototype application and simultaneously by manual counting and by capnography, which was considered the gold standard. The four assessors were mutually blinded. We simulated normoventilation, hypoventilation and hyperventilation as well as deep, normal and superficial breathing depths to assess potential clinical settings. The volunteers were assessed while being undressed, wearing a T-shirt or a winter coat. In total, 20 volunteers were included. The results of camera-based measurements of RRs and capnography were in close agreement throughout all clothing styles and respiratory patterns (Pearson's correlation coefficient, r=0.90-1.00, except for one scenario, in which the volunteer breathed slowly dressed in a winter coat r=0.84). In the winter-coat scenarios, the camera-based prototype application was superior to human counters. In our pilot study, we found that camera-based measurements delivered accurate and reliable results. Future studies need to show that camera-based measurements are a secure alternative for measuring RRs in clinical settings as well.
Measurement of Harm Outcomes in Older Adults after Hospital Discharge: Reliability and Validity

PubMed Central

Douglas, Alison; Letts, Lori; Eva, Kevin; Richardson, Julie

2012-01-01

Objectives. Defining and validating a measure of safety contributes to further validation of clinical measures. The objective was to define and examine the psychometric properties of the outcome “incidents of harm.” Methods. The Incident of Harm Caregiver Questionnaire was administered to caregivers of older adults discharged from hospital by telephone. Caregivers completed daily logs for one month and medical charts were examined. Results. Test-retest reliability (n = 38) was high for the occurrence of an incident of harm (yes/no; kappa = 1.0) and the type of incident (agreement = 100%). Validation against daily logs found no disagreement regarding occurrence or types of incidents. Validation with medical charts found no disagreement regarding incident occurrence and disagreement in half regarding incident type. Discussion. The data support the Incident of Harm Caregiver Questionnaire as a reliable and valid estimation of incidents for this sample and are important to researchers as a method to measure safety when validating clinical measures. PMID:22649728
Application of Thermo-Mechanical Measurements of Plastic Packages for Reliability Evaluation of PEMS

NASA Technical Reports Server (NTRS)

Sharma, Ashok K.; Teverovsky, Alexander

2004-01-01

Thermo-mechanical analysis (TMA) is typically employed for measurements of the glass transition temperature (Tg) and coefficients of thermal expansion (CTE) in molding compounds used in plastic encapsulated microcircuits (PEMs). Application of TMA measurements directly to PEMs allows anomalies to be revealed in deformation of packages with temperature, and thus indicates possible reliability concerns related to thermo-mechanical integrity and stability of the devices. In this work, temperature dependencies of package deformation were measured in several types of PEMs that failed environmental stress testing including temperature cycling, highly accelerated stress testing (HAST) in humid environments, and bum-in (BI) testing. Comparison of thermo-mechanical characteristics of packages and molding compounds in the failed parts allowed for explanation of the observed failures. The results indicate that TMA of plastic packages might be used for quality evaluation of PEMs intended for high-reliability applications.
Linguistic validation and reliability properties are weak investigated of most dementia-specific quality of life measurements-a systematic review.

PubMed

Dichter, Martin Nikolaus; Schwab, Christian G G; Meyer, Gabriele; Bartholomeyczik, Sabine; Halek, Margareta

2016-02-01

For people with dementia, the concept of quality of life (Qol) reflects the disease's impact on the whole person. Thus, Qol is an increasingly used outcome measure in dementia research. This systematic review was performed to identify available dementia-specific Qol measurements and to assess the quality of linguistic validations and reliability studies of these measurements (PROSPERO 2013: CRD42014008725). The MEDLINE, CINAHL, EMBASE, PsycINFO, and Cochrane Methodology Register databases were systematically searched without any date restrictions. Forward and backward citation tracking were performed on the basis of selected articles. A total of 70 articles addressing 19 dementia-specific Qol measurements were identified; nine measurements were adapted to nonorigin countries. The quality of the linguistic validations varied from insufficient to good. Internal consistency was the most frequently tested reliability property. Most of the reliability studies lacked internal validity. Qol measurements for dementia are insufficiently linguistic validated and not well tested for reliability. None of the identified measurements can be recommended without further research. The application of international guidelines and quality criteria is strongly recommended for the performance of linguistic validations and reliability studies of dementia-specific Qol measurements. Copyright © 2016 Elsevier Inc. All rights reserved.
A systematic review of reliable and valid tools for the measurement of patient participation in healthcare.

PubMed

Phillips, Nicole Margaret; Street, Maryann; Haesler, Emily

2016-02-01

Patient participation in healthcare is recognised internationally as essential for consumer-centric, high-quality healthcare delivery. Its measurement as part of continuous quality improvement requires development of agreed standards and measurable indicators. This systematic review sought to identify strategies to measure patient participation in healthcare and to report their reliability and validity. In the context of this review, patient participation was constructed as shared decision-making, acknowledging the patient as having critical knowledge regarding their own health and care needs and promoting self-care/autonomy. Following a comprehensive search, studies reporting reliability or validity of an instrument used in a healthcare setting to measure patient participation, published in English between January 2004 and March 2014 were eligible for inclusion. From an initial search, which identified 1582 studies, 156 studies were retrieved and screened against inclusion criteria. Thirty-three studies reporting 24 patient participation measurement tools met inclusion criteria, and were critically appraised. The majority of studies were descriptive psychometric studies using prospective, cross-sectional designs. Almost all the tools completed by patients, family caregivers, observers or more than one stakeholder focused on aspects of patient-professional communication. Few tools designed for completion by patients or family caregivers provided valid and reliable measures of patient participation. There was low correlation between many of the tools and other measures of patient satisfaction. Few reliable and valid tools for measurement of patient participation in healthcare have been recently developed. Of those reported in this review, the dyadic Observing Patient Involvement in Decision Making (dyadic-OPTION) tool presents the most promise for measuring core components of patient participation. There remains a need for further study into valid, reliable and
The intra- and inter-assessor reliability of measurement of functional outcome by lameness scoring in horses.

PubMed

Fuller, Catherine J; Bladon, Bruce M; Driver, Adam J; Barr, Alistair R S

2006-03-01

The objective of this study was to assess the reliability of lameness scoring in horses. One veterinary surgeon examined nineteen lame horses on four occasions. Gait was recorded by camcorder, and scored from 0 to 10 ranging from sound to non-weight bearing lameness. A global score of overall change in lameness during the study was also determined for each horse. To measure intra-assessor reliability of the scoring systems, one veterinary surgeon scored videotapes of the horses' gaits on two occasions. To measure inter-assessor reliability, three veterinary surgeons viewed the videotapes, assigning individual lameness scores plus global scores to each horse. Reliability of individual lameness scoring was good intra-assessor, but only just within our acceptable limit inter-assessor. However, global scoring of change in lameness throughout the study was found to be reliable overall. Since clinician scoring is commonly used to assess lameness in horses, this is an important finding, fundamental to future clinical studies.
Beyond alpha: an empirical examination of the effects of different sources of measurement error on reliability estimates for measures of individual differences constructs.

PubMed

Schmidt, Frank L; Le, Huy; Ilies, Remus

2003-06-01

On the basis of an empirical study of measures of constructs from the cognitive domain, the personality domain, and the domain of affective traits, the authors of this study examine the implications of transient measurement error for the measurement of frequently studied individual differences variables. The authors clarify relevant reliability concepts as they relate to transient error and present a procedure for estimating the coefficient of equivalence and stability (L. J. Cronbach, 1947), the only classical reliability coefficient that assesses all 3 major sources of measurement error (random response, transient, and specific factor errors). The authors conclude that transient error exists in all 3 trait domains and is especially large in the domain of affective traits. Their findings indicate that the nearly universal use of the coefficient of equivalence (Cronbach's alpha; L. J. Cronbach, 1951), which fails to assess transient error, leads to overestimates of reliability and undercorrections for biases due to measurement error.

How to obtain traceability on optical radiation measurements?

NASA Astrophysics Data System (ADS)

Matamoros García, Carlos H.

2006-02-01

Traceability to national standards provides confidence in measurements results, granting a guaranty when carrying out governmental rules and when demonstrating conformity with quality requirements such as ISO 9000 or ISO/IEC 17025 (and the Mexican equivalent standards). The appropriate traceability contributes with confidence of the quality of products or services. This paper presents different ways to obtain traceability in Mexico for the optical radiation measurements, mentioning some applications, and highlighting the necessity of having traceability to the appropriate units of the SI. Additionally it present the national standards maintained by Centro Nacional de Metrologia (CENAM), the national metrology institute in Mexico, that give the technical support to Mexican measurements in this field and the international recognition that the personal of the Optics and Radiometry Division had gained in 10 years of development.
Reliability of the Inverse Water Volumetry Method to Measure the Volume of the Upper Limb.

PubMed

Beek, Martinus A; te Slaa, Alexander; van der Laan, Lijckle; Mulder, Paul G H; Rutten, Harm J T; Voogd, Adri C; Luiten, Ernest J T; Gobardhan, Paul D

2015-06-01

Lymphedema of the upper extremity is a common side effect of lymph node dissection or irradiation of the axilla. Several techniques are being applied in order to examine the presence and severity of lymphedema. Measurement of circumference of the upper extremity is most frequently performed. An alternative is the water-displacement method. The aim of this study was to determine the reliability and the reproducibility of the "Inverse Water Volumetry apparatus" (IWV-apparatus) for the measurement of arm volumes. The IWV-apparatus is based on the water-displacement method. Measurements were performed by three breast cancer nurse practitioners on ten healthy volunteers in three weekly sessions. The intra-class correlation coefficient, defined as the ratio of the subject component to the total variance, equaled 0.99. The reliability index is calculated as 0.14 kg. This indicates that only changes in a patient's arm volume measurement of more than 0.14 kg would represent a true change in arm volume, which is about 6% of the mean arm volume of 2.3 kg. The IWV-apparatus proved to be a reliable and reproducible method to measure arm volume.
Measurement methods to assess diastasis of the rectus abdominis muscle (DRAM): A systematic review of their measurement properties and meta-analytic reliability generalisation.

PubMed

van de Water, A T M; Benjamin, D R

2016-02-01

Systematic literature review. Diastasis of the rectus abdominis muscle (DRAM) has been linked with low back pain, abdominal and pelvic dysfunction. Measurement is used to either screen or to monitor DRAM width. Determining which methods are suitable for screening and monitoring DRAM is of clinical value. To identify the best methods to screen for DRAM presence and monitor DRAM width. AMED, Embase, Medline, PubMed and CINAHL databases were searched for measurement property studies of DRAM measurement methods. Population characteristics, measurement methods/procedures and measurement information were extracted from included studies. Quality of all studies was evaluated using 'quality rating criteria'. When possible, reliability generalisation was conducted to provide combined reliability estimations. Thirteen studies evaluated measurement properties of the 'finger width'-method, tape measure, calipers, ultrasound, CT and MRI. Ultrasound was most evaluated. Methodological quality of these studies varied widely. Pearson's correlations of r = 0.66-0.79 were found between calipers and ultrasound measurements. Calipers and ultrasound had Intraclass Correlation Coefficients (ICC) of 0.78-0.97 for test-retest, inter- and intra-rater reliability. The 'finger width'-method had weighted Kappa's of 0.73-0.77 for test-retest reliability, but moderate agreement (63%; weighted Kappa = 0.53) between raters. Comparing calipers and ultrasound, low measurement error was found (above the umbilicus), and the methods had good agreement (83%; weighted Kappa = 0.66) for discriminative purposes. The available information support ultrasound and calipers as adequate methods to assess DRAM. For other methods limited measurement information of low to moderate quality is available and further evaluation of their measurement properties is required. Copyright © 2015 Elsevier Ltd. All rights reserved.
Reliability and reproducibility analysis of the Cobb angle and assessing sagittal plane by computer-assisted and manual measurement tools.

PubMed

Wu, Weifei; Liang, Jie; Du, Yuanli; Tan, Xiaoyi; Xiang, Xuanping; Wang, Wanhong; Ru, Neng; Le, Jinbo

2014-02-06

Although many studies on reliability and reproducibility of measurement have been performed on coronal Cobb angle, few results about reliability and reproducibility are reported on sagittal alignment measurement including the pelvis. We usually use SurgimapSpine software to measure the Cobb angle in our studies; however, there are no reports till date on its reliability and reproducible measurements. Sixty-eight standard standing posteroanterior whole-spine radiographs were reviewed. Three examiners carried out the measurements independently under the settings of manual measurement on X-ray radiographies and SurgimapSpine software on the computer. Parameters measured included pelvic incidence, sacral slope, pelvic tilt, Lumbar lordosis (LL), thoracic kyphosis, and coronal Cobb angle. SPSS 16.0 software was used for statistical analyses. The means, standard deviations, intraclass and interclass correlation coefficient (ICC), and 95% confidence intervals (CI) were calculated. There was no notable difference between the two tools (P = 0.21) for the coronal Cobb angle. In the sagittal plane parameters, the ICC of intraobserver reliability for the manual measures varied from 0.65 (T2-T5 angle) to 0.95 (LL angle). Further, for SurgimapSpine tool, the ICC ranged from 0.75 to 0.98. No significant difference in intraobserver reliability was found between the two measurements (P > 0.05). As for the interobserver reliability, measurements with SurgimapSpine tool had better ICC (0.71 to 0.98 vs 0.59 to 0.96) and Pearson's coefficient (0.76 to 0.99 vs 0.60 to 0.97). The reliability of SurgimapSpine measures was significantly higher in all parameters except for the coronal Cobb angle where the difference was not significant (P > 0.05). Although the differences between the two methods are very small, the results of this study indicate that the SurgimapSpine measurement is an equivalent measuring tool to the traditional manual in coronal Cobb angle, but is advantageous in spino
Reliability and reproducibility analysis of the Cobb angle and assessing sagittal plane by computer-assisted and manual measurement tools

PubMed Central

2014-01-01

Background Although many studies on reliability and reproducibility of measurement have been performed on coronal Cobb angle, few results about reliability and reproducibility are reported on sagittal alignment measurement including the pelvis. We usually use SurgimapSpine software to measure the Cobb angle in our studies; however, there are no reports till date on its reliability and reproducible measurements. Methods Sixty-eight standard standing posteroanterior whole-spine radiographs were reviewed. Three examiners carried out the measurements independently under the settings of manual measurement on X-ray radiographies and SurgimapSpine software on the computer. Parameters measured included pelvic incidence, sacral slope, pelvic tilt, Lumbar lordosis (LL), thoracic kyphosis, and coronal Cobb angle. SPSS 16.0 software was used for statistical analyses. The means, standard deviations, intraclass and interclass correlation coefficient (ICC), and 95% confidence intervals (CI) were calculated. Results There was no notable difference between the two tools (P = 0.21) for the coronal Cobb angle. In the sagittal plane parameters, the ICC of intraobserver reliability for the manual measures varied from 0.65 (T2–T5 angle) to 0.95 (LL angle). Further, for SurgimapSpine tool, the ICC ranged from 0.75 to 0.98. No significant difference in intraobserver reliability was found between the two measurements (P > 0.05). As for the interobserver reliability, measurements with SurgimapSpine tool had better ICC (0.71 to 0.98 vs 0.59 to 0.96) and Pearson’s coefficient (0.76 to 0.99 vs 0.60 to 0.97). The reliability of SurgimapSpine measures was significantly higher in all parameters except for the coronal Cobb angle where the difference was not significant (P > 0.05). Conclusion Although the differences between the two methods are very small, the results of this study indicate that the SurgimapSpine measurement is an equivalent measuring tool to the traditional manual
Reliability of Corneal Dynamic Scheimpflug Analyser Measurements in Virgin and Post-PRK Eyes

PubMed Central

Chen, Xiangjun; Stojanovic, Aleksandar; Hua, Yanjun; Eidet, Jon Roger; Hu, Di; Wang, Jingting; Utheim, Tor Paaske

2014-01-01

Purpose To determine the measurement reliability of CorVis ST, a dynamic Scheimpflug analyser, in virgin and post-photorefractive keratectomy (PRK) eyes and compare the results between these two groups. Methods Forty virgin eyes and 42 post-PRK eyes underwent CorVis ST measurements performed by two technicians. Repeatability was evaluated by comparing three consecutive measurements by technician A. Reproducibility was determined by comparing the first measurement by technician A with one performed by technician B. Intraobserver and interobserver intraclass correlation coefficients (ICCs) were calculated. Univariate analysis of covariance (ANCOVA) was used to compare measured parameters between virgin and post-PRK eyes. Results The intraocular pressure (IOP), central corneal thickness (CCT) and 1st applanation time demonstrated good intraobserver repeatability and interobserver reproducibility (ICC≧0.90) in virgin and post-PRK eyes. The deformation amplitude showed a good or close to good repeatability and reproducibility in both groups (ICC≧0.88). The CCT correlated positively with 1st applanation time (r = 0.437 and 0.483, respectively, p<0.05) and negatively with deformation amplitude (r = −0.384 and −0.375, respectively, p<0.05) in both groups. Compared to post-PRK eyes, virgin eyes showed longer 1st applanation time (7.29±0.21 vs. 6.96±0.17 ms, p<0.05) and lower deformation amplitude (1.06±0.07 vs. 1.17±0.08 mm, p<0.05). Conclusions CorVis ST demonstrated reliable measurements for CCT, IOP, and 1st applanation time, as well as relatively reliable measurement for deformation amplitude in both virgin and post-PRK eyes. There were differences in 1st applanation time and deformation amplitude between virgin and post-PRK eyes, which may reflect corneal biomechanical changes occurring after the surgery in the latter. PMID:25302580
Reliability of corneal dynamic scheimpflug analyser measurements in virgin and post-PRK eyes.

PubMed

Chen, Xiangjun; Stojanovic, Aleksandar; Hua, Yanjun; Eidet, Jon Roger; Hu, Di; Wang, Jingting; Utheim, Tor Paaske

2014-01-01

To determine the measurement reliability of CorVis ST, a dynamic Scheimpflug analyser, in virgin and post-photorefractive keratectomy (PRK) eyes and compare the results between these two groups. Forty virgin eyes and 42 post-PRK eyes underwent CorVis ST measurements performed by two technicians. Repeatability was evaluated by comparing three consecutive measurements by technician A. Reproducibility was determined by comparing the first measurement by technician A with one performed by technician B. Intraobserver and interobserver intraclass correlation coefficients (ICCs) were calculated. Univariate analysis of covariance (ANCOVA) was used to compare measured parameters between virgin and post-PRK eyes. The intraocular pressure (IOP), central corneal thickness (CCT) and 1st applanation time demonstrated good intraobserver repeatability and interobserver reproducibility (ICC ≧ 0.90) in virgin and post-PRK eyes. The deformation amplitude showed a good or close to good repeatability and reproducibility in both groups (ICC ≧ 0.88). The CCT correlated positively with 1st applanation time (r = 0.437 and 0.483, respectively, p<0.05) and negatively with deformation amplitude (r = -0.384 and -0.375, respectively, p<0.05) in both groups. Compared to post-PRK eyes, virgin eyes showed longer 1st applanation time (7.29 ± 0.21 vs. 6.96 ± 0.17 ms, p<0.05) and lower deformation amplitude (1.06 ± 0.07 vs. 1.17 ± 0.08 mm, p < 0.05). CorVis ST demonstrated reliable measurements for CCT, IOP, and 1st applanation time, as well as relatively reliable measurement for deformation amplitude in both virgin and post-PRK eyes. There were differences in 1st applanation time and deformation amplitude between virgin and post-PRK eyes, which may reflect corneal biomechanical changes occurring after the surgery in the latter.
Intra-rater Reliability of Arm and Hand Muscle Strength Measurements in Persons With Late Effects of Polio.

PubMed

Brogårdh, Christina; Flansbjer, Ulla-Britt; Carlsson, Håkan; Lexell, Jan

2015-10-01

Muscle weakness in the upper limb is common in persons with late effects of polio. To be able to measure muscle strength and follow changes over time, reliable measurements are needed. To evaluate the intra-rater reliability of isometric and isokinetic arm and hand muscle strength measurements in persons with late effects of polio. A test-retest design. A university hospital outpatient clinic. Twenty-eight persons (mean age 68 years, SD 11 years) with late effects of polio in their upper limbs. Isometric shoulder abduction, isokinetic concentric elbow flexion and extension, isometric elbow flexion, and isometric grip strength were measured twice, 14 days apart. Reliability was evaluated with the intra-class correlation coefficient, the mean difference between the test sessions (d¯), together with the 95% confidence intervals for d¯ , the standard error of measurement (SEM and SEM%), the smallest real difference (SRD and SRD%), and Bland-Altman graphs. A fixed dynamometer (Biodex) was used to measure arm strength and an electronic dynamometer (GRIP-it) was used to measure grip strength. Intra-rater reliability was high, with intra-class correlation coefficients between 0.87 and 0.98. The SEM%, representing the smallest change for a group of persons, ranged from 7%-24% for all strength measurements, and the SRD%, representing the smallest change for an individual person, ranged from 20%-67%. Muscle strength in the upper limbs can be reliably measured in persons with late effects of polio. However, the measurement errors indicate that the method is more suitable to detect changes in muscle strength for a group of persons than for an individual person. Copyright © 2015 American Academy of Physical Medicine and Rehabilitation. Published by Elsevier Inc. All rights reserved.
Reliability and Validity of Quantifying Absolute Muscle Hardness Using Ultrasound Elastography

PubMed Central

Chino, Kentaro; Akagi, Ryota; Dohi, Michiko; Fukashiro, Senshi; Takahashi, Hideyuki

2012-01-01

Muscle hardness is a mechanical property that represents transverse muscle stiffness. A quantitative method that uses ultrasound elastography for quantifying absolute human muscle hardness has been previously devised; however, its reliability and validity have not been completely verified. This study aimed to verify the reliability and validity of this quantitative method. The Young’s moduli of seven tissue-mimicking materials (in vitro; Young’s modulus range, 20–80 kPa; increments of 10 kPa) and the human medial gastrocnemius muscle (in vivo) were quantified using ultrasound elastography. On the basis of the strain/Young’s modulus ratio of two reference materials, one hard and one soft (Young’s moduli of 7 and 30 kPa, respectively), the Young’s moduli of the tissue-mimicking materials and medial gastrocnemius muscle were calculated. The intra- and inter-investigator reliability of the method was confirmed on the basis of acceptably low coefficient of variations (≤6.9%) and substantially high intraclass correlation coefficients (≥0.77) obtained from all measurements. The correlation coefficient between the Young’s moduli of the tissue-mimicking materials obtained using a mechanical method and ultrasound elastography was 0.996, which was equivalent to values previously obtained using magnetic resonance elastography. The Young’s moduli of the medial gastrocnemius muscle obtained using ultrasound elastography were within the range of values previously obtained using magnetic resonance elastography. The reliability and validity of the quantitative method for measuring absolute muscle hardness using ultrasound elastography were thus verified. PMID:23029231
Intra- and inter-tester reliability and validity of normal finger size measurement using the Japanese ring gauge system.

PubMed

Suzuki, T; Sato, Y; Sotome, S; Arai, H; Arai, A; Yoshida, H

2017-06-01

This study was designed to investigate the reliability and validity of measurements of finger diameters with a ring gauge. A reliability study enrolled two independent samples (50 participants and seven examiners in Study I; 26 participants and 26 examiners in Study II). The sizes of each participant's little fingers were measured twice with a ring gauge by each examiner. To investigate the validity of the measurements, five hand therapists compared the finger size and hand volume of 30 participants with the ring gauge and with a figure-of-eight technique (Study III). The intra-class correlation coefficient for intra-observer reliability ranged from 0.97 to 0.99 in Study I, and 0.90 to 0.97 in Study II. The intra-class correlation coefficient for inter-observer reliability was 0.95 in Study I and 0.94 in Study II. The validity study showed a Pearson product moment correlation coefficient of 0.75. The ring gauge showed high reliability and validity for measurement of finger size. III, diagnostic.
Quantifying Digital Ulcers in Systemic Sclerosis: Reliability of Computer-Assisted Planimetry in Measuring Lesion Size.

PubMed

Simpson, V; Hughes, M; Wilkinson, J; Herrick, A L; Dinsdale, G

2018-03-01

Digital ulcers are a major problem in patients with systemic sclerosis (SSc), causing severe pain and impairment of hand function. In addition, digital ulcers heal slowly and sometimes become infected, which can lead to gangrene and necessitate amputation if appropriate intervention is not taken. A reliable, objective method for assessing digital ulcer healing or progression is needed in both the clinical and research arenas. This study was undertaken to compare 2 computer-assisted planimetry methods of measurement of digital ulcer area on photographs (ellipse and freehand regions of interest [ROIs]), and to assess the reliability of photographic calibration and the 2 methods of area measurement. Photographs were taken of 107 digital ulcers in 36 patients with SSc spectrum disease. Three raters assessed the photographs. Custom software allowed raters to calibrate photograph dimensions and draw ellipse or freehand ROIs. The shapes and dimensions of the ROIs were saved for further analysis. Calibration (by a single rater performing 5 repeats per image) produced an intraclass correlation coefficient (intrarater reliability) of 0.99. The mean ± SD areas of digital ulcers assessed using ellipse and freehand ROIs were 18.7 ± 20.2 mm 2 and 17.6 ± 19.3 mm 2 , respectively. Intrarater and interrater reliability of the ellipse ROI were 0.97 and 0.77, respectively. For the freehand ROI, the intrarater and interrater reliability were 0.98 and 0.76, respectively. Our findings indicate that computer-assisted planimetry methods applied to SSc-related digital ulcers can be extremely reliable. Further work is needed to move toward applying these methods as outcome measures for clinical trials and in clinical settings. © 2017, American College of Rheumatology.
Psychometric Properties of Performance-based Measurements of Functional Capacity: Test-Retest Reliability, Practice Effects, and Potential Sensitivity to Change

PubMed Central

Leifker, Feea R.; Patterson, Thomas L.; Bowie, Christopher R.; Mausbach, Brent T.; Harvey, Philip D.

2010-01-01

Performance-based measures of the ability to perform social and everyday living skills are being more widely used to assess functional capacity in people with serious mental illnesses such as schizophrenia and bipolar disorder. Since they are also being used as outcome measures in pharmacological and cognitive remediation studies aimed at cognitive impairments in schizophrenia, understanding their measurement properties and potential sensitivity to change is important. In this study, the test-retest reliability, practice effects, and reliable change indices of two different performance-based functional capacity measures, the UCSD Performance-based skills assessment (UPSA) and Social skills performance assessment (SSPA) were examined over several different retest intervals in two different samples of people with schizophrenia (n’s=238 and 116) and a healthy comparison sample (n=109). These psychometric properties were compared to those of a neuropsychological assessment battery. Test-retest reliabilities of the long form of the UPSA ranged from r=.63 to r=.80 over follow-up periods up to 36 months in people with schizophrenia, while brief UPSA reliabilities ranged from r=.66 to r=.81. Test-retest reliability of the NP performance scores ranged from r=.77 to r=.79. Test-retest reliabilities of the UPSA were lower in healthy controls, while NP performance was slightly more reliable. SSPA test-retest reliability was lower. Practice effect sizes ranged from .05 to .16 for the UPSA and .07 to .19 for the NP assessment in patients, with HC having more practice effects. Reliable change intervals were consistent across NP and both FC measures, indicating equal potential for detection of change. These performance-based measures of functional capacity appear to have similar potential to be sensitive to change compared to NP performance in people with schizophrenia. PMID:20399613
A Practical Solution to Optimizing the Reliability of Teaching Observation Measures under Budget Constraints

ERIC Educational Resources Information Center

Meyer, J. Patrick; Liu, Xiang; Mashburn, Andrew J.

2014-01-01

Researchers often use generalizability theory to estimate relative error variance and reliability in teaching observation measures. They also use it to plan future studies and design the best possible measurement procedures. However, designing the best possible measurement procedure comes at a cost, and researchers must stay within their budget…
Validity and Reliability of Criterion-Referenced Measures: Issues and Procedures for Special Educators.

ERIC Educational Resources Information Center

Harris, Larry P.; Wolf, Steven R.

1979-01-01

The article focuses on the controversy over norm-referenced v criterion-referenced measures (CRM) in assessment of learning disorders. The authors contend that while the reliability of CRMs is generally indisputable, the validity of measures designed from local curricula is still dependent on the intuitive judgments of teachers. (Author/SBH)
Obtaining soil hydraulic parameters from data assimilation under different climatic/soil conditions

USDA-ARS?s Scientific Manuscript database

Obtaining reliable soil hydraulic properties is essential to correctly simulating soil water content (SWC), which is a key component of countless applications such as agricultural management, soil remediation, aquifer protection, etc. Soil hydraulic properties can be measured in the laboratory; howe...
Test-retest reliability of a handheld dynamometer for measurement of isometric cervical muscle strength.

PubMed

Vannebo, Katrine Tranaas; Iversen, Vegard Moe; Fimland, Marius Steiro; Mork, Paul Jarle

2018-03-02

There is a lack of test-retest reliability studies of measurements of cervical muscle strength, taking into account gender and possible learning effects. To investigate test-retest reliability of measurement of maximal isometric cervical muscle strength by handheld dynamometry. Thirty women (age 20-58 years) and 28 men (age 20-60 years) participated in the study. Maximal isometric strength (neck flexion, neck extension, and right/left lateral flexion) was measured on three separate days at least five days apart by one evaluator. Intra-rater consistency tended to improve from day 1-2 measurements to day 2-3 measurements in both women and men. In women, the intra-class correlation coefficients (ICC) for day 2 to day 3 measurements were 0.91 (95% confidence interval [CI], 0.82-0.95) for neck flexion, 0.88 (95% CI, 0.76-0.94) for neck extension, 0.84 (95% CI, 0.68-0.92) for right lateral flexion, and 0.89 (95% CI, 0.78-0.95) for left lateral flexion. The corresponding ICCs among men were 0.86 (95% CI, 0.72-0.93) for neck flexion, 0.93 (95% CI, 0.85-0.97) for neck extension, 0.82 (95% CI, 0.65-0.91) for right lateral flexion and 0.73 (95% CI, 0.50-0.87) for left lateral flexion. This study describes a reliable and easy-to-administer test for assessing maximal isometric cervical muscle strength.
Intra-rater reliability of hallux flexor strength measures using the Nintendo Wii Balance Board.

PubMed

Quek, June; Treleaven, Julia; Brauer, Sandra G; O'Leary, Shaun; Clark, Ross A

2015-01-01

The purpose of this study was to investigate the intra-rater reliability of a new method in combination with the Nintendo Wii Balance Board (NWBB) to measure the strength of hallux flexor muscle. Thirty healthy individuals (age: 34.9 ± 12.9 years, height: 170.4 ± 10.5 cm, weight: 69.3 ± 15.3 kg, female = 15) participated. Repeated testing was completed within 7 days. Participants performed strength testing in sitting using a wooden platform in combination with the NWBB. This new method was set up to selectively recruit an intrinsic muscle of the foot, specifically the flexor hallucis brevis muscle. Statistical analysis was performed using intra-class coefficients and ordinary least product analysis. To estimate measurement error, standard error of measurement (SEM), minimal detectable change (MDC) and percentage error were calculated. Results indicate excellent intra-rater reliability (ICC = 0.982, CI = 0.96-0.99) with an absence of systematic bias. SEM, MDC and percentage error value were 0.5, 1.4 and 12 % respectively. This study demonstrates that a new method in combination with the NWBB application is reliable to measure hallux flexor strength and has potential to be used for future research and clinical application.
Sensitivity to Mental Effort and Test-Retest Reliability of Heart Rate Variability Measures in Healthy Seniors

PubMed Central

Mukherjee, Shalini; Yadav, Rajeev; Yung, Iris; Zajdel, Daniel P.; Oken, Barry S.

2011-01-01

Objectives To determine 1) whether heart rate variability (HRV) was a sensitive and reliable measure in mental effort tasks carried out by healthy seniors and 2) whether non-linear approaches to HRV analysis, in addition to traditional time and frequency domain approaches were useful to study such effects. Methods Forty healthy seniors performed two visual working memory tasks requiring different levels of mental effort, while ECG was recorded. They underwent the same tasks and recordings two weeks later. Traditional and 13 non-linear indices of HRV including Poincaré, entropy and detrended fluctuation analysis (DFA) were determined. Results Time domain (especially mean R-R interval/RRI), frequency domain and, among nonlinear parameters- Poincaré and DFA were the most reliable indices. Mean RRI, time domain and Poincaré were also the most sensitive to different mental effort task loads and had the largest effect size. Conclusions Overall, linear measures were the most sensitive and reliable indices to mental effort. In non-linear measures, Poincaré was the most reliable and sensitive, suggesting possible usefulness as an independent marker in cognitive function tasks in healthy seniors. Significance A large number of HRV parameters was both reliable as well as sensitive indices of mental effort, although the simple linear methods were the most sensitive. PMID:21459665
Quantitative measurement of hypertrophic scar: intrarater reliability, sensitivity, and specificity.

PubMed

Nedelec, Bernadette; Correa, José A; Rachelska, Grazyna; Armour, Alexis; LaSalle, Léo

2008-01-01

The comparison of scar evaluation over time requires measurement tools with acceptable intrarater reliability and the ability to discriminate skin characteristics of interest. The objective of this study was to evaluate the intrarater reliability and sensitivity and specificity of the Cutometer, the Mexameter, and the DermaScan C relative to the modified Vancouver Scar Scale (mVSS) in patient-matched normal skin, normal scar (donor sites), and hypertrophic scar (HSc). A single investigator evaluated four tissue types (severe HSc, less severe HSc, donor site, and normal skin) in 30 burn survivors with all four measurement tools. The intraclass correlation coefficient (ICC) for the Cutometer was acceptable (> or =0.75) for the maximum deformation measure for the donor site and normal skin (>0.78) but was below the acceptable range for the HSc sites and all other parameters. The ICC for the Mexameter erythema (>0.75) and melanin index (>0.89) and the DermaScan C total thickness measurement (>0.82) were acceptable for all sites. The ICC for the total of the height, pliability, and vascularity subscales of the mVSS was acceptable (0.81) for normal scar but below the acceptable range for the scar sites. The DermaScan C was clearly able to discriminate HSc from normal scar and normal skin based on the total thickness measure. The Cutometer was less discriminating but was still able to discriminate HSc from normal scar and normal skin. The Mexameter erythema index was not a good discriminator of HSc and normal scar. Receiver operating characteristic curves were generated to establish the best cutoff point for the DermaScan C total thickness and the Cutometer maximum deformation, which were 2.034 and 0.387 mm, respectively. This study showed that although the Cutometer, the DermaScan C, and the Mexameter have measurement properties that make them attractive substitutes for the mVSS, caution must be used when interpreting results since the Cutometer has a ceiling effect when
Health measurement using the ICF: Test-retest reliability study of ICF codes and qualifiers in geriatric care

PubMed Central

Okochi, Jiro; Utsunomiya, Sakiko; Takahashi, Tai

2005-01-01

Background The International Classification of Functioning, Disability and Health (ICF) was published by the World Health Organization (WHO) to standardize descriptions of health and disability. Little is known about the reliability and clinical relevance of measurements using the ICF and its qualifiers. This study examines the test-retest reliability of ICF codes, and the rate of immeasurability in long-term care settings of the elderly to evaluate the clinical applicability of the ICF and its qualifiers, and the ICF checklist. Methods Reliability of 85 body function (BF) items and 152 activity and participation (AP) items of the ICF was studied using a test-retest procedure with a sample of 742 elderly persons from 59 institutional and at home care service centers. Test-retest reliability was estimated using the weighted kappa statistic. The clinical relevance of the ICF was estimated by calculating immeasurability rate. The effect of the measurement settings and evaluators' experience was analyzed by stratification of these variables. The properties of each item were evaluated using both the kappa statistic and immeasurability rate to assess the clinical applicability of WHO's ICF checklist in the elderly care setting. Results The median of the weighted kappa statistics of 85 BF and 152 AP items were 0.46 and 0.55 respectively. The reproducibility statistics improved when the measurements were performed by experienced evaluators. Some chapters such as genitourinary and reproductive functions in the BF domain and major life area in the AP domain contained more items with lower test-retest reliability measures and rated as immeasurable than in the other chapters. Some items in the ICF checklist were rated as unreliable and immeasurable. Conclusion The reliability of the ICF codes when measured with the current ICF qualifiers is relatively low. The result in increase in reliability according to evaluators' experience suggests proper education will have positive

Reliability of perceived neighbourhood conditions and the effects of measurement error on self-rated health across urban and rural neighbourhoods.

PubMed

Pruitt, Sandi L; Jeffe, Donna B; Yan, Yan; Schootman, Mario

2012-04-01

Limited psychometric research has examined the reliability of self-reported measures of neighbourhood conditions, the effect of measurement error on associations between neighbourhood conditions and health, and potential differences in the reliabilities between neighbourhood strata (urban vs rural and low vs high poverty). We assessed overall and stratified reliability of self-reported perceived neighbourhood conditions using five scales (social and physical disorder, social control, social cohesion, fear) and four single items (multidimensional neighbouring). We also assessed measurement error-corrected associations of these conditions with self-rated health. Using random-digit dialling, 367 women without breast cancer (matched controls from a larger study) were interviewed twice, 2-3 weeks apart. Test-retest (intraclass correlation coefficients (ICC)/weighted κ) and internal consistency reliability (Cronbach's α) were assessed. Differences in reliability across neighbourhood strata were tested using bootstrap methods. Regression calibration corrected estimates for measurement error. All measures demonstrated satisfactory internal consistency (α ≥ 0.70) and either moderate (ICC/κ=0.41-0.60) or substantial (ICC/κ=0.61-0.80) test-retest reliability in the full sample. Internal consistency did not differ by neighbourhood strata. Test-retest reliability was significantly lower among rural (vs urban) residents for two scales (social control, physical disorder) and two multidimensional neighbouring items; test-retest reliability was higher for physical disorder and lower for one multidimensional neighbouring item among the high (vs low) poverty strata. After measurement error correction, the magnitude of associations between neighbourhood conditions and self-rated health were larger, particularly in the rural population. Research is needed to develop and test reliable measures of perceived neighbourhood conditions relevant to the health of rural populations.
Reliability of Maximal Strength Testing in Novice Weightlifters

NASA Technical Reports Server (NTRS)

Loehr, James A.; Lee, Stuart M. C.; Feiveson, Alan H.; Ploutz-Snyder, Lori L.

2009-01-01

The one repetition maximum (1RM) is a criterion measure of muscle strength. However, the reliability of 1RM testing in novice subjects has received little attention. Understanding this information is crucial to accurately interpret changes in muscle strength. To evaluate the test-retest reliability of a squat (SQ), heel raise (HR), and deadlift (DL) 1RM in novice subjects. Twenty healthy males (31 plus or minus 5 y, 179.1 plus or minus 6.1 cm, 81.4 plus or minus 10.6 kg) with no weight training experience in the previous six months participated in four 1RM testing sessions, with each session separated by 5-7 days. SQ and HR 1RM were conducted using a smith machine; DL 1RM was assessed using free weights. Session 1 was considered a familiarization and was not included in the statistical analyses. Repeated measures analysis of variance with Tukey fs post-hoc tests were used to detect between-session differences in 1RM (p.0.05). Test-retest reliability was evaluated by intraclass correlation coefficients (ICC). During Session 2, the SQ and DL 1RM (SQ: 90.2 }4.3, DL: 75.9 }3.3 kg) were less than Session 3 (SQ: 95.3 }4.1, DL: 81.5 plus or minus 3.5 kg) and Session 4 (SQ: 96.6 }4.0, DL: 82.4 }3.9 kg), but there were no differences between Session 3 and Session 4. HR 1RM measured during Session 2 (150.1 }3.7 kg) and Session 3 (152.5 }3.9 kg) were not different from one another, but both were less than Session 4 (157.5 }3.8 kg). The reliability (ICC) of 1RM measures for Sessions 2-4 were 0.88, 0.83, and 0.87, for SQ, HR, and DL, respectively. When considering only Sessions 3 and 4, the reliability was 0.93, 0.91, and 0.86 for SQ, HR, and DL, respectively. One familiarization session and 2 test sessions (for SQ and DL) were required to obtain excellent reliability (ICC greater than or equal to 0.90) in 1RM values with novice subjects. We were unable to attain this level of reliability following 3 HR testing sessions therefore additional sessions may be required to obtain an
Reliability and Validity of Computerized Force Platform Measures of Balance Function in Healthy Older Adults.

PubMed

Harro, Cathy C; Garascia, Chelsea

2018-01-10

Postural control declines with aging and is an independent risk factor for falls in older adults. Objective examination of balance function is warranted to direct fall prevention strategies. Force platform (FP) systems provide quantitative measures of postural control and analysis of different aspects of balance. The purpose of this study was to examine the reliability and validity of FP measures in healthy older adults. This study enrolled 46 healthy elderly adults, mean age 67.67 (5.1) years, who had no history of falls. They were assessed on 3 standardized tests on the NeuroCom Equitest FP system: limits of stability (LOS), motor control test (MCT), and sensory organization test (SOT). The test battery was administered twice within a 10-day period for test-retest reliability; intraclass correlation coefficients (ICCs), standard error of measurement (SEM), and minimal detectable change based on a 95% confidence interval (MDC95) were calculated. FP measures were compared with criterion clinical balance (Mini-BESTest and Functional Gait Assessment) and gait (10-m walk and 6-minute walk) measures to examine concurrent validity using Pearson correlation coefficients. Multiple linear regression analysis examined whether age and activity level were associated with FP performance. The α level was set at P < .05. SOT composite equilibrium scores, MCT average latency, and LOS end point excursion measures all demonstrated excellent test-retest reliability (ICC = 0.90, 0.85, and 0.77, respectively), whereas moderate to good reliability was found for SOT vestibular ratio score (ICC = 0.71). There was large variability in performance in this healthy elderly cohort, resulting in relatively large MDC95 for these measures, especially for the LOS test. Fair correlations were found between LOS end point excursion and clinical balance and gait measures (r = 0.31-0.49), and between MCT average latency and gait measures only (r = -0.32). No correlations were found between SOT
Assessment of the intraobserver and interobserver reliability of a communicating vessels volumeter to measure wrist-hand volume.

PubMed

de Carvalho, Rogério Mendonca; Perez, Maria Del Carmen Janerio; Miranda, Fausto

2012-10-01

Traditional volumetry based on Archimedes' principle is the gold standard for the measurement of limb volume, but the routine use of this technique is discouraged because of several disadvantages. The purpose of this study was to evaluate intraobserver and interobserver reliability of direct measurements of wrist-hand volume using a new communicating vessels volumeter based on Pascal's law. A reliability study was conducted. To evaluate the reliability of the communicating vessels volumeter in generating measurements, 30 hands of 15 participants (9 women, 6 men) were measured 3 times each by 3 observers, totaling 270 volumetric results. Measurement time was short (X =3 minutes 42 seconds). The intraclass correlation coefficient (ICC) was .9977 for observer 1 and .9976 for observers 2 and 3. The interobserver ICC was .9998. The standard error of measurement was about 3 mL for all observers; the interobserver result was 1 mL. The interrater coefficient of variance (CV) was 1.15% for the series of 9 measurements collected for each segment; the intrarater CV was 1.20%. Limitations No swollen hands were measured, and measurements were not compared with the gold standard technique. Thus, accuracy of the new volumeter was not determined in this study. A new device has been developed for plethysmography of the extremities, and the results of its use to measure the volume of the wrist-hand segment were reliable in both intraobserver and interobserver analyses.
Transient-evoked and distortion product otoacoustic emissions: A short-term test-retest reliability study.

PubMed

Keppler, Hannah; Dhooge, Ingeborg; Maes, Leen; D'haenens, Wendy; Bockstael, Annelies; Philips, Birgit; Swinnen, Freya; Vinck, Bart

2010-02-01

Knowledge regarding the variability of transient-evoked otoacoustic emissions (TEOAEs) and distortion product otoacoustic emissions (DPOAEs) is essential in clinical settings and improves their utility in monitoring hearing status over time. In the current study, TEOAEs and DPOAEs were measured with commercially available OAE-equipment in 56 normally-hearing ears during three sessions. Reliability was analysed for the retest measurement without probe-refitting, the immediate retest measurement with probe-refitting, and retest measurements after one hour and one week. The highest reliability was obtained in the retest measurement without probe-refitting, and decreased with increasing time-interval between measurements. For TEOAEs, the lowest reliability was seen at half-octave frequency bands 1.0 and 1.4 kHz; whereas for DPOAEs half-octave frequency band 8.0 kHz had also poor reliability. Higher primary tone level combination for DPOAEs yielded to a better reliability of DPOAE amplitudes. External environmental noise seemed to be the dominating noise source in normal-hearing subjects, decreasing the reliability of emission amplitudes especially in the low-frequency region.
Quantitative outcome measures for systemic sclerosis-related Microangiopathy - Reliability of image acquisition in Nailfold Capillaroscopy.

PubMed

Dinsdale, Graham; Moore, Tonia; O'Leary, Neil; Berks, Michael; Roberts, Christopher; Manning, Joanne; Allen, John; Anderson, Marina; Cutolo, Maurizio; Hesselstrand, Roger; Howell, Kevin; Pizzorni, Carmen; Smith, Vanessa; Sulli, Alberto; Wildt, Marie; Taylor, Christopher; Murray, Andrea; Herrick, Ariane L

2017-09-01

Nailfold capillaroscopic parameters hold increasing promise as outcome measures for clinical trials in systemic sclerosis (SSc). Their inclusion as outcomes would often naturally require capillaroscopy images to be captured at several time points during any one study. Our objective was to assess repeatability of image acquisition (which has been little studied), as well as of measurement. 41 patients (26 with SSc, 15 with primary Raynaud's phenomenon) and 10 healthy controls returned for repeat high-magnification (300×) videocapillaroscopy mosaic imaging of 10 digits one week after initial imaging (as part of a larger study of reliability). Images were assessed in a random order by an expert blinded observer and 4 outcome measures extracted: (1) overall image grade and then (where possible) distal vessel locations were marked, allowing (2) vessel density (across the whole nailfold) to be calculated (3) apex width measurement and (4) giant vessel count. Intra-rater, intra-visit and intra-rater inter-visit (baseline vs. 1week) reliability were examined in 475 and 392 images respectively. A linear, mixed-effects model was used to estimate variance components, from which intra-class correlation coefficients (ICCs) were determined. Intra-visit and inter-visit reliability estimates (ICCs) were (respectively): overall image grade, 0.97 and 0.90; vessel density, 0.92 and 0.65; mean vessel width, 0.91 and 0.79; presence of giant capillary, 0.68 and 0.56. These estimates were conditional on each parameter being measurable. Within-operator image analysis and acquisition are reproducible. Quantitative nailfold capillaroscopy, at least with a single observer, provides reliable outcome measures for clinical studies including randomised controlled trials. Copyright © 2017 Elsevier Inc. All rights reserved.
Reliability and construct validity of the Instrument to Measure the Impact of Valve Heart Disease on the Patient's Daily Life

PubMed Central

dos Anjos, Daniela Brianne Martins; Rodrigues, Roberta Cunha Matheus; Padilha, Kátia Melissa; Pedrosa, Rafaela Batista dos Santos; Gallani, Maria Cecília Bueno Jayme

2016-01-01

ABSTRACT Objective: evaluate the practicality, acceptability and the floor and ceiling effects, estimate the reliability and verify the convergent construct's validity with the instrument called the Heart Valve Disease Impact on daily life (IDCV) of the valve disease in patients with mitral and or aortic heart valve disease. Method: data was obtained from 86 heart valve disease patients through 3 phases: a face to face interview for a socio-demographic and clinic characterization and then other two done through phone calls of the interviewed patients for application of the instrument (test and repeat test). Results: as for the practicality and acceptability, the instrument was applied with an average time of 9,9 minutes and with 110% of responses, respectively. Ceiling and floor effects observed for all domains, especially floor effect. Reliability was tested using the test - repeating pattern to give evidence of temporal stability of the measurement. Significant negative correlations with moderate to strong magnitude were found between the score of the generic question about the impact of the disease and the scores of IDCV, which points to the validity of the instrument convergent construct. Conclusion: the instrument to measure the impact of valve heart disease on the patient's daily life showed evidence of reliability and validity when applied to patients with heart valve disease. PMID:27992024
The Development of a Practical and Reliable Assessment Measure for Atopic Dermatitis (ADAM).

ERIC Educational Resources Information Center

Charman, Denise; Varigos, George; Horne, David J. de L.; Oberklaid, Frank

1999-01-01

A study was conducted in Australia to develop a reliable, valid, and practical measure of atopic dermatitis. The test development process and validity evaluation with two doctors and 51 patients are discussed. Results suggest that operational definitions of the scales need to be defined more clearly. The measure satisfies assumptions for a partial…
Inter- and intra-rater reliability of calliper-based lymph node measurement in dogs with peripheral nodal lymphomas.

PubMed

Childress, M O; Fulkerson, C M; Lahrman, S A; Weng, H-Y

2016-08-01

The purpose of this study was to assess reliability of lymph node measurements between and within raters in dogs with nodal lymphomas. Three raters measured lymph nodes from 20 dogs twice prior to and once after administering chemotherapy. Sum tumour volume (TV) and sum longest diameter (LD) of all lymph nodes at each time point, and the percent change in measurements following chemotherapy, were calculated for each dog. Inter- and intra-rater reliability were assessed with the intraclass correlation coefficient (ICC). ICC for inter-rater sum TV and sum LD prior to chemotherapy were 0.86 and 0.80, respectively. ICC for inter-rater sum TV and sum LD after chemotherapy were 0.95 and 0.91, respectively. ICC for percent change in sum TV and sum LD were 0.96 and 0.94, respectively. ICC for intra-rater reliability ranged from 0.90 to 0.98 for each rater. Inter- and intra-rater reliability in measurements among the three raters was good to excellent. © 2014 John Wiley & Sons Ltd.
Reliability measurement for mixed mode failures of 33/11 kilovolt electric power distribution stations.

PubMed

Alwan, Faris M; Baharum, Adam; Hassan, Geehan S

2013-01-01

The reliability of the electrical distribution system is a contemporary research field due to diverse applications of electricity in everyday life and diverse industries. However a few research papers exist in literature. This paper proposes a methodology for assessing the reliability of 33/11 Kilovolt high-power stations based on average time between failures. The objective of this paper is to find the optimal fit for the failure data via time between failures. We determine the parameter estimation for all components of the station. We also estimate the reliability value of each component and the reliability value of the system as a whole. The best fitting distribution for the time between failures is a three parameter Dagum distribution with a scale parameter [Formula: see text] and shape parameters [Formula: see text] and [Formula: see text]. Our analysis reveals that the reliability value decreased by 38.2% in each 30 days. We believe that the current paper is the first to address this issue and its analysis. Thus, the results obtained in this research reflect its originality. We also suggest the practicality of using these results for power systems for both the maintenance of power systems models and preventive maintenance models.
The reliability analysis of a separated, dual fail operational redundant strapdown IMU. [inertial measurement unit

NASA Technical Reports Server (NTRS)

Motyka, P.

1983-01-01

A methodology for quantitatively analyzing the reliability of redundant avionics systems, in general, and the dual, separated Redundant Strapdown Inertial Measurement Unit (RSDIMU), in particular, is presented. The RSDIMU is described and a candidate failure detection and isolation system presented. A Markov reliability model is employed. The operational states of the system are defined and the single-step state transition diagrams discussed. Graphical results, showing the impact of major system parameters on the reliability of the RSDIMU system, are presented and discussed.
Sample size planning for composite reliability coefficients: accuracy in parameter estimation via narrow confidence intervals.

PubMed

Terry, Leann; Kelley, Ken

2012-11-01

Composite measures play an important role in psychology and related disciplines. Composite measures almost always have error. Correspondingly, it is important to understand the reliability of the scores from any particular composite measure. However, the point estimates of the reliability of composite measures are fallible and thus all such point estimates should be accompanied by a confidence interval. When confidence intervals are wide, there is much uncertainty in the population value of the reliability coefficient. Given the importance of reporting confidence intervals for estimates of reliability, coupled with the undesirability of wide confidence intervals, we develop methods that allow researchers to plan sample size in order to obtain narrow confidence intervals for population reliability coefficients. We first discuss composite reliability coefficients and then provide a discussion on confidence interval formation for the corresponding population value. Using the accuracy in parameter estimation approach, we develop two methods to obtain accurate estimates of reliability by planning sample size. The first method provides a way to plan sample size so that the expected confidence interval width for the population reliability coefficient is sufficiently narrow. The second method ensures that the confidence interval width will be sufficiently narrow with some desired degree of assurance (e.g., 99% assurance that the 95% confidence interval for the population reliability coefficient will be less than W units wide). The effectiveness of our methods was verified with Monte Carlo simulation studies. We demonstrate how to easily implement the methods with easy-to-use and freely available software. ©2011 The British Psychological Society.
Measuring Outcomes for Dysphagia: Validity and Reliability of the European Portuguese Eating Assessment Tool (P-EAT-10).

PubMed

Nogueira, Dália Santos; Ferreira, Pedro Lopes; Reis, Elizabeth Azevedo; Lopes, Inês Sousa

2015-10-01

The purpose of this study was to evaluate the validity and the reliability of the European Portuguese version of the EAT-10 (P-EAT-10). This research was conducted in three phases: (i) cultural and linguistic adaptation; (ii) feasibility and reliability test; and (iii) validity tests. The final sample was formed by a cohort of 520 subjects. The P-EAT-10 index was compared for socio-demographic and clinic variables. It was also compared for both dysphagic and non-dysphagic groups as well as for the results of the 3Oz wst. Lastly, the P-EAT-10 scores were correlated with the EuroQol Group Portuguese EQ-5D index. The Cronbach's α obtained for the P-EAT-10 scale was 0.952 and it remained excellent even if any item was deleted. The item-total and the intraclass correlation coefficients were very good. The P-EAT-10 mean of the non-dysphagic cohort was 0.56 and that of the dysphagic cohort was 14.26, the mean comparison between the 3Oz wst groups and the P-EAT-10 scores were significant. A significant higher perception of QoL was also found among the non-dysphagic subjects. P-EAT-10 is a valid and reliable measure that may be used to document dysphagia which makes it useful both for screening in clinical practice and in research.
Inertial Measurement Units for Clinical Movement Analysis: Reliability and Concurrent Validity

PubMed Central

Nicholas, Kevin; Sparkes, Valerie; Sheeran, Liba; Davies, Jennifer L

2018-01-01

The aim of this study was to investigate the reliability and concurrent validity of a commercially available Xsens MVN BIOMECH inertial-sensor-based motion capture system during clinically relevant functional activities. A clinician with no prior experience of motion capture technologies and an experienced clinical movement scientist each assessed 26 healthy participants within each of two sessions using a camera-based motion capture system and the MVN BIOMECH system. Participants performed overground walking, squatting, and jumping. Sessions were separated by 4 ± 3 days. Reliability was evaluated using intraclass correlation coefficient and standard error of measurement, and validity was evaluated using the coefficient of multiple correlation and the linear fit method. Day-to-day reliability was generally fair-to-excellent in all three planes for hip, knee, and ankle joint angles in all three tasks. Within-day (between-rater) reliability was fair-to-excellent in all three planes during walking and squatting, and poor-to-high during jumping. Validity was excellent in the sagittal plane for hip, knee, and ankle joint angles in all three tasks and acceptable in frontal and transverse planes in squat and jump activity across joints. Our results suggest that the MVN BIOMECH system can be used by a clinician to quantify lower-limb joint angles in clinically relevant movements. PMID:29495600
The reliability and validity of fatigue measures during short-duration maximal-intensity intermittent cycling.

PubMed

Glaister, Mark; Stone, Michael H; Stewart, Andrew M; Hughes, Michael; Moir, Gavin L

2004-08-01

The purpose of the present study was to assess the reliability and validity of fatigue measures, as derived from 4 separate formulae, during tests of repeat sprint ability. On separate days over a 3-week period, 2 groups of 7 recreationally active men completed 6 trials of 1 of 2 maximal (20 x 5 seconds) intermittent cycling tests with contrasting recovery periods (10 or 30 seconds). All trials were conducted on a friction-braked cycle ergometer, and fatigue scores were derived from measures of mean power output for each sprint. Apart from formula 1, which calculated fatigue from the percentage difference in mean power output between the first and last sprint, all remaining formulae produced fatigue scores that showed a reasonably good level of test-retest reliability in both intermittent test protocols (intraclass correlation range: 0.78-0.86; 95% likely range of true values: 0.54-0.97). Although between-protocol differences in the magnitude of the fatigue scores suggested good construct validity, within-protocol differences highlighted limitations with each formula. Overall, the results support the use of the percentage decrement score as the most valid and reliable measure of fatigue during brief maximal intermittent work.
Reliability and Validity Assessment of a Linear Position Transducer

PubMed Central

Garnacho-Castaño, Manuel V.; López-Lastra, Silvia; Maté-Muñoz, José L.

2015-01-01

The objectives of the study were to determine the validity and reliability of peak velocity (PV), average velocity (AV), peak power (PP) and average power (AP) measurements were made using a linear position transducer. Validity was assessed by comparing measurements simultaneously obtained using the Tendo Weightlifting Analyzer Systemi and T-Force Dynamic Measurement Systemr (Ergotech, Murcia, Spain) during two resistance exercises, bench press (BP) and full back squat (BS), performed by 71 trained male subjects. For the reliability study, a further 32 men completed both lifts using the Tendo Weightlifting Analyzer Systemz in two identical testing sessions one week apart (session 1 vs. session 2). Intraclass correlation coefficients (ICCs) indicating the validity of the Tendo Weightlifting Analyzer Systemi were high, with values ranging from 0.853 to 0.989. Systematic biases and random errors were low to moderate for almost all variables, being higher in the case of PP (bias ±157.56 W; error ±131.84 W). Proportional biases were identified for almost all variables. Test-retest reliability was strong with ICCs ranging from 0.922 to 0.988. Reliability results also showed minimal systematic biases and random errors, which were only significant for PP (bias -19.19 W; error ±67.57 W). Only PV recorded in the BS showed no significant proportional bias. The Tendo Weightlifting Analyzer Systemi emerged as a reliable system for measuring movement velocity and estimating power in resistance exercises. The low biases and random errors observed here (mainly AV, AP) make this device a useful tool for monitoring resistance training. Key points This study determined the validity and reliability of peak velocity, average velocity, peak power and average power measurements made using a linear position transducer The Tendo Weight-lifting Analyzer Systemi emerged as a reliable system for measuring movement velocity and power. PMID:25729300
A refined definition improves the measurement reliability of the tip-apex distance.

PubMed

Sakagoshi, Daigo; Sawaguchi, Takeshi; Shima, Yosuke; Inoue, Daisuke; Oshima, Takeshi; Goldhahn, Sabine

2016-07-01

Tip-apex distance (TAD) is reported as a good predictor for cut-outs of lag screws and spiral blades in the treatment of intertrochanteric fractures, and surgeons are advised to strive for TAD within 20 mm. However, the femoral neck axis and the position of the lower limb in the lateral radiograph are not clearly defined and can lead to measurement errors. We propose a refined TAD by defining these factors. The objective of this study was to analyze the reliability of this refined TAD. The radiographs of 130 prospective cases with unstable trochanteric fractures were used for the analysis of the refined TAD. The refined TAD was independently measured by 2 raters with clinical experience of more than 10 years (rater 1, 2) and 2 raters with much less clinical experience (rater 3, 4) after they received a training about the new measurement method. Intraclass correlation coefficient (ICC [2,4]) was calculated to assess the interrater reliability. The mean refined TADs were 18.2:18.4:18.2:18.2 mm for rater 1:2:3:4. There was a strong correlation among all four raters (ICC 0.998, (95% CI: 0.998, 0.999). Regardless of the clinical experience of raters, the refined TAD is a reliable tool and can be used to develop new TAD recommendations for predicting failure of fixation. Future studies with larger samples are needed to evaluate the predictive value of the refined TAD. Copyright © 2016 The Japanese Orthopaedic Association. Published by Elsevier B.V. All rights reserved.
Intra-rater and inter-rater reliability of ultrasonographic measurements of acromion-greater tuberosity distance in patients with post-stroke hemiplegia.

PubMed

Kumar, Praveen; Cruziah, Reynold; Bradley, Michael; Gray, Selena; Swinkels, Annette

2016-06-01

Glenohumeral subluxation (GHS) is reported in up to 81% of patients with stroke. Ultrasonographic measurements of GHS by measuring the acromion-greater tuberosity (AGT) have been found to be reliable for experienced raters. The primary aim was to assess the intra-rater reliability of measurements of AGT distance in people with stroke following a short course of rater training. A secondary aim was to compare the inter-rater reliability of these measurements between novice and experienced raters. Patients with stroke (n = 16; 5 men, 11 women; 74 ± 10 years) with 1-sided weakness who gave informed consent were recruited. Ultrasonographic measurements were recorded at the bedside by two physiotherapists with patients seated upright in a hospital chair. Reliability was assessed by intra-class correlation coefficients (ICCs) and the standard error of measurements (SEM). Minimum detectable change (MDC90) scores were used to estimate the magnitude of change that is likely to exceed measurement error. Mean ± SD AGT distances on the affected and unaffected sides for rater 1 were 2.2 ± 0.7 and 1.7 ± 0.4 cm, respectively. Corresponding values for rater 2 were 2.5 ± 0.6 and 2.0 ± 0.4 cm. Intra-class correlation coefficient values for the affected and unaffected shoulders for rater 1 were 0.96 and 0.91, respectively. Corresponding values for rater 2 were 0.95 and 0.90.SEM and MDC90 for both affected and unaffected shoulders were ≤ 0.2 cm. Inter-rater reliability coefficients were 0.86 (affected) and 0.76 (unaffected) shoulders. Ultrasonographic measurement of AGT distance demonstrates excellent intra-rater reliability for a novice rater. Inter-rater reliability of ultrasonographic measurement of AGT also demonstrates good reliability between novice and experienced raters.
The validation of a swimming turn wall-contact-time measurement system: a touchpad application reliability study.

PubMed

Brackley, Victoria; Ball, Kevin; Tor, Elaine

2018-05-12

The effectiveness of the swimming turn is highly influential to overall performance in competitive swimming. The push-off or wall contact, within the turn phase, is directly involved in determining the speed the swimmer leaves the wall. Therefore, it is paramount to develop reliable methods to measure the wall-contact-time during the turn phase for training and research purposes. The aim of this study was to determine the concurrent validity and reliability of the Pool Pad App to measure wall-contact-time during the freestyle and backstroke tumble turn. The wall-contact-times of nine elite and sub-elite participants were recorded during their regular training sessions. Concurrent validity statistics included the standardised typical error estimate, linear analysis and effect sizes while the intraclass correlating coefficient (ICC) was used for the reliability statistics. The standardised typical error estimate resulted in a moderate Cohen's d effect size with an R 2 value of 0.80 and the ICC between the Pool Pad and 2D video footage was 0.89. Despite these measurement differences, the results from this concurrent validity and reliability analyses demonstrated that the Pool Pad is suitable for measuring wall-contact-time during the freestyle and backstroke tumble turn within a training environment.
Assessing the Reliability of Curriculum-Based Measurement: An Application of Latent Growth Modeling

ERIC Educational Resources Information Center

Yeo, Seungsoo; Kim, Dong-Il; Branum-Martin, Lee; Wayman, Miya Miura; Espin, Christine A.

2012-01-01

The purpose of this study was to demonstrate the use of Latent Growth Modeling (LGM) as a method for estimating reliability of Curriculum-Based Measurement (CBM) progress-monitoring data. The LGM approach permits the error associated with each measure to differ at each time point, thus providing an alternative method for examining of the…

Reliable and valid NEWS for Chinese seniors: measuring perceived neighborhood attributes related to walking.

PubMed

Cerin, Ester; Sit, Cindy Hp; Cheung, Man-Chin; Ho, Sai-Yin; Lee, Lok-Chun Janet; Chan, Wai-Man

2010-11-25

The effects of the built environment on walking in seniors have not been studied in an Asian context. To examine these effects, valid and reliable measures are needed. The aim of this study was to develop and validate a questionnaire of perceived neighborhood characteristics related to walking appropriate for Chinese seniors (Neighborhood Environment Walkability Scale for Chinese Seniors, NEWS-CS). It was based on the Neighborhood Environment Walkability Scale - Abbreviated (NEWS-A), a validated measure of perceived built environment developed in the USA for adults. A secondary study aim was to establish the generalizability of the NEWS-A to an Asian high-density urban context and a different age group. A multidisciplinary panel of experts adapted the original NEWS-A to reflect the built environment of Hong Kong and needs of seniors. The translated instrument was pre-tested on a sample of 50 Chinese-speaking senior residents (65+ years). The final version of the NEWS-CS was interviewer-administered to 484 seniors residing in four selected Hong Kong districts varying in walkability and socio-economic status. Ninety-two participants completed the questionnaire on two separate occasions, 2-3 weeks apart. Test-rest reliability indices were estimated for each item and subscale of the NEWS-CS. Confirmatory factor analysis was used to develop the measurement model of the NEWS-CS and cross-validate that of the NEWS-A. The final version of the NEWS-CS consisted of 14 subscales and four single items (76 items). Test-retest reliability was moderate to good (ICC > 50 or % agreement > 60) except for four items measuring distance to destinations. The originally-proposed measurement models of the NEWS-A and NEWS-CS required 2-3 theoretically-justifiable modifications to fit the data well. The NEWS-CS possesses sufficient levels of reliability and factorial validity to be used for measuring perceived neighborhood environment in Chinese seniors. Further work is needed to assess its
Reliability, precision, and gender differences in knee internal/external rotation proprioception measurements.

PubMed

Nagai, Takashi; Sell, Timothy C; Abt, John P; Lephart, Scott M

2012-11-01

To develop and assess the reliability and precision of knee internal/external rotation (IR/ER) threshold to detect passive motion (TTDPM) and determine if gender differences exist. Test-retest for the reliability/precision and cross-sectional for gender comparisons. University neuromuscular and human performance research laboratory. Ten subjects for the reliability and precision aim. Twenty subjects (10 males and 10 females) for gender comparisons. All TTDPM tests were performed using a multi-mode dynamometer. Subjects performed TTDPM at two knee positions (near IR or ER end-range). Intraclass correlation coefficient (ICC (3,k)) and standard error of measurement (SEM) were used to evaluate the reliability and precision. Independent t-tests were used to compare genders. TTDPM toward IR and ER at two knee positions. Intrasession and intersession reliability and precision were good (ICC=0.68-0.86; SEM=0.22°-0.37°). Females had significantly diminished TTDPM toward IR at IR-test position (males: 0.77°±0.14°, females: 1.18°±0.46°, p=0.021) and TTDPM toward IR at the ER-test position (males: 0.87°±0.13°, females: 1.36°±0.58°, p=0.026). No other significant gender differences were found (p>0.05). The current IR/ER TTDPM methods are reliable and accurate for the test-retest or cross-section research design. Gender differences were found toward IR where the ACL acts as the secondary restraint. Copyright © 2011 Elsevier Ltd. All rights reserved.
Reliability of length measurements collected by community nurses and health volunteers in rural growth monitoring and promotion services.

PubMed

Laar, Matilda E; Marquis, Grace S; Lartey, Anna; Gray-Donald, Katherine

2018-02-17

Length measurements are important in growth, monitoring and promotion (GMP) for the surveillance of a child's weight-for-length and length-for-age. These two indices provide an indication of a child's risk of becoming wasted or stunted, and are more informative about a child's growth than the widely used weight-for-age index (underweight). Although the introduction of length measurements in GMP is recommended by the World Health Organization, concerns about the reliability of length measurements collected in rural outreach settings have been expressed by stakeholders. Our aim was to describe the reliability and challenges associated with community health personnel measuring length for rural outreach GMP activities. Two reliability studies (A and B), using 10 children less than 24 months each, were conducted in the GMP services of a rural district in Ghana. Fifteen nurses and 15 health volunteers (HV) with no prior experience in length measurements were trained. Intra- and inter-observer technical error of measurement (TEM), average bias from expert anthropometrist, and coefficient of reliability (R) of length measurements were assessed and compared across sessions. Observations and interviews were used to understand the ability and experiences of health personnel with measuring length at outreach GMP. Inter-observer TEM was larger than intra-observer TEM for both nurses and HV at both sessions and was unacceptably (compared to error standards) high in both groups at both time points. Average biases from expert's measurements were within acceptable limits, however, both groups tended to underestimate length measurements. The R for lengths collected by nurses (92.3%) was higher at session B compared to that of HV (87.5%). Length measurements taken by nurses and HV, and those taken by an experienced anthropometrist at GMP sessions were of moderate agreement (kappa = 0.53, p < 0.0001). The reliability of length measurements improved after two refresher
Relative and absolute reliability of measures of linoleic acid-derived oxylipins in human plasma.

PubMed

Gouveia-Figueira, Sandra; Bosson, Jenny A; Unosson, Jon; Behndig, Annelie F; Nording, Malin L; Fowler, Christopher J

2015-09-01

Modern analytical techniques allow for the measurement of oxylipins derived from linoleic acid in biological samples. Most validatory work has concerned extraction techniques, repeated analysis of aliquots from the same biological sample, and the influence of external factors such as diet and heparin treatment upon their levels, whereas less is known about the relative and absolute reliability of measurements undertaken on different days. A cohort of nineteen healthy males were used, where samples were taken at the same time of day on two occasions, at least 7 days apart. Relative reliability was assessed using Lin's concordance correlation coefficients (CCC) and intraclass correlation coefficients (ICC). Absolute reliability was assessed by Bland-Altman analyses. Nine linoleic acid oxylipins were investigated. ICC and CCC values ranged from acceptable (0.56 [13-HODE]) to poor (near zero [9(10)- and 12(13)-EpOME]). Bland-Altman limits of agreement were in general quite wide, ranging from ±0.5 (12,13-DiHOME) to ±2 (9(10)-EpOME; log10 scale). It is concluded that relative reliability of linoleic acid-derived oxylipins varies between lipids with compounds such as the HODEs showing better relative reliability than compounds such as the EpOMEs. These differences should be kept in mind when designing and interpreting experiments correlating plasma levels of these lipids with factors such as age, body mass index, rating scales etc. Copyright © 2015 Elsevier Inc. All rights reserved.
The validity and reliability of a novel activity monitor as a measure of walking

PubMed Central

Ryan, C G; Grant, P M; Tigbe, W W; Granat, M H

2006-01-01

Background The accurate measurement of physical activity is crucial to understanding the relationship between physical activity and disease prevention and treatment. Objective The primary purpose of this study was to investigate the validity and reliability of the activPAL physical activity monitor in measuring step number and cadence. Methods The ability of the activPAL monitor to measure step number and cadence in 20 healthy adults (age 34.5±6.9 years; BMI 26.8±4.8 (mean±SD)) was evaluated against video observation. Concurrently, the accuracy of two commonly used pedometers, the Yamax Digi‐Walker SW‐200 and the Omron HJ‐109‐E, was compared to observation for measuring step number. Participants walked on a treadmill at five different speeds (0.90, 1.12, 1.33, 1.56, and 1.78 m/s) and outdoors at three self selected speeds (slow, normal, and fast). Results At all speeds, inter device reliability was excellent for the activPAL (ICC (2,1)⩾0.99) for both step number and cadence. The absolute percentage error for the activPAL was <1.11% for step number and cadence regardless of walking speed. The accuracy of the pedometers was adversely affected by slow walking speeds. Conclusion The activPAL monitor is a valid and reliable measure of walking in healthy adults. Its accuracy is not influenced by walking speed. The activPAL may be a useful device in sports medicine. PMID:16825270
Reliability, precision, and measurement in the context of data from ability tests, surveys, and assessments

NASA Astrophysics Data System (ADS)

Fisher, W. P., Jr.; Elbaum, B.; Coulter, A.

2010-07-01

Reliability coefficients indicate the proportion of total variance attributable to differences among measures separated along a quantitative continuum by a testing, survey, or assessment instrument. Reliability is usually considered to be influenced by both the internal consistency of a data set and the number of items, though textbooks and research papers rarely evaluate the extent to which these factors independently affect the data in question. Probabilistic formulations of the requirements for unidimensional measurement separate consistency from error by modelling individual response processes instead of group-level variation. The utility of this separation is illustrated via analyses of small sets of simulated data, and of subsets of data from a 78-item survey of over 2,500 parents of children with disabilities. Measurement reliability ultimately concerns the structural invariance specified in models requiring sufficient statistics, parameter separation, unidimensionality, and other qualities that historically have made quantification simple, practical, and convenient for end users. The paper concludes with suggestions for a research program aimed at focusing measurement research more on the calibration and wide dissemination of tools applicable to individuals, and less on the statistical study of inter-variable relations in large data sets.
A comparison of the reliability of make versus break testing in measuring palmar abduction strength of the thumb.

PubMed

Lim, J X; Toh, R X; Chook, S K H; Sebastin, S J; Karjalainen, T

2014-06-01

Previous studies have established the role of quantitative measurements of palmar abduction strength of the thumb (PAST). This study compares the reliability of the 'make' versus the 'break' test in measuring PAST in healthy volunteers. In a 'make' test, the body part being tested is positioned at the start of its range of motion and the participant is asked to exert his/her maximal force. In a 'break' test, increasing force is applied to a body part after it has completed its range of motion, until the joint being tested gives way. PAST was measured in both hands in 100 healthy volunteers using a handheld device. Two examiners measured PAST using both the 'make' and 'break' test to determine inter-rater reliability. The tests were repeated in 30 volunteers 6 weeks after the initial testing to determine intra-rater reliability. Our results showed that the 'make' test has better inter and intra-rater reliability.
Sensitivity to mental effort and test-retest reliability of heart rate variability measures in healthy seniors.

PubMed

Mukherjee, Shalini; Yadav, Rajeev; Yung, Iris; Zajdel, Daniel P; Oken, Barry S

2011-10-01

To determine (1) whether heart rate variability (HRV) was a sensitive and reliable measure in mental effort tasks carried out by healthy seniors and (2) whether non-linear approaches to HRV analysis, in addition to traditional time and frequency domain approaches were useful to study such effects. Forty healthy seniors performed two visual working memory tasks requiring different levels of mental effort, while ECG was recorded. They underwent the same tasks and recordings 2 weeks later. Traditional and 13 non-linear indices of HRV including Poincaré, entropy and detrended fluctuation analysis (DFA) were determined. Time domain, especially mean R-R interval (RRI), frequency domain and, among non-linear parameters - Poincaré and DFA were the most reliable indices. Mean RRI, time domain and Poincaré were also the most sensitive to different mental effort task loads and had the largest effect size. Overall, linear measures were the most sensitive and reliable indices to mental effort. In non-linear measures, Poincaré was the most reliable and sensitive, suggesting possible usefulness as an independent marker in cognitive function tasks in healthy seniors. A large number of HRV parameters was both reliable as well as sensitive indices of mental effort, although the simple linear methods were the most sensitive. Copyright © 2011 International Federation of Clinical Neurophysiology. Published by Elsevier Ireland Ltd. All rights reserved.
Reliability of a store observation tool in measuring availability of alcohol and selected foods.

PubMed

Cohen, Deborah A; Schoeff, Diane; Farley, Thomas A; Bluthenthal, Ricky; Scribner, Richard; Overton, Adrian

2007-11-01

Alcohol and food items can compromise or contribute to health, depending on the quantity and frequency with which they are consumed. How much people consume may be influenced by product availability and promotion in local retail stores. We developed and tested an observational tool to objectively measure in-store availability and promotion of alcoholic beverages and selected food items that have an impact on health. Trained observers visited 51 alcohol outlets in Los Angeles and southeastern Louisiana. Using a standardized instrument, two independent observations were conducted documenting the type of outlet, the availability and shelf space for alcoholic beverages and selected food items, the purchase price of standard brands, the placement of beer and malt liquor, and the amount of in-store alcohol advertising. Reliability of the instrument was excellent for measures of item availability, shelf space, and placement of malt liquor. Reliability was lower for alcohol advertising, beer placement, and items that measured the "least price" of apples and oranges. The average kappa was 0.87 for categorical items and the average intraclass correlation coefficient was 0.83 for continuous items. Overall, systematic observation of the availability and promotion of alcoholic beverages and food items was feasible, acceptable, and reliable. Measurement tools such as the one we evaluated should be useful in studies of the impact of availability of food and beverages on consumption and on health outcomes.
Inter-rater reliability of measures to characterize the tobacco retail environment in Mexico.

PubMed

Hall, Marissa G; Kollath-Cattano, Christy; Reynales-Shigematsu, Luz Myriam; Thrasher, James F

2015-01-01

To evaluate the inter-rater reliability of a data collection instrument to assess the tobacco retail environment in Mexico, after major marketing regulations were implemented. In 2013, two data collectors independently evaluated 21 stores in two census tracts, through a data collection instrument that assessed the presence of price promotions, whether single cigarettes were sold, the number of visible advertisements, the presence of signage prohibiting the sale of cigarettes to minors, and characteristics of cigarette pack displays. We evaluated the inter-rater reliability of the collected data, through the calculation of metrics such as intraclass correlation coefficient, percent agreement, Cohen's kappa and Krippendorff's alpha. Most measures demonstrated substantial or perfect inter-rater reliability. Our results indicate the potential utility of the data collection instrument for future point-of-sale research.
Reliability of internal oblique elbow radiographs for measuring displacement of medial epicondyle humerus fractures: a cadaveric study.

PubMed

Gottschalk, Hilton P; Bastrom, Tracey P; Edmonds, Eric W

2013-01-01

Standard elbow radiographs (AP and lateral views) are not accurate enough to measure true displacement of medial epicondyle fractures of the humerus. The amount of perceived displacement has been used to determine treatment options. This study assesses the utility of internal oblique radiographs for measurement of true displacement in these fractures. A medial epicondyle fracture was created in a cadaveric specimen. Displacement of the fragment (mm) was set at 5, 10, and 15 in line with the vector of the flexor pronator mass. The fragment was sutured temporarily in place. Radiographs were obtained at 0 (AP), 15, 30, 45, 60, 75, and 90 degrees (lateral) of internal rotation, with the elbow in set positions of flexion. This was done with and without radio-opaque markers placed on the fragment and fracture bed. The 45 and 60 degrees internal oblique radiographs were then presented to 5 separate reviewers (of different levels of training) to evaluate intraobserver and interobserver agreement. Change in elbow position did not affect the perceived displacement (P=0.82) with excellent intraobserver reliability (intraclass correlation coefficient range, 0.979 to 0.988) and interobserver agreement of 0.953. The intraclass correlation coefficient for intraobserver reliability on 45 degrees internal oblique films for all groups ranged from 0.985 to 0.998, with interobserver agreement of 0.953. For predicting displacement, the observers were 60% accurate in predicting the true displacement on the 45 degrees internal oblique films and only 35% accurate using the 60 degrees internal oblique view. Standardizing to a 45 degrees internal oblique radiograph of the elbow (regardless of elbow flexion) can augment the treating surgeon's ability to determine true displacement. At this degree of rotation, the measured number can be multiplied by 1.4 to better estimate displacement. The addition of a 45 degrees internal oblique radiograph in medial humeral epicondyle fractures has good
Intrarater reliability of hand held dynamometry in measuring lower extremity isometric strength using a portable stabilization device.

PubMed

Jackson, Steven M; Cheng, M Samuel; Smith, A Russell; Kolber, Morey J

2017-02-01

Hand held dynamometry (HHD) is a more objective way to quantify muscle force production (MP) compared to traditional manual muscle testing. HHD reliability can be negatively impacted by both the strength of the tester and the subject particularly in the lower extremities due to larger muscle groups. The primary aim of this investigation was to assess intrarater reliability of HHD with use of a portable stabilization device for lower extremity MP in an athletic population. Isometric lower extremity strength was measured for bilateral lower extremities including hip abductors, external rotators, adductors, knee extensors, and ankle plantar flexors was measured in a sample of healthy recreational runners (8 male, 7 females, = 30 limbs) training for a marathon. These measurements were assessed using an intrasession intrarater reliability design. Intraclass correlation coefficients (ICC) were calculated using 3,1 model based on the single rater design. The standard error of measurement (SEM) for each muscle group was also calculated. ICC were excellent ranging from ICC (3,1) = 0.93-0.98 with standard error of measurements ranging from 0.58 to 17.2 N. This study establishes the use of a HHD with a portable stabilization device as demonstrating good reliability within testers for measuring lower extremity muscle performance in an active healthy population. Copyright © 2016 Elsevier Ltd. All rights reserved.
Development, reliability, and validity of the Alberta Perinatal Stroke Project Parental Outcome Measure.

PubMed

Bemister, Taryn B; Brooks, Brian L; Kirton, Adam

2014-07-01

Perinatal stroke is a leading cause of cerebral palsy and lifelong disability, although parent and family outcomes have not yet been studied in this specific population. The Alberta Perinatal Stroke Project Parental Outcome Measure was developed as a 26-item questionnaire on the impact of perinatal stroke on parents and families. The items were derived from expert opinion and scientific literature on issues salient to parents of children with perinatal stroke, including guilt and blame, which are not well captured in existing measures of family impact. Data were collected from 82 mothers and 28 fathers who completed the Parental Outcome Measure and related questionnaires (mean age, 39.5 years; mean child age, 7.4 years). Analyses examined the Parental Outcome Measure's internal consistency, test-retest reliability, validity, and factor structure. The Parental Outcome Measure demonstrated three unique theoretical constructs: Psychosocial Impact, Guilt, and Blame. The Parental Outcome Measure has excellent internal consistency (Cronbach α = 0.91) and very good test-retest reliability more than 2-5 weeks (r = 0.87). Regarding validity, the Parental Outcome Measure is sensitive to condition severity, accounts for additional variance in parent outcomes, and strongly correlates with measures of anxiety, depression, stress, quality of life, family functioning, and parent adjustment. The Parental Outcome Measure contributes to the literature as the first brief measure of family impact designed for parents of children with perinatal stroke. Copyright © 2014 Elsevier Inc. All rights reserved.
Immersion in Cold-Water Evaluation (ICE) and self-reported cold intolerance are reliable but unrelated measures.

PubMed

Traynor, Robyn; MacDermid, Joy C

2008-09-01

Intolerance to the cold is common following peripheral nerve injury and surgery of the upper extremity. Despite its prevalence, the exact pathophysiology and natural history of this condition are not well understood. Subjective, self-report questionnaires have been created and validated as reliable measures of post-traumatic cold intolerance. The difficulty currently lies in assigning an objective measure to this predominantly subjective phenomenon. The present study evaluated the test-retest reliability of a proposed objective measure of cold intolerance, the Immersion in Cold-water Evaluation (ICE), and its correlation with subjective measures in healthy control subjects. Two age groups were also compared to investigate the effect of age on cold intolerance and temperature recovery. On two separate testing days, subjects completed three health-related questionnaires and submersed their dominant hands in cold water. The temperature of their second and fifth digits was monitored during recovery. Both the objective cold-provocation testing and the subjective self-report questionnaires were highly reliable albeit not significantly correlated. No significant temperature recovery trend was noted between the age groups. Post-traumatic cold intolerance is postulated to have both a vascular and neural etiology among other contributing causes. The protocol studied here was centered predominantly on the former etiology, examining peripheral blood flow and associated temperature recovery. This study established ICE as a reliable means to objectively measure cold response, supplementing information provided by previously validated self-report methods.
The Nutrition Literacy Assessment Instrument is a Valid and Reliable Measure of Nutrition Literacy in Adults with Chronic Disease.

PubMed

Gibbs, Heather D; Ellerbeck, Edward F; Gajewski, Byron; Zhang, Chuanwu; Sullivan, Debra K

2018-03-01

To test the reliability and validity of the Nutrition Literacy Assessment Instrument (NLit) in adult primary care and identify the relationship between nutrition literacy and diet quality. This instrument validation study included a cross-sectional sample participating in up to 2 visits 1 month apart. A total of 429 adults with nutrition-related chronic disease were recruited from clinics and a patient registry affiliated with a Midwestern university medical center. Nutrition literacy was measured by the NLit, which was composed of 6 subscales: nutrition and health, energy sources in food, food label and numeracy, household food measurement, food groups, and consumer skills. Diet quality was measured by Healthy Eating Index-2010 with nutrient data from Diet History Questionnaire II surveys. The researchers measured factor validity and reliability by using binary confirmatory factor analysis; test-retest reliability was measured by Pearson r and the intraclass correlation coefficient, and relationships between nutrition literacy and diet quality were analyzed by linear regression. The NLit demonstrated substantial factor validity and reliability (0.97; confidence interval, 0.96-0.98) and test-retest reliability (0.88; confidence interval, 0.85-0.90). Nutrition literacy was the most significant predictor of diet quality (β = .17; multivariate coefficient = 0.10; P < .001). The NLit is a valid and reliable tool for measuring nutrition literacy in adult primary care patients. Copyright © 2017 Society for Nutrition Education and Behavior. Published by Elsevier Inc. All rights reserved.
The reliability of the Extra Load Index as a measure of relative load carriage economy.

PubMed

Hudson, Sean; Cooke, Carlton; Lloyd, Ray

2017-09-01

The aim of this study was to measure the reliability of the extra load index (ELI) as a method for assessing relative load carriage economy. Seventeen volunteers (12 males, 5 females) performed walking trials at 3 km·h -1 , 6 km·h -1 and a self-selected speed. Trial conditions were repeated 7 days later to assess test-retest reliability. Trials involved four 4-minute periods of walking, each separated by 5 min of rest. The initial stage was performed unloaded followed in a randomised order by a second unloaded period and walking with backpacks of 7 and 20 kg. Results show ELI values did not differ significantly between trials for any of the speeds (p = 0.46) with either of the additional loads (p = 0.297). The systematic bias, limits of agreement and coefficients of variation were small in all trial conditions. We conclude the ELI appears to be a reliable measure of relative load carriage economy. Practitioner Summary: This paper demonstrates that the ELI is a reliable measure of load carriage economy at a range of walking speeds with both a light and heavy load. The ELI, therefore, represents a useful tool for comparing the relative economy associated with different load carriage systems.
Validity and Reliability of Thai Version of the Foot and Ankle Ability Measure (FAAM) Subjective Form.

PubMed

Arunakul, Marut; Arunakul, Preeyaphan; Suesiritumrong, Chakhrist; Angthong, Chayanin; Chernchujit, Bancha

2015-06-01

Self-administered questionnaires have become an important aspect for clinical outcome assessment of foot and ankle-related problems. The Foot and Ankle Ability Measure (FAAM) subjective form is a region-specific questionnaire that is widely used and has sufficient validity and reliability from previous studies. Translate the original English version of FAAM into a Thai version and evaluate the validity and reliability of Thai FAAM in patients with foot and ankle-related problems. The FAAM subjective form was translated into Thai using forward-backward translation protocol. Afterward, reliability and validity were tested. Following responses from 60 consecutive patients on two questionnaires, the Thai FAAM subjective form and the short form (SF)-36, were used. The validity was tested by correlating the scores from both questionnaires. The reliability was adopted by measuring the test-retest reliability and internal consistency. Thai FAAM score including activity of daily life (ADL) and Sport subscale demonstrated the sufficient correlations with physical functioning (PF) and physical composite score (PCS) domains of the SF-36 (statistically significant with p < 0.001 level and ≥ 0.5 values). The result of reliability revealed highly intra-class correlation coefficient as 0.8 and 0.77, respectively from test-retest study. The internal consistency was strong (Cronbach alpha = 0.94 and 0.88, respectively). The Thai version of FAAM subjective form retained the characteristics of the original version and has proved a reliable evaluation instrument for patients with foot and ankle-related problems.
The use of evidence-based guidance to enable reliable and accurate measurements of the home environment

PubMed Central

Atwal, Anita; McIntyre, Anne

2017-01-01

Introduction High quality guidance in home strategies is needed to enable older people to measure their home environment and become involved in the provision of assistive devices and to promote consistency among professionals. This study aims to investigate the reliability of such guidance and its ability to promote accuracy of results when measurements are taken by both older people and professionals. Method Twenty-five health professionals and 26 older people participated in a within-group design to test the accuracy of measurements taken (that is, person’s popliteal height, baths, toilets, beds, stairs and chairs). Data were analysed with descriptive analysis and the Wilcoxon test. The intra-rater reliability was assessed by correlating measurements taken at two different times with guidance use. Results The intra-rater reliability analysis revealed statistical significance (P < 0.05) for all measurements except for the bath internal width. The guidance enabled participants to take 90% of measurements that they were not able to complete otherwise, 80.55% of which lay within the acceptable suggested margin of variation. Accuracy was supported by the significant reduction in the standard deviation of the actual measurements and accuracy scores. Conclusion This evidence-based guidance can be used in its current format by older people and professionals to facilitate appropriate measurements. Yet, some users might need help from carers or specialists depending on their impairments. PMID:29386701
The use of evidence-based guidance to enable reliable and accurate measurements of the home environment.

PubMed

Spiliotopoulou, Georgia; Atwal, Anita; McIntyre, Anne

2018-01-01

High quality guidance in home strategies is needed to enable older people to measure their home environment and become involved in the provision of assistive devices and to promote consistency among professionals. This study aims to investigate the reliability of such guidance and its ability to promote accuracy of results when measurements are taken by both older people and professionals. Twenty-five health professionals and 26 older people participated in a within-group design to test the accuracy of measurements taken (that is, person's popliteal height, baths, toilets, beds, stairs and chairs). Data were analysed with descriptive analysis and the Wilcoxon test. The intra-rater reliability was assessed by correlating measurements taken at two different times with guidance use. The intra-rater reliability analysis revealed statistical significance ( P < 0.05) for all measurements except for the bath internal width. The guidance enabled participants to take 90% of measurements that they were not able to complete otherwise, 80.55% of which lay within the acceptable suggested margin of variation. Accuracy was supported by the significant reduction in the standard deviation of the actual measurements and accuracy scores. This evidence-based guidance can be used in its current format by older people and professionals to facilitate appropriate measurements. Yet, some users might need help from carers or specialists depending on their impairments.
The intelligibility in Context Scale: validity and reliability of a subjective rating measure.

PubMed

McLeod, Sharynne; Harrison, Linda J; McCormack, Jane

2012-04-01

To describe a new measure of functional intelligibility, the Intelligibility in Context Scale (ICS), and evaluate its validity, reliability, and sensitivity using 3 clinical measures of severity of speech sound disorder: (a) percentage of phonemes correct (PPC), (b) percentage of consonants correct (PCC), and (c) percentage of vowels correct (PVC). Speech skills of 120 preschool children (109 with parent-/teacher-identified concern about how they talked and made speech sounds and 11 with no identified concern) were assessed with the Diagnostic Evaluation of Articulation and Phonology (Dodd, Hua, Crosbie, Holm, & Ozanne, 2002). Parents completed the 7-item ICS, which rates the degree to which children's speech is understood by different communication partners (parents, immediate family, extended family, friends, acquaintances, teachers, and strangers) on a 5-point scale. Parents' ratings showed that most children were always (5) or usually (4) understood by parents, immediate family, and teachers, but only sometimes (3) by strangers. Factor analysis confirmed the internal consistency of the ICS items; therefore, ratings were averaged to form an overall intelligibility score. The ICS had high internal reliability (α = .93), sensitivity, and construct validity. Criterion validity was established through significant correlations between the ICS and PPC (r = .54), PCC (r = .54), and PVC (r = .36). The ICS is a promising new measure of functional intelligibility. These data provide initial support for the ICS as an easily administered, valid, and reliable estimate of preschool children's intelligibility when speaking with people of varying levels of familiarity and authority.

Reliability and validity of urinary nerve growth factor measurement in women with lower urinary tract symptoms.

PubMed

Vijaya, Gopalan; Cartwright, Rufus; Bhide, Alka; Derpapas, Alexandros; Fernando, Ruwan; Khullar, Vik

2016-11-01

The validity and reliability of measurement of urinary NGF as a diagnostic biomarker in women with lower urinary tract dysfunction (LUTD) is uncertain. We aimed to evaluate both the diagnostic and discriminant validity, and the test-retest reliability of urinary NGF measurement in women with LUTD. Urinary NGF was measured in women with LUTD (n = 205) and asymptomatic subjects (n = 31). Urinary NGF was assayed using an ELISA method and normalized against urinary creatinine. NGF/creatinine ratios were compared between symptom subgroups using Mann-Whitney U test, and between different urodynamic diagnoses using the Kruskal-Wallis test. Receiver Operator Characteristic (ROC) analysis was employed to evaluate the diagnostic performance of urinary NGF. Test-retest reliability of NGF measurement was assessed using intra-class correlation (ICC). Urinary NGF was significantly but non-specifically increased in symptomatic patients when compared to controls (13.33 vs. 2.05 ng NGF/g Cr, P < 0.001). On multivariate logistic regression NGF was a good predictor of patients having OAB or not, however, the adjusted odds ratio only 1.006. ROC analysis demonstrated poor discriminant ability between different symptomatic groups and urodynamic groups. Using a cut off of 13.0 ng NGF/g creatinine the test provides a sensitivity of 81%, but a specificity of only 39% for overactive bladder. The assays demonstrated good test-retest reliability with ICC of 0.889. Although urinary NGF can be reliably assayed, and is increased in various LUTDs, it discriminates poorly between these disorders therefore has very limited potential as a biomarker. Neurourol. Urodynam. 35:944-948, 2016. © 2015 Wiley Periodicals, Inc. © 2015 Wiley Periodicals, Inc.
Test-Retest Reliability of Respiratory Resistance Measured with the Airflow Perturbation Device

ERIC Educational Resources Information Center

Gallena, Sally K.; Solomon, Nancy Pearl; Johnson, Arthur T.; Vossoughi, Jafar; Tian, Wei

2014-01-01

Purpose: In this study, the authors aimed to determine reliability of the airflow perturbation device (APD) to measure respiratory resistance within and across sessions during resting tidal (RTB) and postexercise breathing in healthy athletes, and during RTB across trials within a session in athletes with paradoxical vocal fold motion (PVFM)…
Reliability analysis of a phaser measurement unit using a generalized fuzzy lambda-tau(GFLT) technique.

PubMed

Komal

2018-05-01

Nowadays power consumption is increasing day-by-day. To fulfill failure free power requirement, planning and implementation of an effective and reliable power management system is essential. Phasor measurement unit(PMU) is one of the key device in wide area measurement and control systems. The reliable performance of PMU assures failure free power supply for any power system. So, the purpose of the present study is to analyse the reliability of a PMU used for controllability and observability of power systems utilizing available uncertain data. In this paper, a generalized fuzzy lambda-tau (GFLT) technique has been proposed for this purpose. In GFLT, system components' uncertain failure and repair rates are fuzzified using fuzzy numbers having different shapes such as triangular, normal, cauchy, sharp gamma and trapezoidal. To select a suitable fuzzy number for quantifying data uncertainty, system experts' opinion have been considered. The GFLT technique applies fault tree, lambda-tau method, fuzzified data using different membership functions, alpha-cut based fuzzy arithmetic operations to compute some important reliability indices. Furthermore, in this study ranking of critical components of the system using RAM-Index and sensitivity analysis have also been performed. The developed technique may be helpful to improve system performance significantly and can be applied to analyse fuzzy reliability of other engineering systems. Copyright © 2018 ISA. Published by Elsevier Ltd. All rights reserved.
Is computed tomography an accurate and reliable method for measuring total knee arthroplasty component rotation?

PubMed

Figueroa, José; Guarachi, Juan Pablo; Matas, José; Arnander, Magnus; Orrego, Mario

2016-04-01

Computed tomography (CT) is widely used to assess component rotation in patients with poor results after total knee arthroplasty (TKA). The purpose of this study was to simultaneously determine the accuracy and reliability of CT in measuring TKA component rotation. TKA components were implanted in dry-bone models and assigned to two groups. The first group (n = 7) had variable femoral component rotations, and the second group (n = 6) had variable tibial tray rotations. CT images were then used to assess component rotation. Accuracy of CT rotational assessment was determined by mean difference, in degrees, between implanted component rotation and CT-measured rotation. Intraclass correlation coefficient (ICC) was applied to determine intra-observer and inter-observer reliability. Femoral component accuracy showed a mean difference of 2.5° and the tibial tray a mean difference of 3.2°. There was good intra- and inter-observer reliability for both components, with a femoral ICC of 0.8 and 0.76, and tibial ICC of 0.68 and 0.65, respectively. CT rotational assessment accuracy can differ from true component rotation by approximately 3° for each component. It does, however, have good inter- and intra-observer reliability.
Test-retest reliability of the assessment of postural stability in typically developing children and in hearing impaired children.

PubMed

De Kegel, A; Dhooge, I; Cambier, D; Baetens, T; Palmans, T; Van Waelvelde, H

2011-04-01

The purpose of this study was to establish test-retest reliability of centre of pressure (COP) measurements obtained by an AccuGait portable forceplate (ACG), mean COG sway velocity measured by a Basic Balance Master (BBM) and clinical balance tests in children with and without balance difficulties. 49 typically developing children and 23 hearing impaired children, with a higher risk for stability problems, between 6 and 12 years of age participated. Each child performed the modified Clinical Test of Sensory Interaction on Balance (mCTSIB), Unilateral Stance (US) and Tandem Stance on ACG, mCTSIB and US on BBM and clinical balance tests: one-leg standing, balance beam walking and one-leg hopping. All subjects completed 2 test sessions on 2 different days in the same week assessed by the same examiner. Among COP measurements obtained by the ACG, mean sway velocity was the most reliable parameter with all ICCs higher than 0.72. The standard deviation (SD) of sway velocity, sway area, SD of anterior-posterior and SD of medio-lateral COP data showed moderate to excellent reliability with ICCs between 0.55 and 0.96 but some caution must be taken into account in some conditions. BBM is less reliable but clinical balance tests are as reliable as ACG. Hearing impaired children exhibited better relative reliability (ICC) and comparable absolute reliability (SEM) for most balance parameters compared to typically developing children. Reliable information regarding postural stability of typically developing children and hearing impaired children may be obtained utilizing COP measurements generated by an AccuGait system and clinical balance tests. Copyright © 2011 Elsevier B.V. All rights reserved.
Validity and Reliability of Surface Electromyography in the Assessment of Primary Muscle Tension Dysphonia.

PubMed

Khoddami, Seyyedeh Maryam; Talebian, Saeed; Izadi, Farzad; Ansari, Noureddin Nakhostin

2017-05-01

The study aims to evaluate the reliability and the discriminative validity of surface electromyography (sEMG) in the assessment of patients with primary muscle tension dysphonia (MTD). The study design is cross-sectional. Fifteen patients with primary MTD (mean age: 34.07 ± 10.99 years) and 15 healthy volunteers (mean age: 34.53 ± 10.63 years) were included. All participants underwent evaluation of sEMG to record the electrical activity of the thyrohyoid and cricothyroid muscles. The outcome measures were the root mean square (RMS), activity peak, duration, and time to the peak activity, which were obtained during /a/ and /i/ prolongation for test-retest reliability. The test-retest reliability was good to excellent for the RMS and peak activity measures (intraclass correlation coefficient [agreement] [ICC agreement ] = 0.49-0.98). The reliability for the activity duration was poor to excellent (ICC agreement = 0.19-0.9). Poor test-retest reliability was found for the time to peak measure (ICC agreement = 0.15-0.37). The standard error of measurement for all sEMG measures was between 0.41 and 2.05. The smallest detectable change (SDC) was calculated between 1.13 and 5.66. The highest SDC values were obtained for the peak and the lowest SDCs were documented for the duration (5.66 and 1.13, respectively). All sEMG measures were not able to discriminate between the MTD patients and healthy subjects (P > 0.05). The sEMG is a reliable tool to measure the RMS, the peak activity, and the activity duration in primary MTD. However, it is not able to discriminate the patients with primary MTD from healthy subjects. Copyright © 2017 The Voice Foundation. Published by Elsevier Inc. All rights reserved.
Reliability and Correlation of Static and Dynamic Foot Arch Measurement in a Healthy Pediatric Population.

PubMed

Scholz, Timo; Zech, Astrid; Wegscheider, Karl; Lezius, Susanne; Braumann, Klaus-Michael; Sehner, Susanne; Hollander, Karsten

2017-09-01

Measurement of the medial longitudinal foot arch in children is a controversial topic, as there are many different methods without a definite standard procedure. The purpose of this study was to 1) investigate intraday and interrater reliability regarding dynamic arch index and static arch height, 2) explore the correlation between both arch indices, and 3) examine the variation of the medial longitudinal arch at two different times of the day. Eighty-six children (mean ± SD age, 8.9 ± 1.9 years) participated in the study. Dynamic footprint data were captured with a pedobarographic platform. For static arch measurements, a specially constructed caliper was used to assess heel-to-toe length and dorsum height. A mixed model was established to determine reliability and variation. Reliability was found to be excellent for the static arch height index in sitting (intraday, 0.90; interrater, 0.80) and standing positions (0.88 and 0.85) and for the dynamic arch index (both 1.00). There was poor correlation between static and dynamic assessment of the medial longitudinal arch (standing dynamic arch index, r = -0.138; sitting dynamic arch index, r = -0.070). Static measurements were found to be significantly influenced by the time of day (P < .001), whereas the dynamic arch index was unchanged (P = .845). This study revealed some further important findings. The static arch height index is influenced by gender (P = .004), whereas dynamic arch index is influenced by side (P = .011) and body mass index (P < .001). Dynamic and static foot measurements are reliable for medial longitudinal foot arch assessment in children. The variation of static arch measurements during the day has to be kept in mind. For clinical purposes, static and dynamic arch data should be interpreted separately.
Reliability and validity of a treatment adherence measure for child psychiatric rehabilitation.

PubMed

Williams, Nathaniel J; Green, Philip

2012-09-01

Treatment adherence, defined as the degree to which practitioners implemented prescribed program principles and activities and avoided proscribed activities, has been an area of growing interest in mental health services for children with severe emotional and behavioral disorders. This study evaluated the reliability and validity of a treatment adherence measure for child psychiatric rehabilitation (CPSR). Parents of children receiving CPSR (n = 79) or psychotherapy (n = 27) completed the Children's Psychosocial Rehabilitation Treatment Adherence Measure (CTAM) and a measure of 2-week session impact. Psychiatric rehabilitation (PSR) supervisors identified PSR practitioners with reputations for high or low adherence to the model. The CTAM's discriminant validity was assessed by using known-groups procedures and predictive validity by examining its relationship to 2-week session impact. The CTAM demonstrated excellent internal consistency (α = .92), discriminant validity (p = .002, d = .72; p = .021, d = .59), and predictive validity (B = 2.24, SE = .31, p < .001), accounting for 28% of the child-level variance in 2-week session impact. Findings suggest the CTAM is a reliable and valid measure of treatment adherence for CPSR programs with a skill-teaching focus. Providers and agencies should take steps to enhance treatment adherence because it may be an important predictor of children's short-term response to CPSR. (PsycINFO Database Record (c) 2012 APA, all rights reserved).
Validity and reliability of Patient-Reported Outcomes Measurement Information System (PROMIS) Instruments in Osteoarthritis

PubMed Central

Broderick, Joan E.; Schneider, Stefan; Junghaenel, Doerte U.; Schwartz, Joseph E.; Stone, Arthur A.

2013-01-01

Objective Evaluation of known group validity, ecological validity, and test-retest reliability of four domain instruments from the Patient Reported Outcomes Measurement System (PROMIS) in osteoarthritis (OA) patients. Methods Recruitment of an osteoarthritis sample and a comparison general population (GP) through an Internet survey panel. Pain intensity, pain interference, physical functioning, and fatigue were assessed for 4 consecutive weeks with PROMIS short forms on a daily basis and compared with same-domain Computer Adaptive Test (CAT) instruments that use a 7-day recall. Known group validity (comparison of OA and GP), ecological validity (comparison of aggregated daily measures with CATs), and test-retest reliability were evaluated. Results The recruited samples matched (age, sex, race, ethnicity) the demographic characteristics of the U.S. sample for arthritis and the 2009 Census for the GP. Compliance with repeated measurements was excellent: > 95%. Known group validity for CATs was demonstrated with large effect sizes (pain intensity: 1.42, pain interference: 1.25, and fatigue: .85). Ecological validity was also established through high correlations between aggregated daily measures and weekly CATs (≥ .86). Test-retest validity (7-day) was very good (≥ .80). Conclusion PROMIS CAT instruments demonstrated known group and ecological validity in a comparison of osteoarthritis patients with a general population sample. Adequate test-retest reliability was also observed. These data provide encouraging initial data on the utility of these PROMIS instruments for clinical and research outcomes in osteoarthritis patients. PMID:23592494
Reliability and Validity of the Dyadic Observed Communication Scale (DOCS).

PubMed

Hadley, Wendy; Stewart, Angela; Hunter, Heather L; Affleck, Katelyn; Donenberg, Geri; Diclemente, Ralph; Brown, Larry K

2013-02-01

We evaluated the reliability and validity of the Dyadic Observed Communication Scale (DOCS) coding scheme, which was developed to capture a range of communication components between parents and adolescents. Adolescents and their caregivers were recruited from mental health facilities for participation in a large, multi-site family-based HIV prevention intervention study. Seventy-one dyads were randomly selected from the larger study sample and coded using the DOCS at baseline. Preliminary validity and reliability of the DOCS was examined using various methods, such as comparing results to self-report measures and examining interrater reliability. Results suggest that the DOCS is a reliable and valid measure of observed communication among parent-adolescent dyads that captures both verbal and nonverbal communication behaviors that are typical intervention targets. The DOCS is a viable coding scheme for use by researchers and clinicians examining parent-adolescent communication. Coders can be trained to reliably capture individual and dyadic components of communication for parents and adolescents and this complex information can be obtained relatively quickly.
Test-retest reliability of sensor-based sit-to-stand measures in young and older adults.

PubMed

Regterschot, G Ruben H; Zhang, Wei; Baldus, Heribert; Stevens, Martin; Zijlstra, Wiebren

2014-01-01

This study investigated test-retest reliability of sensor-based sit-to-stand (STS) peak power and other STS measures in young and older adults. In addition, test-retest reliability of the sensor method was compared to test-retest reliability of the Timed Up and Go Test (TUGT) and Five-Times-Sit-to-Stand Test (FTSST) in older adults. Ten healthy young female adults (20-23 years) and 31 older adults (21 females; 73-94 years) participated in two assessment sessions separated by 3-8 days. Vertical peak power was assessed during three (young adults) and five (older adults) normal and fast STS trials with a hybrid motion sensor worn on the hip. Older adults also performed the FTSST and TUGT. The average sensor-based STS peak power of the normal STS trials and the average sensor-based STS peak power of the fast STS trials showed excellent test-retest reliability in young adults (intra-class correlation (ICC)≥0.90; zero in 95% confidence interval of mean difference between test and retest (95%CI of D); standard error of measurement (SEM)≤6.7% of mean peak power) and older adults (ICC≥0.91; zero in 95%CI of D; SEM≤9.9%). Test-retest reliability of sensor-based STS peak power and TUGT (ICC=0.98; zero in 95%CI of D; SEM=8.5%) was comparable in older adults, test-retest reliability of the FTSST was lower (ICC=0.73; zero outside 95%CI of D; SEM=14.4%). Sensor-based STS peak power demonstrated excellent test-retest reliability and may therefore be useful for clinical assessment of functional status and fall risk. Copyright © 2014 Elsevier B.V. All rights reserved.
Measuring teacher self-report on classroom practices: Construct validity and reliability of the Classroom Strategies Scale-Teacher Form.

PubMed

Reddy, Linda A; Dudek, Christopher M; Fabiano, Gregory A; Peters, Stephanie

2015-12-01

This article presents information about the construct validity and reliability of a new teacher self-report measure of classroom instructional and behavioral practices (the Classroom Strategies Scales-Teacher Form; CSS-T). The theoretical underpinnings and empirical basis for the instructional and behavioral management scales are presented. Information is provided about the construct validity, internal consistency, test-retest reliability, and freedom from item-bias of the scales. Given previous investigations with the CSS Observer Form, it was hypothesized that internal consistency would be adequate and that confirmatory factor analyses (CFA) of CSS-T data from 293 classrooms would offer empirical support for the CSS-T's Total, Composite and subscales, and yield a similar factor structure to that of the CSS Observer Form. Goodness-of-fit indices of χ2/df, Root Mean Square Error of Approximation, Goodness of Fit Index, and Adjusted Goodness of Fit Index suggested satisfactory fit of proposed CFA models whereas the Comparative Fit Index did not. Internal consistency estimates of .93 and .94 were obtained for the Instructional Strategies and Behavioral Strategies Total scales respectively. Adequate test-retest reliability was found for instructional and behavioral total scales (r = .79, r = .84, percent agreement 93% and 93%). The CSS-T evidences freedom from item bias on important teacher demographics (age, educational degree, and years of teaching experience). Implications of results are discussed. (c) 2015 APA, all rights reserved).
Establishing Reliability and Construct Validity for an Instrument to Measure Environmental Connectedness

ERIC Educational Resources Information Center

Beery, Thomas H.

2013-01-01

The purpose of this preliminary study is to establish a reliable and valid measure of environmental connectedness (EC) to allow for further exploration of the Swedish Outdoor Recreation in Change national survey data. The Nordic concept of friluftsliv (nature-based outdoor recreation) and the environmental psychology concept of EC are explored to…
Generalizability Theory Reliability of Written Expression Curriculum-Based Measurement in Universal Screening

ERIC Educational Resources Information Center

Keller-Margulis, Milena A.; Mercer, Sterett H.; Thomas, Erin L.

2016-01-01

The purpose of this study was to examine the reliability of written expression curriculum-based measurement (WE-CBM) in the context of universal screening from a generalizability theory framework. Students in second through fifth grade (n = 145) participated in the study. The sample included 54% female students, 49% White students, 23% African…
Validity and Reliability of Assessing Body Composition Using a Mobile Application.

PubMed

Macdonald, Elizabeth Z; Vehrs, Pat R; Fellingham, Gilbert W; Eggett, Dennis; George, James D; Hager, Ronald

2017-12-01

The purpose of this study was to determine the validity and reliability of the LeanScreen (LS) mobile application that estimates percent body fat (%BF) using estimates of circumferences from photographs. The %BF of 148 weight-stable adults was estimated once using dual-energy x-ray absorptiometry (DXA). Each of two administrators assessed the %BF of each subject twice using the LS app and manually measured circumferences. A mixed-model ANOVA and Bland-Altman analyses were used to compare the estimates of %BF obtained from each method. Interrater and intrarater reliabilities values were determined using multiple measurements taken by each of the two administrators. The LS app and manually measured circumferences significantly underestimated (P < 0.05) the %BF determined using DXA by an average of -3.26 and -4.82 %BF, respectively. The LS app (6.99 %BF) and manually measured circumferences (6.76 %BF) had large limits of agreement. All interrater and intrarater reliability coefficients of estimates of %BF using the LS app and manually measured circumferences exceeded 0.99. The estimates of %BF from manually measured circumferences and the LS app were highly reliable. However, these field measures are not currently recommended for the assessment of body composition because of significant bias and large limits of agreements.
Flow dichroism as a reliable method to measure the hydrodynamic aspect ratio of gold nanoparticles.

PubMed

Reddy, Naveen Krishna; Pérez-Juste, Jorge; Pastoriza-Santos, Isabel; Lang, Peter R; Dhont, Jan K G; Liz-Marzán, Luis M; Vermant, Jan

2011-06-28

Particle shape plays an important role in controlling the optical, magnetic, and mechanical properties of nanoparticle suspensions as well as nanocomposites. However, characterizing the size, shape, and the associated polydispersity of nanoparticles is not straightforward. Electron microscopy provides an accurate measurement of the geometric properties, but sample preparation can be laborious, and to obtain statistically relevant data many particles need to be analyzed separately. Moreover, when the particles are suspended in a fluid, it is important to measure their hydrodynamic properties, as they determine aspects such as diffusion and the rheological behavior of suspensions. Methods that evaluate the dynamics of nanoparticles such as light scattering and rheo-optical methods accurately provide these hydrodynamic properties, but do necessitate a sufficient optical response. In the present work, three different methods for characterizing nonspherical gold nanoparticles are critically compared, especially taking into account the complex optical response of these particles. The different methods are evaluated in terms of their versatility to asses size, shape, and polydispersity. Among these, the rheo-optical technique is shown to be the most reliable method to obtain hydrodynamic aspect ratio and polydispersity for nonspherical gold nanoparticles for two reasons. First, the use of the evolution of the orientation angle makes effects of polydispersity less important. Second, the use of an external flow field gives a mathematically more robust relation between particle motion and aspect ratio, especially for particles with relatively small aspect ratios.
Neurology objective structured clinical examination reliability using generalizability theory

PubMed Central

Park, Yoon Soo; Lukas, Rimas V.; Brorson, James R.

2015-01-01

Objectives: This study examines factors affecting reliability, or consistency of assessment scores, from an objective structured clinical examination (OSCE) in neurology through generalizability theory (G theory). Methods: Data include assessments from a multistation OSCE taken by 194 medical students at the completion of a neurology clerkship. Facets evaluated in this study include cases, domains, and items. Domains refer to areas of skill (or constructs) that the OSCE measures. G theory is used to estimate variance components associated with each facet, derive reliability, and project the number of cases required to obtain a reliable (consistent, precise) score. Results: Reliability using G theory is moderate (Φ coefficient = 0.61, G coefficient = 0.64). Performance is similar across cases but differs by the particular domain, such that the majority of variance is attributed to the domain. Projections in reliability estimates reveal that students need to participate in 3 OSCE cases in order to increase reliability beyond the 0.70 threshold. Conclusions: This novel use of G theory in evaluating an OSCE in neurology provides meaningful measurement characteristics of the assessment. Differing from prior work in other medical specialties, the cases students were randomly assigned did not influence their OSCE score; rather, scores varied in expected fashion by domain assessed. PMID:26432851
Neurology objective structured clinical examination reliability using generalizability theory.

PubMed

Blood, Angela D; Park, Yoon Soo; Lukas, Rimas V; Brorson, James R

2015-11-03

This study examines factors affecting reliability, or consistency of assessment scores, from an objective structured clinical examination (OSCE) in neurology through generalizability theory (G theory). Data include assessments from a multistation OSCE taken by 194 medical students at the completion of a neurology clerkship. Facets evaluated in this study include cases, domains, and items. Domains refer to areas of skill (or constructs) that the OSCE measures. G theory is used to estimate variance components associated with each facet, derive reliability, and project the number of cases required to obtain a reliable (consistent, precise) score. Reliability using G theory is moderate (Φ coefficient = 0.61, G coefficient = 0.64). Performance is similar across cases but differs by the particular domain, such that the majority of variance is attributed to the domain. Projections in reliability estimates reveal that students need to participate in 3 OSCE cases in order to increase reliability beyond the 0.70 threshold. This novel use of G theory in evaluating an OSCE in neurology provides meaningful measurement characteristics of the assessment. Differing from prior work in other medical specialties, the cases students were randomly assigned did not influence their OSCE score; rather, scores varied in expected fashion by domain assessed. © 2015 American Academy of Neurology.
An Evaluation of Available Models for Estimating the Reliability and Validity of Criterion Referenced Measures.

ERIC Educational Resources Information Center

Oakland, Thomas

New strategies for evaluation criterion referenced measures (CRM) are discussed. These strategies examine the following issues: (1) the use of normed referenced measures (NRM) as CRM and then estimating the reliability and validity of such measures in terms of variance from an arbitrarily specified criterion score, (2) estimation of the…
Assuring reliability program effectiveness.

NASA Technical Reports Server (NTRS)

Ball, L. W.

1973-01-01

An attempt is made to provide simple identification and description of techniques that have proved to be most useful either in developing a new product or in improving reliability of an established product. The first reliability task is obtaining and organizing parts failure rate data. Other tasks are parts screening, tabulation of general failure rates, preventive maintenance, prediction of new product reliability, and statistical demonstration of achieved reliability. Five principal tasks for improving reliability involve the physics of failure research, derating of internal stresses, control of external stresses, functional redundancy, and failure effects control. A final task is the training and motivation of reliability specialist engineers.

Reliability of a survey tool for measuring consumer nutrition environment in urban food stores.

PubMed

Hosler, Akiko S; Dharssi, Aliza

2011-01-01

Despite the increase in the volume and importance of food environment research, there is a general lack of reliable measurement tools. This study presents the development and reliability assessment of a tool for measuring consumer nutrition environment in urban food stores. Cross-sectional design. A racially diverse downtown portion (6 ZIP code areas) in Albany, New York. A sample of 39 food stores was visited by our research team in 2009 to 2010. These stores were randomly selected from 123 eligible food stores identified through multiple government lists and ground-truthing. The Food Retail Outlet Survey Tool was developed to assess the presence of selected food and nonfood items, placement, milk prices, physical characteristics of the store, policy implementation, and advertisements on outside windows. For in-store items, agreement of observations between experienced and lightly trained surveyors was assessed. For window advertisement assessments, inter-method agreement (on-site sketch vs digital photo), and inter-rater agreement (both on-site) among lightly trained surveyors were evaluated. Percent agreement, Kappa, and prevalence-adjusted bias-adjusted kappa were calculated for in-store observations. Interclass correlation coefficients were calculated for window observations. Twenty-seven of the 47 in-store items had 100% agreement. The prevalence-adjusted bias-adjusted kappa indicated excellent agreement (≥0.90) on all items, except aisle width (0.74) and dark-green/orange colored fresh vegetables (0.85). The store type (nonconvenience store), the order of visits (first half), and the time to complete survey (>10 minutes) were associated with lower reliability in these 2 items. Both the inter-method and inter-rater agreements for window advertisements were uniformly high (intraclass correlation coefficient ranged 0.94-1.00), indicating high reliability. The Food Retail Outlet Survey Tool is a reliable tool for quickly measuring consumer nutrition
Reliability and criterion validity of measurements using a smart phone-based measurement tool for the transverse rotation angle of the pelvis during single-leg lifting.

PubMed

Jung, Sung-Hoon; Kwon, Oh-Yun; Jeon, In-Cheol; Hwang, Ui-Jae; Weon, Jong-Hyuck

2018-01-01

The purposes of this study were to determine the intra-rater test-retest reliability of a smart phone-based measurement tool (SBMT) and a three-dimensional (3D) motion analysis system for measuring the transverse rotation angle of the pelvis during single-leg lifting (SLL) and the criterion validity of the transverse rotation angle of the pelvis measurement using SBMT compared with a 3D motion analysis system (3DMAS). Seventeen healthy volunteers performed SLL with their dominant leg without bending the knee until they reached a target placed 20 cm above the table. This study used a 3DMAS, considered the gold standard, to measure the transverse rotation angle of the pelvis to assess the criterion validity of the SBMT measurement. Intra-rater test-retest reliability was determined using the SBMT and 3DMAS using intra-class correlation coefficient (ICC) [3,1] values. The criterion validity of the SBMT was assessed with ICC [3,1] values. Both the 3DMAS (ICC = 0.77) and SBMT (ICC = 0.83) showed excellent intra-rater test-retest reliability in the measurement of the transverse rotation angle of the pelvis during SLL in a supine position. Moreover, the SBMT showed an excellent correlation with the 3DMAS (ICC = 0.99). Measurement of the transverse rotation angle of the pelvis using the SBMT showed excellent reliability and criterion validity compared with the 3DMAS.
"Reliability generalization of the Multigroup Ethnic Identity Measure-Revised (MEIM-R)": Correction to Herrington et al. (2016).

PubMed

2016-10-01

Reports an error in "Reliability Generalization of the Multigroup Ethnic Identity Measure-Revised (MEIM-R)" by Hayley M. Herrington, Timothy B. Smith, Erika Feinauer and Derek Griner ( Journal of Counseling Psychology , Advanced Online Publication, Mar 17, 2016, np). The name of author Erika Feinauer was misspelled as Erika Feinhauer. All versions of this article have been corrected. (The following abstract of the original article appeared in record 2016-13160-001.) Individuals' strength of ethnic identity has been linked with multiple positive indicators, including academic achievement and overall psychological well-being. The measure researchers use most often to assess ethnic identity, the Multigroup Ethnic Identity Measure (MEIM), underwent substantial revision in 2007. To inform scholars investigating ethnic identity, we performed a reliability generalization analysis on data from the revised version (MEIM-R) and compared it with data from the original MEIM. Random-effects weighted models evaluated internal consistency coefficients (Cronbach's alpha). Reliability coefficients for the MEIM-R averaged α = .88 across 37 samples, a statistically significant increase over the average of α = .84 for the MEIM across 75 studies. Reliability coefficients for the MEIM-R did not differ across study and participant characteristics such as sample gender and ethnic composition. However, consistently lower reliability coefficients averaging α = .81 were found among participants with low levels of education, suggesting that greater attention to data reliability is warranted when evaluating the ethnic identity of individuals such as middle-school students. Future research will be needed to ascertain whether data with other measures of aspects of personal identity (e.g., racial identity, gender identity) also differ as a function of participant level of education and associated cognitive or maturation processes. (PsycINFO Database Record (c) 2016 APA, all rights reserved).
Impact of Alzheimer's Disease on Caregiver Questionnaire: internal consistency, convergent validity, and test-retest reliability of a new measure for assessing caregiver burden.

PubMed

Cole, Jason C; Ito, Diane; Chen, Yaozhu J; Cheng, Rebecca; Bolognese, Jennifer; Li-McLeod, Josephine

2014-09-04

There is a lack of validated instruments to measure the level of burden of Alzheimer's disease (AD) on caregivers. The Impact of Alzheimer's Disease on Caregiver Questionnaire (IADCQ) is a 12-item instrument with a seven-day recall period that measures AD caregiver's burden across emotional, physical, social, financial, sleep, and time aspects. Primary objectives of this study were to evaluate psychometric properties of IADCQ administered on the Web and to determine most appropriate scoring algorithm. A national sample of 200 unpaid AD caregivers participated in this study by completing the Web-based version of IADCQ and Short Form-12 Health Survey Version 2 (SF-12v2™). The SF-12v2 was used to measure convergent validity of IADCQ scores and to provide an understanding of the overall health-related quality of life of sampled AD caregivers. The IADCQ survey was also completed four weeks later by a randomly selected subgroup of 50 participants to assess test-retest reliability. Confirmatory factor analysis (CFA) was implemented to test the dimensionality of the IADCQ items. Classical item-level and scale-level psychometric analyses were conducted to estimate psychometric characteristics of the instrument. Test-retest reliability was performed to evaluate the instrument's stability and consistency over time. Virtually none (2%) of the respondents had either floor or ceiling effects, indicating the IADCQ covers an ideal range of burden. A single-factor model obtained appropriate goodness of fit and provided evidence that a simple sum score of the 12 items of IADCQ can be used to measure AD caregiver's burden. Scales-level reliability was supported with a coefficient alpha of 0.93 and an intra-class correlation coefficient (for test-retest reliability) of 0.68 (95% CI: 0.50-0.80). Low-moderate negative correlations were observed between the IADCQ and scales of the SF-12v2. The study findings suggest the IADCQ has appropriate psychometric characteristics as a
Reliability of capturing foot parameters using digital scanning and the neutral suspension casting technique

PubMed Central

2011-01-01

Background A clinical study was conducted to determine the intra and inter-rater reliability of digital scanning and the neutral suspension casting technique to measure six foot parameters. The neutral suspension casting technique is a commonly utilised method for obtaining a negative impression of the foot prior to orthotic fabrication. Digital scanning offers an alternative to the traditional plaster of Paris techniques. Methods Twenty one healthy participants volunteered to take part in the study. Six casts and six digital scans were obtained from each participant by two raters of differing clinical experience. The foot parameters chosen for investigation were cast length (mm), forefoot width (mm), rearfoot width (mm), medial arch height (mm), lateral arch height (mm) and forefoot to rearfoot alignment (degrees). Intraclass correlation coefficients (ICC) with 95% confidence intervals (CI) were calculated to determine the intra and inter-rater reliability. Measurement error was assessed through the calculation of the standard error of the measurement (SEM) and smallest real difference (SRD). Results ICC values for all foot parameters using digital scanning ranged between 0.81-0.99 for both intra and inter-rater reliability. For neutral suspension casting technique inter-rater reliability values ranged from 0.57-0.99 and intra-rater reliability values ranging from 0.36-0.99 for rater 1 and 0.49-0.99 for rater 2. Conclusions The findings of this study indicate that digital scanning is a reliable technique, irrespective of clinical experience, with reduced measurement variability in all foot parameters investigated when compared to neutral suspension casting. PMID:21375757
Increasing Reliability of Direct Observation Measurement Approaches in Emotional and/or Behavioral Disorders Research Using Generalizability Theory

ERIC Educational Resources Information Center

Gage, Nicholas A.; Prykanowski, Debra; Hirn, Regina

2014-01-01

Reliability of direct observation outcomes ensures the results are consistent, dependable, and trustworthy. Typically, reliability of direct observation measurement approaches is assessed using interobserver agreement (IOA) and the calculation of observer agreement (e.g., percentage of agreement). However, IOA does not address intraobserver…
Sensitivity, reliability and the effects of diurnal variation on a test battery of field usable upper limb fatigue measures.

PubMed

Yung, Marcus; Wells, Richard P

2017-07-01

Fatigue has been linked to deficits in production quality and productivity and, if of long duration, work-related musculoskeletal disorders. It may thus be a useful risk indicator and design and evaluation tool. However, there is limited information on the test-retest reliability, the sensitivity and the effects of diurnal fluctuation on field usable fatigue measures. This study reports on an evaluation of 11 measurement tools and their 14 parameters. Eight measures were found to have test-retest ICC values greater than 0.8. Four measures were particularly responsive during an intermittent fatiguing condition. However, two responsive measures demonstrated rhythmic behaviour, with significant time effects from 08:00 to mid-afternoon and early evening. Action tremor, muscle mechanomyography and perceived fatigue were found to be most reliable and most responsive; but additional analytical considerations might be required when interpreting daylong responses of MMG and action tremor. Practitioner Summary: This paper presents findings from test-retest and daylong reliability and responsiveness evaluations of 11 fatigue measures. This paper suggests that action tremor, muscle mechanomyography and perceived fatigue were most reliable and most responsive. However, mechanomyography and action tremor may be susceptible to diurnal changes.
Reliability of Measurement of Glenohumeral Internal Rotation, External Rotation, and Total Arc of Motion in 3 Test Positions

PubMed Central

Kevern, Mark A.; Beecher, Michael; Rao, Smita

2014-01-01

Context: Athletes who participate in throwing and racket sports consistently demonstrate adaptive changes in glenohumeral-joint internal and external rotation in the dominant arm. Measurements of these motions have demonstrated excellent intrarater and poor interrater reliability. Objective: To determine intrarater reliability, interrater reliability, and standard error of measurement for shoulder internal rotation, external rotation, and total arc of motion using an inclinometer in 3 testing procedures in National Collegiate Athletic Association Division I baseball and softball athletes. Design: Cross-sectional study. Setting: Athletic department. Patients or Other Participants Thirty-eight players participated in the study. Shoulder internal rotation, external rotation, and total arc of motion were measured by 2 investigators in 3 test positions. The standard supine position was compared with a side-lying test position, as well as a supine test position without examiner overpressure. Results: Excellent intrarater reliability was noted for all 3 test positions and ranges of motion, with intraclass correlation coefficient values ranging from 0.93 to 0.99. Results for interrater reliability were less favorable. Reliability for internal rotation was highest in the side-lying position (0.68) and reliability for external rotation and total arc was highest in the supine-without-overpressure position (0.774 and 0.713, respectively). The supine-with-overpressure position yielded the lowest interrater reliability results in all positions. The side-lying position had the most consistent results, with very little variation among intraclass correlation coefficient values for the various test positions. Conclusions: The results of our study clearly indicate that the side-lying test procedure is of equal or greater value than the traditional supine-with-overpressure method. PMID:25188316
[Reliability and validity of Meaningful Life Measure-Chinese Revised in Chinese college students].

PubMed

Xiao, Rong; Lai, Qiao-Zhen; Yang, Jia-Ping

2016-04-20

To test the reliability and validity of Meaningful Life Measure-Chinese Revised (MLM-CR) in Chinese college students. A total of 1035 college students were evaluated with MLM-CR, Satisfaction with Life Scale (SWLS), Purpose in Life (PIL) and Patient Health Questionnaire-2 (PHQ-2), and 120 of the students were examined with PIL-SF twice. All the items in MLM-CR had good discrimination indexes (r=0.753-0.838, P<0.001). Confirmatory factor analysis confirmed the hypothesized five-factor model of MLM-CR (Χ 2 /df=3.4, GFI=0.946, AGFI=0.924, RMR=0.069, NFI=0.953, CFI=0.966, RMSEA=0.048). The total internal consistency reliability of MLM-CR was 0.942, and the alpha coefficients of the 5 dimensions ranged from 0.782 to 0.877; the total split-half reliability was 0.920, and the split-half reliability of the 5 dimensions ranged from 0.752 to 0.830; the total test-retest reliability was 0.871, and the test-retest reliability of the 5 dimensions ranged from 0.783 to 0.805. The criterion validity of MLM-CR in correlation with SWLS, PIL and PHQ-2 was 0.66, 0.755 and -0.388, respectively (P<0.01). The Average score of MLM-CR of the college students was 5.20∓0.90, and the scores were significantly higher in female students than in the male students (P<0.001). MLM-CR has good psychometric properties for application in comprehensive evaluation of personal meaning in life.
Reliability and Validity of Goal Orientation in Exercise Measure (GOEM)--Turkish Version

ERIC Educational Resources Information Center

Ersöz, Gözde; Müftüler, Mine; Lapa, Tennur Yerlisu; Tümer, Adile

2017-01-01

The aim of this study was to examine validity and reliability of the Turkish version of the Goal Orientation in Exercise Measure (GOEM). There were 408 participants who were regularly exercising and their age ranged from 17 to 61 years old. The psychometric characteristics of the scale were investigated using exploratory factor analysis (EFA),…
Reliability of photogrammetry in the evaluation of the postural aspects of individuals with structural scoliosis.

PubMed

Saad, Karen Ruggeri; Colombo, Alexandra Siqueira; Ribeiro, Ana Paula; João, Sílvia Maria Amado

2012-04-01

The purpose of this study was to investigate the reliability of photogrammetry in the measurement of the postural deviations in individuals with idiopathic scoliosis. Twenty participants with scoliosis (17 women and three men), with a mean age of 23.1 ± 9 yrs, were photographed from the posterior and lateral views. The postural aspects were measured with CorelDRAW software. High inter-rater and test-retest reliability indices were found. It was observed that with more severity of scoliosis, greater were the variations between the thoracic kyphosis and lumbar lordosis measures obtained by the same examiner from the left lateral view photographs. A greater body mass index (BMI) was associated with greater variability of the trunk rotation measures obtained by two independent examiners from the right, lateral view (r = 0.656; p = 0.002). The severity of scoliosis was also associated with greater inter-rater variability measures of trunk rotation obtained from the left, lateral view (r = 0.483; p = 0.036). Photogrammetry demonstrated to be a reliable method for the measurement of postural deviations from the posterior and lateral views of individuals with idiopathic scoliosis and could be complementarily employed for the assessment procedures, which could reduce the number of X-rays used for the follow-up assessments of these individuals. Copyright © 2011 Elsevier Ltd. All rights reserved.
The Chinese-version of the CARE Measure reliably differentiates between doctors in primary care: a cross-sectional study in Hong Kong

PubMed Central

2011-01-01

Background The Consultation and Relational Empathy (CARE) Measure is a widely used patient-rated experience measure which has recently been translated into Chinese and has undergone preliminary qualitative and quantitative validation. The objective of this study was to determine the reliability of the Chinese-version of the CARE Measure in reliably differentiating between doctors in a primary care setting in Hong Kong Methods Data were collected from 984 primary care patients attending 20 doctors with differing levels of training in family medicine in 5 public clinics in Hong Kong. The acceptability of the Chinese-CARE measure to patients was assessed. The reliability of the measure in discriminating effectively between doctors was analysed by Generalisability-theory (G-Theory) Results The items in the Chinese-CARE measure were regarded as important by patients and there were few 'not applicable' responses. The measure showed high internal reliability (coefficient 0.95) and effectively differentiated between doctors with only 15-20 patient ratings per doctor (inter-rater reliability > 0.8). Doctors' mean CARE measure scores varied widely, ranging from 24.1 to 45.9 (maximum possible score 50) with a mean of 34.6. CARE Measure scores were positively correlated with level of training in family medicine (Spearman's rho 0.493, p < 0.05). Conclusion These data demonstrate the acceptability, feasibility and reliability of using the Chinese-CARE Measure in primary care in Hong Kong to differentiate between doctors interpersonal competencies. Training in family medicine appears to enhance these key interpersonal skills. PMID:21631927
The Effects of Participation Rate on the Internal Reliability of Peer Nomination Measures

ERIC Educational Resources Information Center

Marks, Peter E. L.; Babcock, Ben; Cillessen, Antonius H. N.; Crick, Nicki R.

2013-01-01

Although low participation rates have historically been considered problematic in peer nomination research, some researchers have recently argued that small proportions of participants can, in fact, provide adequate sociometric data. The current study used a classical measurement perspective to investigate the internal reliability (Cronbach's…
Software Reliability 2002

NASA Technical Reports Server (NTRS)

Wallace, Dolores R.

2003-01-01

In FY01 we learned that hardware reliability models need substantial changes to account for differences in software, thus making software reliability measurements more effective, accurate, and easier to apply. These reliability models are generally based on familiar distributions or parametric methods. An obvious question is 'What new statistical and probability models can be developed using non-parametric and distribution-free methods instead of the traditional parametric method?" Two approaches to software reliability engineering appear somewhat promising. The first study, begin in FY01, is based in hardware reliability, a very well established science that has many aspects that can be applied to software. This research effort has investigated mathematical aspects of hardware reliability and has identified those applicable to software. Currently the research effort is applying and testing these approaches to software reliability measurement, These parametric models require much project data that may be difficult to apply and interpret. Projects at GSFC are often complex in both technology and schedules. Assessing and estimating reliability of the final system is extremely difficult when various subsystems are tested and completed long before others. Parametric and distribution free techniques may offer a new and accurate way of modeling failure time and other project data to provide earlier and more accurate estimates of system reliability.
Reliability and validity of gait analysis by android-based smartphone.

PubMed

Nishiguchi, Shu; Yamada, Minoru; Nagai, Koutatsu; Mori, Shuhei; Kajiwara, Yuu; Sonoda, Takuya; Yoshimura, Kazuya; Yoshitomi, Hiroyuki; Ito, Hiromu; Okamoto, Kazuya; Ito, Tatsuaki; Muto, Shinyo; Ishihara, Tatsuya; Aoyama, Tomoki

2012-05-01

Smartphones are very common devices in daily life that have a built-in tri-axial accelerometer. Similar to previously developed accelerometers, smartphones can be used to assess gait patterns. However, few gait analyses have been performed using smartphones, and their reliability and validity have not been evaluated yet. The purpose of this study was to evaluate the reliability and validity of a smartphone accelerometer. Thirty healthy young adults participated in this study. They walked 20 m at their preferred speeds, and their trunk accelerations were measured using a smartphone and a tri-axial accelerometer that was secured over the L3 spinous process. We developed a gait analysis application and installed it in the smartphone to measure the acceleration. After signal processing, we calculated the gait parameters of each measurement terminal: peak frequency (PF), root mean square (RMS), autocorrelation peak (AC), and coefficient of variance (CV) of the acceleration peak intervals. Remarkable consistency was observed in the test-retest reliability of all the gait parameter results obtained by the smartphone (p<0.001). All the gait parameter results obtained by the smartphone showed statistically significant and considerable correlations with the same parameter results obtained by the tri-axial accelerometer (PF r=0.99, RMS r=0.89, AC r=0.85, CV r=0.82; p<0.01). Our study indicates that the smartphone with gait analysis application used in this study has the capacity to quantify gait parameters with a degree of accuracy that is comparable to that of the tri-axial accelerometer.
Reliability and Validity of a New Physical Activity Self-Report Measure for Younger Children

ERIC Educational Resources Information Center

Belton, Sarahjane; Mac Donncha, Ciaran

2010-01-01

The purpose of this study was to assess the test-retest reliability and validity of a new Youth Physical Activity Self-Report measure. Heart rate and direct observation were employed as criterion measures with a sample of 79 children (aged 7-9 years). Spearman's rho correlation between self reported activity intensity and heart rate was 0.87 for…
Reliability of metatarsophalangeal and ankle joint torque measurements by an innovative device.

PubMed

Man, Hok-Sum; Leung, Aaron Kam-Lun; Cheung, Jason Tak-Man; Sterzing, Thorsten

2016-07-01

The toe flexor muscles maintain body balance during standing and provide push-off force during walking, running, and jumping. Additionally, they are important contributing structures to maintain normal foot function. Thus, weakness of these muscles may cause poor balance, inefficient locomotion and foot deformities. The quantification of metatarsophalangeal joint (MPJ) stiffness is valuable as it is considered as a confounding factor in toe flexor muscles function. MPJ and ankle joint stiffness measurement is still largely depended on manual skills as current devices do not have good control on alignment, angular joint speed and displacement during measurement. Therefore, this study introduces an innovative dynamometer and protocol procedures for MPJ and ankle Joint torque measurement with precise and reliable foot alignment, angular joint speed and displacement control. Within-day and between-day test-retest experiments on MPJ and ankle joint torque measurement were conducted on ten and nine healthy male subjects respectively. The mean peak torques of MPJ and ankle joint of between-day and within-day measurement were 1.50±0.38Nm/deg and 1.19±0.34Nm/deg. The corresponding torques of the ankle joint were 8.24±2.20Nm/deg and 7.90±3.18Nm/deg respectively. Intraclass-correlation coefficients (ICC) of averaged peak torque of both joints of between-day and within-day test-retest experiments were ranging from 0.91 to 0.96, indicating the innovative device is systematic and reliable for the measurements and can be used for multiple scientific and clinical purposes. Copyright © 2016 Elsevier B.V. All rights reserved.
Validity, Reliability, and Feasibility of Durometer Measurements of Scleroderma Skin Disease in a Multicenter Treatment Trial

PubMed Central

MERKEL, PETER A.; SILLIMAN, NANCY P.; DENTON, CHRISTOPHER P.; FURST, DANIEL E.; KHANNA, DINESH; EMERY, PAUL; HSU, VIVIEN M.; STREISAND, JAMES B.; POLISSON, RICHARD P.; ÅKESSON, ANITA; COPPOCK, JOHN; van den HOOGEN, FRANK; HERRICK, ARIANE; MAYES, MAUREEN D.; VEALE, DOUGLAS; SEIBOLD, JAMES R.; BLACK, CAROL M.; KORN, JOSEPH H.

2013-01-01

Objective To determine the validity, reliability, and feasibility of durometer measurements of skin hardness as an outcome measure in clinical trials of scleroderma. Methods Skin hardness was measured during a multicenter treatment trial for scleroderma using handheld digital durometers with a continuous scale. Skin thickness was measured by modified Rodnan skin score (MRSS). Other outcome data collected included the Scleroderma Health Assessment Questionnaire. In a reliability exercise in advance of the trial, 9 investigators examined the same 5 scleroderma patients by MRSS and durometry. Results Forty-three patients with early diffuse cutaneous systemic sclerosis were studied at 11 international centers (mean age 49 years [range 24–76], median disease duration 6.4 months [range 0.3–23], and median baseline MRSS 22 [range 11–38]). The reliability of durometer measurements was excellent, with high interobserver intraclass correlation coefficients (ICCs) (0.82–0.92), and each result was greater than the corresponding skin site ICCs for MRSS (0.54–0.85). Baseline durometer scores correlated well with MRSS (r = 0.69, P < 0.0001), patient self-assessments of skin disease (r = 0.69, P < 0.0001), and Health Assessment Questionnaire (HAQ) disability scores (r = 0.34, P = 0.03). Change in durometer scores correlated with change in MRSS (r = 0.70, P < 0.0001), change in patient self-assessments of skin disease (r = 0.52, P = 0.003), and change in HAQ disability scores (r = 0.42, P = 0.017). The effect size was greater for durometry than for MRSS or patient self-assessment. Conclusion Durometer measurements of skin hardness in patients with scleroderma are reliable, simple, accurate, demonstrate good sensitivity to change compared with traditional skin scoring, and reflect patients' self-assessments of their disease. Durometer measurements are valid, objective, and scalable, and should be considered for use as a complementary outcome measure to skin scoring in
Estimating the reliability of repeatedly measured endpoints based on linear mixed-effects models. A tutorial.

PubMed

Van der Elst, Wim; Molenberghs, Geert; Hilgers, Ralf-Dieter; Verbeke, Geert; Heussen, Nicole

2016-11-01

There are various settings in which researchers are interested in the assessment of the correlation between repeated measurements that are taken within the same subject (i.e., reliability). For example, the same rating scale may be used to assess the symptom severity of the same patients by multiple physicians, or the same outcome may be measured repeatedly over time in the same patients. Reliability can be estimated in various ways, for example, using the classical Pearson correlation or the intra-class correlation in clustered data. However, contemporary data often have a complex structure that goes well beyond the restrictive assumptions that are needed with the more conventional methods to estimate reliability. In the current paper, we propose a general and flexible modeling approach that allows for the derivation of reliability estimates, standard errors, and confidence intervals - appropriately taking hierarchies and covariates in the data into account. Our methodology is developed for continuous outcomes together with covariates of an arbitrary type. The methodology is illustrated in a case study, and a Web Appendix is provided which details the computations using the R package CorrMixed and the SAS software. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.
The Reliability of Pharyngeal High Resolution Manometry with Impedance for Derivation of Measures of Swallowing Function in Healthy Volunteers

PubMed Central

Omari, Taher I.; Savilampi, Johanna; Kokkinn, Karmen; Schar, Mistyka; Lamvik, Kristin; Doeltgen, Sebastian; Cock, Charles

2016-01-01

Purpose. We evaluated the intra- and interrater agreement and test-retest reliability of analyst derivation of swallow function variables based on repeated high resolution manometry with impedance measurements. Methods. Five subjects swallowed 10 × 10 mL saline on two occasions one week apart producing a database of 100 swallows. Swallows were repeat-analysed by six observers using software. Swallow variables were indicative of contractility, intrabolus pressure, and flow timing. Results. The average intraclass correlation coefficients (ICC) for intra- and interrater comparisons of all variable means showed substantial to excellent agreement (intrarater ICC 0.85–1.00; mean interrater ICC 0.77–1.00). Test-retest results were less reliable. ICC for test-retest comparisons ranged from slight to excellent depending on the class of variable. Contractility variables differed most in terms of test-retest reliability. Amongst contractility variables, UES basal pressure showed excellent test-retest agreement (mean ICC 0.94), measures of UES postrelaxation contractile pressure showed moderate to substantial test-retest agreement (mean Interrater ICC 0.47–0.67), and test-retest agreement of pharyngeal contractile pressure ranged from slight to substantial (mean Interrater ICC 0.15–0.61). Conclusions. Test-retest reliability of HRIM measures depends on the class of variable. Measures of bolus distension pressure and flow timing appear to be more test-retest reliable than measures of contractility. PMID:27190520

Reliability and validity of the Dutch version of the Consultation and Relational Empathy Measure in primary care.

PubMed

van Dijk, Inge; Scholten Meilink Lenferink, Nick; Lucassen, Peter L B J; Mercer, Stewart W; van Weel, Chris; Olde Hartman, Tim C; Speckens, Anne E M

2017-02-01

Empathy is an essential skill in doctor-patient communication with positive effects on compliance, patient satisfaction and symptom duration. There are no validated patient-rated empathy measures available in Dutch. To investigate the validity and reliability of a Dutch version of the Consultation and Relational Empathy (CARE) Measure, a widely used 10-item patient-rated questionnaire of physician empathy. After translation and back translation, the Dutch CARE Measure was distributed among patients from 19 general practitioners in 5 primary care centers. Tests of internal reliability and validity included Cronbach's alpha, item total correlations and factor analysis. Seven items of the QUality Of care Through the patient's Eyes (QUOTE) questionnaire assessing 'affective performance' of the physician were included in factor analysis and used to investigate convergent validity. Of the 800 distributed questionnaires, 655 (82%) were returned. Acceptability and face validity were supported by a low number of 'does not apply' responses (range 0.2%-11.9%). Internal reliability was high (Cronbach's alpha 0.974). Corrected item total correlations were at a minimum of 0.837. Factor analysis on the 10 items of the CARE Measure and 7 QUOTE items resulted in two factors (Eigenvalue > 1), the first containing the CARE Measure items and the second containing the QUOTE items. Convergent construct validity between the CARE Measure and QUOTE was confirmed with a modest positive correlation (r = 0.34, n = 654, P < 0.001). The findings support the preliminary validity and reliability of the Dutch CARE Measure. Future research is required to investigate divergent validity and discriminant ability between doctors. © The Author 2016. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Reliability as Argument

ERIC Educational Resources Information Center

Parkes, Jay

2007-01-01

Reliability consists of both important social and scientific values and methods for evidencing those values, though in practice methods are often conflated with the values. With the two distinctly understood, a reliability argument can be made that articulates the particular reliability values most relevant to the particular measurement situation…
Isometric and isokinetic muscle strength in the upper extremity can be reliably measured in persons with chronic stroke.

PubMed

Ekstrand, Elisabeth; Lexell, Jan; Brogårdh, Christina

2015-09-01

To evaluate the test-retest reliability of isometric and isokinetic muscle strength measurements in the upper extremity after stroke. A test-retest design. Forty-five persons with mild to moderate paresis in the upper extremity > 6 months post-stroke. Isometric arm strength (shoulder abduction, elbow flexion), isokinetic arm strength (elbow extension/flexion) and isometric grip strength were measured with electronic dynamometers. Reliability was evaluated with intra-class correlation coefficients (ICC), changes in the mean, standard error of measurements (SEM) and smallest real differences (SRD). Reliability was high (ICCs: 0.92-0.97). The absolute and relative (%) SEM ranged from 2.7 Nm (5.6%) to 3.0 Nm (9.4%) for isometric arm strength, 2.6 Nm (7.4%) to 2.9 Nm (12.6%) for isokinetic arm strength, and 22.3 N (7.6%) to 26.4 N (9.2%) for grip strength. The absolute and relative (%) SRD ranged from 7.5 Nm (15.5%) to 8.4 Nm (26.1%) for isometric arm strength, 7.1 Nm (20.6%) to 8.0 Nm (34.8%) for isokinetic arm strength, and 61.8 N (21.0%) to 73.3 N (25.6%) for grip strength. Muscle strength in the upper extremity can be reliably measured in persons with chronic stroke. Isometric measurements yield smaller measurement errors than isokinetic measurements and might be preferred, but the choice depends on the research question.
A comparison of computer-assisted and manual wound size measurement.

PubMed

Thawer, Habiba A; Houghton, Pamela E; Woodbury, M Gail; Keast, David; Campbell, Karen

2002-10-01

Accurate and precise wound measurements are a critical component of every wound assessment. To examine the reliability and validity of a new computerized technique for measuring human and animal wounds, chronic human wounds (N = 45) and surgical animal wounds (N = 38) were assessed using manual and computerized techniques. Using intraclass correlation coefficients, intrarater and interrater reliability of surface area measurements obtained using the computerized technique were compared to those obtained using acetate tracings and planimetry. A single measurement of surface area using either technique produced excellent intrarater and interrater reliability for both human and animal wounds, but the computerized technique was more precise than the manual technique for measuring the surface area of animal wounds. For both types of wounds and measurement techniques, intrarater and interrater reliability improved when the average of three repeated measurements was obtained. The precision of each technique with human wounds and the precision of the manual technique with animal wounds also improved when three repeated measurement results were averaged. Concurrent validity between the two techniques was excellent for human wounds but poor for the smaller animal wounds, regardless of whether single or the average of three repeated surface area measurements was used. The computerized technique permits reliable and valid assessment of the surface area of both human and animal wounds.
A neuropsychological instrument measuring age-related cerebral decline in older drivers: development, reliability, and validity of MedDrive

PubMed Central

Vaucher, Paul; Cardoso, Isabel; Veldstra, Janet L.; Herzig, Daniela; Herzog, Michael; Mangin, Patrice; Favrat, Bernard

2014-01-01

When facing age-related cerebral decline, older adults are unequally affected by cognitive impairment without us knowing why. To explore underlying mechanisms and find possible solutions to maintain life-space mobility, there is a need for a standardized behavioral test that relates to behaviors in natural environments. The aim of the project described in this paper was therefore to provide a free, reliable, transparent, computer-based instrument capable of detecting age-related changes on visual processing and cortical functions for the purposes of research into human behavior in computational transportation science. After obtaining content validity, exploring psychometric properties of the developed tasks, we derived (Study 1) the scoring method for measuring cerebral decline on 106 older drivers aged ≥70 years attending a driving refresher course organized by the Swiss Automobile Association to test the instrument's validity against on-road driving performance (106 older drivers). We then validated the derived method on a new sample of 182 drivers (Study 2). We then measured the instrument's reliability having 17 healthy, young volunteers repeat all tests included in the instrument five times (Study 3) and explored the instrument's psychophysical underlying functions on 47 older drivers (Study 4). Finally, we tested the instrument's responsiveness to alcohol and effects on performance on a driving simulator in a randomized, double-blinded, placebo, crossover, dose-response, validation trial including 20 healthy, young volunteers (Study 5). The developed instrument revealed good psychometric properties related to processing speed. It was reliable (ICC = 0.853) and showed reasonable association to driving performance (R2 = 0.053), and responded to blood alcohol concentrations of 0.5 g/L (p = 0.008). Our results suggest that MedDrive is capable of detecting age-related changes that affect processing speed. These changes nevertheless do not necessarily affect
General Aviation Aircraft Reliability Study

NASA Technical Reports Server (NTRS)

Pettit, Duane; Turnbull, Andrew; Roelant, Henk A. (Technical Monitor)

2001-01-01

This reliability study was performed in order to provide the aviation community with an estimate of Complex General Aviation (GA) Aircraft System reliability. To successfully improve the safety and reliability for the next generation of GA aircraft, a study of current GA aircraft attributes was prudent. This was accomplished by benchmarking the reliability of operational Complex GA Aircraft Systems. Specifically, Complex GA Aircraft System reliability was estimated using data obtained from the logbooks of a random sample of the Complex GA Aircraft population.
Concurrent validity and reliability of wireless instrumented insoles measuring postural balance and temporal gait parameters.

PubMed

Oerbekke, Michiel S; Stukstette, Mirelle J; Schütte, Kurt; de Bie, Rob A; Pisters, Martijn F; Vanwanseele, Benedicte

2017-01-01

The OpenGo seems promising to take gait analysis out of laboratory settings due to its capability of long-term measurements and mobility. However, the OpenGo's concurrent validity and reliability need to be assessed to determine if the instrument is suitable for validation in patient samples. Twenty healthy volunteers participated. Center of pressure data were collected under eyes open and closed conditions with participants performing unilateral stance trials on the gold standard (AMTI OR6-7 force plate) while wearing the OpenGo. Temporal gait data (stance time, gait cycle time, and cadence) were collected at a self-selected comfortable walking speed with participants performing test-retest trials on an instrumented treadmill while wearing the OpenGo. Validity was assessed using Bland-Altman plots. Reliability was assessed with Intraclass Correlation Coefficient (2,1) and smallest detectable changes were calculated. Negative means of differences were found in all measured parameters, illustrating lower scores for the OpenGo on average. The OpenGo showed negative upper limits of agreement in center of pressure parameters on the mediolateral axis. Temporal reliability ICCs ranged from 0.90-0.93. Smallest detectable changes for both stance times were 0.04 (left) and 0.05 (right) seconds, for gait cycle time 0.08s, and for cadence 4.5 steps per minute. The OpenGo is valid and reliable for the measurement of temporal gait parameters during walking. Measurements of center of pressure parameters during unilateral stance are not considered valid. The OpenGo seems a promising instrument for clinically screening and monitoring temporal gait parameters in patients, however validation in patient populations is needed. Copyright Â© 2016 Elsevier B.V. All rights reserved.
Caregiver-proxy reliability of the Igbo-culture adapted Maleka Stroke Community Reintegration Measure: a validation study.

PubMed

Okoye, Emmanuel Chiebuka; Awhen, Peter Agba; Akosile, Christopher Olusanjo; Maruf, Fatai Adesina; Iheukwumere, Ngozi; Egwuonwu, Afamefuna Victor

2017-09-01

This study was designed to determine the caregiver-proxy reliability of the Igbo-culture adapted urban version of the Maleka Stroke Community Reintegration Measure (I-MSCRIM). This was a validation study involving 74 consenting stroke survivors and their 74 primary informal caregivers consecutively recruited from selected tertiary hospitals in South-East Nigeria (Igboland). The I-MSCRIM was researcher-administered to the participants. Obtained data was analyzed using frequency counts, percentages, range, mean, standard deviation, Spearman rank order correlation, Mann-Whitney U test, Kruskal-Wallis test and Intra-class Correlation Coefficient. Alpha level was set at 0.05. The mean ages of the stroke survivors (55.4% males) and their primary informal caregivers (41.9% males) were 50.14 ± 12.24 and 31.93 ± 10.9 years respectively. There was no significant difference in the community reintegration (CR) scores as rated by stroke survivors and their primary informal caregivers (p > 0.05). The correlations between stroke survivors' and primary informal caregivers' rated CR scores were all adequate and acceptable (ICC = 0.602-0.917). The discrepancy in the total CR scores between the two ratings was significantly influenced by primary informal caregivers' educational attainment (k = 13.15; p < 0.01). The I-MSCRIM has acceptable caregiver-proxy reliability among Igbo stroke survivors in South-East Nigeria. This suggests that primary informal caregivers of stroke survivors can reliably estimate the CR of their care recipients when I-MSCRIM is administered to them. This will be useful when a stroke survivor cannot respond to I-MSCRIM.
Estimating the Reliability of Single-Item Life Satisfaction Measures: Results from Four National Panel Studies

ERIC Educational Resources Information Center

Lucas, Richard E.; Donnellan, M. Brent

2012-01-01

Life satisfaction is often assessed using single-item measures. However, estimating the reliability of these measures can be difficult because internal consistency coefficients cannot be calculated. Existing approaches use longitudinal data to isolate occasion-specific variance from variance that is either completely stable or variance that…
Interrater and intrarater reliability in the measurement of ankle joint dorsiflexion is independent of examiner experience and technique used.

PubMed

Kim, Paul Jeong; Peace, Ruth; Mieras, Jamie; Thoms, Tanya; Freeman, Denise; Page, Jeffrey

2011-01-01

Goniometric measurement is currently being used as a diagnostic and outcomes assessment tool for ankle joint dorsiflexion. Despite its common use, its interrater and intrarater reliability has been questioned. This is a prospective study examining whether the experience of the examiner or the technique used affects the interrater and intrarater reliability for measuring ankle joint dorsiflexion. Fourteen asymptomatic individuals (8 male and 6 female) with a mean age of 28.2 years (range, 23-52) were enrolled into this study. The years of clinical experience of the five examiners averaged 10.4 years (range, 0-26). Four examiners used a modified Root, Weed and Orien method of measuring ankle joint dorsiflexion. The fifth examiner utilized a nonstandardized technique. A standard goniometer was used for bilateral measurements of ankle joint dorsiflexion with the knee extended and flexed. All five examiners repeated each measurement three times during each of the three sessions, with each session spaced at least 1 week apart. The interclass correlation coefficient reveals a moderate intrarater and poor interrater reliability in ankle joint dorsiflexion measurements using a standard goniometer. More importantly, further analysis indicates that the use of a standardized technique for measurement of ankle joint dorsiflexion or years of clinical experience does not increase the intrarater or interrater reliability. The utility of the goniometric measurement of ankle joint dorsiflexion may be limited.
Validity and Reliability of a New Device (WIMU®) for Measuring Hamstring Muscle Extensibility.

PubMed

Muyor, José M

2017-09-01

The aims of the current study were 1) to evaluate the validity of the WIMU ® system for measuring hamstring muscle extensibility in the passive straight leg raise (PSLR) test using an inclinometer for the criterion and 2) to determine the test-retest reliability of the WIMU ® system to measure hamstring muscle extensibility during the PSLR test. 55 subjects were evaluated on 2 separate occasions. Data from a Unilever inclinometer and WIMU ® system were collected simultaneously. Intraclass correlation coefficients (ICCs) for the validity were very high (0.983-1); a very low systematic bias (-0.21°--0.42°), random error (0.05°-0.04°) and standard error of the estimate (0.43°-0.34°) were observed (left-right leg, respectively) between the 2 devices (inclinometer and the WIMU ® system). The R 2 between the devices was 0.999 (p<0.001) in both the left and right legs. The test-retest reliability of the WIMU ® system was excellent, with ICCs ranging from 0.972-0.995, low coefficients of variation (0.01%), and a low standard error of the estimate (0.19-0.31°). The WIMU ® system showed strong concurrent validity and excellent test-retest reliability for the evaluation of hamstring muscle extensibility in the PSLR test. © Georg Thieme Verlag KG Stuttgart · New York.
Reliability and variability of day-to-day vault training measures in artistic gymnastics.

PubMed

Bradshaw, Elizabeth; Hume, Patria; Calton, Mark; Aisbett, Brad

2010-06-01

Inter-day training reliability and variability in artistic gymnastics vaulting was determined using a customised infra-red timing gate and contact mat timing system. Thirteen Australian high performance gymnasts (eight males and five females) aged 11-23 years were assessed during two consecutive days of normal training. Each gymnast completed a number of vault repetitions per daily session. Inter-day variability of vault run-up velocities (at -18 to -12 m, -12 to -6 m, -6 to -2 m, and -2 to 0 m from the nearest edge of the beat board), and board contact, pre-flight, and table contact times were determined using mixed modelling statistics to account for random (within-subject variability) and fixed effects (gender, number of subjects, number of trials). The difference in the mean (Mdiff) and Cohen's effect sizes for reliability assessment and intra-class correlation coefficients, and the coefficient of variation percentage (CV%) were calculated for variability assessment. Approach velocity (-18 to -2m, CV = 2.4-7.8%) and board contact time (CV = 3.5%) were less variable measures when accounting for day-to-day performance differences, than pre-flight time (CV = 17.7%) and table contact time (CV = 20.5%). While pre-flight and table contact times are relevant training measures, approach velocity and board contact time are more reliable when quantifying vaulting performance.
Test-Retest Reliability of Measures Commonly Used to Measure Striatal Dysfunction across Multiple Testing Sessions: A Longitudinal Study.

PubMed

Palmer, Clare E; Langbehn, Douglas; Tabrizi, Sarah J; Papoutsi, Marina

2017-01-01

Cognitive impairment is common amongst many neurodegenerative movement disorders such as Huntington's disease (HD) and Parkinson's disease (PD) across multiple domains. There are many tasks available to assess different aspects of this dysfunction, however, it is imperative that these show high test-retest reliability if they are to be used to track disease progression or response to treatment in patient populations. Moreover, in order to ensure effects of practice across testing sessions are not misconstrued as clinical improvement in clinical trials, tasks which are particularly vulnerable to practice effects need to be highlighted. In this study we evaluated test-retest reliability in mean performance across three testing sessions of four tasks that are commonly used to measure cognitive dysfunction associated with striatal impairment: a combined Simon Stop-Signal Task; a modified emotion recognition task; a circle tracing task; and the trail making task. Practice effects were seen between sessions 1 and 2 across all tasks for the majority of dependent variables, particularly reaction time variables; some, but not all, diminished in the third session. Good test-retest reliability across all sessions was seen for the emotion recognition, circle tracing, and trail making test. The Simon interference effect and stop-signal reaction time (SSRT) from the combined-Simon-Stop-Signal task showed moderate test-retest reliability, however, the combined SSRT interference effect showed poor test-retest reliability. Our results emphasize the need to use control groups when tracking clinical progression or use pre-baseline training on tasks susceptible to practice effects.
Reliability of a new method for measuring coronal trunk imbalance, the axis-line-angle technique.

PubMed

Zhang, Rui-Fang; Liu, Kun; Wang, Xue; Liu, Qian; He, Jia-Wei; Wang, Xiang-Yang; Yan, Zhi-Han

2015-12-01

Accurate determination of the extent of trunk imbalance in the coronal plane plays a key role in an evaluation of patients with trunk imbalance, such as patients with adolescent idiopathic scoliosis. An established, widely used practice in evaluating trunk imbalance is to drop a plumb line from the C7 vertebra to a key reference axis, the central sacral vertical line (CSVL) in full-spine standing anterioposterior radiographs, and measuring the distance between them, the C7-CSVL. However, measuring the CSVL is subject to intraobserver differences, is error-prone, and is of poor reliability. Therefore, the development of a different way to measure trunk imbalance is needed. This study aimed to describe a new method to measure coronal trunk imbalance, the axis-line-angle technique (ALAT), which measures the angle at the intersection between the C7 plumb line and an axis line drawn from the vertebral centroid of the C7 to the middle of the superior border of the symphysis pubis, and to compare the reliability of the ALAT with that of the C7-CSVL. A prospective study at a university hospital was used. The patient sample consisted of sixty-nine consecutively enrolled men and women patients, aged 10-18 years, who had trunk imbalance defined as C7-CSVL longer than 20 mm on computed full-spine standing anterioposterior radiographs. Data were analyzed to determine the correlation between C7-CSVL and ALAT measurements and to determine intraobserver and interobserver reliabilities. Using a picture archiving and communication system, three radiologists independently evaluated trunk imbalance on the 69 computed radiographs by measuring the C7-CSVL and by measuring the angle determined by the ALAT. Data were analyzed to determine the correlations between the two measures of trunk imbalance, and to determine intraobserver and interobserver reliabilities of each of them. Overall results from the measurements by the C7-CSVL and the ALAT were significantly moderately correlated
The reliability of in-training assessment when performance improvement is taken into account.

PubMed

van Lohuizen, Mirjam T; Kuks, Jan B M; van Hell, Elisabeth A; Raat, A N; Stewart, Roy E; Cohen-Schotanus, Janke

2010-12-01

During in-training assessment students are frequently assessed over a longer period of time and therefore it can be expected that their performance will improve. We studied whether there really is a measurable performance improvement when students are assessed over an extended period of time and how this improvement affects the reliability of the overall judgement. In-training assessment results were obtained from 104 students on rotation at our university hospital or at one of the six affiliated hospitals. Generalisability theory was used in combination with multilevel analysis to obtain reliability coefficients and to estimate the number of assessments needed for reliable overall judgement, both including and excluding performance improvement. Students' clinical performance ratings improved significantly from a mean of 7.6 at the start to a mean of 7.8 at the end of their clerkship. When taking performance improvement into account, reliability coefficients were higher. The number of assessments needed to achieve a reliability of 0.80 or higher decreased from 17 to 11. Therefore, when studying reliability of in-training assessment, performance improvement should be considered.
Is a sphygmomanometer a valid and reliable tool to measure the isometric strength of hip muscles? A systematic review.

PubMed

Toohey, Liam Anthony; De Noronha, Marcos; Taylor, Carolyn; Thomas, James

2015-02-01

Muscle strength measurement is a key component of physiotherapists' assessment and is frequently used as an outcome measure. A sphygmomanometer is an instrument commonly used to measure blood pressure that can be potentially used as a tool to assess isometric muscle strength. To systematically review the evidence on the reliability and validity of a sphygmomanometer for measuring isometric strength of hip muscles. A literature search was conducted across four databases. Studies were eligible if they presented data on reliability and/or validity, used a sphygmomanometer to measure isometric muscle strength of the hip region, and were peer reviewed. The individual studies were evaluated for quality using a standardized critical appraisal tool. A total of 644 articles were screened for eligibility, with five articles chosen for inclusion. The use of a sphygmomanometer to objectively assess isometric muscle strength of the hip muscles appears to be reliable with intraclass correlation coefficient values ranging from 0.66 to 0.94 in elderly and young populations. No studies were identified that have assessed the validity of a sphygmomanometer. The sphygmomanometer appears to be reliable for assessment of isometric muscle strength around the hip joint, but further research is warranted to establish its validity.
A Pilot Study Examining the Test-Retest and Internal Consistency Reliability of the ABLLS-R

ERIC Educational Resources Information Center

Partington, James W.; Bailey, Autumn; Partington, Scott W.

2018-01-01

The literature contains a variety of assessment tools for measuring the skills of individuals with autism or other developmental delays, but most lack adequate empirical evidence supporting their reliability and validity. The current pilot study sought to examine the reliability of scores obtained from the Assessment of Basic Language and Learning…
Reliability of tristimulus colourimetry in the assessment of cutaneous bruise colour.

PubMed

Scafide, Katherine N; Sheridan, Daniel J; Taylor, Laura A; Hayat, Matthew J

2016-06-01

Bruising is one of the most common types of injury clinicians observe among victims of violence and other trauma patients. However, research has shown commonly used qualitative description of cutaneous bruise colour via the naked eye is subjective and unreliable. No published work has formally evaluated the reliability of tristimulus colourimetry as an alternative for assessing bruise colour, despite its clinical and research applications in accurately assessing skin colour. The purpose of this study was to systematically evaluate the test-retest and inter-observer reliability of tristimulus colourimetry in the assessment of cutaneous bruise colour. Two researchers obtained repeated tristimulus colourimetry measures of cutaneous bruises with participants of diverse skin colour. Measures were obtained using the Minolta CR-400 Chomameter. Commission Internationale d'Eclairage (CIE) L*a*b* colour space was used. Data was analysed using intraclass correlation coefficients (ICC), Cronbach's alpha, and minimal detectable change (MDC) on all three L*a*b* values. The colorimeter demonstrated excellent test-retest or intra-rater reliability (L* ICC=0.999; a* ICC=0.973; b* ICC=0.892) and inter-rater reliability (L* ICC=0.997; a* ICC=0.976; b* ICC=0.982). With consistent placement, the tristimulus colourimetry is reliable for the objective assessment and documentation of cutaneous bruise colour for purposes of clinical practice and research. Recommendations for use in practice/research are provided. Copyright © 2016 Elsevier Ltd. All rights reserved.
The validity, reliability, and utility of the iButton® for measurement of body temperature circadian rhythms in sleep/wake research.

PubMed

Hasselberg, Michael J; McMahon, James; Parker, Kathy

2013-01-01

Changes in core body temperature due to heat transfer through the skin have a major influence on sleep regulation. Traditional measures of skin temperature are often complicated by extensive wiring and are not practical for use in normal living conditions. This review describes studies examining the reliability, validity and utility of the iButton®, a wireless peripheral thermometry device, in sleep/wake research. A review was conducted of English language literature on the iButton as a measure of circadian body temperature rhythms associated with the sleep/wake cycle. Seven studies of the iButtton as a measure of human body temperature were included. The iButton was found to be a reliable and valid measure of body temperature. Its application to human skin was shown to be comfortable and tolerable with no significant adverse reactions. Distal skin temperatures were negatively correlated with sleep/wake activity, and the temperature gradient between the distal and proximal skin (DPG) was identified as an accurate physiological correlate of sleep propensity. Methodological issues included site of data logger placement, temperature masking factors, and temperature data analysis. The iButton is an inexpensive, wireless data logger that can be used to obtain a valid measurement of human skin temperature. It is a practical alternative to traditional measures of circadian rhythms in sleep/wake research. Further research is needed to determine the utility of the iButton in vulnerable populations, including those with neurodegenerative disorders and memory impairment and pediatric populations. Copyright © 2011 Elsevier B.V. All rights reserved.
Reliability of new software in measuring cervical multifidus diameters and shoulder muscle strength in a synchronized way; an ultrasonographic study

PubMed Central

Rahnama, Leila; Rezasoltani, Asghar; Khalkhali-Zavieh, Minoo; Rahnama, Behnam; Noori-Kochi, Farhang

2015-01-01

OBJECTIVES: This study was conducted with the purpose of evaluating the inter-session reliability of new software to measure the diameters of the cervical multifidus muscle (CMM), both at rest and during isometric contractions of the shoulder abductors in subjects with neck pain and in healthy individuals. METHOD: In the present study, the reliability of measuring the diameters of the CMM with the Sonosynch software was evaluated by using 24 participants, including 12 subjects with chronic neck pain and 12 healthy individuals. The anterior-posterior diameter (APD) and the lateral diameter (LD) of the CMM were measured in a resting state and then repeated during isometric contraction of the shoulder abductors. Measurements were taken on separate occasions 3 to 7 days apart in order to determine inter-session reliability. Intraclass correlation coefficient (ICC), standard error of measurement (SEM), and smallest detectable difference (SDD) were used to evaluate the relative and absolute reliability, respectively. RESULTS: The Sonosynch software has shown to be highly reliable in measuring the diameters of the CMM both in healthy subjects and in those with neck pain. The ICCs 95% CI for APD ranged from 0.84 to 0.94 in subjects with neck pain and from 0.86 to 0.94 in healthy subjects. For LD, the ICC 95% CI ranged from 0.64 to 0.95 in subjects with neck pain and from 0.82 to 0.92 in healthy subjects. CONCLUSIONS: Ultrasonographic measurement of the diameters of the CMM using Sonosynch has proved to be reliable especially for APD in healthy subjects as well as subjects with neck pain. PMID:26443975

The Children's Play Therapy Instrument (CPTI). Description, development, and reliability studies.

PubMed

Kernberg, P F; Chazan, S E; Normandin, L

1998-01-01

The Children's Play Therapy Instrument (CPTI), its development, and reliability studies are described. The CPTI is a new instrument to examine a child's play activity in individual psychotherapy. Three independent raters used the CPTI to rate eight videotaped play therapy vignettes. Results were compared with the authors' consensual scores from a preliminary study. Generally good to excellent levels of interrater reliability were obtained for the independent raters on intraclass correlation coefficients for ordinal categories of the CPTI. Likewise, kappa levels were acceptable to excellent for nominal categories of the scale. The CPTI holds promise to become a reliable measure of play activity in child psychotherapy. Further research is needed to assess discriminant validity of the CPTI for use as a diagnostic tool and as a measure of process and outcome.
Measuring reliable change in cognition using the Edinburgh Cognitive and Behavioural ALS Screen (ECAS).

PubMed

Crockford, Christopher; Newton, Judith; Lonergan, Katie; Madden, Caoifa; Mays, Iain; O'Sullivan, Meabhdh; Costello, Emmet; Pinto-Grau, Marta; Vajda, Alice; Heverin, Mark; Pender, Niall; Al-Chalabi, Ammar; Hardiman, Orla; Abrahams, Sharon

2018-02-01

Cognitive impairment affects approximately 50% of people with amyotrophic lateral sclerosis (ALS). Research has indicated that impairment may worsen with disease progression. The Edinburgh Cognitive and Behavioural ALS Screen (ECAS) was designed to measure neuropsychological functioning in ALS, with its alternate forms (ECAS-A, B, and C) allowing for serial assessment over time. The aim of the present study was to establish reliable change scores for the alternate forms of the ECAS, and to explore practice effects and test-retest reliability of the ECAS's alternate forms. Eighty healthy participants were recruited, with 57 completing two and 51 completing three assessments. Participants were administered alternate versions of the ECAS serially (A-B-C) at four-month intervals. Intra-class correlation analysis was employed to explore test-retest reliability, while analysis of variance was used to examine the presence of practice effects. Reliable change indices (RCI) and regression-based methods were utilized to establish change scores for the ECAS alternate forms. Test-retest reliability was excellent for ALS Specific, ALS Non-Specific, and ECAS Total scores of the combined ECAS A, B, and C (all > .90). No significant practice effects were observed over the three testing sessions. RCI and regression-based methods produced similar change scores. The alternate forms of the ECAS possess excellent test-retest reliability in a healthy control sample, with no significant practice effects. The use of conservative RCI scores is recommended. Therefore, a change of ≥8, ≥4, and ≥9 for ALS Specific, ALS Non-Specific, and ECAS Total score is required for reliable change.
Reliability of the Star Excursion Balance Test and Two New Similar Protocols to Measure Trunk Postural Control.

PubMed

López-Plaza, Diego; Juan-Recio, Casto; Barbado, David; Ruiz-Pérez, Iñaki; Vera-Garcia, Francisco J

2018-05-18

Although the Star Excursion Balance test (SEBT) has shown a good intrasession reliability, the intersession reliability of this test has not been deeply studied. Furthermore, there is an evident high influence of the lower limbs in the performance of the SEBT, so even if it has been used to measure core stability, it is possibly not the most suitable measurement. The aims of this study were to (1) to assess the absolute and relative between-session reliability of the SEBT and 2 novel variations of this test to assess trunk postural control while sitting, ie, the Star Excursion Sitting Test (SEST) and the Star Excursion Timing Test (SETT); and (2) to analyze the relationships between these 3 test scores. Correlational and reliability test-retest study. Controlled laboratory environment. Twenty-seven physically active men (age: 24.54 ± 3.05 years). Relative and absolute reliability of the SEBT, SEST, and SETT were calculated through the intraclass correlation coefficient (ICC) and standard error of measurement (SEM), respectively. A Pearson correlation analysis was carried out between the variables of the 3 tests. Maximum normalized reach distances were assessed for different SEBT and SEST directions. In addition, composite indexes were calculated for SEBT, SEST, and SETT. The SEBT (dominant leg: ICC = 0.87 [0.73-0.94], SEM = 2.12 [1.66-2.93]; nondominant leg: ICC = 0.74 [0.50-0.87], SEM = 3.23 [2.54-4.45]), SEST (ICC = 0.85 [0.68-0.92], SEM = 1.27 [1.03-1.80]), and SETT (ICC = 0.61 [0.30-0.80], SEM = 2.31 [1.82-3.17]) composite indexes showed moderate-to-high 1-month reliability. A learning effect was detected for some SEBT and SEST directions and for SEST and SETT composite indexes. No significant correlations were found between SEBT and its 2 variations (r ≤ .366; P > .05). A significant correlation was found between the SEST and SETT composite indexes (r = .520; P > .01). SEBT, SEST, and SETT are reliable field protocols to measure postural control. However
A Most Probable Point-Based Method for Reliability Analysis, Sensitivity Analysis and Design Optimization

NASA Technical Reports Server (NTRS)

Hou, Gene J.-W; Newman, Perry A. (Technical Monitor)

2004-01-01

A major step in a most probable point (MPP)-based method for reliability analysis is to determine the MPP. This is usually accomplished by using an optimization search algorithm. The minimum distance associated with the MPP provides a measurement of safety probability, which can be obtained by approximate probability integration methods such as FORM or SORM. The reliability sensitivity equations are derived first in this paper, based on the derivatives of the optimal solution. Examples are provided later to demonstrate the use of these derivatives for better reliability analysis and reliability-based design optimization (RBDO).
Reliability analysis for radiographic measures of lumbar lordosis in adult scoliosis: a case–control study comparing 6 methods

PubMed Central

Hong, Jae Young; Modi, Hitesh N.; Hur, Chang Yong; Song, Hae Ryong; Park, Jong Hoon

2010-01-01

Several methods are used to measure lumbar lordosis. In adult scoliosis patients, the measurement is difficult due to degenerative changes in the vertebral endplate as well as the coronal and sagittal deformity. We did the observational study with three examiners to determine the reliability of six methods for measuring the global lumbar lordosis in adult scoliosis patients. Ninety lateral lumbar radiographs were collected for the study. The radiographs were divided into normal (Cobb < 10°), low-grade (Cobb 10°–19°), high-grade (Cobb ≥ 20°) group to determine the reliability of Cobb L1–S1, Cobb L1–L5, centroid, posterior tangent L1–S1, posterior tangent L1–L5 and TRALL method in adult scoliosis. The 90 lateral radiographs were measured twice by each of the three examiners using the six measurement methods. The data was analyzed to determine the inter- and intra-observer reliability. In general, for the six radiographic methods, the inter- and intra-class correlation coefficients (ICCs) were all ≥0.82. A comparison of the ICCs and 95% CI for the inter- and intra-observer reliability between the groups with varying degrees of scoliosis showed that, the reliability of the lordosis measurement decreased with increasing severity of scoliosis. In Cobb L1–S1, centroid and posterior tangent L1–S1 methods, the ICCs were relatively lower in the high-grade scoliosis group (≥0.60). And, the mean absolute difference (MAD) in these methods was high in the high-grade scoliosis group (≤7.17°). However, in the Cobb L1–L5 and posterior tangent L1–L5 method, the ICCs were ≥0.86 in all groups. And, in the TRALL method, the ICCs were ≥0.76 in all groups. In addition, in the Cobb L1–L5 and posterior tangent L1–L5 method, the MAD was ≤3.63°. And, in the TRALL method, the MAD was ≤3.84° in all groups. We concluded that the Cobb L1–L5 and the posterior tangent L1–L5 methods are reliable methods for measuring the global lumbar lordosis
Reliability issues in human brain temperature measurement

PubMed Central

2009-01-01

brain temperature sensors are fundamentally accurate and the measurements are precise to within 0.1 to 0.2°C. Subtle dissociation between brain and body temperature in excess of 0.1 to 0.2°C is likely to be real. Body temperature sensors need to be secured in position to ensure that measurements are reliable. PMID:19573241
Aircraft and ground vehicle friction measurements obtained under winter runway conditions

NASA Technical Reports Server (NTRS)

Yager, Thomas J.

1989-01-01

Tests with specially instrumented NASA B-737 and B-727 aircraft together with several different ground friction measuring devices have been conducted for a variety of runway surface types and wetness conditions. This effort is part of the Joint FAA/NASA Aircraft/Ground Vehicle Runway Friction Program aimed at obtaining a better understanding of aircraft ground handling performance under adverse weather conditions, and defining relationships between aircraft and ground vehicle tire friction measurements. Aircraft braking performance on dry, wet, snow-, and ice-covered runway conditions is discussed together with ground vehicle friction data obtained under similar runway conditions. For the wet, compacted snow- and ice-covered runway conditions, the relationship between ground vehicles and aircraft friction data is identified. The influence of major test parameters on friction measurements such as speed, test tire characteristics, and surface contaminant-type are discussed. The test results indicate that use of properly maintained and calibrated ground vehicles for monitoring runway friction conditions should be encouraged particularly under adverse weather conditions.
Reliability of the hand held dynamometer in measuring muscle strength in people with interstitial lung disease.

PubMed

Dowman, Leona; McDonald, Christine F; Hill, Catherine J; Lee, Annemarie; Barker, Kathryn; Boote, Claire; Glaspole, Ian; Goh, Nicole; Southcott, Annemarie; Burge, Angela; Ndongo, Rebecca; Martin, Alicia; Holland, Anne E

2016-09-01

To evaluate the inter-rater and intra-rater reliability of the hand held dynamometer in measuring muscle strength in people with interstitial lung disease (ILD). Test retest reliability of hand-held dynamometry for elbow flexor and knee extensor strength between two independent raters and two testing sessions. Physiotherapy department within a tertiary hospital. Thirty participants with ILD of varying aetiology were included. Twenty participants completed the inter-rater reliability protocol (10 idiopathic pulmonary fibrosis, mean (SD) age 73 (10) years, 11 male) and 21 participants completed the intra-rater reliability protocol (10 idiopathic pulmonary fibrosis, mean age 71 (10) years, 11 male). Mean muscle strength (kg). Agreement between the two raters and testing sessions was analyzed using Bland-Altman plots and reliability was estimated using intraclass correlation coefficients (ICC). For elbow flexor strength there was a mean difference between raters of -0.6kg (limits of agreement (LOA) -5.6 to 4.4kg) and within raters of -0.3kg (LOA -2.8 to 2.3kg). The ICCs were 0.95 and 0.98, respectively. For knee extensor strength there was a mean difference between raters of -1.5kg (LOA -6.9 to 3.9kg) and within raters of -0.7kg (LOA -3.9 to 2.4kg). The ICCs were 0.95 and 0.97, respectively. Hand-held dynamometry is reliable in measuring elbow flexor and knee extensor strength in people with ILD. Copyright © 2015 Chartered Society of Physiotherapy. Published by Elsevier Ltd. All rights reserved.
Reliability and validity assessment of administrative databases in measuring the quality of rectal cancer management.

PubMed

Corbellini, Carlo; Andreoni, Bruno; Ansaloni, Luca; Sgroi, Giovanni; Martinotti, Mario; Scandroglio, Ildo; Carzaniga, Pierluigi; Longoni, Mauro; Foschi, Diego; Dionigi, Paolo; Morandi, Eugenio; Agnello, Mauro

2018-01-01

Measurement and monitoring of the quality of care using a core set of quality measures are increasing in health service research. Although administrative databases include limited clinical data, they offer an attractive source for quality measurement. The purpose of this study, therefore, was to evaluate the completeness of different administrative data sources compared to a clinical survey in evaluating rectal cancer cases. Between May 2012 and November 2014, a clinical survey was done on 498 Lombardy patients who had rectal cancer and underwent surgical resection. These collected data were compared with the information extracted from administrative sources including Hospital Discharge Dataset, drug database, daycare activity data, fee-exemption database, and regional screening program database. The agreement evaluation was performed using a set of 12 quality indicators. Patient complexity was a difficult indicator to measure for lack of clinical data. Preoperative staging was another suboptimal indicator due to the frequent missing administrative registration of tests performed. The agreement between the 2 data sources regarding chemoradiotherapy treatments was high. Screening detection, minimally invasive techniques, length of stay, and unpreventable readmissions were detected as reliable quality indicators. Postoperative morbidity could be a useful indicator but its agreement was lower, as expected. Healthcare administrative databases are large and real-time collected repositories of data useful in measuring quality in a healthcare system. Our investigation reveals that the reliability of indicators varies between them. Ideally, a combination of data from both sources could be used in order to improve usefulness of less reliable indicators.
Measurement of impulsive choice in rats: Same and alternate form test-retest reliability and temporal tracking

PubMed Central

Peterson, Jennifer R.; Hill, Catherine C.; Kirkpatrick, Kimberly

2016-01-01

Impulsive choice is typically measured by presenting smaller-sooner (SS) versus larger-later (LL) rewards, with biases towards the SS indicating impulsivity. The current study tested rats on different impulsive choice procedures with LL delay manipulations to assess same-form and alternate-form test-retest reliability. In the systematic-GE procedure (Green & Estle, 2003), the LL delay increased after several sessions of training; in the systematic-ER procedure (Evenden & Ryan, 1996), the delay increased within each session; and in the adjusting-M procedure (Mazur, 1987), the delay changed after each block of trials within a session based on each rat’s choices in the previous block. In addition to measuring choice behavior, we also assessed temporal tracking of the LL delays using the median times of responding during LL trials. The two systematic procedures yielded similar results in both choice and temporal tracking measures following extensive training, whereas the adjusting procedure resulted in relatively more impulsive choices and poorer temporal tracking. Overall, the three procedures produced acceptable same form test-retest reliability over time, but the adjusting procedure did not show significant alternate form test-retest reliability with the other two procedures. The results suggest that systematic procedures may supply better measurements of impulsive choice in rats. PMID:25490901
Test-retest reliability and minimal detectable change of two simplified 3-point balance measures in patients with stroke.

PubMed

Chen, Yi-Miau; Huang, Yi-Jing; Huang, Chien-Yu; Lin, Gong-Hong; Liaw, Lih-Jiun; Lee, Shih-Chieh; Hsieh, Ching-Lin

2017-10-01

The 3-point Berg Balance Scale (BBS-3P) and 3-point Postural Assessment Scale for Stroke Patients (PASS-3P) were simplified from the BBS and PASS to overcome the complex scoring systems. The BBS-3P and PASS-3P were more feasible in busy clinical practice and showed similarly sound validity and responsiveness to the original measures. However, the reliability of the BBS-3P and PASS-3P is unknown limiting their utility and the interpretability of scores. We aimed to examine the test-retest reliability and minimal detectable change (MDC) of the BBS-3P and PASS-3P in patients with stroke. Cross-sectional study. The rehabilitation departments of a medical center and a community hospital. A total of 51 chronic stroke patients (64.7% male). Both balance measures were administered twice 7 days apart. The test-retest reliability of both the BBS-3P and PASS-3P were examined by intraclass correlation coefficients (ICC). The MDC and its percentage over the total score (MDC%) of each measure was calculated for examining the random measurement errors. The ICC values of the BBS-3P and PASS-3P were 0.99 and 0.97, respectively. The MDC% (MDC) of the BBS-3P and PASS-3P were 9.1% (5.1 points) and 8.4% (3.0 points), respectively, indicating that both measures had small and acceptable random measurement errors. Our results showed that both the BBS-3P and the PASS-3P had good test-retest reliability, with small and acceptable random measurement error. These two simplified 3-level balance measures can provide reliable results over time. Our findings support the repeated administration of the BBS-3P and PASS-3P to monitor the balance of patients with stroke. The MDC values can help clinicians and researchers interpret the change scores more precisely.
Validity and Reliability of a New Instrument to Measure Cancer-Related Fatigue in Adolescents

PubMed Central

Hinds, Pamela S.; Hockenberry, Marilyn; Tong, Xin; Rai, Shesh N.; Gattuso, Jamie S.; McCarthy, Kathleen; Pui, Ching-Hon; Srivastava, Deo Kumar

2008-01-01

Adolescents undergoing treatment for cancer rate fatigue as their most prevalent and intense cancer- and treatment-related effect. Parents and staff rate it similarly. Despite its reported prevalence, intensity, and distressing effects, cancer-related fatigue in adolescents is not routinely assessed during or after cancer treatment. We contend that the insufficient clinical attention is primarily due to the lack of a reliable and valid self-report instrument with which adolescent cancer-related fatigue can be measured. Our aim was to determine the reliability and construct validity of a new instrument and its ability to measure change in fatigue over time. Initial testing involved 64 adolescents undergoing curative treatment of cancer who completed the Fatigue Scale-Adolescent (FS-A) at two to four key points in treatment in one of four studies. Internal consistency estimates ranged from 0.67 to 0.95. Validity estimates involving the FS-A with the parent version ranged from 0.13 to 0.76; estimates involving the staff version and the Reynolds Depression Scale were 0.27 and 0.87 respectively. Additional validity findings included significant fatigue differences between anemic and non-anemic patients (P = 0.042) and the emergence of four factors in an exploratory factor analysis. Findings further indicate that the FS-A can be used to measure change over time (t = 2.55, P <0.01). In summary, the FS-A has moderate to strong reliability and impressive validity coefficients for a new research instrument. PMID:17629669
Is intra-bladder pressure measurement a reliable indicator for raised intra-abdominal pressure? A prospective comparative study.

PubMed

Al-Abassi, Abdulla Ahmed; Al Saadi, Azan Saleh; Ahmed, Faisal

2018-06-19

Intra-abdominal pressure (IAP) can be measured by several indirect methods; however, the urinary bladder is largely preferred. The aim of this study was to compare intra-bladder pressure (IBP) at different levels of IAPs and assess its reliability as an indirect method for IAP measurement. We compared IBP with IAP in twenty-one patients undergoing laparoscopic cholecystectomy under general anesthesia. Measurements were recorded at increasing levels of insufflation pressures to approximately 22 mmHg. Pearson's correlation coefficient was calculated to establish the relationship between the two pressure measurements and Bland-Altman analysis was used to assess the limits of agreement between the two methods of measurements. The urinary bladder pressures reflected well the pressures in the abdominal cavity. Pearson correlation coefficient showed a good correlation between the two measurement techniques (r = 0.966, p < 0.0001) and Bland-Altman analysis indicated that the 95% limits of agreement between the two methods ranged from - 2.83 to 2.64. This range is accepted both clinically and according to the recommendations of the World Society of Abdominal Compartment Syndrome (WSACS). Our study showed that IBP measurement is a simple, minimally invasive method that may reliably estimates IAP in patients placed in supine position. Measurements for pressures higher than 12 mmHg may be less reliable. When applied clinically, this should alert the clinician to take safety measures to avoid abdominal compartment syndrome (ACS).
Reliability and validity of the brief multidimensional measure of religiousness/spirituality among adolescents.

PubMed

Harris, Sion Kim; Sherritt, Lon R; Holder, David W; Kulig, John; Shrier, Lydia A; Knight, John R

2008-12-01

Developed for use in health research, the Brief Multidimensional Measure of Religiousness/Spirituality (BMMRS) consists of brief measures of a broad range of religiousness and spirituality (R/S) dimensions. It has established psychometric properties among adults, but little is known about its appropriateness for use with adolescents. We assessed the psychometric properties of the BMMRS among adolescents. We recruited a racially diverse (85% non-White) sample of 305 adolescents aged 12-18 years (median 16 yrs, IQR 14-17) from 3 urban medical clinics; 93 completed a retest 1 week later. We assessed internal consistency and test-retest reliability. We assessed construct validity by examining how well the measures discriminated groups expected to differ based on self-reported religious preference, and how they related to a hypothesized correlate, depressive symptoms. Religious preference was categorized into "No religion/Atheist" (11%), "Don't know/Confused" (9%), or "Named a religion" (80%). Responses to multi-item measures were generally internally consistent (alpha > or = 0.70 for 12/16 measures) and stable over 1 week (intraclass correlation coefficients > or = 0.70 for 14/16). Forgiveness, Negative R/S Coping, and Commitment items showed lower internal cohesiveness. Scores on most measures were higher (p < 0.05) among those who "Named a religion" compared to the "No religion/Atheist" group. Forgiveness, Commitment, and Anticipated Support from members of one's congregation were inversely correlated with depressive symptoms, while BMMRS measures assessing negative R/S experiences (Negative R/S Coping, Negative Interactions with others in congregation, Loss in Faith) were positively correlated with depressive symptoms. These findings suggest that most BMMRS measures are reliable and valid for use among adolescents.
Intra-observer reproducibility and interobserver reliability of the radiographic parameters in the Spinal Deformity Study Group's AIS Radiographic Measurement Manual.

PubMed

Dang, Natasha Radhika; Moreau, Marc J; Hill, Douglas L; Mahood, James K; Raso, James

2005-05-01

Retrospective cross-sectional assessment of the reproducibility and reliability of radiographic parameters. To measure the intra-examiner and interexaminer reproducibility and reliability of salient radiographic features. The management and treatment of adolescent idiopathic scoliosis (AIS) depends on accurate and reproducible radiographic measurements of the deformity. Ten sets of radiographs were randomly selected from a sample of patients with AIS, with initial curves between 20 degrees and 45 degrees. Fourteen measures of the deformity were measured from posteroanterior and lateral radiographs by 2 examiners, and were repeated 5 times at intervals of 3-5 days. Intra-examiner and interexaminer differences were examined. The parameters include measures of curve size, spinal imbalance, sagittal kyphosis and alignment, maximum apical vertebral rotation, T1 tilt, spondylolysis/spondylolisthesis, and skeletal age. Intra-examiner reproducibility was generally excellent for parameters measured from the posteroanterior radiographs but only fair to good for parameters from the lateral radiographs, in which some landmarks were not clearly visible. Of the 13 parameters observed, 7 had excellent interobserver reliability. The measurements from the lateral radiograph were less reproducible and reliable and, thus, may not add value to the assessment of AIS. Taking additional measures encourages a systematic and comprehensive assessment of spinal radiographs.
Test-Retest Reliability of Self-Reported Sexual Health Measures among US Hispanic Adolescents

ERIC Educational Resources Information Center

Jerman, Petra; Berglas, Nancy F.; Rohrbach, Louise A.; Constantine, Norman A.

2016-01-01

Objective: Although Hispanic adolescents in the USA are often the focus of sexual health interventions, their response to survey measures has rarely been assessed within evaluation studies. This study documents the test-retest reliability of a wide range of self-reported sexual health values, attitudes, knowledge and behaviours among Hispanic…
Quantifying frontal plane knee motion during single limb squats: reliability and validity of 2-dimensional measures.

PubMed

Gwynne, Craig R; Curran, Sarah A

2014-12-01

Clinical assessment of lower limb kinematics during dynamic tasks may identify individuals who demonstrate abnormal movement patterns that may lead to etiology of exacerbation of knee conditions such as patellofemoral joint (PFJt) pain. The purpose of this study was to determine the reliability, validity and associated measurement error of a clinically appropriate two-dimensional (2-D) procedure of quantifying frontal plane knee alignment during single limb squats. Nine female and nine male recreationally active subjects with no history of PFJt pain had frontal plane limb alignment assessed using three-dimensional (3-D) motion analysis and digital video cameras (2-D analysis) while performing single limb squats. The association between 2-D and 3-D measures was quantified using Pearson's product correlation coefficients. Intraclass correlation coefficients (ICCs) were determined for within- and between-session reliability of 2-D data and standard error of measurement (SEM) was used to establish measurement error. Frontal plane limb alignment assessed with 2-D analysis demonstrated good correlation compared with 3-D methods (r = 0.64 to 0.78, p < 0.001). Within-session (0.86) and between-session ICCs (0.74) demonstrated good reliability for 2-D measures and SEM scores ranged from 2° to 4°. 2-D measures have good consistency and may provide a valid measure of lower limb alignment when compared to existing 3-D methods. Assessment of lower limb kinematics using 2-D methods may be an accurate and clinically useful alternative to 3-D motion analysis when identifying individuals who demonstrate abnormal movement patterns associated with PFJt pain. 2b.
A new approach for improving reliability of personal navigation devices under harsh GNSS signal conditions.

PubMed

Dhital, Anup; Bancroft, Jared B; Lachapelle, Gérard

2013-11-07

In natural and urban canyon environments, Global Navigation Satellite System (GNSS) signals suffer from various challenges such as signal multipath, limited or lack of signal availability and poor geometry. Inertial sensors are often employed to improve the solution continuity under poor GNSS signal quality and availability conditions. Various fault detection schemes have been proposed in the literature to detect and remove biased GNSS measurements to obtain a more reliable navigation solution. However, many of these methods are found to be sub-optimal and often lead to unavailability of reliability measures, mostly because of the improper characterization of the measurement errors. A robust filtering architecture is thus proposed which assumes a heavy-tailed distribution for the measurement errors. Moreover, the proposed filter is capable of adapting to the changing GNSS signal conditions such as when moving from open sky conditions to deep canyons. Results obtained by processing data collected in various GNSS challenged environments show that the proposed scheme provides a robust navigation solution without having to excessively reject usable measurements. The tests reported herein show improvements of nearly 15% and 80% for position accuracy and reliability, respectively, when applying the above approach.
A New Approach for Improving Reliability of Personal Navigation Devices under Harsh GNSS Signal Conditions

PubMed Central

Dhital, Anup; Bancroft, Jared B.; Lachapelle, Gérard

2013-01-01

In natural and urban canyon environments, Global Navigation Satellite System (GNSS) signals suffer from various challenges such as signal multipath, limited or lack of signal availability and poor geometry. Inertial sensors are often employed to improve the solution continuity under poor GNSS signal quality and availability conditions. Various fault detection schemes have been proposed in the literature to detect and remove biased GNSS measurements to obtain a more reliable navigation solution. However, many of these methods are found to be sub-optimal and often lead to unavailability of reliability measures, mostly because of the improper characterization of the measurement errors. A robust filtering architecture is thus proposed which assumes a heavy-tailed distribution for the measurement errors. Moreover, the proposed filter is capable of adapting to the changing GNSS signal conditions such as when moving from open sky conditions to deep canyons. Results obtained by processing data collected in various GNSS challenged environments show that the proposed scheme provides a robust navigation solution without having to excessively reject usable measurements. The tests reported herein show improvements of nearly 15% and 80% for position accuracy and reliability, respectively, when applying the above approach. PMID:24212120
RELIABILITY AND APPLICABILITY OF DSTS AND BOTTOMHOLE PRESSURE MEASUREMENTS IN TEXAS GULF TERTIARY FORMATIONS

EPA Science Inventory

Pressure data gathered from drillstem tests (DSTs) and bottomhole pressure measurements provide critical information toward formation and can be used for an assessment of prevailing pressure regimes and their influence on the migration potential of formation fluids. Reliability o...

A Measure of Perceived Argument Strength: Reliability and Validity

PubMed Central

Zhao, Xiaoquan; Strasser, Andrew; Cappella, Joseph N.; Lerman, Caryn; Fishbein, Martin

2014-01-01

Studies of the content of persuasive messages in which the central arguments of the message are scrutinized have traditionally relied on the technique of thought-listing to assess argument strength. Although the validity of the thought-listing procedure is well documented, its utility can be limited in situations involving non-adult populations and sensitive topics. In this paper we present a self-reported scale that can be used to assess perceived argument strength in contexts where thought-listing may be less appropriate. This scale taps into perceived argument strength from multiple points of view, including but also extending beyond the potential of the argument to elicit positive and negative thoughts. Reliability and validity of this scale were assessed in health communication contexts involving anti-drug PSAs directed at adolescents and anti-smoking PSAs targeting adults. Evidence of convergence between this scale and the thought-listing technique was also obtained using the classical comprehensive exam arguments. PMID:25568663
Questionnaire for measuring organisational attributes in dental-care practices: psychometric properties and test-retest reliability.

PubMed

Goetz, Katja; Hasse, Philipp; Szecsenyi, Joachim; Campbell, Stephen M

2016-04-01

The consideration of organisational aspects, such as shared goals and clear communication, within the health care team is important to ensure good quality care. In primary health care, the instrument Survey of Organizational Attributes for Primary Care (SOAPC) is available to measure organisational attributes of care. However, there is no instrument available for dental care. The aim of the present study was to investigate psychometric properties and test-retest reliability of the version of SOAPC adapted for dental care, namely the Survey of Organizational Attributes in Dental Care (SOADC). The SOADC consists of 21 items in the following four subscales: communication; decision making; stress/chaos; and history of change. Convergent construct validity was measured using the job satisfaction scale. A total of 287 dental-care practices were asked to participate in the validation study. Psychometric properties and test-retest reliability were observed. A total of 43 dental-care practices responded to the survey. At baseline, 178 dental-care staff completed the questionnaire, and 4 weeks later 138 did so. Internal consistency, measured by Cronbach's alpha, was 0.718 or higher in the subscales. The test-retest reliability for each subscale and the overall SOADC score demonstrated good correlations over the 4-week test-retest interval, except for 'history of change'. A strong correlation with the aggregated job-satisfaction scale showed high convergent construct validity of SOADC. The consideration of organisational aspects from the perspective of dental-care teams is important for providing good quality of care. The SOADC is a reliable instrument with good psychometric properties and is suitable for the evaluation of organisational attributes in dental-care practices. © 2015 FDI World Dental Federation.
Sustaining Reliability on Accountability Measures at The Johns Hopkins Hospital.

PubMed

Pronovost, Peter J; Holzmueller, Christine G; Callender, Tiffany; Demski, Renee; Winner, Laura; Day, Richard; Austin, J Matthew; Berenholtz, Sean M; Miller, Marlene R

2016-02-01

In 2012 Johns Hopkins Medicine leaders challenged their health system to reliably deliver best practice care linked to nationally vetted core measures and achieve The Joint Commission Top Performer on Key Quality Measures ®program recognition and the Delmarva Foundation award. Thus, the Armstrong Institute for Patient Safety and Quality implemented an initiative to ensure that ≥96% of patients received care linked to measures. Nine low-performing process measures were targeted for improvement-eight Joint Commission accountability measures and one Delmarva Foundation core measure. In the initial evaluation at The Johns Hopkins Hospital, all accountability measures for the Top Performer program reached the required ≥95% performance, gaining them recognition by The Joint Commission in 2013. Efforts were made to sustain performance of accountability measures at The Johns Hopkins Hospital. Improvements were sustained through 2014 using the following conceptual framework: declare and communicate goals, create an enabling infrastructure, engage clinicians and connect them in peer learning communities, report transparently, and create accountability systems. One part of the accountability system was for teams to create a sustainability plan, which they presented to senior leaders. To support sustained improvements, Armstrong Institute leaders added a project management office for all externally reported quality measures and concurrent reviewers to audit performance on care processes for certain measure sets. The Johns Hopkins Hospital sustained performance on all accountability measures, and now more than 96% of patients receive recommended care consistent with nationally vetted quality measures. The initiative methods enabled the transition of quality improvement from an isolated project to a way of leading an organization.
Reliability and limits of agreement of circumferential, water displacement, and optoelectronic volumetry in the measurement of upper limb lymphedema.

PubMed

Deltombe, T; Jamart, J; Recloux, S; Legrand, C; Vandenbroeck, N; Theys, S; Hanson, P

2007-03-01

We conducted a reliability comparison study to determine the intrarater and inter-rater reliability and the limits of agreement of the volume estimated by circumferential measurements using the frustum sign method and the disk model method, by water displacement volumetry, and by infrared optoelectronic volumetry in the assessment of upper limb lymphedema. Thirty women with lymphedema following axillary lymph node dissection surgery for breast cancer surgery were enrolled. In each patient, the volumes of the upper limbs were estimated by three physical therapists using circumference measurements, water displacement and optoelectronic volumetry. One of the physical therapists performed each measure twice. Intraclass correlation coefficients (ICCs), relative differences, and limits of agreement were determined. Intrarater and interrater reliability ICCs ranged from 0.94 to 1. Intrarater relative differences were 1.9% for the disk model method, 3.2% for the frustum sign model method, 2.9% for water displacement volumetry, and 1.5% for optoelectronic volumetry. Intrarater reliability was always better than interrater, except for the optoelectronic method. Intrarater and interrater limits of agreement were calculated for each technique. The disk model method and optoelectronic volumetry had better reliability than the frustum sign method and water displacement volumetry, which is usually considered to be the gold standard. In terms of low-cost, simplicity, and reliability, we recommend the disk model method as the method of choice in clinical practice. Since intrarater reliability was always better than interrater reliability (except for optoelectronic volumetry), patients should therefore, ideally, always be evaluated by the same therapist. Additionally, the limits of agreement must be taken into account when determining the response of a patient to treatment.
Reliability and validity of a measure of role functioning among people with psychiatric disabilities.

PubMed

Harris, Meredith; Gladman, Beverley; Hennessy, Nicole; Lloyd, Chris; Mowry, Bryan; Waghorn, Geoffrey

2011-06-01

The aim was to investigate the reliability and validity of the Socially Valued Role Classification Scale (SRCS), a domain-specific measure of role functioning designed for use with community residents with psychiatric disabilities. Test-retest reliability, concurrent validity, face validity, consumer and clinician acceptability and utility were examined. Sixty community residents with schizophrenia or schizoaffective disorder participated in this study where the SRCS was administered by telephone. Test-retest reliability showed good or very good agreement for subscale scores (intraclass correlations (ICCs): 0.78-0.89) and for items capturing amount of participation in domain-specific activities (ICC: 0.67-1.00). Greater variation was observed for items capturing assistance required with activities (κ: 0.40-0.75), and standard of activities performed (κ: 0.43-1.00). Concurrent validity was supported by moderate to very good associations in the directions expected. Face validity, user acceptability and utility in telephone interviews were adequate. These findings add to previous psychometric evidence and support the continued development of the SRCS for use in community mental health settings. The SRCS has promising utility for occupational therapists involved in psychiatric rehabilitation outcome measurement. © 2011 Queensland Health - QCMHR. Australian Occupational Therapy Journal © 2011 Australian Association of Occupational Therapists.
Reliability and Validity of Objective Measures of Physical Activity in Youth With Cerebral Palsy Who Are Ambulatory.

PubMed

O'Neil, Margaret E; Fragala-Pinkham, Maria; Lennon, Nancy; George, Ameeka; Forman, Jeffrey; Trost, Stewart G

2016-01-01

Physical therapy for youth with cerebral palsy (CP) who are ambulatory includes interventions to increase functional mobility and participation in physical activity (PA). Thus, reliable and valid measures are needed to document PA in youth with CP. The purpose of this study was to evaluate the inter-instrument reliability and concurrent validity of 3 accelerometer-based motion sensors with indirect calorimetry as the criterion for measuring PA intensity in youth with CP. Fifty-seven youth with CP (mean age=12.5 years, SD=3.3; 51% female; 49.1% with spastic hemiplegia) participated. Inclusion criteria were: aged 6 to 20 years, ambulatory, Gross Motor Function Classification System (GMFCS) levels I through III, able to follow directions, and able to complete the full PA protocol. Protocol activities included standardized activity trials with increasing PA intensity (resting, writing, household chores, active video games, and walking at 3 self-selected speeds), as measured by weight-relative oxygen uptake (in mL/kg/min). During each trial, participants wore bilateral accelerometers on the upper arms, waist/hip, and ankle and a portable indirect calorimeter. Intraclass coefficient correlations (ICCs) were calculated to evaluate inter-instrument reliability (left-to-right accelerometer placement). Spearman correlations were used to examine concurrent validity between accelerometer output (activity and step counts) and indirect calorimetry. Friedman analyses of variance with post hoc pair-wise analyses were conducted to examine the validity of accelerometers to discriminate PA intensity across activity trials. All accelerometers exhibited excellent inter-instrument reliability (ICC=.94-.99) and good concurrent validity (rho=.70-.85). All accelerometers discriminated PA intensity across most activity trials. This PA protocol consisted of controlled activity trials. Accelerometers provide valid and reliable measures of PA intensity among youth with CP. © 2016 American
Brazilian Version of the Functional Assessment Measure: Cross-Cultural Adaptation and Reliability Evaluation

ERIC Educational Resources Information Center

Lourenco Jorge, Liliana; Garcia Marchi, Flavia Helena; Portela Hara, Ana Clara; Battistella, Linamara R.

2011-01-01

The objective of this prospective study was to perform a cross-cultural adaptation of the Functional Assessment Measure (FAM) into Brazilian Portuguese, and to assess the test-retest reliability. The instrument was translated, back-translated, pretested, and reviewed by a committee. The Brazilian version was assessed in 61 brain-injury patients.…
Toward increased reliability in the electric power industry: direct temperature measurement in transformers using fiber optic sensors

NASA Astrophysics Data System (ADS)

McDonald, Greg

1998-09-01

Optimal loading, prevention of catastrophic failures and reduced maintenance costs are some of the benefits of accurate determination of hot spot winding temperatures in medium and high power transformers. Temperature estimates obtained using current theoretical models are not always accurate. Traditional technology (IR, thermocouples...) are unsuitable or inadequate for direct measurement. Nortech fiber-optic temperature sensors offer EMI immunity and chemical resistance and are a proven solution to the problem. The Nortech sensor's measurement principle is based on variations in the spectral absorption of a fiber-mounted semiconductor chip and probes are interchangeable with no need for recalibration. Total length of probe + extension can be up to several hundred meters allowing system electronics to be located in the control room or mounted in the transformer instrumentation cabinet. All of the sensor materials withstand temperatures up to 250 degree(s)C and have demonstrated excellent resistance to the harsh transformer environment (hot oil, kerosene). Thorough study of the problem and industry collaboration in testing and installation allows Nortech to identify and meet the need for durable probes, leak-proof feedthroughs, standard computer interfaces and measurement software. Refined probe technology, the method's simplicity and reliable calibration are all assets that should lead to growing acceptance of this type of direct measuring in the electric power industry.
Measuring surface topography with scanning electron microscopy. I. EZEImage: a program to obtain 3D surface data.

PubMed

Ponz, Ezequiel; Ladaga, Juan Luis; Bonetto, Rita Dominga

2006-04-01

Scanning electron microscopy (SEM) is widely used in the science of materials and different parameters were developed to characterize the surface roughness. In a previous work, we studied the surface topography with fractal dimension at low scale and two parameters at high scale by using the variogram, that is, variance vs. step log-log graph, of a SEM image. Those studies were carried out with the FERImage program, previously developed by us. To verify the previously accepted hypothesis by working with only an image, it is indispensable to have reliable three-dimensional (3D) surface data. In this work, a new program (EZEImage) to characterize 3D surface topography in SEM has been developed. It uses fast cross correlation and dynamic programming to obtain reliable dense height maps in a few seconds which can be displayed as an image where each gray level represents a height value. This image can be used for the FERImage program or any other software to obtain surface topography characteristics. EZEImage also generates anaglyph images as well as characterizes 3D surface topography by means of a parameter set to describe amplitude properties and three functional indices for characterizing bearing and fluid properties.
Validity and reliability of an instrumented leg-extension machine for measuring isometric muscle strength of the knee extensors.

PubMed

Ruschel, Caroline; Haupenthal, Alessandro; Jacomel, Gabriel Fernandes; Fontana, Heiliane de Brito; Santos, Daniela Pacheco dos; Scoz, Robson Dias; Roesler, Helio

2015-05-20

Isometric muscle strength of knee extensors has been assessed for estimating performance, evaluating progress during physical training, and investigating the relationship between isometric and dynamic/functional performance. To assess the validity and reliability of an adapted leg-extension machine for measuring isometric knee extensor force. Validity (concurrent approach) and reliability (test and test-retest approach) study. University laboratory. 70 healthy men and women aged between 20 and 30 y (39 in the validity study and 31 in the reliability study). Intraclass correlation coefficient (ICC) values calculated for the maximum voluntary isometric torque of knee extensors at 30°, 60°, and 90°, measured with the prototype and with an isokinetic dynamometer (ICC2,1, validity study) and measured with the prototype in test and retest sessions, scheduled from 48 h to 72 h apart (ICC1,1, reliability study). In the validity analysis, the prototype showed good agreement for measurements at 30° (ICC2,1 = .75, SEM = 18.2 Nm) and excellent agreement for measurements at 60° (ICC2,1 = .93, SEM = 9.6 Nm) and at 90° (ICC2,1 = .94, SEM = 8.9 Nm). Regarding the reliability analysis, between-days' ICC1,1 were good to excellent, ranging from .88 to .93. Standard error of measurement and minimal detectable difference based on test-retest ranged from 11.7 Nm to 18.1 Nm and 32.5 Nm to 50.1 Nm, respectively, for the 3 analyzed knee angles. The analysis of validity and repeatability of the prototype for measuring isometric muscle strength has shown to be good or excellent, depending on the knee joint angle analyzed. The new instrument, which presents a relative low cost and easiness of transportation when compared with an isokinetic dynamometer, is valid and provides consistent data concerning isometric strength of knee extensors and, for this reason, can be used for practical, clinical, and research purposes.
Reliability of a Cryoscopic Micro-Osmometer Using 15-µL Plasma Samples to Measure Hydration Status in Varied Environmental Conditions

ERIC Educational Resources Information Center

Scanlan, Aaron T.; Richter-Stretton, Gina L.; Madueno, Maria C.; Borges, Nattai R.; Fenning, Andrew S.

2017-01-01

Measurement of plasma osmolality (P[subscript osm]) remains popular for assessing hydration status in exercise science. However, a controlled reliability assessment of micro-osmometry using small sample volumes to measure Posm remains to be performed. This study aimed to examine the reliability of a cryoscopic micro-osmometer requiring 15-µL…
Reliability of a two-wavelength autofluorescence technique by Heidelberg Spectralis to measure macular pigment optical density in Asian subjects.

PubMed

Obana, Akira; Gellermann, Werner; Gohto, Yuko; Seto, Takahiko; Sasano, Hiroyuki; Tanito, Masaki; Okazaki, Shigetoshi

2018-03-01

This study evaluates the accuracy of an objective two-wavelength fundus autofluorescence technique for the purpose of measuring the macular pigment optical density (MPOD) in Asian pigmented eyes. Potential differences between MPOD values obtained via autofluorescence technique and subjective heterochromatic photometry (HFP) were examined. Inter-examiner reproducibility between three examiners and test-retest reliability over five time points were also explored. Subjects were 27 healthy Japanese volunteers aged 24 to 58 (mean ± standard deviation, 40.2 ± 9.0) years. An MPOD module of the Spectralis MultiColor instrument configuration (Spectralis-MP) was used for the autofluorescence technique, and a Macular Metrics Densitometer (MM) was used for HFP. The mean MPOD values at 0.25° and 0.5° eccentricities using the Spectralis-MP were 0.51 ± 0.12 and 0.48 ± 0.13, respectively. In comparison, the MM based values were 0.72 ± 0.23 and 0.61 ± 0.25, respectively. High correlations between the Spectralis-MP and MM instrument were found (Pearson's correlation coefficients of 0.73 and 0.87 at 0.25° and 0.5° eccentricities, respectively), but there was a systematic bias: the MPOD values by MM method were significantly higher than those by Spectralis-MP at 0.25° eccentricity. High inter-examiner reproducibility and test-retest reliability were found for MM measurements at 0.5° eccentricity, but not at 0.25°. The Spectralis-MP showed less inter-examiner and test-retest variability than the MM instrument at 0.25° and 0.5° eccentricities. We conclude that the Spectralis-MP, given its high agreement with the HFP method and due to its higher reproducibility and reliability, is well suited for clinical measurements of MPOD levels in Asian pigmented eyes. Copyright © 2018. Published by Elsevier Ltd.
Reliability and validity of measures used in assessing dental anxiety in 5- to 15-year-old Croatian children.

PubMed

Majstorovic, M; Veerkamp, J S; Skrinjaric, I

2003-12-01

The aim of the study was to evaluate reliability and validity of different questionnaires and predict related causes, as concomitant factors in assessing different aspects of children's dental anxiety. Children were interviewed on dental anxiety, dispositional risk factors and satisfaction with the dentist after dental treatment had been accomplished. Parents were interviewed on dental anxiety as well. The study population included 165 children (91 boys) aged 5 to 15 years, referred to a university dental clinic by general dental practitioners because of a history of fear and uncooperative behaviour during previous dental visits. Children were treated by two dentists, both experienced in treating fearful children. Statistical analysis was performed in Statistics for Windows, Release 5.5 and Release 7.5. Pearson's correlation coefficients were calculated for validity and Cronbach alpha for reliability of the measures. Spearman Brown prophecy formula was used for correction of the alpha scores. Results The children's total average CFSS-DS score was 27.02, with no significant difference with respect to gender. The highest Cronbach alpha scores regarding reliability were obtained for the S-DAI, the CFSS-DS and the PDAS. Pearson's correlations regarding validity presented significant correlations between the CMFQ, the CDAS and the S-DAI, between the OAS, the CDAS and the S-DAI, as well as between the OAS and the DVSS-SV. Previous negative medical experience had significant influence on children's dental anxiety, supporting Rachman's conditioning theory. Anxious children were more likely to show behaviour problems (aggression) and more introvert in expressing their judgement regarding the dentist. Both the S-DAI and the CFSS-DS, which were standardized in the Croatian population sample, showed the highest reliability in assessment of children's dental anxiety.
Measuring disability: a systematic review of the validity and reliability of the Global Activity Limitations Indicator (GALI).

PubMed

Van Oyen, Herman; Bogaert, Petronille; Yokota, Renata T C; Berger, Nicolas

2018-01-01

GALI or Global Activity Limitation Indicator is a global survey instrument measuring participation restriction. GALI is the measure underlying the European indicator Healthy Life Years (HLY). Gali has a substantial policy use within the EU and its Member States. The objective of current paper is to bring together what is known from published manuscripts on the validity and the reliability of GALI. Following the PRISMA guidelines, two search strategies (PUBMED, Google Scholar) were combined to identify manuscripts published in English with publication date 2000 or beyond. Articles were classified as reliability studies, concurrent or predictive validity studies, in national or international populations. Four cross-sectional studies (of which 2 international) studied how GALI relates to other health measures (concurrent validity). A dose-response effect by GALI severity level on the association with the other health status measures was observed in the national studies. The 2 international studies (SHARE, EHIS) concluded that the odds of reporting participation restriction was higher in subjects with self-reported or observed functional limitations. In SHARE, the size of the Odds Ratio's (ORs) in the different countries was homogeneous, while in EHIS the size of the ORs varied more strongly. For the predictive validity, subjects were followed over time (4 studies of which one international). GALI proved, both in national and international data, to be a consistent predictor of future health outcomes both in terms of mortality and health care expenditure. As predictors of mortality, the two distinct health concepts, self-rated health and GALI, acted independently and complementary of each other. The one reliability study identified reported a sufficient reliability of GALI. GALI as inclusive one question instrument fits all conceptual characteristics specified for a global measure on participation restriction. In none of the studies, included in the review, there was
Reliability and concurrent and construct validity of the Strategies for Weight Management measure for adults.

PubMed

Kolodziejczyk, Julia K; Norman, Gregory J; Rock, Cheryl L; Arredondo, Elva M; Roesch, Scott C; Madanat, Hala; Patrick, Kevin

2016-01-01

This study evaluates the reliability and validity of the strategies for weight management (SWM) measure, a questionnaire that assesses weight management strategies for adults. The SWM includes 20 items that are categorized within the following subscales: (1) energy intake, (2) energy expenditure, (3) self-monitoring, and (4) self-regulation. Baseline and 6-month data were collected from 404 overweight/obese adults (mean age=22±3.8 years, 68% ethnic minority) enrolled in a randomized controlled trial aiming to reduce weight by improving diet and physical activity behaviours. Reliability and validity were assessed for each subscale separately. Cronbach alpha was conducted to assess reliability. Concurrent, construct I (sensitivity to the study treatment condition), and construct II (relationship to the outcomes) validity were assessed using linear regressions with the following outcome measures: weight, self-reported diet, and weekly energy expenditure. All subscales showed strong internal consistency. The strength of the validity evidence depended on subscale and validity type. The strongest validity evidence was concurrent validity of the energy intake and energy expenditure subscales; construct I validity of the energy intake and self-monitoring subscales; and construct II validity of the energy intake, energy expenditure, and self-regulation subscales. Results indicate that the SWM can be used to assess weight management strategies among an ethnically diverse sample of adults as each subscale showed evidence of reliability and select types of validity. As validity is an accumulation of evidence over multiple studies, this study provides initial reliability and validity evidence in one population segment. Copyright © 2015 Asia Oceania Association for the Study of Obesity. Published by Elsevier Ltd. All rights reserved.
TEST-retest reliability of kinetic variables measured on campus board in sport climbers.

PubMed

Abreu, Edgardo Alvares de Campos; Araújo, Sílvia Ribeiro Santos; Cançado, Gustavo Henrique da Cunha Peixoto; Andrade, André Gustavo Pereira de; Chagas, Mauro Heleno; Menzel, Hans-Joachim Karl

2018-05-16

Sport climbers frequently use campus board (CB) to improve their upper limb strength under similar conditions of high-difficulty sport climbing routes. The objective of this study was to assess the test-retest reliability of peak force and impulse measured using a CB instrumented with two load cells on starting holds. The same evaluator examined 22 climbers on two days with 48 h between the assessments. The participants performed five concentric lunges (CL) and five lunges with stretch-shortening cycle with 1 min intervals between repetitions and 10 min between exercises. All variables were associated with significant intraclass correlation coefficient (ICC) values (p = 0.001), and none variable showed systematic errors (p > 0.05). Peak force ICC was higher than 0.88, and the standard error of measurement (SEM%) was less than 5%. Impulse ICC for the CL was greater than 0.90, and the SEM% was less than 14%. We conclude that the kinetic variables measured using the CB were reliable. The ability of the hands to maintain contact with the holds (peak force) and the abilities of the arms and shoulders vertically move the centre of mass (impulse) should be taken into account by coaches on CB training prescription as well for further research.
Tissue tonometry is a simple, objective measure for pliability of burn scar: is it reliable?

PubMed

Lye, Ian; Edgar, Dale W; Wood, Fiona M; Carroll, Sara

2006-01-01

Objective measurement of burn scar response to treatment is important to facilitate individual patient care, research, and service development. This work examines the validity and reliability of the tonometer as a means of quantifying scar pliability. Ten burn survivors were recruited into the study. Triplicate measures were taken for each of four scar and one normal skin point. The pliability score from the Vancouver Scar Scale also was used as a comparison. The tonometer demonstrated a high degree of reliability (intraclass correlation coefficients 0.91-0.94). It also was shown to provide a valid measure of pliability by quantifying decreased tissue deformation for scar (2.04 +/- 0.45 mm) compared with normal tissue (3.02 +/- 0.92 mm; t = 4.28, P = .004) and a moderate correlation with Vancouver Scar Scale scores. The tissue tonometer provides a repeatable, objective index of burn scar pliability. Using the methods described, it is a simple, clinically useful technique for monitoring an individual's scar.
Reliability and discriminative validity of sudden ankle inversion measurements in patients with chronic ankle instability.

PubMed

Eechaute, Christophe; Vaes, Peter; Duquet, William; Van Gheluwe, Bart

2009-07-01

Studies investigating peroneal muscle reaction times in chronically unstable ankle joints present conflicting results. The degree of reliability and accuracy of these measurements is unknown in patients with chronic ankle instability (CAI). 40 patients with CAI and 30 healthy subjects were tested using a sudden ankle inversion of 50 degrees while standing on a trapdoor device. Sudden ankle inversion measurements were registered using electromyography, accelerometry and electrogoniometry. For reliability testing, intra-class coefficients (ICCs; model 3,1) and standard errors of measurements of the latency time, motor response time and electromechanical delay of the peroneus longus muscle, the time and angular position of onset of decelerations, the mean and maximum inversion speed and the total inversion time were calculated in 15 patients with CAI. To assess between-group differences, t-tests for independent samples (p<.05) were used. ICCs ranged from .20 (angular position of onset of the second deceleration) to .98 (electromechanical delay of the peroneus longus muscle). Significant between-group differences were observed in only 2 of the 12 variables (for the electromechanical delay of the peroneus longus muscle, p=.001; time of onset of the second deceleration, p=.040). The latency time and motor response time of the peroneus longus muscle, the total inversion time and the mean inversion speed demonstrate acceptable reliability in healthy subjects and patients. The latency time and motor response time of the peroneus longus muscle are not delayed in patients with CAI. Ankle inversion measurements are not discriminative for CAI.
The PedsQL 4.0 as a school population health measure: feasibility, reliability, and validity.

PubMed

Varni, James W; Burwinkle, Tasha M; Seid, Michael

2006-03-01

The application of health-related quality of life (HRQOL) as a school population health measure may facilitate risk assessment and resource allocation, the tracking of student health at the school and district level, the identification of health disparities among schoolchildren, and the determination of health outcomes from interventions and policy decisions at the school, district, and county level. To determine the feasibility, reliability, and validity of the 23-item PedsQL 4.0 (Pediatric Quality of Life Inventory) Generic Core Scales as a school population health measure for children and adolescents. Survey conducted in 304 classes at 18 elementary schools, 4 middle schools, and 3 high schools within a large metropolitan school district. The PedsQL 4.0 Generic Core Scales (Physical, Emotional, Social, School Functioning) were completed by 2437 children ages 8-18 and 4227 parents of children ages 5-18. The PedsQL 4.0 evidenced minimal missing responses, achieved excellent reliability for the Total Scale Score (alpha = 0.89 child, 0.92, parent report), and distinguished between healthy children and children with chronic health conditions. The PedsQL 4.0 was related to indicators of socioeconomic status (SES) at the school and district level. The PedsQL School Functioning Scale was significantly correlated with standardized achievement scores based on the Stanford 9. The results demonstrate the feasibility, reliability and validity of the PedsQL 4.0 Generic Core Scales as a school population health measure. The implications of measuring HRQOL in schoolchildren at the school, district, and county level for identifying and ameliorating health disparities are discussed.
Assessing the Reliability of Skills Measured by the SAT®. Research Notes. RN-24

ERIC Educational Resources Information Center

Ewing, Maureen; Huff, Kristen; Andrews, Melissa; King, Kinda

2005-01-01

In connection with the new SAT that was introduced in March 2005, research has been under way to investigate the feasibility of providing examinees with score reports that contain feedback on skills measured by the critical reading, mathematics, and writing sections of the test. The main purpose of this study was to estimate the reliability of the…

Validity and reliability of GPS and LPS for measuring distances covered and sprint mechanical properties in team sports

PubMed Central

Baumgart, Christian; Polglaze, Ted; Freiwald, Jürgen

2018-01-01

This study aimed to investigate the validity and reliability of global (GPS) and local (LPS) positioning systems for measuring distances covered and sprint mechanical properties in team sports. Here, we evaluated two recently released 18 Hz GPS and 20 Hz LPS technologies together with one established 10 Hz GPS technology. Six male athletes (age: 27±2 years; VO2max: 48.8±4.7 ml/min/kg) performed outdoors on 10 trials of a team sport-specific circuit that was equipped with double-light timing gates. The circuit included various walking, jogging, and sprinting sections that were performed either in straight-lines or with changes of direction. During the circuit, athletes wore two devices of each positioning system. From the reported and filtered velocity data, the distances covered and sprint mechanical properties (i.e., the theoretical maximal horizontal velocity, force, and power output) were computed. The sprint mechanical properties were modeled via an inverse dynamic approach applied to the center of mass. The validity was determined by comparing the measured and criterion data via the typical error of estimate (TEE), whereas the reliability was examined by comparing the two devices of each technology (i.e., the between-device reliability) via the coefficient of variation (CV). Outliers due to measurement errors were statistically identified and excluded from validity and reliability analyses. The 18 Hz GPS showed better validity and reliability for determining the distances covered (TEE: 1.6–8.0%; CV: 1.1–5.1%) and sprint mechanical properties (TEE: 4.5–14.3%; CV: 3.1–7.5%) than the 10 Hz GPS (TEE: 3.0–12.9%; CV: 2.5–13.0% and TEE: 4.1–23.1%; CV: 3.3–20.0%). However, the 20 Hz LPS demonstrated superior validity and reliability overall (TEE: 1.0–6.0%; CV: 0.7–5.0% and TEE: 2.1–9.2%; CV: 1.6–7.3%). For the 10 Hz GPS, 18 Hz GPS, and 20 Hz LPS, the relative loss of data sets due to measurement errors was 10.0%, 20.0%, and 15
Validity and reliability of GPS and LPS for measuring distances covered and sprint mechanical properties in team sports.

PubMed

Hoppe, Matthias W; Baumgart, Christian; Polglaze, Ted; Freiwald, Jürgen

2018-01-01

This study aimed to investigate the validity and reliability of global (GPS) and local (LPS) positioning systems for measuring distances covered and sprint mechanical properties in team sports. Here, we evaluated two recently released 18 Hz GPS and 20 Hz LPS technologies together with one established 10 Hz GPS technology. Six male athletes (age: 27±2 years; VO2max: 48.8±4.7 ml/min/kg) performed outdoors on 10 trials of a team sport-specific circuit that was equipped with double-light timing gates. The circuit included various walking, jogging, and sprinting sections that were performed either in straight-lines or with changes of direction. During the circuit, athletes wore two devices of each positioning system. From the reported and filtered velocity data, the distances covered and sprint mechanical properties (i.e., the theoretical maximal horizontal velocity, force, and power output) were computed. The sprint mechanical properties were modeled via an inverse dynamic approach applied to the center of mass. The validity was determined by comparing the measured and criterion data via the typical error of estimate (TEE), whereas the reliability was examined by comparing the two devices of each technology (i.e., the between-device reliability) via the coefficient of variation (CV). Outliers due to measurement errors were statistically identified and excluded from validity and reliability analyses. The 18 Hz GPS showed better validity and reliability for determining the distances covered (TEE: 1.6-8.0%; CV: 1.1-5.1%) and sprint mechanical properties (TEE: 4.5-14.3%; CV: 3.1-7.5%) than the 10 Hz GPS (TEE: 3.0-12.9%; CV: 2.5-13.0% and TEE: 4.1-23.1%; CV: 3.3-20.0%). However, the 20 Hz LPS demonstrated superior validity and reliability overall (TEE: 1.0-6.0%; CV: 0.7-5.0% and TEE: 2.1-9.2%; CV: 1.6-7.3%). For the 10 Hz GPS, 18 Hz GPS, and 20 Hz LPS, the relative loss of data sets due to measurement errors was 10.0%, 20.0%, and 15.8%, respectively. This study shows that
Practical Issues in Implementing Software Reliability Measurement

NASA Technical Reports Server (NTRS)

Nikora, Allen P.; Schneidewind, Norman F.; Everett, William W.; Munson, John C.; Vouk, Mladen A.; Musa, John D.

1999-01-01

Many ways of estimating software systems' reliability, or reliability-related quantities, have been developed over the past several years. Of particular interest are methods that can be used to estimate a software system's fault content prior to test, or to discriminate between components that are fault-prone and those that are not. The results of these methods can be used to: 1) More accurately focus scarce fault identification resources on those portions of a software system most in need of it. 2) Estimate and forecast the risk of exposure to residual faults in a software system during operation, and develop risk and safety criteria to guide the release of a software system to fielded use. 3) Estimate the efficiency of test suites in detecting residual faults. 4) Estimate the stability of the software maintenance process.
Test-retest reliability of biodex system 4 pro for isometric ankle-eversion and -inversion measurement.

PubMed

Tankevicius, Gediminas; Lankaite, Doanata; Krisciunas, Aleksandras

2013-08-01

The lack of knowledge about isometric ankle testing indicates the need for research in this area. to assess test-retest reliability and to determine the optimal position for isometric ankle-eversion and -inversion testing. Test-retest reliability study. Isometric ankle eversion and inversion were assessed in 3 different dynamometer foot-plate positions: 0°, 7°, and 14° of inversion. Two maximal repetitions were performed at each angle. Both limbs were tested (40 ankles in total). The test was performed 2 times with a period of 7 d between the tests. University hospital. The study was carried out on 20 healthy athletes with no history of ankle sprains. Reliability was assessed using intraclass correlation coefficient (ICC2,1); minimal detectable change (MDC) was calculated using a 95% confidence interval. Paired t test was used to measure statistically significant changes, and P <.05 was considered statistically significant. Eversion and inversion peak torques showed high ICCs in all 3 angles (ICC values .87-.96, MDC values 3.09-6.81 Nm). Eversion peak torque was the smallest when testing at the 0° angle and gradually increased, reaching maximum values at 14° angle. The increase of eversion peak torque was statistically significant at 7 ° and 14° of inversion. Inversion peak torque showed an opposite pattern-it was the smallest when measured at the 14° angle and increased at the other 2 angles; statistically significant changes were seen only between measures taken at 0° and 14°. Isometric eversion and inversion testing using the Biodex 4 Pro system is a reliable method. The authors suggest that the angle of 7° of inversion is the best for isometric eversion and inversion testing.
Assessment of lumbosacral kyphosis in spondylolisthesis: a computer-assisted reliability study of six measurement techniques

PubMed Central

Glavas, Panagiotis; Mac-Thiong, Jean-Marc; Parent, Stefan; de Guise, Jacques A.

2008-01-01

Although recognized as an important aspect in the management of spondylolisthesis, there is no consensus on the most reliable and optimal measure of lumbosacral kyphosis (LSK). Using a custom computer software, four raters evaluated 60 standing lateral radiographs of the lumbosacral spine during two sessions at a 1-week interval. The sample size consisted of 20 normal, 20 low and 20 high grade spondylolisthetic subjects. Six parameters were included for analysis: Boxall’s slip angle, Dubousset’s lumbosacral angle (LSA), the Spinal Deformity Study Group’s (SDSG) LSA, dysplastic SDSG LSA, sagittal rotation (SR), kyphotic Cobb angle (k-Cobb). Intra- and inter-rater reliability for all parameters was assessed using intra-class correlation coefficients (ICC). Correlations between parameters and slip percentage were evaluated with Pearson coefficients. The intra-rater ICC’s for all the parameters ranged between 0.81 and 0.97 and the inter-rater ICC’s were between 0.74 and 0.98. All parameters except sagittal rotation showed a medium to large correlation with slip percentage. Dubousset’s LSA and the k-Cobb showed the largest correlations (r = −0.78 and r = −0.50, respectively). SR was associated with the weakest correlation (r = −0.10). All other parameters had medium correlations with percent slip (r = 0.31–0.43). All measurement techniques provided excellent inter- and intra-rater reliability. Dubousset’s LSA showed the strongest correlation with slip grade. This parameter can be used in the clinical setting with PACS software capabilities to assess LSK. A computer-assisted technique is recommended in order to increase the reliability of the measurement of LSK in spondylolisthesis. PMID:19015898
RELIABILITY AND VALIDITY OF SUBJECTIVE ASSESSMENT OF LUMBAR LORDOSIS IN CONVENTIONAL RADIOGRAPHY.

PubMed

Ruhinda, E; Byanyima, R K; Mugerwa, H

2014-10-01

Reliability and validity studies of different lumbar curvature analysis and measurement techniques have been documented however there is limited literature on the reliability and validity of subjective visual analysis. Radiological assessment of lumbar lordotic curve aids in early diagnosis of conditions even before neurologic changes set in. To ascertain the level of reliability and validity of subjective assessment of lumbar lordosis in conventional radiography. A blinded, repeated-measures diagnostic test was carried out on lumbar spine x-ray radiographs. Radiology Department at Joint Clinical Research Centre (JCRC), Mengo-Kampala-Uganda. Seventy (70) lateral lumbar x-ray films were used for this study and were obtained from the archive of JCRC radiology department at Butikiro house, Mengo-Kampala. Poor observer agreement, both inter- and intra-observer, with kappa values of 0.16 was found. Inter-observer agreement was poorer than intra-observer agreement. Kappa values significantly rose when the lumbar lordosis was clustered into four categories without grading each abnormality. The results confirm that subjective assessment of lumbar lordosis has low reliability and validity. Film quality has limited influence on the observer reliability. This study further shows that fewer scale categories of lordosis abnormalities produce better observer reliability.
Reliability of the detailed assessment of speed of handwriting on Flemish children.

PubMed

Simons, Johan; Probst, Michel

2014-01-01

This study evaluates the reliability of the Detailed Assessment of Speed of Handwriting (DASH) in a Dutch-speaking sample of children. The sample included 650 boys and 513 girls (age range = 9-16 years). Handwriting speed measurements were obtained using the DASH. Interrater agreement, test-retest reliability, and internal consistency were calculated; gender and age effects were analyzed. Interrater agreement shows excellent reliability with intraclass correlation coefficients of at least 0.94. Test-retest correlations ranged from r = 0.65 to r = 0.81. The internal consistency measures, calculated with Cronbach's alpha, were between 0.88 and 0.94. Both gender and age have a significant effect on handwriting speed, with F (7.1144) = 17.43 (P < .001) for gender and F (7.1144) = 21.8 (P < .001) for age. The DASH is a reliable assessment tool to evaluate handwriting speed of Dutch-speaking children. There is a tendency of girls to write faster than boys.
The reliability of the Associate Platinum digital foot scanner in measuring previously developed footprint characteristics: a technical note.

PubMed

Papuga, M Owen; Burke, Jeanmarie R

2011-02-01

An ink pad and paper, pressure-sensitive platforms, and photography have previously been used to collect footprint data used in clinical assessment. Digital scanners have been widely used more recently to collect such data. The purpose of this study was to evaluate the intra- and interrater reliability of a flatbed digital image scanning technology to capture footprint data. This study used a repeated-measures design on 32 (16 male 16 female) healthy subjects. The following measured indices of footprint were recorded from 2-dimensional images of the plantar surface of the foot recorded with an Associate Platinum (Foot Levelers Inc, Roanoke, VA) digital foot scanner: Staheli index, Chippaux-Smirak index, arch angle, and arch index. Intraclass correlation coefficient (ICC) values were calculated to evaluate intrarater, interday, and interclinician reliability. The ICC values for intrarater reliability were greater than or equal to .817, indicating an excellent level of reproducibility in assessing the collected images. Analyses of variance revealed that there were no significant differences between raters for each index (P > .05). The ICC values also indicated excellent reliability (.881-.971) between days and clinicians in all but one of the indices of footprint, arch angle (.689), with good reliability between clinicians. The full-factorial analysis of variance model did not reveal any interaction effects (P > .05), which indicated that indices of footprint were not changing across days and clinicians. Scanning technology used in this study demonstrated good intra- and interrater reliability measurements of footprint indices, as demonstrated by high ICC values. Copyright © 2011 National University of Health Sciences. Published by Mosby, Inc. All rights reserved.
Reliable Gene Expression Measurements from Fine Needle Aspirates of Pancreatic Tumors

PubMed Central

Anderson, Michelle A.; Brenner, Dean E.; Scheiman, James M.; Simeone, Diane M.; Singh, Nalina; Sikora, Matthew J.; Zhao, Lili; Mertens, Amy N.; Rae, James M.

2010-01-01

Background and aims: Biomarker use for pancreatic cancer diagnosis has been impaired by a lack of samples suitable for reliable quantitative RT-PCR (qRT-PCR). Fine needle aspirates (FNAs) from pancreatic masses were studied to define potential causes of RNA degradation and develop methods for accurately measuring gene expression. Methods: Samples from 32 patients were studied. RNA degradation was assessed by using a multiplex PCR assay for varying lengths of glyceraldehyde-3-phosphate dehydrogenase, and effects on qRT-PCR were determined by using a 150-bp and a 80-bp amplicon for RPS6. Potential causes of and methods to circumvent RNA degradation were studied by using FNAs from a pancreatic cancer xenograft. Results: RNA extracted from pancreatic mass FNAs was extensively degraded. Fragmentation was related to needle bore diameter and could not be overcome by alterations in aspiration technique. Multiplex PCR for glyceraldehyde-3-phosphate dehydrogenase could distinguish samples that were suitable for qRT-PCR. The use of short PCR amplicons (<100 bp) provided reliable gene expression analysis from FNAs. When appropriate samples were used, the assay was highly reproducible for gene copy number with minimal (0.0003 or about 0.7% of total) variance. Conclusions: The degraded properties of endoscopic FNAs markedly affect the accuracy of gene expression measurements. Our novel approach to designate specimens “informative” for qRT-PCR allowed accurate molecular assessment for the diagnosis of pancreatic diseases. PMID:20709792
Day-to-day reliability of gait characteristics in rats.

PubMed

Raffalt, Peter C; Nielsen, Louise R; Madsen, Stefan; Munk Højberg, Laurits; Pingel, Jessica; Nielsen, Jens Bo; Wienecke, Jacob; Alkjær, Tine

2018-04-27

The purpose of the present study was to determine the day-to-day reliability in stride characteristics in rats during treadmill walking obtained with two-dimensional (2D) motion capture. Kinematics were recorded from 26 adult rats during walking at 8 m/min, 12 m/min and 16 m/min on two separate days. Stride length, stride time, contact time, swing time and hip, knee and ankle joint range of motion were extracted from 15 strides. The relative reliability was assessed using intra-class correlation coefficients (ICC(1,1)) and (ICC(3,1)). The absolute reliability was determined using measurement error (ME). Across walking speeds, the relative reliability ranged from fair to good (ICCs between 0.4 and 0.75). The ME was below 91 mm for strides lengths, below 55 ms for the temporal stride variables and below 6.4° for the joint angle range of motion. In general, the results indicated an acceptable day-to-day reliability of the gait pattern parameters observed in rats during treadmill walking. The results of the present study may serve as a reference material that can help future intervention studies on rat gait characteristics both with respect to the selection of outcome measures and in the interpretation of the results. Copyright © 2018 Elsevier Ltd. All rights reserved.
Factorial Validity, Reliability, and Measurement Equivalence of the Noctcaelador Inventory across Three Ethnic Groups

ERIC Educational Resources Information Center

Kelly, William E.

2008-01-01

This study examined the reliability, factorial validity, and measurement equivalence of the Noctcaelador Inventory (NI) among three ethnic groups of college students. Participants included 200 Whites, 200 African Americans, and 200 Latino/Hispanics. The results indicated that although the African American sample scored slightly lower than the…
Reliability of upper and lower extremity anthropometric measurements and the effect on tissue mass predictions.

PubMed

Burkhart, Timothy A; Arthurs, Katherine L; Andrews, David M

2008-01-01

Accurate modeling of soft tissue motion effects relative to bone during impact requires knowledge of the mass of soft and rigid tissues in living people. Holmes et al., [2005. Predicting in vivo soft tissue masses of the lower extremity using segment anthropometric measures and DXA. Journal of Applied Biomechanics, 21, 371-382] developed and validated regression equations to predict the individual tissue masses of lower extremity segments of young healthy adults, based on simple anthropometric measurements. However, the reliability of these measurements and the effect on predicted tissue mass estimates from the equations has yet to be determined. In the current study, two measurers were responsible for collecting two sets of unilateral measurements (25 male and 25 female subjects) for the right upper and lower extremities. These included 6 lengths, 6 circumferences, 8 breadths, and 4 skinfold thicknesses. Significant differences were found between measurers and between sexes, but these differences were relatively small in general (75-80% of between-measurer differences were <1cm). Within-measurer measurement differences were smaller and more consistent than those between measurers in most cases. Good to excellent reliability was demonstrated for all measurement types, with intra-class correlation coefficients of 0.79, 0.86, 0.85 and 0.86 for lengths, circumferences, breadth and skinfolds, respectively. Predicted tissue mass magnitudes were moderately affected by the measurement differences. The maximum mean errors between measurers ranged from 3.2% to 24.2% for bone mineral content and fat mass, for the leg and foot, and the leg segments, respectively.
The validity and reliability of the Functional Strength Measurement (FSM) in children with intellectual disabilities.

PubMed

Aertssen, W F M; Steenbergen, B; Smits-Engelsman, B C M

2018-06-07

There is lack of valid and reliable field-based tests for assessing functional strength in young children with mild intellectual disabilities (IDs). The aim of this study was to investigate the test-retest reliability and construct validity of the Functional Strength Measurement in children with ID (FSM-ID). Fifty-two children with mild ID (40 boys and 12 girls, mean age 8.48 years, SD = 1.48) were tested with the FSM. Test-retest reliability (n = 32) was examined by a two-way interclass correlation coefficient for agreement (ICC 2.1A). Standard error of measurement and smallest detectable change were calculated. Construct validity was determined by calculating correlations between the FSM-ID and handheld dynamometry (HHD) (convergent validity), FSM-ID, FSM-ID and subtest strength of the Bruininks-Oseretsky test of motor proficiency - second edition (BOT-2) (convergent validity) and the FSM-ID and balance subtest of the BOT-2 (discriminant validity). Test-retest reliability ICC ranged 0.89-0.98. Correlation between the items of the FSM-ID and HHD ranged 0.39-0.79 and between FSM-ID and BOT-2 (strength items) 0.41-0.80. Correlation between items of the FSM-ID and BOT-2 (balance items) ranged 0.41-0.70. The FSM-ID showed good test-retest reliability and good convergent validity with the HHD and BOT-2 subtest strength. The correlations assessing discriminant validity were higher than expected. Poor levels of postural control and core stability in children with mild IDs may be the underlying factor of those higher correlations. © 2018 MENCAP and International Association of the Scientific Study of Intellectual and Developmental Disabilities and John Wiley & Sons Ltd.
Validity and Reliability of Dynamic Visual Acuity (DVA) Measurement During Walking

NASA Technical Reports Server (NTRS)

Deshpande, Nandini; Peters, Brian T.; Bloomberg, Jacob J.

2014-01-01

DVA is primarily subserved by the vestibulo-ocular reflex mechanism. Individuals with vestibular hypofunction commonly experience highly debilitating illusory movement or blurring of visual images during daily activities possibly, due to impaired DVA. Even without pathologies, gradual age-related morphological deterioration is evident in all components of the vestibular system. We examined the construct validity to detect age-related differences and test-retest reliability of DVA measurements performed during walking. METHODS: Healthy adults were recruited into 3 groups: 1. young (20-39years, n=18), 2. middle-aged (40-59years, n=14), and 3. older adults (60-80years, n=15). Randomly selected seven participants from each group (n=21) participated in retesting. Participants were excluded if they had a history of vestibular or neuromuscular pathologies, dizziness/vertigo or >1 falls in the past year. Older persons with MMSE scores <29/30 were excluded to minimize cognitive errors. Participants' age, height, weight and normal walking speed were recorded. The binocular DVA was measured while walking on a treadmill at 0.8 m/s, 1.0 m/s and 1.2 m/s speeds. The walking speeds chosen represent a range of slow to moderate walking speeds for adult life span in participants who have no current mobility problems. The monitor that displayed Landolt 'C' optotypes was placed at 50 cm from the eyes for nearDVA (primary compensation by otolith organs) and at 3.0 m for farDVA (primary compensation by semicircular canals). A mixed factor ANOVA (age group x speed) was performed separately for the Near and FarDVA for detecting group differences. Intraclass correlation coefficients (ICCs) were calculated for each condition to determine test-retest reliability. RESULTS: The three age groups were not different in their height, weight and normal walking speed (p>0.05). The post hoc analyses for DVA measurements demonstrated that each group was significantly different from the other two groups
Measuring the validity and reliability of the Apple Watch as a physical activity monitor.

PubMed

Zhang, Peng; Godin, Steven D; Owens, Matthew V

2018-04-04

This study aimed to investigate the validity and reliability of the energy expenditure (EE) estimation of Apple Watch among college students. Thirty college students completed two sets of three 10-minute treadmill walking and running trials while wearing three Apple Watches and being connected to indirect calorimetry. The walking trials were at speeds of 54, 80, and 107 m•min-1 while the running trials were at 134, 161, 188m•min-1. Energy expenditure comparisons were made using Two-way ANOVA with repeatedmeasures. Reliability was analyzed by Intraclass Correlation. There was no significant device x speed interactions (F (15, 696) = 1.113, p = 0.341) between the indirect calorimetry (criterion) and Apple Watch. The lowest Inter-Class Correlation (ICC) scores were 0.49 (95%CI) at 54 while the highest were 0.72 (95%CI) at 107 and 134 m•min-1. Apple Watch demonstrated a low to moderate validity and reliability on measuring EE.
Whole Arm Water Displacement Volumetry Is a Reliable and Sensitive Measure: A Pilot to Assess Acute Postburn Volume Change.

PubMed

Edgar, Dale W; Briffa, N Kathy; Wood, Fiona M

Water displacement volumetry (WDV) is a reliable method for measurement of wrist and hand volume in lymphedema patients. However, within session WDV reliability for the whole upper limb (UL) lacks comprehensive investigation, particularly in acute edema populations. This study aimed to confirm the reliability and investigate the impact of time between repeated trials on the sensitivity of WDV as a measure of whole UL volume change in an uninjured cohort and a burn injured pilot group. Within session, duplicate measures of whole UL WDV were recorded in two groups of noninjured volunteers and a group of burn patients. Each noninjured group differed only in the time between WDV repeats. The reliability trials were performed <10 minutes apart (T10) and 20 to 30 minutes apart (T20). The time between repetitions for burn patients was 20 to 30 minutes, based on the results of the noninjured participant trials. All trial groups demonstrated excellent correlation between trials (ICCT10 = 0.999, ICCT20 = 0.997). The minimum detectable difference calculated for WDV when measuring whole UL volume change of >50 ml for noninjured and >100 ml for burn patients. Despite this, a systematic bias was demonstrated between the T10 group means. The T20 group trials did not indicate such error on statistical testing (P = .297). The study confirms that WDV measurement of whole ULs is reliable and sensitive, if used at least 20 minutes apart. However, a significant and clinically relevant subject-by-method interaction was demonstrated. Researchers and clinicians are reminded to be aware of the performance of the technique when designing investigations in patient populations.
Diluted thrombin time reliably measures low to intermediate plasma dabigatran concentrations.

PubMed

Božič-Mijovski, Mojca; Malmström, Rickard E; Malovrh, Petra; Antovic, Jovan P; Vene, Nina; Šinigoj, Petra; Mavri, Alenka

2016-07-01

Direct oral anticoagulant dabigatran was first introduced as a fixed-dose drug without routine coagulation monitoring, but current recommendations suggest that diluted thrombin time can be used to estimate plasma drug level. The aim of this study was to assess a diluted thrombin time assay based on the same thrombin reagent already used for traditional thrombin time measurements that reliably measure low to intermediate plasma dabigatran levels. We included 44 patients with atrial fibrillation who started treatment with dabigatran 150 mg (23 patients) or 110 mg (21 patients) twice a day. Blood samples were collected at baseline (no dabigatran) and 2-4 weeks after the beginning of dabigatran therapy at trough and at peak. Plasma dabigatran levels were measured with diluted thrombin time and compared to liquid chromatography with tandem mass spectrometry as the reference method. The performance of the diluted thrombin time was compared to Hemoclot® Thrombin Inhibitor and Ecarin Chromogenic Assay. In ex vivo plasma samples, diluted thrombin time highly correlated with the liquid chromatography with tandem mass spectrometry (Pearson's R = 0.9799). In the low to intermediate range (dabigatran concentration ≤ 100 µg/L) diluted thrombin time correlated significantly more closely to the liquid chromatography with tandem mass spectrometry (R = 0.964) than Hemoclot® Thrombin Inhibitor (R = 0.935, p = 0.05) or Ecarin Chromogenic Assay (R = 0.915, p < 0.01). It was also the only functional assay without any significant bias in the low to intermediate range. Both trough and peak diluted thrombin time values were similar to liquid chromatography with tandem mass spectrometry. We conclude that the diluted thrombin time assay presented in this study reliably detects dabigatran and that it is superior to the Hemoclot® Thrombin Inhibitor assay in the low to intermediate range. © The Author(s) 2015.
The Influence of Resistance Training Experience on the Between-Day Reliability of Commonly Used Strength Measures in Male Youth Athletes.

PubMed

Weakley, Jonathon J S; Till, Kevin; Darrall-Jones, Joshua; Roe, Gregory A B; Phibbs, Padraic J; Read, Dale B; Jones, Ben L

2017-07-01

Weakley, JJS, Till, K, Darrall-Jones, J, Roe, GAB, Phibbs, PJ, Read, DB, and Jones, BL. The influence of resistance training experience on the between-day reliability of commonly used strength measures in male youth athletes. J Strength Cond Res 31(7): 2005-2010, 2017-The purpose of this study was to determine the between-day reliability of commonly used strength measures in male youth athletes while considering resistance training experience. Data were collected on 25 male athletes over 2 testing sessions, with 72 hours rest between, for the 3 repetition maximum (3RM) front squat, chin-up, and bench press. Subjects were initially categorized by resistance training experience (inexperienced; 6-12 months, experienced; >2 years). The assessment of the between-day reliability (coefficient of variation [CV%]) showed that the front squat (experienced: 2.90%; inexperienced: 1.90%), chin-up (experienced: 1.70%; inexperienced: 1.90%), and bench press (experienced: 4.50%; inexperienced: 2.40%) were all reliable measures of strength in both groups. Comparison between groups for the error of measurement for each exercise showed trivial differences. When both groups were combined, the CV% for the front squat, bench press, and chin-up were 2.50, 1.80, and 3.70%, respectively. This study provides scientists and practitioners with the between-day reliability reference data to determine real and practical changes for strength in male youth athletes with different resistance training experience. Furthermore, this study demonstrates that 3RM front squat, chin-up, and bench press are reliable exercises to quantify strength in male youth athletes.
The Children's Play Therapy Instrument (CPTI): Description, Development, and Reliability Studies

PubMed Central

Kernberg, Paulina F.; Chazan, Saralea E.; Normandin, Lina

1998-01-01

The Children's Play Therapy Instrument (CPTI), its development, and reliability studies are described. The CPTI is a new instrument to examine a child's play activity in individual psychotherapy. Three independent raters used the CPTI to rate eight videotaped play therapy vignettes. Results were compared with the authors' consensual scores from a preliminary study. Generally good to excellent levels of interrater reliability were obtained for the independent raters on intraclass correlation coefficients for ordinal categories of the CPTI. Likewise, kappa levels were acceptable to excellent for nominal categories of the scale. The CPTI holds promise to become a reliable measure of play activity in child psychotherapy. Further research is needed to assess discriminant validity of the CPTI for use as a diagnostic tool and as a measure of process and outcome.(The Journal of Psychotherapy Practice and Research 1998; 7:196–207) PMID:9631341
Test Reliability at the Individual Level

PubMed Central

Hu, Yueqin; Nesselroade, John R.; Erbacher, Monica K.; Boker, Steven M.; Burt, S. Alexandra; Keel, Pamela K.; Neale, Michael C.; Sisk, Cheryl L.; Klump, Kelly

2016-01-01

Reliability has a long history as one of the key psychometric properties of a test. However, a given test might not measure people equally reliably. Test scores from some individuals may have considerably greater error than others. This study proposed two approaches using intraindividual variation to estimate test reliability for each person. A simulation study suggested that the parallel tests approach and the structural equation modeling approach recovered the simulated reliability coefficients. Then in an empirical study, where forty-five females were measured daily on the Positive and Negative Affect Schedule (PANAS) for 45 consecutive days, separate estimates of reliability were generated for each person. Results showed that reliability estimates of the PANAS varied substantially from person to person. The methods provided in this article apply to tests measuring changeable attributes and require repeated measures across time on each individual. This article also provides a set of parallel forms of PANAS. PMID:28936107

Some links on this page may take you to non-federal websites. Their policies may differ from this site.