correlation coefficients iccs: Topics by Science.gov

Sample records for correlation coefficients iccs

Choosing the best index for the average score intraclass correlation coefficient.

PubMed

Shieh, Gwowen

2016-09-01

The intraclass correlation coefficient (ICC)(2) index from a one-way random effects model is widely used to describe the reliability of mean ratings in behavioral, educational, and psychological research. Despite its apparent utility, the essential property of ICC(2) as a point estimator of the average score intraclass correlation coefficient is seldom mentioned. This article considers several potential measures and compares their performance with ICC(2). Analytical derivations and numerical examinations are presented to assess the bias and mean square error of the alternative estimators. The results suggest that more advantageous indices can be recommended over ICC(2) for their theoretical implication and computational ease.
Confidence Intervals and "F" Tests for Intraclass Correlation Coefficients Based on Three-Way Mixed Effects Models

ERIC Educational Resources Information Center

Zhou, Hong; Muellerleile, Paige; Ingram, Debra; Wong, Seok P.

2011-01-01

Intraclass correlation coefficients (ICCs) are commonly used in behavioral measurement and psychometrics when a researcher is interested in the relationship among variables of a common class. The formulas for deriving ICCs, or generalizability coefficients, vary depending on which models are specified. This article gives the equations for…
A comparison of two indices for the intraclass correlation coefficient.

PubMed

Shieh, Gwowen

2012-12-01

In the present study, we examined the behavior of two indices for measuring the intraclass correlation in the one-way random effects model: the prevailing ICC(1) (Fisher, 1938) and the corrected eta-squared (Bliese & Halverson, 1998). These two procedures differ both in their methods of estimating the variance components that define the intraclass correlation coefficient and in their performance of bias and mean squared error in the estimation of the intraclass correlation coefficient. In contrast with the natural unbiased principle used to construct ICC(1), in the present study it was analytically shown that the corrected eta-squared estimator is identical to the maximum likelihood estimator and the pairwise estimator under equal group sizes. Moreover, the empirical results obtained from the present Monte Carlo simulation study across various group structures revealed the mutual dominance relationship between their truncated versions for negative values. The corrected eta-squared estimator performs better than the ICC(1) estimator when the underlying population intraclass correlation coefficient is small. Conversely, ICC(1) has a clear advantage over the corrected eta-squared for medium and large magnitudes of population intraclass correlation coefficient. The conceptual description and numerical investigation provide guidelines to help researchers choose between the two indices for more accurate reliability analysis in multilevel research.
Reliability of environmental sampling culture results using the negative binomial intraclass correlation coefficient.

PubMed

Aly, Sharif S; Zhao, Jianyang; Li, Ben; Jiang, Jiming

2014-01-01

The Intraclass Correlation Coefficient (ICC) is commonly used to estimate the similarity between quantitative measures obtained from different sources. Overdispersed data is traditionally transformed so that linear mixed model (LMM) based ICC can be estimated. A common transformation used is the natural logarithm. The reliability of environmental sampling of fecal slurry on freestall pens has been estimated for Mycobacterium avium subsp. paratuberculosis using the natural logarithm transformed culture results. Recently, the negative binomial ICC was defined based on a generalized linear mixed model for negative binomial distributed data. The current study reports on the negative binomial ICC estimate which includes fixed effects using culture results of environmental samples. Simulations using a wide variety of inputs and negative binomial distribution parameters (r; p) showed better performance of the new negative binomial ICC compared to the ICC based on LMM even when negative binomial data was logarithm, and square root transformed. A second comparison that targeted a wider range of ICC values showed that the mean of estimated ICC closely approximated the true ICC.
A Guideline of Selecting and Reporting Intraclass Correlation Coefficients for Reliability Research.

PubMed

Koo, Terry K; Li, Mae Y

2016-06-01

Intraclass correlation coefficient (ICC) is a widely used reliability index in test-retest, intrarater, and interrater reliability analyses. This article introduces the basic concept of ICC in the content of reliability analysis. There are 10 forms of ICCs. Because each form involves distinct assumptions in their calculation and will lead to different interpretations, researchers should explicitly specify the ICC form they used in their calculation. A thorough review of the research design is needed in selecting the appropriate form of ICC to evaluate reliability. The best practice of reporting ICC should include software information, "model," "type," and "definition" selections. When coming across an article that includes ICC, readers should first check whether information about the ICC form has been reported and if an appropriate ICC form was used. Based on the 95% confident interval of the ICC estimate, values less than 0.5, between 0.5 and 0.75, between 0.75 and 0.9, and greater than 0.90 are indicative of poor, moderate, good, and excellent reliability, respectively. This article provides a practical guideline for clinical researchers to choose the correct form of ICC and suggests the best practice of reporting ICC parameters in scientific publications. This article also gives readers an appreciation for what to look for when coming across ICC while reading an article.
R package to estimate intracluster correlation coefficient with confidence interval for binary data.

PubMed

Chakraborty, Hrishikesh; Hossain, Akhtar

2018-03-01

The Intracluster Correlation Coefficient (ICC) is a major parameter of interest in cluster randomized trials that measures the degree to which responses within the same cluster are correlated. There are several types of ICC estimators and its confidence intervals (CI) suggested in the literature for binary data. Studies have compared relative weaknesses and advantages of ICC estimators as well as its CI for binary data and suggested situations where one is advantageous in practical research. The commonly used statistical computing systems currently facilitate estimation of only a very few variants of ICC and its CI. To address the limitations of current statistical packages, we developed an R package, ICCbin, to facilitate estimating ICC and its CI for binary responses using different methods. The ICCbin package is designed to provide estimates of ICC in 16 different ways including analysis of variance methods, moments based estimation, direct probabilistic methods, correlation based estimation, and resampling method. CI of ICC is estimated using 5 different methods. It also generates cluster binary data using exchangeable correlation structure. ICCbin package provides two functions for users. The function rcbin() generates cluster binary data and the function iccbin() estimates ICC and it's CI. The users can choose appropriate ICC and its CI estimate from the wide selection of estimates from the outputs. The R package ICCbin presents very flexible and easy to use ways to generate cluster binary data and to estimate ICC and it's CI for binary response using different methods. The package ICCbin is freely available for use with R from the CRAN repository (https://cran.r-project.org/package=ICCbin). We believe that this package can be a very useful tool for researchers to design cluster randomized trials with binary outcome. Copyright © 2017 Elsevier B.V. All rights reserved.
Tutorial on use of intraclass correlation coefficients for assessing intertest reliability and its application in functional near-infrared spectroscopy-based brain imaging

NASA Astrophysics Data System (ADS)

Li, Lin; Zeng, Li; Lin, Zi-Jing; Cazzell, Mary; Liu, Hanli

2015-05-01

Test-retest reliability of neuroimaging measurements is an important concern in the investigation of cognitive functions in the human brain. To date, intraclass correlation coefficients (ICCs), originally used in inter-rater reliability studies in behavioral sciences, have become commonly used metrics in reliability studies on neuroimaging and functional near-infrared spectroscopy (fNIRS). However, as there are six popular forms of ICC, the adequateness of the comprehensive understanding of ICCs will affect how one may appropriately select, use, and interpret ICCs toward a reliability study. We first offer a brief review and tutorial on the statistical rationale of ICCs, including their underlying analysis of variance models and technical definitions, in the context of assessment on intertest reliability. Second, we provide general guidelines on the selection and interpretation of ICCs. Third, we illustrate the proposed approach by using an actual research study to assess intertest reliability of fNIRS-based, volumetric diffuse optical tomography of brain activities stimulated by a risk decision-making protocol. Last, special issues that may arise in reliability assessment using ICCs are discussed and solutions are suggested.
Repeatability, interocular correlation and agreement of quantitative swept-source optical coherence tomography angiography macular metrics in healthy subjects.

PubMed

Fang, Danqi; Tang, Fang Yao; Huang, Haifan; Cheung, Carol Y; Chen, Haoyu

2018-05-29

To investigate the repeatability, interocular correlation and agreement of quantitative swept-source optical coherence tomography angiography (SS-OCTA) metrics in healthy subjects. Thirty-three healthy normal subjects were enrolled. The macula was scanned four times by an SS-OCTA system using the 3 mm×3 mm mode. The superficial capillary map images were analysed using a MATLAB program. A series of parameters were measured: foveal avascular zone (FAZ) area, FAZ perimeter, FAZ circularity, parafoveal vessel density, fractal dimension and vessel diameter index (VDI). The repeatability of four scans was determined by intraclass correlation coefficient (ICC). Then the averaged results were analysed for intereye difference, correlation and agreement using paired t-test, Pearson's correlation coefficient (r), ICC and Bland-Altman plot. The repeatability assessment of the macular metrics exported high ICC values (ranged from 0.853 to 0.996). There is no statistically significant difference in the OCTA metrics between the two eyes. FAZ area (ICC=0.961, r=0.929) and FAZ perimeter (ICC=0.884, r=0.802) showed excellent binocular correlation. Fractal dimension (ICC=0.732, r=0.578) and VDI (ICC=0.707, r=0.547) showed moderate binocular correlation, while parafoveal vessel density had poor binocular correlation. Bland-Altman plots showed the range of agreement was from -0.0763 to 0.0954 mm 2 for FAZ area and from -0.0491 to 0.1136 for parafoveal vessel density. The macular metrics obtained using SS-OCTA showed excellent repeatability in healthy subjects. We showed high intereye correlation in FAZ area and perimeter, moderate correlation in fractal dimension and VDI, while vessel density had poor correlation in normal healthy subjects. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Performance of intraclass correlation coefficient (ICC) as a reliability index under various distributions in scale reliability studies.

PubMed

Mehta, Shraddha; Bastero-Caballero, Rowena F; Sun, Yijun; Zhu, Ray; Murphy, Diane K; Hardas, Bhushan; Koch, Gary

2018-04-29

Many published scale validation studies determine inter-rater reliability using the intra-class correlation coefficient (ICC). However, the use of this statistic must consider its advantages, limitations, and applicability. This paper evaluates how interaction of subject distribution, sample size, and levels of rater disagreement affects ICC and provides an approach for obtaining relevant ICC estimates under suboptimal conditions. Simulation results suggest that for a fixed number of subjects, ICC from the convex distribution is smaller than ICC for the uniform distribution, which in turn is smaller than ICC for the concave distribution. The variance component estimates also show that the dissimilarity of ICC among distributions is attributed to the study design (ie, distribution of subjects) component of subject variability and not the scale quality component of rater error variability. The dependency of ICC on the distribution of subjects makes it difficult to compare results across reliability studies. Hence, it is proposed that reliability studies should be designed using a uniform distribution of subjects because of the standardization it provides for representing objective disagreement. In the absence of uniform distribution, a sampling method is proposed to reduce the non-uniformity. In addition, as expected, high levels of disagreement result in low ICC, and when the type of distribution is fixed, any increase in the number of subjects beyond a moderately large specification such as n = 80 does not have a major impact on ICC. Copyright © 2018 John Wiley & Sons, Ltd.
The validity and reliability of the my jump 2 app for measuring the reactive strength index and drop jump performance.

PubMed

Haynes, Tom; Bishop, Chris; Antrobus, Mark; Brazier, Jon

2018-03-27

This is the first study to independently assess the concurrent validity and reliability of the My Jump 2 app for measuring drop jump performance. It is also the first to evaluate the app's ability to measure the reactive strength index (RSI). Fourteen male sport science students (age: 29.5 ± 9.9 years) performed three drop jumps from 20 cm and 40 cm (totalling 84 jumps), assessed via a force platform and the My Jump 2 app. Reported metrics included reactive strength index, jump height, ground contact time, and mean power. Measurements from both devices were compared using the intraclass correlation coefficient (ICC), Pearson product moment correlation coefficient (r), Cronbach's alpha (α), coefficient of variation (CV) and BlandAltman plots. Near perfect agreement was seen between devices at 20 cm for RSI (ICC = 0.95) and contact time (ICC = 0.99) and at 40 cm for RSI (ICC = 0.98), jump height (ICC = 0.96) and contact time (ICC = 0.92); with very strong agreement seen at 20 cm for jump height (ICC = 0.80). In comparison with the force plate the app showed good validity for RSI (20 cm: r = 0.94; 40 cm; r = 0.97), jump height (20 cm: r = 0.80; 40 cm; r = 0.96) and contact time (20 cm = 0.96; 40 cm; r = 0.98). The results of the present study show that the My Jump 2 app is a valid and reliable tool for assessing drop jump performance.
Validity and reliability of International Physical Activity Questionnaire-Short Form in Chinese youth.

PubMed

Wang, Chao; Chen, Peijie; Zhuang, Jie

2013-12-01

The psychometric profiles of the widely used International Physical Activity Questionnaire-Short Form (IPAQ-SF) in Chinese youth have not been reported. The purpose of this study was to examine the validity and reliability of the IPAQ-SF using a sample of Chinese youth. One thousand and twenty-one youth (M(age) = 14.26 +/- 1.63 years, 52.8% boys) from 11 cities in China wore accelerometers for 7 consecutive days and completed the IPAQ-SF on the 8th day to recall their physical activity (PA) during accelerometer-wearing days. A subsample of 92 youth (M(age) = 15.90 +/- 1.35 years, 46.7% boys) completed the IPAQ-SF again a week later to recall their PA during accelerometer-wearing days. Differences in PA estimated by the IPAQ-SF and accelerometer were examined by paired-sample t test. Spearman correlation coefficients were used to examine the correlation between the IPAQ-SF and accelerometer. Test-retest reliability of the IPAQ-SF was determined by the intraclass correlation coefficient (ICC). Compared with accelerometer, the IPAQ-SF overestimated sedentary time, moderate PA (MPA), vigorous PA (VPA), and moderate-to-vigorous PA (MVPA). Correlations between PA (total PA, MPA, VPA, and MVPA) and sedentary time measured by 2 instruments ranged from "none" to "low" (p = .08-.31). Test-retest ICC of the IPAQ-SF ranged from "moderate" to "high" (ICC = .43-.83), except for sitting in boys (ICC = .06), sitting for the whole sample (ICC = .32), and VPA in girls (ICC = .35). The IPAQ-SF was not a valid instrument for measuring PA and sedentary behavior in Chinese youth.
On the estimation of intracluster correlation for time-to-event outcomes in cluster randomized trials.

PubMed

Kalia, Sumeet; Klar, Neil; Donner, Allan

2016-12-30

Cluster randomized trials (CRTs) involve the random assignment of intact social units rather than independent subjects to intervention groups. Time-to-event outcomes often are endpoints in CRTs. Analyses of such data need to account for the correlation among cluster members. The intracluster correlation coefficient (ICC) is used to assess the similarity among binary and continuous outcomes that belong to the same cluster. However, estimating the ICC in CRTs with time-to-event outcomes is a challenge because of the presence of censored observations. The literature suggests that the ICC may be estimated using either censoring indicators or observed event times. A simulation study explores the effect of administrative censoring on estimating the ICC. Results show that ICC estimators derived from censoring indicators or observed event times are negatively biased. Analytic work further supports these results. Observed event times are preferred to estimate the ICC under minimum frequency of administrative censoring. To our knowledge, the existing literature provides no practical guidance on the estimation of ICC when substantial amount of administrative censoring is present. The results from this study corroborate the need for further methodological research on estimating the ICC for correlated time-to-event outcomes. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.
Pitfalls and important issues in testing reliability using intraclass correlation coefficients in orthopaedic research.

PubMed

Lee, Kyoung Min; Lee, Jaebong; Chung, Chin Youb; Ahn, Soyeon; Sung, Ki Hyuk; Kim, Tae Won; Lee, Hui Jong; Park, Moon Seok

2012-06-01

Intra-class correlation coefficients (ICCs) provide a statistical means of testing the reliability. However, their interpretation is not well documented in the orthopedic field. The purpose of this study was to investigate the use of ICCs in the orthopedic literature and to demonstrate pitfalls regarding their use. First, orthopedic articles that used ICCs were retrieved from the Pubmed database, and journal demography, ICC models and concurrent statistics used were evaluated. Second, reliability test was performed on three common physical examinations in cerebral palsy, namely, the Thomas test, the Staheli test, and popliteal angle measurement. Thirty patients were assessed by three orthopedic surgeons to explore the statistical methods testing reliability. Third, the factors affecting the ICC values were examined by simulating the data sets based on the physical examination data where the ranges, slopes, and interobserver variability were modified. Of the 92 orthopedic articles identified, 58 articles (63%) did not clarify the ICC model used, and only 5 articles (5%) described all models, types, and measures. In reliability testing, although the popliteal angle showed a larger mean absolute difference than the Thomas test and the Staheli test, the ICC of popliteal angle was higher, which was believed to be contrary to the context of measurement. In addition, the ICC values were affected by the model, type, and measures used. In simulated data sets, the ICC showed higher values when the range of data sets were larger, the slopes of the data sets were parallel, and the interobserver variability was smaller. Care should be taken when interpreting the absolute ICC values, i.e., a higher ICC does not necessarily mean less variability because the ICC values can also be affected by various factors. The authors recommend that researchers clarify ICC models used and ICC values are interpreted in the context of measurement.
Pulmonary disease in cystic fibrosis: assessment with chest CT at chest radiography dose levels.

PubMed

Ernst, Caroline W; Basten, Ines A; Ilsen, Bart; Buls, Nico; Van Gompel, Gert; De Wachter, Elke; Nieboer, Koenraad H; Verhelle, Filip; Malfroot, Anne; Coomans, Danny; De Maeseneer, Michel; de Mey, Johan

2014-11-01

To investigate a computed tomographic (CT) protocol with iterative reconstruction at conventional radiography dose levels for the assessment of structural lung abnormalities in patients with cystic fibrosis ( CF cystic fibrosis ). In this institutional review board-approved study, 38 patients with CF cystic fibrosis (age range, 6-58 years; 21 patients <18 years and 17 patients >18 years) underwent investigative CT (at minimal exposure settings combined with iterative reconstruction) as a replacement of yearly follow-up posteroanterior chest radiography. Verbal informed consent was obtained from all patients or their parents. CT images were randomized and rated independently by two radiologists with use of the Bhalla scoring system. In addition, mosaic perfusion was evaluated. As reference, the previous available conventional chest CT scan was used. Differences in Bhalla scores were assessed with the χ(2) test and intraclass correlation coefficients ( ICC intraclass correlation coefficient s). Radiation doses for CT and radiography were assessed for adults (>18 years) and children (<18 years) separately by using technical dose descriptors and estimated effective dose. Differences in dose were assessed with the Mann-Whitney U test. The median effective dose for the investigative protocol was 0.04 mSv (95% confidence interval [ CI confidence interval ]: 0.034 mSv, 0.10 mSv) for children and 0.05 mSv (95% CI confidence interval : 0.04 mSv, 0.08 mSv) for adults. These doses were much lower than those with conventional CT (median: 0.52 mSv [95% CI confidence interval : 0.31 mSv, 3.90 mSv] for children and 1.12 mSv [95% CI confidence interval : 0.57 mSv, 3.15 mSv] for adults) and of the same order of magnitude as those for conventional radiography (median: 0.012 mSv [95% CI confidence interval : 0.006 mSv, 0.022 mSv] for children and 0.012 mSv [95% CI confidence interval : 0.005 mSv, 0.031 mSv] for adults). All images were rated at least as diagnostically acceptable. Very good agreement was found in overall Bhalla score ( ICC intraclass correlation coefficient , 0.96) with regard to the severity of bronchiectasis ( ICC intraclass correlation coefficient , 0.87) and sacculations and abscesses ( ICC intraclass correlation coefficient , 0.84). Interobserver agreement was excellent ( ICC intraclass correlation coefficient , 0.86-1). For patients with CF cystic fibrosis , a dedicated chest CT protocol can replace the two yearly follow-up chest radiographic examinations without major dose penalty and with similar diagnostic quality compared with conventional CT.
Intraherd correlation coefficients and design effects for bovine viral diarrhoea, infectious bovine rhinotracheitis, leptospirosis and neosporosis in cow-calf system herds in North-eastern Mexico.

PubMed

Segura-Correa, J C; Domínguez-Díaz, D; Avalos-Ramírez, R; Argaez-Sosa, J

2010-09-01

Knowledge of the intraherd correlation coefficient (ICC) and design (D) effect for infectious diseases could be of interest in sample size calculation and to provide the correct standard errors of prevalence estimates in cluster or two-stage samplings surveys. Information on 813 animals from 48 non-vaccinated cow-calf herds from North-eastern Mexico was used. The ICC for the bovine viral diarrhoea (BVD), infectious bovine rhinotracheitis (IBR), leptospirosis and neosporosis diseases were calculated using a Bayesian approach adjusting for the sensitivity and specificity of the diagnostic tests. The ICC and D values for BVD, IBR, leptospirosis and neosporosis were 0.31 and 5.91, 0.18 and 3.88, 0.22 and 4.53, and 0.11 and 2.68, respectively. The ICC and D values were different from 0 and D greater than 1, therefore large sample sizes are required to obtain the same precision in prevalence estimates than for a random simple sampling design. The report of ICC and D values is of great help in planning and designing two-stage sampling studies. 2010 Elsevier B.V. All rights reserved.
Translation, cultural adaptation and validation into portuguese (Brazil) in Systemic Sclerosis Questionnaire (SySQ).

PubMed

Machado, Roberta Ismael Lacerda; Souto, Lais Medeiros; Freire, Eutilia Andrade Medeiros

2014-01-01

Systemic sclerosis (SSc) is a multisystem disease, autoimmune disorder characterized by a fibroblastic disfunction, with significant impact on quality of life (QoL), measured by instruments or questionnaires that usually were formulated in other languages and in different cultural contexts. Translate into Brazilian Portuguese, cross cultural adaptation and assess the reliability and validity of the Systemic Sclerosis Questionnaire (SySQ). Translation and adaptation: into Portuguese and cross-cultural adaptation was performed in accordance with studies on questionnaire translation methodology into other languages. Reliability: it was analyzed using three interviews with different interviewers, two on the same day (interobserver) and the third within 14 days of the first assessment (intraobserver).Validity was assessed by correlating clinical and quality of life parameters with the domain scores of Sysc. a descriptive analysis of the study sample. Reproducibility was assessed using an intraclass correlation coefficient (ICC). Internal consistency was assessed using Cronbach's alpha coefficient. To assess validity we used Spearman correlation coefficient. Five percent was the level of significance adopted for all statistical tests. In the evaluation of the questionnaires, the results were similar to the original questionnaire, the internal consistency ranging between 0.73 and 0.93 for each item. The interobserver reproducibility was very good for all domains (α = 0.786 to 0.983) and intraobserver agreement was considered very good for general symptoms domain (ICC = 0.916), good for musculoskeletal symptoms domain (ICC = 0.897) and cardiopulmonary domain (ICC = 0.842) and reasonable for gastrointestinal symptoms domain (ICC = 0.686). The Brazilian Portuguese version of SySQ proved to be reproducible and valid for our population, using a recognized methodology for translation and cultural adaptation of questionnaires, as well as to assess the reproducibility and validity.
The Scarbase Duo(®): Intra-rater and inter-rater reliability and validity of a compact dual scar assessment tool.

PubMed

Fell, Matthew; Meirte, Jill; Anthonissen, Mieke; Maertens, Koen; Pleat, Jonathon; Moortgat, Peter

2016-03-01

Objective scar assessment tools were designed to help identify problematic scars and direct clinical management. Their use has been restricted by their measurement of a single scar property and the bulky size of equipment. The Scarbase Duo(®) was designed to assess both trans-epidermal water loss (TEWL) and colour of a burn scar whilst being compact and easy to use. Twenty patients with a burn scar were recruited and measurements taken using the Scarbase Duo(®) by two observers. The Scarbase Duo(®) measures TEWL via an open-chamber system and undertakes colorimetry via narrow-band spectrophotometry, producing values for relative erythema and melanin pigmentation. Validity was assessed by comparing the Scarbase Duo(®) against the Dermalab(®) and the Minolta Chromameter(®) respectively for TEWL and colorimetry measurements. The intra-class correlation coefficient (ICC) was used to assess reliability with standard error of measurement (SEM) used to assess reproducibility of measurements. The Pearson correlation coefficient (r) was used to assess the convergent validity. The Scarbase Duo(®) TEWL mode had excellent reliability when used on scars for both intra- (ICC=0.95) and inter-rater (ICC=0.96) measurements with moderate SEM values. The erythema component of the colorimetry mode showed good reliability for use on scars for both intra-(ICC=0.81) and inter-rater (ICC=0.83) measurements with low SEM values. Pigmentation values showed excellent reliability on scar tissue for both intra- (ICC=0.97) and inter-rater (ICC=0.97) with moderate SEM values. The Scarbase Duo(®) TEWL function had excellent correlation with the Dermalab(®) (r=0.93) whilst the colorimetry erythema value had moderate correlation with the Minolta Chromameter (r=0.72). The Scarbase Duo(®) is a reliable and objective scar assessment tool, which is specifically designed for burn scars. However, for clinical use, standardised measurement conditions are recommended. Copyright © 2015 Elsevier Ltd and ISBI. All rights reserved.
Agreement in functional assessment: graphic approaches to displaying respondent effects.

PubMed

Haley, Stephen M; Ni, Pengsheng; Coster, Wendy J; Black-Schaffer, Randie; Siebens, Hilary; Tao, Wei

2006-09-01

The objective of this study was to examine the agreement between respondents of summary scores from items representing three functional content areas (physical and mobility, personal care and instrumental, applied cognition) within the Activity Measure for Postacute Care (AM-PAC). We compare proxy vs. patient report in both hospital and community settings as represented by intraclass correlation coefficients and two graphic approaches. The authors conducted a prospective, cohort study of a convenience sample of adults (n = 47) receiving rehabilitation services either in hospital (n = 31) or community (n = 16) settings. In addition to using intraclass correlation coefficients (ICC) as indices of agreement, we applied two graphic approaches to serve as complements to help interpret the direction and magnitude of respondent disagreements. We created a "mountain plot" based on a cumulative distribution curve and a "survival-agreement plot" with step functions used in the analysis of survival data. ICCs on summary scores between patient and proxy report were physical and mobility ICC = 0.92, personal care and instrumental ICC = 0.93, and applied cognition ICC = 0.77. Although combined respondent agreement was acceptable, graphic approaches helped interpret differences in separate analyses of clinician and family agreement. Graphic analyses allow for a simple interpretation of agreement data and may be useful in determining the meaningfulness of the amount and direction of interrespondent variation.
Dealing with Dependence (Part I): Understanding the Effects of Clustered Data

ERIC Educational Resources Information Center

McCoach, D. Betsy; Adelson, Jill L.

2010-01-01

This article provides a conceptual introduction to the issues surrounding the analysis of clustered (nested) data. We define the intraclass correlation coefficient (ICC) and the design effect, and we explain their effect on the standard error. When the ICC is greater than 0, then the design effect is greater than 1. In such a scenario, the…
Lumbar lordosis and sacral slope in lumbar spinal stenosis: standard values and measurement accuracy.

PubMed

Bredow, J; Oppermann, J; Scheyerer, M J; Gundlfinger, K; Neiss, W F; Budde, S; Floerkemeier, T; Eysel, P; Beyer, F

2015-05-01

Radiological study. To asses standard values, intra- and interobserver reliability and reproducibility of sacral slope (SS) and lumbar lordosis (LL) and the correlation of these parameters in patients with lumbar spinal stenosis (LSS). Anteroposterior and lateral X-rays of the lumbar spine of 102 patients with LSS were included in this retrospective, radiologic study. Measurements of SS and LL were carried out by five examiners. Intraobserver correlation and correlation between LL and SS were calculated with Pearson's r linear correlation coefficient and intraclass correlation coefficients (ICC) were calculated for inter- and intraobserver reliability. In addition, patients were examined in subgroups with respect to previous surgery and the current therapy. Lumbar lordosis averaged 45.6° (range 2.5°-74.9°; SD 14.2°), intraobserver correlation was between Pearson r = 0.93 and 0.98. The measurement of SS averaged 35.3° (range 13.8°-66.9°; SD 9.6°), intraobserver correlation was between Pearson r = 0.89 and 0.96. Intraobserver reliability ranged from 0.966 to 0.992 ICC in LL measurements and 0.944-0.983 ICC in SS measurements. There was an interobserver reliability ICC of 0.944 in LL and 0.990 in SS. Correlation between LL and SS averaged r = 0.79. No statistically significant differences were observed between the analyzed subgroups. Manual measurement of LL and SS in patients with LSS on lateral radiographs is easily performed with excellent intra- and interobserver reliability. Correlation between LL and SS is very high. Differences between patients with and without previous decompression were not statistically significant.

Measuring the Cobb angle with the iPhone in kyphoses: a reliability study.

PubMed

Jacquot, Frederic; Charpentier, Axelle; Khelifi, Sofiane; Gastambide, Daniel; Rigal, Regis; Sautet, Alain

2012-08-01

Smartphones have gained widespread use in the healthcare field to fulfill a variety of tasks. We developed a small iPhone application to take advantage of the built-in position sensor to measure angles in a variety of spinal deformities. We present a reliability study of this tool in measuring kyphotic angles. Radiographs taken from 20 different patients' charts were presented to a panel of six operators at two different times. Radiographs were measured with the protractor and the iPhone application and statistical analysis was applied to measure intraclass correlation coefficients between both measurement methods, and to measure intra- and interobserver reliability The intraclass correlation coefficient calculated between methods (i.e. CobbMeter application on the iPhone versus standard method with the protractor) was 0.963 for all measures, indicating excellent correlation was obtained between the CobbMeter application and the standard method. The interobserver correlation coefficient was 0.965. The intraobserver ICC was 0.977, indicating excellent reproductibility of measurements at different times for all operators. The interobserver ICC between fellowship trained senior surgeons and general orthopaedic residents was 0.989. Consistently, the ICC for intraobserver and interobserver correlations was higher with the CobbMeter application than with the regular protractor method. This difference was not statistically significant. Measuring kyphotic angles with the iPhone application appears to be a valid procedure and is in no way inferior to the standard way of measuring the Cobb angle in kyphotic deformities.
Differences between genders in colorectal morphology on CT colonography using a quantitative approach: a pilot study.

PubMed

Weber, Charles N; Poff, Jason A; Lev-Toaff, Anna S; Levine, Marc S; Zafar, Hanna M

To explore quantitative differences between genders in morphologic colonic metrics and determine metric reproducibility. Quantitative colonic metrics from 20 male and 20 female CTC datasets were evaluated twice by two readers; all exams were performed after incomplete optical colonoscopy. Intra-/inter-reader reliability was measured with intraclass correlation coefficient (ICC) and concordance correlation coefficient (CCC). Women had overall decreased colonic volume, increased tortuosity and compactness and lower sigmoid apex height on CTC compared to men (p<0.0001,all). Quantitative measurements in colonic metrics were highly reproducible (ICC=0.9989 and 0.9970; CCC=0.9945). Quantitative morphologic differences between genders can be reproducibility measured. Copyright © 2017 Elsevier Inc. All rights reserved.
Stability and reproducibility of proteomic profiles measured with an aptamer-based platform.

PubMed

Kim, Claire H; Tworoger, Shelley S; Stampfer, Meir J; Dillon, Simon T; Gu, Xuesong; Sawyer, Sherilyn J; Chan, Andrew T; Libermann, Towia A; Eliassen, A Heather

2018-05-30

The feasibility of SOMAscan, a multiplex, high sensitivity proteomics platform, for use in studies using archived plasma samples has not yet been assessed. We quantified 1,305 proteins from plasma samples donated by 16 Nurses' Health Study (NHS) participants, 40 NHSII participants, and 12 local volunteers. We assessed assay reproducibility using coefficients of variation (CV) from duplicate samples and intra-class correlation coefficients (ICC) and Spearman correlation coefficients (r) of samples processed (i.e., centrifuged and aliquoted into separate components) immediately, 24, and 48 hours after collection, as well as those of samples collected from the same individuals 1 year apart. CVs were <20% for 99% of proteins overall and <10% for 92% of proteins in heparin samples compared to 66% for EDTA samples. We observed ICC or Spearman r (comparing immediate vs. 24-hour delayed processing) ≥0.75 for 61% of proteins, with some variation by anticoagulant (56% for heparin and 70% for EDTA) and protein class (ranging from 49% among kinases to 83% among hormones). Within-person stability over 1 year was good (ICC or Spearman r ≥ 0.4) for 91% of proteins. These results demonstrate the feasibility of SOMAscan for analyses of archived plasma samples.
Health Service Quality Scale: Brazilian Portuguese translation, reliability and validity.

PubMed

Rocha, Luiz Roberto Martins; Veiga, Daniela Francescato; e Oliveira, Paulo Rocha; Song, Elaine Horibe; Ferreira, Lydia Masako

2013-01-17

The Health Service Quality Scale is a multidimensional hierarchical scale that is based on interdisciplinary approach. This instrument was specifically created for measuring health service quality based on marketing and health care concepts. The aim of this study was to translate and culturally adapt the Health Service Quality Scale into Brazilian Portuguese and to assess the validity and reliability of the Brazilian Portuguese version of the instrument. We conducted a cross-sectional, observational study, with public health system patients in a Brazilian university hospital. Validity was assessed using Pearson's correlation coefficient to measure the strength of the association between the Brazilian Portuguese version of the instrument and the SERVQUAL scale. Internal consistency was evaluated using Cronbach's alpha coefficient; the intraclass (ICC) and Pearson's correlation coefficients were used for test-retest reliability. One hundred and sixteen consecutive postoperative patients completed the questionnaire. Pearson's correlation coefficient for validity was 0.20. Cronbach's alpha for the first and second administrations of the final version of the instrument were 0.982 and 0.986, respectively. For test-retest reliability, Pearson's correlation coefficient was 0.89 and ICC was 0.90. The culturally adapted, Brazilian Portuguese version of the Health Service Quality Scale is a valid and reliable instrument to measure health service quality.
The feasibility of measuring joint angular velocity with a gyro-sensor.

PubMed

Arai, Takeshi; Obuchi, Shuichi; Shiba, Yoshitaka; Omuro, Kazuya; Nakano, Chika; Higashi, Takuya

2008-01-01

To determine the reliability of an assessment of joint angular velocity using a gyro-sensor and to examine the relationship between ankle angular velocity and physical functions. Cross-sectional. Kinesiology laboratory. Twenty healthy young adults (mean age, 22.5 y) and 113 community-dwelling older adults (mean age, 75.1 y). Not applicable. Maximal ankle joint velocity was measured using a gyro-sensor during heel-rising and jumping with knee extended. The intraclass correlation coefficient (ICC) was used to determine the intertester and intratester reliability. The Pearson correlation coefficient was used to examine the relationships between maximal ankle joint velocity and isometric muscle strength and isokinetic muscle power in young adults and also to examine the relationships between maximal ankle joint velocity and functional performance measurements such as walking time in older adults. High reliability was found for intertester (ICC=.96) and intratester reliability (ICC=.96). The data from the gyro-sensor highly correlated with muscle strength (r range, .62-.68; P<.01) and muscle power (r range, .45-.79; P range, .01-.05). In older subjects, mobility functions significantly correlated with the angular velocity of ankle plantarflexion. Measurement of ankle angular velocity using a gyro-sensor is both reliable and feasible, with the results representing a significant correlation to muscle power and performance measurements.
Gait consistency over a 7-day interval in people with Parkinson's disease.

PubMed

Urquhart, D M; Morris, M E; Iansek, R

1999-06-01

To evaluate the consistency of temporal and spatial parameters of the walking pattern in subjects with idiopathic Parkinson's disease (PD) over a 7-day interval during the "on" phase of the levodopa medication cycle. Walking patterns were measured on a 12-meter walkway at the Kingston Gait Laboratory, Cheltenham, using a computerized stride analyzer. Sixteen subjects (7 women, 9 men) with PD recruited from the Movement Disorders Clinic at Kingston Centre. Speed of walking, stride length, cadence, and the percentage of the walking cycle spent in the double limb support phase of gait were measured, together with the level of disability as indexed by the modified Webster scale. Product-moment correlation coefficients and intraclass correlation coefficients (ICC 2,1) for repeat measures over a 7-day interval were high for speed (r = .90; ICC = .93), cadence (r = .90; ICC = .86), and stride length (r = 1.00; ICC = .97) and moderate for double limb support duration after removal of outliers (r = .75; ICC = .73); 95% confidence intervals for the change scores were within clinically acceptable limits for all variables. The mean modified Webster score was 11.4 on the first day and 10.1 7 days later. The gait pattern and level of disability in subjects with PD without severe motor fluctuations remained stable over a 1-week period when optimal medication prevailed.
Reliability and concurrent validity of a Smartphone, bubble inclinometer and motion analysis system for measurement of hip joint range of motion.

PubMed

Charlton, Paula C; Mentiplay, Benjamin F; Pua, Yong-Hao; Clark, Ross A

2015-05-01

Traditional methods of assessing joint range of motion (ROM) involve specialized tools that may not be widely available to clinicians. This study assesses the reliability and validity of a custom Smartphone application for assessing hip joint range of motion. Intra-tester reliability with concurrent validity. Passive hip joint range of motion was recorded for seven different movements in 20 males on two separate occasions. Data from a Smartphone, bubble inclinometer and a three dimensional motion analysis (3DMA) system were collected simultaneously. Intraclass correlation coefficients (ICCs), coefficients of variation (CV) and standard error of measurement (SEM) were used to assess reliability. To assess validity of the Smartphone application and the bubble inclinometer against the three dimensional motion analysis system, intraclass correlation coefficients and fixed and proportional biases were used. The Smartphone demonstrated good to excellent reliability (ICCs>0.75) for four out of the seven movements, and moderate to good reliability for the remaining three movements (ICC=0.63-0.68). Additionally, the Smartphone application displayed comparable reliability to the bubble inclinometer. The Smartphone application displayed excellent validity when compared to the three dimensional motion analysis system for all movements (ICCs>0.88) except one, which displayed moderate to good validity (ICC=0.71). Smartphones are portable and widely available tools that are mostly reliable and valid for assessing passive hip range of motion, with potential for large-scale use when a bubble inclinometer is not available. However, caution must be taken in its implementation as some movement axes demonstrated only moderate reliability. Copyright © 2014 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.
Stability of physical activity, fitness components and diet quality indices.

PubMed

Mertens, E; Clarys, P; Mullie, P; Lefevre, J; Charlier, R; Knaeps, S; Huybrechts, I; Deforche, B

2017-04-01

Regular physical activity (PA), a high level of fitness and a high diet quality are positively associated with health. However, information about stability of fitness components and diet quality indices is limited. This study aimed to evaluate stability of those parameters. This study includes 652 adults (men=57.56 (10.28) years; women=55.90 (8.34) years at follow-up) who participated in 2002-2004 and returned for follow-up at the Policy Research Centre Leuven in 2012-2014. Minutes sport per day and Physical activity level (PAL) were calculated from the Flemish Physical Activity Computerized Questionnaire. Cardiorespiratory fitness (CRF), morphological fitness (MORF; body mass index and waist circumference) and metabolic fitness (METF) (blood cholesterol and triglycerides) were used as fitness components. Diet quality indices (Healthy Eating Index-2010 (HEI), Diet Quality Index (DQI), Mediterranean Diet Score (MDS)) were calculated from a diet record. Tracking coefficients were calculated using Pearson/Spearman correlation coefficients (r Pearson ) and intra-class correlation coefficients (r ICC ). In both men (r Pearson&ICC =0.51) and women (r Pearson =0.62 and r ICC =0.60) PAL showed good stability, while minutes sport remained stable in women (r Pearson&ICC =0.57) but less in men (r Pearson&ICC =0.45). Most fitness components remained stable (r⩾0.50) except some METF components in women. In general the diet quality indices and their components were unstable (r<0.50). PAL and the majority of the fitness components remained stable, while diet quality was unstable over 10 years. For unstable parameters such as diet quality measurements are needed at both time points in prospective research.
Different hip and knee priority score systems: are they good for the same thing?

PubMed

Escobar, Antonio; Quintana, Jose Maria; Espallargues, Mireia; Allepuz, Alejandro; Ibañez, Berta

2010-10-01

The aim of the present study was to compare two priority tools used for joint replacement for patients on waiting lists, which use two different methods. Two prioritization tools developed and validated by different methodologies were used on the same cohort of patients. The first, an IRYSS hip and knee priority score (IHKPS) developed by RAND method, was applied while patients were on the waiting list. The other, a Catalonia hip-knee priority score (CHKPS) developed by conjoint analysis, was adapted and applied retrospectively. In addition, all patients fulfilled pre-intervention the Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC). Correlation between them was studied by Pearson correlation coefficient (r). Agreement was analysed by means of intra-class correlation coefficient (ICC), Kendall coefficient and Cohern kappa. The relationship between IHKPS, CHKPS and baseline WOMAC scores by r coefficient was studied. The sample consisted of 774 consecutive patients. Pearson correlation coefficient between IHKPS and CHKPS was 0.79. The agreement study showed that ICC was 0.74, Kendall coefficient 0.86 and kappa 0.66. Finally, correlation between CHKPS and baseline WOMAC ranged from 0.43 to 0.64. The results according to the relationship between IHKPS and WOMAC ranged from 0.50 to 0.74. Results support the hypothesis that if the final objective of the prioritization tools is to organize and sort patients on the waiting list, although they use different methodologies, the results are similar. © 2010 Blackwell Publishing Ltd.
Reliability of shear wave ultrasound elastography for neck lesions identified in routine clinical practice.

PubMed

Bhatia, K; Tong, C S L; Cho, C C M; Yuen, E H Y; Lee, J; Ahuja, A T

2012-10-01

To evaluate the reliability of shear wave ultrasound elastography (SWE) in the neck. 176 neck lesions (40 thyroid, 56 lymph nodes, 46 salivary, 34 miscellaneous) identified in a routine US clinic underwent SWE by one or two blinded radiologists. For this study, SWE required the operator to acquire three 10 second dynamic colour-coded SWE cineloops per lesion, select one static image per cineloop, and place circular regions-of-interest within the entire lesion and stiffest part to generate 3 SWE measurements per static image. For logistical reasons, one radiologist evaluated all 176 lesions and the other evaluated 58 lesions. Both radiologists also reviewed 27 archived cineloops independently to assess SWE excluding practical technique. Reliability was assessed using intraclass correlation coefficients (ICCs) concordance correlation coefficients (CCCs) and coefficients of repeatability (CORs). Test-retest ICCs for the radiologist evaluating 176 lesions were 0.78 - 0.85 (fair-excellent agreement), CCCs were 0.85 - 0.88 (substantial agreement), and CORs were 14.9 - 36.1 kPa. For both radiologists evaluating 58 lesions, intra-rater and inter-rater ICCs were 0.65 - 0.78 and 0.72 - 0.77 respectively. For SWE excluding practical technique, inter-rater ICCs were 0.97 - 0.98 (excellent agreement). ICCs differed according to tissue, being higher in thyroid lesions than lymph nodes (p < 0.001), and higher in benign than malignant lesions (p values < 0.001). Intra- and inter-rater reliability of SWE is fair to excellent according to ICCs. SWE reliability is influenced appreciably by acquisition technique. Nevertheless, CORs for SWE are not negligible. To determine whether these results are acceptable clinically, further research is required to establish SWE stiffness values of normal and pathological tissues in the neck. © Georg Thieme Verlag KG Stuttgart · New York.
Reliability of the Balance Evaluation Systems Test (BESTest) and BESTest sections for adults with hemiparesis

PubMed Central

Rodrigues, Letícia C.; Marques, Aline P.; Barros, Paula B.; Michaelsen, Stella M.

2014-01-01

BACKGROUND: The Balance Evaluation Systems Test (BESTest) was recently created to allow the development of treatments according to the specific balance system affected in each patient. The Brazilian version of the BESTest has not been specifically tested after stroke. OBJECTIVE: To evaluate the intra- and inter-rater reliability and concurrent and convergent validity of the total score of the BESTest and BESTest sections for adults with hemiparesis after stroke. METHOD: The study included 16 subjects (61.1±7.5 years) with chronic hemiparesis (54.5±43.5 months after stroke). The BESTest was administered by two raters in the same week and one of the raters repeated the test after a one-week interval. Intraclass correlation coefficient (ICC) was calculated to assess intra- and interrater reliability. Concurrent validity with the Berg Balance Scale (BBS) and convergent validity with the Activities-specific Balance Confidence scale (ABC-Brazil) were assessed using Pearson's correlation coefficient. RESULTS: Both the BESTest total score (ICC=0.98) and the BESTest sections (ICC between 0.85 and 0.96) have excellent intrarater reliability. Interrater reliability for the total score was excellent (ICC=0.93) and, for the sections, it ranged between 0.71 and 0.94. The correlation coefficient between the BESTest and the BBS and ABC-Brazil were 0.78 and 0.59, respectively. CONCLUSIONS: The Brazilian version of the BESTest demonstrated adequate reliability when measured by sections and could identify what balance system was affected in patients after stroke. Concurrent validity was excellent with the BBS total score and good to excellent with the sections. The total scores but not the sections present adequate convergent validity with the ABC-Brazil. However, other psychometric properties should be further investigated. PMID:25003281
Correlation of skeletal maturation stages determined by cervical vertebrae and hand-wrist evaluations.

PubMed

Flores-Mir, Carlos; Burgess, Corr A; Champney, Mitchell; Jensen, Robert J; Pitcher, Micheal R; Major, Paul W

2006-01-01

The aim of this study was to assess the correlation between the Fishman maturation prediction method (FMP) and the cervical vertebral maturation (CVM) method for skeletal maturation stage determination. Hand-wrist and lateral cephalograms from 79 subjects (52 females and 27 males) were used. Hand-wrist radiographs were analyzed using the FMP to determine skeletal maturation level (advanced, average, or delayed) and stage (relative position of the individual in the pubertal growth curve). Cervical vertebrae (C2, C3, and C4) outlines obtained from lateral cephalograms were analyzed using the CVM to determine skeletal maturation stage. Intraexaminer reliability (Intraclass correlation coefficient [ICC]) for both methods was calculated from 10 triplicate hand-wrist and lateral cephalograms from the same patients. An ICC coefficient of 0.985 for FMP and an ICC of 0.889 for CVM were obtained. A Spearman correlation value of 0.72 (P < .001) was found between the skeletal maturation stages of both methods. When the sample was subgrouped according to skeletal maturation level, the following correlation values were found: for early mature adolescents 0.73, for average mature adolescents 0.70, and for late mature adolescents 0.87. All these correlation values were statistically different from zero (P < .024). Correlation values between both skeletal maturation methods were moderately high. This may be high enough to use either of the methods indistinctively for research purposes but not for the assessment of individual patients. Skeletal level influences the correlation values and, therefore, it should be considered whenever possible.
Feasibility of a Respiratory Movement Evaluation Tool to Quantify Thoracoabdominal Movement for Neuromuscular Diseases.

PubMed

Liu, Fumio; Kawakami, Michiyuki; Tamura, Kimimasa; Taki, Yoshihito; Shimizu, Katsumi; Otsuka, Tomoyoshi; Tsuji, Tetsuya; Miyata, Chieko; Tashiro, Syoichi; Wada, Ayako; Mizuno, Katsuhiro; Aoki, Yoshimitsu; Liu, Meigen

2017-04-01

An objective method to evaluate thoracoabdominal movement is needed in daily clinical practice to detect patients at risk of hypoventilation and to allow for timely interventions in neuromuscular diseases. The clinical feasibility, reliability, and validity of a newly developed method for quantifying respiratory movement using fiber grating sensors, called the Respiratory Movement Evaluation Tool (RMET), was evaluated. The time needed to measure respiratory movement and the usability of the measurement were determined by 5 clinicians using the Quebec User Evaluation of Satisfaction with Assistive Technology (QUEST) 2.0 questionnaire. Thoracoabdominal movement was measured using RMET 3 times in 10 healthy subjects to evaluate intraclass correlation coefficients (ICC). The subjects were encouraged to breathe 10 times while voluntarily changing the amount of air during ventilation simultaneously with the RMET and a spirometer, and their correlations were evaluated to test validity using Pearson's product-moment correlation coefficients. The same measurements were also performed in 10 subjects with Duchenne muscular dystrophy. Real-time recordings of thoracoabdominal movements were obtained over a mean time of 374 ± 23.9 s. With QUEST 2.0, the median score of each item exceeded 3 (more or less satisfied). In healthy subjects, ICC(1,1) ranged from 0.82 to 0.99, and ICC(2,1) ranged from 0.83 to 0.97. Significant correlations were observed between the respiratory amplitudes measured with RMET, and the amount of air during ventilation was measured with a spirometer (r = 0.995, P < .001). In subjects with Duchenne muscular dystrophy, ICC(1,1) ranged from 0.87 to 0.97, and ICC(2,1) ranged from 0.84 to 0.99. The respiratory amplitudes measured with RMET correlated significantly with the amount of air during ventilation with a spirometer (r = 0.957, P < .001). We developed a novel method of quantifying respiratory movement called RMET that was feasible to use in daily clinical practice. Copyright © 2017 by Daedalus Enterprises.
[Reliability and validity of Parkinson's disease sleep scale-Chinese version in the south west of China].

PubMed

Zhang, J H; Peng, R; Du, Y; Mou, Y; Li, N N; Cheng, L

2016-11-08

Objective: To evaluate the reliability and validity of Parkinson's disease sleep scale-Chinese version (CPDSS) through a study of a large PD population in southwest China, and to explore the prevalence and characteristics of sleep disorders in Parkinson's disease (PD) patients from southwest China. Methods: A total of 544 PD patients and 220 control subjects were enrolled in our study. Demographic data, CPDSS, ESS, PDQ39, HAMD and H-Y stage were assessed in all subjects. Statistical description, Cronbach's alpha coefficient, intra-class correlation coefficient ( ICC ), Spearman rank correlation coefficient and Mann-Whitney U test were used for statistical analyses. Result: The Cronbach's alpha coefficient for CPDSS was 0.79, ICC of the total scale was 0.94 and ICC of each item ranged from 0.73 to 0.97. The factor analysis yielded a five-factor solution, which explained 63.4% of the total variance. Total and each item scores of CPDSS in PD patients were lower than those in healthy controls. 69.3% of PD patients had sleep disorder, while prevalence in the control group was only 29.6%. Negative correlation was found between CPDSS and ESS. Daytime sleepiness was the most common factor (35.9%) leading to sleep disorders. The sleep disorders of PD patients in Southwest China were significantly related with the course of disease, the severity of disease, the quality of life, depression, cognitive level and motor symptoms. Conclusion: CPDSS has good feasibility, reliability and validity in PD population from southwest China. CPDSS is considered as an effective tool for the assessment of sleep disorder in PD patients.
Reliability and concurrent validity of postural asymmetry measurement in adolescent idiopathic scoliosis.

PubMed

Prowse, Ashleigh; Aslaksen, Berit; Kierkegaard, Marie; Furness, James; Gerdhem, Paul; Abbott, Allan

2017-01-18

To investigate the reliability and concurrent validity of the Baseline ® Body Level/Scoliosis meter for adolescent idiopathic scoliosis postural assessment in three anatomical planes. This is an observational reliability and concurrent validity study of adolescent referrals to the Orthopaedic department for scoliosis screening at Karolinska University Hospital, Stockholm, Sweden between March-May 2012. A total of 31 adolescents with idiopathic scoliosis (13.6 ± 0.6 years old) of mild-moderate curvatures (25° ± 12°) were consecutively recruited. Measurement of cervical, thoracic and lumbar curvatures, pelvic and shoulder tilt, and axial thoracic rotation (ATR) were performed by two trained physiotherapists in one day. The intraclass correlation coefficient (ICC) was used to determine the inter-examiner reliability (ICC2,1) and the intra-rater reliability (ICC3,3) of the Baseline ® Body Level/Scoliosis meter. Spearman's correlation analyses were used to estimate concurrent validity between the Baseline ® Body Level/Scoliosis meter and Gold Standard Cobb angles from radiographs and the Orthopaedic Systems Inc. Scoliometer. There was excellent reliability between examiners for thoracic kyphosis (ICC2,1 = 0.94), ATR (ICC2,1 = 0.92) and lumbar lordosis (ICC2,1 = 0.79). There was adequate reliability between examiners for cervical lordosis (ICC2,1 = 0.51), however poor reliability for pelvic and shoulder tilt. Both devices were reproducible in the measurement of ATR when repeated by one examiner (ICC3,3 0.98-1.00). The device had a good correlation with the Scoliometer (rho = 0.78). When compared with Cobb angle from radiographs, there was a moderate correlation for ATR (rho = 0.627). The Baseline ® Body Level/Scoliosis meter provides reliable transverse and sagittal cervical, thoracic and lumbar measurements and valid transverse plan measurements of mild-moderate scoliosis deformity.
Validity and cross-cultural adaptation of the persian version of the oxford elbow score.

PubMed

Ebrahimzadeh, Mohammad H; Kachooei, Amir Reza; Vahedi, Ehsan; Moradi, Ali; Mashayekhi, Zeinab; Hallaj-Moghaddam, Mohammad; Azami, Mehran; Birjandinejad, Ali

2014-01-01

Oxford Elbow Score (OES) is a patient-reported questionnaire used to assess outcomes after elbow surgery. The aim of this study was to validate and adapt the OES into Persian language. After forward-backward translation of the OES into Persian, a total number of 92 patients after elbow surgeries completed the Persian OES along with the Persian DASH and SF-36. To assess test-retest reliability, 31 randomly selected patients (34%) completed the Persian OES again after three days while abstaining from all forms of therapeutic regimens. Reliability of the Persian OES was assessed by measuring intraclass correlation coefficient (ICC) for test-retest reliability and Cronbach's alpha for internal consistency. Spearman's correlation coefficient was used to test the construct validity. Cronbach's alpha coefficient was 0.92 showing excellent reliability. Cronbach's alpha for function, pain, and social-psychological subscales was 0.95, 0.86, and 0.85, respectively. Intraclass correlation coefficient (ICC) was 0.85 for the overall questionnaire and 0.90, 0.76, and 0.75 for function, pain, and social-psychological subscales, respectively. Construct validity was confirmed as the Spearman correlation between OES and DASH was 0.80. Persian OES is a valid and reliable patient-reported outcome measure to assess postsurgical elbow status in Persian speaking population.
Interobserver reliability of the 'Welfare Quality(®) Animal Welfare Assessment Protocol for Growing Pigs'.

PubMed

Czycholl, I; Kniese, C; Büttner, K; Beilage, E Grosse; Schrader, L; Krieter, J

2016-01-01

The present paper focuses on evaluating the interobserver reliability of the 'Welfare Quality(®) Animal Welfare Assessment Protocol for Growing Pigs'. The protocol for growing pigs mainly consists of a Qualitative Behaviour Assessment (QBA), direct behaviour observations (BO) carried out by instantaneous scan sampling and checks for different individual parameters (IP), e.g. presence of tail biting, wounds and bursitis. Three trained observers collected the data by performing 29 combined assessments, which were done at the same time and on the same animals; but they were carried out completely independent of each other. The findings were compared by the calculation of Spearman Rank Correlation Coefficients (RS), Intraclass Correlation Coefficients (ICC), Smallest Detectable Changes (SDC) and Limits of Agreements (LoA). There was no agreement found concerning the adjectives belonging to the QBA (e.g. active: RS: 0.50, ICC: 0.30, SDC: 0.38, LoA: -0.05 to 0.45; fearful: RS: 0.06, ICC: 0.0, SDC: 0.26, LoA: -0.20 to 0.30). In contrast, the BO showed good agreement (e.g. social behaviour: RS: 0.45, ICC: 0.50, SDC: 0.09, LoA: -0.09 to 0.03 use of enrichment material: RS: 0.75, ICC: 0.68, SDC: 0.06, LoA: -0.03 to 0.03). Overall, observers agreed well in the IP, e.g. tail biting (RS: 0.52, ICC: 0.88; SDC: 0.05, LoA: -0.01 to 0.02) and wounds (RS: 0.43, ICC: 0.59, SDC: 0.10, LoA: -0.09 to 0.10). The parameter bursitis showed great differences (RS: 0.10, ICC: 0.0, SDC: 0.35, LoA: -0.37 to 0.40), which can be explained by difficulties in the assessment when the animals moved around quickly or their legs were soiled. In conclusion, the interobserver reliability was good in the BO and most IP, but not for the parameter bursitis and the QBA.
Reliability of Leg and Vertical Stiffness During High Speed Treadmill Running.

PubMed

Pappas, Panagiotis; Dallas, Giorgos; Paradisis, Giorgos

2017-04-01

In research, the accurate and reliable measurement of leg and vertical stiffness could contribute to valid interpretations. The current study aimed at determining the intraparticipant variability (ie, intraday and interday reliabilities) of leg and vertical stiffness, as well as related parameters, during high speed treadmill running, using the "sine-wave" method. Thirty-one males ran on a treadmill at 6.67 m∙s -1 , and the contact and flight times were measured. To determine the intraday reliability, three 10-s running bouts with 10-min recovery were performed. In addition, to examine the interday reliability, three 10-s running bouts on 3 separate days with 48-h interbout intervals were performed. The reliability statistics included repeated-measure analysis of variance, average intertrial correlations, intraclass correlation coefficients (ICCs), Cronbach's α reliability coefficient, and the coefficient of variation (CV%). Both intraday and interday reliabilities were high for leg and vertical stiffness (ICC > 0.939 and CV < 4.3%), as well as related variables (ICC > 0.934 and CV < 3.9%). It was thus inferred that the measurements of leg and vertical stiffness, as well as the related parameters obtained using the "sine-wave" method during treadmill running at 6.67 m∙s -1 , were highly reliable, both within and across days.
Reliability of fully automated versus visually controlled pre- and post-processing of resting-state EEG.

PubMed

Hatz, F; Hardmeier, M; Bousleiman, H; Rüegg, S; Schindler, C; Fuhr, P

2015-02-01

To compare the reliability of a newly developed Matlab® toolbox for the fully automated, pre- and post-processing of resting state EEG (automated analysis, AA) with the reliability of analysis involving visually controlled pre- and post-processing (VA). 34 healthy volunteers (age: median 38.2 (20-49), 82% female) had three consecutive 256-channel resting-state EEG at one year intervals. Results of frequency analysis of AA and VA were compared with Pearson correlation coefficients, and reliability over time was assessed with intraclass correlation coefficients (ICC). Mean correlation coefficient between AA and VA was 0.94±0.07, mean ICC for AA 0.83±0.05 and for VA 0.84±0.07. AA and VA yield very similar results for spectral EEG analysis and are equally reliable. AA is less time-consuming, completely standardized, and independent of raters and their training. Automated processing of EEG facilitates workflow in quantitative EEG analysis. Copyright © 2014 International Federation of Clinical Neurophysiology. Published by Elsevier Ireland Ltd. All rights reserved.
Cross-cultural adaptation and validation of the Turkish version of Oxford hip score.

PubMed

Tuğay, Baki Umut; Tuğay, Nazan; Güney, Hande; Hazar, Zeynep; Yüksel, İnci; Atilla, Bülent

2015-06-01

The purpose of this study was to translate the Oxford hip score (OHS) into Turkish and to evaluate the psychometric properties by testing the internal consistency, reproducibility, construct validity, and responsiveness in patients with hip osteoarthritis (OA). Oxford hip score was translated and culturally adapted according to the guidelines in the literature. Seventy patients (mean age 61.45 ± 9.29 years) with hip osteoarthritis participated in the study. Patients completed the Turkish Oxford hip score (OHS-TR), the Short-Form 36 (SF-36), and Western Ontario and McMaster Universities Index (WOMAC). Internal consistency was tested using Cronbach's α coefficient. Patients completed OHS-TR questionnaire twice in 7 days for determining the reproducibility. Correlation between the total results of both tests was determined by the Pearson correlation coefficient and intraclass correlation coefficient (ICC). Validity was assessed by calculating the Pearson correlation coefficient between the OHS-TR and WOMAC and SF-36 scores. Floor and ceiling effects were analyzed. The internal consistency was high (Cronbach's α 0.93). The construct validity showed a significant correlation between the OHS-TR and WOMAC and related SF-36 domains (p < 0.001). The ICC's ranged between 0.80 and 0.99. There was no floor or ceiling effect in total OHS-TR score. The OHS-TR questionnaire is valid, reliable, and responsive for the Turkish-speaking patients with hip OA.

Test-Retest Reliability of the Salutogenic Wellness Promotion Scale (SWPS)

ERIC Educational Resources Information Center

Anderson, L. M.; Moore, J. B.; Hayden, B. M.; Becker, C. M.

2014-01-01

Objective: This study examined the temporal stability (i.e. test-retest reliability) of the Salutogenic Wellness Promotion Scale (SWPS) using intraclass correlation coefficients (ICC). Current intraclass results were also compared to previously published interclass correlations to support the use of the intraclass method for test-retest…
Reliability of Entire Corneal Thickness Mapping in Normal Post-Laser in situ Keratomileusis and Keratoconus Eyes Using Long Scan Depth Spectral Domain Optical Coherence Tomography.

PubMed

Xu, Zhe; Chen, Sisi; Yang, Chun; Huang, Shenghai; Shen, Meixiao; Wang, Yuanyuan

2018-01-01

To investigate the repeatability and reproducibility of mapping the entire corneal thickness using spectral domain optical coherence tomography (SD-OCT). Thirty normal eyes, 30 post-laser in situ keratomileusis (LASIK) surgery eyes, and 30 keratoconus eyes were analyzed. A custom-built long scan depth SD-OCT device was used to obtain entire corneal images. Ten-millimeter-diameter corneal thickness maps were generated by an automated segmentation algorithm. Intraclass correlation coefficients of repeatability (ICC1) and reproducibility (ICC2), and coefficients of repeatability (CoR1) and reproducibility (CoR2), were calculated to quantify the precision and accuracy of corneal pachymetry measurements using the Bland-Altman method. For SD-OCT measurements in healthy subjects, CoR1 and CoR2 were less than 5.00 and 5.53 μm. ICC1 and ICC2 were more than 0.997 and 0.996. For SD-OCT measurements in LASIK patients, CoR1 and CoR2 were less than 5.09 and 5.34 μm. ICC1 and ICC2 were more than 0.997 and 0.996. For SD-OCT measurements in keratoconus patients, CoR1 and CoR2 were less than 11.57 and 10.92 μm. ICC1 and ICC2 were more than 0.995 and 0.996. The measurements of corneal pachymetric mapping by long scan depth SD-OCT can be assessed over the entire corneal area with good repeatability and reproducibility. © 2017 S. Karger AG, Basel.
Evaluating the Consistency of Current Mainstream Wearable Devices in Health Monitoring: A Comparison Under Free-Living Conditions

PubMed Central

Wen, Dong; Zhang, Xingting; Liu, Xingyu

2017-01-01

Background Wearable devices are gaining increasing market attention; however, the monitoring accuracy and consistency of the devices remains unknown. Objective The purpose of this study was to assess the consistency of the monitoring measurements of the latest wearable devices in the state of normal activities to provide advice to the industry and support to consumers in making purchasing choices. Methods Ten pieces of representative wearable devices (2 smart watches, 4 smart bracelets of Chinese brands or foreign brands, and 4 mobile phone apps) were selected, and 5 subjects were employed to simultaneously use all the devices and the apps. From these devices, intact health monitoring data were acquired for 5 consecutive days and analyzed on the degree of differences and the relationships of the monitoring measurements by the different devices. Results The daily measurements by the different devices fluctuated greatly, and the coefficient of variation (CV) fluctuated in the range of 2-38% for the number of steps, 5-30% for distance, 19-112% for activity duration, .1-17% for total energy expenditure (EE), 22-100% for activity EE, 2-44% for sleep duration, and 35-117% for deep sleep duration. After integrating the measurement data of 25 days among the devices, the measurements of the number of steps (intraclass correlation coefficient, ICC=.89) and distance (ICC=.84) displayed excellent consistencies, followed by those of activity duration (ICC=.59) and the total EE (ICC=.59) and activity EE (ICC=.57). However, the measurements for sleep duration (ICC=.30) and deep sleep duration (ICC=.27) were poor. For most devices, there was a strong correlation between the number of steps and distance measurements (R2>.95), and for some devices, there was a strong correlation between activity duration measurements and EE measurements (R2>.7). A strong correlation was observed in the measurements of steps, distance and EE from smart watches and mobile phones of the same brand, Apple or Samsung (r>.88). Conclusions Although wearable devices are developing rapidly, the current mainstream devices are only reliable in measuring the number of steps and distance, which can be used as health assessment indicators. However, the measurement consistencies of activity duration, EE, sleep quality, and so on, are still inadequate, which require further investigation and improved algorithms. PMID:28270382
A simple method of measuring tibial tubercle to trochlear groove distance on MRI: description of a novel and reliable technique.

PubMed

Camp, Christopher L; Heidenreich, Mark J; Dahm, Diane L; Bond, Jeffrey R; Collins, Mark S; Krych, Aaron J

2016-03-01

Tibial tubercle-trochlear groove (TT-TG) distance is a variable that helps guide surgical decision-making in patients with patellar instability. The purpose of this study was to compare the accuracy and reliability of an MRI TT-TG measuring technique using a simple external alignment method to a previously validated gold standard technique that requires advanced software read by radiologists. TT-TG was calculated by MRI on 59 knees with a clinical diagnosis of patellar instability in a blinded and randomized fashion by two musculoskeletal radiologists using advanced software and by two orthopaedists using the study technique which utilizes measurements taken on a simple electronic imaging platform. Interrater reliability between the two radiologists and the two orthopaedists and intermethods reliability between the two techniques were calculated using interclass correlation coefficients (ICC) and concordance correlation coefficients (CCC). ICC and CCC values greater than 0.75 were considered to represent excellent agreement. The mean TT-TG distance was 14.7 mm (Standard Deviation (SD) 4.87 mm) and 15.4 mm (SD 5.41) as measured by the radiologists and orthopaedists, respectively. Excellent interobserver agreement was noted between the radiologists (ICC 0.941; CCC 0.941), the orthopaedists (ICC 0.978; CCC 0.976), and the two techniques (ICC 0.941; CCC 0.933). The simple TT-TG distance measurement technique analysed in this study resulted in excellent agreement and reliability as compared to the gold standard technique. This method can predictably be performed by orthopaedic surgeons without advanced radiologic software. II.
American Orthopaedic Foot and Ankle Society ankle-hindfoot scale: A cross-cultural adaptation and validation study from Iran.

PubMed

Vosoughi, Amir Reza; Roustaei, Narges; Mahdaviazad, Hamideh

2018-06-01

The use of valid and reliable outcome rating scales is essential for evaluating the result of different treatments and interventions. The purposes of this study were to translate and culturally adapt the American Orthopaedic Foot and Ankle Society ankle-hindfoot scale (AOFAS-AHFS) into Persian languages and evaluate its psychometric properties. Forward-backward translation and cultural adaptation method were used to develop Persian version of AOFAS-AHFS. From March to July 2016, one hundred consecutive patients with ankle and hindfoot injuries were included. Internal consistency and reproducibility were evaluated using Cronbach's alpha, Spearman's rank correlation coefficient and Intraclass correlation coefficient (ICC) respectively. Construct validity reported which compare the outcome rating scale measurements with Short Form-36 (SF-36), also convergent and discriminant validity evaluated using Spearman's rank correlation coefficient. Mean age (SD) of the patients was 41.95±13.45years. Cronbach's α coefficient, Spearman's rho and ICC values were 0.71, 0.89 and 0.90 respectively. Total score of AOFAS-AHFS and SF-36 domains has a correlation ranged between 0.17-0.55. Spearman's rank correlation coefficient of 0.4 was exceeded by all items with the exception of stability. The Spearman's rank correlation between each item in functional subscales with its own subscales was higher than the correlation between these items and other subscales. Persian version of AOFAS-AHFS provides additional reliable and valid instrument which can be used to assess broad range of patients with foot and ankle disorders that speaking in Persian. However, it seems that the original version of AOFAS-AHFS needs some revisions. Copyright © 2017 European Foot and Ankle Society. Published by Elsevier Ltd. All rights reserved.
The Reliability of a Novel Mobile 3-dimensional Wound Measurement Device.

PubMed

Anghel, Ersilia L; Kumar, Anagha; Bigham, Thomas E; Maselli, Kathryn M; Steinberg, John S; Evans, Karen K; Kim, Paul J; Attinger, Christopher E

2016-11-01

Objective assessment of wound dimensions is essential for tracking progression and determining treatment effectiveness. A reliability study was designed to establish intrarater and interrater reliability of a novel mobile 3-dimensional wound measurement (3DWM) device. Forty-five wounds were assessed by 2 raters using a 3DWM device to obtain length, width, area, depth, and volume measurements. Wounds were also measured manually, using a disposable ruler and digital planimetry. The intraclass correlation coefficient (ICC) was used to establish intrarater and interrater reliability. High levels of intrarater and interrater agreement were observed for area, length, and width; ICC = 0.998, 0.977, 0.955 and 0.999, 0.997, 0.995, respectively. Moderate levels of intrarater (ICC = 0.888) and interrater (ICC = 0.696) agreement were observed for volume. Lastly, depth yielded an intrarater ICC of 0.360 and an interrater ICC of 0.649. Measures from the 3DWM device were highly correlated with those obtained from scaled photography for length, width, and area (ρ = 0.997, 0.988, 0.997, P < 0.001). The 3DWM device yielded correlations of ρ = 0.990, 0.987, 0.996 with P < 0.001 for length, width, and area when compared to manual measurements. The 3DWM device was found to be highly reliable for measuring wound areas for a range of wound sizes and types as compared to manual measurement and digital planimetry. The depth and therefore volume measurement using the 3DWM device was found to have a lower ICC, but volume ICC alone was moderate. Overall, this device offers a mobile option for objective wound measurement in the clinical setting.
Intra-class correlation estimates for assessment of vitamin A intake in children.

PubMed

Agarwal, Girdhar G; Awasthi, Shally; Walter, Stephen D

2005-03-01

In many community-based surveys, multi-level sampling is inherent in the design. In the design of these studies, especially to calculate the appropriate sample size, investigators need good estimates of intra-class correlation coefficient (ICC), along with the cluster size, to adjust for variation inflation due to clustering at each level. The present study used data on the assessment of clinical vitamin A deficiency and intake of vitamin A-rich food in children in a district in India. For the survey, 16 households were sampled from 200 villages nested within eight randomly-selected blocks of the district. ICCs and components of variances were estimated from a three-level hierarchical random effects analysis of variance model. Estimates of ICCs and variance components were obtained at village and block levels. Between-cluster variation was evident at each level of clustering. In these estimates, ICCs were inversely related to cluster size, but the design effect could be substantial for large clusters. At the block level, most ICC estimates were below 0.07. At the village level, many ICC estimates ranged from 0.014 to 0.45. These estimates may provide useful information for the design of epidemiological studies in which the sampled (or allocated) units range in size from households to large administrative zones.
Reliability and validity of Web-SPAN, a web-based method for assessing weight status, diet and physical activity in youth.

PubMed

Storey, K E; McCargar, L J

2012-02-01

Web-based surveys are becoming increasing popular. The present study aimed to assess the reliability and validity of the Web-Survey of Physical Activity and Nutrition (Web-SPAN) for self-report of height and weight, diet and physical activity by youth. School children aged 11-15years (grades 7-9; n=459) participated in the school-based research (boys, n=225; girls, n=233; mean age, 12.8years). Students completed Web-SPAN (self-administered) twice and participated in on-site school assessments [height, weight, 3-day food/pedometer record, Physical Activity Questionnaire for Older Children (PAQ-C), shuttle run]. Intraclass (ICC) and Pearson's correlation coefficients and paired samples t-tests were used to assess the test-retest reliability of Web-SPAN and to compare Web-SPAN with the on-site assessments. Test-retest reliability for height (ICC=0.90), weight (ICC=0.98) and the PAQ-C (ICC=0.79) were highly correlated, whereas correlations for nutrients were not as strong (ICC=0.37-0.64). There were no differences between Web-SPAN times 1 and 2 for height and weight, although there were differences for the PAQ-C and most nutrients. Web-SPAN was strongly correlated with the on-site assessments, including height (ICC=0.88), weight (ICC=0.93) and the PAQ-C (ICC=0.70). Mean differences for height and the PAQ-C were not significant, whereas mean differences for weight were significant resulting in an underestimation of being overweight/obesity prevalence (84% agreement). Correlations for nutrients were in the range 0.24-0.40; mean differences were small but generally significantly different. Correlations were weak between the web-based PAQ-C and 3-day pedometer record (r=0.28) and 20-m shuttle run (r=0.28). Web-SPAN is a time- and cost-effective method that can be used to assess the diet and physical activity status of youth in large cross-sectional studies and to assess group trends (weight status). © 2011 The Authors. Journal of Human Nutrition and Dietetics © 2011 The British Dietetic Association Ltd.
Reliability and concurrent validity of postural asymmetry measurement in adolescent idiopathic scoliosis

PubMed Central

Prowse, Ashleigh; Aslaksen, Berit; Kierkegaard, Marie; Furness, James; Gerdhem, Paul; Abbott, Allan

2017-01-01

AIM To investigate the reliability and concurrent validity of the Baseline® Body Level/Scoliosis meter for adolescent idiopathic scoliosis postural assessment in three anatomical planes. METHODS This is an observational reliability and concurrent validity study of adolescent referrals to the Orthopaedic department for scoliosis screening at Karolinska University Hospital, Stockholm, Sweden between March-May 2012. A total of 31 adolescents with idiopathic scoliosis (13.6 ± 0.6 years old) of mild-moderate curvatures (25° ± 12°) were consecutively recruited. Measurement of cervical, thoracic and lumbar curvatures, pelvic and shoulder tilt, and axial thoracic rotation (ATR) were performed by two trained physiotherapists in one day. The intraclass correlation coefficient (ICC) was used to determine the inter-examiner reliability (ICC2,1) and the intra-rater reliability (ICC3,3) of the Baseline® Body Level/Scoliosis meter. Spearman’s correlation analyses were used to estimate concurrent validity between the Baseline® Body Level/Scoliosis meter and Gold Standard Cobb angles from radiographs and the Orthopaedic Systems Inc. Scoliometer. RESULTS There was excellent reliability between examiners for thoracic kyphosis (ICC2,1 = 0.94), ATR (ICC2,1 = 0.92) and lumbar lordosis (ICC2,1 = 0.79). There was adequate reliability between examiners for cervical lordosis (ICC2,1 = 0.51), however poor reliability for pelvic and shoulder tilt. Both devices were reproducible in the measurement of ATR when repeated by one examiner (ICC3,3 0.98-1.00). The device had a good correlation with the Scoliometer (rho = 0.78). When compared with Cobb angle from radiographs, there was a moderate correlation for ATR (rho = 0.627). CONCLUSION The Baseline® Body Level/Scoliosis meter provides reliable transverse and sagittal cervical, thoracic and lumbar measurements and valid transverse plan measurements of mild-moderate scoliosis deformity. PMID:28144582
Prospective Study Validating Inter- and Intraobserver Variability of Tissue Compliance Meter in Breast Tissue of Healthy Volunteers: Potential Implications for Patients With Radiation-Induced Fibrosis of the Breast

DOE Office of Scientific and Technical Information (OSTI.GOV)

Wernicke, A. Gabriella, E-mail: gaw9008@med.cornell.ed; Parashar, Bhupesh; Kulidzhanov, Fridon

2011-05-01

Purpose: Accurate detection of radiation-induced fibrosis (RIF) is crucial in management of breast cancer survivors. Tissue compliance meter (TCM) has been validated in musculature. We validate TCM in healthy breast tissue with respect to interobserver and intraobserver variability before applying it in RIF. Methods and Materials: Three medical professionals obtained three consecutive TCM measurements in each of the four quadrants of the right and left breasts of 40 women with no breast disease or surgical intervention. The intraclass correlation coefficient (ICC) assessed interobserver variability. The paired t test and Pearson correlation coefficient (r) were used to assess intraobserver variability withinmore » each rater. Results: The median age was 45 years (range, 24-68 years). The median bra size was 35C (range, 32A-40DD). Of the participants, 27 were white (67%), 4 black (10%), 5 Asian (13%), and 4 Hispanic (10%). ICCs indicated excellent interrater reliability (low interobserver variability) among the three raters, by breast and quadrant (all ICC {>=}0.99). The paired t test and Pearson correlation coefficient both indicated low intraobserver variability within each rater (right vs. left breast), stratified by quadrant (all r{>=} 0.94, p < 0.0001). Conclusions: The interobserver and intraobserver variability is small using TCM in healthy mammary tissue. We are now embarking on a prospective study using TCM in women with breast cancer at risk of developing RIF that may guide early detection, timely therapeutic intervention, and assessment of success of therapy for RIF.« less
The Reliability and Validity of the Computerized Double Inclinometer in Measuring Lumbar Mobility

PubMed Central

MacDermid, Joy Christine; Arumugam, Vanitha; Vincent, Joshua Israel; Carroll, Krista L

2014-01-01

Study Design : Repeated measures reliability/validity study. Objectives : To determine the concurrent validity, test-retest, inter-rater and intra-rater reliability of lumbar flexion and extension measurements using the Tracker M.E. computerized dual inclinometer (CDI) in comparison to the modified-modified Schober (MMS) Summary of Background : Numerous studies have evaluated the reliability and validity of the various methods of measuring spinal motion, but the results are inconsistent. Differences in equipment and techniques make it difficult to correlate results. Methods : Twenty subjects with back pain and twenty without back pain were selected through convenience sampling. Two examiners measured sagittal plane lumbar range of motion for each subject. Two separate tests with the CDI and one test with the MMS were conducted. Each test consisted of three trials. Instrument and examiner order was randomly assigned. Intra-class correlations (ICCs 2, 2 and 2, 2) and Pearson correlation coefficients (r) were used to calculate reliability and concurrent validity respectively. Results : Intra-trial reliability was high to very high for both the CDI (ICCs 0.85 - 0.96) and MMS (ICCs 0.84 - 0.98). However, the reliability was poor to moderate, when the CDI unit had to be repositioned either by the same rate (ICCs 0.16 - 0.59) or a different rater (ICCs 0.45 - 0.52). Inter-rater reliability for the MMS was moderate to high (ICCs 0.75 - 0.82) which bettered the moderate correlation obtained for the CDI (ICCs 0.45 - 0.52). Correlations between the CDI and MMS were poor for flexion (0.32; p<0.05) and poor to moderate (-0.42 - -0.51; p<0.05) for extension measurements. Conclusion : When using the CDI, an average of subsequent tests is required to obtain moderate reliability. The MMS was highly reliable than the CDI. The MMS and the CDI measure lumbar movement on a different metric that are not highly related to each other. PMID:25352928
Sciatic neurosteatosis: Relationship with age, gender, obesity and height.

PubMed

Ratner, Shayna; Khwaja, Raamis; Zhang, Lihua; Xi, Yin; Dessouky, Riham; Rubin, Craig; Chhabra, Avneesh

2018-04-01

To evaluate inter-reader performance for cross-sectional area and fat quantification of bilateral sciatic nerves on MRI and assess correlations with anthropometrics. In this IRB-approved, HIPPA-compliant study, three readers performed a cross-sectional analysis of 3T lumbosacral plexus MRIs over an 18-month period. Image slices were evaluated at two levels (A and B). The sciatic nerve was outlined using a free hand region of interest tool on PACS. Proton-density fat fraction (FF) and cross-sectional areas were recorded. Inter-reader agreement was assessed using intra-class correlation coefficient (ICC). Spearman correlation coefficients were used for correlations with age, BMI and height and Wilcoxon rank sum test was used to assess gender differences. A total of 67 patients were included in this study with male to female ratio of 1:1. Inter-reader agreement was good to excellent for FF measurements at both levels (ICC=0.71-0.90) and poor for sciatic nerve areas (ICC=0.08-0.27). Positive correlations of sciatic FF and area were seen with age (p value<0.05). Males had significantly higher sciatic intraneural fat than females (p<0.05). Fat quantification MRI is highly reproducible with significant positive correlations of sciatic FF and area with age, which may have implications for MRI diagnosis of sciatic neuropathy. • MR proton density fat fraction is highly reproducible at multiple levels. • Sciatic intraneural fat is positively correlated with increasing age (p < 0.05). • Positive correlations exist between bilateral sciatic nerve areas and age (p < 0.05). • Males had significantly higher sciatic intraneural fat than females (p < 0.05).
Diffusion-weighted MR imaging of upper abdominal organs at different time points: Apparent diffusion coefficient normalization using a reference organ.

PubMed

Song, Ji Soo; Kwak, Hyo Sung; Byon, Jung Hee; Jin, Gong Yong

2017-05-01

To compare the apparent diffusion coefficient (ADC) of upper abdominal organs acquired at different time points, and to investigate the usefulness of normalization. We retrospectively evaluated 58 patients who underwent three rounds of magnetic resonance (MR) imaging including diffusion-weighted imaging of the upper abdomen. MR examinations were performed using three different 3.0 Tesla (T) and one 1.5T systems, with variable b value combinations and respiratory motion compensation techniques. The ADC values of the upper abdominal organs from three different time points were analyzed, using the ADC values of the paraspinal muscle (ADC psm ) and spleen (ADC spleen ) for normalization. Intraclass correlation coefficients (ICC) and comparison of dependent ICCs were used for statistical analysis. The ICCs of the original ADC and ADC psm showed fair to substantial agreement, while ADC spleen showed substantial to almost perfect agreement. The ICC of ADC spleen of all anatomical regions showed less variability compared with that of the original ADC (P < 0.005). Normalized ADC using the spleen as a reference organ significantly decreased variability in measurement of the upper abdominal organs in different MR systems at different time points and could be regarded as an imaging biomarker for future multicenter, longitudinal studies. 5 J. MAGN. RESON. IMAGING 2017;45:1494-1501. © 2016 International Society for Magnetic Resonance in Medicine.
The reliability of a simplified water displacement instrument: a method for measuring arm volume.

PubMed

Sagen, Ase; Kåresen, Rolf; Risberg, May Arna

2005-01-01

To present a new water displacement measurement, the Simplified Water Displacement Instrument (SWDI), and to evaluate its intra- and intertester reliability. Reliability design. Hospital setting. Fifty-six healthy people were studied. Intratester reliability was evaluated once a week for 4 weeks in 20 women and 10 men. Intertester reliability was assessed by 2 physical therapists in 26 people. Not applicable. Coefficients of variation (CVs) and intraclass correlation coefficients (ICCs). The intratester reliability showed a CV range of 2.2% to 2.6% and an ICC range of .98 to .99. The intertester reliability showed a CV of 1.3% and an ICC of .99. There was a significant increase in arm volume in men compared with women. There were no significant differences in changes in volume over the 4 weeks. There was a significant greater right arm volume (3.3%) among the right-handed subjects (P<.001). Both intra- and intertester reliability were satisfactory for the SWDI.
Reliability of instruments in a cooperative, multisite study: employment intervention demonstration program.

PubMed

Salyers, M P; McHugo, G J; Cook, J A; Razzano, L A; Drake, R E; Mueser, K T

2001-09-01

Reliability of well-known instruments was examined in 202 people with severe mental illness participating in a multisite vocational study. We examined interrater reliability of the Positive and Negative Syndrome Scale (PANSS) and the internal consistency and test-retest reliability of the PANSS, the Rosenberg Self-Esteem Scale, the Medical Outcomes Study Short Form-36 (SF-36), and the Quality of Life Interview. Most scales had good levels of reliability, with intraclass correlation coefficients (ICCs) and coefficient alphas above .70. However, the SF-36 scales were generally less stable over time, particularly Social Functioning (ICC = .55). Test-retest reliability was lower among less educated respondents and among ethnic minorities. We recommend close monitoring of psychometric issues in future multisite studies.
Validation of Attitude and Heading Reference System and Microsoft Kinect for Continuous Measurement of Cervical Range of Motion Compared to the Optical Motion Capture System.

PubMed

Song, Young Seop; Yang, Kyung Yong; Youn, Kibum; Yoon, Chiyul; Yeom, Jiwoon; Hwang, Hyeoncheol; Lee, Jehee; Kim, Keewon

2016-08-01

To compare optical motion capture system (MoCap), attitude and heading reference system (AHRS) sensor, and Microsoft Kinect for the continuous measurement of cervical range of motion (ROM). Fifteen healthy adult subjects were asked to sit in front of the Kinect camera with optical markers and AHRS sensors attached to the body in a room equipped with optical motion capture camera. Subjects were instructed to independently perform axial rotation followed by flexion/extension and lateral bending. Each movement was repeated 5 times while being measured simultaneously with 3 devices. Using the MoCap system as the gold standard, the validity of AHRS and Kinect for measurement of cervical ROM was assessed by calculating correlation coefficient and Bland-Altman plot with 95% limits of agreement (LoA). MoCap and ARHS showed fair agreement (95% LoA<10°), while MoCap and Kinect showed less favorable agreement (95% LoA>10°) for measuring ROM in all directions. Intraclass correlation coefficient (ICC) values between MoCap and AHRS in -40° to 40° range were excellent for flexion/extension and lateral bending (ICC>0.9). ICC values were also fair for axial rotation (ICC>0.8). ICC values between MoCap and Kinect system in -40° to 40° range were fair for all motions. Our study showed feasibility of using AHRS to measure cervical ROM during continuous motion with an acceptable range of error. AHRS and Kinect system can also be used for continuous monitoring of flexion/extension and lateral bending in ordinary range.
Estimating lifetime risk from spot biomarker data and intra‐class correlation coefficients (ICC)

EPA Science Inventory

Human biomarker measurements in tissues including blood, breath, and urine can serve as efficient surrogates for environmental monitoring because a single biological sample integrates personal exposure across all environmental media and uptake pathways. However, biomarkers repres...
Within-person reproducibility and sensitivity to dietary change of C15:0 and C17:0 levels in dried blood spots: Data from the European Food4Me Study.

PubMed

Albani, Viviana; Celis-Morales, Carlos; O'Donovan, Clare B; Walsh, Marianne C; Woolhead, Clara; Forster, Hannah; Fallaize, Rosalind; Macready, Anna L; Marsaux, Cyril F M; Navas-Carretero, Santiago; San-Cristobal, Rodrigo; Kolossa, Silvia; Mavrogianni, Christina; Lambrinou, Christina P; Moschonis, George; Godlewska, Magdalena; Surwillo, Agnieszka; Traczyk, Iwona; Gundersen, Thomas E; Drevon, Christian A; Daniel, Hannelore; Manios, Yannis; Martinez, J Alfredo; Saris, Wim H M; Lovegrove, Julie A; Gibney, Michael J; Gibney, Eileen R; Mathers, John C; Adamson, Ashley J; Brennan, Lorraine

2017-10-01

Previous work highlighted the potential of odd-chain length saturated fatty acids as potential markers of dairy intake. The aim of this study was to assess the reproducibility of these biomarkers and their sensitivity to changes in dairy intake. Fatty acid profiles and dietary intakes from food frequency questionnaires (FFQs) were measured three times over six months in the Food4Me Study. Reproducibility was explored through intra-class correlation coefficients (ICCs) and within-subject coefficients of variation (WCV). Sensitivity to changes in diet was examined using regression analysis. C15:0 blood levels showed high correlation over time (ICC: 0.62, 95% CI: 0.57, 0.68), however, the ICC for C17:0 was much lower (ICC: 0.32, 95% CI: 0.28, 0.46). The WCV for C15:0 was 16.6% and that for C17:0 was 14.6%. There were significant associations between changes in intakes of total dairy, high-fat dairy, cheese and butter and C15:0; and change in intakes of high-fat dairy and cream and C17:0. Results provide evidence of reproducibility of C15:0 levels over time and sensitivity to change in intake of high-fat dairy products with results comparable to the well-established biomarker of fish intake (EPA+DHA). © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
The correlation between pulsatile intracranial pressure and indices of intracranial pressure-volume reserve capacity: results from ventricular infusion testing.

PubMed

Eide, Per Kristian

2016-12-01

OBJECTIVE The objective of this study was to examine how pulsatile and static intracranial pressure (ICP) scores correlate with indices of intracranial pressure-volume reserve capacity, i.e., intracranial elastance (ICE) and intracranial compliance (ICC), as determined during ventricular infusion testing. METHODS All patients undergoing ventricular infusion testing and overnight ICP monitoring during the 6-year period from 2007 to 2012 were included in the study. Clinical data were retrieved from a quality registry, and the ventricular infusion pressure data and ICP scores were retrieved from a pressure database. The ICE and ICC (= 1/ICE) were computed during the infusion phase of the infusion test. RESULTS During the period from 2007 to 2012, 82 patients with possible treatment-dependent hydrocephalus underwent ventricular infusion testing within the department of neurosurgery. The infusion tests revealed a highly significant positive correlation between ICE and the pulsatile ICP scores mean wave amplitude (MWA) and rise-time coefficient (RTC), and the static ICP score mean ICP. The ICE was negatively associated with linear measures of ventricular size. The overnight ICP recordings revealed significantly increased MWA (> 4 mm Hg) and RTC (> 20 mm Hg/sec) values in patients with impaired ICC (< 0.5 ml/mm Hg). CONCLUSIONS In this study cohort, there was a significant positive correlation between pulsatile ICP and ICE measured during ventricular infusion testing. In patients with impaired ICC during infusion testing (ICC < 0.5 ml/mm Hg), overnight ICP recordings showed increased pulsatile ICP (MWA > 4 mm Hg, RTC > 20 mm Hg/sec), but not increased mean ICP (< 10-15 mm Hg). The present data support the assumption that pulsatile ICP (MWA and RTC) may serve as substitute markers of pressure-volume reserve capacity, i.e., ICE and ICC.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Galavis, P; Friedman, K; Chandarana, H

Purpose: Radiomics involves the extraction of texture features from different imaging modalities with the purpose of developing models to predict patient treatment outcomes. The purpose of this study is to investigate texture feature reproducibility across [18F]FDG PET/CT and [18F]FDG PET/MR imaging in patients with primary malignancies. Methods: Twenty five prospective patients with solid tumors underwent clinical [18F]FDG PET/CT scan followed by [18F]FDG PET/MR scans. In all patients the lesions were identified using nuclear medicine reports. The images were co-registered and segmented using an in-house auto-segmentation method. Fifty features, based on the intensity histogram, second and high order matrices, were extractedmore » from the segmented regions from both image data sets. One-way random-effects ANOVA model of the intra-class correlation coefficient (ICC) was used to establish texture feature correlations between both data sets. Results: Fifty features were classified based on their ICC values, which were found in the range from 0.1 to 0.86, in three categories: high, intermediate, and low. Ten features extracted from second and high-order matrices showed large ICC ≥ 0.70. Seventeen features presented intermediate 0.5 ≤ ICC ≤ 0.65 and the remaining twenty three presented low ICC ≤ 0.45. Conclusion: Features with large ICC values could be reliable candidates for quantification as they lead to similar results from both imaging modalities. Features with small ICC indicates a lack of correlation. Therefore, the use of these features as a quantitative measure will lead to different assessments of the same lesion depending on the imaging modality from where they are extracted. This study shows the importance of the need for further investigation and standardization of features across multiple imaging modalities.« less

Assessing the burden of childhood asthma: validation of electronic versions of the Mini Pediatric and Pediatric Asthma Caregiver's Quality of Life Questionnaires.

PubMed

Minard, Janice P; Thomas, Nicola J; Olajos-Clow, Jennifer G; Wasilewski, Nastasia V; Jenkins, Blaine; Taite, Ann K; Day, Andrew G; Lougheed, M Diane

2016-01-01

To validate electronic versions of the Mini Pediatric and Pediatric Asthma Caregiver's Quality of Life Questionnaires (MiniPAQLQ and PACQLQ, respectively), determine completion times and correlate QOL of children and caregivers. A total of 63 children and 64 caregivers completed the paper and electronic MiniPAQLQ or PACQLQ. Agreement between versions of each questionnaire was summarized by intraclass correlation coefficients (ICC). The correlation between MiniPAQLQ and PACQLQ scores from child-caregiver pairs was assessed using Pearson's correlation coefficient. There was no significant difference (mean difference = 0.1, 95% CI -0.1, 0.2) in MiniPAQLQ Overall Scores between paper (5.9 ± 1.0, mean ± SD) and electronic (5.8 ± 1.0) versions, or any of the domains. ICCs ranged from 0.89 (Overall) to 0.86 (Emotional Function). Overall PACQLQ scores for both versions were comparable (5.9 ± 0.9 and 5.8 ± 1.0; mean difference = 0.0; 95% CI -0.1, 0.2). ICCs ranged from 0.81 (Activity Limitation) to 0.88 (Emotional Function). The electronic PACQLQ took 26 s longer (95% CI 11, 41; p < 0.001). Few participants (3-11%) preferred the paper format. MiniPAQLQ and PACQLQ scores were significantly correlated (all p < 0.05) for Overall (r paper = 0.33, r electronic = 0.27) and Emotional Function domains (r paper = 0.34, r electronic = 0.29). These electronic QOL questionnaires are valid, and asthma-related QOL of children and caregivers is related.
Palliative sedation: reliability and validity of sedation scales.

PubMed

Arevalo, Jimmy J; Brinkkemper, Tijn; van der Heide, Agnes; Rietjens, Judith A; Ribbe, Miel; Deliens, Luc; Loer, Stephan A; Zuurmond, Wouter W A; Perez, Roberto S G M

2012-11-01

Observer-based sedation scales have been used to provide a measurable estimate of the comfort of nonalert patients in palliative sedation. However, their usefulness and appropriateness in this setting has not been demonstrated. To study the reliability and validity of observer-based sedation scales in palliative sedation. A prospective evaluation of 54 patients under intermittent or continuous sedation with four sedation scales was performed by 52 nurses. Included scales were the Minnesota Sedation Assessment Tool (MSAT), Richmond Agitation-Sedation Scale (RASS), Vancouver Interaction and Calmness Scale (VICS), and a sedation score proposed in the Guideline for Palliative Sedation of the Royal Dutch Medical Association (KNMG). Inter-rater reliability was tested with the intraclass correlation coefficient (ICC) and Cohen's kappa coefficient. Correlations between the scales using Spearman's rho tested concurrent validity. We also examined construct, discriminative, and evaluative validity. In addition, nurses completed a user-friendliness survey. Overall moderate to high inter-rater reliability was found for the VICS interaction subscale (ICC = 0.85), RASS (ICC = 0.73), and KNMG (ICC = 0.71). The largest correlation between scales was found for the RASS and KNMG (rho = 0.836). All scales showed discriminative and evaluative validity, except for the MSAT motor subscale and VICS calmness subscale. Finally, the RASS was less time consuming, clearer, and easier to use than the MSAT and VICS. The RASS and KNMG scales stand as the most reliable and valid among the evaluated scales. In addition, the RASS was less time consuming, clearer, and easier to use than the MSAT and VICS. Further research is needed to evaluate the impact of the scales on better symptom control and patient comfort. Copyright © 2012 U.S. Cancer Pain Relief Committee. Published by Elsevier Inc. All rights reserved.
Intraclass Correlation Coefficients for Obesity Indicators and Energy Balance-Related Behaviors Among New York City Public Elementary Schools.

PubMed

Gray, Heewon Lee; Burgermaster, Marissa; Tipton, Elizabeth; Contento, Isobel R; Koch, Pamela A; Di Noia, Jennifer

2016-04-01

Sample size and statistical power calculation should consider clustering effects when schools are the unit of randomization in intervention studies. The objective of the current study was to investigate how student outcomes are clustered within schools in an obesity prevention trial. Baseline data from the Food, Health & Choices project were used. Participants were 9- to 13-year-old students enrolled in 20 New York City public schools (n= 1,387). Body mass index (BMI) was calculated based on measures of height and weight, and body fat percentage was measured with a Tanita® body composition analyzer (Model SC-331s). Energy balance-related behaviors were self-reported with a frequency questionnaire. To examine the cluster effects, intraclass correlation coefficients (ICCs) were calculated as school variance over total variance for outcome variables. School-level covariates, percentage students eligible for free and reduced-price lunch, percentage Black or Hispanic, and English language learners were added in the model to examine ICC changes. The ICCs for obesity indicators are: .026 for BMI-percentile, .031 for BMIz-score, .035 for percentage of overweight students, .037 for body fat percentage, and .041 for absolute BMI. The ICC range for the six energy balance-related behaviors are .008 to .044 for fruit and vegetables, .013 to .055 for physical activity, .031 to .052 for recreational screen time, .013 to .091 for sweetened beverages, .033 to .121 for processed packaged snacks, and .020 to .083 for fast food. When school-level covariates were included in the model, ICC changes varied from -95% to 85%. This is the first study reporting ICCs for obesity-related anthropometric and behavioral outcomes among New York City public schools. The results of the study may aid sample size estimation for future school-based cluster randomized controlled trials in similar urban setting and population. Additionally, identifying school-level covariates that can reduce cluster effects is important when analyzing data. © 2015 Society for Public Health Education.
Evaluating the Consistency of Current Mainstream Wearable Devices in Health Monitoring: A Comparison Under Free-Living Conditions.

PubMed

Wen, Dong; Zhang, Xingting; Liu, Xingyu; Lei, Jianbo

2017-03-07

Wearable devices are gaining increasing market attention; however, the monitoring accuracy and consistency of the devices remains unknown. The purpose of this study was to assess the consistency of the monitoring measurements of the latest wearable devices in the state of normal activities to provide advice to the industry and support to consumers in making purchasing choices. Ten pieces of representative wearable devices (2 smart watches, 4 smart bracelets of Chinese brands or foreign brands, and 4 mobile phone apps) were selected, and 5 subjects were employed to simultaneously use all the devices and the apps. From these devices, intact health monitoring data were acquired for 5 consecutive days and analyzed on the degree of differences and the relationships of the monitoring measurements by the different devices. The daily measurements by the different devices fluctuated greatly, and the coefficient of variation (CV) fluctuated in the range of 2-38% for the number of steps, 5-30% for distance, 19-112% for activity duration, .1-17% for total energy expenditure (EE), 22-100% for activity EE, 2-44% for sleep duration, and 35-117% for deep sleep duration. After integrating the measurement data of 25 days among the devices, the measurements of the number of steps (intraclass correlation coefficient, ICC=.89) and distance (ICC=.84) displayed excellent consistencies, followed by those of activity duration (ICC=.59) and the total EE (ICC=.59) and activity EE (ICC=.57). However, the measurements for sleep duration (ICC=.30) and deep sleep duration (ICC=.27) were poor. For most devices, there was a strong correlation between the number of steps and distance measurements (R 2 >.95), and for some devices, there was a strong correlation between activity duration measurements and EE measurements (R 2 >.7). A strong correlation was observed in the measurements of steps, distance and EE from smart watches and mobile phones of the same brand, Apple or Samsung (r>.88). Although wearable devices are developing rapidly, the current mainstream devices are only reliable in measuring the number of steps and distance, which can be used as health assessment indicators. However, the measurement consistencies of activity duration, EE, sleep quality, and so on, are still inadequate, which require further investigation and improved algorithms. ©Dong Wen, Xingting Zhang, Xingyu Liu, Jianbo Lei. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 07.03.2017.
Cross-cultural adaptation and validation of the Patient-Rated Tennis Elbow Evaluation Questionnaire on lateral elbow tendinopathy for French-speaking patients.

PubMed

Kaux, Jean-François; Delvaux, François; Schaus, Jean; Demoulin, Christophe; Locquet, Médéa; Buckinx, Fanny; Beaudart, Charlotte; Dardenne, Nadia; Van Beveren, Julien; Croisier, Jean-Louis; Forthomme, Bénédicte; Bruyère, Olivier

Translation and validation of algo-functional questionnaire. The lateral elbow tendinopathy is a common injury in tennis players and physical workers. The Patient-Rated Tennis Elbow Evaluation (PRTEE) Questionnaire was specifically designed to measure pain and functional limitations in patients with lateral epicondylitis (tennis elbow). First developed in English, this questionnaire has since been translated into several languages. The aims of the study were to translate and cross-culturally adapt the PRTEE questionnaire into French and to evaluate the reliability and validity of this translated version of the questionnaire (PRTEE-F). The PRTEE was translated and cross-culturally adapted into French according to international guidelines. To assess the reliability and validity of the PRTEE-F, 115 participants were asked twice to fill in the PRTEE-F, and once the Disabilities of Arm, Shoulder and Hand Questionnaire (DASH) and the Short Form Health Survey (SF-36). Internal consistency (using Cronbach's alpha), test-retest reliability (using intraclass correlation coefficient (ICC), standard error of measurement and minimal detectable change), and convergent and divergent validity (using the Spearman's correlation coefficients respectively with the DASH and with some subscales of the SF-36) were assessed. The PRTEE was translated into French without any problems. PRTEE-F showed a good test-retest reliability for the overall score (ICC 0.86) and for each item (ICC 0.8-0.96) and a high internal consistency (Cronbach's alpha = 0.98). The correlation analyses revealed high correlation coefficients between PRTEE-F and DASH (convergent validity) and, as expected, a low or moderate correlation with the divergent subscales of the SF-36 (discriminant validity). There was no floor or ceiling effect. The PRTEE questionnaire was successfully cross-culturally adapted into French. The PRTEE-F is reliable and valid for evaluating French-speaking patients with lateral elbow tendinopathy. Copyright Â© 2016 Hanley & Belfus. Published by Elsevier Inc. All rights reserved.
Estimating the intra-cluster correlation coefficient for evaluating an educational intervention program to improve rabies awareness and dog bite prevention among children in Sikkim, India: A pilot study.

PubMed

Auplish, Aashima; Clarke, Alison S; Van Zanten, Trent; Abel, Kate; Tham, Charmaine; Bhutia, Thinlay N; Wilks, Colin R; Stevenson, Mark A; Firestone, Simon M

2017-05-01

Educational initiatives targeting at-risk populations have long been recognized as a mainstay of ongoing rabies control efforts. Cluster-based studies are often utilized to assess levels of knowledge, attitudes and practices of a population in response to education campaigns. The design of cluster-based studies requires estimates of intra-cluster correlation coefficients obtained from previous studies. This study estimates the school-level intra-cluster correlation coefficient (ICC) for rabies knowledge change following an educational intervention program. A cross-sectional survey was conducted with 226 students from 7 schools in Sikkim, India, using cluster sampling. In order to assess knowledge uptake, rabies education sessions with pre- and post-session questionnaires were administered. Paired differences of proportions were estimated for questions answered correctly. A mixed effects logistic regression model was developed to estimate school-level and student-level ICCs and to test for associations between gender, age, school location and educational level. The school- and student-level ICCs for rabies knowledge and awareness were 0.04 (95% CI: 0.01, 0.19) and 0.05 (95% CI: 0.2, 0.09), respectively. These ICCs suggest design effect multipliers of 5.45 schools and 1.05 students per school, will be required when estimating sample sizes and designing future cluster randomized trials. There was a good baseline level of rabies knowledge (mean pre-session score 71%), however, key knowledge gaps were identified in understanding appropriate behavior around scared dogs, potential sources of rabies and how to correctly order post rabies exposure precaution steps. After adjusting for the effect of gender, age, school location and education level, school and individual post-session test scores improved by 19%, with similar performance amongst boys and girls attending schools in urban and rural regions. The proportion of participants that were able to correctly order post-exposure precautionary steps following educational intervention increased by 87%. The ICC estimates presented in this study will aid in designing cluster-based studies evaluating educational interventions as part of disease control programs. This study demonstrates the likely benefits of educational intervention incorporating bite prevention and rabies education. Copyright © 2017 Elsevier B.V. All rights reserved.
Rating scale for psychogenic nonepileptic seizures: scale development and clinimetric testing.

PubMed

Cianci, Vittoria; Ferlazzo, Edoardo; Condino, Francesca; Mauvais, Hélène Somma; Farnarier, Guy; Labate, Angelo; Latella, Maria Adele; Gasparini, Sara; Branca, Damiano; Pucci, Franco; Vazzana, Francesco; Gambardella, Antonio; Aguglia, Umberto

2011-06-01

Our aim was to develop a clinimetric scale evaluating motor phenomena, associated features, and severity of psychogenic nonepileptic seizures (PNES). Sixty video/EEG-recorded PNES induced by suggestion maneuvers were evaluated. We examined the relationship between results from this scale and results from the Clinical Global Impression (CGI) scale to validate this technique. Interrater reliabilities of the PNES scale for three raters were analyzed using the AC1 statistic, Kendall's coefficient of concordance (KCC), and intraclass correlation coefficients (ICCs). The relationship between the CGI and PNES scales was evaluated with Spearman correlations. The AC1 statistic demonstrated good interrater reliability for each phenomenon analyzed (tremor/oscillation, tonic; clonic/jerking, hypermotor/agitation, atonic/akinetic, automatisms, associated features). KCC and the ICC showed moderate interrater agreement for phenomenology, associated phenomena, and total PNES scores. Spearman's correlation of mean CGI score with mean total PNES score was 0.69 (P<0.001). The scale described here accurately evaluates the phenomenology of PNES and could be used to assess and compare subgroups of patients with PNES. Copyright © 2011 Elsevier Inc. All rights reserved.
[Concordance of glomerular filtration rate with creatinine clearance in 24-hour urine and Schwartz and Schwartz updated].

PubMed

Salazar-Gutiérrez, María Luisa; Ochoa-Ponce, Cristina; Lona-Reyes, Juan Carlos; Gutiérrez-Íñiguez, Sara Ivonne

Reference methods for the quantification of the glomerular filtration rate (GFR) are difficult to use in clinical practice; formulas for evaluating GFR based on serum creatinine (SCr) and/or creatinine clearance are used. The aim of this study was to quantify the correlation and concordance of GFR with creatinine clearance in 24-hour urine (GFR24) and Schwartz and Schwartz updated formulas. Cross-sectional study involving healthy pediatric patients and with chronic kidney disease (CKD) from 5 to 16.9 years. Linear correlation between GFR 24 and two formulas was evaluated with the Pearson correlation coefficient (r) and intraclass correlation coefficient (ICC). We studied 134 patients, of which 59.7% were male. Mean age was 10.8 years. The average GFR24 was 140.34ml/min/1.73m 2 ; 34.3% (n=46) had GFR <90ml/min/1.73m 2 . Moderate linear correlation between GFR24 and Schwartz (r= 0.63) and Schwartz updated (r= 0.65) formulas was observed. There was good concordance between the GFR24 and Schwartz (ICC= 0.77) and updated Schwartz (ICC= 0.77) formulas. Schwartz classical formula in patients with GFR24 ≥ 90ml/min/1.73m 2 estimated higher values, while Schwartz updated underestimated values. There is moderate correlation and good concordance between the GFR24 and Schwartz and Schwartz updated formulas. The concordance was better in patients with obesity and lower in women, patients with hyperfiltration and normal weight. Copyright © 2016 Hospital Infantil de México Federico Gómez. Publicado por Masson Doyma México S.A. All rights reserved.
A systematic review of statistical methods used to test for reliability of medical instruments measuring continuous variables.

PubMed

Zaki, Rafdzah; Bulgiba, Awang; Nordin, Noorhaire; Azina Ismail, Noor

2013-06-01

Reliability measures precision or the extent to which test results can be replicated. This is the first ever systematic review to identify statistical methods used to measure reliability of equipment measuring continuous variables. This studyalso aims to highlight the inappropriate statistical method used in the reliability analysis and its implication in the medical practice. In 2010, five electronic databases were searched between 2007 and 2009 to look for reliability studies. A total of 5,795 titles were initially identified. Only 282 titles were potentially related, and finally 42 fitted the inclusion criteria. The Intra-class Correlation Coefficient (ICC) is the most popular method with 25 (60%) studies having used this method followed by the comparing means (8 or 19%). Out of 25 studies using the ICC, only 7 (28%) reported the confidence intervals and types of ICC used. Most studies (71%) also tested the agreement of instruments. This study finds that the Intra-class Correlation Coefficient is the most popular method used to assess the reliability of medical instruments measuring continuous outcomes. There are also inappropriate applications and interpretations of statistical methods in some studies. It is important for medical researchers to be aware of this issue, and be able to correctly perform analysis in reliability studies.
Portuguese community pharmacists' attitudes to and knowledge of antibiotic misuse: questionnaire development and reliability.

PubMed

Roque, Fátima; Soares, Sara; Breitenfeld, Luiza; Gonzalez-Gonzalez, Cristian; Figueiras, Adolfo; Herdeiro, Maria Teresa

2014-01-01

To develop and evaluate the reliability of a self-administered questionnaire designed to assess the attitudes and knowledge of community pharmacists in Portugal about microbial resistance and the antibiotic dispensing process. This study was divided into the following three stages: (1) design of the questionnaire, which included a literature review and a qualitative study with focus-group sessions; (2) assessment of face and content validity, using a panel of experts and a pre-test of community pharmacists; and, (3) pilot study and reliability analysis, which included a test-retest study covering fifty practising pharmacists based at community pharmacies in five districts situated in Northern Portugal. Questionnaire reproducibility was quantified using the intraclass correlation coefficient (ICC; 95% confidence interval) computed by means of one-way analysis of variance (ANOVA). Internal consistency was evaluated using Cronbach's alpha. The correlation coefficients were fair to good (ICC>0.4) for all statements (scale-items) regarding knowledge of and attitudes to antibiotic resistance, and ranged from fair to good to excellent for statements about situations in which pharmacists acknowledged that antibiotics were sometimes dispensed without a medical prescription (ICC>0.8). Cronbach's alpha for this section was 0.716. The questionnaire designed in this study is valid and reliable in terms of content validity, face validity and reproducibility.
Validity and Reliability of the PUSH Wearable Device to Measure Movement Velocity During the Back Squat Exercise.

PubMed

Balsalobre-Fernández, Carlos; Kuzdub, Matt; Poveda-Ortiz, Pedro; Campo-Vecino, Juan Del

2016-07-01

Balsalobre-Fernández, C, Kuzdub, M, Poveda-Ortiz, P, and Campo-Vecino, Jd. Validity and reliability of the PUSH wearable device to measure movement velocity during the back squat exercise. J Strength Cond Res 30(7): 1968-1974, 2016-The purpose of this study was to analyze the validity and reliability of a wearable device to measure movement velocity during the back squat exercise. To do this, 10 recreationally active healthy men (age = 23.4 ± 5.2 years; back squat 1 repetition maximum [1RM] = 83 ± 8.2 kg) performed 3 repetitions of the back squat exercise with 5 different loads ranging from 25 to 85% 1RM on a Smith Machine. Movement velocity for each of the total 150 repetitions was simultaneously recorded using the T-Force linear transducer (LT) and the PUSH wearable band. Results showed a high correlation between the LT and the wearable device mean (r = 0.85; standard error of estimate [SEE] = 0.08 m·s) and peak velocity (r = 0.91, SEE = 0.1 m·s). Moreover, there was a very high agreement between these 2 devices for the measurement of mean (intraclass correlation coefficient [ICC] = 0.907) and peak velocity (ICC = 0.944), although a systematic bias between devices was observed (PUSH peak velocity being -0.07 ± 0.1 m·s lower, p ≤ 0.05). When measuring the 3 repetitions with each load, both devices displayed almost equal reliability (Test-retest reliability: LT [r = 0.98], PUSH [r = 0.956]; ICC: LT [ICC = 0.989], PUSH [ICC = 0.981]; coefficient of variation [CV]: LT [CV = 4.2%], PUSH [CV = 5.0%]). Finally, individual load-velocity relationships measured with both the LT (R = 0.96) and the PUSH wearable device (R = 0.94) showed similar, very high coefficients of determination. In conclusion, these results support the use of an affordable wearable device to track velocity during back squat training. Wearable devices, such as the one in this study, could have valuable practical applications for strength and conditioning coaches.
The reliability, precision and clinically meaningful change of walking assessments in multiple sclerosis.

PubMed

Learmonth, Yvonne C; Dlugonski, Deirdre D; Pilutti, Lara A; Sandroff, Brian M; Motl, Robert W

2013-11-01

Assessing walking impairment in those with multiple sclerosis (MS) is common, however little is known about the reliability, precision and clinically important change of walking outcomes. The purpose of this study was to determine the reliability, precision and clinically important change of the Timed 25-Foot Walk (T25FW), Six-Minute Walk (6MW), Multiple Sclerosis Walking Scale-12 (MSWS-12) and accelerometry. Data were collected from 82 persons with MS at two time points, six months apart. Analyses were undertaken for the whole sample and stratified based on disability level and usage of walking aids. Intraclass correlation coefficient (ICC) analyses established reliability: standard error of measurement (SEM) and coefficient of variation (CV) determined precision; and minimal detectable change (MDC) defined clinically important change. All outcome measures were reliable with precision and MDC varying between measures in the whole sample: T25FW: ICC=0.991; SEM=1 s; CV=6.2%; MDC=2.7 s (36%), 6MW: ICC=0.959; SEM=32 m; CV=6.2%; MDC=88 m (20%), MSWS-12: ICC=0.927; SEM=8; CV=27%; MDC=22 (53%), accelerometry counts/day: ICC=0.883; SEM=28450; CV=17%; MDC=78860 (52%), accelerometry steps/day: ICC=0.907; SEM=726; CV=16%; MDC=2011 (45%). Variation in these estimates was seen based on disability level and walking aid. The reliability of these outcomes is good and falls within acceptable ranges. Precision and clinically important change estimates provide guidelines for interpreting these outcomes in clinical and research settings.
Translation, cross-cultural adaptation and validation of the Bulgarian version of the Dizziness Handicap Inventory.

PubMed

Georgieva-Zhostova, Spaska; Kolev, Ognyan I; Stambolieva, Katerina

2014-09-01

The aim of the present study was the translation, cross-cultural adaptation and validation of the Dizziness Handicap Inventory in Bulgarian language (DHI-BG). Ninety-seven vestibular patients (19 men and 78 women, mean age 45.08 ± 13.85 years) took part in the investigation. All participants were asked to fill in the DHI-BG. Internal consistency was estimated using Cronbach's alpha and item-total correlation, reproducibility by calculating Bland-Altman's limits of agreement and intraclass correlation coefficients (ICCs). Associations were estimated by Spearman's correlation coefficients. The Cronbach's alpha for the total score, functional, physical and emotional subscales of DHI-BG were 0.88, 0.75, 0.72 and 0.81. The floor and ceiling effects of the DHI-BG total scale were evaluated with respect to the limits of agreement which were ±9.4-14.53 points. Intraclass correlation coefficients (ICCs) for all scale and subscales were higher than the recommended value of 0.75 and determined good test-retest reliability. The range of items correlation for DHI-BG was from 0.27 (item 12) to 0.72 (item 3). No significant differences were observed in the Cronbach's alpha coefficients between the DHI-BG and the original version, the German and Italian versions of the questionnaire. The most significant difference was observed in comparison with the German version of DHI. Construct validity presented a moderate correlation between Romberg coefficients and DHI-BG scores and strong correlation between all scores of DHI and the self-perceived disability. The results suggest that DHI-BG scores show a good discriminative validity between groups with different levels of self-assessed disability. The Bulgarian version of the DHI is a reliable and valid tool in assessing the impact of dizziness on the quality of life in Bulgarian vestibular patients.
The Contributions of Near Work and Outdoor Activity to the Correlation Between Siblings in the Collaborative Longitudinal Evaluation of Ethnicity and Refractive Error (CLEERE) Study

PubMed Central

Jones-Jordan, Lisa A.; Sinnott, Loraine T.; Graham, Nicholas D.; Cotter, Susan A.; Kleinstein, Robert N.; Manny, Ruth E.; Mutti, Donald O.; Twelker, J. Daniel; Zadnik, Karla

2014-01-01

Purpose. We determined the correlation between sibling refractive errors adjusted for shared and unique environmental factors using data from the Collaborative Longitudinal Evaluation of Ethnicity and Refractive Error (CLEERE) Study. Methods. Refractive error from subjects' last study visits was used to estimate the intraclass correlation coefficient (ICC) between siblings. The correlation models used environmental factors (diopter-hours and outdoor/sports activity) assessed annually from parents by survey to adjust for shared and unique environmental exposures when estimating the heritability of refractive error (2*ICC). Results. Data from 700 families contributed to the between-sibling correlation for spherical equivalent refractive error. The mean age of the children at the last visit was 13.3 ± 0.90 years. Siblings engaged in similar amounts of near and outdoor activities (correlations ranged from 0.40–0.76). The ICC for spherical equivalent, controlling for age, sex, ethnicity, and site was 0.367 (95% confidence interval [CI] = 0.304, 0.420), with an estimated heritability of no more than 0.733. After controlling for these variables, and near and outdoor/sports activities, the resulting ICC was 0.364 (95% CI = 0.304, 0.420; estimated heritability no more than 0.728, 95% CI = 0.608, 0.850). The ICCs did not differ significantly between male–female and single sex pairs. Conclusions. Adjusting for shared family and unique, child-specific environmental factors only reduced the estimate of refractive error correlation between siblings by 0.5%. Consistent with a lack of association between myopia progression and either near work or outdoor/sports activity, substantial common environmental exposures had little effect on this correlation. Genetic effects appear to have the major role in determining the similarity of refractive error between siblings. PMID:25205866
Reproducibility of a peripheral quantitative computed tomography scan protocol to measure the material properties of the second metatarsal.

PubMed

Chaplais, Elodie; Greene, David; Hood, Anita; Telfer, Scott; du Toit, Verona; Singh-Grewal, Davinder; Burns, Joshua; Rome, Keith; Schiferl, Daniel J; Hendry, Gordon J

2014-07-19

Peripheral quantitative computed tomography (pQCT) is an established technology that allows for the measurement of the material properties of bone. Alterations to bone architecture are associated with an increased risk of fracture. Further pQCT research is necessary to identify regions of interest that are prone to fracture risk in people with chronic diseases. The second metatarsal is a common site for the development of insufficiency fractures, and as such the aim of this study was to assess the reproducibility of a novel scanning protocol of the second metatarsal using pQCT. Eleven embalmed cadaveric leg specimens were scanned six times; three times with and without repositioning. Each foot was positioned on a custom-designed acrylic foot plate to permit unimpeded scans of the region of interest. Sixty-six scans were obtained at 15% (distal) and 50% (mid shaft) of the second metatarsal. Voxel size and scan speed were reduced to 0.40 mm and 25 mm.sec(-1). The reference line was positioned at the most distal portion of the 2(nd) metatarsal. Repeated measurements of six key variables related to bone properties were subject to reproducibility testing. Data were log transformed and reproducibility of scans were assessed using intraclass correlation coefficients (ICC) and coefficients of variation (CV%). Reproducibility of the measurements without repositioning were estimated as: trabecular area (ICC 0.95; CV% 2.4), trabecular density (ICC 0.98; CV% 3.0), Strength Strain Index (SSI) - distal (ICC 0.99; CV% 5.6), cortical area (ICC 1.0; CV% 1.5), cortical density (ICC 0.99; CV% 0.1), SSI - mid shaft (ICC 1.0; CV% 2.4). Reproducibility of the measurements after repositioning were estimated as: trabecular area (ICC 0.96; CV% 2.4), trabecular density (ICC 0.98; CV% 2.8), SSI - distal (ICC 1.0; CV% 3.5), cortical area (ICC 0.99; CV%2.4), cortical density (ICC 0.98; CV% 0.8), SSI - mid shaft (ICC 0.99; CV% 3.2). The scanning protocol generated excellent reproducibility for key bone properties measured at the distal and mid-shaft regions of the 2(nd) metatarsal. This protocol extends the capabilities of pQCT to evaluate bone quality in people who may be at an increased risk of metatarsal insufficiency fractures.
Reproducibility of a peripheral quantitative computed tomography scan protocol to measure the material properties of the second metatarsal

PubMed Central

2014-01-01

Background Peripheral quantitative computed tomography (pQCT) is an established technology that allows for the measurement of the material properties of bone. Alterations to bone architecture are associated with an increased risk of fracture. Further pQCT research is necessary to identify regions of interest that are prone to fracture risk in people with chronic diseases. The second metatarsal is a common site for the development of insufficiency fractures, and as such the aim of this study was to assess the reproducibility of a novel scanning protocol of the second metatarsal using pQCT. Methods Eleven embalmed cadaveric leg specimens were scanned six times; three times with and without repositioning. Each foot was positioned on a custom-designed acrylic foot plate to permit unimpeded scans of the region of interest. Sixty-six scans were obtained at 15% (distal) and 50% (mid shaft) of the second metatarsal. Voxel size and scan speed were reduced to 0.40 mm and 25 mm.sec-1. The reference line was positioned at the most distal portion of the 2nd metatarsal. Repeated measurements of six key variables related to bone properties were subject to reproducibility testing. Data were log transformed and reproducibility of scans were assessed using intraclass correlation coefficients (ICC) and coefficients of variation (CV%). Results Reproducibility of the measurements without repositioning were estimated as: trabecular area (ICC 0.95; CV% 2.4), trabecular density (ICC 0.98; CV% 3.0), Strength Strain Index (SSI) - distal (ICC 0.99; CV% 5.6), cortical area (ICC 1.0; CV% 1.5), cortical density (ICC 0.99; CV% 0.1), SSI – mid shaft (ICC 1.0; CV% 2.4). Reproducibility of the measurements after repositioning were estimated as: trabecular area (ICC 0.96; CV% 2.4), trabecular density (ICC 0.98; CV% 2.8), SSI - distal (ICC 1.0; CV% 3.5), cortical area (ICC 0.99; CV%2.4), cortical density (ICC 0.98; CV% 0.8), SSI – mid shaft (ICC 0.99; CV% 3.2). Conclusions The scanning protocol generated excellent reproducibility for key bone properties measured at the distal and mid-shaft regions of the 2nd metatarsal. This protocol extends the capabilities of pQCT to evaluate bone quality in people who may be at an increased risk of metatarsal insufficiency fractures. PMID:25037451
Accounting for twin births in sample size calculations for randomised trials.

PubMed

Yelland, Lisa N; Sullivan, Thomas R; Collins, Carmel T; Price, David J; McPhee, Andrew J; Lee, Katherine J

2018-05-04

Including twins in randomised trials leads to non-independence or clustering in the data. Clustering has important implications for sample size calculations, yet few trials take this into account. Estimates of the intracluster correlation coefficient (ICC), or the correlation between outcomes of twins, are needed to assist with sample size planning. Our aims were to provide ICC estimates for infant outcomes, describe the information that must be specified in order to account for clustering due to twins in sample size calculations, and develop a simple tool for performing sample size calculations for trials including twins. ICCs were estimated for infant outcomes collected in four randomised trials that included twins. The information required to account for clustering due to twins in sample size calculations is described. A tool that calculates the sample size based on this information was developed in Microsoft Excel and in R as a Shiny web app. ICC estimates ranged between -0.12, indicating a weak negative relationship, and 0.98, indicating a strong positive relationship between outcomes of twins. Example calculations illustrate how the ICC estimates and sample size calculator can be used to determine the target sample size for trials including twins. Clustering among outcomes measured on twins should be taken into account in sample size calculations to obtain the desired power. Our ICC estimates and sample size calculator will be useful for designing future trials that include twins. Publication of additional ICCs is needed to further assist with sample size planning for future trials. © 2018 John Wiley & Sons Ltd.
Repeatability of knee impulsive loading measurements with skin-mounted accelerometers and lower limb surface electromyographic recordings during gait in knee osteoarthritic and asymptomatic individuals

PubMed Central

Lyytinen, T.; Bragge, T.; Hakkarainen, M.; Liikavainio, T.; Karjalainen, P.A.; Arokoski, J.P.

2016-01-01

Objectives: To determine the repeatability of knee joint impulsive loading measurements with skin-mounted accelerometers (SMAs) and lower limb surface electromyography (EMG) recordings during gait. Methods: Triaxial SMA and EMG from 4 muscles during level and stair walking in nine healthy and nine knee osteoarthritis (OA) subjects were used. The initial peak acceleration (IPA), root mean square (RMS), maximal acceleration transient rate (ATRmax) and mean EMG activity (EMGact) were calculated. The coefficient of variation (CV) and the intraclass correlation coefficient (ICC) were calculated to measure repeatability. Results: The CV and ICC of RMS accelerations ranged from 4.9% to 10.9% and from 0.69 to 0.96 in both study groups during level walking. The CV and ICC of IPA and ATRmax varied from 7.7% to 14.2% and from 0.85 to 0.99 during level and stairs up walking in healthy subjects. The CV and ICC of EMGact ranged from 8.3% to 31.7% and from 0.16 to 0.97 in both study groups. Conclusions: RMS accelerations exhibited good repeatability during walking in healthy and knee OA subjects. The repeatability of EMG measurements was acceptable in healthy subjects depending on the measured muscles. PMID:26944825
Regional oxygen saturation index (rSO2) in brachioradialis and deltoid muscle. Correlation and prognosis in patients with respiratory sepsis.

PubMed

Rodríguez, A; Claverias, L; Marín, J; Magret, M; Rosich, S; Bodí, M; Trefler, S; Pascual, S; Gea, J

2015-03-01

To compare oxygen saturation index (rSO2) obtained simultaneously in two different brachial muscles. Prospective and observational study. Intensive care unit. Critically ill patients with community-acquired pneumonia. Two probes of NIRS device (INVOS 5100) were simultaneously placed on the brachioradialis (BR) and deltoid (D) muscles. rSO2 measurements were recorded at baseline (ICU admission) and at 24h. Demographic and clinical variables were registered. Pearson's correlation coefficient was used to assess the association between continuous variables. The consistency of the correlation was assessed using the intraclass correlation coefficient (ICC) and Bland-Altman plot. The predictive value of the rSO2 for mortality was calculated by ROC curve. Nineteen patients were included with an ICU mortality of 21.1%. The rSO2 values at baseline and at 24h were significantly higher in D than in BR muscle. Values obtained simultaneously in both limbs showed a strong correlation and adequate consistency: BR (r=0.95; p<0.001; ICC=0.94; 95% CI: 0.90-0.96; p<0.001), D (r=0.88; p=0.01; ICC=0.88; 95% CI: 0.80-0.90; p>0.001) but a wide limit of agreement. Non-survivors had rSO2 values significantly lower than survivors at all times of the study. No patient with rSO2 >60% in BR died, and only 17.6% died with an rSO2 value >60% in D. Both muscles showed consistent discriminatory power for mortality. Both BR and D muscles were appropriate for measuring rSO2. Copyright © 2013 Elsevier España, S.L.U. and SEMICYUC. All rights reserved.
The reliability of three psoriasis assessment tools: Psoriasis area and severity index, body surface area and physician global assessment.

PubMed

Bożek, Agnieszka; Reich, Adam

2017-08-01

A wide variety of psoriasis assessment tools have been proposed to evaluate the severity of psoriasis in clinical trials and daily practice. The most frequently used clinical instrument is the psoriasis area and severity index (PASI); however, none of the currently published severity scores used for psoriasis meets all the validation criteria required for an ideal score. The aim of this study was to compare and assess the reliability of 3 commonly used assessment instruments for psoriasis severity: the psoriasis area and severity index (PASI), body surface area (BSA) and physician global assessment (PGA). On the scoring day, 10 trained dermatologists evaluated 9 adult patients with plaque-type psoriasis using the PASI, BSA and PGA. All the subjects were assessed twice by each physician. Correlations between the assessments were analyzed using the Pearson correlation coefficient. Intra-class correlation coefficient (ICC) was calculated to analyze intra-rater reliability, and the coefficient of variation (CV) was used to assess inter-rater variability. Significant correlations were observed among the 3 scales in both assessments. In all 3 scales the ICCs were > 0.75, indicating high intra-rater reliability. The highest ICC was for the BSA (0.96) and the lowest one for the PGA (0.87). The CV for the PGA and PASI were 29.3 and 36.9, respectively, indicating moderate inter-rater variability. The CV for the BSA was 57.1, indicating high inter-rater variability. Comparing the PASI, PGA and BSA, it was shown that the PGA had the highest inter-rater reliability, whereas the BSA had the highest intra-rater reliability. The PASI showed intermediate values in terms of interand intra-rater reliability. None of the 3 assessment instruments showed a significant advantage over the other. A reliable assessment of psoriasis severity requires the use of several independent evaluations simultaneously.

Comparison of biometric measurements obtained by the Verion Image-Guided System versus the auto-refracto-keratometer.

PubMed

Velasco-Barona, Cecilio; Cervantes-Coste, Guadalupe; Mendoza-Schuster, Erick; Corredor-Ortega, Claudia; Casillas-Chavarín, Nadia L; Silva-Moreno, Alejandro; Garza-León, Manuel; Gonzalez-Salinas, Roberto

2018-06-01

To compare the biometric measurements obtained from the Verion Image-Guided System to those obtained by auto-refracto-keratometer in normal eyes. This is a prospective, observational, comparative study conducted at the Asociación para Evitar la Ceguera en México I.A.P., Mexico. Three sets of keratometry measurements were obtained using the image-guided system to assess the coefficient of variation, the within-subject standard deviation and intraclass correlation coefficient (ICC). A paired Student t test was used to assess statistical significance between the Verion and the auto-refracto-keratometer. A Pearson's correlation coefficient (r) was obtained for all measurements, and the level of agreement was verified using Bland-Altman plots. The right eyes of 73 patients were evaluated by each platform. The Verion coefficient of variation was 0.3% for the flat and steep keratometry, with the ICC being greater than 0.9 for all parameters measured. Paired t test showed statistically significant differences between groups (P = 0.0001). A good correlation was evidenced for keratometry values between platforms (r = 0.903, P = 0.0001 for K1, and r = 0.890, P = 0.0001). Bland-Altman plots showed a wide data spread for all variables. The image-guided system provided highly repeatable corneal power and keratometry measurements. However, significant differences were evidenced between the two platforms, and although values were highly correlated, they showed a wide data spread for all analysed variables; therefore, their interchangeable use for biometry assessment is not advisable.
Elder abuse and socioeconomic inequalities: a multilevel study in 7 European countries.

PubMed

Fraga, Sílvia; Lindert, Jutta; Barros, Henrique; Torres-González, Francisco; Ioannidi-Kapolou, Elisabeth; Melchiorre, Maria Gabriella; Stankunas, Mindaugas; Soares, Joaquim F

2014-04-01

To compare the prevalence of elder abuse using a multilevel approach that takes into account the characteristics of participants as well as socioeconomic indicators at city and country level. In 2009, the project on abuse of elderly in Europe (ABUEL) was conducted in seven cities (Stuttgart, Germany; Ancona, Italy; Kaunas, Lithuania, Stockholm, Sweden; Porto, Portugal; Granada, Spain; Athens, Greece) comprising 4467 individuals aged 60-84 years. We used a 3-level hierarchical structure of data: 1) characteristics of participants; 2) mean of tertiary education of each city; and 3) country inequality indicator (Gini coefficient). Multilevel logistic regression was used and proportional changes in Intraclass Correlation Coefficient (ICC) were inspected to assert explained variance between models. The prevalence of elder abuse showed large variations across sites. Adding tertiary education to the regression model reduced the country level variance for psychological abuse (ICC=3.4%), with no significant decrease in the explained variance for the other types of abuse. When the Gini coefficient was considered, the highest drop in ICC was observed for financial abuse (from 9.5% to 4.3%). There is a societal and community level dimension that adds information to individual variability in explaining country differences in elder abuse, highlighting underlying socioeconomic inequalities leading to such behavior. Copyright © 2014 Elsevier Inc. All rights reserved.
Estimates of Intraclass Correlation for Variables Related to Behavioral HIV/STD Prevention in a Predominantly African American and Hispanic Sample of Young Women

ERIC Educational Resources Information Center

Pals, Sherri L.; Beaty, Brenda L.; Posner, Samuel F.; Bull, Sheana S.

2009-01-01

Studies designed to evaluate HIV and STD prevention interventions often involve random assignment of groups such as neighborhoods or communities to study conditions (e.g., to intervention or control). Investigators who design group-randomized trials (GRTs) must take the expected intraclass correlation coefficient (ICC) into account in sample size…
Spatiotemporal image correlation-derived volumetric Doppler impedance indices from spherical samples of the placenta: intraobserver reliability and correlation with conventional umbilical artery Doppler indices.

PubMed

Welsh, A W; Hou, M; Meriki, N; Martins, W P

2012-10-01

Volumetric impedance indices derived from spatiotemporal image correlation (STIC) power Doppler ultrasound (PDU) might overcome the influence of machine settings and attenuation. We examined the feasibility of obtaining these indices from spherical samples of anterior placentas in healthy pregnancies, and assessed intraobserver reliability and correlation with conventional umbilical artery (UA) impedance indices. Uncomplicated singleton pregnancies with anterior placenta were included in the study. A single observer evaluated UA pulsatility index (PI), resistance index (RI) and systolic/diastolic ratio (S/D) and acquired three STIC-PDU datasets from the placenta just above the placental cord insertion. Another observer analyzed the STIC-PDU datasets using Virtual Organ Computer-aided AnaLysis (VOCAL) spherical samples from every frame to determine the vascularization index (VI) and vascularization flow index (VFI); maximum, minimum and average values were used to determine the three volumetric impedance indices (vPI, vRI, vS/D). Intraobserver reliability was examined by intraclass correlation coefficients (ICC) and association between volumetric indices from placenta, and UA Doppler indices were assessed by Pearson's correlation coefficient. A total of 25 pregnant women were evaluated but five were excluded because of artifacts observed during analysis. The reliability of measurement of volumetric indices of both VI and VFI from three STIC-PDU datasets was similar, with all ICCs ≥ 0.78. Pearson's r values showed a weak and non-significant correlation between UA pulsed-wave Doppler indices and their respective volumetric indices from spherical samples of placenta (all r ≥ 0.23). VOCAL indices from specific phases of the cardiac cycle showed good repeatability (ICC ≥ 0.92). Volumetric impedance indices determined from spherical samples of placenta are sufficiently reliable but do not correlate with UA Doppler indices in healthy pregnancies. Copyright © 2012 ISUOG. Published by John Wiley & Sons, Ltd.
Within-Family Variability in Representations of Past Relationships With Parents

PubMed Central

Tucker, Corinna Jenkins; Fingerman, Karen; Savla, Jyoti

2009-01-01

Background We examined within-family variation in siblings’ memories of experiences with parents and their associations with current positive and negative affect. Methods Participants were 1,369 adults with at least 1 sibling, aged 26–74 years from 498 families in the MacArthur Study of Midlife in the United States (Mage = 47 years, 59% women, 94% White). Results There was considerable variability in recalled maternal and paternal treatment across the dimensions of affection (intraclass correlation coefficients [ICCs] 0.33 and 0.41, respectively), discipline (ICCs 0.39 and 0.43), and conflict (ICCs 0.24 and 0.26). In turn, recalled parental treatment, particularly affection, made unique contributions to current positive (ICC 0.12) and negative affect (ICC 0.08) over and above individual and familial level characteristics such as offspring demographic characteristics, extraversion and neuroticism, family structure, recalled early family environment, and parents' current status. Conclusions Results link adults' memories of experiences with their parents in childhood to their current well-being and highlight the importance of considering within-family models for family theory. PMID:19176488
Inter- and intra-rater reliability of calliper-based lymph node measurement in dogs with peripheral nodal lymphomas.

PubMed

Childress, M O; Fulkerson, C M; Lahrman, S A; Weng, H-Y

2016-08-01

The purpose of this study was to assess reliability of lymph node measurements between and within raters in dogs with nodal lymphomas. Three raters measured lymph nodes from 20 dogs twice prior to and once after administering chemotherapy. Sum tumour volume (TV) and sum longest diameter (LD) of all lymph nodes at each time point, and the percent change in measurements following chemotherapy, were calculated for each dog. Inter- and intra-rater reliability were assessed with the intraclass correlation coefficient (ICC). ICC for inter-rater sum TV and sum LD prior to chemotherapy were 0.86 and 0.80, respectively. ICC for inter-rater sum TV and sum LD after chemotherapy were 0.95 and 0.91, respectively. ICC for percent change in sum TV and sum LD were 0.96 and 0.94, respectively. ICC for intra-rater reliability ranged from 0.90 to 0.98 for each rater. Inter- and intra-rater reliability in measurements among the three raters was good to excellent. © 2014 John Wiley & Sons Ltd.
Unidirectional Expiratory Valve Method to Assess Maximal Inspiratory Pressure in Individuals without Artificial Airway.

PubMed

Grams, Samantha Torres; Kimoto, Karen Yumi Mota; Azevedo, Elen Moda de Oliveira; Lança, Marina; Albuquerque, André Luis Pereira de; Brito, Christina May Moran de; Yamaguti, Wellington Pereira

2015-01-01

Maximal Inspiratory Pressure (MIP) is considered an effective method to estimate strength of inspiratory muscles, but still leads to false positive diagnosis. Although MIP assessment with unidirectional expiratory valve method has been used in patients undergoing mechanical ventilation, no previous studies investigated the application of this method in subjects without artificial airway. This study aimed to compare the MIP values assessed by standard method (MIPsta) and by unidirectional expiratory valve method (MIPuni) in subjects with spontaneous breathing without artificial airway. MIPuni reproducibility was also evaluated. This was a crossover design study, and 31 subjects performed MIPsta and MIPuni in a random order. MIPsta measured MIP maintaining negative pressure for at least one second after forceful expiration. MIPuni evaluated MIP using a unidirectional expiratory valve attached to a face mask and was conducted by two evaluators (A and B) at two moments (Tests 1 and 2) to determine interobserver and intraobserver reproducibility of MIP values. Intraclass correlation coefficient (ICC[2,1]) was used to determine intraobserver and interobserver reproducibility. The mean values for MIPuni were 14.3% higher (-117.3 ± 24.8 cmH2O) than the mean values for MIPsta (-102.5 ± 23.9 cmH2O) (p<0.001). Interobserver reproducibility assessment showed very high correlation for Test 1 (ICC[2,1] = 0.91), and high correlation for Test 2 (ICC[2,1] = 0.88). The assessment of the intraobserver reproducibility showed high correlation for evaluator A (ICC[2,1] = 0.86) and evaluator B (ICC[2,1] = 0.77). MIPuni presented higher values when compared with MIPsta and proved to be reproducible in subjects with spontaneous breathing without artificial airway.
The Factor Structure of the Spiritual Well-Being Scale in Veterans Experienced Chemical Weapon Exposure.

PubMed

Sharif Nia, Hamid; Pahlevan Sharif, Saeed; Boyle, Christopher; Yaghoobzadeh, Ameneh; Tahmasbi, Bahram; Rassool, G Hussein; Taebei, Mozhgan; Soleimani, Mohammad Ali

2018-04-01

This study aimed to determine the factor structure of the spiritual well-being among a sample of the Iranian veterans. In this methodological research, 211 male veterans of Iran-Iraq warfare completed the Paloutzian and Ellison spiritual well-being scale. Maximum likelihood (ML) with oblique rotation was used to assess domain structure of the spiritual well-being. The construct validity of the scale was assessed using confirmatory factor analysis (CFA), convergent validity, and discriminant validity. Reliability was evaluated with Cronbach's alpha, Theta (θ), and McDonald Omega (Ω) coefficients, intra-class correlation coefficient (ICC), and construct reliability (CR). Results of ML and CFA suggested three factors which were labeled "relationship with God," "belief in fate and destiny," and "life optimism." The ICC, coefficients of the internal consistency, and CR were >.7 for the factors of the scale. Convergent validity and discriminant validity did not fulfill the requirements. The Persian version of spiritual well-being scale demonstrated suitable validity and reliability among the veterans of Iran-Iraq warfare.
Reliability and Concurrent Validity of the Narrow Path Walking Test in Persons With Multiple Sclerosis.

PubMed

Rosenblum, Uri; Melzer, Itshak

2017-01-01

About 90% of people with multiple sclerosis (PwMS) have gait instability and 50% fall. Reliable and clinically feasible methods of gait instability assessment are needed. The study investigated the reliability and validity of the Narrow Path Walking Test (NPWT) under single-task (ST) and dual-task (DT) conditions for PwMS. Thirty PwMS performed the NPWT on 2 different occasions, a week apart. Number of Steps, Trial Time, Trial Velocity, Step Length, Number of Step Errors, Number of Cognitive Task Errors, and Number of Balance Losses were measured. Intraclass correlation coefficients (ICC2,1) were calculated from the average values of NPWT parameters. Absolute reliability was quantified from standard error of measurement (SEM) and smallest real difference (SRD). Concurrent validity of NPWT with Functional Reach Test, Four Square Step Test (FSST), 12-item Multiple Sclerosis Walking Scale (MSWS-12), and 2 Minute Walking Test (2MWT) was determined using partial correlations. Intraclass correlation coefficients (ICCs) for most NPWT parameters during ST and DT ranged from 0.46-0.94 and 0.55-0.95, respectively. The highest relative reliability was found for Number of Step Errors (ICC = 0.94 and 0.93, for ST and DT, respectively) and Trial Velocity (ICC = 0.83 and 0.86, for ST and DT, respectively). Absolute reliability was high for Number of Step Errors in ST (SEM % = 19.53%) and DT (SEM % = 18.14%) and low for Trial Velocity in ST (SEM % = 6.88%) and DT (SEM % = 7.29%). Significant correlations for Number of Step Errors and Trial Velocity were found with FSST, MSWS-12, and 2MWT. In persons with PwMS performing the NPWT, Number of Step Errors and Trial Velocity were highly reliable parameters. Based on correlations with other measures of gait instability, Number of Step Errors was the most valid parameter of dynamic balance under the conditions of our test.Video Abstract available for more insights from the authors (see Supplemental Digital Content 1, available at: http://links.lww.com/JNPT/A159).
Reproducibility and validity of the Shanghai Women's Health Study physical activity questionnaire.

PubMed

Matthews, Charles E; Shu, Xiao-Ou; Yang, Gong; Jin, Fan; Ainsworth, Barbara E; Liu, Dake; Gao, Yu-Tang; Zheng, Wei

2003-12-01

In this investigation, the authors evaluated the reproducibility and validity of the Shanghai Women's Health Study (SWHS) physical activity questionnaire (PAQ), which was administered in a cohort study of approximately 75,000 Chinese women aged 40-70 years. Reproducibility (2-year test-retest) was evaluated using kappa statistics and intraclass correlation coefficients (ICCs). Validity was evaluated by comparing Spearman correlations (r) for the SWHS PAQ with two criterion measures administered over a period of 12 months: four 7-day physical activity logs and up to 28 7-day PAQs. Women were recruited from the SWHS cohort (n = 200). Results indicated that the reproducibility of adolescent and adult exercise participation (kappa = 0.85 and kappa = 0.64, respectively) and years of adolescent exercise and adult exercise energy expenditure (ICC = 0.83 and ICC = 0.70, respectively) was reasonable. Reproducibility values for adult lifestyle activities were lower (ICC = 0.14-0.54). Significant correlations between the PAQ and criterion measures of adult exercise were observed for the first PAQ administration (physical activity log, r = 0.50; 7-day PAQ, r = 0.62) and the second PAQ administration (physical activity log, r = 0.74; 7-day PAQ, r = 0.80). Significant correlations between PAQ lifestyle activities and the 7-day PAQ were also noted (r = 0.33-0.88). These data indicate that the SWHS PAQ is a reproducible and valid measure of exercise behaviors and that it demonstrates utility in stratifying women by levels of important lifestyle activities (e.g., housework, walking, cycling).
Periorbital Biometric Measurements using ImageJ Software: Standardisation of Technique and Assessment Of Intra- and Interobserver Variability

PubMed Central

Rajyalakshmi, R.; Prakash, Winston D.; Ali, Mohammad Javed; Naik, Milind N.

2017-01-01

Purpose: To assess the reliability and repeatability of periorbital biometric measurements using ImageJ software and to assess if the horizontal visible iris diameter (HVID) serves as a reliable scale for facial measurements. Methods: This study was a prospective, single-blind, comparative study. Two clinicians performed 12 periorbital measurements on 100 standardised face photographs. Each individual’s HVID was determined by Orbscan IIz and used as a scale for measurements using ImageJ software. All measurements were repeated using the ‘average’ HVID of the study population as a measurement scale. Intraclass correlation coefficient (ICC) and Pearson product-moment coefficient were used as statistical tests to analyse the data. Results: The range of ICC for intra- and interobserver variability was 0.79–0.99 and 0.86–0.99, respectively. Test-retest reliability ranged from 0.66–1.0 to 0.77–0.98, respectively. When average HVID of the study population was used as scale, ICC ranged from 0.83 to 0.99, and the test-retest reliability ranged from 0.83 to 0.96 and the measurements correlated well with recordings done with individual Orbscan HVID measurements. Conclusion: Periorbital biometric measurements using ImageJ software are reproducible and repeatable. Average HVID of the population as measured by Orbscan is a reliable scale for facial measurements. PMID:29403183
Comparison of Automated Brain Volume Measures obtained with NeuroQuant and FreeSurfer.

PubMed

Ochs, Alfred L; Ross, David E; Zannoni, Megan D; Abildskov, Tracy J; Bigler, Erin D

2015-01-01

To examine intermethod reliabilities and differences between FreeSurfer and the FDA-cleared congener, NeuroQuant, both fully automated methods for structural brain MRI measurements. MRI scans from 20 normal control subjects, 20 Alzheimer's disease patients, and 20 mild traumatically brain-injured patients were analyzed with NeuroQuant and with FreeSurfer. Intermethod reliability was evaluated. Pairwise correlation coefficients, intraclass correlation coefficients, and effect size differences were computed. NeuroQuant versus FreeSurfer measures showed excellent to good intermethod reliability for the 21 regions evaluated (r: .63 to .99/ICC: .62 to .99/ES: -.33 to 2.08) except for the pallidum (r/ICC/ES = .31/.29/-2.2) and cerebellar white matter (r/ICC/ES = .31/.31/.08). Volumes reported by NeuroQuant were generally larger than those reported by FreeSurfer with the whole brain parenchyma volume reported by NeuroQuant 6.50% larger than the volume reported by FreeSurfer. There was no systematic difference in results between the 3 subgroups. NeuroQuant and FreeSurfer showed good to excellent intermethod reliability in volumetric measurements for all brain regions examined with the only exceptions being the pallidum and cerebellar white matter. This finding was robust for normal individuals, patients with Alzheimer's disease, and patients with mild traumatic brain injury. Copyright © 2015 by the American Society of Neuroimaging.
Education Research: Bias and poor interrater reliability in evaluating the neurology clinical skills examination

PubMed Central

Schuh, L A.; London, Z; Neel, R; Brock, C; Kissela, B M.; Schultz, L; Gelb, D J.

2009-01-01

Objective: The American Board of Psychiatry and Neurology (ABPN) has recently replaced the traditional, centralized oral examination with the locally administered Neurology Clinical Skills Examination (NEX). The ABPN postulated the experience with the NEX would be similar to the Mini-Clinical Evaluation Exercise, a reliable and valid assessment tool. The reliability and validity of the NEX has not been established. Methods: NEX encounters were videotaped at 4 neurology programs. Local faculty and ABPN examiners graded the encounters using 2 different evaluation forms: an ABPN form and one with a contracted rating scale. Some NEX encounters were purposely failed by residents. Cohen’s kappa and intraclass correlation coefficients (ICC) were calculated for local vs ABPN examiners. Results: Ninety-eight videotaped NEX encounters of 32 residents were evaluated by 20 local faculty evaluators and 18 ABPN examiners. The interrater reliability for a determination of pass vs fail for each encounter was poor (kappa 0.32; 95% confidence interval [CI] = 0.11, 0.53). ICC between local faculty and ABPN examiners for each performance rating on the ABPN NEX form was poor to moderate (ICC range 0.14-0.44), and did not improve with the contracted rating form (ICC range 0.09-0.36). ABPN examiners were more likely than local examiners to fail residents. Conclusions: There is poor interrater reliability between local faculty and American Board of Psychiatry and Neurology examiners. A bias was detected for favorable assessment locally, which is concerning for the validity of the examination. Further study is needed to assess whether training can improve interrater reliability and offset bias. GLOSSARY ABIM = American Board of Internal Medicine; ABPN = American Board of Psychiatry and Neurology; CI = confidence interval; HFH = Henry Ford Hospital; ICC = intraclass correlation coefficients; IM = internal medicine; mini-CEX = Mini-Clinical Evaluation Exercise; NEX = Neurology Clinical Skills Examination; RITE = residency inservice training examination; UC = University of Cincinnati; UM = University of Michigan; USF = University of South Florida. PMID:19605769
Validity and reliability of a pilot scale for assessment of multiple system atrophy symptoms.

PubMed

Matsushima, Masaaki; Yabe, Ichiro; Takahashi, Ikuko; Hirotani, Makoto; Kano, Takahiro; Horiuchi, Kazuhiro; Houzen, Hideki; Sasaki, Hidenao

2017-01-01

Multiple system atrophy (MSA) is a rare progressive neurodegenerative disorder for which brief yet sensitive scale is required in order for use in clinical trials and general screening. We previously compared several scales for the assessment of MSA symptoms and devised an eight-item pilot scale with large standardized response mean [handwriting, finger taps, transfers, standing with feet together, turning trunk, turning 360°, gait, body sway]. The aim of the present study is to investigate the validity and reliability of a simple pilot scale for assessment of multiple system atrophy symptoms. Thirty-two patients with MSA (15 male/17 female; 20 cerebellar subtype [MSA-C]/12 parkinsonian subtype [MSA-P]) were prospectively registered between January 1, 2014 and February 28, 2015. Patients were evaluated by two independent raters using the Unified MSA Rating Scale (UMSARS), Scale for Assessment and Rating of Ataxia (SARA), and the pilot scale. Correlations between UMSARS, SARA, pilot scale scores, intraclass correlation coefficients (ICCs), and Cronbach's alpha coefficients were calculated. Pilot scale scores significantly correlated with scores for UMSARS Parts I, II, and IV as well as with SARA scores. Intra-rater and inter-rater ICCs and Cronbach's alpha coefficients remained high (> 0.94) for all measures. The results of the present study indicate the validity and reliability of the eight-item pilot scale, particularly for the assessment of symptoms in patients with early state multiple system atrophy.
Validity and reliability of isometric muscle strength measurements of hip abduction and abduction with external hip rotation in a bent-hip position using a handheld dynamometer with a belt.

PubMed

Aramaki, Hidefumi; Katoh, Munenori; Hiiragi, Yukinobu; Kawasaki, Tsubasa; Kurihara, Tomohisa; Ohmi, Yorikatsu

2016-07-01

[Purpose] This study aimed to investigate the relatedness, reliability, and validity of isometric muscle strength measurements of hip abduction and abduction with an external hip rotation in a bent-hip position using a handheld dynamometer with a belt. [Subjects and Methods] Twenty healthy young adults, with a mean age of 21.5 ± 0.6 years were included. Isometric hip muscle strength in the subjects' right legs was measured under two posture positions using two devices: a handheld dynamometer with a belt and an isokinetic dynamometer. Reliability was evaluated using an intra-class correlation coefficient (ICC); relatedness and validity were evaluated using Pearson's product moment correlation coefficient. Differences in measurements of devices were assessed by two-way ANOVA. [Results] ICC (1, 1) was ≥0.9; significant positive correlations in measurements were found between the two devices under both conditions. No main effect was found between the measurement values. [Conclusion] Our findings revealed that there was relatedness, reliability, and validity of this method for isometric muscle strength measurements using a handheld dynamometer with a belt.
Inter-vender and test-retest reliabilities of resting-state functional magnetic resonance imaging: Implications for multi-center imaging studies.

PubMed

An, Hyeong Su; Moon, Won-Jin; Ryu, Jae-Kyun; Park, Ju Yeon; Yun, Won Sung; Choi, Jin Woo; Jahng, Geon-Ho; Park, Jang-Yeon

2017-12-01

This prospective multi-center study aimed to evaluate the inter-vendor and test-retest reliabilities of resting-state functional magnetic resonance imaging (RS-fMRI) by assessing the temporal signal-to-noise ratio (tSNR) and functional connectivity. Study included 10 healthy subjects and each subject was scanned using three 3T MR scanners (GE Signa HDxt, Siemens Skyra, and Philips Achieva) in two sessions. The tSNR was calculated from the time course data. Inter-vendor and test-retest reliabilities were assessed with intra-class correlation coefficients (ICCs) derived from variant component analysis. Independent component analysis was performed to identify the connectivity of the default-mode network (DMN). In result, the tSNR for the DMN was not significantly different among the GE, Philips, and Siemens scanners (P=0.638). In terms of vendor differences, the inter-vendor reliability was good (ICC=0.774). Regarding the test-retest reliability, the GE scanner showed excellent correlation (ICC=0.961), while the Philips (ICC=0.671) and Siemens (ICC=0.726) scanners showed relatively good correlation. The DMN pattern of the subjects between the two sessions for each scanner and between three scanners showed the identical patterns of functional connectivity. The inter-vendor and test-retest reliabilities of RS-fMRI using different 3T MR scanners are good. Thus, we suggest that RS-fMRI could be used in multicenter imaging studies as a reliable imaging marker. Copyright © 2017 Elsevier Inc. All rights reserved.
Practical utility of thermodilution versus doppler ultrasound to measure hemodialysis blood access flow.

PubMed

Fontseré, Néstor; Mestres, Gaspar; Barrufet, Marta; Burrel, Marta; Vera, Manel; Arias, Marta; Masso, Elisabeth; Cases, Aleix; Maduell, Francisco; Campistol, Josep M

2013-01-01

The current clinical guidelines recommend indirect access blood flow (Qa) measurement as one of the most important components in vascular access maintenance programs. The best-know methods are doppler ultrasound (DU) and saline dilution method. This study evaluates the efficiency of Qa measurement with thermodilution method (TD) in comparison with the DU. Transversal study in 64 patients in hemodialysis (41 men); mean age 59.9 years with 54 AVFs and 10 PTFE. Qa reference value was obtained with DU in brachial artery (AVFs) or at the zone of arterial puncture (AVGs). Bland-Altman and interclass correlation coefficient (ICC) were used to study accuracy. Mean values obtained with DU-Qa were 1426 ± 753 mL/min AVFs and 1186 ± 789 mL/min AVGs. The mean Qa with TD was 1372 ± 770 AVFs (bias 54.6; ICC 0.923) and 1176 ± 758 AVGs (bias 10.2; ICC 0.992). In the subgroup of 28 patients with radiocephalic latero-terminal AVFs the DU-Qa was 1232 ± 767 mL/min. The Qa was in radial artery 942 (ICC 0.805); radial-ulnar artery 1103 (ICC 0.973); cephalic vein 788 (ICC 0.772) and TD 1026 (ICC 0.971). We detected 5 cases of significant stenosis. After endovascular treatment the Kt was 79 liters (61; p=0.043) and TD-Qa 895 mL/min (663; p=0.043). TD represents a good indirect method of Qa measurement. In the subgroup of patients with radiocephalic AVFs, Qa measurements in the radial and ulnar artery are more accurate. Therefore, in this situation the TD method obtained an excellent correlation in comparison to brachial artery.
Reliability and validity of the Turkish version of the Berg Balance Scale.

PubMed

Sahin, Fusun; Yilmaz, Figen; Ozmaden, Asli; Kotevolu, Nurdan; Sahin, Tulay; Kuran, Banu

2008-01-01

The purpose of this study was to develop a Turkish version of the Berg Balance Scale (BBS) and assess its reliability and validity. Sixty healthy volunteers older than 65 years were included in to the study. Subjects who had lower extremity amputation, or were armchair or bedridden were excluded. After translation process, the Turkish version of the scale was administered to each participant twice with an interval of 2 weeks. The intraclass correlation coefficient (ICC) was calculated to assess intra- and inter-observer reliability. Chronbach alpha was calculated to evaluate internal consistency of the total BBS score. Interclass correlation coefficient was calcuated to examine test-retest reliability. Convergent validity was assessed by correlating the scale with Modified Barthel Index (MBI) and Timed Up and Go Test (TUG). Construct validity was assessed with factor analysis. The mean age in years of the participants were 77.00+/-5.67 (range: 67-92 yrs). The ICC for intra- and inter- observer reliability was 0.98 (p<0.0001) and 0.97 (p<0.0001), respectively. Chronbach alpha of the Turkish version of the BBS was 0.98. The test-retest reliability (ICC) of the Turkish version of the BBS was determined as 0.98 for the total score, and ranged from 0.86-0.99 for individual items. In terms of validity, the Turkish version of the BBS was correlated with the MBI (in positive direction) and TUG (in negative direction) (r=0.67 p<0.0001; r=-0.75 p<0.0001, respectively). The Turkish version of the BBS is a reliable and valid scale to be used in balance assessment of Turkish older adults.
Can we have an overall osteoarthritis severity score for the patellofemoral joint using magnetic resonance imaging? Reliability and validity.

PubMed

Kobayashi, Sarah; Peduto, Anthony; Simic, Milena; Fransen, Marlene; Refshauge, Kathryn; Mah, Jean; Pappas, Evangelos

2018-04-01

This work aimed to assess inter-rater reliability and agreement of a magnetic resonance imaging (MRI)-based Kellgren and Lawrence (K&L) grading for patellofemoral joint osteoarthritis (OA) and to validate it against the MRI Osteoarthritis Knee Score (MOAKS). MRI scans from people aged 45 to 75 years with chronic knee pain participating in a randomised clinical trial evaluating dietary supplements were utilised. Fifty participants were randomly selected and scored using the MRI-based K&L grading using axial and sagittal MRI scans. Raters conducted inter-rater reliability, blinded to clinical information, radiology reports and other rater results. Intra- and inter-rater reliability and agreement were evaluated using the intra-class correlation coefficient (ICC) and Cohen's weighted kappa. There was a 2-week interval between the first and second readings for intra-rater reliability. Validity was assessed using the MOAKS and evaluated using Spearman's correlation coefficient. Intra-rater reliability of the K&L system was excellent: ICC 0.91 (95% CI 0.82-0.95); weighted kappa (ĸ = 0.69). Inter-rater reliability was high (ICC 0.88; 95% CI 0.79-0.93), while agreement between raters was moderate (ĸ = 0.49-0.57). Validity analysis demonstrated a strong correlation between the total MOAKS features score and the K&L grading system (ρ = 0.62-0.67) but weak correlations when compared with individual MOAKS features (ρ = 0.19-0.61). The high reliability and good agreement show consistency in grading the severity of patellofemoral OA with the MRI-based K&L score. Our validity results suggest that the scale may be useful, particularly in the clinical environment. Future research should validate this method against clinical findings.
Validity of a physical activity questionnaire in Shanghai.

PubMed

Peters, Tricia M; Shu, Xiao-Ou; Moore, Steven C; Xiang, Yong Bing; Yang, Gong; Ekelund, Ulf; Liu, Da-Ke; Tan, Yu-Ting; Ji, Bu-Tian; Schatzkin, Arthur S; Zheng, Wei; Chow, Wong Ho; Matthews, Charles E; Leitzmann, Michael F

2010-12-01

In large epidemiologic studies, physical activity (PA) is often assessed using PA questionnaires (PAQ). Because available PAQ may not capture the full range of PA in which urban Chinese adults engage, a PAQ was developed for this purpose. We examined the validity of this PAQ and the 1-yr stability of PA in 545 urban Shanghai adults. The PAQ was interview-administered twice, approximately 1 yr apart, and participants also wore an accelerometer and completed a PA-log for seven consecutive days every 3 months during the same year. The intraclass correlation coefficient (ICC) was used to evaluate the stability of PA across questionnaire administrations, and Spearman correlation coefficients (ρ) and mean differences and 95% limits of agreement were used to examine the validity of the questionnaire compared against accelerometry and the PA-log. When measured by accelerometry, estimates of time spent in moderate-to-vigorous PA were lower and estimates of time spent sedentary were higher than when self-reported on the PAQ (P < 0.001). Total PA (ICC = 0.65) and PA domains (ICC = 0.45-0.85) showed moderate to high stability across PAQ administrations. Total PA (ρ = 0.30), moderate-to-vigorous activity (ρ = 0.17), light activity (ρ = 0.36), and sedentary behavior (ρ = 0.16) assessed by PAQ and by accelerometry were significantly and positively correlated, and correlations of the PAQ with the PA-log (ρ = 0.36-0.85) were stronger than those observed with accelerometry. The PAQ significantly overestimated time spent in moderate-to-vigorous activity and underestimated time spent in light activity and sedentary behavior compared with accelerometry, but it performed well at ranking participants according to PA level.

Intra-observer reproducibility and diagnostic performance of breast shear-wave elastography in Asian women.

PubMed

Park, Hye Young; Han, Kyung Hwa; Yoon, Jung Hyun; Moon, Hee Jung; Kim, Min Jung; Kim, Eun-Kyung

2014-06-01

Our aim was to evaluate intra-observer reproducibility of shear-wave elastography (SWE) in Asian women. Sixty-four breast masses (24 malignant, 40 benign) were examined with SWE in 53 consecutive Asian women (mean age, 44.9 y old). Two SWE images were obtained for each of the lesions. The intra-observer reproducibility was assessed by intra-class correlation coefficients (ICC). We also evaluated various clinicoradiologic factors that can influence reproducibility in SWE. The ICC of intra-observer reproducibility was 0.789. In clinicoradiologic factor evaluation, masses surrounded by mixed fatty and glandular tissue (ICC: 0.619) showed lower intra-observer reproducibility compared with lesions that were surrounded by glandular tissue alone (ICC: 0.937; p < 0.05). Overall, the intra-observer reproducibility of breast SWE was excellent in Asian women. However, it may decrease when breast tissue is in a heterogeneous background. Therefore, SWE should be performed carefully in these cases. Copyright © 2014 World Federation for Ultrasound in Medicine & Biology. Published by Elsevier Inc. All rights reserved.
Clinical Usefulness of the Pendulum Test Using a NK Table to Measure the Spasticity of Patients with Brain Lesions

PubMed Central

Kim, Yong-Wook

2013-01-01

. [Purpose] The purpose of the present study was to investigate the clinical usefulness (reliability and validity) of the pendulum test using a Noland-Kuckhoff (NK) table with an attached electrogoniometer to measure the spasticity of patients with brain lesions. [Subjects] The subjects were 31 patients with stroke or traumatic brain injury. [Methods] The intraclass correlation coefficient (ICC) was used to verify the test–retest reliability of spasticity measures obtained using the pendulum test. Pearson's product correlation coefficient was used to examine the validity of the pendulum test using the amplitude of the patellar tendon reflex (PTR) test, an objective and quantitative measure of spasticity. [Results] The test–retest reliability was high, reflecting a significant correlation between the test and the retest (ICCs = 0.95–0.97). A significant negative correlation was found between the amplitude of the PTR test and the four variables measured in the pendulum test (r = −0.77– −0.85). [Conclusion] The pendulum test using a NK table is an objective measure of spasticity and can be used in the clinical setting in place of more expensive and complicated equipment. Further studies are needed to investigate the therapeutic effect of this method on spasticity. PMID:24259775
Clinical usefulness of the pendulum test using a NK table to measure the spasticity of patients with brain lesions.

PubMed

Kim, Yong-Wook

2013-10-01

. [Purpose] The purpose of the present study was to investigate the clinical usefulness (reliability and validity) of the pendulum test using a Noland-Kuckhoff (NK) table with an attached electrogoniometer to measure the spasticity of patients with brain lesions. [Subjects] The subjects were 31 patients with stroke or traumatic brain injury. [Methods] The intraclass correlation coefficient (ICC) was used to verify the test-retest reliability of spasticity measures obtained using the pendulum test. Pearson's product correlation coefficient was used to examine the validity of the pendulum test using the amplitude of the patellar tendon reflex (PTR) test, an objective and quantitative measure of spasticity. [Results] The test-retest reliability was high, reflecting a significant correlation between the test and the retest (ICCs = 0.95-0.97). A significant negative correlation was found between the amplitude of the PTR test and the four variables measured in the pendulum test (r = -0.77- -0.85). [Conclusion] The pendulum test using a NK table is an objective measure of spasticity and can be used in the clinical setting in place of more expensive and complicated equipment. Further studies are needed to investigate the therapeutic effect of this method on spasticity.
Comparison of Collection Methods for Fecal Samples for Discovery Metabolomics in Epidemiologic Studies.

PubMed

Loftfield, Erikka; Vogtmann, Emily; Sampson, Joshua N; Moore, Steven C; Nelson, Heidi; Knight, Rob; Chia, Nicholas; Sinha, Rashmi

2016-11-01

The gut metabolome may be associated with the incidence and progression of numerous diseases. The composition of the gut metabolome can be captured by measuring metabolite levels in the feces. However, there are little data describing the effect of fecal sample collection methods on metabolomic measures. We collected fecal samples from 18 volunteers using four methods: no solution, 95% ethanol, fecal occult blood test (FOBT) cards, and fecal immunochemical test (FIT). One set of samples was frozen after collection (day 0), and for 95% ethanol, FOBT, and FIT, a second set was frozen after 96 hours at room temperature. We evaluated (i) technical reproducibility within sample replicates, (ii) stability after 96 hours at room temperature for 95% ethanol, FOBT, and FIT, and (iii) concordance of metabolite measures with the putative "gold standard," day 0 samples without solution. Intraclass correlation coefficients (ICC) estimating technical reproducibility were high for replicate samples for each collection method. ICCs estimating stability at room temperature were high for 95% ethanol and FOBT (median ICC > 0.87) but not FIT (median ICC = 0.52). Similarly, Spearman correlation coefficients (r s ) estimating metabolite concordance with the "gold standard" were higher for 95% ethanol (median r s = 0.82) and FOBT (median r s = 0.70) than for FIT (median r s = 0.40). Metabolomic measurements appear reproducible and stable in fecal samples collected with 95% ethanol or FOBT. Concordance with the "gold standard" is highest with 95% ethanol and acceptable with FOBT. Future epidemiologic studies should collect feces using 95% ethanol or FOBT if interested in studying fecal metabolomics. Cancer Epidemiol Biomarkers Prev; 25(11); 1483-90. ©2016 AACR. ©2016 American Association for Cancer Research.
Validity and reliability of an iPhone App to assess time, velocity and leg power during a sit-to-stand functional performance test.

PubMed

Ruiz-Cárdenas, Juan Diego; Rodríguez-Juan, Juan José; Smart, Rowan R; Jakobi, Jennifer M; Jones, Gareth R

2018-01-01

The purposes of this study were: (i) Analyze the concurrent validity and reliability of an iPhone App for measuring time, velocity and power during a single sit-to-stand (STS) test compared with measurements recorded from a force plate; and (ii) Evaluate the relationship between the iPhone App measures with age and functional performance. Forty-eight healthy individuals (age range: 26-81 years) were recruited. All participants completed a STS test on a force plate with the movement recorded on an iPhone 6 at 240 frames-per-second. Functional ability was also measured using isometric handgrip strength and self-paced walking time tests. Intraclass correlation coefficients (ICC), Pearson's correlation coefficient, Cronbach's alpha (α) and Bland-Altman plots with 95% confidence intervals (CI) were used to test validity and reliability between instruments. The results showed a good agreement between all STS measurement variables; time (ICC=0.864, 95%CI=0.77-0.92; α=0.926), velocity (ICC=0.912, 95%CI=0.85-0.95; α=0.953) and power (ICC=0.846, 95%CI=0.74-0.91; α=0.917) with no systematic bias between instruments for any variable analyzed. STS time, velocity and power derived from the iPhone App show moderate to strong associations with age (|r|=0.63-0.83) and handgrip strength (|r|=0.4-0.64) but not the walking test. The results of this study identify that this iPhone App is reliable for measuring STS and the derived values of time, velocity and power shows strong associations with age and handgrip strength. Copyright © 2017 Elsevier B.V. All rights reserved.
Reliability and validity analysis of the transfer assessment instrument.

PubMed

McClure, Laura A; Boninger, Michael L; Ozawa, Haishin; Koontz, Alicia

2011-03-01

To describe the development and evaluate the reliability and validity of a newly created outcome measure, the Transfer Assessment Instrument (TAI), to assess the quality of transfers performed by full-time wheelchair users. Repeated measures. 2009 National Veterans Wheelchair Games in Spokane, WA. A convenience sample of full-time wheelchair users (N=40) who perform sitting pivot or standing pivot transfers. Not applicable. Intraclass correlation coefficients (ICCs) for reliability and Spearman correlation coefficients for concurrent validity between the TAI and a global assessment scale (0-100 visual analog scale [VAS]). No adverse events occurred during testing. Intrarater ICCs for 3 raters ranged between .35 and .89, and the interrater ICC was .642. Correlations between the TAI and a global assessment VAS ranged between .19 (P=.285) and .69 (P>.000). Item analyses of the tool found a wide range of results, from weak to good reliability. Evaluators found the TAI to be safe and able to be completed in a short time. The TAI is a safe, quick outcome measure that uses equipment typically found in a clinical setting and does not ask participants to perform new skills. Reliability and validity testing found the TAI to have acceptable interrater and a wide range of intrarater reliability. Future work indicates the need for continued refinement including removal or modification of items found to have low reliability, improved education for clinicians, and further reliability and validity analysis with a more diverse subject population. The TAI has the potential to fill a void in assessment of transfers. Copyright © 2011 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Alcohol Drinking Onset: A Reliability Study

ERIC Educational Resources Information Center

Prause, JoAnn; Dooley, David; Ham-Rowbottom, Kathleen A.; Emptage, Nicholas

2007-01-01

Early alcohol drinking onset (ADO) is associated with adult alcohol misuse, but the accuracy of ADO is unclear. Reliability of self-reported ADO was studied in two panels of the National Longitudinal Survey of Youth. For the Adult sample (n = 6,215), the intraclass correlation coefficient (ICC) was 0.36. Older respondents had higher reliabilities…
[Transcultural adaptation of the Antifat Attitudes Test to Brazilian Portuguese].

PubMed

Obara, Angélica Almeida; Alvarenga, Marle Dos Santos

2018-05-01

Obese individuals are often blamed for their own condition and the targets of discrimination and prejudice. The scope of this study is to describe the cross-cultural adaptation to Brazilian Portuguese and the validation of the Antifat Attitudes Test - specifically developed for evaluation of negative attitudes toward the obese individual. The scale has 34 statements distributed in three subscales - Social/Character Disparagement (15 items), Physical/Romantic Unattractiveness (10 items) and Weight Control/Blame (9 items). The method involved the translation of the scale; evaluation of the conceptual, operational and item equivalence; evaluation of the semantic equivalence using the paired t test, the Pearson correlation coefficient and the intraclass correlation coefficient (ICC); internal consistency evaluation (Cronbach's alpha) and test-retest reliability (ICC) and Confirmatory Factor Analysis - after application in 340 college students in the area of health. The results showed good global internal consistency and reliability (α 0.85; CCI 0.83), and factor analysis showed that the original subscales can be kept in the adaptation, and therefore the scale adapted to the Brazilian-Portuguese version is valid and useful in studies to explore negative attitudes toward obese individuals.
A portable x-ray fluorescence instrument for analyzing dust wipe samples for lead: evaluation with field samples.

PubMed

Sterling, D A; Lewis, R D; Luke, D A; Shadel, B N

2000-06-01

Dust wipe samples collected in the field were tested by nondestructive X-ray fluorescence (XRF) followed by laboratory analysis with flame atomic absorption spectrophotometry (FAAS). Data were analyzed for precision and accuracy of measurement. Replicate samples with the XRF show high precision with an intraclass correlation coefficient (ICC) of 0.97 (P<0.0001) and an overall coefficient of variation of 11.6%. Paired comparison indicates no statistical difference (P=0.272) between XRF and FAAS analysis. Paired samples are highly correlated with an R(2) ranging between 0.89 for samples that contain paint chips and 0.93 for samples that do not contain paint chips. The ICC for absolute agreement between XRF and laboratory results was 0.95 (P<0.0001). The relative error over the concentration range of 25 to 14,200 microgram Pb is -12% (95% CI, -18 to -5). The XRF appears to be an excellent method for rapid on-site evaluation of dust wipes for clearance and risk assessment purposes, although there are indications of some confounding when paint chips are present. Copyright 2000 Academic Press.
Reliability of heart rate measures during walking before and after running maximal efforts.

PubMed

Boullosa, D A; Barros, E S; del Rosso, S; Nakamura, F Y; Leicht, A S

2014-11-01

Previous studies on HR recovery (HRR) measures have utilized the supine and the seated postures. However, the most common recovery mode in sport and clinical settings after running exercise is active walking. The aim of the current study was to examine the reliability of HR measures during walking (4 km · h(-1)) before and following a maximal test. Twelve endurance athletes performed an incremental running test on 2 days separated by 48 h. Absolute (coefficient of variation, CV, %) and relative [Intraclass correlation coefficient, (ICC)] reliability of time domain and non-linear measures of HR variability (HRV) from 3 min recordings, and HRR parameters over 5 min were assessed. Moderate to very high reliability was identified for most HRV indices with short-term components of time domain and non-linear HRV measures demonstrating the greatest reliability before (CV: 12-22%; ICC: 0.73-0.92) and after exercise (CV: 14-32%; ICC: 0.78-0.91). Most HRR indices and parameters of HRR kinetics demonstrated high to very high reliability with HR values at a given point and the asymptotic value of HR being the most reliable (CV: 2.5-10.6%; ICC: 0.81-0.97). These findings demonstrate these measures as reliable tools for the assessment of autonomic control of HR during walking before and after maximal efforts. © Georg Thieme Verlag KG Stuttgart · New York.
Central Corneal Thickness Reproducibility among Ten Different Instruments.

PubMed

Pierro, Luisa; Iuliano, Lorenzo; Gagliardi, Marco; Ambrosi, Alessandro; Rama, Paolo; Bandello, Francesco

2016-11-01

To assess agreement between one ultrasonic (US) and nine optical instruments for the measurement of central corneal thickness (CCT), and to evaluate intra- and inter-operator reproducibility. In this observational cross-sectional study, two masked operators measured CCT thickness twice in 28 healthy eyes. We used seven spectral-domain optical coherence tomography (SD-OCT) devices, one time-domain OCT, one Scheimpflug camera, and one US-based instrument. Inter- and intra-operator reproducibility was evaluated by intraclass correlation coefficient (ICC), coefficient of variation (CV), and Bland-Altman test analysis. Instrument-to-instrument reproducibility was determined by ANOVA for repeated measurements. We also tested how the devices disagreed regarding systemic bias and random error using a structural equation model. Mean CCT of all instruments ranged from 536 ± 42 μm to 577 ± 40 μm. An instrument-to-instrument correlation test showed high values among the 10 investigated devices (correlation coefficient range 0.852-0.995; p values <0.0001 in all cases). The highest correlation coefficient values were registered between 3D OCT-2000 Topcon-Spectral OCT/SLO Opko (0.995) and Cirrus HD-OCT Zeiss-RS-3000 Nidek (0.995), whereas the lowest were seen between SS-1000 CASIA and Spectral OCT/SLO Opko (0.852). ICC and CV showed excellent inter- and intra-operator reproducibility for all optic-based devices, except for the US-based device. Bland-Altman analysis demonstrated low mean biases between operators. Despite highlighting good intra- and inter-operator reproducibility, we found that a scale bias between instruments might interfere with thorough CCT monitoring. We suggest that optimal monitoring is achieved with the same operator and the same device.
Cross-Cultural Adaptation, Validation, and Reliability Testing of the Modified Oswestry Disability Questionnaire in Persian Population with Low Back Pain.

PubMed

Baradaran, Aslan; Ebrahimzadeh, Mohammad H; Birjandinejad, Ali; Kachooei, Amir Reza

2016-04-01

Prospective study. We aimed to validate the Persian version of the modified Oswestry disability questionnaire (MODQ) in patients with low back pain. Modified Oswestry low back pain disability questionnaire is a well-known condition-specific outcome measure that helps quantify disability in patients with lumbar syndromes. To test the validity in a pilot study, the Persian MODQ was administered to 25 individuals with low back pain. We then enrolled 200 consecutive patients with low back pain to fill the Persian MODQ as well as the short form 36 (SF-36) questionnaire. Convergent validity of the MODQ was tested using the Spearman's correlation coefficient between the MODQ and SF-36 subscales. Intraclass correlation coefficient (ICC) and Cronbach's α coefficient were measured to test the reliability between test and retest and internal consistency of all items, respectively. ICC for individual items ranged from 0.43 to 0.80 showing good reliability and reproducibility of each individual item. Cronbach's α coefficient was 0.69 showing good internal consistency across all 10 items of the Persian MODQ. Total MODQ score showed moderate to strong correlation with the eight subscales and the two domains of the SF-36. The highest correlation was between the MODQ and the physical functioning subscale of the SF-36 (r=-0.54, p<0.001) and the physical component domain of the SF-36 (r=-0.55, p<0.001) showing that MODQ is measuring what it is supposed to measure in terms of disability and physical function. Persian version of the MODQ is a valid and reliable tool for the assessment of the disability following low back pain.
Simplified Radiographic Damage Index for Affected Joints in Chronic Gouty Arthritis

PubMed Central

2016-01-01

The aim of this study was to develop and validate a new radiographic damage scoring method (DAmagE index of GoUt; DAEGU) in chronic gout using plain radiography. Two independent observers scored foot x-rays from 15 patients with chronic gout according to the DAEGU method and the modified Sharp/van der Heijde (SvdH) method. The 10 metatarsophalangeal (MTP) and 2 interphalangeal (IP) joints of the first toes of both feet were scored to assess the degrees of erosion and joint space narrowing (JSN). The intraobserver and interobserver reliabilities were analyzed by calculating the intraclass correlation coefficient (ICC) and minimal detectable change (MDC). The correlation between the DAEGU and SvdH methods was analyzed by calculating the Spearman's rho correlation coefficients and Kappa coefficients. The DAEGU method was found to be highly reproducible (0.945–0.987 for the intraobserver and 0.993–0.996 for the interobserver ICC values). The erosion, JSN, and total scores exhibited strong positive correlations between the DAEGU and SvdH methods and also within each method (r = 0.860–0.969, P < 0.001 for all parameters). The DAEGU and SvdH methods were in very good agreement as determined by Kappa coefficient analysis [0.732 (0.387–1.000) for erosion and 1.000 (1.000–1.000) for JSN]. In conclusion, this study revealed that DAEGU method was a reliable and feasible tool in the assessment of radiographic damage in chronic gout. The DAEGU method may provide a more easy assessment of structural damage in chronic gout in the real clinical practice. PMID:26955246
The Test–Retest Reliability of the Photopic Negative Response (PhNR)

PubMed Central

Tang, Jessica; Edwards, Thomas; Crowston, Jonathan G.; Sarossy, Marc

2014-01-01

Purpose The photopic negative response (PhNR) may be useful as a tool to monitor longitudinal change in retinal ganglion cell (RGC) function. The goal was to assess PhNR test–retest reliability, and to estimate the amount of change between tests that is likely to be statistically significant for an individual test subject. Methods Photopic electroretinograms (ERGs) were recorded from 49 visually normal subjects (mean age, 38.9 years; range, 21–72 years). Signals were acquired using Dawson-Trick-Litzkow (DTL) electrodes in response to red stimulus at four flash energies (0.5, 1, 2.25, 3 cd·s/m2) on a blue background (10 cd/m2). The PhNR amplitude was recorded from prestimulus baseline to trough (BT), prestimulus baseline to fixed time point (BF), and b-wave peak to trough (PT). The ratio of baseline PhNR to b-wave amplitude (BT/b-wave) was calculated. Reliability was assessed using the intraclass correlation coefficient (ICC2,1) and coefficient of repeatability (CoR). Results Flash energy of 1.00 cd·s/m2 produced reliable, well-defined traces. At this stimulus, the a- and b-wave amplitudes were reproduced with moderate reliability (ICC, 0.62; CoR%, 90.0%; and ICC, 0.74; CoR%, 54.3%; respectively). For PhNR, the order from most to least reliable measurement was: PT (ICC, 0.64; CoR%, 59.1%), BT (ICC, 0.40; CoR%, 148.3%), and BF (ICC, 0.22; CoR%, 166.1%). The BT/b-wave did not improve reliability (ICC, 0.37; CoR%, 181.5). Conclusion The b-wave peak-to-PhNR trough amplitude produced the most reliable measurement. Translational Relevance A relatively large magnitude of change in PhNR amplitude is required to make clinical inferences about changes in RGC function. Refinement to the technique of acquisition and/or processing of the PhNR is recommended to improve reliability. PMID:25374770
[Cross-cultural adaptation and validation of the Health and Taste Attitude Scale (HTAS) in Portuguese].

PubMed

Koritar, Priscila; Philippi, Sonia Tucunduva; Alvarenga, Marle dos Santos; Santos, Bernardo dos

2014-08-01

The scope of this study was to show the cross-cultural adaptation and validation of the Health and Taste Attitude Scale in Portuguese. The methodology included translation of the scale; evaluation of conceptual, operational and item-based equivalence by 14 experts and 51 female undergraduates; semantic equivalence and measurement assessment by 12 bilingual women by the paired t-test, the Pearson correlation coefficient and the coefficient intraclass correlation; internal consistency and test-retest reliability by Cronbach's alpha and intraclass correlation coefficient, respectively, after application on 216 female undergraduates; assessment of discriminant and concurrent validity via the t-test and Spearman's correlation coefficient, respectively, in addition to Confirmatory Factor and Exploratory Factor Analysis. The scale was considered adequate and easily understood by the experts and university students and presented good internal consistency and reliability (µ 0.86, ICC 0.84). The results show that the scale is valid and can be used in studies with women to better understand attitudes related to taste.
Measuring the food and built environments in urban centres: reliability and validity of the EURO-PREVOB Community Questionnaire.

PubMed

Pomerleau, J; Knai, C; Foster, C; Rutter, H; Darmon, N; Derflerova Brazdova, Z; Hadziomeragic, A F; Pekcan, G; Pudule, I; Robertson, A; Brunner, E; Suhrcke, M; Gabrijelcic Blenkus, M; Lhotska, L; Maiani, G; Mistura, L; Lobstein, T; Martin, B W; Elinder, L S; Logstrup, S; Racioppi, F; McKee, M

2013-03-01

The authors designed an instrument to measure objectively aspects of the built and food environments in urban areas, the EURO-PREVOB Community Questionnaire, within the EU-funded project 'Tackling the social and economic determinants of nutrition and physical activity for the prevention of obesity across Europe' (EURO-PREVOB). This paper describes its development, reliability, validity, feasibility and relevance to public health and obesity research. The Community Questionnaire is designed to measure key aspects of the food and built environments in urban areas of varying levels of affluence or deprivation, within different countries. The questionnaire assesses (1) the food environment and (2) the built environment. Pilot tests of the EURO-PREVOB Community Questionnaire were conducted in five to 10 purposively sampled urban areas of different socio-economic status in each of Ankara, Brno, Marseille, Riga, and Sarajevo. Inter-rater reliability was compared between two pairs of fieldworkers in each city centre using three methods: inter-observer agreement (IOA), kappa statistics, and intraclass correlation coefficients (ICCs). Data were collected successfully in all five cities. Overall reliability of the EURO-PREVOB Community Questionnaire was excellent (inter-observer agreement (IOA) > 0.87; intraclass correlation coefficients (ICC)s > 0.91 and kappa statistics > 0.7. However, assessment of certain aspects of the quality of the built environment yielded slightly lower IOA coefficients than the quantitative aspects. The EURO-PREVOB Community Questionnaire was found to be a reliable and practical observational tool for measuring differences in community-level data on environmental factors that can impact on dietary intake and physical activity. The next step is to evaluate its predictive power by collecting behavioural and anthropometric data relevant to obesity and its determinants. Copyright © 2013 The Royal Society for Public Health. Published by Elsevier Ltd. All rights reserved.
A comparative study of software programmes for cross-sectional skeletal muscle and adipose tissue measurements on abdominal computed tomography scans of rectal cancer patients.

PubMed

van Vugt, Jeroen L A; Levolger, Stef; Gharbharan, Arvind; Koek, Marcel; Niessen, Wiro J; Burger, Jacobus W A; Willemsen, Sten P; de Bruin, Ron W F; IJzermans, Jan N M

2017-04-01

The association between body composition (e.g. sarcopenia or visceral obesity) and treatment outcomes, such as survival, using single-slice computed tomography (CT)-based measurements has recently been studied in various patient groups. These studies have been conducted with different software programmes, each with their specific characteristics, of which the inter-observer, intra-observer, and inter-software correlation are unknown. Therefore, a comparative study was performed. Fifty abdominal CT scans were randomly selected from 50 different patients and independently assessed by two observers. Cross-sectional muscle area (CSMA, i.e. rectus abdominis, oblique and transverse abdominal muscles, paraspinal muscles, and the psoas muscle), visceral adipose tissue area (VAT), and subcutaneous adipose tissue area (SAT) were segmented by using standard Hounsfield unit ranges and computed for regions of interest. The inter-software, intra-observer, and inter-observer agreement for CSMA, VAT, and SAT measurements using FatSeg, OsiriX, ImageJ, and sliceOmatic were calculated using intra-class correlation coefficients (ICCs) and Bland-Altman analyses. Cohen's κ was calculated for the agreement of sarcopenia and visceral obesity assessment. The Jaccard similarity coefficient was used to compare the similarity and diversity of measurements. Bland-Altman analyses and ICC indicated that the CSMA, VAT, and SAT measurements between the different software programmes were highly comparable (ICC 0.979-1.000, P < 0.001). All programmes adequately distinguished between the presence or absence of sarcopenia (κ = 0.88-0.96 for one observer and all κ = 1.00 for all comparisons of the other observer) and visceral obesity (all κ = 1.00). Furthermore, excellent intra-observer (ICC 0.999-1.000, P < 0.001) and inter-observer (ICC 0.998-0.999, P < 0.001) agreement for all software programmes were found. Accordingly, excellent Jaccard similarity coefficients were found for all comparisons (mean ≥ 0.964). FatSeg, OsiriX, ImageJ, and sliceOmatic showed an excellent agreement for CSMA, VAT, and SAT measurements on abdominal CT scans. Furthermore, excellent inter-observer and intra-observer agreement were achieved. Therefore, results of studies using these different software programmes can reliably be compared. © 2016 The Authors. Journal of Cachexia, Sarcopenia and Muscle published by John Wiley & Sons Ltd on behalf of the Society on Sarcopenia, Cachexia and Wasting Disorders.
A comparative study of software programmes for cross‐sectional skeletal muscle and adipose tissue measurements on abdominal computed tomography scans of rectal cancer patients

PubMed Central

Levolger, Stef; Gharbharan, Arvind; Koek, Marcel; Niessen, Wiro J.; Burger, Jacobus W.A.; Willemsen, Sten P.; de Bruin, Ron W.F.

2016-01-01

Abstract Background The association between body composition (e.g. sarcopenia or visceral obesity) and treatment outcomes, such as survival, using single‐slice computed tomography (CT)‐based measurements has recently been studied in various patient groups. These studies have been conducted with different software programmes, each with their specific characteristics, of which the inter‐observer, intra‐observer, and inter‐software correlation are unknown. Therefore, a comparative study was performed. Methods Fifty abdominal CT scans were randomly selected from 50 different patients and independently assessed by two observers. Cross‐sectional muscle area (CSMA, i.e. rectus abdominis, oblique and transverse abdominal muscles, paraspinal muscles, and the psoas muscle), visceral adipose tissue area (VAT), and subcutaneous adipose tissue area (SAT) were segmented by using standard Hounsfield unit ranges and computed for regions of interest. The inter‐software, intra‐observer, and inter‐observer agreement for CSMA, VAT, and SAT measurements using FatSeg, OsiriX, ImageJ, and sliceOmatic were calculated using intra‐class correlation coefficients (ICCs) and Bland–Altman analyses. Cohen's κ was calculated for the agreement of sarcopenia and visceral obesity assessment. The Jaccard similarity coefficient was used to compare the similarity and diversity of measurements. Results Bland–Altman analyses and ICC indicated that the CSMA, VAT, and SAT measurements between the different software programmes were highly comparable (ICC 0.979–1.000, P < 0.001). All programmes adequately distinguished between the presence or absence of sarcopenia (κ = 0.88–0.96 for one observer and all κ = 1.00 for all comparisons of the other observer) and visceral obesity (all κ = 1.00). Furthermore, excellent intra‐observer (ICC 0.999–1.000, P < 0.001) and inter‐observer (ICC 0.998–0.999, P < 0.001) agreement for all software programmes were found. Accordingly, excellent Jaccard similarity coefficients were found for all comparisons (mean ≥ 0.964). Conclusions FatSeg, OsiriX, ImageJ, and sliceOmatic showed an excellent agreement for CSMA, VAT, and SAT measurements on abdominal CT scans. Furthermore, excellent inter‐observer and intra‐observer agreement were achieved. Therefore, results of studies using these different software programmes can reliably be compared. PMID:27897414
Normalization of ADC does not improve correlation with overall survival in patients with high-grade glioma (HGG).

PubMed

Qin, Lei; Li, Angie; Qu, Jinrong; Reinshagen, Katherine; Li, Xiang; Cheng, Su-Chun; Bryant, Annie; Young, Geoffrey S

2018-04-01

Mixed reports leave uncertainty about whether normalization of apparent diffusion coefficient (ADC) to a within-subject white matter reference is necessary for assessment of tumor cellularity. We tested whether normalization improves the previously reported correlation of resection margin ADC with 15-month overall survival (OS) in HGG patients. Spin-echo echo-planar DWI was retrieved from 3 T MRI acquired between maximal resection and radiation in 37 adults with new-onset HGG (25 glioblastoma; 12 anaplastic astrocytoma). ADC maps were produced with the FSL DTIFIT tool (Oxford Centre for Functional MRI). 3 neuroradiologists manually selected regions of interest (ROI) in normal appearing white matter (NAWM) and in non-enhancing tumor (NT) < 2 cm from the margin of residual enhancing tumor or resection cavity. Normalized ADC (nADC) was computed as the ratio of absolute NT ADC to NAWM ADC. Reproducibility of nADC and absolute ADC among the readers' ROI was assessed using intra-class correlation coefficient (ICC) and within-subject coefficient of variation (wCV). Correlations of ADC and nADC with OS were compared using receiver operating characteristics (ROC) analysis. A p value 0.05 was considered statistically significant. Both mean ADC and nADC differed significantly between patients subgrouped by 15-month OS (p = 0.0014 and 0.0073 respectively). wCV and ICC among the readers were similar for absolute and normalized ADC. In ROC analysis of correlation with OS, nADC did not perform significantly better than absolute ADC. Normalization does not significantly improve the correlation of absolute ADC with OS in HGG, suggesting that normalization is not necessary for clinical or research ADC analysis in HGG patients.
Intra- and inter-rater reliability of digital image analysis for skin color measurement

PubMed Central

Sommers, Marilyn; Beacham, Barbara; Baker, Rachel; Fargo, Jamison

2013-01-01

Background We determined the intra- and inter-rater reliability of data from digital image color analysis between an expert and novice analyst. Methods Following training, the expert and novice independently analyzed 210 randomly ordered images. Both analysts used Adobe® Photoshop lasso or color sampler tools based on the type of image file. After color correction with Pictocolor® in camera software, they recorded L*a*b* (L*=light/dark; a*=red/green; b*=yellow/blue) color values for all skin sites. We computed intra-rater and inter-rater agreement within anatomical region, color value (L*, a*, b*), and technique (lasso, color sampler) using a series of one-way intra-class correlation coefficients (ICCs). Results Results of ICCs for intra-rater agreement showed high levels of internal consistency reliability within each rater for the lasso technique (ICC ≥ 0.99) and somewhat lower, yet acceptable, level of agreement for the color sampler technique (ICC = 0.91 for expert, ICC = 0.81 for novice). Skin L*, skin b*, and labia L* values reached the highest level of agreement (ICC ≥ 0.92) and skin a*, labia b*, and vaginal wall b* were the lowest (ICC ≥ 0.64). Conclusion Data from novice analysts can achieve high levels of agreement with data from expert analysts with training and the use of a detailed, standard protocol. PMID:23551208

Intra- and inter-rater reliability of digital image analysis for skin color measurement.

PubMed

Sommers, Marilyn; Beacham, Barbara; Baker, Rachel; Fargo, Jamison

2013-11-01

We determined the intra- and inter-rater reliability of data from digital image color analysis between an expert and novice analyst. Following training, the expert and novice independently analyzed 210 randomly ordered images. Both analysts used Adobe(®) Photoshop lasso or color sampler tools based on the type of image file. After color correction with Pictocolor(®) in camera software, they recorded L*a*b* (L*=light/dark; a*=red/green; b*=yellow/blue) color values for all skin sites. We computed intra-rater and inter-rater agreement within anatomical region, color value (L*, a*, b*), and technique (lasso, color sampler) using a series of one-way intra-class correlation coefficients (ICCs). Results of ICCs for intra-rater agreement showed high levels of internal consistency reliability within each rater for the lasso technique (ICC ≥ 0.99) and somewhat lower, yet acceptable, level of agreement for the color sampler technique (ICC = 0.91 for expert, ICC = 0.81 for novice). Skin L*, skin b*, and labia L* values reached the highest level of agreement (ICC ≥ 0.92) and skin a*, labia b*, and vaginal wall b* were the lowest (ICC ≥ 0.64). Data from novice analysts can achieve high levels of agreement with data from expert analysts with training and the use of a detailed, standard protocol. © 2013 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Reliability of tristimulus colourimetry in the assessment of cutaneous bruise colour.

PubMed

Scafide, Katherine N; Sheridan, Daniel J; Taylor, Laura A; Hayat, Matthew J

2016-06-01

Bruising is one of the most common types of injury clinicians observe among victims of violence and other trauma patients. However, research has shown commonly used qualitative description of cutaneous bruise colour via the naked eye is subjective and unreliable. No published work has formally evaluated the reliability of tristimulus colourimetry as an alternative for assessing bruise colour, despite its clinical and research applications in accurately assessing skin colour. The purpose of this study was to systematically evaluate the test-retest and inter-observer reliability of tristimulus colourimetry in the assessment of cutaneous bruise colour. Two researchers obtained repeated tristimulus colourimetry measures of cutaneous bruises with participants of diverse skin colour. Measures were obtained using the Minolta CR-400 Chomameter. Commission Internationale d'Eclairage (CIE) L*a*b* colour space was used. Data was analysed using intraclass correlation coefficients (ICC), Cronbach's alpha, and minimal detectable change (MDC) on all three L*a*b* values. The colorimeter demonstrated excellent test-retest or intra-rater reliability (L* ICC=0.999; a* ICC=0.973; b* ICC=0.892) and inter-rater reliability (L* ICC=0.997; a* ICC=0.976; b* ICC=0.982). With consistent placement, the tristimulus colourimetry is reliable for the objective assessment and documentation of cutaneous bruise colour for purposes of clinical practice and research. Recommendations for use in practice/research are provided. Copyright © 2016 Elsevier Ltd. All rights reserved.
Manual muscle testing and hand-held dynamometry in people with inflammatory myopathy: An intra- and interrater reliability and validity study

PubMed Central

Baschung Pfister, Pierrette; Sterkele, Iris; Maurer, Britta; de Bie, Rob A.; Knols, Ruud H.

2018-01-01

Manual muscle testing (MMT) and hand-held dynamometry (HHD) are commonly used in people with inflammatory myopathy (IM), but their clinimetric properties have not yet been sufficiently studied. To evaluate the reliability and validity of MMT and HHD, maximum isometric strength was measured in eight muscle groups across three measurement events. To evaluate reliability of HHD, intra-class correlation coefficients (ICC), the standard error of measurements (SEM) and smallest detectable changes (SDC) were calculated. To measure reliability of MMT linear Cohen`s Kappa was computed for single muscle groups and ICC for total score. Additionally, correlations between MMT8 and HHD were evaluated with Spearman Correlation Coefficients. Fifty people with myositis (56±14 years, 76% female) were included in the study. Intra-and interrater reliability of HHD yielded excellent ICCs (0.75–0.97) for all muscle groups, except for interrater reliability of ankle extension (0.61). The corresponding SEMs% ranged from 8 to 28% and the SDCs% from 23 to 65%. MMT8 total score revealed excellent intra-and interrater reliability (ICC>0.9). Intrarater reliability of single muscle groups was substantial for shoulder and hip abduction, elbow and neck flexion, and hip extension (0.64–0.69); moderate for wrist (0.53) and knee extension (0.49) and fair for ankle extension (0.35). Interrater reliability was moderate for neck flexion (0.54) and hip abduction (0.44); fair for shoulder abduction, elbow flexion, wrist and ankle extension (0.20–0.33); and slight for knee extension (0.08). Correlations between the two tests were low for wrist, knee, ankle, and hip extension; moderate for elbow flexion, neck flexion and hip abduction; and good for shoulder abduction. In conclusion, the MMT8 total score is a reliable assessment to consider general muscle weakness in people with myositis but not for single muscle groups. In contrast, our results confirm that HHD can be recommended to evaluate strength of single muscle groups. PMID:29596450
Reliability of a new test battery for fitness assessment of the European Astronaut corps.

PubMed

Petersen, Nora; Thieschäfer, Lutz; Ploutz-Snyder, Lori; Damann, Volker; Mester, Joachim

2015-01-01

To optimise health for space missions, European astronauts follow specific conditioning programs before, during and after their flights. To evaluate the effectiveness of these programs, the European Space Agency conducts an Astronaut Fitness Assessment (AFA), but the test-retest reliability of elements within it remains unexamined. The reliability study described here presents a scientific basis for implementing the AFA, but also highlights challenges faced by operational teams supporting humans in such unique environments, especially with respect to health and fitness monitoring of crew members travelling not only into space, but also across the world. The AFA tests assessed parameters known to be affected by prolonged exposure to microgravity: aerobic capacity (VO2max), muscular strength (one repetition max, 1 RM) and power (vertical jumps), core stability, flexibility and balance. Intraclass correlation coefficients (ICC3.1), standard error of measurement and coefficient of variation were used to assess relative and absolute test-retest reliability. Squat and bench 1 RM (ICC3.1 = 0.94-0.99), hip flexion (ICC3.1 = 0.99) and left and right handgrip strength (ICC3.1 = 0.95 and 0.97), showed the highest test-retest reliability, followed by VO2max (ICC3.1 = 0.91), core strength (ICC3.1 = 0.78-0.89), hip extension (ICC3.1 = 0.63), the countermeasure (ICC3.1 = 0.76) and squat (ICC3.1 = 0.63) jumps, and single right- and left-leg jump height (ICC3.1 = 0.51 and 0.14). For balance, relative reliability ranged from ICC3.1 = 0.78 for path length (two legs, head tilted back, eyes open) to ICC3.1 = 0.04 for average rotation velocity (one leg, eyes closed). In a small sample (n = 8) of young, healthy individuals, the AFA battery of tests demonstrated acceptable test-retest reliability for most parameters except some balance and single-leg jump tasks. These findings suggest that, for the application with astronauts, most AFA tests appear appropriate to be maintained in the test battery, but that some elements may be unreliable, and require either modification (duration, selection of task) or removal (single-leg jump, balance test on sphere) from the battery. The test battery is mobile and universally applicable for occupational and general fitness assessment by its comprehensive composition of tests covering many systems involved in whole body movement.
Reproducibility of urinary biomarkers in multiple 24-h urine samples.

PubMed

Sun, Qi; Bertrand, Kimberly A; Franke, Adrian A; Rosner, Bernard; Curhan, Gary C; Willett, Walter C

2017-01-01

Limited knowledge regarding the reproducibility of biomarkers in 24-h urine samples has hindered the collection and use of the samples in epidemiologic studies. We aimed to evaluate the reproducibility of various markers in repeat 24-h urine samples. We calculated intraclass correlation coefficients (ICCs) of biomarkers measured in 24-h urine samples that were collected in 3168 participants in the NHS (Nurses' Health Study), NHSII (Nurses' Health Study II), and Health Professionals Follow-Up Study. In 742 women with 4 samples each collected over the course of 1 y, ICCs for sodium were 0.32 in the NHS and 0.34 in the NHSII. In 2439 men and women with 2 samples each collected over 1 wk to ≥1 mo, the ICCs ranged from 0.33 to 0.68 for sodium at various intervals between collections. The urinary excretion of potassium, calcium, magnesium, phosphate, sulfate, and other urinary markers showed generally higher reproducibility (ICCs >0.4). In 47 women with two 24-h urine samples, ICCs ranged from 0.15 (catechin) to 0.75 (enterolactone) for polyphenol metabolites. For phthalates, ICCs were generally ≤0.26 except for monobenzyl phthalate (ICC: 0.55), whereas the ICC was 0.39 for bisphenol A (BPA). We further estimated that, for the large majority of the biomarkers, the mean of three 24-h urine samples could provide a correlation of ≥0.8 with true long-term urinary excretion. These data suggest that the urinary excretion of various biomarkers, such as minerals, electrolytes, most polyphenols, and BPA, is reasonably reproducible in 24-h urine samples that are collected within a few days or ≤1 y. Our findings show that three 24-h samples are sufficient for the measurement of long-term exposure status in epidemiologic studies. © 2017 American Society for Nutrition.
Reproducibility of urinary biomarkers in multiple 24-h urine samples123

PubMed Central

Sun, Qi; Bertrand, Kimberly A; Franke, Adrian A; Rosner, Bernard; Curhan, Gary C; Willett, Walter C

2017-01-01

Background: Limited knowledge regarding the reproducibility of biomarkers in 24-h urine samples has hindered the collection and use of the samples in epidemiologic studies. Objective: We aimed to evaluate the reproducibility of various markers in repeat 24-h urine samples. Design: We calculated intraclass correlation coefficients (ICCs) of biomarkers measured in 24-h urine samples that were collected in 3168 participants in the NHS (Nurses’ Health Study), NHSII (Nurses’ Health Study II), and Health Professionals Follow-Up Study. Results: In 742 women with 4 samples each collected over the course of 1 y, ICCs for sodium were 0.32 in the NHS and 0.34 in the NHSII. In 2439 men and women with 2 samples each collected over 1 wk to ≥1 mo, the ICCs ranged from 0.33 to 0.68 for sodium at various intervals between collections. The urinary excretion of potassium, calcium, magnesium, phosphate, sulfate, and other urinary markers showed generally higher reproducibility (ICCs >0.4). In 47 women with two 24-h urine samples, ICCs ranged from 0.15 (catechin) to 0.75 (enterolactone) for polyphenol metabolites. For phthalates, ICCs were generally ≤0.26 except for monobenzyl phthalate (ICC: 0.55), whereas the ICC was 0.39 for bisphenol A (BPA). We further estimated that, for the large majority of the biomarkers, the mean of three 24-h urine samples could provide a correlation of ≥0.8 with true long-term urinary excretion. Conclusions: These data suggest that the urinary excretion of various biomarkers, such as minerals, electrolytes, most polyphenols, and BPA, is reasonably reproducible in 24-h urine samples that are collected within a few days or ≤1 y. Our findings show that three 24-h samples are sufficient for the measurement of long-term exposure status in epidemiologic studies. PMID:28049663
Collinear Latent Variables in Multilevel Confirmatory Factor Analysis

PubMed Central

van de Schoot, Rens; Hox, Joop

2014-01-01

Because variables may be correlated in the social and behavioral sciences, multicollinearity might be problematic. This study investigates the effect of collinearity manipulated in within and between levels of a two-level confirmatory factor analysis by Monte Carlo simulation. Furthermore, the influence of the size of the intraclass correlation coefficient (ICC) and estimation method; maximum likelihood estimation with robust chi-squares and standard errors and Bayesian estimation, on the convergence rate are investigated. The other variables of interest were rate of inadmissible solutions and the relative parameter and standard error bias on the between level. The results showed that inadmissible solutions were obtained when there was between level collinearity and the estimation method was maximum likelihood. In the within level multicollinearity condition, all of the solutions were admissible but the bias values were higher compared with the between level collinearity condition. Bayesian estimation appeared to be robust in obtaining admissible parameters but the relative bias was higher than for maximum likelihood estimation. Finally, as expected, high ICC produced less biased results compared to medium ICC conditions. PMID:29795827
Collinear Latent Variables in Multilevel Confirmatory Factor Analysis: A Comparison of Maximum Likelihood and Bayesian Estimations.

PubMed

Can, Seda; van de Schoot, Rens; Hox, Joop

2015-06-01

Because variables may be correlated in the social and behavioral sciences, multicollinearity might be problematic. This study investigates the effect of collinearity manipulated in within and between levels of a two-level confirmatory factor analysis by Monte Carlo simulation. Furthermore, the influence of the size of the intraclass correlation coefficient (ICC) and estimation method; maximum likelihood estimation with robust chi-squares and standard errors and Bayesian estimation, on the convergence rate are investigated. The other variables of interest were rate of inadmissible solutions and the relative parameter and standard error bias on the between level. The results showed that inadmissible solutions were obtained when there was between level collinearity and the estimation method was maximum likelihood. In the within level multicollinearity condition, all of the solutions were admissible but the bias values were higher compared with the between level collinearity condition. Bayesian estimation appeared to be robust in obtaining admissible parameters but the relative bias was higher than for maximum likelihood estimation. Finally, as expected, high ICC produced less biased results compared to medium ICC conditions.
Reliability of Doppler and stethoscope methods of determining systolic blood pressures: considerations for calculating an ankle-brachial index.

PubMed

Chesbro, Steven B; Asongwed, Elmira T; Brown, Jamesha; John, Emmanuel B

2011-01-01

The purposes of this study were to: (1) identify the interrater and intrarater reliability of systolic blood pressures using a stethoscope and Doppler to determine an ankle-brachial index (ABI), and (2) to determine the correlation between the 2 methods. Peripheral arterial disease (PAD) affects approximately 8 to 12 million people in the United States, and nearly half of those with this disease are asymptomatic. Early detection and prompt treatment of PAD will improve health outcomes. It is important that clinicians perform tests that determine the presence of PAD. Two individual raters trained in ABI procedure measured the systolic blood pressures of 20 individuals' upper and lower extremities. Standard ABI measurement protocols were observed. Raters individually recorded the systolic blood pressures of each extremity using a stethoscope and a Doppler, for a total of 640 independent measures. Interrater reliability of Doppler measurements to determine SBP at the ankle was very strong (intraclass correlation coefficient [ICC], 0.93-0.99) compared to moderate to strong reliability using a stethoscope (ICC, 0.64-0.87). Agreement between the 2 devices to determine SBP was moderate to very weak (ICC, 0.13-0.61). Comparisons of the use of Doppler and stethoscope to determine ABI showed weak to very weak intrarater correlation (ICC, 0.17-0.35). Linear regression analysis of the 2 methods to determine ABI showed positive but weak to very weak correlations (r2 = .013, P = .184). A Doppler ultrasound is recommended over a stethoscope for accuracy in systolic pressure readings for ABI measurements.
Utility of computer-assisted approaches for population surveillance of physical activity.

PubMed

Creamer, MeLisa; Bowles, Heather R; von Hofe, Belinda; Pettee Gabriel, Kelley; Kohl, Harold W; Bauman, Adrian

2014-08-01

Computer-assisted techniques may be a useful way to enhance physical activity surveillance and increase accuracy of reported behaviors. Evaluate the reliability and validity of a physical activity (PA) self-report instrument administered by telephone and internet. The telephone-administered Active Australia Survey was adapted into 2 forms for internet self-administration: survey questions only (internet-text) and with videos demonstrating intensity (internet-video). Data were collected from 158 adults (20-69 years, 61% female) assigned to telephone (telephone-interview) (n = 56), internet-text (n = 51), or internet-video (n = 51). Participants wore an accelerometer and completed a logbook for 7 days. Test-retest reliability was assessed using intraclass correlation coefficients (ICC). Convergent validity was assessed using Spearman correlations. Strong test-retest reliability was observed for PA variables in the internet-text (ICC = 0.69 to 0.88), internet-video (ICC = 0.66 to 0.79), and telephone-interview (ICC = 0.69 to 0.92) groups (P-values < 0.001). For total PA, correlations (ρ) between the survey and Actigraph+logbook were ρ = 0.47 for the internet-text group, ρ = 0.57 for the internet-video group, and ρ = 0.65 for the telephone-interview group. For vigorous-intensity activity, the correlations between the survey and Actigraph+logbook were 0.52 for internet-text, 0.57 for internet-video, and 0.65 for telephone-interview (P < .05). Internet-video of the survey had similar test-retest reliability and convergent validity when compared with the telephone-interview, and should continue to be developed.
Monitoring sedentary patterns in office employees: validity of an m-health tool (Walk@Work-App) for occupational health.

PubMed

Bort-Roig, Judit; Puig-Ribera, Anna; Contreras, Ruth S; Chirveches-Pérez, Emilia; Martori, Joan C; Gilson, Nicholas D; McKenna, Jim

2017-09-15

This study validated the Walk@Work-Application (W@W-App) for measuring occupational sitting and stepping. The W@W-App was installed on the smartphones of office-based employees (n=17; 10 women; 26±3 years). A prescribed 1-hour laboratory protocol plus two continuous hours of occupational free-living activities were performed. Intra-class correlation coefficients (ICC) compared mean differences of sitting time and step count measurements between the W@W-App and criterion measures (ActivPAL3TM and SW200Yamax Digi-Walker). During the protocol, agreement between self-paced walking (ICC=0.85) and active working tasks step counts (ICC=0.80) was good. The smallest median difference was for sitting time (1.5seconds). During free-living conditions, sitting time (ICC=0.99) and stepping (ICC=0.92) showed excellent agreement, with a difference of 0.5minutes and 18 steps respectively. The W@W-App provided valid measures for monitoring occupational sedentary patterns in real life conditions; a key issue for increasing awareness and changing occupational sedentariness. Copyright © 2017 SESPAS. Publicado por Elsevier España, S.L.U. All rights reserved.
Is there a systematic bias of apparent diffusion coefficient (ADC) measurements of the breast if measured on different workstations? An inter- and intra-reader agreement study.

PubMed

Clauser, Paola; Marcon, Magda; Maieron, Marta; Zuiani, Chiara; Bazzocchi, Massimo; Baltzer, Pascal A T

2016-07-01

To evaluate the influence of post-processing systems, intra- and inter-reader agreement on the variability of apparent diffusion coefficient (ADC) measurements in breast lesions. Forty-one patients with 41 biopsy-proven breast lesions gave their informed consent and were included in this prospective IRB-approved study. Magnetic resonance imaging (MRI) examinations were performed at 1.5 T using an EPI-DWI sequence, with b-values of 0 and 1000 s/mm(2). Two radiologists (R1, R2) reviewed the images in separate sessions and measured the ADC for lesion, using MRI-workstation (S-WS), PACS-workstation (P-WS) and a commercial DICOM viewer (O-SW). Agreement was evaluated using the intraclass correlation coefficient (ICC), Bland-Altman plots and coefficient of variation (CV). Thirty-one malignant, two high-risk and eight benign mass-like lesions were analysed. Intra-reader agreement was almost perfect (ICC-R1 = 0.974; ICC-R2 = 0.990) while inter-reader agreement was substantial (ICC from 0.615 to 0.682). Bland-Altman plots revealed a significant bias in ADC values measured between O-SW and S-WS (P = 0.025), no further systematic differences were identified. CV varied from 6.8 % to 7.9 %. Post-processing systems may have a significant, although minor, impact on ADC measurements in breast lesions. While intra-reader agreement is high, the main source of ADC variability seems to be caused by inter-reader variation. • ADC provides quantitative information on breast lesions independent from the system used. • ADC measurement using different workstations and software systems is generally reliable. • Systematic, but minor, differences may occur between different post-processing systems. • Inter-reader agreement of ADC measurements exceeded intra-reader agreement.
Reliability and concurrent validity of the iPhone® Compass application to measure thoracic rotation range of motion (ROM) in healthy participants.

PubMed

Furness, James; Schram, Ben; Cox, Alistair J; Anderson, Sarah L; Keogh, Justin

2018-01-01

Several water-based sports (swimming, surfing and stand up paddle boarding) require adequate thoracic mobility (specifically rotation) in order to perform the appropriate activity requirements. The measurement of thoracic spine rotation is problematic for clinicians due to a lack of convenient and reliable measurement techniques. More recently, smartphones have been used to quantify movement in various joints in the body; however, there appears to be a paucity of research using smartphones to assess thoracic spine movement. Therefore, the aim of this study is to determine the reliability (intra and inter rater) and validity of the iPhone ® app (Compass) when assessing thoracic spine rotation ROM in healthy individuals. A total of thirty participants were recruited for this study. Thoracic spine rotation ROM was measured using both the current clinical gold standard, a universal goniometer (UG) and the Smart Phone Compass app. Intra-rater and inter-rater reliability was determined with a Intraclass Correlation Coefficient (ICC) and associated 95% confidence intervals (CI). Validation of the Compass app in comparison to the UG was measured using Pearson's correlation coefficient and levels of agreement were identified with Bland-Altman plots and 95% limits of agreement. Both the UG and Compass app measurements both had excellent reproducibility for intra-rater (ICC 0.94-0.98) and inter-rater reliability (ICC 0.72-0.89). However, the Compass app measurements had higher intra-rater reliability ( ICC = 0.96 - 0.98; 95% CI [0.93-0.99]; vs. ICC = 0.94 - 0.98; 95% CI [0.88-0.99]) and inter-rater reliability ( ICC = 0.87 - 0.89; 95% CI [0.74-0.95] vs. ICC = 0.72 - 0.82; 95% CI [0.21-0.94]). A strong and significant correlation was found between the UG and the Compass app, demonstrating good concurrent validity ( r = 0.835, p < 0.001). Levels of agreement between the two devices were 24.8° (LoA -9.5°, +15.3°). The UG was found to consistently measure higher values than the compass app (mean difference 2.8°, P < 0.001). This study reveals that the iPhone ® app (Compass) is a reliable tool for measuring thoracic spine rotation which produces greater reproducibility of measurements both within and between raters than a UG. As a significant positive correlation exists between the Compass app and UG, this supports the use of either device in clinical practice as a reliable and valid tool to measure thoracic rotation. Considering the levels of agreement are clinically unacceptable, the devices should not be used interchangeably for initial and follow up measurements.
Reliability analysis of a sensitive and independent stabilometry parameter set

PubMed Central

Nagymáté, Gergely; Orlovits, Zsanett

2018-01-01

Recent studies have suggested reduced independent and sensitive parameter sets for stabilometry measurements based on correlation and variance analyses. However, the reliability of these recommended parameter sets has not been studied in the literature or not in every stance type used in stabilometry assessments, for example, single leg stances. The goal of this study is to evaluate the test-retest reliability of different time-based and frequency-based parameters that are calculated from the center of pressure (CoP) during bipedal and single leg stance for 30- and 60-second measurement intervals. Thirty healthy subjects performed repeated standing trials in a bipedal stance with eyes open and eyes closed conditions and in a single leg stance with eyes open for 60 seconds. A force distribution measuring plate was used to record the CoP. The reliability of the CoP parameters was characterized by using the intraclass correlation coefficient (ICC), standard error of measurement (SEM), minimal detectable change (MDC), coefficient of variation (CV) and CV compliance rate (CVCR). Based on the ICC, SEM and MDC results, many parameters yielded fair to good reliability values, while the CoP path length yielded the highest reliability (smallest ICC > 0.67 (0.54–0.79), largest SEM% = 19.2%). Usually, frequency type parameters and extreme value parameters yielded poor reliability values. There were differences in the reliability of the maximum CoP velocity (better with 30 seconds) and mean power frequency (better with 60 seconds) parameters between the different sampling intervals. PMID:29664938
Reliability analysis of a sensitive and independent stabilometry parameter set.

PubMed

Nagymáté, Gergely; Orlovits, Zsanett; Kiss, Rita M

2018-01-01

Recent studies have suggested reduced independent and sensitive parameter sets for stabilometry measurements based on correlation and variance analyses. However, the reliability of these recommended parameter sets has not been studied in the literature or not in every stance type used in stabilometry assessments, for example, single leg stances. The goal of this study is to evaluate the test-retest reliability of different time-based and frequency-based parameters that are calculated from the center of pressure (CoP) during bipedal and single leg stance for 30- and 60-second measurement intervals. Thirty healthy subjects performed repeated standing trials in a bipedal stance with eyes open and eyes closed conditions and in a single leg stance with eyes open for 60 seconds. A force distribution measuring plate was used to record the CoP. The reliability of the CoP parameters was characterized by using the intraclass correlation coefficient (ICC), standard error of measurement (SEM), minimal detectable change (MDC), coefficient of variation (CV) and CV compliance rate (CVCR). Based on the ICC, SEM and MDC results, many parameters yielded fair to good reliability values, while the CoP path length yielded the highest reliability (smallest ICC > 0.67 (0.54-0.79), largest SEM% = 19.2%). Usually, frequency type parameters and extreme value parameters yielded poor reliability values. There were differences in the reliability of the maximum CoP velocity (better with 30 seconds) and mean power frequency (better with 60 seconds) parameters between the different sampling intervals.
Construction of the Mandarin version of the International Prostate Symptom Score inventory in assessing lower urinary tract symptoms in a Malaysian population.

PubMed

Quek, Kia Fatt; Chua, Chong Beng; Razack, Azad Hassan; Low, Wah Yun; Loh, Chit Sin

2005-01-01

The purpose of the present study was to validate the Mandarin version of the International Prostate Symptom Score (Mand-IPSS) in a Malaysian population. The validity and reliability were studied in patients with lower urinary tract symptoms (LUTS; benign prostatic hyperplasia [BPH] group) and without LUTS (control group). Test-retest methodology was used to assess the reliability while Cronbach alpha was used to assess the internal consistency. Sensitivity to change was used to express the effect size index in the preintervention versus post-intervention score in patients with LUTS who underwent transurethral resection of the prostate. For the control group and BPH group, the internal consistency was excellent and a high degree of internal consistency was observed for all seven items (Cronbach alpha = 0.86-0.98 and 0.90-0.98, respectively). Test-retest correlation coefficients for all items were highly significant. Intraclass correlation coefficient (ICC) was high for the control (ICC = 0.93-0.99) and BPH group (ICC = 0.91-0.99). The sensitivity and specificity showed a high degree of sensitivity and specificity to the effects of treatment. A high degree of significance between baseline and post-treatment scores was observed across all seven items in the BPH group but not in the control group. The Mand-IPSS is a suitable, reliable, valid and sensitive instrument to measure clinical change in the Malaysian population.
Reproducibility of the time to peak torque and the joint angle at peak torque on knee of young sportsmen on the isokinetic dynamometer.

PubMed

Bernard, P-L; Amato, M; Degache, F; Edouard, P; Ramdani, S; Blain, H; Calmels, P; Codine, P

2012-05-01

Although peak torque has shown acceptable reproducibility, this may not be the case with two other often used parameters: time to peak torque (TPT) and the angle of peak torque (APT). Those two parameters should be used for the characterization of muscular adaptations in athletes. The isokinetic performance of the knee extensors and flexors in both limbs was measured in 29 male athletes. The experimental protocol consisted of three consecutive identical paradigms separated by 45 min breaks. Each test consisted of four maximal concentric efforts performed at 60 and 180°/s. Reproducibility was quantified by the standard error measurement (SEM), the coefficient of variation (CV) and by means of intra-class correlation coefficients (ICCs) with the calculation of 6 forms of ICCs. Using ICC as the indicator of reproducibility, the correlations for TPT of both limbs showed a range of 0.51-0.65 in extension and 0.50-0.63 in flexion. For APT, the values were 0.46-0.60 and 0.51-0.81, respectively. In addition, the calculated standard error of measurement (SEM) and CV scores confirmed the low level of absolute reproducibility. Due to their low reproducibility, neither TPT nor APT can serve as independent isokinetic parameters of knee flexor and extensor performance. So, given its reproducibility level, TPT and APT should not be used for the characterization of muscular adaptations in athletes. Copyright © 2012 Elsevier Masson SAS. All rights reserved.
Adolescent Alcohol Use Self-Report Stability: A Decade of Panel Study Data

ERIC Educational Resources Information Center

Shillington, Audrey M.; Clapp, John D.; Reed, Mark B.; Woodruff, Susan I.

2011-01-01

This study analyzed six waves of panel data from the National Longitudinal Survey of Youth (NLSY). These analyses were conducted to test the stability of self-reported lifetime use and age of onset. Intraclass correlation coefficients (ICCs) indicated that the stability of age of onset reports decreased with longer time frames between follow-ups.…
Intra and inter-rater reliability of infrared image analysis of masticatory and upper trapezius muscles in women with and without temporomandibular disorder.

PubMed

Costa, Ana C S; Dibai Filho, Almir V; Packer, Amanda C; Rodrigues-Bigaton, Delaine

2013-01-01

Infrared thermography is an aid tool that can be used to evaluate several pathologies given its efficiency in analyzing the distribution of skin surface temperature. To propose two forms of infrared image analysis of the masticatory and upper trapezius muscles, and to determine the intra and inter-rater reliability of both forms of analysis. Infrared images of masticatory and upper trapezius muscles of 64 female volunteers with and without temporomandibular disorder (TMD) were collected. Two raters performed the infrared image analysis, which occurred in two ways: temperature measurement of the muscle length and in central portion of the muscle. The Intraclass Correlation Coefficient (ICC) was used to determine the intra and inter-rater reliability. The ICC showed excellent intra and inter-rater values for both measurements: temperature measurement of the muscle length (TMD group, intra-rater, ICC ranged from 0.996 to 0.999, inter-rater, ICC ranged from 0.992 to 0.999; control group, intra-rater, ICC ranged from 0.993 to 0.998, inter-rater, ICC ranged from 0.990 to 0.998), and temperature measurement of the central portion of the muscle (TMD group, intra-rater, ICC ranged from 0.981 to 0.998, inter-rater, ICC ranged from 0.971 to 0.998; control group, intra-rater, ICC ranged from 0.887 to 0.996, inter-rater, ICC ranged from 0.852 to 0.996). The results indicated that temperature measurements of the masticatory and upper trapezius muscles carried out by the analysis of the muscle length and central portion yielded excellent intra and inter-rater reliability.
Reliability and validity of an audio signal modified shuttle walk test.

PubMed

Singla, Rupak; Rai, Richa; Faye, Abhishek Anil; Jain, Anil Kumar; Chowdhury, Ranadip; Bandyopadhyay, Debdutta

2017-01-01

The audio signal in the conventionally accepted protocol of shuttle walk test (SWT) is not well-understood by the patients and modification of the audio signal may improve the performance of the test. The aim of this study is to study the validity and reliability of an audio signal modified SWT, called the Singla-Richa modified SWT (SWTSR), in healthy normal adults. In SWTSR, the audio signal was modified with the addition of reverse counting to it. A total of 54 healthy normal adults underwent conventional SWT (CSWT) at one instance and two times SWTSRon the same day. The validity was assessed by comparing outcomes of the SWTSRto outcomes of CSWT using the Pearson correlation coefficient and Bland-Altman plot. Test-retest reliability of SWTSRwas assessed using the intraclass correlation coefficient (ICC). The acceptability of the modified test in comparison to the conventional test was assessed using Likert scale. The distance walked (mean ± standard deviation) in the CSWT and SWTSRtest was 853.33 ± 217.33 m and 857.22 ± 219.56 m, respectively (Pearson correlation coefficient - 0.98; P < 0.001) indicating SWTSRto be a valid test. The SWTSRwas found to be a reliable test with ICC of 0.98 (95% confidence interval: 0.97-0.99). The acceptability of SWTSRwas significantly higher than CSWT. The SWTSRwith modified audio signal with reverse counting is a reliable as well as a valid test when compared with CSWT in healthy normal adults. It better understood by subjects compared to CSWT.

Validity and reliability of head posture measurement using Microsoft Kinect.

PubMed

Oh, Baek-Lok; Kim, Jongmin; Kim, Jongshin; Hwang, Jeong-Min; Lee, Jehee

2014-11-01

To investigate the validity and reliability of Microsoft Kinect-based head tracker (KHT) for measuring head posture. Considering the cervical range of motion (CROM) as a reference, one-dimensional and three-dimensional (1D and 3D) head postures of 12 normal subjects (28-58 years of age; 6 women and 6 men) were obtained using the KHT. The KHT was validated by Pearson's correlation coefficient and intraclass correlation (ICC) coefficient. Test-retest reliability of the KHT was determined by its 95% limit of agreement (LoA) with the Bland-Altman plot. Face recognition success rate was evaluated for each head posture. Measurements of 1D and 3D head posture performed using the KHT were very close to those of the CROM with correlation coefficients of 0.99 and 0.97 (p<0.05), respectively, as well as with an ICC of >0.99 and 0.98, respectively. The reliability tests of the KHT in terms of 1D and 3D head postures had 95% LoA angles of approximately ±2.5° and ±6.5°, respectively. The KHT showed good agreement with the CROM and relatively favourable test-retest reliability. Considering its high performance, convenience and low cost, KHT could be clinically used as a head posture-measuring system. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
[Translation and validation in italian of the Moral Distress Scale for psychiatric nurses (MDS-P)].

PubMed

Canciani, Eleonora; Spotti, Daniela; Bonetti, Loris

2016-01-01

Moral distress (MD) is a painful feeling and/or psychological disequilibrium, which may lead to negative consequences into the wellness of a nurse's working life. Nurses who work in psychiatry are more likely to experience a different type of MD compared with nurses of other contexts. In Italy a tool to evaluate MD in nurses who work in psychiatry doesn't exist. The aim of this study is to validate the Moral Distress Scale for Psychiatric Nurses (MDS-P) in Italian language. For translation the forward and back-translation has been used; the effectiveness regarding content and face validity of the translated scale has been analyzed through a focus group with experts of the field. In order to check the reliability of the scale the test-retest method has been used, by means of the determination of Spearman's correlation coefficient, Intraclass Correlation Coefficient (ICC) and Cronbach's alpha. The forward and back-translation process was successful. During the focus group analysis, 8 items were added to the 15 items of the original scale, due to experts suggestions. 32 nurses took part in the test-retest phase. Spearman's correlation coefficient resulted to be 0,91, ICC > 0,9, Cronbach's alpha calculated on test and retest, was always >0,9. The Italian version of the MDS-P proves to be an effective, appropriate and reliable instrument to measure the MD phenomenon within the population of nurses who work in the psychia- tric field in Italy.
Development and preliminary reliability of a multitasking assessment for executive functioning after concussion.

PubMed

Smith, Laurel B; Radomski, Mary Vining; Davidson, Leslie Freeman; Finkelstein, Marsha; Weightman, Margaret M; McCulloch, Karen L; Scherer, Matthew R

2014-01-01

OBJECTIVES. Executive functioning deficits may result from concussion. The Charge of Quarters (CQ) Duty Task is a multitask assessment designed to assess executive functioning in servicemembers after concussion. In this article, we discuss the rationale and process used in the development of the CQ Duty Task and present pilot data from the preliminary evaluation of interrater reliability (IRR). METHOD. Three evaluators observed as 12 healthy participants performed the CQ Duty Task and measured performance using various metrics. Intraclass correlation coefficient (ICC) quantified IRR. RESULTS. The ICC for task completion was .94. ICCs for other assessment metrics were variable. CONCLUSION. Preliminary IRR data for the CQ Duty Task are encouraging, but further investigation is needed to improve IRR in some domains. Lessons learned in the development of the CQ Duty Task could benefit future test development efforts with populations other than the military. Copyright © 2014 by the American Occupational Therapy Association, Inc.
Development and Preliminary Reliability of a Multitasking Assessment for Executive Functioning After Concussion

PubMed Central

Radomski, Mary Vining; Davidson, Leslie Freeman; Finkelstein, Marsha; Weightman, Margaret M.; McCulloch, Karen L.; Scherer, Matthew R.

2014-01-01

OBJECTIVES. Executive functioning deficits may result from concussion. The Charge of Quarters (CQ) Duty Task is a multitask assessment designed to assess executive functioning in servicemembers after concussion. In this article, we discuss the rationale and process used in the development of the CQ Duty Task and present pilot data from the preliminary evaluation of interrater reliability (IRR). METHOD. Three evaluators observed as 12 healthy participants performed the CQ Duty Task and measured performance using various metrics. Intraclass correlation coefficient (ICC) quantified IRR. RESULTS. The ICC for task completion was .94. ICCs for other assessment metrics were variable. CONCLUSION. Preliminary IRR data for the CQ Duty Task are encouraging, but further investigation is needed to improve IRR in some domains. Lessons learned in the development of the CQ Duty Task could benefit future test development efforts with populations other than the military. PMID:25005507
Clinimetric properties of the alberta infant motor scale in infants born preterm.

PubMed

Pin, Tamis W; de Valle, Katy; Eldridge, Bev; Galea, Mary P

2010-01-01

The Alberta Infant Motor Scale (AIMS) is a standardized motor assessment for young infants. This study aimed to examine the reliability of the AIMS in a group of infants born at or before 29 weeks of gestation. Fifty-nine infants born preterm were recruited. Two experienced pediatric physical therapists participated in this reliability study. Infants were assessed at 4, 8, 12, and 18 months corrected age (CA). Intrarater reliability was high (intraclass correlation coefficient [ICC] > or =0.99). The ICC for interrater reliability varied from 0.85 to 0.97. The ICC was low at 4 and 18 months CA. The AIMS is reliable in evaluating motor development in infants born preterm. Clinicians should be cautious about using the AIMS in infants at very young ages and those approaching independent ambulation. Accurate placement of the window on a movement repertoire is crucial. Attention is required when using the AIMS in infants developing atypically.
Reliability of self-reported weight and height among state bank employees.

PubMed

Chor, D; Coutinho, E da S; Laurenti, R

1999-02-01

Self-reported weight and height were compared with direct measurements in order to evaluate the agreement between the two sources. Data were obtained from a cross-sectional study on health status from a probabilistic sample of 1,183 employees of a bank, in Rio de Janeiro State, Brazil. Direct measurements were made of 322 employees. Differences between the two sources were evaluated using mean differences, limits of agreement and intraclass correlation coefficient (ICC). Men and women tended to underestimate their weight while differences between self-reported and measured height were insignificant. Body mass index (BMI) mean differences were smaller than those observed for weight. ICC was over 0.98 for weight and 0.95 for BMI, expressing close agreement. Combining a graphical method with ICC may be useful in pilot studies to detect populational groups capable of providing reliable information on weight and height, thus minimizing resources needed for field work.
Validation and cultural adaptation of a German version of the Physicians' Reactions to Uncertainty scales

PubMed Central

Schneider, Antonius; Szecsenyi, Joachim; Barie, Stefan; Joest, Katharina; Rosemann, Thomas

2007-01-01

Background The aim of the study was to examine the validity of a translated and culturally adapted version of the Physicians' Reaction to Uncertainty scales (PRU) in primary care physicians. Methods In a structured process, the original questionnaire was translated, culturally adapted and assessed after administering it to 93 GPs. Test-retest reliability was tested by sending the questionnaire to the GPs again after two weeks. Results The principal factor analysis confirmed the postulated four-factor structure underlying the 15 items. In contrast to the original version, item 5 achieved a higher loading on the 'concern about bad outcomes' scale. Consequently, we rearranged the scales. Good item-scale correlations were obtained, with Pearson's correlation coefficient ranging from 0.56–0.84. As regards the item-discriminant validity between the scales 'anxiety due to uncertainty' and 'concern about bad outcomes', partially high correlations (Pearson's correlation coefficient 0.02–0.69; p < 0.001) were found, indicating an overlap between both constructs. The assessment of internal consistency revealed satisfactory values; Cronbach's alpha of the rearranged version was 0.86 or higher for all scales. Test-retest-reliability, assessed by means of the intraclass-correlation-coefficient (ICC), exceeded 0.84, except for the 'reluctance to disclose mistakes to physicians' scale (ICC = 0.66). In this scale, some substantial floor effects occurred, with 29.3% of answers showing the lowest possible value. Conclusion Dealing with uncertainty is an important issue in daily practice. The psychometric properties of the rearranged German version of the PRU are satisfying. The revealed floor effects do not limit the significance of the questionnaire. Thus, the German version of the PRU could contribute to the further evaluation of the impact of uncertainty in primary care physicians. PMID:17562018
Movement-related beta oscillations show high intra-individual reliability.

PubMed

Espenhahn, Svenja; de Berker, Archy O; van Wijk, Bernadette C M; Rossiter, Holly E; Ward, Nick S

2017-02-15

Oscillatory activity in the beta frequency range (15-30Hz) recorded from human sensorimotor cortex is of increasing interest as a putative biomarker of motor system function and dysfunction. Despite its increasing use in basic and clinical research, surprisingly little is known about the test-retest reliability of spectral power and peak frequency measures of beta oscillatory signals from sensorimotor cortex. Establishing that these beta measures are stable over time in healthy populations is a necessary precursor to their use in the clinic. Here, we used scalp electroencephalography (EEG) to evaluate intra-individual reliability of beta-band oscillations over six sessions, focusing on changes in beta activity during movement (Movement-Related Beta Desynchronization, MRBD) and after movement termination (Post-Movement Beta Rebound, PMBR). Subjects performed visually-cued unimanual wrist flexion and extension. We assessed Intraclass Correlation Coefficients (ICC) and between-session correlations for spectral power and peak frequency measures of movement-related and resting beta activity. Movement-related and resting beta power from both sensorimotor cortices was highly reliable across sessions. Resting beta power yielded highest reliability (average ICC=0.903), followed by MRBD (average ICC=0.886) and PMBR (average ICC=0.663). Notably, peak frequency measures yielded lower ICC values compared to the assessment of spectral power, particularly for movement-related beta activity (ICC=0.386-0.402). Our data highlight that power measures of movement-related beta oscillations are highly reliable, while corresponding peak frequency measures show greater intra-individual variability across sessions. Importantly, our finding that beta power estimates show high intra-individual reliability over time serves to validate the notion that these measures reflect meaningful individual differences that can be utilised in basic research and clinical studies. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.
A Culture-Specific Nutrient Intake Assessment Instrument in Patients with Pulmonary Tuberculosis

PubMed Central

Frediani, Jennifer K.; Tukvadze, Nestani; Sanikidze, Ekaterina; Kipiani, Maia; Hebbar, Gautam; Easley, Kirk A.; Shenvi, Neeta; Ramakrishnan, Usha; Tangpricha, Vin; Blumberg, Henry M.; Ziegler, Thomas R.

2013-01-01

Background and Aim To develop and evaluate a culture-specific nutrient intake assessment tool for use in adults with pulmonary tuberculosis (TB) in Tbilisi, Georgia. Methods We developed an instrument to measure food intake over 3 consecutive days using a questionnaire format. The tool was then compared to 24 hour food recalls. Food intake data from 31 subjects with TB were analyzed using the Nutrient Database System for Research (NDS-R) dietary analysis program. Paired t-tests, Pearson correlations and intraclass correlation coefficients (ICC) were used to assess the agreement between the two methods of dietary intake for calculated nutrient intakes. Results The Pearson correlation coefficient for mean daily caloric intake between the 2 methods was 0.37 (P = 0.04) with a mean difference of 171 kcals/day (p = 0.34). The ICC was 0.38 (95% CI: 0.03 to 0.64) suggesting the within-patient variability may be larger than between-patient variability. Results for mean daily intake of total fat, total carbohydrate, total protein, retinol, vitamins D and E, thiamine, calcium, sodium, iron, selenium, copper, and zinc between the two assessment methods were also similar. Conclusions This novel nutrient intake assessment tool provided quantitative nutrient intake data from TB patients. These pilot data can inform larger studies in similar populations. PMID:23541173
Using pedometers to estimate ambulatory physical activity in Vietnam.

PubMed

Thuy, Au Bich; Blizzard, Leigh; Schmidt, Michael; Magnussen, Costan; Hansen, Emily; Dwyer, Terence

2011-01-01

Pedometer measurement of physical activity (PA) has been shown to be reliable and valid in industrialized populations, but its applicability in economically developing Vietnam remains untested. This study assessed the feasibility, stability and validity of pedometer estimates of PA in Vietnam. 250 adults from a population-based survey were randomly selected to wear Yamax pedometers and record activities for 7 consecutive days. Stability and concurrent validity were assessed using intraclass correlation coefficients (ICC) and Spearman correlation coefficients. Overall, 97.6% of participants provided at least 1 day of usable recordings, and 76.2% wore pedometers for all 7 days. Only 5.2% of the sample participants were involved in work activities not measurable by pedometer. The number of steps increased with hours of wear. There was no significant difference between weekday and weekend in number of steps, and at least 3 days of recordings were required (ICC of the 3 days of recordings: men 0.96, women 0.97). Steps per hour were moderately correlated (men r = .42, women r = .26) with record estimates of total PA. It is feasible to use pedometers to estimate PA in Vietnam. The measure should involve at least 3 days of recording irrespective of day of the week. ©2011 Human Kinetics, Inc.
Two-colour chewing gum mixing ability: digitalisation and spatial heterogeneity analysis.

PubMed

Weijenberg, R A F; Scherder, E J A; Visscher, C M; Gorissen, T; Yoshida, E; Lobbezoo, F

2013-10-01

Many techniques are available to assess masticatory performance, but not all are appropriate for every population. A proxy suitable for elderly persons suffering from dementia was lacking, and a two-colour chewing gum mixing ability test was investigated for this purpose. A fully automated digital analysis algorithm was applied to a mixing ability test using two-coloured gum samples in a stepwise increased number of chewing cycles protocol (Experiment 1: n = 14; seven men, 19-63 years), a test-retest assessment (Experiment 2: n = 10; four men, 20-49 years) and compared to an established wax cubes mixing ability test (Experiment 3: n = 13; 0 men, 21-31 years). Data were analysed with repeated measures anova (Experiment 1), the calculation of the intraclass correlation coefficient (ICC; Experiment 2) and Spearman's rho correlation coefficient (Experiment 3). The method was sensitive to increasing numbers of chewing cycles (F5,65 = 57·270, P = 0·000) and reliable in the test-retest (ICC value of 0·714, P = 0·004). There was no significant correlation between the two-coloured gum test and the wax cubes test. The two-coloured gum mixing ability test was able to adequately assess masticatory function and is recommended for use in a population of elderly persons with dementia. © 2013 John Wiley & Sons Ltd.
Psychometric testing of the modified Care Dependency Scale (Neuro-CDS).

PubMed

Piredda, Michela; Biagioli, Valentina; Gambale, Giulia; Porcelli, Elisa; Barbaranelli, Claudio; Palese, Alvisa; De Marinis, Maria Grazia

2016-01-01

Effective measures of nursing care dependency in neurorehabilitation are warranted to plan nursing interventions to help patients avoid increasing dependency. The Care Dependency Scale (CDS) is a theory-based, comprehensive tool to evaluate functional disability. This study aimed to modify the CDS for neurological and neurorehabilitation patients (Neuro-CDS) and to test its psychometric properties in adult neurorehabilitation inpatients. Exploratory factor analysis (EFA) was performed using a Maximum Likelihood robust (MLR) estimator. The Barthel Index (BI) was used to evaluate concurrent validity. Stability was measured using the Intra-class Correlation Coefficient (ICC). The sample included 124 patients (mean age = 69.7 years, 54% male). The EFA revealed a two-factor structure with good fit indexes, Factor 1 (Physical care dependence) loaded by 11 items and Factor 2 (Psycho-social care dependence) loaded by 4 items. The correlation between factors was 0.61. Correlations between Factor 1 and the BI and between Factor 2 and the BI were r = 0.843 and r = 0.677, respectively (p < 0.001). The Cronbach's alpha coefficients were 0.99 and 0.88 (Factor 1 and 2). The ICC was 0.98. The Neuro-CDS is multidimensional, valid, reliable, straightforward, and able to measure care dependence in neurorehabilitation patients as a basis for individualized and holistic care.
Agreement between the Facial Nerve Grading System 2.0 and the House-Brackmann Grading System in Patients with Bell Palsy.

PubMed

Lee, Ho Yun; Park, Moon Suh; Byun, Jae Yong; Chung, Ji Hyun; Na, Se Young; Yeo, Seung Geun

2013-09-01

We have analyzed the correlation between the House-Brackmann (HB) scale and Facial Nerve Grading System 2.0 (FNGS 2.0) in patients with Bell palsy, and evaluated the usefulness of the new grading system. Sixty patients diagnosed with Bell palsy from May 2009 to December 2010 were evaluated using the HB scale and FNGS 2.0 scale during their initial visit, and after 3 and 6 weeks and 3 months. The overall intraclass correlation coefficient (ICC) was 0.908 (P=0.000) and the Spearman correlation coefficient (SCC) was 0.912 (P<0.05). ICC and SCC displayed differences over time, being 0.604 and 0.626, respectively, at first visit; 0.834 and 0.843, respectively, after 3 weeks; 0.844 and 0.848, respectively, after 6 weeks; and 0.808 and 0.793, respectively, after 3 months. There was a significant difference in full recovery, depending on the scale used (HB, P=0.000; FNGS 2.0, P<0.05). The exact agreements between regional assessment and FNGS 2.0 for the mouth, eyes, and brow were 72%, 63%, and 52%, respectively. FNGS 2.0 shows moderate agreement with HB grading. Regional assessment, rather than HB grading, yields stricter evaluation, resulting in better prognosis and determination of grade.
Examining the reliability and validity of a modified version of the International Physical Activity Questionnaire, long form (IPAQ-LF) in Nigeria: a cross-sectional study.

PubMed

Oyeyemi, Adewale L; Bello, Umar M; Philemon, Saratu T; Aliyu, Habeeb N; Majidadi, Rebecca W; Oyeyemi, Adetoyeje Y

2014-12-01

To investigate the reliability and an aspect of validity of a modified version of the long International Physical Activity Questionnaire (Hausa IPAQ-LF) in Nigeria. Cross-sectional study, examining the reliability and construct validity of the Hausa IPAQ-LF compared with anthropometric and biological variables. Metropolitan Maiduguri, the capital city of Borno State in Nigeria. 180 Nigerian adults (50% women) with a mean age of 35.6 (SD=10.3) years, recruited from neighbourhoods with diverse socioeconomic status and walkability. Domains (domestic physical activity (PA), occupational PA, leisure-time PA, active transportation and sitting time) and intensities of PA (vigorous, moderate and walking) were measured with the Hausa IPAQ-LF on two different occasions, 8 days apart. Outcomes for construct validity were measured body mass index (BMI), systolic blood pressure (SBP) and diastolic blood pressure (DBP). The Hausa IPAQ-LF demonstrated good test-retest reliability (intraclass correlation coefficient, ICC>75) for total PA (ICC=0.79, 95% CI 0.65 to 0.82), occupational PA (ICC=0.77, 95% CI 0.68 to 0.82), active transportation (ICC=0.82, 95% CI 0.75 to 0.87) and vigorous intensity activities (ICC=0.82, 95% CI 0.76 to 0.87). Reliability was substantially higher for total PA (ICC=0.80), occupational PA (ICC=0.78), leisure-time PA (ICC=0.75) and active transportation (ICC=0.80) in men than in women, but domestic PA (ICC=0.38) and sitting time (ICC=0.71) demonstrated more substantial reliability coefficients in women than in men. For the construct validity, domestic PA was significantly related mainly with SBP (r=-0.27) and DBP (r=-0.17), and leisure-time PA and total PA were significantly related only with SBP (r=-0.16) and BMI (r=-0.29), respectively. Similarly, moderate-intensity PA was mainly related with SBP (r=-0.16, p<0.05) and DBP (r=-0.21, p<0.01), but vigorous-intensity PA was only related with BMI (r=-0.11, p<0.05). The modified Hausa IPAQ-LF demonstrated sufficient evidence of test-retest reliability and may be valid for assessing context specific PA behaviours of adults in Nigeria. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
The concurrent validity and reliability of a low-cost, high-speed camera-based method for measuring the flight time of vertical jumps.

PubMed

Balsalobre-Fernández, Carlos; Tejero-González, Carlos M; del Campo-Vecino, Juan; Bavaresco, Nicolás

2014-02-01

Flight time is the most accurate and frequently used variable when assessing the height of vertical jumps. The purpose of this study was to analyze the validity and reliability of an alternative method (i.e., the HSC-Kinovea method) for measuring the flight time and height of vertical jumping using a low-cost high-speed Casio Exilim FH-25 camera (HSC). To this end, 25 subjects performed a total of 125 vertical jumps on an infrared (IR) platform while simultaneously being recorded with a HSC at 240 fps. Subsequently, 2 observers with no experience in video analysis analyzed the 125 videos independently using the open-license Kinovea 0.8.15 software. The flight times obtained were then converted into vertical jump heights, and the intraclass correlation coefficient (ICC), Bland-Altman plot, and Pearson correlation coefficient were calculated for those variables. The results showed a perfect correlation agreement (ICC = 1, p < 0.0001) between both observers' measurements of flight time and jump height and a highly reliable agreement (ICC = 0.997, p < 0.0001) between the observers' measurements of flight time and jump height using the HSC-Kinovea method and those obtained using the IR system, thus explaining 99.5% (p < 0.0001) of the differences (shared variance) obtained using the IR platform. As a result, besides requiring no previous experience in the use of this technology, the HSC-Kinovea method can be considered to provide similarly valid and reliable measurements of flight time and vertical jump height as more expensive equipment (i.e., IR). As such, coaches from many sports could use the HSC-Kinovea method to measure the flight time and height of their athlete's vertical jumps.
Study to determine the criterion validity of the SenseWear Armband as a measure of physical activity in people with rheumatoid arthritis.

PubMed

Tierney, Marie; Fraser, Alexander; Purtill, Helen; Kennedy, Norelee

2013-06-01

Measuring physical activity in people with rheumatoid arthritis (RA) is of great importance in light of the increased mortality in this population due to cardiovascular disease. Validation of activity monitors in specific populations is recommended to ensure the accuracy of physical activity measurement. Thus, the purpose of this study was to determine the validity of the SenseWear Pro3 Armband (SWA) as a measure of physical activity during activities of daily living (ADL) in people with RA. Fourteen subjects (8 men and 6 women) with a diagnosis of RA were recruited from rheumatology clinics at the Mid-Western Regional Hospitals, Limerick, Ireland. Participants undertook a series of ADL of varying intensities. The SWA was compared to the criterion measures of the Oxycon Mobile indirect calorimetry system (energy expenditure in kJ) and of manual video observation (step count). Bland and Altman, intraclass correlation coefficient (ICC), and correlation analyses were done using SPSS, version 19.0. The SWA showed substantial agreement (ICC 0.717, P < 0.001) and a strong relationship (Pearson's correlation coefficient = 0.852) compared with the criterion measure when estimating energy expenditure during ADL. However, it was found that the SWA overestimated energy expenditure, particularly at higher intensity levels. The ability of the SWA to estimate step counts during ADL was poor (ICC 0.304, P = 0.038). The SWA can be considered a valid tool to estimate energy expenditure during ADL in the RA population; however, attention should be paid to its tendency to overestimate energy expenditure. Copyright © 2013 by the American College of Rheumatology.
Translation and Cross-cultural Adaptation of the Hip Disability and Osteoarthritis Score into Persian Language: Reassessment of Validity and Reliability

PubMed Central

Mousavian, Alireza; Kachooie, Amir Reza; Birjandinejad, Ali; Khoshsaligheh, Masood; Ebrahimzadeh, Mohammad Hosein

2018-01-01

Background: This study aimed Persian translation and validation of the hip disability and osteoarthritis outcome score (HOOS) questionnaire. Methods: The study was carried out in two phases. First, we translated the HOOS according to acceptable guidelines. We assessed HOOS content convergent validity on 203 hip osteoarthritis patients using SF-36. Internal consistency was tested using Cronbach's alpha coefficient if each item removed and intraclass correlation coefficient (ICC) for the assessment of test-retest reproducibility. Results: Patients had mean (standard deviation) age of 39 (17). Test-retest ICC in whole was 0.95 (P = 0.014) showing excellent reliability. ICC was 0.92 for the “pain” subscale (P = 0.02), 0.81 for the “symptom” subscale (P = 0.002), 0.81 for the “function of daily living (FDL)” (P = 0.022), 0.88 for the “function of sports and recreational activities” (P = 0.006), but it was 0.62 (P = 0.1) for the “quality of life (QOL).” Cronbach's alpha was 0.92, 0.73, 0.97, 0.86, 0.80, and 0.80 for the pain, symptom, FDL, function of sports, QOL, and stiffness, respectively, showing good to excellent internal consistancy. Having SF-36 for the assessment of convergent validity, there was a strong correlation between total HOOS score and the physical component summary domain of SF-36 (r = 0.64, P = 0.0001), whereas the t correlation with the mental component summary domain was weak (r = 0.16, P = 0.04). Conclusions: The Persian version of the HOOS questionnaire is a valid (regarding physical not mental aspects) and reliable assessment tool in patients with hip osteoarthritis. PMID:29619147
The admissions process of a bachelor of science in nursing program: initial reliability and validity of the personal interview.

PubMed

Carpio, B; Brown, B

1993-01-01

The undergraduate nursing degree program (B.Sc.N.) at McMaster University School of Nursing uses small groups, and is learner-centered and problem-based. A study was conducted during the 1991 admissions cycle to determine the initial reliability and validity of the semi-structured personal interview which constitutes the final component of candidate selection for this program. During the interview, three-member teams assess applicant suitability to the program based on six dimensions: applicant motivation, awareness of the program, problem-solving abilities, ability to relate to others, self-appraisal skills, and career goals. Each interviewer assigns the applicant a global rating using a seven-point scale. For the purposes of this study four interviewer teams were randomly selected from the pool of 31 teams to interview four simulated (preprogrammed) applicants. Using two-factor repeated-measures ANOVA to analyze interview ratings, inter-rater and inter-team intraclass correlation coefficients (ICC) were calculated. Inter-team reliability ranged from .64 to .97 for the individual dimensions, and .66 to .89 on global ratings. Inter-rater ICC for the six dimensions ranged from .81 to .99, and .96 to .99 for the global ratings. The item-to-total correlation coefficients between individual dimensions and global ratings ranged from .8 to 1.0. Pearson correlations between items ranged from .77 to 1.0. The ICC were then calculated for the interview scores of 108 actual applicants to the program. Inter-rater reliability based on global ratings was .79 for the single (1 rater) observation, and .91 for the multiple (3 rater) observation. These findings support the continued use of the interview as a reliable instrument with face validity. Studies of predictive validity will be undertaken.
Psychometric properties of the OARSI/OMERACT osteoarthritis pain and functional impairment scales: ICOAP, KOOS-PS and HOOS-PS.

PubMed

Ruyssen-Witrand, A; Fernandez-Lopez, C J; Gossec, L; Anract, P; Courpied, J P; Dougados, M

2011-01-01

To evaluate the psychometric properties of the OARSI-OMERACT questionnaires in comparison to the existing validated scales. Consecutive hip or knee osteoarthritis patients consulting in an orthopedic department were enrolled in the study. Data collected were pain using the Intermittent and Constant Osteoarthritis Pain (ICOAP), a Numeric Rating Scale (NRS), the Western Ontario McMaster Universities' Osteoarthritis Index (WOMAC) pain subscale, the Lequesne pain subscale; functional impairment using the Knee disability and Osteoarthritis Outcome Score-Physical Function Shortform (KOOS-PS), the Hip disability and Osteoarthritis Outcome Score-Physical Function Shortform (HOOS-PS), a NRS, the WOMAC function sub-scale, the Lequesne function subscale. Validity was assessed by calculating the Spearman's correlation coefficient between all the scales. Reliability was assessed in out-patients with stable disease comparing the data collected within 2 weeks using the intra-class correlation coefficient (ICC). Responsiveness was assessed on the data from hospitalised patients prior to and 12 weeks after a total joint replacement (TJR) using the standardised response mean. Three hundred patients (mean age=68 years, females=62%, hip OA=57%) were included. There was a moderate to good correlation between ICOAP, KOOS-PS, HOOS-PS and the WOMAC, NRS and Lequesne scales. Reliability of the ICOAP hip OA HOOS-PS and KOOS-PS was good (ICC range 0.80-0.81) whereas it was moderate for knee ICOAP (ICC=0.65). Responsiveness of the ICOAP, KOOS-PS and HOOS-PS 12 weeks after TJR was comparable to responsiveness of other scales (SRM range: 0.54-1.82). The psychometric properties of the ICOAP, KOOS-PS and HOOS-PS were comparable to those of the WOMAC, Lequesne and NRS.
Variability in baseline laboratory measurements of the Brazilian Longitudinal Study of Adult Health (ELSA-Brasil).

PubMed

Ladwig, R; Vigo, A; Fedeli, L M G; Chambless, L E; Bensenor, I; Schmidt, M I; Vidigal, P G; Castilhos, C D; Duncan, B B

2016-08-01

Multi-center epidemiological studies must ascertain that their measurements are accurate and reliable. For laboratory measurements, reliability can be assessed through investigation of reproducibility of measurements in the same individual. In this paper, we present results from the quality control analysis of the baseline laboratory measurements from the ELSA-Brasil study. The study enrolled 15,105 civil servants at 6 research centers in 3 regions of Brazil between 2008-2010, with multiple biochemical analytes being measured at a central laboratory. Quality control was ascertained through standard laboratory evaluation of intra- and inter-assay variability and test-retest analysis in a subset of randomly chosen participants. An additional sample of urine or blood was collected from these participants, and these samples were handled in the same manner as the original ones, locally and at the central laboratory. Reliability was assessed with the intraclass correlation coefficient (ICC), estimated through a random effects model. Coefficients of variation (CV) and Bland-Altman plots were additionally used to assess measurement variability. Laboratory intra and inter-assay CVs varied from 0.86% to 7.77%. From test-retest analyses, the ICCs were high for the majority of the analytes. Notably lower ICCs were observed for serum sodium (ICC=0.50; 95%CI=0.31-0.65) and serum potassium (ICC=0.73; 95%CI=0.60-0.83), due to the small biological range of these analytes. The CVs ranged from 1 to 14%. The Bland-Altman plots confirmed these results. The quality control analyses showed that the collection, processing and measurement protocols utilized in the ELSA-Brasil produced reliable biochemical measurements.

Intraclass correlation and design effect in BMI, physical activity and diet: a cross-sectional study of 56 countries.

PubMed

Masood, Mohd; Reidpath, Daniel D

2016-01-07

Measuring the intraclass correlation coefficient (ICC) and design effect (DE) may help to modify the public health interventions for body mass index (BMI), physical activity and diet according to geographic targeting of interventions in different countries. The purpose of this study was to quantify the level of clustering and DE in BMI, physical activity and diet in 56 low-income, middle-income and high-income countries. Cross-sectional study design. Multicountry national survey data. The World Health Survey (WHS), 2003, data were used to examine clustering in BMI, physical activity in metabolic equivalent of task (MET) and diet in fruits and vegetables intake (FVI) from low-income, middle-income and high-income countries. Multistage sampling in the WHS used geographical clusters as primary sampling units (PSU). These PSUs were used as a clustering or grouping variable in this analysis. Multilevel intercept only regression models were used to calculate the ICC and DE for each country. The median ICC (0.039) and median DE (1.82) for BMI were low; however, FVI had a higher median ICC (0.189) and median DE (4.16). For MET, the median ICC was 0.141 and median DE was 4.59. In some countries, however, the ICC and DE for BMI were large. For instance, South Africa had the highest ICC (0.39) and DE (11.9) for BMI, whereas Uruguay had the highest ICC (0.434) for MET and Ethiopia had the highest ICC (0.471) for FVI. This study shows that across a wide range of countries, there was low area level clustering for BMI, whereas MET and FVI showed high area level clustering. These results suggested that the country level clustering effect should be considered in developing preventive approaches for BMI, as well as improving physical activity and healthy diets for each country. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/
The minimal clinically important difference of the control of allergic rhinitis and asthma test (CARAT): cross-cultural validation and relation with pollen counts

PubMed Central

van der Leeuw, Sander; van der Molen, Thys; Dekhuijzen, PN Richard; Fonseca, Joao A; van Gemert, Frederik A; Gerth van Wijk, Roy; Kocks, Janwillem WH; Oosterom, Helma; Riemersma, Roland A; Tsiligianni, Ioanna G; de Weger, Letty A; Oude Elberink, Joanne NG; Flokstra-de Blok, Bertine MJ

2015-01-01

Background: The Control of Allergic Rhinitis and Asthma Test (CARAT) monitors control of asthma and allergic rhinitis. Aims: To determine the CARAT’s minimal clinically important difference (MCID) and to evaluate the psychometric properties of the Dutch CARAT. Methods: CARAT was applied in three measurements at 1-month intervals. Patients diagnosed with asthma and/or rhinitis were approached. MCID was evaluated using Global Rating of Change (GRC) and standard error of measurement (s.e.m.). Cronbach’s alpha was used to evaluate internal consistency. Spearman’s correlation coefficients were calculated between CARAT, the Asthma Control Questionnaire (ACQ5) and the Visual Analog Scale (VAS) on airway symptoms to determine construct and longitudinal validity. Test–retest reliability was evaluated with intra-class correlation coefficient (ICC). Changes in pollen counts were compared with delta CARAT and ACQ5 scores. Results: A total of 92 patients were included. The MCID of the CARAT was 3.50 based on GRC scores; the s.e.m. was 2.83. Cronbach’s alpha was 0.82. Correlation coefficients between CARAT and ACQ5 and VAS questions ranged from 0.64 to 0.76 (P<0.01). Longitudinally, correlation coefficients between delta CARAT scores and delta ACQ5 and VAS scores ranged from 0.41 to 0.67 (P<0.01). Test–retest reliability showed an ICC of 0.81 (P<0.01) and 0.80 (P<0.01). Correlations with pollen counts were higher for CARAT than for ACQ5. Conclusions: This is the first investigation of the MCID of the CARAT. The CARAT uses a whole-point scale, which suggests that the MCID is 4 points. The CARAT is a valid and reliable tool that is also applicable in the Dutch population. PMID:25569880
The reliability of dual-energy X-ray absorptiometry measurements of bone mineral density in the metatarsals.

PubMed

Fuller, Joel T; Archer, Jane; Buckley, Jonathan D; Tsiros, Margarita D; Thewlis, Dominic

2016-01-01

To investigate the reliability of a simple, efficient technique for measuring bone mineral density (BMD) in the metatarsals using dual-energy X-ray absorptiometry (DXA). BMD of the right foot of 32 trained male distance runners was measured using a DXA scanner with the foot in the plantar position. Separate regions of interest (ROI) were used to assess the BMD of each metatarsal shaft (1st-5th) for each participant. ROI analysis was repeated by the same investigator to determine within-scan intra-rater reliability and by a different investigator to determine within-scan inter-rater reliability. Repeat DXA scans were undertaken for ten participants to assess between-scan intra-rater reliability. Assessment of BMD was consistently most reliable for the first metatarsal across all domains of reliability assessed (intra-class correlation coefficient [ICC] ≥0.97; coefficient of variation [CV] ≤1.5%; limits of agreement [LOA] ≤4.2%). Reasonable levels of intra-rater reliability were also achieved for the second and fifth metatarsals (ICC ≥0.90; CV ≤4.2%; LOA ≤11.9%). Poorer levels of reliability were demonstrated for the third (ICC ≥0.64; CV ≤8.2%; LOA ≤23.6%) and fourth metatarsals (ICC ≥0.67; CV ≤9.6%; LOA ≤27.5%). BMD was greatest in the first and second metatarsals (P < 0.01). Reliable measurements of BMD were achieved for the first, second and fifth metatarsals.
Electronic working length determination in primary teeth by ProPex and Digital Signal Processing.

PubMed

Nelson-Filho, Paulo; Lucisano, Marcela Pacífico; Leonardo, Mário Roberto; da Silva, Raquel Assed Bezerra; da Silva, Léa Assed Bezerra

2010-12-01

The purpose of this study was to evaluate the accuracy of electronic apex locators Digital Signal Processing (DSP) and ProPex, for root canal length determination in primary teeth. Fifteen primary molars (a total of 34 root canals) were divided into two groups: Group I - without physiological resorption (n = 16); and Group II - with physiological resorption (n = 18). The length of each canal was measured by introducing a file until its tip was visible and then it was retracted 1 mm. For electronic measurement, the devices were set to 1 mm short of the apical resorption. The data were analysed statistically using the intraclass correlation coefficient (ICC). Results showed that the ICC was high for both electronic apex locators in all situations - with (ICC: DSP = 0.82 and Propex = 0.89) or without resorption (ICC: DSP = 0.92 and Propex = 0.90). Both apex locators were extremely accurate in determining the working length in primary teeth, both with or without physiological resorption. © 2010 The Authors. Australian Endodontic Journal © 2010 Australian Society of Endodontology.
Reproducibility and repeatability of semi-quantitative 18F-fluorodihydrotestosterone (FDHT) uptake metrics in castration-resistant prostate cancer metastases: a prospective multi-center study.

PubMed

Vargas, Hebert Alberto; Kramer, Gem M; Scott, Andrew M; Weickhardt, Andrew; Meier, Andreas A; Parada, Nicole; Beattie, Bradley J; Humm, John L; Staton, Kevin D; Zanzonico, Pat B; Lyashchenko, Serge K; Lewis, Jason S; Yaqub, Maqsood; Sosa, Ramon E; van den Eertwegh, Alfons J; Davis, Ian D; Ackermann, Uwe; Pathmaraj, Kunthi; Schuit, Robert C; Windhorst, Albert D; Chua, Sue; Weber, Wolfgang A; Larson, Steven M; Scher, Howard I; Lammertsma, Adriaan A; Hoekstra, Otto; Morris, Michael J

2018-04-06

18 F-fluorodihydrotestosterone ( 18 F-FDHT) is a radiolabeled analogue of the androgen receptor's primary ligand that is currently being credentialed as a biomarker for prognosis, response, and pharmacodynamic effects of new therapeutics. As part of the biomarker qualification process, we prospectively assessed its reproducibility and repeatability in men with metastatic castration-resistant prostate cancer (mCRPC). Methods: We conducted a prospective multi-institutional study of mCRPC patients undergoing two (test/re-test) 18 F-FDHT PET/CT scans on two consecutive days. Two independent readers evaluated all examinations and recorded standardized uptake values (SUVs), androgen receptor-positive tumor volumes (ARTV), and total lesion uptake (TLU) for the most avid lesion detected in each of 32 pre-defined anatomical regions. The relative absolute difference and reproducibility coefficient (RC) of each metric were calculated between the test and re-test scans. Linear regression analyses, intra-class correlation coefficients (ICC), and Bland-Altman plots were used to evaluate repeatability of 18 F-FDHT metrics. The coefficient of variation (COV) and ICC were used to assess inter-observer reproducibility. Results: Twenty-seven patients with 140 18 F-FDHT-avid regions were included. The best repeatability among 18 F-FDHT uptake metrics was found for SUV metrics (SUV max , SUVmean, and SUVpeak), with no significant differences in repeatability found among them. Correlations between the test and re-test scans were strong for all SUV metrics (R2 ≥ 0.92; ICC ≥ 0.97). The RCs of the SUV metrics ranged from 21.3% for SUVpeak to 24.6% for SUV max The test and re-test ARTV and TLU, respectively, were highly correlated (R2 and ICC ≥ 0.97), although variability was significantly higher than that for SUV (RCs > 46.4%). The PSA levels, Gleason score, weight, and age did not affect repeatability, nor did total injected activity, uptake measurement time, or differences in uptake time between the two scans. Including the single most avid lesion per patient, the five most avid lesions per patient, only lesions ≥ 4.2 mL, only lesions with an SUV ≥ 4 g/mL, or normalizing of SUV to area under the parent plasma activity concentration-time curve did not significantly affect repeatability. All metrics showed high inter-observer reproducibility (ICC > 0.98; COV < 0.2-10.8%). Conclusion: 18 F-FDHT is a highly reproducible means of imaging mCRPC. Amongst 18 F-FDHT uptake metrics, SUV had the highest repeatability among the measures assessed. These performance characteristics lend themselves to further biomarker development and clinical qualification of the tracer. Copyright © 2018 by the Society of Nuclear Medicine and Molecular Imaging, Inc.
Validity and reliability of the patient assessment of constipation quality of life questionnaire for the Turkish population.

PubMed

Bengi, Göksel; Yalçın, Mustafa; Akpınar, Hale; Keskinoğlu, Pembe; Ellidokuz, Hülya

2015-07-01

There are few specific evaluation forms for evaluating the quality of life among patients with chronic constipation. Our study aimed to determine the validity and reliability of the translated Patient Assessment of Constipation Quality of Life (PAC-QOL) questionnaire for the Turkish population because evidence of its reliability and validity is required to justify its use in other studies and clinical practice. This study included 154 patients with constipation who were treated at the Department of Gastroenterology, Dokuz Eylül University Hospital between January and June 2012. The translated PAC-QOL questionnaire was completed by patients at the clinic and also at a 2-week follow-up to test its reliability. Cronbach's alpha coefficient (internal consistency) was 0.91 (good) for the translated PAC-QOL questionnaire. Time validity was evaluated using the intraclass correlation coefficient (ICC) method, and the ICC value for all questions was confirmed as 0.68 at the 2-week follow-up. The validity of the tool in the study group was evaluated using factor analysis, and the results were highly significant (Kaiser-Meyer-Olkin value: 0.857; Bartlett's test: p=0.001). Questions were categorized according to six factors based on the factor analysis, and these factors explained 65.1% of the total variation. For hypothesis verification of the tool, the correlation coefficient for PAC-QOL and PAC Symptoms (PAC-SYM) was r=0.577 (p<0.001), whereas the correlation coefficient for PAC-QOL and constipation severity score was r=0.457 (p<0.001). The PAC-QOL questionnaire was reliable, although not valid because of the limited sample group.
The Reliability of Pharyngeal High Resolution Manometry with Impedance for Derivation of Measures of Swallowing Function in Healthy Volunteers

PubMed Central

Omari, Taher I.; Savilampi, Johanna; Kokkinn, Karmen; Schar, Mistyka; Lamvik, Kristin; Doeltgen, Sebastian; Cock, Charles

2016-01-01

Purpose. We evaluated the intra- and interrater agreement and test-retest reliability of analyst derivation of swallow function variables based on repeated high resolution manometry with impedance measurements. Methods. Five subjects swallowed 10 × 10 mL saline on two occasions one week apart producing a database of 100 swallows. Swallows were repeat-analysed by six observers using software. Swallow variables were indicative of contractility, intrabolus pressure, and flow timing. Results. The average intraclass correlation coefficients (ICC) for intra- and interrater comparisons of all variable means showed substantial to excellent agreement (intrarater ICC 0.85–1.00; mean interrater ICC 0.77–1.00). Test-retest results were less reliable. ICC for test-retest comparisons ranged from slight to excellent depending on the class of variable. Contractility variables differed most in terms of test-retest reliability. Amongst contractility variables, UES basal pressure showed excellent test-retest agreement (mean ICC 0.94), measures of UES postrelaxation contractile pressure showed moderate to substantial test-retest agreement (mean Interrater ICC 0.47–0.67), and test-retest agreement of pharyngeal contractile pressure ranged from slight to substantial (mean Interrater ICC 0.15–0.61). Conclusions. Test-retest reliability of HRIM measures depends on the class of variable. Measures of bolus distension pressure and flow timing appear to be more test-retest reliable than measures of contractility. PMID:27190520
Improving Teacher Selection: The Effect of Inter-Rater Reliability in the Screening Process. CEDR Working Paper. WP #2015-7

ERIC Educational Resources Information Center

Martinkova, Patricia; Goldhaber, Dan

2015-01-01

Inter-rater reliability, commonly assessed by intra-class correlation coefficient ICC, is an important index for describing the extent to which there is consistency amongst two or more raters in assigned measures. In organizational research, the data structure is often hierarchical and designs deviate substantially from the ideal of a balanced…
Reliability of the dynavision™ d2 for assessing reaction time performance.

PubMed

Wells, Adam J; Hoffman, Jay R; Beyer, Kyle S; Jajtner, Adam R; Gonzalez, Adam M; Townsend, Jeremy R; Mangine, Gerald T; Robinson, Edward H; McCormack, William P; Fragala, Maren S; Stout, Jeffrey R

2014-01-01

Recently, the Dynavision™ D2 Visuomotor Training Device (D2) has emerged as a tool in the assessment of reaction time (RT); however, information regarding the reliability of the D2 have been limited, and to date, reliability data have been limited to non- generalizable samples. Therefore, the purpose of this study was to establish intraclass correlation coefficients (ICC2,1) for the D2 that are generalizable across a population of recreationally active young adults. Forty-two recreationally active men and women (age: 23.41 ± 4.84 years; height: 1.72 ± 0.11 m; mass: 76.62 ± 18.26 Kg) completed 6 trials for three RT tasks of increasing complexity. Each trial was separated by at least 48-hours. A repeated measures ANOVA was used to detect differences in performance across the six trials. Intraclass correlation coefficients (ICC2,1) standard error of measurement (SEM), and minimal differences (MD) were used to determine the reliability of the D2 from the two sessions with the least significant difference score. Moderate to strong reliability was demonstrated for visual RT (ICC2,1: 0.84, SEM: 0.033), and reactive ability in both Mode A and Mode B tasks (Mode A hits: ICC2,1: 0.75, SEM: 5.44; Mode B hits: ICC2,1: 0.73, SEM: 8.57). Motor RT (ICC2,1: 0.63, SEM: 0.035s) showed fair reliability, while average RT per hit for Modes A and B showed moderate reliability (ICC2,1: 0.68, SEM: 0.43 s and ICC2,1: 0.72, SEM: 0.03 s respectively). It appears that one familiarization trial is necessary for the choice reaction time (CRT) task while three familiarization trials are necessary for reactive RT tasks. In conclusion, results indicate that the Dynavision™ D2 is a reliable device to assess neuromuscular reactivity given that an adequate practice is provided. The data presented are generalizable to a population of recreationally active young adults. Key PointsThe Dynavision™ D2 is a light-training reaction device, developed to train sensory motor integration through the visual system, offering the ability to assess visual and motor reaction to both central and peripheral stimuli, with a capacity to integrate increasing levels of cognitive challenge.The Dynavision™ D2 is a reliable instrument for assessing reaction time in recreationally active young adults.It is recommended that one familiarization trial is necessary for the choice reaction time task assessment to learn the test protocol, while three familiarization trials are needed for reactive ability in Mode A and Mode B before a subsequent reliable baseline score can be established.Significant training effects were observed for all reaction time tests and should be taken into account with continuous trials.
A comparison of the simplified olecranon and digital methods of assessment of skeletal maturity during the pubertal growth spurt.

PubMed

Canavese, F; Charles, Y P; Dimeglio, A; Schuller, S; Rousset, M; Samba, A; Pereira, B; Steib, J-P

2014-11-01

Assessment of skeletal age is important in children's orthopaedics. We compared two simplified methods used in the assessment of skeletal age. Both methods have been described previously with one based on the appearance of the epiphysis at the olecranon and the other on the digital epiphyses. We also investigated the influence of assessor experience on applying these two methods. Our investigation was based on the anteroposterior left hand and lateral elbow radiographs of 44 boys (mean: 14.4; 12.4 to 16.1 ) and 78 girls (mean: 13.0; 11.1 to14.9) obtained during the pubertal growth spurt. A total of nine observers examined the radiographs with the observers assigned to three groups based on their experience (experienced, intermediate and novice). These raters were required to determined skeletal ages twice at six-week intervals. The correlation between the two methods was determined per assessment and per observer groups. Interclass correlation coefficients (ICC) evaluated the reproducibility of the two methods. The overall correlation between the two methods was r = 0.83 for boys and r = 0.84 for girls. The correlation was equal between first and second assessment, and between the observer groups (r ≥ 0.82). There was an equally strong ICC for the assessment effect (ICC ≤ 0.4%) and observer effect (ICC ≤ 3%) for each method. There was no significant (p < 0.05) difference between the levels of experience. The two methods are equally reliable in assessing skeletal maturity. The olecranon method offers detailed information during the pubertal growth spurt, while the digital method is as accurate but less detailed, making it more useful after the pubertal growth spurt once the olecranon has ossified. ©2014 The British Editorial Society of Bone & Joint Surgery.
ASSOCIATIONS BETWEEN THREE CLINICAL ASSESSMENT TOOLS FOR POSTURAL STABILITY

PubMed Central

Saxion, Casie E.; Cameron, Kenneth L.; Gerber, J. Parry

2010-01-01

Study Design: Clinical Measurement, Correlation, Reliability Objectives: To assess the relationship between the Single Leg Balance (SLB), modified Balance Error Scoring System (mBESS), and modified Star Excursion Balance (mSEBT) tests and secondarily to assess inter-rater and test-retest reliability of these tests. Background: Ankle sprains often result in chronic instability and dysfunction. Several clinical tests assess postural deficits as a potential cause of this dysfunction; however, limited information exists pertaining to the relationship that these tests have with one another. Methods: Two independent examiners measured the performance of 34 healthy participants completing the SLB Test, mBESS test, and mSEBT at two different time periods. The relationship between tests was assessed using the Pearson Correlation and Fisher's Exact Tests. Inter-rater and test-retest reliability were assessed using the intraclass correlation coefficient (ICC) and Kappa statistics. Results: A significant correlation (r = -0.35) was observed between the mSEBT and the mBESS. Fisher's Exact Test showed a significant association between the SLB Test and mBESS (P = .048), but no association between the SLB and mSEBT (P = 1.000). Inter-rater reliability was excellent for the mSEBT and fair for the mBESS (ICCs of .91 and .61 respectively). Excellent agreement was observed between raters for the SLB test (k = 1.00). Test-retest reliability was excellent for the mSEBT (ICC = 0.98) and fair for the mBESS (ICC = 0.74). There was poor test-retest agreement for the SLB test (k = .211). Conclusion: There was a significant relationship observed between the SLB Test, mBESS test, and mSEBT: however; strength of association measures showed limited overlap between these tests. This suggests that these tests are interrelated but may not assess equal components of postural stability. PMID:21589668
Volumetric computed tomography analysis of the olfactory cleft in patients with chronic rhinosinusitis.

PubMed

Soler, Zachary M; Pallanch, John F; Sansoni, Eugene Ritter; Jones, Cameron S; Lawrence, Lauren A; Schlosser, Rodney J; Mace, Jess C; Smith, Timothy L

2015-09-01

Commonly used computed tomography (CT) staging systems for chronic rhinosinusitis (CRS) focus on the sinuses and do not quantify disease in the olfactory cleft. The goal of the current study was to determine whether precise measurements of olfactory cleft opacification better correlate with olfaction in patients with CRS. Olfaction was assessed using the 40-item Smell Identification Test (SIT-40) before and after sinus surgery in adult patients. Olfactory cleft opacification was quantified precisely using three-dimensional (3D), computerized volumetric analysis, as well as via semiquantitative Likert scale estimations at predetermined anatomic sites. Sinus opacification was also quantified using the Lund-Mackay staging system. The overall cohort (n = 199) included 89 (44.7%) patients with CRS with nasal polyposis (CRSwNP) and 110 (55.3%) with CRS without nasal polyposis (CRSsNP). The olfactory cleft opacified volume correlated with objective olfaction as determined by the SIT-40 (Spearman's rank correlation coefficient [Rs ] = -0.461; p < 0.001). The correlation was significantly stronger in the CRSwNP subgroup (Rs = -0.573; p < 0.001), whereas no appreciable correlation was found in the CRSsNP group (Rs = -0.141; p = 0.141). Correlations between sinus-specific Lund-Mackay CT scoring and SIT-40 scores were weaker in the CRSwNP (Rs = -0.377; p < 0.001) subgroup but stronger in the CRSsNP (Rs = -0.225; p = 0.018) group when compared to olfactory cleft correlations. Greater intraclass correlations (ICCs) were found between quantitative volumetric measures of olfactory cleft opacification (ICC = 0.844; p < 0.001) as compared with semiquantitative Likert grading (ICC = 0.627; p < 0.001). Quantitative measures of olfactory cleft opacification correlate with objective olfaction, with the strongest correlations seen in patients with nasal polyps. © 2015 ARS-AAOA, LLC.
Comparing audio and video data for rating communication.

PubMed

Williams, Kristine; Herman, Ruth; Bontempo, Daniel

2013-09-01

Video recording has become increasingly popular in nursing research, adding rich nonverbal, contextual, and behavioral information. However, benefits of video over audio data have not been well established. We compared communication ratings of audio versus video data using the Emotional Tone Rating Scale. Twenty raters watched video clips of nursing care and rated staff communication on 12 descriptors that reflect dimensions of person-centered and controlling communication. Another group rated audio-only versions of the same clips. Interrater consistency was high within each group with Interclass Correlation Coefficient (ICC) (2,1) for audio .91, and video = .94. Interrater consistency for both groups combined was also high with ICC (2,1) for audio and video = .95. Communication ratings using audio and video data were highly correlated. The value of video being superior to audio-recorded data should be evaluated in designing studies evaluating nursing care.
Test-Retest Reliability and Minimal Detectable Change of Randomized Dichotic Digits in Learning-Disabled Children: Implications for Dichotic Listening Training.

PubMed

Mahdavi, Mohammad Ebrahim; Pourbakht, Akram; Parand, Akram; Jalaie, Shohreh

2018-03-01

Evaluation of dichotic listening to digits is a common part of many studies for diagnosis and managing auditory processing disorders in children. Previous researchers have verified test-retest relative reliability of dichotic digits results in normal children and adults. However, detecting intervention-related changes in the ear scores after dichotic listening training requires information regarding trial-to-trial typical variation of individual ear scores that is estimated using indices of absolute reliability. Previous studies have not addressed absolute reliability of dichotic listening results. To compare the results of the Persian randomized dichotic digits test (PRDDT) and its relative and absolute indices of reliability between typical achieving (TA) and learning-disabled (LD) children. A repeated measures observational study. Fifteen LD children were recruited from a previously performed study with age range of 7-12 yr. The control group consisted of 15 TA schoolchildren with age range of 8-11 yr. The Persian randomized dichotic digits test was administered on the children under free recall condition in two test sessions 7-12 days apart. We compared the average of the ear scores and ear advantage between TA and LD children. Relative indices of reliability included Pearson's correlation and intraclass correlation (ICC 2,1 ) coefficients and absolute reliability was evaluated by calculation of standard error of measurement (SEM) and minimal detectable change (MDC) using the raw ear scores. The Pearson correlation coefficient indicated that in both groups of children the ear scores of test and retest sessions were strongly and positively (greater than +0.8) correlated. The ear scores showed excellent ICC coefficient of consistency (0.78-0.82) and fair to excellent ICC coefficient of absolute agreement (0.62-0.74) in TA children and excellent ICC coefficients of consistency and absolute agreement in LD children (0.76-0.87). SEM and SEM% of the ear scores in TA children were 1.46 and 1.44% for the right ear and 4.68 and 5.47% for the left ear. SEM and SEM% of the ear scores in LD children were 4.55 and 5.88% for the right ear to 7.56 and 12.81% for the left ear. MDC and MDC% of the ear scores in TA children varied from 4.03 and 3.99% for the right ear to 12.93 and 15.13% for the left ear. MDC and MDC% of the ear scores in LD children varied from 12.57 and 16.25% for the right ear to 20.89 and 35.39% for the left ear. The LD children indicated test-retest relative reliability as high as TA children in the ear scores measured by PRDDT. However, within-subject variations of the ear scores calculated by indices of absolute reliability were considerably higher in LD children versus TA children. The results of the current study could have implications for detecting real training-related changes in the ear scores. American Academy of Audiology
Regional reliability of quantitative signal targeting with alternating radiofrequency (STAR) labeling of arterial regions (QUASAR).

PubMed

Tatewaki, Yasuko; Higano, Shuichi; Taki, Yasuyuki; Thyreau, Benjamin; Murata, Takaki; Mugikura, Shunji; Ito, Daisuke; Takase, Kei; Takahashi, Shoki

2014-01-01

Quantitative signal targeting with alternating radiofrequency labeling of arterial regions (QUASAR) is a recent spin labeling technique that could improve the reliability of brain perfusion measurements. Although it is considered reliable for measuring gray matter as a whole, it has never been evaluated regionally. Here we assessed this regional reliability. Using a 3-Tesla Philips Achieva whole-body system, we scanned four times 10 healthy volunteers, in two sessions 2 weeks apart, to obtain QUASAR images. We computed perfusion images and ran a voxel-based analysis within all brain structures. We also calculated mean regional cerebral blood flow (rCBF) within regions of interest configured for each arterial territory distribution. The mean CBF over whole gray matter was 37.74 with intraclass correlation coefficient (ICC) of .70. In white matter, it was 13.94 with an ICC of .30. Voxel-wise ICC and coefficient-of-variation maps showed relatively lower reliability in watershed areas and white matter especially in deeper white matter. The absolute mean rCBF values were consistent with the ones reported from PET, as was the relatively low variability in different feeding arteries. Thus, QUASAR reliability for regional perfusion is high within gray matter, but uncertain within white matter. © 2014 The Authors. Journal of Neuroimaging published by the American Society of Neuroimaging.
Regional Reliability of Quantitative Signal Targeting with Alternating Radiofrequency (STAR) Labeling of Arterial Regions (QUASAR)

PubMed Central

Tatewaki, Yasuko; Higano, Shuichi; Taki, Yasuyuki; Thyreau, Benjamin; Murata, Takaki; Mugikura, Shunji; Ito, Daisuke; Takase, Kei; Takahashi, Shoki

2014-01-01

BACKGROUND AND PURPOSE Quantitative signal targeting with alternating radiofrequency labeling of arterial regions (QUASAR) is a recent spin labeling technique that could improve the reliability of brain perfusion measurements. Although it is considered reliable for measuring gray matter as a whole, it has never been evaluated regionally. Here we assessed this regional reliability. METHODS Using a 3-Tesla Philips Achieva whole-body system, we scanned four times 10 healthy volunteers, in two sessions 2 weeks apart, to obtain QUASAR images. We computed perfusion images and ran a voxel-based analysis within all brain structures. We also calculated mean regional cerebral blood flow (rCBF) within regions of interest configured for each arterial territory distribution. RESULTS The mean CBF over whole gray matter was 37.74 with intraclass correlation coefficient (ICC) of .70. In white matter, it was 13.94 with an ICC of .30. Voxel-wise ICC and coefficient-of-variation maps showed relatively lower reliability in watershed areas and white matter especially in deeper white matter. The absolute mean rCBF values were consistent with the ones reported from PET, as was the relatively low variability in different feeding arteries. CONCLUSIONS Thus, QUASAR reliability for regional perfusion is high within gray matter, but uncertain within white matter. PMID:25370338
Reliability of Heart Rate Variability in Children: Influence of Sex and Body Position During Data Collection.

PubMed

Silva, Carla Cristiane; Bertollo, Maurizio; Reichert, Felipe Fossati; Boullosa, Daniel Alexandre; Nakamura, Fábio Yuzo

2017-05-01

To examine which body position and indices present better reliability of heart rate variability (HRV) measures in children and to compare the HRV analyzed in different body positions between sexes. Twenty eutrophic prepubertal children of each sex participated in the study. The RR intervals were recorded using a portable heart rate monitor twice a day for 7 min in the supine, sitting, and standing positions. The reproducibility was analyzed using the intraclass correlation coefficient (ICC; two way mixed) and within-subject coefficient of variation (CV).Two-way ANOVA with repeated measures was used to compare the sexes. High levels of reproducibility were indicated by higher ICC in the root-mean-square difference of successive normal RR intervals (RMSSD: 0.93 and 0.94) and Poincaré plot of the short-term RR interval variability (SD1: 0.92 and 0.94) parameters for boys and girls, respectively, in the supine position. The ICCs were lower in the sitting and standing positions for all HRV indices. In addition, the girls presented significantly higher values than the boys for SDNN and absolute high frequency (HF; p < .05) in the supine position. The supine position is the most reproducible for the HRV indices in both sexes, especially the vagal related indices.
Intraindividual stability of cortisol and cortisone and the ratio of cortisol to cortisone in saliva, urine and hair.

PubMed

Zhang, Quan; Chen, Zheng; Chen, Shenghuo; Xu, Youyun; Deng, Huihua

2017-02-01

Cortisol, cortisone and the ratio of cortisol to cortisone in saliva, urine and hair are acute, short-term and long-term biomarkers to reliably assess the activity of hypothalamic-pituitary-adrenal (HPA) axis and 11β-hydroxysteroid dehydrogenase (11β-HSD). One key issue is whether these biomarkers have intraindividual relative stability. Salivary, urinary and hair cortisol was proven to show considerable long-term intraindividual relative stability. However, currently unknown is whether cortisone and the ratio in saliva, urine and hair show intraindividual relative stability. The present study utilized a longitudinal design to validate long-term stability within two weeks of three biomarkers in saliva and urine, and long-term stability within twelve months of three hair biomarkers. Salivary, urinary and hair steroids were measured with high performance liquid chromatography tandem mass spectrometry. Three biomarkers in urine and hair showed moderate test-retest correlations with coefficient (r) ranging between 0.22 and 0.56 and good multiple-test consistencies with coefficient of intraclass correlation (ICC) ranging between 0.42 and 0.67. Three single-point salivary biomarkers showed weak to moderate test-retest correlations (r's between 0.01 and 0.38) and poor to fair multiple-test consistencies (ICC's between 0.29 and 0.53) within two weeks. Three single-day salivary biomarkers showed moderate test-retest correlations (r's between 0.23 and 0.53) and good multiple-test consistencies (ICC's between 0.56 and 0.66) within two weeks. Three biomarkers in urine and hair showed moderate long-term intraindividual relative stability. Three single-point salivary biomarkers showed weak to moderate short-term and long-term intraindividual relative stability, but three single-day salivary biomarkers showed moderate short-term and long-term intraindividual relative stability. Copyright © 2016 Elsevier Inc. All rights reserved.
Criterion validity of the Physical Activity Questionnaire for Schoolchildren (PAQ-S) in assessing physical activity levels: the Healthy Growth Study.

PubMed

Manios, Y; Androutsos, O; Moschonis, G; Birbilis, M; Maragkopoulou, K; Giannopoulou, A; Argyri, E; Kourlaba, G

2013-10-01

The aim of this paper was to evaluate the criterion validity of the Physical Activity Questionnaire for Schoolchildren (PAQ-S). The current study is a subcohort of the Healthy Growth Study, a large-scale cross-sectional study. 202 schoolchildren aged 9-13 years from Greece completed the PAQ-S and wore an accelerometer for 4 consecutive days. Time spent moderate (MPA), moderate to vigorous (MVPA) and vigorous (VPA) physical activity was calculated based on PAQ-S and accelerometer data. The average time spent on MPA and MVPA as derived from PAQ-S and from accelerometers were significantly moderately correlated (r=0.462, P<0.001 and r=0.483, P<0.001, respectively). No significant correlation was detected between PAQ-S and accelerometer-measured time spent performing VPA (rho=0.150, P=0.057). Intraclass Correlation Coefficient (ICC) indicated a moderate agreement between PAQ-S and accelerometer in estimating MPA (ICC=0.592, P<0.001) and MVPA (ICC=0.581, P<0.001). Bland-Altman analysis revealed a small mean difference (the "bias"), between the two methods, in estimating MPA, although this difference was found to be significantly higher than zero ("bias"=27.4% of the accelerometer-measured mean score, P=0.006). On the other hand, Bland-Altman analysis revealed a large mean difference in estimating MVPA and VPA ("bias"=84.2% and 357% of the accelerometer-measured mean score for MVPA and VPA, respectively and P<0.001). The high correlation coefficient between the average and difference values between all physical activity scores derived from accelerometers and PAQ-S, indicate a systematic overestimation of physical activity time with increasing physical activity for PAQ-S. The validity of PAQ-S for the estimation of MPA and MVPA was found to be slightly similar self-reported measures for schoolchildren. Therefore, this questionnaire could be used as a tool for physical activity assessment in large population studies.
Within- and between-session reliability of the maximal voluntary knee extension torque and activation.

PubMed

Park, Jihong; Hopkins, J Ty

2013-01-01

A ratio between the torque generated by maximal voluntary isometric contraction (MVIC) and exogenous electrical stimulus, central activation ratio (CAR), has been widely used to assess quadriceps function. To date, no data exist regarding between-session reliability of this measurement. Thirteen neurologically sound volunteers underwent three testing sessions (three trials per session) with 48 hours between-session. Subjects performed MVICs of the quadriceps with the knee locked at 90° flexion and the hip at 85°. Once the MVIC reached a plateau, an electrical stimulation from superimposed burst technique (SIB: 125 V with peak output current 450 mA) was manually delivered and transmitted directly to the quadriceps via stimulating electrodes. CAR was calculated by using the following equation: CAR = MVIC torque/MVIC + SIB torque. Intraclass correlation coefficients (ICC) were calculated within- (ICC((2,1))) and between-session (ICC((2,k))) for MVIC torques and CAR values. Our data show that quadriceps MVIC and CAR are very reliable both within- (ICC((2,1)) = 0.99 for MVIC; 0.94 for CAR) and between-measurement sessions (ICC((2,k)) = 0.92 for MVIC; 0.86 for CAR) in healthy young adults. For clinical research, more data of the patients with pathological conditions are required to ensure reproducibility of calculation of CAR.

Validation of a buffet meal design in an experimental restaurant.

PubMed

Allirot, Xavier; Saulais, Laure; Disse, Emmanuel; Roth, Hubert; Cazal, Camille; Laville, Martine

2012-06-01

We assessed the reproducibility of intakes and meal mechanics parameters (cumulative energy intake (CEI), number of bites, bite rate, mean energy content per bite) during a buffet meal designed in a natural setting, and their sensitivity to food deprivation. Fourteen men were invited to three lunch sessions in an experimental restaurant. Subjects ate their regular breakfast before sessions A and B. They skipped breakfast before session FAST. The same ad libitum buffet was offered each time. Energy intakes and meal mechanics were assessed by foods weighing and video recording. Intrasubject reproducibility was evaluated by determining intraclass correlation coefficients (ICC). Mixed-models were used to assess the effects of the sessions on CEI. We found a good reproducibility between A and B for total energy (ICC=0.82), carbohydrate (ICC=0.83), lipid (ICC=0.81) and protein intake (ICC=0.79) and for meal mechanics parameters. Total energy, lipid and carbohydrate intake were higher in FAST than in A and B. CEI were found sensitive to differences in hunger level while the other meal mechanics parameters were stable between sessions. In conclusion, a buffet meal in a normal eating environment is a valid tool for assessing the effects of interventions on intakes. Copyright © 2012 Elsevier Ltd. All rights reserved.
Development, reproducibility and validity of the food frequency questionnaire in the Poland arm of the Prospective Urban and Rural Epidemiological (PURE) study.

PubMed

Dehghan, M; Ilow, R; Zatonska, K; Szuba, A; Zhang, X; Mente, A; Regulska-Ilow, B

2012-06-01

A food frequency questionnaire (FFQ) is the most commonly used method in large epidemiological studies. The validation of an FFQ is essential for specific populations because foods are culture-dependent. The present study aimed to develop an FFQ and evaluate its validity and reproducibility in estimating the intake of nutrients in urban and rural areas of Poland. Adult participants (n = 146) in the Polish arm of the ongoing Prospective Urban and Rural Epidemiological (PURE) study completed FFQs on two occasions, as well as four 24-h dietary recalls (DRs) during a 12-month period. Correlation coefficients (r) and de-attenuated correlation coefficients between dietary recalls and both FFQs were calculated for selected macro- and micronutrients. Agreement between the two methods was evaluated by classification into quartiles and the Bland-Altman method. Reproducibility was assessed by the intra-class correlation coefficient (ICC). The final food list contained 134 food items. For urban participants, FFQ2 generally underestimated energy, protein and fat compared to the FFQ1 and mean of DRs. In rural areas, compared to DRs, both FFQs overestimated energy and macronutrients. For both urban and rural settings, de-attenuated correlation exceeded 0.4 for almost all nutrients and the exact agreement in quartile categorisation was >66%. When assessing repeatability, ICC varied from 0.39-0.63 in an urban setting and 0.19-0.45 in a rural setting. This 134-item FFQ has good validity and reproducibility in relation to the reference method and can be used to rank individuals based on their macro- and micronutrient intake. © 2012 The Authors. Journal of Human Nutrition and Dietetics © 2012 The British Dietetic Association Ltd.
Validity and Reliability of Persian Version of HIV/AIDS Related Stigma Scale for People Living With HIV/AIDS in Iran.

PubMed

Pourmarzi, Davoud; Khoramirad, Ashraf; Ahmari Tehran, Hoda; Abedini, Zahra

2015-11-01

To assess the perceived HIV/AIDS related stigma a comprehensive and well developed stigma instrument is necessary. This study aimed to assess validity and reliability of the Persian version of HIV/AIDS related stigma scale which was developed by Kang et al for people living with HIV/AIDS in Iran. Thescale was forward translatedby two bilingual academic members then both translations were discussed by expert team. Back-translation was done by two other bilingual translators then we carried out discussion with both of them. To evaluate understandability the scale was administered to 10 Persons Living with HIV/AIDS (PLWHA). Final Persian version was administered to 80 PLWHA in Qom, Iran in 2014. Test-retest reliability was assessed in a sample of 20 PLWHA after a week by intra-class correlation coefficient (ICC). Cronbach's alpha coefficient for overall scale was 0.85. Also Cronbach's alpha coefficients for the five subscales were as follows: social rejection (9 items, α = 0.84), negative self-worth (4 items, α = 0.70), perceived interpersonal insecurity (2 items, α = 0.57), financial insecurity (3 items, α = 0.70), discretionary disclosure (2 items, α = 0.83). Test-retest reliability was also approved with ICC = 0.78. Correlation between items and their hypothesized subscale is greater than 0.5. Correlation between an item and its own subscale was significantly higher than its correlation with other subscales. This study demonstrate that the Persian version of HIV/AIDS related stigma scale is valid and reliable to assess HIV/AIDS related stigma perceived by people living whit HIV/AIDS in Iran.
Validity and Reliability of Persian Version of HIV/AIDS Related Stigma Scale for People Living With HIV/AIDS in Iran

PubMed Central

Pourmarzi, Davoud; Khoramirad, Ashraf; Ahmari Tehran, Hoda; Abedini, Zahra

2015-01-01

Objective: To assess the perceived HIV/AIDS related stigma a comprehensive and well developed stigma instrument is necessary. This study aimed to assess validity and reliability of the Persian version of HIV/AIDS related stigma scale which was developed by Kang et al for people living with HIV/AIDS in Iran. Materials and methods: Thescale was forward translatedby two bilingual academic members then both translations were discussed by expert team. Back-translation was done by two other bilingual translators then we carried out discussion with both of them. To evaluate understandability the scale was administered to 10 Persons Living with HIV/AIDS (PLWHA). Final Persian version was administered to 80 PLWHA in Qom, Iran in 2014. Test–retest reliability was assessed in a sample of 20 PLWHA after a week by intra-class correlation coefficient (ICC). Results: Cronbach’s alpha coefficient for overall scale was 0.85. Also Cronbach’s alpha coefficients for the five subscales were as follows: social rejection (9 items, α = 0.84), negative self-worth (4 items, α = 0.70), perceived interpersonal insecurity (2 items, α = 0.57), financial insecurity (3 items, α = 0.70), discretionary disclosure (2 items, α = 0.83). Test–retest reliability was also approved with ICC = 0.78. Correlation between items and their hypothesized subscale is greater than 0.5. Correlation between an item and its own subscale was significantly higher than its correlation with other subscales. Conclusion: This study demonstrate that the Persian version of HIV/AIDS related stigma scale is valid and reliable to assess HIV/AIDS related stigma perceived by people living whit HIV/AIDS in Iran. PMID:27047562
Validity and reproducibility of the Physical Activity Scale for the Elderly (PASE) questionnaire for the measurement of the physical activity level in patients after total knee arthroplasty.

PubMed

Bolszak, Sylvain; Casartelli, Nicola C; Impellizzeri, Franco M; Maffiuletti, Nicola A

2014-02-20

The need for valid and reproducible questionnaires to routinely assess the physical activity level of patients after total knee arthroplasty (TKA) is of particular concern in clinical settings. Aims of this study were to evaluate the validity and reproducibility of the physical activity scale for the elderly (PASE) questionnaire in TKA patients, with a particular view on gender differences. A total of 50 elderly patients (25 women and 25 men aged 70 ± 6 years) following primary unilateral TKA were recruited. The reproducibility was evaluated by administering the PASE questionnaire during two occasions separated by 7 days. The construct (criterion) validity was investigated by comparing the physical activity level reported by patients in the PASE questionnaire to that measured by accelerometry. Reproducibility was evaluated using intraclass correlation coefficients (ICC3,1) for reliability and standard error of measurement (SEM) and smallest detectable change (SDC) for agreement, while validity was investigated with Pearson correlation coefficients. Reliability of the PASE total score was acceptable for men (ICC = 0.77) but not for women (ICC = 0.58). Its agreement was low for both men and women, as witnessed by high SEM (32% and 35%, respectively) and SDC (89% and 97%, respectively). Construct validity of the PASE total score was low in both men (r = 0.45) and women (r = 0.06). The PASE questionnaire has several validity and reproducibility shortcomings, therefore its use is not recommended for the assessment of physical activity level in patients after TKA, particularly in women.
The reliability, minimal detectable change and concurrent validity of a gravity-based bubble inclinometer and iphone application for measuring standing lumbar lordosis.

PubMed

Salamh, Paul A; Kolber, Morey

2014-01-01

To investigate the reliability, minimal detectable change (MDC90) and concurrent validity of a gravity-based bubble inclinometer (inclinometer) and iPhone® application for measuring standing lumbar lordosis. Two investigators used both an inclinometer and an iPhone® with an inclinometer application to measure lumbar lordosis of 30 asymptomatic participants. ICC models 3,k and 2,k were used for the intrarater and interrater analysis, respectively. Good interrater and intrarater reliability was present for the inclinometer with Intraclass Correlation Coefficients (ICC) of 0.90 and 0.85, respectively and the iPhone® application with ICC values of 0.96 and 0.81. The minimal detectable change (MDC90) indicates that a change greater than or equal to 7° and 6° is needed to exceed the threshold of error using the iPhone® and inclinometer, respectively. The concurrent validity between the two instruments was good with a Pearson product-moment coefficient of correlation (r) of 0.86 for both raters. Ninety-five percent limits of agreement identified differences ranging from 9° greater in regards to the iPhone® to 8° less regarding the inclinometer. Both the inclinometer and iPhone® application possess good interrater reliability, intrarater reliability and concurrent validity for measuring standing lumbar lordosis. This investigation provides preliminary evidence to suggest that smart phone applications may offer clinical utility comparable to inclinometry for quantifying standing lumbar lordosis. Clinicians should recognize potential individual differences when using these devices interchangeably.
Validity and Reliability of a Wearable Inertial Sensor to Measure Velocity and Power in the Back Squat and Bench Press.

PubMed

Orange, Samuel T; Metcalfe, James W; Liefeith, Andreas; Marshall, Phil; Madden, Leigh A; Fewster, Connor R; Vince, Rebecca V

2018-05-08

Orange, ST, Metcalfe, JW, Liefeith, A, Marshall, P, Madden, LA, Fewster, CR, and Vince, RV. Validity and reliability of a wearable inertial sensor to measure velocity and power in the back squat and bench press. J Strength Cond Res XX(X): 000-000, 2018-This study examined the validity and reliability of a wearable inertial sensor to measure velocity and power in the free-weight back squat and bench press. Twenty-nine youth rugby league players (18 ± 1 years) completed 2 test-retest sessions for the back squat followed by 2 test-retest sessions for the bench press. Repetitions were performed at 20, 40, 60, 80, and 90% of 1 repetition maximum (1RM) with mean velocity, peak velocity, mean power (MP), and peak power (PP) simultaneously measured using an inertial sensor (PUSH) and a linear position transducer (GymAware PowerTool). The PUSH demonstrated good validity (Pearson's product-moment correlation coefficient [r]) and reliability (intraclass correlation coefficient [ICC]) only for measurements of MP (r = 0.91; ICC = 0.83) and PP (r = 0.90; ICC = 0.80) at 20% of 1RM in the back squat. However, it may be more appropriate for athletes to jump off the ground with this load to optimize power output. Further research should therefore evaluate the usability of inertial sensors in the jump squat exercise. In the bench press, good validity and reliability were evident only for the measurement of MP at 40% of 1RM (r = 0.89; ICC = 0.83). The PUSH was unable to provide a valid and reliable estimate of any other criterion variable in either exercise. Practitioners must be cognizant of the measurement error when using inertial sensor technology to quantify velocity and power during resistance training, particularly with loads other than 20% of 1RM in the back squat and 40% of 1RM in the bench press.
Validity and reliability of Nintendo Wii Fit balance scores.

PubMed

Wikstrom, Erik A

2012-01-01

Interactive gaming systems have the potential to help rehabilitate patients with musculoskeletal conditions. The Nintendo Wii Balance Board, which is part of the Wii Fit game, could be an effective tool to monitor progress during rehabilitation because the board and game can provide objective measures of balance. However, the validity and reliability of Wii Fit balance scores remain unknown. To determine the concurrent validity of balance scores produced by the Wii Fit game and the intrasession and intersession reliability of Wii Fit balance scores. Descriptive laboratory study. Sports medicine research laboratory. Forty-five recreationally active participants (age = 27.0 ± 9.8 years, height = 170.9 ± 9.2 cm, mass = 72.4 ± 11.8 kg) with a heterogeneous history of lower extremity injury. Participants completed a single-limb-stance task on a force plate and the Star Excursion Balance Test (SEBT) during the first test session. Twelve Wii Fit balance activities were completed during 2 test sessions separated by 1 week. Postural sway in the anteroposterior (AP) and mediolateral (ML) directions and the AP, ML, and resultant center-of-pressure (COP) excursions were calculated from the single-limb stance. The normalized reach distance was recorded for the anterior, posteromedial, and posterolateral directions of the SEBT. Wii Fit balance scores that the game software generated also were recorded. All 96 of the calculated correlation coefficients among Wii Fit activity outcomes and established balance outcomes were interpreted as poor (r < 0.50). Intrasession reliability for Wii Fit balance activity scores ranged from good (intraclass correlation coefficient [ICC] = 0.80) to poor (ICC = 0.39), with 8 activities having poor intrasession reliability. Similarly, 11 of the 12 Wii Fit balance activity scores demonstrated poor intersession reliability, with scores ranging from fair (ICC = 0.74) to poor (ICC = 0.29). Wii Fit balance activity scores had poor concurrent validity relative to COP outcomes and SEBT reach distances. In addition, the included Wii Fit balance activity scores generally had poor intrasession and intersession reliability.
An International Ki67 Reproducibility Study

PubMed Central

2013-01-01

Background In breast cancer, immunohistochemical assessment of proliferation using the marker Ki67 has potential use in both research and clinical management. However, lack of consistency across laboratories has limited Ki67’s value. A working group was assembled to devise a strategy to harmonize Ki67 analysis and increase scoring concordance. Toward that goal, we conducted a Ki67 reproducibility study. Methods Eight laboratories received 100 breast cancer cases arranged into 1-mm core tissue microarrays—one set stained by the participating laboratory and one set stained by the central laboratory, both using antibody MIB-1. Each laboratory scored Ki67 as percentage of positively stained invasive tumor cells using its own method. Six laboratories repeated scoring of 50 locally stained cases on 3 different days. Sources of variation were analyzed using random effects models with log2-transformed measurements. Reproducibility was quantified by intraclass correlation coefficient (ICC), and the approximate two-sided 95% confidence intervals (CIs) for the true intraclass correlation coefficients in these experiments were provided. Results Intralaboratory reproducibility was high (ICC = 0.94; 95% CI = 0.93 to 0.97). Interlaboratory reproducibility was only moderate (central staining: ICC = 0.71, 95% CI = 0.47 to 0.78; local staining: ICC = 0.59, 95% CI = 0.37 to 0.68). Geometric mean of Ki67 values for each laboratory across the 100 cases ranged 7.1% to 23.9% with central staining and 6.1% to 30.1% with local staining. Factors contributing to interlaboratory discordance included tumor region selection, counting method, and subjective assessment of staining positivity. Formal counting methods gave more consistent results than visual estimation. Conclusions Substantial variability in Ki67 scoring was observed among some of the world’s most experienced laboratories. Ki67 values and cutoffs for clinical decision-making cannot be transferred between laboratories without standardizing scoring methodology because analytical validity is limited. PMID:24203987
Effect of knee angle on neuromuscular assessment of plantar flexor muscles: A reliability study

PubMed Central

Cornu, Christophe; Jubeau, Marc

2018-01-01

Introduction This study aimed to determine the intra- and inter-session reliability of neuromuscular assessment of plantar flexor (PF) muscles at three knee angles. Methods Twelve young adults were tested for three knee angles (90°, 30° and 0°) and at three time points separated by 1 hour (intra-session) and 7 days (inter-session). Electrical (H reflex, M wave) and mechanical (evoked and maximal voluntary torque, activation level) parameters were measured on the PF muscles. Intraclass correlation coefficients (ICC) and coefficients of variation were calculated to determine intra- and inter-session reliability. Results The mechanical measurements presented excellent (ICC>0.75) intra- and inter-session reliabilities regardless of the knee angle considered. The reliability of electrical measurements was better for the 90° knee angle compared to the 0° and 30° angles. Conclusions Changes in the knee angle may influence the reliability of neuromuscular assessments, which indicates the importance of considering the knee angle to collect consistent outcomes on the PF muscles. PMID:29596480
Reliability and Reproducibility of Advanced ECG Parameters in Month-to-Month and Year-to-Year Recordings in Healthy Subjects

NASA Technical Reports Server (NTRS)

Starc, Vito; Abughazaleh, Ahmed S.; Schlegel, Todd T.

2014-01-01

Advanced resting ECG parameters such the spatial mean QRS-T angle and the QT variability index (QTVI) have important diagnostic and prognostic utility, but their reliability and reproducibility (R&R) are not well characterized. We hypothesized that the spatial QRS-T angle would have relatively higher R&R than parameters such as QTVI that are more responsive to transient changes in the autonomic nervous system. The R&R of several conventional and advanced ECG para-meters were studied via intraclass correlation coefficients (ICCs) and coefficients of variation (CVs) in: (1) 15 supine healthy subjects from month-to-month; (2) 27 supine healthy subjects from year-to-year; and (3) 25 subjects after transition from the supine to the seated posture. As hypothesized, for the spatial mean QRS-T angle and many conventional ECG parameters, ICCs we-re higher, and CVs lower than QTVI, suggesting that the former parameters are more reliable and reproducible.
Assessment of the reliability and consistency of the "malnutrition inflammation score" (MIS) in Mexican adults with chronic kidney disease for diagnosis of protein-energy wasting syndrome (PEW).

PubMed

González-Ortiz, Ailema Janeth; Arce-Santander, Celene Viridiana; Vega-Vega, Olynka; Correa-Rotter, Ricardo; Espinosa-Cuevas, María de Los Angeles

2014-10-04

The protein-energy wasting syndrome (PEW) is a condition of malnutrition, inflammation, anorexia and wasting of body reserves resulting from inflammatory and non-inflammatory conditions in patients with chronic kidney disease (CKD).One way of assessing PEW, extensively described in the literature, is using the Malnutrition Inflammation Score (MIS). To assess the reliability and consistency of MIS for diagnosis of PEW in Mexican adults with CKD on hemodialysis (HD). Study of diagnostic tests. A sample of 45 adults with CKD on HD were analyzed during the period June-July 2014.The instrument was applied on 2 occasions; the test-retest reliability was calculated using the Intraclass Correlation Coefficient (ICC); the internal consistency of the questionnaire was analyzed using Cronbach's αcoefficient. A weighted Kappa test was used to estimate the validity of the instrument; the result was subsequently compared with the Bilbrey nutritional index (BNI). The reliability of the questionnaires, evaluated in the patient sample, was ICC=0.829.The agreement between MIS observations was considered adequate, k= 0.585 (p <0.001); when comparing it with BNI, a value of k = 0.114 was obtained (p <0.001).In order to estimate the tendency, a correlation test was performed. The r² correlation coefficient was 0.488 (P <0.001). MIS has adequate reliability and validity for diagnosing PEW in the population with chronic kidney disease on HD. Copyright AULA MEDICA EDICIONES 2014. Published by AULA MEDICA. All rights reserved.
Reliability and construct validity of the Spanish version of the 6-item CTS symptoms scale for outcomes assessment in carpal tunnel syndrome.

PubMed

Rosales, Roberto S; Martin-Hidalgo, Yolanda; Reboso-Morales, Luis; Atroshi, Isam

2016-03-03

The purpose of this study was to assess the reliability and construct validity of the Spanish version of the 6-item carpal tunnel syndrome (CTS) symptoms scale (CTS-6). In this cross-sectional study 40 patients diagnosed with CTS based on clinical and neurophysiologic criteria, completed the standard Spanish versions of the CTS-6 and the disabilities of the arm, shoulder and hand (QuickDASH) scales on two occasions with a 1-week interval. Internal-consistency reliability was assessed with the Cronbach alpha coefficient and test-retest reliability with the intraclass correlation coefficient, two way random effect model and absolute agreement definition (ICC2,1). Cross-sectional precision was analyzed with the Standard Error of the Measurement (SEM). Longitudinal precision for test-retest reliability coefficient was assessed with the Standard Error of the Measurement difference (SEMdiff) and the Minimal Detectable Change at 95 % confidence level (MDC95). For assessing construct validity it was hypothesized that the CTS-6 would have a strong positive correlation with the QuickDASH, analyzed with the Pearson correlation coefficient (r). The standard Spanish version of the CTS-6 presented a Cronbach alpha of 0.81 with a SEM of 0.3. Test-retest reliability showed an ICC of 0.85 with a SRMdiff of 0.36 and a MDC95 of 0.7. The correlation between CTS-6 and the QuickDASH was concordant with the a priori formulated construct hypothesis (r 0.69) CONCLUSIONS: The standard Spanish version of the 6-item CTS symptoms scale showed good internal consistency, test-retest reliability and construct validity for outcomes assessment in CTS. The CTS-6 will be useful to clinicians and researchers in Spanish speaking parts of the world. The use of standardized outcome measures across countries also will facilitate comparison of research results in carpal tunnel syndrome.
Age Band 1 of the Movement Assessment Battery for Children-Second Edition: Exploring Its Usefulness in Mainland China

ERIC Educational Resources Information Center

Hua, Jing; Gu, Guixiong; Meng, Wei; Wu, Zhuochun

2013-01-01

The aim of this paper was to examine the validity and reliability of age band 1 of the Movement Assessment Battery for Children-Second Edition (MABC-2) in preparation for its standardization in mainland China. Interrater and test-retest reliability of the MABC-2 was estimated using Intraclass Correlation Coefficient (ICC). Cronbach's alpha for…
Web-Based Assessment of Mental Well-Being in Early Adolescence: A Reliability Study.

PubMed

Hamann, Christoph; Schultze-Lutter, Frauke; Tarokh, Leila

2016-06-15

The ever-increasing use of the Internet among adolescents represents an emerging opportunity for researchers to gain access to larger samples, which can be queried over several years longitudinally. Among adolescents, young adolescents (ages 11 to 13 years) are of particular interest to clinicians as this is a transitional stage, during which depressive and anxiety symptoms often emerge. However, it remains unclear whether these youngest adolescents can accurately answer questions about their mental well-being using a Web-based platform. The aim of the study was to examine the accuracy of responses obtained from Web-based questionnaires by comparing Web-based with paper-and-pencil versions of depression and anxiety questionnaires. The primary outcome was the score on the depression and anxiety questionnaires under two conditions: (1) paper-and-pencil and (2) Web-based versions. Twenty-eight adolescents (aged 11-13 years, mean age 12.78 years and SD 0.78; 18 females, 64%) were randomly assigned to complete either the paper-and-pencil or the Web-based questionnaire first. Intraclass correlation coefficients (ICCs) were calculated to measure intrarater reliability. Intraclass correlation coefficients were calculated separately for depression (Children's Depression Inventory, CDI) and anxiety (Spence Children's Anxiety Scale, SCAS) questionnaires. On average, it took participants 17 minutes (SD 6) to answer 116 questions online. Intraclass correlation coefficient analysis revealed high intrarater reliability when comparing Web-based with paper-and-pencil responses for both CDI (ICC=.88; P<.001) and the SCAS (ICC=.95; P<.001). According to published criteria, both of these values are in the "almost perfect" category indicating the highest degree of reliability. The results of the study show an excellent reliability of Web-based assessment in 11- to 13-year-old children as compared with the standard paper-pencil assessment. Furthermore, we found that Web-based assessments with young adolescents are highly feasible, with all enrolled participants completing the Web-based form. As early adolescence is a time of remarkable social and behavioral changes, these findings open up new avenues for researchers from diverse fields who are interested in studying large samples of young adolescents over time.
The Reliability of Anthropometric Measurements Used Preoperatively in Aesthetic Breast Surgery.

PubMed

Isaac, Kathryn V; Murphy, Blake D; Beber, Brett; Brown, Mitchell

2016-04-01

Patient outcomes in aesthetic breast surgery are highly dependent on breast measurements used in preoperative planning. The purpose of this study is to determine the reliability of anthropometric breast measurements. Four raters measured 28 women using 7 measurements: sternal notch to nipple distance (Sn-N), nipple to midline (N-M), nipple to inframammary-fold distance under maximal stretch (N-IMF), breast base width (BW), soft tissue pinch thickness of the upper pole (STPT:UP), STPT at the inframammary fold (STPT:IMF), and anterior pull skin stretch (APSS). Reliability was assessed using intra-class correlation coefficients (ICCs). Inter-rater reliability was excellent for Sn-N, N-M, and BW (ICC = 0.94, 0.90, and 0.76, respectively) and was good for N-IMF (ICC = 0.70). The STPT:UP, STPT:IMF, and APSS measurements were not reliable between raters (ICC < 0.2). Intra-rater reliability was excellent for Sn-N, N-M, and BW for all raters (all ICC > 0.75). The N-IMF intra-rater reliability was excellent in senior raters (ICC > 0.75) and good in junior raters (ICC > 0.6). The STPT:UP, STPT:IMF, and APSS measurements showed fair or poor reliability for most raters (ICC < 0.6). The Sn-N, N-M, and BW measurements are very reliable. Dynamic measurements including APSS, STPT:UP, and STUP:IMF are unreliable. N-IMF is the only reliable dynamic measurement, and its reliability improves with increasing clinical experience. The variable reliability of preoperative measurements must be considered in the planning of aesthetic breast surgery. 4 Diagnostic. © 2015 The American Society for Aesthetic Plastic Surgery, Inc. Reprints and permission: journals.permissions@oup.com.
Wavefront Derived Refraction and Full Eye Biometry in Pseudophakic Eyes

PubMed Central

Mao, Xinjie; Banta, James T.; Ke, Bilian; Jiang, Hong; He, Jichang; Liu, Che; Wang, Jianhua

2016-01-01

Purpose To assess wavefront derived refraction and full eye biometry including ciliary muscle dimension and full eye axial geometry in pseudophakic eyes using spectral domain OCT equipped with a Shack-Hartmann wavefront sensor. Methods Twenty-eight adult subjects (32 pseudophakic eyes) having recently undergone cataract surgery were enrolled in this study. A custom system combining two optical coherence tomography systems with a Shack-Hartmann wavefront sensor was constructed to image and monitor changes in whole eye biometry, the ciliary muscle and ocular aberration in the pseudophakic eye. A Badal optical channel and a visual target aligning with the wavefront sensor were incorporated into the system for measuring the wavefront-derived refraction. The imaging acquisition was performed twice. The coefficients of repeatability (CoR) and intraclass correlation coefficient (ICC) were calculated. Results Images were acquired and processed successfully in all patients. No significant difference was detected between repeated measurements of ciliary muscle dimension, full-eye biometry or defocus aberration. The CoR of full-eye biometry ranged from 0.36% to 3.04% and the ICC ranged from 0.981 to 0.999. The CoR for ciliary muscle dimensions ranged from 12.2% to 41.6% and the ICC ranged from 0.767 to 0.919. The defocus aberrations of the two measurements were 0.443 ± 0.534 D and 0.447 ± 0.586 D and the ICC was 0.951. Conclusions The combined system is capable of measuring full eye biometry and refraction with good repeatability. The system is suitable for future investigation of pseudoaccommodation in the pseudophakic eye. PMID:27010674
Wavefront Derived Refraction and Full Eye Biometry in Pseudophakic Eyes.

PubMed

Mao, Xinjie; Banta, James T; Ke, Bilian; Jiang, Hong; He, Jichang; Liu, Che; Wang, Jianhua

2016-01-01

To assess wavefront derived refraction and full eye biometry including ciliary muscle dimension and full eye axial geometry in pseudophakic eyes using spectral domain OCT equipped with a Shack-Hartmann wavefront sensor. Twenty-eight adult subjects (32 pseudophakic eyes) having recently undergone cataract surgery were enrolled in this study. A custom system combining two optical coherence tomography systems with a Shack-Hartmann wavefront sensor was constructed to image and monitor changes in whole eye biometry, the ciliary muscle and ocular aberration in the pseudophakic eye. A Badal optical channel and a visual target aligning with the wavefront sensor were incorporated into the system for measuring the wavefront-derived refraction. The imaging acquisition was performed twice. The coefficients of repeatability (CoR) and intraclass correlation coefficient (ICC) were calculated. Images were acquired and processed successfully in all patients. No significant difference was detected between repeated measurements of ciliary muscle dimension, full-eye biometry or defocus aberration. The CoR of full-eye biometry ranged from 0.36% to 3.04% and the ICC ranged from 0.981 to 0.999. The CoR for ciliary muscle dimensions ranged from 12.2% to 41.6% and the ICC ranged from 0.767 to 0.919. The defocus aberrations of the two measurements were 0.443 ± 0.534 D and 0.447 ± 0.586 D and the ICC was 0.951. The combined system is capable of measuring full eye biometry and refraction with good repeatability. The system is suitable for future investigation of pseudoaccommodation in the pseudophakic eye.
Modeling parameters that characterize pacing of elite female 800-m freestyle swimmers.

PubMed

Lipińska, Patrycja; Allen, Sian V; Hopkins, Will G

2016-01-01

Pacing offers a potential avenue for enhancement of endurance performance. We report here a novel method for characterizing pacing in 800-m freestyle swimming. Websites provided 50-m lap and race times for 192 swims of 20 elite female swimmers between 2000 and 2013. Pacing for each swim was characterized with five parameters derived from a linear model: linear and quadratic coefficients for effect of lap number, reductions from predicted time for first and last laps, and lap-time variability (standard error of the estimate). Race-to-race consistency of the parameters was expressed as intraclass correlation coefficients (ICCs). The average swim was a shallow negative quadratic with slowest time in the eleventh lap. First and last laps were faster by 6.4% and 3.6%, and lap-time variability was ±0.64%. Consistency between swimmers ranged from low-moderate for the linear and quadratic parameters (ICC = 0.29 and 0.36) to high for the last-lap parameter (ICC = 0.62), while consistency for race time was very high (ICC = 0.80). Only ~15% of swimmers had enough swims (~15 or more) to provide reasonable evidence of optimum parameter values in plots of race time vs. each parameter. The modest consistency of most of the pacing parameters and lack of relationships between parameters and performance suggest that swimmers usually compensated for changes in one parameter with changes in another. In conclusion, pacing in 800-m elite female swimmers can be characterized with five parameters, but identifying an optimal pacing profile is generally impractical.
Apparent diffusion coefficient measurement in glioma: Influence of region-of-interest determination methods on apparent diffusion coefficient values, interobserver variability, time efficiency, and diagnostic ability.

PubMed

Han, Xu; Suo, Shiteng; Sun, Yawen; Zu, Jinyan; Qu, Jianxun; Zhou, Yan; Chen, Zengai; Xu, Jianrong

2017-03-01

To compare four methods of region-of-interest (ROI) placement for apparent diffusion coefficient (ADC) measurements in distinguishing low-grade gliomas (LGGs) from high-grade gliomas (HGGs). Two independent readers measured ADC parameters using four ROI methods (single-slice [single-round, five-round and freehand] and whole-volume) on 43 patients (20 LGGs, 23 HGGs) who had undergone 3.0 Tesla diffusion-weighted imaging and time required for each method of ADC measurements was recorded. Intraclass correlation coefficients (ICCs) were used to assess interobserver variability of ADC measurements. Mean and minimum ADC values and time required were compared using paired Student's t-tests. All ADC parameters (mean/minimum ADC values of three single-slice methods, mean/minimum/standard deviation/skewness/kurtosis/the10 th and 25 th percentiles/median/maximum of whole-volume method) were correlated with tumor grade (low versus high) by unpaired Student's t-tests. Discriminative ability was determined by receiver operating characteristic curves. All ADC measurements except minimum, skewness, and kurtosis of whole-volume ROI differed significantly between LGGs and HGGs (all P < 0.05). Mean ADC value of single-round ROI had the highest effect size (0.72) and the greatest areas under the curve (0.872). Three single-slice methods had good to excellent ICCs (0.67-0.89) and the whole-volume method fair to excellent ICCs (0.32-0.96). Minimum ADC values differed significantly between whole-volume and single-round ROI (P = 0.003) and, between whole-volume and five-round ROI (P = 0.001). The whole-volume method took significantly longer than all single-slice methods (all P < 0.001). ADC measurements are influenced by ROI determination methods. Whole-volume histogram analysis did not yield better results than single-slice methods and took longer. Mean ADC value derived from single-round ROI is the most optimal parameter for differentiating LGGs from HGGs. 3 J. Magn. Reson. Imaging 2017;45:722-730. © 2016 International Society for Magnetic Resonance in Medicine.

TU-AB-BRA-05: Repeatability of [F-18]-NaF PET Imaging Biomarkers for Bone Lesions: A Multicenter Study

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lin, C; Bradshaw, T; Perk, T

2015-06-15

Purpose: Quantifying the repeatability of imaging biomarkers is critical for assessing therapeutic response. While therapeutic efficacy has been traditionally quantified by SUV metrics, imaging texture features have shown potential for use as quantitative biomarkers. In this study we evaluated the repeatability of quantitative {sup 18}F-NaF PET-derived SUV metrics and texture features in bone lesions from patients in a multicenter study. Methods: Twenty-nine metastatic castrate-resistant prostate cancer patients received whole-body test-retest NaF PET/CT scans from one of three harmonized imaging centers. Bone lesions of volume greater than 1.5 cm{sup 3} were identified and automatically segmented using a SUV>15 threshold. From eachmore » lesion, 55 NaF PET-derived texture features (including first-order, co-occurrence, grey-level run-length, neighbor gray-level, and neighbor gray-tone difference matrix) were extracted. The test-retest repeatability of each SUV metric and texture feature was assessed with Bland-Altman analysis. Results: A total of 315 bone lesions were evaluated. Of the traditional SUV metrics, the repeatability coefficient (RC) was 12.6 SUV for SUVmax, 2.5 SUV for SUVmean, and 4.3 cm{sup 3} for volume. Their respective intralesion coefficients of variation (COVs) were 12%, 17%, and 6%. Of the texture features, COV was lowest for entropy (0.03%) and highest for kurtosis (105%). Lesion intraclass correlation coefficient (ICC) was lowest for maximum correlation coefficient (ICC=0.848), and highest for entropy (ICC=0.985). Across imaging centers, repeatability of texture features and SUV varied. For example, across imaging centers, COV for SUVmax ranged between 11–23%. Conclusion: Many NaF PET-derived SUV metrics and texture features for bone lesions demonstrated high repeatability, such as SUVmax, entropy, and volume. Several imaging texture features demonstrated poor repeatability, such as SUVtotal and SUVstd. These results can be used to establish response criteria for NaF PET-based treatment response assessment. Prostate Cancer Foundation (PCF)« less
Variability of a "force signature" during windmill softball pitching and relationship between discrete force variables and pitch velocity.

PubMed

Nimphius, Sophia; McGuigan, Michael R; Suchomel, Timothy J; Newton, Robert U

2016-06-01

This study assessed reliability of discrete ground reaction force (GRF) variables over multiple pitching trials, investigated the relationships between discrete GRF variables and pitch velocity (PV) and assessed the variability of the "force signature" or continuous force-time curve during the pitching motion of windmill softball pitchers. Intraclass correlation coefficient (ICC) for all discrete variables was high (0.86-0.99) while the coefficient of variance (CV) was low (1.4-5.2%). Two discrete variables were significantly correlated to PV; second vertical peak force (r(5)=0.81, p=0.03) and time between peak forces (r(5)=-0.79; p=0.03). High ICCs and low CVs support the reliability of discrete GRF and PV variables over multiple trials and significant correlations indicate there is a relationship between the ability to produce force and the timing of this force production with PV. The mean of all pitchers' curve-average standard deviation of their continuous force-time curves demonstrated low variability (CV=4.4%) indicating a repeatable and identifiable "force signature" pattern during this motion. As such, the continuous force-time curve in addition to discrete GRF variables should be examined in future research as a potential method to monitor or explain changes in pitching performance. Copyright © 2016 Elsevier B.V. All rights reserved.
[Reliability and validity of the PAQ-A questionnaire to assess physical activity in Spanish adolescents].

PubMed

Martínez-Gómez, David; Martínez-de-Haro, Vicente; Pozo, Tamara; Welk, Gregory J; Villagra, Ariel; Calle, Marisa E; Marcos, Ascensión; Veiga, Oscar L

2009-01-01

Questionnaires are feasible instruments to assess physical activity (PA) in large samples. The aim of the current study was to evaluate the reliability and validity of the PAQ-A questionnaire in Spanish adolescents using the measurement of PA by accelerometer as criterion. In a sample of 82 adolescents, aged 12 to 17 years, 1-week PAQ-A test-retest was administered. Reliability was analyzed by the Intraclass Correlation Coefficient (ICC) and the internal consistency by the Cronbach's alpha Coefficient. Two hundred thirty-two adolescents, aged 13-17 years, completed the PAQ-A and wore the ActiGraph GT1M accelerometer during 7-days. The PAQ-A was compared against total PA and moderate to vigorous PA (MVPA) obtained by the accelerometer. Test-retest reliability showed ICC = 0.71 for the final score of PAQ-A. Internal consistency was alpha = 0.65 in the first self-report, alpha = 0.67 in the retest in 82 adolescents sample, and alpha = 0.74 in the 232 adolescents sample. The PAQ-A was moderately correlated with total PA (rho = 0.39) and MVPA (rho= 0.34) assessed by the accelerometer. The PAQ-A obtained significantly moderate correlations in boys but not in girls against the accelerometer. The PAQ-A questionnaire shows an adequate reliability and a reasonable validity for assessing PA in Spanish adolescents.
Preliminary appraisal of the reliability and validity of the Colorado State University Feline Acute Pain Scale.

PubMed

Shipley, Hilary; Guedes, Alonso; Graham, Lynelle; Goudie-DeAngelis, Elizabeth; Wendt-Hornickle, Erin

2018-05-01

Objectives The objective of this study was to determine the inter-rater reliability and convergent validity of the Colorado State University Feline Acute Pain Scale (CSU-FAPS) in a preliminary appraisal of its performance in a clinical teaching setting. Methods Sixty-eight female cats were assessed for pain after ovariohysterectomy. A cohort of 21 cats was examined independently by four raters (two board-certified anesthesiologists and two anesthesia residents) with the CSU-FAPS, and intra-class correlation coefficient (ICC) was used to determine inter-rater reliability. Weighted Cohen's kappa was used to determine inter-rater reliability centered on the 'need to reassess analgesic plan' (dichotomous scale). A separate cohort of 47 cats was evaluated independently by two raters (one board-certified anesthesiologist and one veterinary small animal rotating intern) using the CSU-FAPS and the Glasgow Composite Measure Pain Scale (CMPS-Feline), and Spearman rank-order correlation was determined to assess convergent validity. Reliability was interpreted using Altman's classification as very good, good, moderate, fair and poor. Validity was considered adequate if correlation coefficients were between 0.4 and 0.8. Results The ICC was 0.61 for anesthesiologists and 0.67 for residents, indicating good reliability. Weighted Cohen's kappa was 0.79 for anesthesiologists and 0.44 for residents, indicating moderate to good reliability. The Spearman rank correlation indicated a statistically significant ( P = 0.0003) positive correlation (0.31; 95% confidence interval 0.14-0.46) between the CSU-FAPS and the CMPS-Feline. Conclusions and relevance The CSU-FAPS showed moderate-to-good inter-rater reliability when used by veterinarians to assess pain level or need to reassess analgesic plan after ovariohysterectomy in cats. The validity fell short of current guidelines for correlation coefficients and further refinement and testing are warranted to improve its performance.
Five times sit-to-stand test in subjects with total knee replacement: Reliability and relationship with functional mobility tests.

PubMed

Medina-Mirapeix, Francesc; Vivo-Fernández, Iván; López-Cañizares, Juan; García-Vidal, José A; Benítez-Martínez, Josep Carles; Del Baño-Aledo, María Elena

2018-01-01

The objective was to determine the inter-observer and test/retest reliability of the "Five-repetition sit-to-stand" (5STS) test in patients with total knee replacement (TKR). To explore correlation between 5STS and two mobility tests. A reliability study was conducted among 24 (mean age 72.13, S.D. 10.67; 50% were women) outpatients with TKR. They were recruited from a traumatology unit of a public hospital via convenience sampling. A physiotherapist and trauma physician assessed each patient at the same time. The same physiotherapist realized a 5STS second measurement 45-60min after the first one. Reliability was assessed with intraclass correlation coefficients (ICCs) and Bland-Altman plots. Pearson coefficient was calculated to assess the correlation between 5STS, time up to go test (TUG) and four meters gait speed (4MGS). ICC for inter-observer and test-retest reliability of the 5STS were 0.998 (95% confidence interval [CI], 0.995-0.999) and 0.982 (95% CI, 0.959-0.992). Bland-Altman plot inter-observer showed limits between -0.82 and 1.06 with a mean of 0.11 and no heteroscedasticity within the data. Bland-Altman plot for test-retest showed the limits between 1.76 and 4.16, a mean of 1.20 and heteroscedasticity within the data. Pearson correlation coefficient revealed significant correlation between 5STS and TUG (r=0.7, p<0.001) and 4MGS (r=-0.583, p=0.003). This study demonstrates excellent inter-observer and test-retest reliability when it is used in people with TKR, and also significant correlation with other functional mobility tests. These findings support the use of 5STS as outcome measure in TKR population. Copyright © 2017 Elsevier B.V. All rights reserved.
Validity and reliability of a new tool to evaluate handwriting difficulties in Parkinson's disease.

PubMed

Nackaerts, Evelien; Heremans, Elke; Smits-Engelsman, Bouwien C M; Broeder, Sanne; Vandenberghe, Wim; Bergmans, Bruno; Nieuwboer, Alice

2017-01-01

Handwriting in Parkinson's disease (PD) features specific abnormalities which are difficult to assess in clinical practice since no specific tool for evaluation of spontaneous movement is currently available. This study aims to validate the 'Systematic Screening of Handwriting Difficulties' (SOS-test) in patients with PD. Handwriting performance of 87 patients and 26 healthy age-matched controls was examined using the SOS-test. Sixty-seven patients were tested a second time within a period of one month. Participants were asked to copy as much as possible of a text within 5 minutes with the instruction to write as neatly and quickly as in daily life. Writing speed (letters in 5 minutes), size (mm) and quality of handwriting were compared. Correlation analysis was performed between SOS outcomes and other fine motor skill measurements and disease characteristics. Intrarater, interrater and test-retest reliability were assessed using the intraclass correlation coefficient (ICC) and Spearman correlation coefficient. Patients with PD had a smaller (p = 0.043) and slower (p<0.001) handwriting and showed worse writing quality (p = 0.031) compared to controls. The outcomes of the SOS-test significantly correlated with fine motor skill performance and disease duration and severity. Furthermore, the test showed excellent intrarater, interrater and test-retest reliability (ICC > 0.769 for both groups). The SOS-test is a short and effective tool to detect handwriting problems in PD with excellent reliability. It can therefore be recommended as a clinical instrument for standardized screening of handwriting deficits in PD.
Psychometric Properties of the Persian Version of the Simple Shoulder Test (SST) Questionnaire.

PubMed

Ebrahimzadeh, Mohammad H; Vahedi, Ehsan; Baradaran, Aslan; Birjandinejad, Ali; Seyyed-Hoseinian, Seyyed-Hadi; Bagheri, Farshid; Kachooei, Amir Reza

2016-10-01

To validate the Persian version of the simple shoulder test in patients with shoulder joint problems. Following Beaton`s guideline, translation and back translation was conducted. We reached to a consensus on the Persian version of SST. To test the face validity in a pilot study, the Persian SST was administered to 20 individuals with shoulder joint conditions. We enrolled 148 consecutive patients with shoulder problem to fill the Persian SST, shoulder specific measure including Oxford shoulder score (OSS) and two general measures including DASH and SF-36. To measure the test-retest reliability, 42 patients were randomly asked to fill the Persian-SST for the second time after one week. Cronbach's alpha coefficient was used to demonstrate internal consistency over the 12 items of Persian-SST. ICC for the total questionnaire was 0.61 showing good and acceptable test-retest reliability. ICC for individual items ranged from 0.32 to 0.79. The total Cronbach's alpha was 0.84 showing good internal consistency over the 12 items of the Persian-SST. Validity testing showed strong correlation between SST and OSS and DASH. The correlation with OSS was positive while with DASH scores was negative. The correlation was also good to strong with all physical and most mental subscales of the SF-36. Correlation coefficient was higher with DASH and OSS in compare to SF-36. Persian version of SST found to be valid and reliable instrument for shoulder joint pain and function assessment in Iranian population.
Dynamic gadolinium-enhanced magnetic resonance imaging allows accurate assessment of the synovial inflammatory activity in rheumatoid arthritis knee joints: a comparison with synovial histology.

PubMed

Axelsen, M B; Stoltenberg, M; Poggenborg, R P; Kubassova, O; Boesen, M; Bliddal, H; Hørslev-Petersen, K; Hanson, L G; Østergaard, M

2012-03-01

To determine whether dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) evaluated using semi-automatic image processing software can accurately assess synovial inflammation in rheumatoid arthritis (RA) knee joints. In 17 RA patients undergoing knee surgery, the average grade of histological synovial inflammation was determined from four biopsies obtained during surgery. A preoperative series of T(1)-weighted dynamic fast low-angle shot (FLASH) MR images was obtained. Parameters characterizing contrast uptake dynamics, including the initial rate of enhancement (IRE), were generated by the software in three different areas: (I) the entire slice (Whole slice); (II) a manually outlined region of interest (ROI) drawn quickly around the joint, omitting large artefacts such as blood vessels (Quick ROI); and (III) a manually outlined ROI following the synovial capsule of the knee joint (Precise ROI). Intra- and inter-reader agreement was assessed using the intra-class correlation coefficient (ICC). The IRE from the Quick ROI and the Precise ROI revealed high correlations to the grade of histological inflammation (Spearman's correlation coefficient (rho) = 0.70, p = 0.001 and rho = 0.74, p = 0.001, respectively). Intra- and inter-reader ICCs were very high (0.93-1.00). No Whole slice parameters were correlated to histology. DCE-MRI provides fast and accurate assessment of synovial inflammation in RA patients. Manual outlining of the joint to omit large artefacts is necessary.
Repeatability of two-dimensional chemical shift imaging multivoxel proton magnetic resonance spectroscopy for measuring human cerebral choline-containing compounds.

PubMed

Puri, Basant K; Egan, Mary; Wallis, Fintan; Jakeman, Philip

2018-03-22

To investigate the repeatability of proton magnetic resonance spectroscopy in the in vivo measurement of human cerebral levels of choline-containing compounds (Cho). Two consecutive scans were carried out in six healthy resting subjects at a magnetic field strength of 1.5 T. On each occasion, neurospectroscopy data were collected from 64 voxels using the same 2D chemical shift imaging (CSI) sequence. The data were analyzed in the same way, using the same software, to obtain the values for each voxel of the ratio of Cho to creatine. The Wilcoxon related-samples signed-rank test, coefficient of variation (CV), repeatability coefficient (RC), and intraclass correlation coefficient (ICC) were used to assess the repeatability. The CV ranged from 2.75% to 33.99%, while the minimum RC was 5.68%. There was excellent reproducibility, as judged by significant ICC values, in 26 voxels. Just three voxels showed significant differences according to the Wilcoxon related-samples signed-rank test. It is therefore concluded that when CSI multivoxel proton neurospectroscopy is used to measure cerebral choline-containing compounds at 1.5 T, the reproducibility is highly acceptable.
Diagnosing Femoroacetabular Impingement From Plain Radiographs

PubMed Central

Ayeni, Olufemi R.; Chan, Kevin; Whelan, Daniel B.; Gandhi, Rajiv; Williams, Dale; Harish, Srinivasan; Choudur, Hema; Chiavaras, Mary M.; Karlsson, Jon; Bhandari, Mohit

2014-01-01

Background: A diagnosis of femoroacetabular impingement (FAI) requires careful history and physical examination, as well as an accurate and reliable radiologic evaluation using plain radiographs as a screening modality. Radiographic markers in the diagnosis of FAI are numerous and not fully validated. In particular, reliability in their assessment across health care providers is unclear. Purpose: To determine inter- and intraobserver reliability between orthopaedic surgeons and musculoskeletal radiologists. Study Design: Cohort study (diagnosis); Level of evidence, 3. Methods: Six physicians (3 orthopaedic surgeons, 3 musculoskeletal radiologists) independently evaluated a broad spectrum of FAI pathologies across 51 hip radiographs on 2 occasions separated by at least 4 weeks. Reviewers used 8 common criteria to diagnose FAI, including (1) pistol-grip deformity, (2) size of alpha angle, (3) femoral head-neck offset, (4) posterior wall sign abnormality, (5) ischial spine sign abnormality, (6) coxa profunda abnormality, (7) crossover sign abnormality, and (8) acetabular protrusion. Agreement was calculated using the intraclass correlation coefficient (ICC). Results: When establishing an FAI diagnosis, there was poor interobserver reliability between the surgeons and radiologists (ICC batch 1 = 0.33; ICC batch 2 = 0.15). In contrast, there was higher interobserver reliability within each specialty, ranging from fair to good (surgeons: ICC batch 1 = 0.72; ICC batch 2 = 0.70 vs radiologists: ICC batch 1 = 0.59; ICC batch 2 = 0.74). Orthopaedic surgeons had the highest interobserver reliability when identifying pistol-grip deformities (ICC = 0.81) or abnormal alpha angles (ICC = 0.81). Similarly, radiologists had the highest agreement for detecting pistol-grip deformities (ICC = 0.75). Conclusion: These results suggest that surgeons and radiologists agree among themselves, but there is a need to improve the reliability of radiographic interpretations for FAI between the 2 specialties. The observed degree of low reliability may ultimately lead to missed, delayed, or inappropriate treatments for patients with symptomatic FAI. PMID:26535344
Validity and Reliability of a Digital Inclinometer to Assess Knee Joint Position Sense in an Open Kinetic Chain.

PubMed

Romero-Franco, Natalia; Montaño-Munuera, Juan Antonio; Fernández-Domínguez, Juan Carlos; Jiménez-Reyes, Pedro

2017-12-18

New methods are being validated to easily evaluate the knee joint position sense (JPS) due to its role in sports movement and the risk of injury. However, no studies to date have considered the open kinetic chain (OKC) technique, despite the biomechanical differences compared to closed kinetic chain movements. To analyze the validity and reliability of a digital inclinometer to measure the knee JPS in the OKC movement. The validity, inter-tester and intra-tester reliability of a digital inclinometer for measuring knee JPS were evaluated. Sports research laboratory. Eighteen athletes (11 males and 7 females; 28.4 ± 6.6 years; 71.9 ± 14.0 kg; 1.77 ± 0.09 m; 22.8 ± 3.2 kg/m 2 ) voluntary participated in this study. Absolute angular error (AAE), relative angular error (RAE) and variable angular error (VAE) of knee JPS in an OKC. Intraclass correlation coefficient (ICC) and standard error of the mean (SEM) were calculated to determine the validity and reliability of the inclinometer. Data showed excellent validity of the inclinometer to obtain proprioceptive errors compared to the video analysis in JPS tasks (AAE: ICC = 0.981, SEM = 0.08; RAE: ICC = 0.974, SEM = 0.12; VAE: ICC = 0.973, SEM = 0.07). Inter-tester reliability was also excellent for all the proprioceptive errors (AAE: ICC = 0.967, SEM = 0.04; RAE: ICC = 0.974, SEM = 0.03; VAE: ICC = 0.939, SEM = 0.08). Similar results were obtained for intra-tester reliability (AAE: ICC = 0.861, SEM = 0.1; RAE: ICC = 0.894, SEM = 0.1; VAE: ICC = 0.700, SEM = 0.2). The digital inclinometer is a valid and reliable method to assess the knee JPS in OKC. Sport professionals may evaluate the knee JPS to monitor its deterioration during training or improvements throughout the rehabilitation process.
Determination of repeatability of kinect sensor.

PubMed

Bonnechère, Bruno; Sholukha, Victor; Jansen, Bart; Omelina, Lubos; Rooze, Marcel; Van Sint Jan, Serge

2014-05-01

The Kinect™ (Microsoft™, Redmond, WA) sensor, originally developed for gaming purposes, may have interesting possibilities for other fields such as posture and motion assessment. The ability of the Kinect sensor to perform biomechanical measurements has previously been studied and shows promising results. However, interday repeatability of the device is still not known. This study assessed the intra- and interday repeatability of the Kinect sensor compared with a standard stereophotogrammetric device during posture assessment for measuring segment lengths. Forty subjects took part in the study. Five motionless captures were performed in one session to assess posture. Data were simultaneously recorded with both devices. Similar intraclass correlations coefficient (ICC) values were found for intraday (ICC=0.94 for the Kinect device and 0.98 for the stereophotogrammetric device) and interday (ICC=0.88 and 0.87, respectively) repeatability. Results of this study suggest that a cost-effective, easy-to-use, and portable single markerless camera offers the same repeatability during posture assessment as an expensive, time-consuming, and nontransportable marker-based device.
Psychometric Properties of a Standardized Observation Protocol to Quantify Pediatric Physical Therapy Actions.

PubMed

Sonderer, Patrizia; Akhbari Ziegler, Schirin; Gressbach Oertle, Barbara; Meichtry, André; Hadders-Algra, Mijna

2017-07-01

Pediatric physical therapy (PPT) is characterized by heterogeneity. This blurs the evaluation of effective components of PPT. The Groningen Observation Protocol (GOP) was developed to quantify contents of PPT. This study assesses the reliability and completeness of the GOP. Sixty infant PPT sessions were video-taped. Two random samples of 10 videos were used to determine interrater and intrarater reliability using interclass correlation coefficients (ICCs) with 95% confidence intervals. Completeness of GOP 2.0 was based on 60 videos. Interrater reliability of quantifying PPT actions was excellent (ICC, 0.75-1.0) in 71% and sufficient to good (ICC, 0.4-0.74) in 24% of PPT actions. Intrarater reliability was excellent in 94% and sufficient to good in 6% of PPT actions. Completeness was good for greater than 90% of PPT actions. GOP 2.0 has good reliability and completeness. After appropriate training, it is a useful tool to quantify PPT for children with developmental disorders.
Validation of Spanish versions of the Pelvic Floor Distress Inventory (PFDI) and Pelvic Floor Impact Questionnaire (PFIQ): a multicenter validation randomized study.

PubMed

Omotosho, Tola B; Hardart, Anne; Rogers, Rebecca G; Schaffer, Joseph I; Kobak, William H; Romero, Audrey A

2009-06-01

The purpose of this study is to validate Spanish versions of the Pelvic Floor Distress Inventory (PFDI) and Pelvic Floor Impact Questionnaire (PFIQ). Spanish versions were developed using back translation and validation was performed by randomizing bilingual women to complete the Spanish or English versions of the questionnaires first. Weighted kappa statistics assessed agreement for individual questions; interclass correlation coefficients (ICC) compared primary and subscale scores. Cronbach's alpha assessed internal consistency of Spanish versions. To detect a 2.7 point difference in scores with 80% power and alpha of 0.05, 44 bilingual subjects were required. Individual questions showed good to excellent agreement (kappa > 0.6) for all but eight questions on the PFIQ. ICCs of primary and subscale scores for both questionnaires showed excellent agreement. (All ICC > 0.79). All Cronbach's alpha values were excellent (>0.84) for the primary scales of both questionnaires. Valid and reliable Spanish versions of the PFIQ and PFDI have been developed.
Validation of equations for pleural effusion volume estimation by ultrasonography.

PubMed

Hassan, Maged; Rizk, Rana; Essam, Hatem; Abouelnour, Ahmed

2017-12-01

To validate the accuracy of previously published equations that estimate pleural effusion volume using ultrasonography. Only equations using simple measurements were tested. Three measurements were taken at the posterior axillary line for each case with effusion: lateral height of effusion ( H ), distance between collapsed lung and chest wall ( C ) and distance between lung and diaphragm ( D ). Cases whose effusion was aspirated to dryness were included and drained volume was recorded. Intra-class correlation coefficient (ICC) was used to determine the predictive accuracy of five equations against the actual volume of aspirated effusion. 46 cases with effusion were included. The most accurate equation in predicting effusion volume was ( H + D ) × 70 (ICC 0.83). The simplest and yet accurate equation was H × 100 (ICC 0.79). Pleural effusion height measured by ultrasonography gives a reasonable estimate of effusion volume. Incorporating distance between lung base and diaphragm into estimation improves accuracy from 79% with the first method to 83% with the latter.
Carotid stenosis assessment with multi-detector CT angiography: comparison between manual and automatic segmentation methods.

PubMed

Zhu, Chengcheng; Patterson, Andrew J; Thomas, Owen M; Sadat, Umar; Graves, Martin J; Gillard, Jonathan H

2013-04-01

Luminal stenosis is used for selecting the optimal management strategy for patients with carotid artery disease. The aim of this study is to evaluate the reproducibility of carotid stenosis quantification using manual and automated segmentation methods using submillimeter through-plane resolution Multi-Detector CT angiography (MDCTA). 35 patients having carotid artery disease with >30 % luminal stenosis as identified by carotid duplex imaging underwent contrast enhanced MDCTA. Two experienced CT readers quantified carotid stenosis from axial source images, reconstructed maximum intensity projection (MIP) and 3D-carotid geometry which was automatically segmented by an open-source toolkit (Vascular Modelling Toolkit, VMTK) using NASCET criteria. Good agreement among the measurement using axial images, MIP and automatic segmentation was observed. Automatic segmentation methods show better inter-observer agreement between the readers (intra-class correlation coefficient (ICC): 0.99 for diameter stenosis measurement) than manual measurement of axial (ICC = 0.82) and MIP (ICC = 0.86) images. Carotid stenosis quantification using an automatic segmentation method has higher reproducibility compared with manual methods.
Goniometric reliability in a clinical setting. Shoulder measurements.

PubMed

Riddle, D L; Rothstein, J M; Lamb, R L

1987-05-01

The purpose of this study was to examine the intratester and intertester reliabilities for clinical goniometric measurements of shoulder passive range of motion (PROM) using two different sizes of universal goniometers. Patients were measured without controlling therapist goniometric placement technique or patient position during measurements. Repeated PROM measurements of shoulder flexion, extension, abduction, shoulder horizontal abduction, horizontal adduction, lateral (external) rotation, and medial (internal) rotation were taken of two groups of 50 subjects each. The intratester intraclass correlation coefficients (ICCs) for all motions ranged from .87 to .99. The ICCs for the intertester reliability of PROM measurements of horizontal abduction, horizontal adduction, extension, and medial rotation ranged from .26 to .55. The intertester ICCs for PROM measurements of flexion, abduction, and lateral rotation ranged from .84 to .90. Goniometric PROM measurements for the shoulder appear to be highly reliable when taken by the same physical therapist, regardless of the size of the goniometer used. The degree of intertester reliability for these measurements appears to be range-of-motion specific.
Dynamic footprint measurement collection technique and intrarater reliability: ink mat, paper pedography, and electronic pedography.

PubMed

Fascione, Jeanna M; Crews, Ryan T; Wrobel, James S

2012-01-01

Identifying the variability of footprint measurement collection techniques and the reliability of footprint measurements would assist with appropriate clinical foot posture appraisal. We sought to identify relationships between these measures in a healthy population. On 30 healthy participants, midgait dynamic footprint measurements were collected using an ink mat, paper pedography, and electronic pedography. The footprints were then digitized, and the following footprint indices were calculated with photo digital planimetry software: footprint index, arch index, truncated arch index, Chippaux-Smirak Index, and Staheli Index. Differences between techniques were identified with repeated-measures analysis of variance with post hoc test of Scheffe. In addition, to assess practical similarities between the different methods, intraclass correlation coefficients (ICCs) were calculated. To assess intrarater reliability, footprint indices were calculated twice on 10 randomly selected ink mat footprint measurements, and the ICC was calculated. Dynamic footprint measurements collected with an ink mat significantly differed from those collected with paper pedography (ICC, 0.85-0.96) and electronic pedography (ICC, 0.29-0.79), regardless of the practical similarities noted with ICC values (P = .00). Intrarater reliability for dynamic ink mat footprint measurements was high for the footprint index, arch index, truncated arch index, Chippaux-Smirak Index, and Staheli Index (ICC, 0.74-0.99). Footprint measurements collected with various techniques demonstrate differences. Interchangeable use of exact values without adjustment is not advised. Intrarater reliability of a single method (ink mat) was found to be high.
Handcrafted cuff manometers do not accurately measure endotracheal tube cuff pressure

PubMed Central

Annoni, Raquel; de Almeida, Antonio Evanir

2015-01-01

Objective To test the agreement between two handcrafted devices and a cuff-specific manometer. Methods The agreement between two handcrafted devices adapted to measure tracheal tube cuff pressure and a cuff-specific manometer was tested on 79 subjects. The cuff pressure was measured with a commercial manometer and with two handcrafted devices (HD) assembled with aneroid sphygmomanometers (HD1 and HD2). The data were compared using Wilcoxon and Spearman tests, the intraclass correlation coefficient (ICC) and limit-of-agreement analysis. Results Cuff pressures assessed with handcrafted devices were significantly different from commercial device measurements (pressures were higher when measured with HD1 and lower with HD2). The ICCs between the commercial device and HD1 and HD2 were excellent (ICC = 0.8 p < 0.001) and good (ICC = 0.66, p < 0.001), respectively. However, the Bland- Altman plots showed wide limits of agreement between HD1 and HD2 and the commercial device. Conclusion The handcrafted manometers do not provide accurate cuff pressure measurements when compared to a cuff-specific device and should not be used to replace the commercial cuff manometers in mechanically ventilated patients. PMID:26376160
Reliability of Neurobehavioral Assessments from Birth to Term Equivalent Age in Preterm and Term Born Infants.

PubMed

Eeles, Abbey L; Olsen, Joy E; Walsh, Jennifer M; McInnes, Emma K; Molesworth, Charlotte M L; Cheong, Jeanie L Y; Doyle, Lex W; Spittle, Alicia J

2017-02-01

Neurobehavioral assessments provide insight into the functional integrity of the developing brain and help guide early intervention for preterm (<37 weeks' gestation) infants. In the context of shorter hospital stays, clinicians often need to assess preterm infants prior to term equivalent age. Few neurobehavioral assessments used in the preterm period have established interrater reliability. To evaluate the interrater reliability of the Hammersmith Neonatal Neurological Examination (HNNE) and the NICU Network Neurobehavioral Scale (NNNS), when used both preterm and at term (>36 weeks). Thirty-five preterm infants and 11 term controls were recruited. Five assessors double-scored the HNNE and NNNS administered either preterm or at term. A one-way random effects, absolute, single-measures interclass correlation coefficient (ICC) was calculated to determine interrater reliability. Interrater reliability for the HNNE was excellent (ICC > 0.74) for optimality scores, and good (ICC 0.60-0.74) to excellent for subtotal scores, except for 'Tone Patterns' (ICC 0.54). On the NNNS, interrater reliability was predominantly excellent for all items. Interrater agreement was generally excellent at both time points. Overall, the HNNE and NNNS neurobehavioral assessments demonstrated mostly excellent interrater reliability when used prior to term and at term.

The reliability of eyetracking to assess attentional bias to threatening words in healthy individuals.

PubMed

Skinner, Ian W; Hübscher, Markus; Moseley, G Lorimer; Lee, Hopin; Wand, Benedict M; Traeger, Adrian C; Gustin, Sylvia M; McAuley, James H

2017-08-15

Eyetracking is commonly used to investigate attentional bias. Although some studies have investigated the internal consistency of eyetracking, data are scarce on the test-retest reliability and agreement of eyetracking to investigate attentional bias. This study reports the test-retest reliability, measurement error, and internal consistency of 12 commonly used outcome measures thought to reflect the different components of attentional bias: overall attention, early attention, and late attention. Healthy participants completed a preferential-looking eyetracking task that involved the presentation of threatening (sensory words, general threat words, and affective words) and nonthreatening words. We used intraclass correlation coefficients (ICCs) to measure test-retest reliability (ICC > .70 indicates adequate reliability). The ICCs(2, 1) ranged from -.31 to .71. Reliability varied according to the outcome measure and threat word category. Sensory words had a lower mean ICC (.08) than either affective words (.32) or general threat words (.29). A longer exposure time was associated with higher test-retest reliability. All of the outcome measures, except second-run dwell time, demonstrated low measurement error (<6%). Most of the outcome measures reported high internal consistency (α > .93). Recommendations are discussed for improving the reliability of eyetracking tasks in future research.
Concordance and interchangeability of biometric measurements of ocular axial length in patients awaiting cataract surgery.

PubMed

Martín-Serrano, María José; Roman-Ortiz, Carmen; Villa-Sáez, M Luz; Labrador-Castellanos, M Purificación; Blanco-Carrasco, Rosario; Lozano-Ballesteros, Felicidad; Pedraza-Martín, Carmen; José-Herrero, M Teresa San; López-Ropero, Ana M; Tenías Burillo, José María

2014-01-01

To estimate in patients awaiting cataract surgery the concordance and interchangeability of axial eye length measurements performed with the aid of various biometric methods (optical or ultrasonic) by different operators (nurses) at different times during the period prior to surgery. We selected 182 consecutive eyes from 91 patients.Ocular axial length was measured with the aid of 2 methods (IOLMaster® and Ocuscan®) by 9 randomly allocated technicians at 2 different times during the waiting period. The concordance between measurements was evaluated by means of the intraclass correlation coefficient (ICC); the interchangeability of the results was assessed with Bland Altman plots and Passing and Bablok regression. The measurements were consistent between biometric methods (ICC 0.975, 95% confidence interval [CI] 0.968 to 0.980) and measurement dates (ICC 0.996, 95% CI 0.995 to 0.997). Interobserver agreement was more heterogeneous (ICC range 0.844 to 0.998). No systematic errors were observed among the various biometric methods and measurement dates. Because measurement of axial length in phakic patients may be technician-dependent, the technician's experience should be noted in the protocols of ophthalmology services.
Design and Evaluation of a Training Protocol for a Photographic Method of Visual Estimation of Fruit and Vegetable Intake among Kindergarten Through Second-Grade Students.

PubMed

Masis, Natalie; McCaffrey, Jennifer; Johnson, Susan L; Chapman-Novakofski, Karen

2017-04-01

To design a replicable training protocol for visual estimation of fruit and vegetable (FV) intake of kindergarten through second-grade students through digital photography of lunch trays that results in reliable data for FV served and consumed. Protocol development through literature and researcher input was followed by 3 laboratory-based trainings of 3 trainees. Lunchroom data collection sessions were done at 2 elementary schools for kindergarten through second-graders. Intraclass correlation coefficients (ICCs) were used. By training 3, ICC was substantial for amount of FV served and consumed (0.86 and 0.95, respectively; P < .05). The ICC was moderate for percentage of fruits consumed (0.67; P = .06). In-school estimates for ICCs were all significant for amounts served at school 1 and percentage of FV consumed at both schools. The protocol resulted in reliable estimation of combined FV served and consumed using digital photography. The ability to estimate FV intake accurately will benefit intervention development and evaluation. Copyright © 2017 Society for Nutrition Education and Behavior. Published by Elsevier Inc. All rights reserved.
First quality score for referral letters in gastroenterology—a validation study

PubMed Central

Eskeland, Sigrun Losada; Brunborg, Cathrine; Seip, Birgitte; Wiencke, Kristine; Hovde, Øistein; Owen, Tanja; Skogestad, Erik; Huppertz-Hauss, Gert; Halvorsen, Fred-Arne; Garborg, Kjetil; Aabakken, Lars; de Lange, Thomas

2016-01-01

Objective To create and validate an objective and reliable score to assess referral quality in gastroenterology. Design An observational multicentre study. Setting and participants 25 gastroenterologists participated in selecting variables for a Thirty Point Score (TPS) for quality assessment of referrals to gastroenterology specialist healthcare for 9 common indications. From May to September 2014, 7 hospitals from the South-Eastern Norway Regional Health Authority participated in collecting and scoring 327 referrals to a gastroenterologist. Main outcome measure Correlation between the TPS and a visual analogue scale (VAS) for referral quality. Results The 327 referrals had an average TPS of 13.2 (range 1–25) and an average VAS of 4.7 (range 0.2–9.5). The reliability of the score was excellent, with an intra-rater intraclass correlation coefficient (ICC) of 0.87 and inter-rater ICC of 0.91. The overall correlation between the TPS and the VAS was moderate (r=0.42), and ranged from fair to substantial for the various indications. Mean agreement was good (ICC=0.47, 95% CI (0.34 to 0.57)), ranging from poor to good. Conclusions The TPS is reliable, objective and shows good agreement with the subjective VAS. The score may be a useful tool for assessing referral quality in gastroenterology, particularly important when evaluating the effect of interventions to improve referral quality. PMID:27855107
The Unilateral Below Elbow Test: a function test for children with unilateral congenital below elbow deficiency.

PubMed

Bagley, Anita M; Molitor, Fred; Wagner, Lisa V; Tomhave, Wendy; James, Michelle A

2006-07-01

The Unilateral Below Elbow Test (UBET) was developed to evaluate function in bimanual activities for both the prosthesis wearer and non-wearer. Nine tasks were chosen for each of four age-specific categories defined by development stages of hand function (2-4y, 5-7y, 8-10y, and 11-21y). Two scales, Completion of Task and Method of Use, were designed to rate performance. To measure reliability, four occupational therapists scored samples of videotaped UBET performances. For Completion of Task, an interval scale, agreement in scoring was measured with interclass correlation coefficients (ICC; n=9; five females, four males). For Method of Use, a nominal scale, chance-adjusted association was calculated with Cohen's kappa coefficients (interobserver n=198; 111 females, 87 males; intraobserver n=93; 56 females, 37 males). For Completion of Task, the average ICC was 0.87 for the prosthesis-on condition, and 0.85 for the prosthesis-off condition. ICCs exceeded 0.80 for eight out of nine tasks for the two older age groups, but for only five out of nine tasks in the younger age groups. Higher inter- and intraobserver kappa coefficients for Method of Use resulted when scoring children with their prostheses on versus off. The oldest age group had lower kappa values than the other three groups. The UBET is recommended for the functional evaluation of Completion of Task in children with unilateral congenital below elbow deficiency with and without their prostheses. Method of Use scoring can evaluate individuals for directed therapy interventions or prosthetic training.
Short Mood and Feelings Questionnaire for screening children and adolescents for plastic surgery: cross-cultural validation study.

PubMed

Sucupira, Eduardo; Sabino, Miguel; Lima, Edson Luiz de; Dini, Gal Moreira; Brito, Maria José Azevedo de; Ferreira, Lydia Masako

2017-01-01

Patient-reported outcome measurements assessing the emotional state of children and adolescents who seek plastic surgery are important for determining whether the intervention is indicated or not. The aim of this study was to cross-culturally adapt and validate the Short Mood and Feelings Questionnaire (child/adolescent and parent versions) for Brazilian Portuguese, test its psychometric properties and assess the emotional state of children and adolescents who seek plastic surgery. DESIGN AND SETTING: Cross-cultural validation study conducted in a plastic surgery outpatient clinic at a public university hospital. A total of 124 consecutive patients of both sexes were selected between September 2013 and February 2014. Forty-seven patients participated in the cultural adaptation of the questionnaire. The final version was tested for reliability on 20 patients. Construct validity was tested on 57 patients by correlating the Short Mood and Feelings Questionnaire (child/adolescent and parent versions) with the Strengths and Difficulties Questionnaire and the Rosenberg Self-Esteem scale. The child/adolescent and parent versions of the Short Mood and Feelings Questionnaire showed Cronbach's alpha of 0.768 and 0.874, respectively, and had good inter-rater reliability (intraclass correlation coefficient, ICC = 0.757 and ICC = 0.853, respectively) and intra-rater reliability (ICC = 0.738 and ICC = 0.796, respectively). The Brazilian-Portuguese version of the Short Mood and Feelings Questionnaire is a reproducible instrument with face, content and construct validity.The mood state and feelings among children and adolescents seeking cosmetic surgery were healthy.
Assessing the inter-observer variability of Computer-Aided Nodule Assessment and Risk Yield (CANARY) to characterize lung adenocarcinomas.

PubMed

Nakajima, Erica C; Frankland, Michael P; Johnson, Tucker F; Antic, Sanja L; Chen, Heidi; Chen, Sheau-Chiann; Karwoski, Ronald A; Walker, Ronald; Landman, Bennett A; Clay, Ryan D; Bartholmai, Brian J; Rajagopalan, Srinivasan; Peikert, Tobias; Massion, Pierre P; Maldonado, Fabien

2018-01-01

Lung adenocarcinoma (ADC), the most common lung cancer type, is recognized increasingly as a disease spectrum. To guide individualized patient care, a non-invasive means of distinguishing indolent from aggressive ADC subtypes is needed urgently. Computer-Aided Nodule Assessment and Risk Yield (CANARY) is a novel computed tomography (CT) tool that characterizes early ADCs by detecting nine distinct CT voxel classes, representing a spectrum of lepidic to invasive growth, within an ADC. CANARY characterization has been shown to correlate with ADC histology and patient outcomes. This study evaluated the inter-observer variability of CANARY analysis. Three novice observers segmented and analyzed independently 95 biopsy-confirmed lung ADCs from Vanderbilt University Medical Center/Nashville Veterans Administration Tennessee Valley Healthcare system (VUMC/TVHS) and the Mayo Clinic (Mayo). Inter-observer variability was measured using intra-class correlation coefficient (ICC). The average ICC for all CANARY classes was 0.828 (95% CI 0.76, 0.895) for the VUMC/TVHS cohort, and 0.852 (95% CI 0.804, 0.901) for the Mayo cohort. The most invasive voxel classes had the highest ICC values. To determine whether nodule size influenced inter-observer variability, an additional cohort of 49 sub-centimeter nodules from Mayo were also segmented by three observers, with similar ICC results. Our study demonstrates that CANARY ADC classification between novice CANARY users has an acceptably low degree of variability, and supports the further development of CANARY for clinical application.
Development and Reliability Evaluation of the Movement Rating Instrument for Virtual Reality Video Game Play.

PubMed

Levac, Danielle; Nawrotek, Joanna; Deschenes, Emilie; Giguere, Tia; Serafin, Julie; Bilodeau, Martin; Sveistrup, Heidi

2016-06-01

Virtual reality active video games are increasingly popular physical therapy interventions for children with cerebral palsy. However, physical therapists require educational resources to support decision making about game selection to match individual patient goals. Quantifying the movements elicited during virtual reality active video game play can inform individualized game selection in pediatric rehabilitation. The objectives of this study were to develop and evaluate the feasibility and reliability of the Movement Rating Instrument for Virtual Reality Game Play (MRI-VRGP). Item generation occurred through an iterative process of literature review and sample videotape viewing. The MRI-VRGP includes 25 items quantifying upper extremity, lower extremity, and total body movements. A total of 176 videotaped 90-second game play sessions involving 7 typically developing children and 4 children with cerebral palsy were rated by 3 raters trained in MRI-VRGP use. Children played 8 games on 2 virtual reality and active video game systems. Intraclass correlation coefficients (ICCs) determined intra-rater and interrater reliability. Excellent intrarater reliability was evidenced by ICCs of >0.75 for 17 of the 25 items across the 3 raters. Interrater reliability estimates were less precise. Excellent interrater reliability was achieved for far reach upper extremity movements (ICC=0.92 [for right and ICC=0.90 for left) and for squat (ICC=0.80) and jump items (ICC=0.99), with 9 items achieving ICCs of >0.70, 12 items achieving ICCs of between 0.40 and 0.70, and 4 items achieving poor reliability (close-reach upper extremity-ICC=0.14 for right and ICC=0.07 for left) and single-leg stance (ICC=0.55 for right and ICC=0.27 for left). Poor video quality, differing item interpretations between raters, and difficulty quantifying the high-speed movements involved in game play affected reliability. With item definition clarification and further psychometric property evaluation, the MRI-VRGP could inform the content of educational resources for therapists by ranking games according to frequency and type of elicited body movements.
Development and Reliability Evaluation of the Movement Rating Instrument for Virtual Reality Video Game Play

PubMed Central

Nawrotek, Joanna; Deschenes, Emilie; Giguere, Tia; Serafin, Julie; Bilodeau, Martin; Sveistrup, Heidi

2016-01-01

Background Virtual reality active video games are increasingly popular physical therapy interventions for children with cerebral palsy. However, physical therapists require educational resources to support decision making about game selection to match individual patient goals. Quantifying the movements elicited during virtual reality active video game play can inform individualized game selection in pediatric rehabilitation. Objective The objectives of this study were to develop and evaluate the feasibility and reliability of the Movement Rating Instrument for Virtual Reality Game Play (MRI-VRGP). Methods Item generation occurred through an iterative process of literature review and sample videotape viewing. The MRI-VRGP includes 25 items quantifying upper extremity, lower extremity, and total body movements. A total of 176 videotaped 90-second game play sessions involving 7 typically developing children and 4 children with cerebral palsy were rated by 3 raters trained in MRI-VRGP use. Children played 8 games on 2 virtual reality and active video game systems. Intraclass correlation coefficients (ICCs) determined intra-rater and interrater reliability. Results Excellent intrarater reliability was evidenced by ICCs of >0.75 for 17 of the 25 items across the 3 raters. Interrater reliability estimates were less precise. Excellent interrater reliability was achieved for far reach upper extremity movements (ICC=0.92 [for right and ICC=0.90 for left) and for squat (ICC=0.80) and jump items (ICC=0.99), with 9 items achieving ICCs of >0.70, 12 items achieving ICCs of between 0.40 and 0.70, and 4 items achieving poor reliability (close-reach upper extremity-ICC=0.14 for right and ICC=0.07 for left) and single-leg stance (ICC=0.55 for right and ICC=0.27 for left). Conclusions Poor video quality, differing item interpretations between raters, and difficulty quantifying the high-speed movements involved in game play affected reliability. With item definition clarification and further psychometric property evaluation, the MRI-VRGP could inform the content of educational resources for therapists by ranking games according to frequency and type of elicited body movements. PMID:27251029
Quantitative 4D Transcatheter Intraarterial Perfusion MR Imaging as a Method to Standardize Angiographic Chemoembolization Endpoints

PubMed Central

Jin, Brian; Wang, Dingxin; Lewandowski, Robert J.; Ryu, Robert K.; Sato, Kent T.; Larson, Andrew C.; Salem, Riad; Omary, Reed A.

2011-01-01

PURPOSE We aimed to test the hypothesis that subjective angiographic endpoints during transarterial chemoembolization (TACE) of hepatocellular carcinoma (HCC) exhibit consistency and correlate with objective intraprocedural reductions in tumor perfusion as determined by quantitative four dimensional (4D) transcatheter intraarterial perfusion (TRIP) magnetic resonance (MR) imaging. MATERIALS AND METHODS This prospective study was approved by the institutional review board. Eighteen consecutive patients underwent TACE in a combined MR/interventional radiology (MR-IR) suite. Three board-certified interventional radiologists independently graded the angiographic endpoint of each procedure based on a previously described subjective angiographic chemoembolization endpoint (SACE) scale. A consensus SACE rating was established for each patient. Patients underwent quantitative 4D TRIP-MR imaging immediately before and after TACE, from which mean whole tumor perfusion (Fρ) was calculated. Consistency of SACE ratings between observers was evaluated using the intraclass correlation coefficient (ICC). The relationship between SACE ratings and intraprocedural TRIP-MR imaging perfusion changes was evaluated using Spearman’s rank correlation coefficient. RESULTS The SACE rating scale demonstrated very good consistency among all observers (ICC = 0.80). The consensus SACE rating was significantly correlated with both absolute (r = 0.54, P = 0.022) and percent (r = 0.85, P < 0.001) intraprocedural perfusion reduction. CONCLUSION The SACE rating scale demonstrates very good consistency between raters, and significantly correlates with objectively measured intraprocedural perfusion reductions during TACE. These results support the use of the SACE scale as a standardized alternative method to quantitative 4D TRIP-MR imaging to classify patients based on embolic endpoints of TACE. PMID:22021520
Cross-cultural adaptation and validation of the Persian version of the Intermittent and Constant Osteoarthritis Pain Measure for the knee.

PubMed

Panah, Sara Hojat; Baharlouie, Hamze; Rezaeian, Zahra Sadat; Hawker, Gilian

2016-01-01

The present study aimed to translate and evaluate the reliability and validity of the Persian version of the 11-item Intermittent and Constant Osteoarthritis Pain (ICOAP) measure in Iranian subjects with Knee Osteoarthritis (KOA). The ICOAP questionnaire was translated according to the Manufacturers Alliance for Productivity and Innovation (MAPI) protocol. The procedure consisted of forward and backward translation, as well as the assessment of the psychometric properties of the Persian version of the questionnaire. A sample of 230 subjects with KOA was asked to complete the Persian versions of ICOAP and Knee injury and Osteoarthritis Outcome Score (KOOS). The ICOAP was readministered to forty subjects five days after the first visit. Test-retest reliability was assessed using Intraclass Correlation Coefficient (ICC), and internal consistency was assessed by Cronbach's alpha and item-total correlation. The correlation between ICOAP and KOOS was determined using Spearman's correlation coefficient. Subjects found the Persian-version of the ICOAP to be clear, simple, and unambiguous, confirming its face validity. Spearman correlations between ICOAP total and subscale scores with KOOS scores were between 0.5 and 0.7, confirming construct validity. Cronbach's alpha, used to assess internal consistency, was 0.89, 0.93, and 0.92 for constant pain, intermittent pain, and total pain scores, respectively. The ICC was 0.90 for constant pain and 0.91 for the intermittent pain and total pain score. The Persian version of the ICOAP is a reliable and valid outcome measure that can be used in Iranian subjects with KOA.
Dynamic contrast-enhanced MR imaging of the rectum: Correlations between single-section and whole-tumor histogram analyses.

PubMed

Choi, M H; Oh, S N; Park, G E; Yeo, D-M; Jung, S E

2018-05-10

To evaluate the interobserver and intermethod correlations of histogram metrics of dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) parameters acquired by multiple readers using the single-section and whole-tumor volume methods. Four DCE parameters (K trans , K ep , V e , V p ) were evaluated in 45 patients (31 men and 14 women; mean age, 61±11 years [range, 29-83 years]) with locally advanced rectal cancer using pre-chemoradiotherapy (CRT) MRI. Ten histogram metrics were extracted using two methods of lesion selection performed by three radiologists: the whole-tumor volume method for the whole tumor on axial section-by-section images and the single-section method for the entire area of the tumor on one axial image. The interobserver and intermethod correlations were evaluated using the intraclass correlation coefficients (ICCs). The ICCs showed excellent interobserver and intermethod correlations in most of histogram metrics of the DCE parameters. The ICCs among the three readers were > 0.7 (P<0.001) for all histogram metrics, except for the minimum and maximum. The intermethod correlations for most of the histogram metrics were excellent for each radiologist, regardless of the differences in the radiologists' experience. The interobserver and intermethod correlations for most of the histogram metrics of the DCE parameters are excellent in rectal cancer. Therefore, the single-section method may be a potential alternative to the whole-tumor volume method using pre-CRT MRI, despite the fact that the high agreement between the two methods cannot be extrapolated to post-CRT MRI. Copyright © 2018 Société française de radiologie. Published by Elsevier Masson SAS. All rights reserved.
Repeatability of Ophtha Top topography and comparison with IOL-Master and LenstarLS900 in cataract patients

PubMed Central

Yu, Sha-Sha; Song, Hui; Tang, Xin

2017-01-01

AIM To determine the repeatability of Ophtha Top topography and assess the consistency with intraocular lens (IOL)-Master and LenstarLS900 (Lenstar) in measuring corneal parameters among cataract patients. METHODS Totally 125 eyes were enrolled. Corneas were successively measured with Ophtha Top, IOL-Master and Lenstar at least three times. The flattest meridian power (Kf), the steepest meridian power (Ks), mean power (Km), J0 and J45 were recorded. Intra-class correlation coefficients (ICCs), the coefficient of variance (COV), within subject standard deviation (Sw), and test-retest repeatability (2.77Sw) were adopted to determine the repeatability. The 95% limit of agreement (95%LOA) and Bland-Altman plots were used to assess comparability. RESULTS Repeatability of Ophtha Top topography for measuring corneal parameters showed the ICCs were all above 0.93, 2.77Sw was lower than 0.31, and the COV of the Kf and Ks was lower than 0.25. The keratometric readings with Ophtha Top topography were flatter than with the IOL-Master and Lenstar devices, while the Pearson correlation coefficients were over 0.97. The J0 and J45 with Ophtha Top topography were smaller compared with Lenstar and IOL-Master, while was comparable between Lenstar and IOL-Master. CONCLUSION Ophtha Top topography shows excellent repeatability for measuring corneal parameters. However, differences between the Ophtha TOP topography and Lenstar, IOL-Master both in cornea curvature and the astigmatism should be noted clinically. PMID:29181314
Repeatability of Ophtha Top topography and comparison with IOL-Master and LenstarLS900 in cataract patients.

PubMed

Yu, Sha-Sha; Song, Hui; Tang, Xin

2017-01-01

To determine the repeatability of Ophtha Top topography and assess the consistency with intraocular lens (IOL)-Master and LenstarLS900 (Lenstar) in measuring corneal parameters among cataract patients. Totally 125 eyes were enrolled. Corneas were successively measured with Ophtha Top, IOL-Master and Lenstar at least three times. The flattest meridian power (Kf), the steepest meridian power (Ks), mean power (Km), J0 and J45 were recorded. Intra-class correlation coefficients (ICCs), the coefficient of variance (COV), within subject standard deviation (Sw), and test-retest repeatability (2.77Sw) were adopted to determine the repeatability. The 95% limit of agreement (95%LOA) and Bland-Altman plots were used to assess comparability. Repeatability of Ophtha Top topography for measuring corneal parameters showed the ICCs were all above 0.93, 2.77Sw was lower than 0.31, and the COV of the Kf and Ks was lower than 0.25. The keratometric readings with Ophtha Top topography were flatter than with the IOL-Master and Lenstar devices, while the Pearson correlation coefficients were over 0.97. The J0 and J45 with Ophtha Top topography were smaller compared with Lenstar and IOL-Master, while was comparable between Lenstar and IOL-Master. Ophtha Top topography shows excellent repeatability for measuring corneal parameters. However, differences between the Ophtha TOP topography and Lenstar, IOL-Master both in cornea curvature and the astigmatism should be noted clinically.
Explicating the Conditions Under Which Multilevel Multiple Imputation Mitigates Bias Resulting from Random Coefficient-Dependent Missing Longitudinal Data.

PubMed

Gottfredson, Nisha C; Sterba, Sonya K; Jackson, Kristina M

2017-01-01

Random coefficient-dependent (RCD) missingness is a non-ignorable mechanism through which missing data can arise in longitudinal designs. RCD, for which we cannot test, is a problematic form of missingness that occurs if subject-specific random effects correlate with propensity for missingness or dropout. Particularly when covariate missingness is a problem, investigators typically handle missing longitudinal data by using single-level multiple imputation procedures implemented with long-format data, which ignores within-person dependency entirely, or implemented with wide-format (i.e., multivariate) data, which ignores some aspects of within-person dependency. When either of these standard approaches to handling missing longitudinal data is used, RCD missingness leads to parameter bias and incorrect inference. We explain why multilevel multiple imputation (MMI) should alleviate bias induced by a RCD missing data mechanism under conditions that contribute to stronger determinacy of random coefficients. We evaluate our hypothesis with a simulation study. Three design factors are considered: intraclass correlation (ICC; ranging from .25 to .75), number of waves (ranging from 4 to 8), and percent of missing data (ranging from 20 to 50%). We find that MMI greatly outperforms the single-level wide-format (multivariate) method for imputation under a RCD mechanism. For the MMI analyses, bias was most alleviated when the ICC is high, there were more waves of data, and when there was less missing data. Practical recommendations for handling longitudinal missing data are suggested.
Isometric abdominal wall muscle strength assessment in individuals with incisional hernia: a prospective reliability study.

PubMed

Jensen, K K; Kjaer, M; Jorgensen, L N

2016-12-01

To determine the reliability of measurements obtained by the Good Strength dynamometer, determining isometric abdominal wall and back muscle strength in patients with ventral incisional hernia (VIH) and healthy volunteers with an intact abdominal wall. Ten patients with VIH and ten healthy volunteers with an intact abdominal wall were each examined twice with a 1 week interval. Examination included the assessment of truncal flexion and extension as measured with the Good Strength dynamometer, the completion of the International Physical Activity Questionnaire (IPAQ) and the self-assessment of truncal strength on a visual analogue scale (SATS). The test-retest reliability of truncal flexion and extension was assessed by interclass correlation coefficient (ICC), and Bland and Altman graphs. Finally, correlations between truncal strength, and IPAQ and SATS were examined. Truncal flexion and extension showed excellent test-retest reliability for both patients with VIH (ICC 0.91 and 0.99) and healthy controls (ICC 0.97 and 0.96). Bland and Altman plots showed that no systematic bias was present for neither truncal flexion nor extension when assessing reliability. For patients with VIH, no significant correlations between objective measures of truncal strength and IPAQ or SATS were found. For healthy controls, both truncal flexion (τ 0.58, p = 0.025) and extension (τ 0.58, p = 0.025) correlated significantly with SATS, while no other significant correlation between truncal strength measures and IPAQ was found. The Good Strength dynamometer provided a reliable, low-cost measure of truncal flexion and extension in patients with VIH.
Isometric hand grip strength measured by the Nintendo Wii Balance Board - a reliable new method.

PubMed

Blomkvist, A W; Andersen, S; de Bruin, E D; Jorgensen, M G

2016-02-03

Low hand grip strength is a strong predictor for both long-term and short-term disability and mortality. The Nintendo Wii Balance Board (WBB) is an inexpensive, portable, wide-spread instrument with the potential for multiple purposes in assessing clinically relevant measures including muscle strength. The purpose of the study was to explore intrarater reliability and concurrent validity of the WBB by comparing it to the Jamar hand dynamometer. Intra-rater test-retest cohort design with randomized validity testing on the first session. Using custom WBB software, thirty old adults (69.0 ± 4.2 years of age) were studied for reproducibility and concurrent validity compared to the Jamar hand dynamometer. Reproducibility was tested for dominant and non-dominant hands during the same time-of-day, one week apart. Intraclass correlation coefficient (ICC) and standard error of measurement (SEM) and limits of agreement (LOA) were calculated to describe relative and absolute reproducibility respectively. To describe concurrent validity, Pearson's product-moment correlation and ICC was calculated. Reproducibility was high with ICC values of >0.948 across all measures. Both SEM and LOA were low (0.2-0.5 kg and 2.7-4.2 kg, respectively) in both the dominant and non-dominant hand. For validity, Pearson correlations were high (0.80-0.88) and ICC values were fair to good (0.763-0.803). Reproducibility for WBB was high for relative measures and acceptable for absolute measures. In addition, concurrent validity between the Jamar hand dynamometer and the WBB was acceptable. Thus, the WBB may be a valid instrument to assess hand grip strength in older adults.
Ultrasound is a reproducible and valid tool for measuring scar height in children with burn scars: A cross-sectional study of the psychometric properties and utility of the ultrasound and 3D camera.

PubMed

Simons, M; Kee, E Gee; Kimble, R; Tyack, Z

2017-08-01

The aim of this study was to investigate the reproducibility and validity of measuring scar height in children using ultrasound and 3D camera. Using a cross-sectional design, children with discrete burn scars were included. Reproducibility was tested using Intraclass Correlation Coefficient (ICC) for reliability, and percentage agreement within 1mm between test and re-test, standard error of measurement (SEM), smallest detectable change (SDC) and Bland Altman limits of agreement for agreement. Concurrent validity was tested using Spearman's rho for support of pre-specified hypotheses. Forty-nine participants (55 scars) were included. For ultrasound, test-retest and inter-rater reproducibility of scar thickness was acceptable for scarred skin (ICC=0.95, SDC=0.06cm and ICC=0.82, SDC=0.14cm). The ultrasound picked up changes of <1mm. Inter-rater reproducibility of maximal scar height using the 3D camera was acceptable (ICC=0.73, SDC=0.55cm). Construct validity of the ultrasound was supported with a strong correlation between the measure of scar thickness and observer ratings of thickness using the POSAS (ρ=0.61). Construct validity of the 3D camera was also supported with a moderate correlation (ρ=0.37) with the same measure using maximal scar height. The ultrasound is capable of detecting smaller changes or differences in scar thickness than the 3D camera, in children with burn scars. However agreement as part of reproducibility was lower than expected between raters for the ultrasound. Improving the accuracy of scar relocation may go some way to address agreement. Crown Copyright © 2017. Published by Elsevier Ltd. All rights reserved.
Reliability and Agreement Between Metrics of Cone Spacing in Adaptive Optics Images of the Human Retinal Photoreceptor Mosaic.

PubMed

Giannini, Daniela; Lombardo, Giuseppe; Mariotti, Letizia; Devaney, Nicholas; Serrao, Sebastiano; Lombardo, Marco

2017-06-01

To assess reliability and agreement among three metrics used to evaluate the distribution of cell distances in adaptive optics (AO) images of the cone mosaic. Using an AO flood illumination retinal camera, we acquired images of the cone mosaic in 20 healthy subjects and 12 patients with retinal diseases. The three spacing metrics studied were the center-to-center spacing (Scc), the local cone spacing (LCS), and the density recovery profile distance (DRPD). Each metric was calculated in sampling areas of different sizes (64 × 64 μm and 204 × 204 μm) across the parafovea. Both Scc and LCS were able to discriminate between healthy subjects and patients with retinal diseases; DRPD did not reliably detect any abnormality in the distribution of cell distances in patients with retinal diseases. The agreement between Scc and LCS was high in healthy subjects (intraclass correlation coefficient [ICC] ≥ 0.79) and moderate in patients with retinal diseases (ICC ≤ 0.51). The DRPD had poor agreement with Scc (ICC ≤ 0.47) and LCS (ICC ≤ 0.37). The correlation between the spacing metrics of the two sampling areas was greater in healthy subjects than in patients with retinal diseases. The Scc and LCS provided interchangeable estimates of cone distance in AO retinal images of healthy subjects but could not be used interchangeably when investigating retinal diseases with significant cell reflectivity loss (≥30%). The DRPD was unreliable for describing cell distance in a human retinal cone mosaic and did not correlate with Scc and LCS. Caution is needed when comparing spacing metrics evaluated in sampling areas of different sizes.
Psychometric properties of the Malay Version of the hospital anxiety and depression scale: a study of husbands of breast cancer patients in Kuala Lumpur, Malaysia.

PubMed

Yusoff, Nasir; Low, Wah Yun; Yip, Cheng-Har

2011-01-01

The main objective of this paper is to examine the psychometric properties of the Malay Version of the Hospital Anxiety and Depression Scale (HADS), tested on 67 husbands of the women who were diagnosed with breast cancer. The eligible husbands were retrieved from the Clinical Oncology Clinic at three hospitals in Kuala Lumpur, Malaysia. Data was collected at three weeks and ten weeks following surgery for breast cancer of their wives. The psychometric properties of the HADS were reported based on Cronbach' alpha, Intraclass Correlation Coefficients (ICC), Effect Size Index (ESI), sensitivity and discriminity of the scale. Internal consistency of the scale is excellent, with Cronbach's alpha of 0.88 for Anxiety subscale and 0.79 for Depression subscale. Test-retest Intraclass Correlation Coefficient (ICC) is 0.35 and 0.42 for Anxiety and Depression Subscale, respectively. Small mean differences were observed at test-retest measurement with ESI of 0.21 for Anxiety and 0.19 for Depression. Non-significant result was revealed for the discriminant validity (mastectomy vs lumpectomy). The Malay Version of the HADS is appropriate to measure the anxiety and depression among the husbands of the women with breast cancer in Malaysia.

Excellent Intra and Inter-Observer Reproducibility of Wrist Circumference Measurements in Obese Children and Adolescents

PubMed Central

Campagna, Giuseppe; Zampetti, Simona; Gallozzi, Alessia; Giansanti, Sara; Chiesa, Claudio; Pacifico, Lucia; Buzzetti, Raffaella

2016-01-01

In a previous study, we found that wrist circumference, in particular its bone component, was associated with insulin resistance in a population of overweight/obese children. The aim of the present study was to evaluate the intra- and inter-operator variability in wrist circumference measurement in a population of obese children and adolescents. One hundred and two (54 male and 48 female) obese children and adolescents were consecutively enrolled. In all subjects wrist circumferences were measured by two different operators two times to assess intra- and inter-operator variability. Statistical analysis was performed using SAS v.9.4 and JMP v.12. Measurements of wrist circumference showed excellent inter-operator reliability with Intra class Correlation Coefficients (ICC) of 0.96 and ICC of 0.97 for the first and the second measurement, respectively. The intra-operator reliability was, also, very strong with a Concordance Correlation Coefficient (CCC) of 0.98 for both operators. The high reproducibility demonstrated in our results suggests that wrist circumference measurement, being safe, non-invasive and repeatable can be easily used in out-patient settings to identify youths with increased risk of insulin-resistance. This can avoid testing the entire population of overweight/obese children for insulin resistance parameters. PMID:27294398
Development and psychometric testing of a trans-professional evidence-based practice profile questionnaire.

PubMed

McEvoy, Maureen Patricia; Williams, Marie T; Olds, Timothy Stephen

2010-01-01

Previous survey tools operationalising knowledge, attitudes or beliefs about evidence-based practice (EBP) have shortcomings in content, psychometric properties and target audience. This study developed and psychometrically assessed a self-report trans-professional questionnaire to describe an EBP profile. Sixty-six items were collated from existing EBP questionnaires and administered to 526 academics and students from health and non-health backgrounds. Principal component factor analysis revealed the presence of five factors (Relevance, Terminology, Confidence, Practice and Sympathy). Following expert panel review and pilot testing, the 58-item final questionnaire was disseminated to 105 subjects on two occasions. Test-retest and internal reliability were quantified using intra-class correlation coefficients (ICCs) and Cronbach's alpha, convergent validity against a commonly used EBP questionnaire by Pearson's correlation coefficient and discriminative validity via analysis of variance (ANOVA) based on exposure to EBP training. The final questionnaire demonstrated acceptable internal consistency (Cronbach's alpha 0.96), test-retest reliability (ICCs range 0.77-0.94) and convergent validity (Practice 0.66, Confidence 0.80 and Sympathy 0.54). Three factors (Relevance, Terminology and Confidence) distinguished EBP exposure groups (ANOVA p < 0.001-0.004). The evidence-based practice profile (EBP(2)) questionnaire is a reliable instrument with the ability to discriminate for three factors, between respondents with differing EBP exposures.
Cross-cultural adaptation and validation of the Italian Psychosocial Impact of Dental Aesthetics Questionnaire (PIDAQ).

PubMed

Bucci, Rosaria; Rongo, Roberto; Zito, Eugenio; Galeotti, Angela; Valletta, Rosa; D'Antò, Vincenzo

2015-03-01

To validate and cross-culturally adapt the Italian version of the Psychological Impact of Dental Aesthetics Questionnaire (PIDAQ) among Italian young adults. After translation, back translation, and cross-cultural adaptation of the English PIDAQ, a first version of the Italian questionnaire was pretested. The final Italian PIDAQ was administered to 598 subjects aged 18-30 years, along with two other instruments: the aesthetic component of the index of orthodontic treatment need (IOTN-AC) and the perception of occlusion scale (POS), which identified the self-reporting grade of malocclusion. Structural validity was assessed by means of factorial analysis, internal consistency was measured with Cronbach's alpha coefficient (α), convergent validity was assessed by means of Spearman correlation, and test-retest reliability was calculated with intra-class correlation coefficient (ICC) and standard measurement error. Criterion validity was evaluated by multivariate and univariate analysis of variance with Bonferroni post hoc tests. The α of the Italian PIDAQ domains ranged between 0.79 and 0.92. The ICC was between 0.81 and 0.90. The mean scores of each PIDAQ domain showed a statistically significant difference when analysed according to the IOTN-AC and POS scores. The satisfactory psychometric properties make PIDAQ a usable tool for future studies on oral health-related quality of life among Italian young adults.
Comparison of Collection Methods for Fecal Samples in Microbiome Studies

PubMed Central

Vogtmann, Emily; Chen, Jun; Amir, Amnon; Shi, Jianxin; Abnet, Christian C.; Nelson, Heidi; Knight, Rob; Chia, Nicholas; Sinha, Rashmi

2017-01-01

Prospective cohort studies are needed to assess the relationship between the fecal microbiome and human health and disease. To evaluate fecal collection methods, we determined technical reproducibility, stability at ambient temperature, and accuracy of 5 fecal collection methods (no additive, 95% ethanol, RNAlater Stabilization Solution, fecal occult blood test cards, and fecal immunochemical test tubes). Fifty-two healthy volunteers provided fecal samples at the Mayo Clinic in Rochester, Minnesota, in 2014. One set from each sample collection method was frozen immediately, and a second set was incubated at room temperature for 96 hours and then frozen. Intraclass correlation coefficients (ICCs) were calculated for the relative abundance of 3 phyla, 2 alpha diversity metrics, and 4 beta diversity metrics. Technical reproducibility was high, with ICCs for duplicate fecal samples between 0.64 and 1.00. Stability for most methods was generally high, although the ICCs were below 0.60 for 95% ethanol in metrics that were more sensitive to relative abundance. When compared with fecal samples that were frozen immediately, the ICCs were below 0.60 for the metrics that were sensitive to relative abundance; however, the remaining 2 alpha diversity and 3 beta diversity metrics were all relatively accurate, with ICCs above 0.60. In conclusion, all fecal sample collection methods appear relatively reproducible, stable, and accurate. Future studies could use these collection methods for microbiome analyses. PMID:27986704
Automated lobar quantification of emphysema in patients with severe COPD.

PubMed

Revel, Marie-Pierre; Faivre, Jean-Baptiste; Remy-Jardin, Martine; Deken, Valérie; Duhamel, Alain; Marquette, Charles-Hugo; Tacelli, Nunzia; Bakai, Anne-Marie; Remy, Jacques

2008-12-01

Automated lobar quantification of emphysema has not yet been evaluated. Unenhanced 64-slice MDCT was performed in 47 patients evaluated before bronchoscopic lung-volume reduction. CT images reconstructed with a standard (B20) and high-frequency (B50) kernel were analyzed using a dedicated prototype software (MevisPULMO) allowing lobar quantification of emphysema extent. Lobar quantification was obtained following (a) a fully automatic delineation of the lobar limits by the software and (b) a semiautomatic delineation with manual correction of the lobar limits when necessary and was compared with the visual scoring of emphysema severity per lobe. No statistically significant difference existed between automated and semiautomated lobar quantification (p > 0.05 in the five lobes), with differences ranging from 0.4 to 3.9%. The agreement between the two methods (intraclass correlation coefficient, ICC) was excellent for left upper lobe (ICC = 0.94), left lower lobe (ICC = 0.98), and right lower lobe (ICC = 0.80). The agreement was good for right upper lobe (ICC = 0.68) and moderate for middle lobe (IC = 0.53). The Bland and Altman plots confirmed these results. A good agreement was observed between the software and visually assessed lobar predominance of emphysema (kappa 0.78; 95% CI 0.64-0.92). Automated and semiautomated lobar quantifications of emphysema are concordant and show good agreement with visual scoring.
Reliability of the Superimposed-Burst Technique in Patients With Patellofemoral Pain: A Technical Report.

PubMed

Norte, Grant E; Frye, Jamie L; Hart, Joseph M

2015-11-01

The superimposed-burst (SIB) technique is commonly used to quantify central activation failure after knee-joint injury, but its reliability has not been established in pathologic cohorts. To assess within-session and between-sessions reliability of the SIB technique in patients with patellofemoral pain. Descriptive laboratory study. University laboratory. A total of 10 patients with self-reported patellofemoral pain (1 man, 9 women; age = 24.1 ± 3.8 years, height = 167.8 ± 15.2 cm, mass = 71.6 ± 17.5 kg) and 10 healthy control participants (3 men, 7 women; age = 27.4 ± 5.0 years, height = 173.5 ± 9.9 cm, mass = 78.2 ± 16.5 kg) volunteered. Participants were assessed at 6 intervals spanning 21 days. Intraclass correlation coefficients (ICCs [3,3]) were used to assess reliability. Quadriceps central activation ratio, knee-extension maximal voluntary isometric contraction force, and SIB force. The quadriceps central activation ratio was highly reliable within session (ICC [3,3] = 0.97) and between sessions through day 21 (ICC [3,3] = 0.90-0.95). Acceptable reliability of knee extension (ICC [3,3] = 0.75-0.91) and SIB force (ICC [3,3] = 0.77-0.89) was observed through day 21. The SIB technique was reliable for clinical research up to 21 days in patients with patellofemoral pain.
Repeatability and Reproducibility of Retinal Nerve Fiber Layer Parameters Measured by Scanning Laser Polarimetry with Enhanced Corneal Compensation in Normal and Glaucomatous Eyes.

PubMed

Ara, Mirian; Ferreras, Antonio; Pajarin, Ana B; Calvo, Pilar; Figus, Michele; Frezzotti, Paolo

2015-01-01

To assess the intrasession repeatability and intersession reproducibility of peripapillary retinal nerve fiber layer (RNFL) thickness parameters measured by scanning laser polarimetry (SLP) with enhanced corneal compensation (ECC) in healthy and glaucomatous eyes. One randomly selected eye of 82 healthy individuals and 60 glaucoma subjects was evaluated. Three scans were acquired during the first visit to evaluate intravisit repeatability. A different operator obtained two additional scans within 2 months after the first session to determine intervisit reproducibility. The intraclass correlation coefficient (ICC), coefficient of variation (COV), and test-retest variability (TRT) were calculated for all SLP parameters in both groups. ICCs ranged from 0.920 to 0.982 for intravisit measurements and from 0.910 to 0.978 for intervisit measurements. The temporal-superior-nasal-inferior-temporal (TSNIT) average was the highest (0.967 and 0.946) in normal eyes, while nerve fiber indicator (NFI; 0.982) and inferior average (0.978) yielded the best ICC in glaucomatous eyes for intravisit and intervisit measurements, respectively. All COVs were under 10% in both groups, except NFI. TSNIT average had the lowest COV (2.43%) in either type of measurement. Intervisit TRT ranged from 6.48 to 12.84. The reproducibility of peripapillary RNFL measurements obtained with SLP-ECC was excellent, indicating that SLP-ECC is sufficiently accurate for monitoring glaucoma progression.
Test-retest reliability of the Clinical Learning Environment, Supervision and Nurse Teacher (CLES + T) scale.

PubMed

Gustafsson, Margareta; Blomberg, Karin; Holmefur, Marie

2015-07-01

The Clinical Learning Environment, Supervision and Nurse Teacher (CLES + T) scale evaluates the student nurses' perception of the learning environment and supervision within the clinical placement. It has never been tested in a replication study. The aim of the present study was to evaluate the test-retest reliability of the CLES + T scale. The CLES + T scale was administered twice to a group of 42 student nurses, with a one-week interval. Test-retest reliability was determined by calculations of Intraclass Correlation Coefficients (ICCs) and weighted Kappa coefficients. Standard Error of Measurements (SEM) and Smallest Detectable Difference (SDD) determined the precision of individual scores. Bland-Altman plots were created for analyses of systematic differences between the test occasions. The results of the study showed that the stability over time was good to excellent (ICC 0.88-0.96) in the sub-dimensions "Supervisory relationship", "Pedagogical atmosphere on the ward" and "Role of the nurse teacher". Measurements of "Premises of nursing on the ward" and "Leadership style of the manager" had lower but still acceptable stability (ICC 0.70-0.75). No systematic differences occurred between the test occasions. This study supports the usefulness of the CLES + T scale as a reliable measure of the student nurses' perception of the learning environment within the clinical placement at a hospital. Copyright © 2015 Elsevier Ltd. All rights reserved.
Between-day reliability of a method for non-invasive estimation of muscle composition.

PubMed

Simunič, Boštjan

2012-08-01

Tensiomyography is a method for valid and non-invasive estimation of skeletal muscle fibre type composition. The validity of selected temporal tensiomyographic measures has been well established recently; there is, however, no evidence regarding the method's between-day reliability. Therefore it is the aim of this paper to establish the between-day repeatability of tensiomyographic measures in three skeletal muscles. For three consecutive days, 10 healthy male volunteers (mean±SD: age 24.6 ± 3.0 years; height 177.9 ± 3.9 cm; weight 72.4 ± 5.2 kg) were examined in a supine position. Four temporal measures (delay, contraction, sustain, and half-relaxation time) and maximal amplitude were extracted from the displacement-time tensiomyogram. A reliability analysis was performed with calculations of bias, random error, coefficient of variation (CV), standard error of measurement, and intra-class correlation coefficient (ICC) with a 95% confidence interval. An analysis of ICC demonstrated excellent agreement (ICC were over 0.94 in 14 out of 15 tested parameters). However, lower CV was observed in half-relaxation time, presumably because of the specifics of the parameter definition itself. These data indicate that for the three muscles tested, tensiomyographic measurements were reproducible across consecutive test days. Furthermore, we indicated the most possible origin of the lowest reliability detected in half-relaxation time. Copyright © 2012 Elsevier Ltd. All rights reserved.
Reliability of Lactation Assessment Tools Applied to Overweight and Obese Women.

PubMed

Chapman, Donna J; Doughty, Katherine; Mullin, Elizabeth M; Pérez-Escamilla, Rafael

2016-05-01

The interrater reliability of lactation assessment tools has not been evaluated in overweight/obese women. This study aimed to compare the interrater reliability of 4 lactation assessment tools in this population. A convenience sample of 45 women (body mass index > 27.0) was videotaped while breastfeeding (twice daily on days 2, 4, and 7 postpartum). Three International Board Certified Lactation Consultants independently rated each videotaped session using 4 tools (Infant Breastfeeding Assessment Tool [IBFAT], modified LATCH [mLATCH], modified Via Christi [mVC], and Riordan's Tool [RT]). For each day and tool, we evaluated interrater reliability with 1-way repeated-measures analyses of variance, intraclass correlation coefficients (ICCs), and percentage absolute agreement between raters. Analyses of variance showed significant differences between raters' scores on day 2 (all scales) and day 7 (RT). Intraclass correlation coefficient values reflected good (mLATCH) to excellent reliability (IBFAT, mVC, and RT) on days 2 and 7. All day 4 ICCs reflected good reliability. The ICC for mLATCH was significantly lower than all others on day 2 and was significantly lower than IBFAT (day 7). Percentage absolute interrater agreement for scale components ranged from 31% (day 2: observable swallowing, RT) to 92% (day 7: IBFAT, fixing; and mVC, latch time). Swallowing scores on all scales had the lowest levels of interrater agreement (31%-64%). We demonstrated differences in the interrater reliability of 4 lactation assessment tools when applied to overweight/obese women, with the lowest values observed on day 4. Swallowing assessment was particularly unreliable. Researchers and clinicians using these scales should be aware of the differences in their psychometric behavior. © The Author(s) 2015.
Breast MRI radiomics: comparison of computer- and human-extracted imaging phenotypes.

PubMed

Sutton, Elizabeth J; Huang, Erich P; Drukker, Karen; Burnside, Elizabeth S; Li, Hui; Net, Jose M; Rao, Arvind; Whitman, Gary J; Zuley, Margarita; Ganott, Marie; Bonaccio, Ermelinda; Giger, Maryellen L; Morris, Elizabeth A

2017-01-01

In this study, we sought to investigate if computer-extracted magnetic resonance imaging (MRI) phenotypes of breast cancer could replicate human-extracted size and Breast Imaging-Reporting and Data System (BI-RADS) imaging phenotypes using MRI data from The Cancer Genome Atlas (TCGA) project of the National Cancer Institute. Our retrospective interpretation study involved analysis of Health Insurance Portability and Accountability Act-compliant breast MRI data from The Cancer Imaging Archive, an open-source database from the TCGA project. This study was exempt from institutional review board approval at Memorial Sloan Kettering Cancer Center and the need for informed consent was waived. Ninety-one pre-operative breast MRIs with verified invasive breast cancers were analysed. Three fellowship-trained breast radiologists evaluated the index cancer in each case according to size and the BI-RADS lexicon for shape, margin, and enhancement (human-extracted image phenotypes [HEIP]). Human inter-observer agreement was analysed by the intra-class correlation coefficient (ICC) for size and Krippendorff's α for other measurements. Quantitative MRI radiomics of computerised three-dimensional segmentations of each cancer generated computer-extracted image phenotypes (CEIP). Spearman's rank correlation coefficients were used to compare HEIP and CEIP. Inter-observer agreement for HEIP varied, with the highest agreement seen for size (ICC 0.679) and shape (ICC 0.527). The computer-extracted maximum linear size replicated the human measurement with p < 10 -12 . CEIP of shape, specifically sphericity and irregularity, replicated HEIP with both p values < 0.001. CEIP did not demonstrate agreement with HEIP of tumour margin or internal enhancement. Quantitative radiomics of breast cancer may replicate human-extracted tumour size and BI-RADS imaging phenotypes, thus enabling precision medicine.
Reliability and validity of the range of motion scale (ROMS) in patients with abnormal postures.

PubMed

van Rooijen, Diana E; Lalli, Stefania; Marinus, Johan; Maihöfner, Christian; McCabe, Candida S; Munts, Alex G; van der Plas, Anton A; Tijssen, Marina A J; van de Warrenburg, Bart P; Albanese, Alberto; van Hilten, Jacobus J

2015-03-01

Sustained abnormal postures (i.e., fixed dystonia) are the most frequently reported motor abnormalities in complex regional pain syndrome (CRPS), but these symptoms may also develop after peripheral trauma without CRPS. Currently, there is no valid and reliable measurement instrument available to measure the severity and distribution of these postures. The range of motion scale (ROMS) was therefore developed to assess the severity based on the possible active range of motion of all joints (arms, legs, trunk, and neck), and the present study evaluates its reliability and validity. Inter- and intra-rater reliability of the ROMS was determined in 16 patients with abnormal sustained postures, who were videotaped following a standard video protocol in a university hospital. The recordings were rated by a panel of international experts. In addition, 30 patients were clinically tested with both the Burke-Fahn-Marsden (BFM) scale as well as the ROMS to assess construct validity. Inter-rater reliability for total ROMS scores showed an intra-class correlation coefficient (ICC) of 0.85. The majority of the scores for the separate joints (13 out of 18) demonstrated an almost perfect agreement with ICCs ranging from 0.81 to 0.94; of the other items, one showed fair, one moderate, and three substantial agreement. The ICCs for the intra-rater reliability ranged from moderate to almost perfect (0.68-0.98). Spearman's correlation coefficients between corresponding body areas as measured with the ROMS or BFM were all above 0.82. The ROMS is a reliable and valid instrument to evaluate the severity and distribution of sustained abnormal postures. Wiley Periodicals, Inc.
The Reliability of Individualized Load-Velocity Profiles.

PubMed

Banyard, Harry G; Nosaka, K; Vernon, Alex D; Haff, G Gregory

2017-11-15

This study examined the reliability of peak velocity (PV), mean propulsive velocity (MPV), and mean velocity (MV) in the development of load-velocity profiles (LVP) in the full depth free-weight back squat performed with maximal concentric effort. Eighteen resistance-trained men performed a baseline one-repetition maximum (1RM) back squat trial and three subsequent 1RM trials used for reliability analyses, with 48-hours interval between trials. 1RM trials comprised lifts from six relative loads including 20, 40, 60, 80, 90, and 100% 1RM. Individualized LVPs for PV, MPV, or MV were derived from loads that were highly reliable based on the following criteria: intra-class correlation coefficient (ICC) >0.70, coefficient of variation (CV) ≤10%, and Cohen's d effect size (ES) <0.60. PV was highly reliable at all six loads. Importantly, MPV and MV were highly reliable at 20, 40, 60, 80 and 90% but not 100% 1RM (MPV: ICC=0.66, CV=18.0%, ES=0.10, standard error of the estimate [SEM]=0.04m·s -1 ; MV: ICC=0.55, CV=19.4%, ES=0.08, SEM=0.04m·s -1 ). When considering the reliable ranges, almost perfect correlations were observed for LVPs derived from PV 20-100% (r=0.91-0.93), MPV 20-90% (r=0.92-0.94) and MV 20-90% (r=0.94-0.95). Furthermore, the LVPs were not significantly different (p>0.05) between trials, movement velocities, or between linear regression versus second order polynomial fits. PV 20-100% , MPV 20-90% , and MV 20-90% are reliable and can be utilized to develop LVPs using linear regression. Conceptually, LVPs can be used to monitor changes in movement velocity and employed as a method for adjusting sessional training loads according to daily readiness.
Potential reliability and validity of a modified version of the Unified Parkinson’s Disease Rating Scale that could be administered remotely

PubMed Central

Abdolahi, Amir; Scoglio, Nicholas; Killoran, Annie; Dorsey, Ray; Biglan, Kevin M.

2013-01-01

Background By permitting remote assessments of patients and research participants, telemedicine has the potential to reshape clinical care and clinical trials for Parkinson disease. While the majority of the motor Unified Parkinson’s Disease Rating Scale (UPDRS) items can be conducted visually, rigidity and retropulsion pull testing require hands-on assessment by the rater and are less feasible to perform remotely in patients' homes. Methods In a secondary data analysis of the Comparison of the Agonist pramipexole vs. Levodopa on Motor complications in Parkinson’s Disease (CALM-PD) study, a randomized clinical trial, we assessed the cross-sectional (baseline and 2 years) and longitudinal (change from baseline to 2 years) reliability of a modified motor UPDRS (removing rigidity and retropulsion items) compared to the standard motor UPDRS (all items) using intraclass correlation coefficients (ICC), stratified by treatment group. Internal consistency of the modified UPDRS (mUPDRS) was measured using Cronbach’s alpha, and concurrent validity was assessed using Pearson’s correlation coefficient (r) between the standard motor UPDRS and mUPDRS. Results The mUPDRS versus standard motor UPDRS is cross-sectionally (ICC ≥ 0.92) and longitudinally (ICC ≥ 0.92) reliable for both treatment groups. High internal consistencies were also observed (α ≥ 0.96). The mUPDRS had high concurrent validity with the standard UPDRS at both time points and longitudinally (r ≥ 0.93, p < 0.0001). Conclusions A modified version of the motor UPDRS without rigidity and retropulsion pull testing is reliable and valid and may lay the foundation for its use in remote assessments of patients and research participants. PMID:23102808
Potential reliability and validity of a modified version of the Unified Parkinson's Disease Rating Scale that could be administered remotely.

PubMed

Abdolahi, Amir; Scoglio, Nicholas; Killoran, Annie; Dorsey, E Ray; Biglan, Kevin M

2013-02-01

By permitting remote assessments of patients and research participants, telemedicine has the potential to reshape clinical care and clinical trials for Parkinson disease. While the majority of the motor Unified Parkinson's Disease Rating Scale (UPDRS) items can be conducted visually, rigidity and retropulsion pull testing require hands-on assessment by the rater and are less feasible to perform remotely in patients' homes. In a secondary data analysis of the Comparison of the Agonist pramipexole vs. Levodopa on Motor complications in Parkinson's Disease (CALM-PD) study, a randomized clinical trial, we assessed the cross-sectional (baseline and 2 years) and longitudinal (change from baseline to 2 years) reliability of a modified motor UPDRS (removing rigidity and retropulsion items) compared to the standard motor UPDRS (all items) using intraclass correlation coefficients (ICC), stratified by treatment group. Internal consistency of the modified UPDRS (mUPDRS) was measured using Cronbach's alpha, and concurrent validity was assessed using Pearson's correlation coefficient (r) between the standard motor UPDRS and mUPDRS. The mUPDRS versus standard motor UPDRS is cross-sectionally (ICC ≥ 0.92) and longitudinally (ICC ≥ 0.92) reliable for both treatment groups. High internal consistencies were also observed (α ≥ 0.96). The mUPDRS had high concurrent validity with the standard UPDRS at both time points and longitudinally (r ≥ 0.93, p < 0.0001). A modified version of the motor UPDRS without rigidity and retropulsion pull testing is reliable and valid and may lay the foundation for its use in remote assessments of patients and research participants. Copyright © 2012 Elsevier Ltd. All rights reserved.
Concurrent validity and interrater reliability of a new smartphone application to assess 3D active cervical range of motion in patients with neck pain.

PubMed

Stenneberg, Martijn S; Busstra, Harm; Eskes, Michel; van Trijffel, Emiel; Cattrysse, Erik; Scholten-Peeters, Gwendolijne G M; de Bie, Rob A

2018-04-01

There is a lack of valid, reliable, and feasible instruments for measuring planar active cervical range of motion (aCROM) and associated 3D coupling motions in patients with neck pain. Smartphones have advanced sensors and appear to be suitable for these measurements. To estimate the concurrent validity and interrater reliability of a new iPhone application for assessing planar aCROM and associated 3D coupling motions in patients with neck pain, using an electromagnetic tracking device as a reference test. Cross-sectional study. Two samples of neck pain patients were recruited; 30 patients for the validity study and 26 patients for the reliability study. Validity was estimated using intraclass correlation coefficients (ICCs), and by calculating 95% limits of agreement (LoA). To estimate interrater reliability, ICCs were calculated. Cervical 3D coupling motions were analyzed by calculating the cross-correlation coefficients and ratio between the main motions and coupled motions for both instruments. ICCs for concurrent validity and interrater reliability ranged from 0.90 to 0.99. The width of the 95% LoA ranged from about 5° for right lateral bending to 11° for total rotation. No significant differences were found between both devices for associated coupling motion analysis. The iPhone application appears to be a useful discriminative tool for the measurement of planar aCROM and associated coupling motions in patients with neck pain. It fulfills the need for a valid, reliable, and feasible instrument in clinical practice and research. Therapists and researchers should consider measurement error when interpreting scores. Copyright © 2017 Elsevier Ltd. All rights reserved.
Paraspinal skin temperature patterns: an interexaminer and intraexaminer reliability study.

PubMed

Owens, Edward F; Hart, John F; Donofrio, Joseph J; Haralambous, Jason; Mierzejewski, Eric

2004-01-01

Paraspinal thermography is used by chiropractors as an aid in assessing the presence of vertebral subluxation. Few reliability studies have been carried out, with mixed results. Digital infrared scanning equipment is now available with location tracking that may enhance reproducibility. Digitized scans enable a computer-aided interpretation of thermographic patterns. To assess the ability of examiners to reproduce thermal patterns. Repeated measures with 2 examiners assessing the same patient on 2 occasions. Thirty asymptomatic students served as subjects. A TyTron C-3000 handheld thermographic scanner interfaced to a Microsoft Windows compatible personal computer was used for all recordings. Each examiner recorded 2 scans on each patient. It took an average of 3 minutes to complete all 4 scans. Data were exported to a spreadsheet for initial analysis, then SPSS was used for calculation of intraclass correlation coefficients (ICC). Since the starting and stopping points of scans were not always the same, care was taken to align scans visually, using well-distinguished peaks on the charts as guides. Scans were cropped to remove artifacts that might have occurred at the beginning and end of the scans. Intraexaminer and interexaminer ICCs were calculated. Skin temperatures ranged from 35.4 degrees C to 30.0 degrees C over all scans. The average temperatures changed little from the first to the last scans, indicating that subjects' overall skin temperatures were stable during the scanning procedure. Intraexaminer ICCs ranged from 0.953 to 0.984. The left and right channel data show slightly higher congruence than the Delta channel. The interexaminer reliability coefficients ranged from 0.918 to 0.975. Again, the Delta channel shows slightly less reliability, although the ICCs were quite high for all channels. Intraexaminer and interexaminer reliability of paraspinal thermal scans using the TyTron C-3000 were found to be very high, with ICC values between 0.91 and 0.98. Changes seen in thermal scans when properly done are most likely due to actual physiological changes rather than equipment error.
Reproducibility of DCE-MRI time-intensity curve-shape analysis in patients with knee arthritis: A comparison with qualitative and pharmacokinetic analyses.

PubMed

van der Leij, Christiaan; Lavini, Cristina; van de Sande, Marleen G H; de Hair, Marjolein J H; Wijffels, Christophe; Maas, Mario

2015-12-01

To compare the between-session reproducibility of dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) combined with time-intensity curve (TIC)-shape analysis in arthritis patients, within one scanner and between two different scanners, and to compare this method with qualitative analysis and pharmacokinetic modeling (PKM). Fifteen knee joint arthritis patients were included and scanned twice on a closed-bore 1.5T scanner (n = 9, group 1), or on a closed-bore 1.5T and on an open-bore 1.0T scanner (n = 6, group 2). DCE-MRI data were postprocessed using in-house developed software ("Dynamo"). Disease activity was assessed. Disease activity was comparable between the two visits. In group 1 qualitative analysis showed the highest reproducibility with intraclass correlation coefficients (ICCs) between 0.78 and 0.98 and root mean square-coefficients of variation (RMS-CoV) of 8.0%-14.9%. TIC-shape analysis showed a slightly lower reproducibility with similar ICCs (0.78-0.97) but higher RMS-CoV (18.3%-42.9%). The PKM analysis showed the lowest reproducibility with ICCs between 0.39 and 0.64 (RMS-CoV 21.5%-51.9%). In group 2 TIC-shape analysis of the two most important TIC-shape types showed the highest reproducibility with ICCs of 0.78 and 0.71 (RMS-CoV 29.8% and 59.4%) and outperformed the reproducibility of the most important qualitative parameter (ICC 0.31, RMS-CoV 45.1%) and the within-scanner reproducibility of PKM analysis. TIC-shape analysis is a robust postprocessing method within one scanner, almost as reproducible as the qualitative analysis. Between scanners, the reproducibility of the most important TIC-shapes outperform that of the most important qualitative parameter and the within-scanner reproducibility of PKM analysis. © 2015 Wiley Periodicals, Inc.
Global longitudinal strain software upgrade: Implications for intervendor consistency and longitudinal imaging studies.

PubMed

Castel, Anne-Laure; Menet, Aymeric; Ennezat, Pierre-Vladimir; Delelis, François; Le Goffic, Caroline; Binda, Camille; Guerbaai, Raphaëlle-Ashley; Levy, Franck; Graux, Pierre; Tribouilloy, Christophe; Maréchaux, Sylvestre

2016-01-01

Speckle tracking can be used to measure left ventricular global longitudinal strain (GLS). To study the effect of speckle tracking software product upgrades on GLS values and intervendor consistency. Subjects (patients or healthy volunteers) underwent systematic echocardiography with equipment from Philips and GE, without a change in their position. Off-line post-processing for GLS assessment was performed with the former and most recent upgrades from these two vendors (Philips QLAB 9.0 and 10.2; GE EchoPAC 12.1 and 13.1.1). GLS was obtained in three myocardial layers with EchoPAC 13.1.1. Intersoftware and intervendor consistency was assessed. Interobserver variability was tested in a subset of patients. Among 73 subjects (65 patients and 8 healthy volunteers), absolute values of GLS were higher with QLAB 10.2 compared with 9.0 (intraclass correlation coefficient [ICC]: 0.88; bias: 2.2%). Agreement between EchoPAC 13.1.1 and 12.1 varied by myocardial layer (13.1.1 only): midwall (ICC: 0.95; bias: -1.1%), endocardium (ICC: 0.93; bias: 1.6%) and epicardial (ICC: 0.80; bias: -3.3%). Although GLS was comparable for QLAB 9.0 versus EchoPAC 12.1 (ICC: 0.95; bias: 0.5%), the agreement was lower between QLAB 10.2 and EchoPAC 13.1.1 endocardial (ICC: 0.91; bias: 1.1%), midwall (ICC: 0.73; bias: 3.9%) and epicardial (ICC: 0.54; bias: 6.0%). Interobserver variability of all software products in a subset of 20 patients was excellent (ICC: 0.97-0.99; bias: -0.8 to 1.0%). Upgrades of speckle tracking software may be associated with significant changes in GLS values, which could affect intersoftware and intervendor consistency. This finding has important clinical implications for the longitudinal follow-up of patients with speckle tracking echocardiography. Copyright © 2015 Elsevier Masson SAS. All rights reserved.
Quantification of atrial dynamics using cardiovascular magnetic resonance: inter-study reproducibility.

PubMed

Kowallick, Johannes T; Morton, Geraint; Lamata, Pablo; Jogiya, Roy; Kutty, Shelby; Hasenfuß, Gerd; Lotz, Joachim; Nagel, Eike; Chiribiri, Amedeo; Schuster, Andreas

2015-05-17

Cardiovascular magnetic resonance (CMR) offers quantification of phasic atrial functions based on volumetric assessment and more recently, on CMR feature tracking (CMR-FT) quantitative strain and strain rate (SR) deformation imaging. Inter-study reproducibility is a key requirement for longitudinal studies but has not been defined for CMR-based quantification of left atrial (LA) and right atrial (RA) dynamics. Long-axis 2- and 4-chamber cine images were acquired at 9:00 (Exam A), 9:30 (Exam B) and 14:00 (Exam C) in 16 healthy volunteers. LA and RA reservoir, conduit and contractile booster pump functions were quantified by volumetric indexes as derived from fractional volume changes and by strain and SR as derived from CMR-FT. Exam A and B were compared to assess the inter-study reproducibility. Morning and afternoon scans were compared to address possible diurnal variation of atrial function. Inter-study reproducibility was within acceptable limits for all LA and RA volumetric, strain and SR parameters. Inter-study reproducibility was better for volumetric indexes and strain than for SR parameters and better for LA than for RA dynamics. For the LA, reservoir function showed the best reproducibility (intraclass correlation coefficient (ICC) 0.94-0.97, coefficient of variation (CoV) 4.5-8.2%), followed by conduit (ICC 0.78-0.97, CoV 8.2-18.5%) and booster pump function (ICC 0.71-0.95, CoV 18.3-22.7). Similarly, for the RA, reproducibility was best for reservoir function (ICC 0.76-0.96, CoV 7.5-24.0%) followed by conduit (ICC 0.67-0.91, CoV 13.9-35.9) and booster pump function (ICC 0.73-0.90, CoV 19.4-32.3). Atrial dynamics were not measurably affected by diurnal variation between morning and afternoon scans. Inter-study reproducibility for CMR-based derivation of LA and RA functions is acceptable using either volumetric, strain or SR parameters with LA function showing higher reproducibility than RA function assessment. Amongst the different functional components, reservoir function is most reproducibly assessed by either technique followed by conduit and booster pump function, which needs to be considered in future longitudinal research studies.

Validation of a simplified food frequency questionnaire for the assessment of dietary habits in Iranian adults: Isfahan Healthy Heart Program, Iran.

PubMed

Mohammadifard, Noushin; Sajjadi, Firouzeh; Maghroun, Maryam; Alikhasi, Hassan; Nilforoushzadeh, Farzaneh; Sarrafzadegan, Nizal

2015-03-01

Dietary assessment is the first step of dietary modification in community-based interventional programs. This study was performed to validate a simple food frequency questionnaire (SFFQ) for assessment of selected food items in epidemiological studies with a large sample size as well as community trails. This validation study was carried out on 264 healthy adults aged ≥ 41 years old living in 3 district central of Iran, including Isfahan, Najafabad, and Arak. Selected food intakes were assessed using a 48-item food frequency questionnaire (FFQ). The FFQ was interviewer-administered, which was completed twice; at the beginning of the study and 2 weeks thereafter. The validity of this SFFQ was examined compared to estimated amount by single 24 h dietary recall and 2 days dietary record. Validation of the FFQ was determined using Spearman correlation coefficients between daily frequency consumption of food groups as assessed by the FFQ and the qualitative amount of daily food groups intake accessed by dietary reference method was applied to evaluate validity. Intraclass correlation coefficients (ICC) were used to determine the reproducibility. Spearman correlation coefficient between the estimated amount of food groups intake by examined and reference methods ranged from 0.105 (P = 0.378) in pickles to 0.48 (P < 0.001) in plant protein. ICC for reproducibility of FFQ were between 0.47-0.69 in different food groups (P < 0.001). The designed SFFQ has a good relative validity and reproducibility for assessment of selected food groups intake. Thus, it can serve as a valid tool in epidemiological studies and clinical trial with large participants.
The test-retest reliability and criterion validity of a high-intensity, netball-specific circuit test: The Net-Test.

PubMed

Mungovan, Sean F; Peralta, Paula J; Gass, Gregory C; Scanlan, Aaron T

2018-04-12

To examine the test-retest reliability and criterion validity of a high-intensity, netball-specific fitness test. Repeated measures, within-subject design. Eighteen female netball players competing in an international competition completed a trial of the Net-Test, which consists of 14 timed netball-specific movements. Players also completed a series of netball-relevant criterion fitness tests. Ten players completed an additional Net-Test trial one week later to assess test-retest reliability using intraclass correlation coefficient (ICC), typical error of measurement (TEM), and coefficient of variation (CV). The typical error of estimate expressed as CV and Pearson correlations were calculated between each criterion test and Net-Test performance to assess criterion validity. Five movements during the Net-Test displayed moderate ICC (0.84-0.90) and two movements displayed high ICC (0.91-0.93). Seven movements and heart rate taken during the Net-Test held low CV (<5%) with values ranging from 1.7 to 9.5% across measures. Total time (41.63±2.05s) during the Net-Test possessed low CV and significant (p<0.05) correlations with 10m sprint time (1.98±0.12s; CV=4.4%, r=0.72), 20m sprint time (3.38±0.19s; CV=3.9%, r=0.79), 505 Change-of-Direction time (2.47±0.08s; CV=2.0%, r=0.80); and maximum oxygen uptake (46.59±2.58 mLkg -1 min -1 ; CV=4.5%, r=-0.66). The Net-Test possesses acceptable reliability for the assessment of netball fitness. Further, the high criterion validity for the Net-Test suggests a range of important netball-specific fitness elements are assessed in combination. Copyright © 2018 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.
Cross-cultural adaptation and reliability and validity of the Dutch Patient-Rated Tennis Elbow Evaluation (PRTEE-D).

PubMed

van Ark, Mathijs; Zwerver, Johannes; Diercks, Ronald L; van den Akker-Scheek, Inge

2014-08-11

Lateral Epicondylalgia (LE) is a common injury for which no reliable and valid measure exists to determine severity in the Dutch language. The Patient-Rated Tennis Elbow Evaluation (PRTEE) is the first questionnaire specifically designed for LE but in English. The aim of this study was to translate into Dutch and cross-culturally adapt the PRTEE and determine reliability and validity of the PRTEE-D (Dutch version). The PRTEE was cross-culturally adapted according to international guidelines. Participants (n = 122) were asked to fill out the PRTEE-D twice with a one week interval to assess test-retest reliability. Internal consistency of the PRTEE-D was determined by calculating Crohnbach's alphas for the questionnaire and subscales. Intraclass Correlation Coefficients (ICC) were calculated for the overall PRTEE-D score, pain and function subscale and individual questions to determine test-retest reliability. Additionally, the Disabilities for the Arm, Shoulder and Hand questionnaire (DASH) and Visual Analogue Scale (VAS) pain scores were obtained from 30 patients to assess construct validity; Spearman's correlation coefficients were calculated between the PRTEE-D (subscales) and DASH and VAS-pain scores. The PRTEE was successfully cross-culturally adapted into Dutch (PRTEE-D). Crohnbach's alpha for the first assessment of the PRTEE-D was 0.98; Crohnbach's alpha was 0.93 for the pain subscale and 0.97 for the function subscale. ICC for the PRTEE-D was 0.98; subscales also showed excellent ICC values (pain scale 0.97 and function scale 0.97). A significant moderate correlation exists between PRTEE-D and DASH (0.65) and PRTEE-D and VAS pain (0.68). The PRTEE was successfully cross-culturally adapted and this study showed that the PRTEE-D is reliable and valid to obtain an indication of severity of LE. An easy-to-use instrument for practitioners is now available and this facilitates comparing Dutch and international research data.
Assessment of Lower Limb Muscle Strength and Power Using Hand-Held and Fixed Dynamometry: A Reliability and Validity Study

PubMed Central

Perraton, Luke G.; Bower, Kelly J.; Adair, Brooke; Pua, Yong-Hao; Williams, Gavin P.; McGaw, Rebekah

2015-01-01

Introduction Hand-held dynamometry (HHD) has never previously been used to examine isometric muscle power. Rate of force development (RFD) is often used for muscle power assessment, however no consensus currently exists on the most appropriate method of calculation. The aim of this study was to examine the reliability of different algorithms for RFD calculation and to examine the intra-rater, inter-rater, and inter-device reliability of HHD as well as the concurrent validity of HHD for the assessment of isometric lower limb muscle strength and power. Methods 30 healthy young adults (age: 23±5yrs, male: 15) were assessed on two sessions. Isometric muscle strength and power were measured using peak force and RFD respectively using two HHDs (Lafayette Model-01165 and Hoggan microFET2) and a criterion-reference KinCom dynamometer. Statistical analysis of reliability and validity comprised intraclass correlation coefficients (ICC), Pearson correlations, concordance correlations, standard error of measurement, and minimal detectable change. Results Comparison of RFD methods revealed that a peak 200ms moving window algorithm provided optimal reliability results. Intra-rater, inter-rater, and inter-device reliability analysis of peak force and RFD revealed mostly good to excellent reliability (coefficients ≥ 0.70) for all muscle groups. Concurrent validity analysis showed moderate to excellent relationships between HHD and fixed dynamometry for the hip and knee (ICCs ≥ 0.70) for both peak force and RFD, with mostly poor to good results shown for the ankle muscles (ICCs = 0.31–0.79). Conclusions Hand-held dynamometry has good to excellent reliability and validity for most measures of isometric lower limb strength and power in a healthy population, particularly for proximal muscle groups. To aid implementation we have created freely available software to extract these variables from data stored on the Lafayette device. Future research should examine the reliability and validity of these variables in clinical populations. PMID:26509265
Reliability and validity of a dual-task test for skill proficiency in roundhouse kicks in elite taekwondo athletes.

PubMed

Chen, Chung-Yu; Dai, Jing; Chen, I-Fan; Chou, Kuei-Ming; Chang, Chen-Kang

2015-01-01

The dual-task methodology, conducting two tasks simultaneously, may provide better validity than the traditional single-task tests in the environment that is closely related to real sport competitions. The purpose of this study is to determine the reliability and validity of a dual-task test that aims to measure the reaction time and skill proficiency in roundhouse kicks in elite and sub-elite taekwondo athletes. The dual-task results were compared to those in the single-task movements with various levels of complexity. The single-task movements A, B, and C were composed of one, three, and five roundhouse kicks, respectively. The dual-task movement D was composed of movement C and a push of a button in response to a light stimulus as the secondary task. The subjects were 12 elite and 12 sub-elite male taekwondo athletes. The test included four movements with five repeats of each movement in a randomized order. Each subject conducted the same test on two consecutive days. The intraclass correlation coefficient (ICC) showed moderate-to-high correlation in the premotor time (ICC =0.439-0.634 in elite and ICC =0.681-0.824 in sub-elite), motor time (ICC =0.861-0.956 in elite and ICC =0.721-0.931 in sub-elite), and reaction time (ICC =0.692 in elite and ICC =0.676 in sub-elite) in the secondary task in both groups. The elite athletes had significantly faster premotor time than their sub-elite counterparts in all the four movements (all P<0.05). The largest difference lies in the reaction time in the secondary task, in which the elite group (0.248±0.026 seconds) was 33.0% faster than the sub-elite group (0.370±0.081 seconds) (P<0.001). This study shows that the test developed in this study has reasonable reliability and validity in both single- and dual-task methods. In addition, the dual-task method may be a more appropriate way to assess the reaction time and skill proficiency in taekwondo athletes.
Reliability and validity of a dual-task test for skill proficiency in roundhouse kicks in elite taekwondo athletes

PubMed Central

Chen, Chung-Yu; Dai, Jing; Chen, I-Fan; Chou, Kuei-Ming; Chang, Chen-Kang

2015-01-01

The dual-task methodology, conducting two tasks simultaneously, may provide better validity than the traditional single-task tests in the environment that is closely related to real sport competitions. The purpose of this study is to determine the reliability and validity of a dual-task test that aims to measure the reaction time and skill proficiency in roundhouse kicks in elite and sub-elite taekwondo athletes. The dual-task results were compared to those in the single-task movements with various levels of complexity. The single-task movements A, B, and C were composed of one, three, and five roundhouse kicks, respectively. The dual-task movement D was composed of movement C and a push of a button in response to a light stimulus as the secondary task. The subjects were 12 elite and 12 sub-elite male taekwondo athletes. The test included four movements with five repeats of each movement in a randomized order. Each subject conducted the same test on two consecutive days. The intraclass correlation coefficient (ICC) showed moderate-to-high correlation in the premotor time (ICC =0.439–0.634 in elite and ICC =0.681–0.824 in sub-elite), motor time (ICC =0.861–0.956 in elite and ICC =0.721–0.931 in sub-elite), and reaction time (ICC =0.692 in elite and ICC =0.676 in sub-elite) in the secondary task in both groups. The elite athletes had significantly faster premotor time than their sub-elite counterparts in all the four movements (all P<0.05). The largest difference lies in the reaction time in the secondary task, in which the elite group (0.248±0.026 seconds) was 33.0% faster than the sub-elite group (0.370±0.081 seconds) (P<0.001). This study shows that the test developed in this study has reasonable reliability and validity in both single- and dual-task methods. In addition, the dual-task method may be a more appropriate way to assess the reaction time and skill proficiency in taekwondo athletes. PMID:26150736
Dietary quality varies according to data collection instrument: a comparison between a food frequency questionnaire and 24-hour recall.

PubMed

Rodrigues, Paulo Rogério Melo; de Souza, Rita Adriana Gomes; De Cnop, Mara Lima; Monteiro, Luana Silva; Coura, Camila Pinheiro; Brito, Alessandra Page; Pereira, Rosangela Alves

2016-02-01

The objective of this study was to assess the agreement between the Brazilian Healthy Eating Index - Revised (BHEI-R), estimated by a food frequency questionnaire (FFQ) and multiple 24-hour recalls (24h-R). The Wilcoxon paired test, partial correlations (PC), intraclass correlation coefficient (ICC), and Bland-Altman method were used. The total BHEI-R scores and its components ("total fruits", "whole fruits", "total vegetables", "integral cereals", "saturated fat", "sodium", and "energy intake derived from solid fat, added sugar, and alcoholic beverages") were statistically different, with the ICC and PC indicating poor concordance and correlation. The mean concordance estimated for the total BHEI-R and its components varied from 68% for "integral cereals" to 147% for "whole fruits". The suitable concordance limits were violated for most of the components of the BHEI-R. Poor concordance was observed between the BHEI-R estimated by the FFQ and by multiple 24h-R, which indicated a strong reliability of the BHEI-R on the instrument used to collect information on food consumption.
Arabic cross cultural adaptation and validation of the National Institutes of Health Stroke Scale.

PubMed

Hussein, Haitham M; Abdel Moneim, Amr; Emara, Tamer; Abd-Elhamid, Yousry A; Salem, Haitham H; Abd-Allah, Foad; Farrag, Mohammad A; Tork, M Amir; Shalash, Ali S; Ezz El Dein, Khaled H; Osman, Gamaleldin; Georgy, Shady S; Ghali, Peter G; Lyden, Patrick D; Moustafa, Ramez R

2015-10-15

The National Institutes of Health Stroke Scale (NIHSS), the most commonly used tool to quantify neurological deficit in acute stroke, was initially developed in English. We present our experience in developing and validating an Arabic version of the NIHSS (arNIHSS). In 6months, 137 patients were recruited (mean age±standard deviation 62±12years; 48 women). For interrater agreement, weighted kappa value ranged from 0.36 to 0.66 and intraclass correlation coefficient (ICC) for the whole scale was excellent at 0.95 (95% confidence interval [CI] 0.94-0.97). For intrarater agreement, weighted kappa ranged from 0.52 to 1.0 and the ICC was 0.94 (95% CI 0.87-0.98). The construct validity of the arNIHSS is demonstrated by its correlation with the DWI-ASPECT and the 3months mRS score (Spearman correlation -0.46 and 0.58 respectively; P<0.001 for both). We developed and validated a culturally adapted Arabic version of the NIHSS. Further validation in other Arab countries is recommended. Copyright © 2015 Elsevier B.V. All rights reserved.
Validation of the Walking Impairment Questionnaire for Spanish patients.

PubMed

Lozano, Francisco S; March, José R; González-Porras, José R; Carrasco, Eduardo; Lobos, José M; Areitio-Aurtena, Alix

2013-09-01

The Walking Impairment Questionnaire (WIQ) is a short, easy to complete, disease-specific questionnaire to assess intermittent claudication. A Spanish version of the WIQ for Hispanic Americans has recently been validated in Texas, but it needs to be validated for European Spanish people. After translation and cultural adaptation of the WIQ, 920 patients with intermittent claudication (ankle brachial index < 0.9) completed two questionnaires (Spanish version of the WIQ and European Quality of Life 5 Dimension [EQ-5D]). The validity of the WIQ was determined by correlating WIQ and EQ-5D. Test-retest reliability and internal consistency were determined using the intra-class correlation coefficient (ICC) and Cronbach's alpha, respectively. The three domains of the WIQ were moderately correlated with the EQ-5D health outcome (r = 0.54 to 0.60; p < 0.001). Test-retest reliabilities ranged from ICC = 0.89 to 0.91 and internal consistency (Cronbach's alpha = 0.92) was high. The Spanish version of the WIQ for European Spanish patients was valid and reproducible, suggesting that it could be used in Spanish patients with intermittent claudication.
Validity of real-time ultrasound imaging to measure anterior hip muscle size: a comparison with magnetic resonance imaging.

PubMed

Mendis, M Dilani; Wilson, Stephen J; Stanton, Warren; Hides, Julie A

2010-09-01

Clinical measurement, criterion standard. To investigate the validity of real-time ultrasound imaging (USI) to measure individual anterior hip muscle cross-sectional area. The hip flexor muscles are important for hip joint function and could be affected by joint pathology or injury. Objectively documenting individual anterior hip muscle size can be useful in identifying muscle size asymmetry and monitoring treatment efficacy for patients with hip problems. USI offers a novel method of measuring individual muscle size in the clinic, but its validity in measuring the anterior hip muscles has not been investigated. Nine healthy participants (5 males, 4 females) underwent imaging of their iliopsoas, sartorius, and rectus femoris muscles with USI and magnetic resonance imaging. Bilateral muscle cross-sectional areas were measured on images from both modalities. There was no significant difference (P>.05) in mean cross-sectional area measurements from USI and magnetic resonance imaging for each muscle. Agreement between measurements was high for the iliopsoas (left: intraclass correlation coefficient [ICC3,1] = 0.86; 95% confidence interval [CI]: 0.51, 0.97; right: ICC3,1 = 0.88; 95% CI: 0.57, 0.97), sartorius (left: ICC3,1 = 0.82; 95% CI: 0.41, 0.96; right: ICC3,1 = 0.81; 95% CI: 0.39, 0.95), and rectus femoris (left: ICC3,1 = 0.85; 95% CI: 0.49, 0.96; right: ICC3,1 = 0.89; 95% CI: 0.61, 0.97). Reliability of measuring each muscle with USI was high between 2 trials (ICCs3,1 = 0.84 to 0.94). USI is a valid measure of iliopsoas, sartorius, and rectus femoris muscle size in healthy people, as long as a strict measurement protocol is followed.
Is liver perfusion CT reproducible? A study on intra- and interobserver agreement of normal hepatic haemodynamic parameters obtained with two different software packages.

PubMed

Bretas, Elisa Almeida Sathler; Torres, Ulysses S; Torres, Lucas Rios; Bekhor, Daniel; Saito Filho, Celso Fernando; Racy, Douglas Jorge; Faggioni, Lorenzo; D'Ippolito, Giuseppe

2017-10-01

To evaluate the agreement between the measurements of perfusion CT parameters in normal livers by using two different software packages. This retrospective study was based on 78 liver perfusion CT examinations acquired for detecting suspected liver metastasis. Patients with any morphological or functional hepatic abnormalities were excluded. The final analysis included 37 patients (59.7 ± 14.9 y). Two readers (1 and 2) independently measured perfusion parameters using different software packages from two major manufacturers (A and B). Arterial perfusion (AP) and portal perfusion (PP) were determined using the dual-input vascular one-compartmental model. Inter-reader agreement for each package and intrareader agreement between both packages were assessed with intraclass correlation coefficients (ICC) and Bland-Altman statistics. Inter-reader agreement was substantial for AP using software A (ICC = 0.82) and B (ICC = 0.85-0.86), fair for PP using software A (ICC = 0.44) and fair to moderate for PP using software B (ICC = 0.56-0.77). Intrareader agreement between software A and B ranged from slight to moderate (ICC = 0.32-0.62) for readers 1 and 2 considering the AP parameters, and from fair to moderate (ICC = 0.40-0.69) for readers 1 and 2 considering the PP parameters. At best there was only moderate agreement between both software packages, resulting in some uncertainty and suboptimal reproducibility. Advances in knowledge: Software-dependent factors may contribute to variance in perfusion measurements, demanding further technical improvements. AP measurements seem to be the most reproducible parameter to be adopted when evaluating liver perfusion CT.
Direct comparison of PI-RADS version 2 and version 1 regarding interreader agreement and diagnostic accuracy for the detection of clinically significant prostate cancer.

PubMed

Becker, Anton S; Cornelius, Alexander; Reiner, Cäcilia S; Stocker, Daniel; Ulbrich, Erika J; Barth, Borna K; Mortezavi, Ashkan; Eberli, Daniel; Donati, Olivio F

2017-09-01

to simultaneously evaluate interreader agreement and diagnostic accuracy in the of PI-RADS v2 and compare it to v1. A total of 67 patients (median age 65.3 y, range 51.2-78.2 y; PSA 6.8μg/L, 0.2-33μg/L) undergoing MRI of the prostate and subsequent transperineal template biopsy within ≤6 months from MRI were included. Four readers from two institutions evaluated the likelihood of prostate cancer using PI-RADS v1 and v2 in two separate reading sessions ≥3 months apart. Interreader agreement was assessed for each pulse-sequence and for total PI-RADS scores using the intraclass correlation coefficient (ICC). Differences were considered significant for non-overlapping 95%-confidence intervals. Diagnostic accuracy was assessed with the area under the receiver operating characteristic curve (A Z ). A p-value <0.05 was considered statistically significant. Interreader agreement for DCE-scores was good in v2 (ICC 2 =0.70; 95% CI: 0.66-0.74) and slightly lower in v1 (ICC 1 =0.64, 0.59-0.69). Agreement for DWI scores (ICC 1 =0.77, ICC 2 =0.76) as well as final PI-RADS scores per quadrant were nearly identical (ICC 1 =ICC 2 =0.71). Diagnostic accuracy showed no significant differences (p=0.09-0.93) between v1 and v2 in any of the readers (range: A Z =0.78-0.88). PI-RADS scores show similar interreader agreement in v2 and v1 at comparable diagnostic performance. The simplification of the DCE interpretation in v2 might slightly improve agreement while not negatively affecting diagnostic performance. Copyright © 2017 Elsevier B.V. All rights reserved.
Intra- and interobserver agreement for fetal cerebral measurements in 3D-ultrasonography.

PubMed

Albers, Maria E W A; Buisman, Erato T I A; Kahn, René S; Franx, Arie; Onland-Moret, N Charlotte; de Heus, Roel

2018-04-10

The aim of this study is to evaluate intra- and interobserver agreement for measurement of intracranial, cerebellar, and thalamic volume with the Virtual Organ Computer-aided AnaLysis (VOCAL) technique in three-dimensional ultrasound images, in comparison to two-dimensional measurements of these brain structures. Three-dimensional ultrasound images of the brains of 80 fetuses at 20-24 weeks' gestational age were obtained from YOUth, a Dutch prospective cohort study. Two observers performed offline measurement of the occipitofrontal diameter, intracranial volume, transcerebellar diameter, cerebellar volume, and thalamic width, area, and volume, independently. VOCAL was used for calculation of the volumes. The two-way random, single measures intraclass correlation coefficient (ICC) was used for analysis of agreement and Bland-Altman plots were configured. Intra- and interobserver agreement was almost perfect for occipitofrontal diameter (intra ICC 0.88, 95% CI 0.82-0.92; inter ICC 0.91, 95% CI 0.85-0.94), intracranial volume (intra ICC 0.96, 95% CI 0.91-0.98; inter ICC 0.97, 95% CI 0.96-0.98) and transcerebellar diameter (intra ICC 0.91, 95% CI 0.86-0.94; inter ICC 0.86, 95% CI 0.78-0.910). For cerebellar volume, the intraobserver agreement was almost perfect (0.85, 95% CI 0.76-0.90), whereas the interobserver agreement was substantial (0.75, 95% CI 0.44-0.88). Agreement was only moderate for thalamic measurements. Bland-Altman plots for the volume measurements are normally distributed with acceptable mean differences and 95% limits of agreement. The intra- and interobserver agreement of the measurement of intracranial and cerebellar volume with VOCAL was almost perfect. These measurements are therefore reliable, and can be used to investigate fetal brain development. Thalamic measurements are not reliable enough. © 2018 Wiley Periodicals, Inc.
Reliability of videotaped observational gait analysis in patients with orthopedic impairments

PubMed Central

Brunnekreef, Jaap J; van Uden, Caro JT; van Moorsel, Steven; Kooloos, Jan GM

2005-01-01

Background In clinical practice, visual gait observation is often used to determine gait disorders and to evaluate treatment. Several reliability studies on observational gait analysis have been described in the literature and generally showed moderate reliability. However, patients with orthopedic disorders have received little attention. The objective of this study is to determine the reliability levels of visual observation of gait in patients with orthopedic disorders. Methods The gait of thirty patients referred to a physical therapist for gait treatment was videotaped. Ten raters, 4 experienced, 4 inexperienced and 2 experts, individually evaluated these videotaped gait patterns of the patients twice, by using a structured gait analysis form. Reliability levels were established by calculating the Intraclass Correlation Coefficient (ICC), using a two-way random design and based on absolute agreement. Results The inter-rater reliability among experienced raters (ICC = 0.42; 95%CI: 0.38–0.46) was comparable to that of the inexperienced raters (ICC = 0.40; 95%CI: 0.36–0.44). The expert raters reached a higher inter-rater reliability level (ICC = 0.54; 95%CI: 0.48–0.60). The average intra-rater reliability of the experienced raters was 0.63 (ICCs ranging from 0.57 to 0.70). The inexperienced raters reached an average intra-rater reliability of 0.57 (ICCs ranging from 0.52 to 0.62). The two expert raters attained ICC values of 0.70 and 0.74 respectively. Conclusion Structured visual gait observation by use of a gait analysis form as described in this study was found to be moderately reliable. Clinical experience appears to increase the reliability of visual gait analysis. PMID:15774012
Reliability of Serum Metabolites over a Two-Year Period: A Targeted Metabolomic Approach in Fasting and Non-Fasting Samples from EPIC

PubMed Central

Achaintre, David; Sacerdote, Carlotta; Vineis, Paolo; Key, Timothy J.; Onland Moret, N. Charlotte; Scalbert, Augustin; Rinaldi, Sabina; Ferrari, Pietro

2015-01-01

Objective Although metabolic profiles have been associated with chronic disease risk, lack of temporal stability of metabolite levels could limit their use in epidemiological investigations. The present study aims to evaluate the reliability over a two-year period of 158 metabolites and compare reliability over time in fasting and non-fasting serum samples. Methods Metabolites were measured with the AbsolueIDQp180 kit (Biocrates, Innsbruck, Austria) by mass spectrometry and included acylcarnitines, amino acids, biogenic amines, hexoses, phosphatidylcholines and sphingomyelins. Measurements were performed on repeat serum samples collected two years apart in 27 fasting men from Turin, Italy, and 39 non-fasting women from Utrecht, The Netherlands, all participating in the European Prospective Investigation into Cancer and Nutrition (EPIC) study. Reproducibility was assessed by estimating intraclass correlation coefficients (ICCs) in multivariable mixed models. Results In fasting samples, a median ICC of 0.70 was observed. ICC values were <0.50 for 48% of amino acids, 27% of acylcarnitines, 18% of lysophosphatidylcholines and 4% of phosphatidylcholines. In non-fasting samples, the median ICC was 0.54. ICC values were <0.50 for 71% of acylcarnitines, 48% of amino acids, 44% of biogenic amines, 36% of sphingomyelins, 34% of phosphatidylcholines and 33% of lysophosphatidylcholines. Overall, reproducibility was lower in non-fasting as compared to fasting samples, with a statistically significant difference for 19–36% of acylcarnitines, phosphatidylcholines and sphingomyelins. Conclusion A single measurement per individual may be sufficient for the study of 73% and 52% of the metabolites showing ICCs >0.50 in fasting and non-fasting samples, respectively. ICCs were higher in fasting samples that are preferable to non-fasting. PMID:26274920
Impact of training on concordance among rheumatologists and dermatologists in the assessment of patients with psoriasis and psoriatic arthritis.

PubMed

Salvarani, Carlo; Girolomoni, Giampiero; Di Lernia, Vito; Gisondi, Paolo; Tripepi, Giovanni; Egan, Colin Gerard; Marchesoni, Antonio

2016-12-01

To evaluate the impact of training on the reliability among dermatologists and rheumatologists in the assessment of psoriatic arthritis (PsA) patients. Overall, 9 hospital-based rheumatologists and 8 hospital-based dermatologists met in Reggio Emilia, Italy on October 2015 to assess 17 PsA patients. After 1 month, physicians underwent a 3-h training session by 4 recognized experts and then assessed 19 different PsA patients according to a modified Latin square design. Measures included tender (TJC) and swollen joint count (SJC), dactylitis, enthesitis, Schober test, psoriasis body surface area (BSA), Psoriasis Area and Severity Index (PASI), Nail Psoriasis Severity Index (NAPSI), and static physician's global assessment of PsA disease activity (sPGA). Variance components analyses were performed to estimate the intraclass correlation coefficient (ICC). TJC and enthesitis-measured pre-training by dermatologists or rheumatologists revealed moderate-substantial agreement (ICC: 0.4-0.8). In contrast, SJC and Schober test showed fair (ICC: 0.2-0.4) and moderate agreement, respectively (ICC: 0.4-0.6), while poor agreement (ICC: 0-0.2) was represented by dactylitis. Moderate-substantial (ICC: 0.4-0.8) agreement was observed for most skin measures by dermatologists and rheumatologists, apart from BSA, where fair agreement (ICC: 0.2-0.4) was observed. Agreement levels were similar before and after training for arthritis measures. In contrast, levels of agreement after training for 3 of the 4 skin measures were increased for dermatologists and all 4 skin measures were increased for rheumatologists. Substantial to excellent agreement was observed for TJC, enthesitis, PASI, and sPGA. Rheumatologists benefited from training to a greater extent. Copyright © 2016 Elsevier Inc. All rights reserved.
Development of a Peer Teaching-Assessment Program and a Peer Observation and Evaluation Tool

PubMed Central

Trujillo, Jennifer M.; Barr, Judith; Gonyeau, Michael; Van Amburgh, Jenny A.; Matthews, S. James; Qualters, Donna

2008-01-01

Objectives To develop a formalized, comprehensive, peer-driven teaching assessment program and a valid and reliable assessment tool. Methods A volunteer taskforce was formed and a peer-assessment program was developed using a multistep, sequential approach and the Peer Observation and Evaluation Tool (POET). A pilot study was conducted to evaluate the efficiency and practicality of the process and to establish interrater reliability of the tool. Intra-class correlation coefficients (ICC) were calculated. Results ICCs for 8 separate lectures evaluated by 2-3 observers ranged from 0.66 to 0.97, indicating good interrater reliability of the tool. Conclusion Our peer assessment program for large classroom teaching, which includes a valid and reliable evaluation tool, is comprehensive, feasible, and can be adopted by other schools of pharmacy. PMID:19325963
Intrarater Reliability of Muscle Strength and Hamstring to Quadriceps Strength Imbalance Ratios During Concentric, Isometric, and Eccentric Maximal Voluntary Contractions Using the Isoforce Dynamometer.

PubMed

Mau-Moeller, Anett; Gube, Martin; Felser, Sabine; Feldhege, Frank; Weippert, Matthias; Husmann, Florian; Tischer, Thomas; Bader, Rainer; Bruhn, Sven; Behrens, Martin

2017-08-17

To determine intrasession and intersession reliability of strength measurements and hamstrings to quadriceps strength imbalance ratios (H/Q ratios) using the new isoforce dynamometer. Repeated measures. Exercise science laboratory. Thirty healthy subjects (15 females, 15 males, 27.8 years). Coefficient of variation (CV) and intraclass correlation coefficients (ICC) were calculated for (1) strength parameters, that is peak torque, mean work, and mean power for concentric and eccentric maximal voluntary contractions; isometric maximal voluntary torque (IMVT); rate of torque development (RTD), and (2) H/Q ratios, that is conventional concentric, eccentric, and isometric H/Q ratios (Hcon/Qcon at 60 deg/s, 120 deg/s, and 180 deg/s, Hecc/Qecc at -60 deg/s and Hiso/Qiso) and functional eccentric antagonist to concentric agonist H/Q ratios (Hecc/Qcon and Hcon/Qecc). High reliability: CV <10%, ICC >0.90; moderate reliability: CV between 10% and 20%, ICC between 0.80 and 0.90; low reliability: CV >20%, ICC <0.80. (1) Strength parameters: (a) high intrasession reliability for concentric, eccentric, and isometric measurements, (b) moderate-to-high intersession reliability for concentric and eccentric measurements and IMVT, and (c) moderate-to-high intrasession reliability but low intersession reliability for RTD. (2) H/Q ratios: (a) moderate-to-high intrasession reliability for conventional ratios, (b) high intrasession reliability for functional ratios, (c) higher intersession reliability for Hcon/Qcon and Hiso/Qiso (moderate to high) than Hecc/Qecc (low to moderate), and (d) higher intersession reliability for conventional H/Q ratios (low to high) than functional H/Q ratios (low to moderate). The results have confirmed the reliability of strength parameters and the most frequently used H/Q ratios.
Cross-cultural adaptation and reproducibility of the Brazilian-Portuguese version of the modified FRESNO Test to evaluate the competence in evidence based practice by physical therapists

PubMed Central

Silva, Anderson M.; Costa, Lucíola C. M.; Comper, Maria L.; Padula, Rosimeire S.

2016-01-01

BACKGROUND: The Modified Fresno Test was developed to assess knowledge and skills of both physical therapy (PT) professionals and students to use evidence-based practice (EBP). OBJECTIVES: To translate the Modified Fresno Test into Brazilian-Portuguese and to evaluate the test's reproducibility. METHOD: The first step consisted of adapting the instrument into the Brazilian-Portuguese language. Then, a total of 57 participants, including PT students, PT professors and PT practitioners, completed the translated instrument. The responses from the participants were used to evaluate reproducibility of the translated instrument. Internal consistency was calculated using the Cronbach's alpha. Reliability was calculated using the intraclass correlation coefficient (ICC) for continuous variables, and the Kappa coefficient (K) for categorical variables. The agreement was assessed using the standard error of the measurement (SEM). RESULTS: The cross-cultural adaptation process was appropriate, providing an adequate Brazilian-Portuguese version of the instrument. The internal consistency was good (α=0.769). The reliability for inter- and intra-rater assessment were ICC=0.89 (95% CI 0.82 to 0.93); for evaluator 1 was ICC=0.85 (95% CI 0.57 to 0.93); and for evaluator 2 was ICC=0.98 (95% CI 0.97 to 0.99). The SEM was 13.04 points for inter-rater assessment, 12.57 points for rater 1 and 4.59 points for rater 2. CONCLUSION: The Brazilian-Portuguese language version of the Modified Fresno Test showed satisfactory results in terms of reproducibility. The Modified Fresno Test will allow physical therapy professionals and students to be evaluated on the use of understanding EBP. PMID:26786079
Cross-cultural adaptation and reproducibility of the Brazilian-Portuguese version of the modified FRESNO Test to evaluate the competence in evidence based practice by physical therapists.

PubMed

Silva, Anderson M; Costa, Lucíola C M; Comper, Maria L; Padula, Rosimeire S

2016-01-01

The Modified Fresno Test was developed to assess knowledge and skills of both physical therapy (PT) professionals and students to use evidence-based practice (EBP). To translate the Modified Fresno Test into Brazilian-Portuguese and to evaluate the test's reproducibility. The first step consisted of adapting the instrument into the Brazilian-Portuguese language. Then, a total of 57 participants, including PT students, PT professors and PT practitioners, completed the translated instrument. The responses from the participants were used to evaluate reproducibility of the translated instrument. Internal consistency was calculated using the Cronbach's alpha. Reliability was calculated using the intraclass correlation coefficient (ICC) for continuous variables, and the Kappa coefficient (K) for categorical variables. The agreement was assessed using the standard error of the measurement (SEM). The cross-cultural adaptation process was appropriate, providing an adequate Brazilian-Portuguese version of the instrument. The internal consistency was good (α=0.769). The reliability for inter- and intra-rater assessment were ICC=0.89 (95% CI 0.82 to 0.93); for evaluator 1 was ICC=0.85 (95% CI 0.57 to 0.93); and for evaluator 2 was ICC=0.98 (95% CI 0.97 to 0.99). The SEM was 13.04 points for inter-rater assessment, 12.57 points for rater 1 and 4.59 points for rater 2. The Brazilian-Portuguese language version of the Modified Fresno Test showed satisfactory results in terms of reproducibility. The Modified Fresno Test will allow physical therapy professionals and students to be evaluated on the use of understanding EBP.

[Reliability of a questionnaire for measuring physical activity and sedentary behavior in children from preschool to fourth grade].

PubMed

Camargo, Diana Marina; Santisteban, Stefany; Paredes, Erika; Flórez, Mary Ann; Bueno, Diego

2015-01-01

International recommendations for physical activity and time spent in sedentary behaviors for children in their early years require the availability of measuring instruments with psychometric properties that allow for the assessment of population dynamics and interventions to improve health. To evaluate the reliability of a questionnaire to measure physical activity and sedentary behaviors in children from preschool to fourth grade. One hundred and eight parents answered the questionnaire. The instrument included socio-demographic variables, as well as those associated with physical activity, including time walking to school, organized sports and playtime activities. Sedentary behaviors included motorized transport to school, reading and "screen time", sleeping and extracurricular courses. Internal consistency, reproducibility and agreement were evaluated using Cronbach's alpha coefficient, the Intraclass Correlation Coefficient (ICC) and the Bland and Altman limits of agreement method, respectively. Internal consistency for physical activity ranged from 0.59 to 0.64, and for sedentary behaviors between 0.22 and 0.34. The highest reproducibility was found for walking to school and time spent on this (kappa=0.79, ICC 0.69), and organized sports, and time on this activity (kappa=0.72, ICC 0.76). Among sedentary behaviors, motorized transport to school and computer use showed kappas of 0.82 and 0.71, respectively; additionally, the time spent on these behaviors showed an ICC of 0.8 and 0.59, respectively. We found limits of agreement between moderate and good for reading time, napping, extracurricular courses, computer and console use. The questionnaire provided reliable information on the physical activity and sedentary behaviors in children under 10 years of age and could be used in other Latin American countries.
Assessment of loaded squat jump height with a free-weight barbell and Smith machine: comparison of the take-off velocity and flight time procedures.

PubMed

Pérez-Castilla, Alejandro; McMahon, John J; Comfort, Paul; García-Ramos, Amador

2017-07-31

The aims of this study were to compare the reliability and magnitude of jump height between the two standard procedures of analysing force platform data to estimate jump height (take-off velocity [TOV] and flight time [FT]) in the loaded squat jump (SJ) exercise performed with a free-weight barbell and in a Smith machine. Twenty-three collegiate men (age 23.1 ± 3.2 years, body mass 74.7 ± 7.3 kg, height 177.1 ± 7.0 cm) were tested twice for each SJ type (free-weight barbell and Smith machine) with 17, 30, 45, 60, and 75 kg loads. No substantial differences in reliability were observed between the TOV (Coefficient of variation [CV]: 9.88%; Intraclass correlation coefficient [ICC]: 0.82) and FT (CV: 8.68%; ICC: 0.88) procedures (CV ratio: 1.14), while the Smith SJ (CV: 7.74%; ICC: 0.87) revealed a higher reliability than the free-weight SJ (CV: 9.88%; ICC: 0.81) (CV ratio: 1.28). The TOV procedure provided higher magnitudes of jump height than the FT procedure for the loaded Smith machine SJ (systematic bias: 2.64 cm; P<0.05), while no significant differences between the TOV and FT procedures were observed in the free-weight SJ exercise (systematic bias: 0.26 cm; P>0.05). Heteroscedasticity of the errors was observed for the Smith machine SJ (r: 0.177) with increasing differences in favour of the TOV procedure for the trials with lower jump height (i.e. higher external loads). Based on these results the use of a Smith machine in conjunction with the FT more accurately determine jump height during the loaded SJ.
The reliability of three devices used for measuring vertical jump height.

PubMed

Nuzzo, James L; Anning, Jonathan H; Scharfenberg, Jessica M

2011-09-01

The purpose of this investigation was to assess the intrasession and intersession reliability of the Vertec, Just Jump System, and Myotest for measuring countermovement vertical jump (CMJ) height. Forty male and 39 female university students completed 3 maximal-effort CMJs during 2 testing sessions, which were separated by 24-48 hours. The height of the CMJ was measured from all 3 devices simultaneously. Systematic error, relative reliability, absolute reliability, and heteroscedasticity were assessed for each device. Systematic error across the 3 CMJ trials was observed within both sessions for males and females, and this was most frequently observed when the CMJ height was measured by the Vertec. No systematic error was discovered across the 2 testing sessions when the maximum CMJ heights from the 2 sessions were compared. In males, the Myotest demonstrated the best intrasession reliability (intraclass correlation coefficient [ICC] = 0.95; SEM = 1.5 cm; coefficient of variation [CV] = 3.3%) and intersession reliability (ICC = 0.88; SEM = 2.4 cm; CV = 5.3%; limits of agreement = -0.08 ± 4.06 cm). Similarly, in females, the Myotest demonstrated the best intrasession reliability (ICC = 0.91; SEM = 1.4 cm; CV = 4.5%) and intersession reliability (ICC = 0.92; SEM = 1.3 cm; CV = 4.1%; limits of agreement = 0.33 ± 3.53 cm). Additional analysis revealed that heteroscedasticity was present in the CMJ when measured from all 3 devices, indicating that better jumpers demonstrate greater fluctuations in CMJ scores across testing sessions. To attain reliable CMJ height measurements, practitioners are encouraged to familiarize athletes with the CMJ technique and then allow the athletes to complete numerous repetitions until performance plateaus, particularly if the Vertec is being used.
Sex Estimation from Human Cranium: Forensic and Anthropological Interest of Maxillary Sinus Volumes.

PubMed

Radulesco, Thomas; Michel, Justin; Mancini, Julien; Dessi, Patrick; Adalian, Pascal

2018-05-01

Sex estimation is a key objective of forensic science. We aimed to establish whether maxillary sinus volumes (MSV) could assist in estimating an individual's sex. One hundred and three CT scans were included. MSV were determined using three-dimensional reconstructions. Two observers performed three-dimensional MSV reconstructions using the same methods. Intra- and interobserver reproducibility were statistically compared using the intraclass correlation coefficient (ICC) (α = 5%). Both intra- and interobserver reproducibility were perfect regarding MSV; both ICCs were 100%. There were no significant differences between right and left MSV (p = 0.083). No correlation was found between age and MSV (p > 0.05). We demonstrated the existence of sexual dimorphism in MSV (p < 0.001) and showed that MSV measurements gave a 68% rate of correct allocations to sex group. MSV measurements could be useful to support sex estimation in forensic medicine. © 2017 American Academy of Forensic Sciences.
Validation and calibration of HeadCount, a self-report measure for quantifying heading exposure in soccer players.

PubMed

Catenaccio, E; Caccese, J; Wakschlag, N; Fleysher, R; Kim, N; Kim, M; Buckley, T A; Stewart, W F; Lipton, R B; Kaminski, T; Lipton, M L

2016-01-01

The long-term effects of repetitive head impacts due to heading are an area of increasing concern, and exposure must be accurately measured; however, the validity of self-report of cumulative soccer heading is not known. In order to validate HeadCount, a 2-week recall questionnaire, the number of player-reported headers was compared to the number of headers observed by trained raters for a men's and a women's collegiate soccer teams during an entire season of competitive play using Spearman's correlations and intraclass correlation coefficients (ICCs), and calibrated using a generalized estimating equation. The average Spearman's rho was 0.85 for men and 0.79 for women. The average ICC was 0.75 in men and 0.38 in women. The calibration analysis demonstrated that men tend to report heading accurately while women tend to overestimate. HeadCount is a valid instrument for tracking heading behaviour, but may have to be calibrated in women.
Evaluation of the psychometric properties of the Nighttime Symptoms of COPD Instrument.

PubMed

Mocarski, Michelle; Zaiser, Erica; Trundell, Dylan; Make, Barry J; Hareendran, Asha

2015-01-01

Nighttime symptoms can negatively impact the quality of life of patients with chronic obstructive pulmonary disease (COPD). The Nighttime Symptoms of COPD Instrument (NiSCI) was designed to measure the occurrence and severity of nighttime symptoms in patients with COPD, the impact of symptoms on nighttime awakenings, and rescue medication use. The objective of this study was to explore item reduction, inform scoring recommendations, and evaluate the psychometric properties of the NiSCI. COPD patients participating in a Phase III clinical trial completed the NiSCI daily. Item analyses were conducted using weekly mean and single day scores. Descriptive statistics (including percentage of respondents at floor/ceiling and inter-item correlations), factor analyses, and Rasch model analyses were conducted to examine item performance and scoring. Test-retest reliability was assessed for the final instrument using the intraclass correlation coefficient (ICC). Correlations with assessments conducted during study visits were used to evaluate convergent and known-groups validity. Data from 1,663 COPD patients aged 40-93 years were analyzed. Item analyses supported the generation of four scores. A one-factor structure was confirmed with factor analysis and Rasch analysis for the symptom severity score. Test-retest reliability was confirmed for the six-item symptom severity (ICC, 0.85), number of nighttime awakenings (ICC, 0.82), and rescue medication (ICC, 0.68) scores. Convergent validity was supported by significant correlations between the NiSCI, St George's Respiratory Questionnaire, and Exacerbations of Chronic Obstructive Pulmonary Disease Tool-Respiratory Symptoms scores. The results suggest that the NiSCI can be used to determine the severity of nighttime COPD symptoms, the number of nighttime awakenings due to COPD symptoms, and the nighttime use of rescue medication. The NiSCI is a reliable and valid instrument to evaluate these concepts in COPD patients in clinical trials and clinical practice. Scoring recommendations and steps for further research are discussed.
Accuracy and reproducibility of novel echocardiographic three-dimensional automated software for the assessment of the aortic root in candidates for thanscatheter aortic valve replacement.

PubMed

García-Martín, Ana; Lázaro-Rivera, Carla; Fernández-Golfín, Covadonga; Salido-Tahoces, Luisa; Moya-Mur, Jose-Luis; Jiménez-Nacher, Jose-Julio; Casas-Rojo, Eduardo; Aquila, Iolanda; González-Gómez, Ariana; Hernández-Antolín, Rosana; Zamorano, José Luis

2016-07-01

A specialized three-dimensional transoesophageal echocardiography (3D-TOE) reconstruction tool has recently been introduced; the system automatically configures a geometric model of the aortic root from the images obtained by 3D-TOE and performs quantitative analysis of these structures. The aim of this study was to compare the measurements of the aortic annulus (AA) obtained by the new model to that obtained by 3D-TOE and multidetector computed tomography (MDCT) in candidates to transcatheter aortic valve implantation (TAVI) and to assess the reproducibility of this new method. We included 31 patients who underwent TAVI. The AA diameters and area were evaluated by the manual 3D-TOE method and by the automatic software. We showed an excellent correlation between the measurements obtained by both methods: intra-class correlation coefficient (ICC): 0.731 (0.508-0.862), r: 0.742 for AA diameter and ICC: 0.723 (0.662-0.923), r: 0.723 for the AA area, with no significant differences regardless of the method used. The interobserver variability was superior for the automatic measurements than for the manual ones. In a subgroup of 10 patients, we also found an excellent correlation between the automatic measurements and those obtained by MDCT, ICC: 0.941 (0.761-0.985), r: 0.901 for AA diameter and ICC: 0.853 (0.409-0.964), r: 0.744 for the AA area. The new automatic 3D-TOE software allows modelling and quantifying the aortic root from 3D-TOE data with high reproducibility. There is good correlation between the automated measurements and other 3D validated techniques. Our results support its use in clinical practice as an alternative to MDCT previous to TAVI. Published on behalf of the European Society of Cardiology. All rights reserved. © The Author 2015. For permissions please email: journals.permissions@oup.com.
Multicenter reliability of semiautomatic retinal layer segmentation using OCT

PubMed Central

Oberwahrenbrock, Timm; Traber, Ghislaine L.; Lukas, Sebastian; Gabilondo, Iñigo; Nolan, Rachel; Songster, Christopher; Balk, Lisanne; Petzold, Axel; Paul, Friedemann; Villoslada, Pablo; Brandt, Alexander U.; Green, Ari J.

2018-01-01

Objective To evaluate the inter-rater reliability of semiautomated segmentation of spectral domain optical coherence tomography (OCT) macular volume scans. Methods Macular OCT volume scans of left eyes from 17 subjects (8 patients with MS and 9 healthy controls) were automatically segmented by Heidelberg Eye Explorer (v1.9.3.0) beta-software (Spectralis Viewing Module v6.0.0.7), followed by manual correction by 5 experienced operators from 5 different academic centers. The mean thicknesses within a 6-mm area around the fovea were computed for the retinal nerve fiber layer, ganglion cell layer (GCL), inner plexiform layer (IPL), inner nuclear layer, outer plexiform layer (OPL), and outer nuclear layer (ONL). Intraclass correlation coefficients (ICCs) were calculated for mean layer thickness values. Spatial distribution of ICC values for the segmented volume scans was investigated using heat maps. Results Agreement between raters was good (ICC > 0.84) for all retinal layers, particularly inner retinal layers showed excellent agreement across raters (ICC > 0.96). Spatial distribution of ICC showed highest values in the perimacular area, whereas the ICCs were poorer for the foveola and the more peripheral macular area. The automated segmentation of the OPL and ONL required the most correction and showed the least agreement, whereas differences were less prominent for the remaining layers. Conclusions Automated segmentation with manual correction of macular OCT scans is highly reliable when performed by experienced raters and can thus be applied in multicenter settings. Reliability can be improved by restricting analysis to the perimacular area and compound segmentation of GCL and IPL. PMID:29552598
Validation of the OMERACT Psoriatic Arthritis Magnetic Resonance Imaging Score (PsAMRIS) for the Hand and Foot in a Randomized Placebo-controlled Trial.

PubMed

Glinatsi, Daniel; Bird, Paul; Gandjbakhch, Frederique; Mease, Philip J; Bøyesen, Pernille; Peterfy, Charles G; Conaghan, Philip G; Østergaard, Mikkel

2015-12-01

To assess changes following treatment and the reliability and responsiveness to change of the Outcome Measures in Rheumatology (OMERACT) Psoriatic Arthritis Magnetic Resonance Imaging Score (PsAMRIS) in a randomized controlled trial. Forty patients with PsA randomized to either placebo or abatacept (ABA) had MRI of either 1 hand (n = 20) or 1 foot (n = 20) at baseline and after 6 months. Images were scored blindly twice by 3 independent readers according to the PsAMRIS (for synovitis, tenosynovitis, periarticular inflammation, bone edema, bone erosion, and bone proliferation). Inflammatory features improved numerically but statistically nonsignificantly in the ABA group but not the placebo group. Baseline intrareader intraclass correlation coefficients (ICC) were good (≥ 0.50) to very good (≥ 0.80) for all features in both hand and foot. Baseline interreader ICC were good (ICC 0.72-0.96) for all features, except periarticular inflammation and bone proliferation in the hand and tenosynovitis in the foot (ICC 0.25-0.44). Intrareader and interreader ICC for change scores varied. Guyatt's responsiveness index (GRI) was high for inflammatory features in the hand and metatarsophalangeal joints (GRI -0.67 to -3.13; bone edema not calculable). Minimal change and low prevalence resulted in low ICC and GRI for bone damage. PsAMRIS showed overall good intrareader agreement in the hand and foot, and inflammatory feature scores were responsive to change, suggesting that PsAMRIS may be a valid tool for MRI assessment of hands and feet in PsA clinical trials.
Periodontal repair in dogs: examiner reproducibility in the supraalveolar periodontal defect model.

PubMed

Koo, Ki-Tae; Polimeni, Giuseppe; Albandar, Jasim M; Wikesjö, Ulf M E

2004-06-01

Histometric assessments are routinely used to evaluate biologic events ascertained in histologic sections acquired from animal and human studies. The objective of this study was to evaluate the intra- and inter-examiner reproducibility of histometric assessments in the supraalveolar periodontal defect model. Histometric analysis using incandescent and polarized light microscopy, an attached digital camera system, and a PC-based image analysis system including a custom program for the supraalveolar periodontal defect model was performed on histologic sections acquired from one jaw quadrant in each of 12 dogs. The animals had received an experimental protocol including implantation of a coral biomaterial and guided tissue regeneration (GTR) barrier devices, and were evaluated following a 4-week healing interval. Histometric parameters were recorded and repeated within a 3-month interval by two examiners following brief training. Intra- and inter-examiner reproducibility was assessed using the intra-class correlation coefficient (ICC). Most parameters showed high intra-examiner ICCs. Parameters including defect height, connective tissue repair, bone regeneration (height/area), formation of a junctional epithelium, positioning of the GTR device, ankylosis, root resorption, and defect area yielded an ICC> or 0..9. The ICCs for bone density and biomaterial density were somewhat lower (0.8 and 0.7, respectively). The inter-examiner reproducibility was somewhat lower compared to the intra-examiner reproducibility. Nevertheless, the ICCs were generally high (ICC range: 0.6-0.9). Histometric evaluations in the supraalveolar periodontal defect model yield highly reproducible results, in particular when a single examiner performs the histometric measurements, even when the examiner was exposed to limited training.
Reliability of a rapid hematology stain for sputum cytology*

PubMed Central

Gonçalves, Jéssica; Pizzichini, Emilio; Pizzichini, Marcia Margaret Menezes; Steidle, Leila John Marques; Rocha, Cristiane Cinara; Ferreira, Samira Cardoso; Zimmermann, Célia Tânia

2014-01-01

Objective: To determine the reliability of a rapid hematology stain for the cytological analysis of induced sputum samples. Methods: This was a cross-sectional study comparing the standard technique (May-Grünwald-Giemsa stain) with a rapid hematology stain (Diff-Quik). Of the 50 subjects included in the study, 21 had asthma, 19 had COPD, and 10 were healthy (controls). From the induced sputum samples collected, we prepared four slides: two were stained with May-Grünwald-Giemsa, and two were stained with Diff-Quik. The slides were read independently by two trained researchers blinded to the identification of the slides. The reliability for cell counting using the two techniques was evaluated by determining the intraclass correlation coefficients (ICCs) for intraobserver and interobserver agreement. Agreement in the identification of neutrophilic and eosinophilic sputum between the observers and between the stains was evaluated with kappa statistics. Results: In our comparison of the two staining techniques, the ICCs indicated almost perfect interobserver agreement for neutrophil, eosinophil, and macrophage counts (ICC: 0.98-1.00), as well as substantial agreement for lymphocyte counts (ICC: 0.76-0.83). Intraobserver agreement was almost perfect for neutrophil, eosinophil, and macrophage counts (ICC: 0.96-0.99), whereas it was moderate to substantial for lymphocyte counts (ICC = 0.65 and 0.75 for the two observers, respectively). Interobserver agreement for the identification of eosinophilic and neutrophilic sputum using the two techniques ranged from substantial to almost perfect (kappa range: 0.91-1.00). Conclusions: The use of Diff-Quik can be considered a reliable alternative for the processing of sputum samples. PMID:25029648
Validity and reliability of a new tool to evaluate handwriting difficulties in Parkinson’s disease

PubMed Central

Nackaerts, Evelien; Heremans, Elke; Smits-Engelsman, Bouwien C. M.; Broeder, Sanne; Vandenberghe, Wim; Bergmans, Bruno; Nieuwboer, Alice

2017-01-01

Background Handwriting in Parkinson’s disease (PD) features specific abnormalities which are difficult to assess in clinical practice since no specific tool for evaluation of spontaneous movement is currently available. Objective This study aims to validate the ‘Systematic Screening of Handwriting Difficulties’ (SOS-test) in patients with PD. Methods Handwriting performance of 87 patients and 26 healthy age-matched controls was examined using the SOS-test. Sixty-seven patients were tested a second time within a period of one month. Participants were asked to copy as much as possible of a text within 5 minutes with the instruction to write as neatly and quickly as in daily life. Writing speed (letters in 5 minutes), size (mm) and quality of handwriting were compared. Correlation analysis was performed between SOS outcomes and other fine motor skill measurements and disease characteristics. Intrarater, interrater and test-retest reliability were assessed using the intraclass correlation coefficient (ICC) and Spearman correlation coefficient. Results Patients with PD had a smaller (p = 0.043) and slower (p<0.001) handwriting and showed worse writing quality (p = 0.031) compared to controls. The outcomes of the SOS-test significantly correlated with fine motor skill performance and disease duration and severity. Furthermore, the test showed excellent intrarater, interrater and test-retest reliability (ICC > 0.769 for both groups). Conclusion The SOS-test is a short and effective tool to detect handwriting problems in PD with excellent reliability. It can therefore be recommended as a clinical instrument for standardized screening of handwriting deficits in PD. PMID:28253374
Biomechanical factors associated with time to complete a change of direction cutting maneuver.

PubMed

Marshall, Brendan M; Franklyn-Miller, Andrew D; King, Enda A; Moran, Kieran A; Strike, Siobhán C; Falvey, Éanna C

2014-10-01

Cutting ability is an important aspect of many team sports, however, the biomechanical determinants of cutting performance are not well understood. This study aimed to address this issue by identifying the kinetic and kinematic factors correlated with the time to complete a cutting maneuver. In addition, an analysis of the test-retest reliability of all biomechanical measures was performed. Fifteen (n = 15) elite multidirectional sports players (Gaelic hurling) were recruited, and a 3-dimensional motion capture analysis of a 75° cut was undertaken. The factors associated with cutting time were determined using bivariate Pearson's correlations. Intraclass correlation coefficients (ICCs) were used to examine the test-retest reliability of biomechanical measures. Five biomechanical factors were associated with cutting time (2.28 ± 0.11 seconds): peak ankle power (r = 0.77), peak ankle plantar flexor moment (r = 0.65), range of pelvis lateral tilt (r = -0.54), maximum thorax lateral rotation angle (r = 0.51), and total ground contact time (r = -0.48). Intraclass correlation coefficient scores for these 5 factors, and indeed for the majority of the other biomechanical measures, ranged from good to excellent (ICC >0.60). Explosive force production about the ankle, pelvic control during single-limb support, and torso rotation toward the desired direction of travel were all key factors associated with cutting time. These findings should assist in the development of more effective training programs aimed at improving similar cutting performances. In addition, test-retest reliability scores were generally strong, therefore, motion capture techniques seem well placed to further investigate the determinants of cutting ability.
Reliability and validity of the 12-item WHODAS 2.0 in patients with Kashin-Beck disease.

PubMed

Younus, Mohammad Imran; Wang, Di-Miao; Yu, Fang-Fang; Fang, Hua; Guo, Xiong

2017-09-01

The purpose of this study was to check the reliability and validity of the 12-item Chinese version of the World Health Organization Disability Assessment Schedule 2.0 (WHODAS 2.0) for the assessment of disability in patients with Kashin-Beck disease (KBD). We recruited 219 patients with KBD from the high-risk KBD area in the Shaanxi province, using stratified multistage random sampling. We assessed each patient using the Chinese version of the 12-item WHODAS 2.0 and the Western Ontario and McMaster Universities Index of Osteoarthritis (WOMAC). Statistical evaluations of the instruments consisted of Cronbach's alpha, intraclass correlation coefficient (ICC), confirmatory factor analysis (CFA), and Pearson's correlation coefficient. Cronbach's alpha and ICC for the six domains ranged from 0.704 to 0.906 and 0.690 to 0.852, respectively. A six-factor structure fits the data well (CFI = 0.967, TLI = 0.944, RMSEA = 0.08). Regarding convergent validity, the four domains of the 12-item WHODAS 2.0 (getting around, self-care, life activity, and participation) showed moderate-to-strong correlation for all three domains of the WOMAC (0.428 < |r| < 0.804). Regarding divergent validity, the two domains of the 12-item WHODAS 2.0 (understanding and communication, and getting along with people) showed weak correlation for the three domains of WOMAC (0.182 < |r| < 0.295). The Chinese version of 12-item WHODAS 2.0 questionnaire is a reliable and valid instrument when administered to KBD patients.
Three-dimensional ultrasonography of the breast; An adequate replacement for MRI in neoadjuvant chemotherapy tumour response evaluation? - RESPONDER trial.

PubMed

van Egdom, L S E; Lagendijk, M; Heijkoop, E H M; Koning, A H J; van Deurzen, C H M; Jager, A; van Lankeren, W; Koppert, L B

2018-07-01

Accurate measurement of tumour response during and after neoadjuvant chemotherapy (NAC) is important and may influence treatment decisions in invasive breast cancer patients. Breast MRI forms the gold standard but is more burdensome, time consuming and costly. In this study response measurement was done with 3-D ultrasound by Automated Breast Volume Scanner (ABVS) and compared to breast MRI. Moreover, patient satisfaction with both techniques was compared. A single-institution, prospective observational pilot study evaluating tumour response by ABVS in addition to breast MRI (standard care) was performed in 25 invasive breast cancer patients receiving NAC. Tumour response was evaluated comparing longest tumour diameters as well as tumour volumes at predefined time points using Bland-Altman analysis. Volume measurements for breast MRI were obtained using a fully immersive virtual reality system (a Barco I-Space) and V-Scope software. Same software was used to obtain ABVS volume measurements using an in-house developed desktop VR system. Inter- and intra-observer agreement was evaluated by Intraclass Correlation Coefficient (ICC). Twenty-five patients were eligible for baseline measurement, 20 for a mid-NAC response evaluation, and five for a post-NAC response evaluation. MRI and ABVS showed absolute concordance in 73% of patients for the mid-NAC evaluation, with a 'good' correlation for the difference in longest diameter measurement (ICC 0.73, p < 0.01) as compared to baseline assessment. Concerning difference in volume measurement in the mid-NAC response evaluation showed a 'fair' correlation (ICC 0.52, p < 0.01) and in the post-NAC response evaluation an 'excellent' correlation (ICC 0.98, p < 0.01). 'Excellent' inter- and intra-observer agreement was found (ICC 0.88, p < 0.01) with comparable limits of agreement (LOA) for observer 1 and 2 in both diameter and volume measurement. Patient satisfaction was higher for ABVS compared to breast MRI, 93% versus 12% respectively. ABVS showed 'good' correlation with MRI tumour response evaluation in breast cancer patients during NAC with 'excellent' inter- and intra-observer agreement. ABVS has patients' preference over breast MRI and could be considered as alternative to breast MRI, in case results on an on-going prospective trial confirm these results (NTR6799). Copyright © 2018 Elsevier B.V. All rights reserved.
Reliability of the Star Excursion Balance Test and Two New Similar Protocols to Measure Trunk Postural Control.

PubMed

López-Plaza, Diego; Juan-Recio, Casto; Barbado, David; Ruiz-Pérez, Iñaki; Vera-Garcia, Francisco J

2018-05-18

Although the Star Excursion Balance test (SEBT) has shown a good intrasession reliability, the intersession reliability of this test has not been deeply studied. Furthermore, there is an evident high influence of the lower limbs in the performance of the SEBT, so even if it has been used to measure core stability, it is possibly not the most suitable measurement. The aims of this study were to (1) to assess the absolute and relative between-session reliability of the SEBT and 2 novel variations of this test to assess trunk postural control while sitting, ie, the Star Excursion Sitting Test (SEST) and the Star Excursion Timing Test (SETT); and (2) to analyze the relationships between these 3 test scores. Correlational and reliability test-retest study. Controlled laboratory environment. Twenty-seven physically active men (age: 24.54 ± 3.05 years). Relative and absolute reliability of the SEBT, SEST, and SETT were calculated through the intraclass correlation coefficient (ICC) and standard error of measurement (SEM), respectively. A Pearson correlation analysis was carried out between the variables of the 3 tests. Maximum normalized reach distances were assessed for different SEBT and SEST directions. In addition, composite indexes were calculated for SEBT, SEST, and SETT. The SEBT (dominant leg: ICC = 0.87 [0.73-0.94], SEM = 2.12 [1.66-2.93]; nondominant leg: ICC = 0.74 [0.50-0.87], SEM = 3.23 [2.54-4.45]), SEST (ICC = 0.85 [0.68-0.92], SEM = 1.27 [1.03-1.80]), and SETT (ICC = 0.61 [0.30-0.80], SEM = 2.31 [1.82-3.17]) composite indexes showed moderate-to-high 1-month reliability. A learning effect was detected for some SEBT and SEST directions and for SEST and SETT composite indexes. No significant correlations were found between SEBT and its 2 variations (r ≤ .366; P > .05). A significant correlation was found between the SEST and SETT composite indexes (r = .520; P > .01). SEBT, SEST, and SETT are reliable field protocols to measure postural control. However, whereas the SEBT assesses postural control in single-leg stance, SEST and SETT provide trunk postural control measures with lower influence of the lower-limbs. To be determined. Copyright © 2018 American Academy of Physical Medicine and Rehabilitation. Published by Elsevier Inc. All rights reserved.
Relative and absolute reliability of measures of linoleic acid-derived oxylipins in human plasma.

PubMed

Gouveia-Figueira, Sandra; Bosson, Jenny A; Unosson, Jon; Behndig, Annelie F; Nording, Malin L; Fowler, Christopher J

2015-09-01

Modern analytical techniques allow for the measurement of oxylipins derived from linoleic acid in biological samples. Most validatory work has concerned extraction techniques, repeated analysis of aliquots from the same biological sample, and the influence of external factors such as diet and heparin treatment upon their levels, whereas less is known about the relative and absolute reliability of measurements undertaken on different days. A cohort of nineteen healthy males were used, where samples were taken at the same time of day on two occasions, at least 7 days apart. Relative reliability was assessed using Lin's concordance correlation coefficients (CCC) and intraclass correlation coefficients (ICC). Absolute reliability was assessed by Bland-Altman analyses. Nine linoleic acid oxylipins were investigated. ICC and CCC values ranged from acceptable (0.56 [13-HODE]) to poor (near zero [9(10)- and 12(13)-EpOME]). Bland-Altman limits of agreement were in general quite wide, ranging from ±0.5 (12,13-DiHOME) to ±2 (9(10)-EpOME; log10 scale). It is concluded that relative reliability of linoleic acid-derived oxylipins varies between lipids with compounds such as the HODEs showing better relative reliability than compounds such as the EpOMEs. These differences should be kept in mind when designing and interpreting experiments correlating plasma levels of these lipids with factors such as age, body mass index, rating scales etc. Copyright © 2015 Elsevier Inc. All rights reserved.
Caliper Method Versus Digital Photogrammetry for Assessing Arch Height Index in Pregnant Women.

PubMed

Harrison, Kathryn D; McCrory, Jean L

2016-11-01

Foot anthropometry may be altered during pregnancy. Pregnant women often report lower-extremity pain that may be related to these alterations. The Arch Height Index Measurement System is a common method of foot arch assessment; however, the required calipers are costly and are not widely available. Thus, we compared the reliability of a digital photogrammetry method of arch height index (AHI) assessment with that of the Arch Height Index Measurement System. Ten pregnant women (mean ± SD: age, 29 ± 4 years; height, 166.9 ± 6.8 cm; weight, 63.3 ± 8.8 kg) in their second trimester were recruited to participate, along with a control group of 10 nulliparous weight-matched women (mean ± SD: age, 22 ± 2 years; height, 164.6 ± 4.8 cm; weight, 61.5 ± 8.1 kg). During the second and third trimesters, and once postpartum, AHI was assessed using calipers and using digital photogrammetry. Mixed model absolute agreement type intraclass correlation coefficient (ICC) was used to determine correlation between the two methods for sitting and standing AHI. The ICC results for sitting AHI only (0.819-0.968) were reasonable for clinical measures; ICC values for standing AHI (0.674-0.789) did not reach values deemed reasonable for clinical use. Caliper and digital photogrammetry methods of AHI assessment are correlated in pregnant women; however, for standing AHI, the correlation is not sufficient for clinical use. Photogrammetry may still be appropriate for clinical use, as long as values from this method are not substituted directly for results obtained from calipers.
A new approach to the measurement of pelvic asymmetry: proposed methods and reliability.

PubMed

Gnat, Rafael; Biały, Maciej

2015-05-01

This is a methodological study presenting a novel method of pelvic asymmetry (PA) measurement for use in the research laboratory setting. The purpose of the study is (1) to establish intrarater and interrater reliability of the proposed measures of PA, (2) to verify the influence of repeated measurements on the reliability, and (3) to assess correlation between the proposed measures of PA. Twelve healthy volunteers participated, and 2 teams of raters were involved. Registration of anatomic landmarks' positions in the optical motion capture system was repeated 3 times. Two asymmetry indexes were calculated: for pelvic torsion and for lateral pelvic tilt. Interclass correlation coefficients (ICCs), standard errors of measurement, and smallest detectable differences were used to describe the intrarater and interrater reliability of the 2 indexes. After 2 repeated registrations of pelvic landmarks' positions, the reliability of our asymmetry indexes was good and excellent. The ICCs for intrarater reliability ranged from 0.96 to 0.97; the ICCs for interrater reliability ranged 0.81 to 0.90. There was moderate, nonsignificant correlation between asymmetry indexes for pelvis torsion and for lateral pelvic tilt (r = 0.45, P = .14). The 2 proposed asymmetry indexes showed good and excellent intrarater and interrater reliability after 2 repeated registrations of pelvic landmarks' positions and thus may be useful in the research laboratory setting. However, these indexes are not strongly correlated, which suggests that the 2 types of PA may constitute different clinical entities. Copyright © 2015 National University of Health Sciences. Published by Elsevier Inc. All rights reserved.
Reliability of a visual scoring system with fluorescent tracers to assess dermal pesticide exposure.

PubMed

Aragon, Aurora; Blanco, Luis; Lopez, Lylliam; Liden, Carola; Nise, Gun; Wesseling, Catharina

2004-10-01

We modified Fenske's semi-quantitative 'visual scoring system' of fluorescent tracer deposited on the skin of pesticide applicators and evaluated its reproducibility in the Nicaraguan setting. The body surface of 33 farmers, divided into 31 segments, was videotaped in the field after spraying with a pesticide solution containing a fluorescent tracer. A portable UV lamp was used for illumination in a foldaway dark room. The videos of five farmers were randomly selected. The scoring was based on a matrix with extension of fluorescent patterns (scale 0-5) on the ordinate and intensity (scale 0-5) on the abscissa, with the product of these two ranks as the final score for each body segment (0-25). Five medical students rated and evaluated the quality of 155 video images having undergone 4 h of training. Cronbach alpha coefficients and two-way random effects intraclass correlation coefficients (ICC) with absolute agreement were computed to assess inter-rater reliability. Consistency was high (Cronbach alpha = 0.96), but the scores differed substantially between raters. The overall ICC was satisfactory [0.75; 95% confidence interval (CI) = 0.62-0.83], but it was lower for intensity (0.54; 95% CI = 0.40-0.66) and higher for extension (0.80; 95% CI = 0.71-0.86). ICCs were lowest for images with low scores and evaluated as low quality, and highest for images with high scores and high quality. Inter-rater reliability coefficients indicate repeatability of the scoring system. However, field conditions for recording fluorescence should be improved to achieve higher quality images, and training should emphasize a better mechanism for the reading of body areas with low contamination.

Performance characteristics of the Kin-Com dynamometer.

PubMed

Mayhew, T P; Rothstein, J M; Finucane, S D; Lamb, R L

1994-11-01

The purpose of this study was to assess the performance characteristics of a Kin-Com dynamometer (model #500-11) under controlled conditions. Comparisons were made between measurements of force, angle, and velocity obtained from the Kin-Com and measurements acquired from an external recording system of known weights, angles, and user-set velocities. The strength of the linear relationships between measurements obtained with the different recording systems was analyzed using a coefficient of determination (r2). An intraclass correlation coefficient (ICC[2,1]) was used to examine the reliability of the force, angle, and velocity measurements obtained with each recording system on 2 different days. In all conditions, the coefficient of determination for the force, angle, and velocity comparisons was above .99. The ICC for between-day comparisons for all force, angle, and velocity measurements was above .99. Our results indicate that the static measurements of force and angle that are necessary for use in the gravity-correction procedure and isometric testing are accurate and replicable between days. The Kin-Com dynamometer's control system regulating lever arm velocity is also accurate and replicable under a no-load condition. It was ascertained during the velocity testing that the use of any acceleration and deceleration mode other than "high" resulted in a loss of excursion of the lever arm.
Development and Reliability Testing of a Fast-Food Restaurant Observation Form.

PubMed

Rimkus, Leah; Ohri-Vachaspati, Punam; Powell, Lisa M; Zenk, Shannon N; Quinn, Christopher M; Barker, Dianne C; Pugach, Oksana; Resnick, Elissa A; Chaloupka, Frank J

2015-01-01

To develop a reliable observational data collection instrument to measure characteristics of the fast-food restaurant environment likely to influence consumer behaviors, including product availability, pricing, and promotion. The study used observational data collection. Restaurants were in the Chicago Metropolitan Statistical Area. A total of 131 chain fast-food restaurant outlets were included. Interrater reliability was measured for product availability, pricing, and promotion measures on a fast-food restaurant observational data collection instrument. Analysis was done with Cohen's κ coefficient and proportion of overall agreement for categorical variables and intraclass correlation coefficient (ICC) for continuous variables. Interrater reliability, as measured by average κ coefficient, was .79 for menu characteristics, .84 for kids' menu characteristics, .92 for food availability and sizes, .85 for beverage availability and sizes, .78 for measures on the availability of nutrition information,.75 for characteristics of exterior advertisements, and .62 and .90 for exterior and interior characteristics measures, respectively. For continuous measures, average ICC was .88 for food pricing measures, .83 for beverage prices, and .65 for counts of exterior advertisements. Over 85% of measures demonstrated substantial or almost perfect agreement. Although some measures required revision or protocol clarification, results from this study suggest that the instrument may be used to reliably measure the fast-food restaurant environment.
Reliability, Validity, and Ability to Identify Fall Status of the Balance Evaluation Systems Test, Mini-Balance Evaluation Systems Test, and Brief-Balance Evaluation Systems Test in Older People Living in the Community.

PubMed

Marques, Alda; Almeida, Sara; Carvalho, Joana; Cruz, Joana; Oliveira, Ana; Jácome, Cristina

2016-12-01

To assess the reliability, validity, and ability to identify fall status of the Balance Evaluation Systems Test (BESTest), Mini-BESTest, and Brief-BESTest, compared with the Berg Balance Scale (BBS), in older people living in the community. Cross-sectional. Community centers. Older adults (N=122; mean age ± SD, 76±9y). Not applicable. Participants reported on falls history in the preceding year and completed the Activities-Specific Balance Confidence (ABC) Scale. The BBS, BESTest, and the Five Times Sit-To-Stand Test were administered. Interrater (2 physiotherapists) and test-retest relative (48-72h) and absolute reliabilities were explored with the intraclass correlation coefficient (ICC) equation (2,1) and the Bland and Altman method. Minimal detectable changes at the 95% confidence level (MDC 95 ) were established. Validity was assessed by correlating the balance tests with each other and with the ABC Scale (Spearman correlation coefficients-ρ). Receiver operating characteristics assessed the ability of each balance test to differentiate between people with and without a history of falls. All balance tests presented good to excellent interrater (ICC=.71-.93) and test-retest (ICC=.50-.82) relative reliability, with no evidence of bias. MDC 95 values were 4.6, 9, 3.8, and 4.1 points for the BBS, BESTest, Mini-BESTest, and Brief-BESTest, respectively. All tests were significantly correlated with each other (ρ=.83-.96) and with the ABC Scale (ρ=.46-.61). Acceptable ability to identify fall status (areas under the curve, .71-.78) was found for all tests. Cutoff points were 48.5, 82, 19.5, and 12.5 points for the BBS, BESTest, Mini-BESTest, and Brief-BESTest, respectively. All balance tests are reliable, valid, and able to identify fall status in older people living in the community. Therefore, the choice of which test to use will depend on the level of balance impairment, purpose, and time availability. Copyright Â© 2016. Published by Elsevier Inc.
Reliability and scientific use of a surgical planning software for anterior cervical discectomy and fusion (ACDF).

PubMed

Barth, Martin; Weiß, Christel; Brenke, Christopher; Schmieder, Kirsten

2017-04-01

Software-based planning of a spinal implant inheres in the promise of precision and superior results. The purpose of the study was to analyze the measurement reliability, prognostic value, and scientific use of a surgical planning software in patients receiving anterior cervical discectomy and fusion (ACDF). Lateral neutral, flexion, and extension radiographs of patients receiving tailored cages as suggested by the planning software were available for analysis. Differences of vertebral wedging angles and segmental height of all cervical segments were determined at different timepoints using intraclass correlation coefficients (ICC). Cervical lordosis (C2/C7), segmental heights, global, and segmental range of motion (ROM) were determined at different timepoints. Clinical and radiological variables were correlated 12 months after surgery. 282 radiographs of 35 patients with a mean age of 53.1 ± 12.0 years were analyzed. Measurement of segmental height was highly accurate with an ICC near to 1, but angle measurements showed low ICC values. Likewise, the ICCs of the prognosticated values were low. Postoperatively, there was a significant decrease of segmental height (p < 0.0001) and loss of C2/C7 ROM (p = 0.036). ROM of unfused segments also significantly decreased (p = 0.016). High NDI was associated with low subsidence rates. The surgical planning software showed high accuracy in the measurement of height differences and lower accuracy values with angle measurements. Both the prognosticated height and angle values were arbitrary. Global ROM, ROM of the fused and intact segments, is restricted after ACDF.
Validation of the iPhone app using the force platform to estimate vertical jump height.

PubMed

Carlos-Vivas, Jorge; Martin-Martinez, Juan P; Hernandez-Mocholi, Miguel A; Perez-Gomez, Jorge

2018-03-01

Vertical jump performance has been evaluated with several devices: force platforms, contact mats, Vertec, accelerometers, infrared cameras and high-velocity cameras; however, the force platform is considered the gold standard for measuring vertical jump height. The purpose of this study was to validate an iPhone app called My Jump, that measures vertical jump height by comparing it with other methods that use the force platform to estimate vertical jump height, namely, vertical velocity at take-off and time in the air. A total of 40 sport sciences students (age 21.4±1.9 years) completed five countermovement jumps (CMJs) over a force platform. Thus, 200 CMJ heights were evaluated from the vertical velocity at take-off and the time in the air using the force platform, and from the time in the air with the My Jump mobile application. The height obtained was compared using the intraclass correlation coefficient (ICC). Correlation between APP and force platform using the time in the air was perfect (ICC=1.000, P<0.001). Correlation between APP and force platform using the vertical velocity at take-off was also very high (ICC=0.996, P<0.001), with an error margin of 0.78%. Therefore, these results showed that application, My Jump, is an appropriate method to evaluate the vertical jump performance; however, vertical jump height is slightly overestimated compared with that of the force platform.
Determination of heart rate variability with an electronic stethoscope.

PubMed

Kamran, Haroon; Naggar, Isaac; Oniyuke, Francisca; Palomeque, Mercy; Chokshi, Priya; Salciccioli, Louis; Stewart, Mark; Lazar, Jason M

2013-02-01

Heart rate variability (HRV) is widely used to characterize cardiac autonomic function by measuring beat-to-beat alterations in heart rate. Decreased HRV has been found predictive of worse cardiovascular (CV) outcomes. HRV is determined from time intervals between QRS complexes recorded by electrocardiography (ECG) for several minutes to 24 h. Although cardiac auscultation with a stethoscope is performed routinely on patients, the human ear cannot detect heart sound time intervals. The electronic stethoscope digitally processes heart sounds, from which cardiac time intervals can be obtained. Accordingly, the objective of this study was to determine the feasibility of obtaining HRV from electronically recorded heart sounds. We prospectively studied 50 subjects with and without CV risk factors/disease and simultaneously recorded single lead ECG and heart sounds for 2 min. Time and frequency measures of HRV were calculated from R-R and S1-S1 intervals and were compared using intra-class correlation coefficients (ICC). The majority of the indices were strongly correlated (ICC 0.73-1.0), while the remaining indices were moderately correlated (ICC 0.56-0.63). In conclusion, we found HRV measures determined from S1-S1 are in agreement with those determined by single lead ECG, and we demonstrate and discuss differences in the measures in detail. In addition to characterizing cardiac murmurs and time intervals, the electronic stethoscope holds promise as a convenient low-cost tool to determine HRV in the hospital and outpatient settings as a practical extension of the physical examination.
First quality score for referral letters in gastroenterology-a validation study.

PubMed

Eskeland, Sigrun Losada; Brunborg, Cathrine; Seip, Birgitte; Wiencke, Kristine; Hovde, Øistein; Owen, Tanja; Skogestad, Erik; Huppertz-Hauss, Gert; Halvorsen, Fred-Arne; Garborg, Kjetil; Aabakken, Lars; de Lange, Thomas

2016-10-08

To create and validate an objective and reliable score to assess referral quality in gastroenterology. An observational multicentre study. 25 gastroenterologists participated in selecting variables for a Thirty Point Score (TPS) for quality assessment of referrals to gastroenterology specialist healthcare for 9 common indications. From May to September 2014, 7 hospitals from the South-Eastern Norway Regional Health Authority participated in collecting and scoring 327 referrals to a gastroenterologist. Correlation between the TPS and a visual analogue scale (VAS) for referral quality. The 327 referrals had an average TPS of 13.2 (range 1-25) and an average VAS of 4.7 (range 0.2-9.5). The reliability of the score was excellent, with an intra-rater intraclass correlation coefficient (ICC) of 0.87 and inter-rater ICC of 0.91. The overall correlation between the TPS and the VAS was moderate (r=0.42), and ranged from fair to substantial for the various indications. Mean agreement was good (ICC=0.47, 95% CI (0.34 to 0.57)), ranging from poor to good. The TPS is reliable, objective and shows good agreement with the subjective VAS. The score may be a useful tool for assessing referral quality in gastroenterology, particularly important when evaluating the effect of interventions to improve referral quality. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/.
Reliability of levator scapulae index in subjects with and without scapular downward rotation syndrome.

PubMed

Lee, Ji-Hyun; Cynn, Heon-Seock; Choi, Woo-Jeong; Jeong, Hyo-Jung; Yoon, Tae-Lim

2016-05-01

The objective of this study was to introduce levator scapulae (LS) measurement using a caliper and the levator scapulae index (LSI) and to investigate intra- and interrater reliability of the LSI in subjects with and without scapular downward rotation syndrome (SDRS). Two raters measured LS length twice in 38 subjects (19 with SDRS and 19 without SDRS). For reliability testing, intraclass correlation coefficients (ICCs), standard error of measurement (SEM), and minimal detectable change (MDC) were calculated. Intrarater reliability analysis resulted with ICCs ranging from 0.94 to 0.98 in subjects with SDRS and 0.96 to 0.98 in subjects without SDRS. These results represented that intrarater reliability in both groups were excellent for measuring LS length with the LSI. Interrater reliability was good (ICC: 0.82) in subjects with SDRS; however, interrater reliability was moderate (ICC: 0.75) in subjects without SDRS. Additionally, SEM and MDC were 0.13% and 0.36% in subjects with SDRS and 0.35% and 0.97% in subjects without SDRS. In subjects with SDRS, low dispersion of the measurement errors and MDC were shown. This study suggested that the LSI is a reliable method to measure LS length and is more reliable for subjects with SDRS. Copyright © 2015 Elsevier Ltd. All rights reserved.
The reliability of an instrumented start block analysis system.

PubMed

Tor, Elaine; Pease, David L; Ball, Kevin A

2015-02-01

The swimming start is highly influential to overall competition performance. Therefore, it is paramount to develop reliable methods to perform accurate biomechanical analysis of start performance for training and research. The Wetplate Analysis System is a custom-made force plate system developed by the Australian Institute of Sport--Aquatic Testing, Training and Research Unit (AIS ATTRU). This sophisticated system combines both force data and 2D digitization to measure a number of kinetic and kinematic parameter values in an attempt to evaluate start performance. Fourteen elite swimmers performed two maximal effort dives (performance was defined as time from start signal to 15 m) over two separate testing sessions. Intraclass correlation coefficients (ICC) were used to determine each parameter's reliability. The kinetic parameters all had ICC greater than 0.9 except the time of peak vertical force (0.742). This may have been due to variations in movement initiation after the starting signal between trials. The kinematic and time parameters also had ICC greater than 0.9 apart from for the time of maximum depth (0.719). This parameter was lower due to the swimmers varying their depth between trials. Based on the high ICC scores for all parameters, the Wetplate Analysis System is suitable for biomechanical analysis of swimming starts.
[Inter-rater agreement on self-reported exposure to ergonomic risk factors for the upper extremities among mechanic assemblers in an automotive industry].

PubMed

d'Errico, Angelo; Fontana, Dario; Merogno, Angela

2016-01-01

to assess reproducibility of self-reported exposure to ergonomic hazards for the upper limbs, measured through a questionnaire based on a diffused checklist for the assessment of ergonomic risk (OCRA) in a sample of mechanical assemblers of an automotive industry. cross-sectional study; reproducibility was assessed as interrater agreement of a composite index of ergonomic risk, estimated through the intraclass correlation coefficient (ICC). 58 mechanical assemblers, working in 29 twin areas, characterised by same work stations and tasks. composite index of ergonomic risk for the upper limbs. reproducibility of the ergonomic index was high in the overall sample (ICC: 0.81) and it was higher for the twin areas employing same-gender workers (ICC: 0.96), compared to those with workers of the opposite gender (ICC: 0.66). these results indicate that a questionnaire measuring with a great detail the exposure to the main ergonomic risk factors for the upper limbs, as the one based on the OCRA checklist used for this study, would allow to obtain a highly reproducible ergonomic index. If its validity against the corresponding observational checklist will be found elevated by future studies, this questionnaire may represent a useful tool for a preliminary assessment of workers' exposure to ergonomic hazards for the upper limbs.
Simultaneous Validation of Seven Physical Activity Questionnaires Used in Japanese Cohorts for Estimating Energy Expenditure: A Doubly Labeled Water Study.

PubMed

Sasai, Hiroyuki; Nakata, Yoshio; Murakami, Haruka; Kawakami, Ryoko; Nakae, Satoshi; Tanaka, Shigeho; Ishikawa-Takata, Kazuko; Yamada, Yosuke; Miyachi, Motohiko

2018-04-28

Physical activity questionnaires (PAQs) used in large-scale Japanese cohorts have rarely been simultaneously validated against the gold standard doubly labeled water (DLW) method. This study examined the validity of seven PAQs used in Japan for estimating energy expenditure against the DLW method. Twenty healthy Japanese adults (9 men; mean age, 32.4 [standard deviation {SD}, 9.4] years, mainly researchers and students) participated in this study. Fifteen-day daily total energy expenditure (TEE) and basal metabolic rate (BMR) were measured using the DLW method and a metabolic chamber, respectively. Activity energy expenditure (AEE) was calculated as TEE - BMR - 0.1 × TEE. Seven PAQs were self-administered to estimate TEE and AEE. The mean measured values of TEE and AEE were 2,294 (SD, 318) kcal/day and 721 (SD, 161) kcal/day, respectively. All of the PAQs indicated moderate-to-strong correlations with the DLW method in TEE (rho = 0.57-0.84). Two PAQs (Japan Public Health Center Study [JPHC]-PAQ Short and JPHC-PAQ Long) showed significant equivalence in TEE and moderate intra-class correlation coefficients (ICC). None of the PAQs showed significantly equivalent AEE estimates, with differences ranging from -547 to 77 kcal/day. Correlations and ICCs in AEE were mostly weak or fair (rho = 0.02-0.54, and ICC = 0.00-0.44). Only JPHC-PAQ Short provided significant and fair agreement with the DLW method. TEE estimated by the PAQs showed moderate or strong correlations with the results of DLW. Two PAQs showed equivalent TEE and moderate agreement. None of the PAQs showed equivalent AEE estimation to the gold standard, with weak-to-fair correlations and agreements. Further studies with larger sample sizes are needed to confirm these findings.
Within-person reproducibility of red blood cell mercury over a 10- to 15-year period among women in the Nurses' Health Study II.

PubMed

Kioumourtzoglou, Marianthi-Anna; Roberts, Andrea L; Nielsen, Flemming; Tworoger, Shelley S; Grandjean, Philippe; Weisskopf, Marc G

2016-01-01

Most epidemiologic studies of methylmercury (MeHg) health effects rely on a single measurement of a MeHg biomarker to assess long-term exposures. Long-term reproducibility data are, therefore, needed to assess the reliability of a single measure to reflect long-term exposures. In this study, we assessed within-person reproducibility of red blood cell (RBC) mercury (Hg), a marker of methyl-mercury, over 10-15 years in a sample of 57 women. Fifty-seven women from the Nurses' Health Study II provided two blood samples 10-15-years apart (median: 12 years), which were analyzed for mercury levels in the red blood cells (B-Hg*). To characterize within-person reproducibility, we estimated correlation and intraclass correlation coefficients (r and ICC) across the two samples. Further, we compared different prediction models, including variables on fish and seafood consumption, for B-Hg* at the first sample, using leave-one-out cross-validation to assess predictive ability. Overall, we observed strong correlations over 10-15 years (r=0.69), as well as a high ICC (0.67; 95% CI: 0.49, 0.79). Fish and seafood consumption reported concurrently with the first B-Hg* sample accounted for 26.8% of the variability in that B-Hg*, giving a correlation of r=0.52. Despite decreasing B-Hg* levels over time, we observed strong correlations and high ICC estimates across B-Hg* measured 10-15 years apart, suggesting good relative within-person stability over time. Our results indicate that a single measurement of B-Hg* likely is adequate to represent long-term exposures.
Feasibility and reproducibility of feature-tracking-based strain and strain rate measures of the left ventricle in different diseases and genders.

PubMed

Maceira, Alicia M; Tuset-Sanchis, Luis; López-Garrido, Miguel; San Andres, Marta; López-Lereu, M Pilar; Monmeneu, Jose V; García-González, M Pilar; Higueras, Laura

2018-05-01

The measurement of myocardial deformation by strain analysis is an evolving tool to quantify regional and global myocardial function. To assess the feasibility and reproducibility of myocardial strain/strain rate measurements with magnetic resonance feature tracking (MR-FT) in healthy subjects and in patient groups. Prospective study. Sixty patients (20 hypertensives with left ventricular (LV) hypertrophy (H); 20 nonischemic dilated cardiomyopathy (D); 20 ischemic heart disease (I); as well as 20 controls (C) were included, 10 men and 10 women in each group. A 1.5T MR protocol including steady-state free precession (SSFP) cine sequences in the standard views and late enhancement sequences. LV volumes, mass, global and regional radial, circumferential, and longitudinal strain/strain rate were measured using CVI42 software. The analysis time was recorded. Intraobserver and interobserver agreement and intraclass correlation coefficients (ICC) were obtained for reproducibility assessment as well as differences according to gender and group of pertinence. Strain/strain rate analysis could be achieved in all subjects. The average analysis time was 14 ± 3 minutes. The average intraobserver ICC was excellent (ICC >0.90) for strain and good (ICC >0.75) for strain rate. Reproducibility of strain measurements was good to excellent (ICC >0.75) for all groups of subjects and both genders. Reproducibility of strain measurements was good for basal segments (ICC >0.75) and excellent for middle and apical segments (ICC >0.90). Reproducibility of strain rate measurements was moderate for basal segments (ICC >0.50) and good for middle and apical segments. MR-FT for strain/strain rate analysis is a feasible and highly reproducible technique. CVI42 FT analysis was equally feasible and reproducible in various pathologies and between genders. Better reproducibility was seen globally for middle and apical segments, which needs further clarification. 3 Technical Efficacy Stage 2 J. Magn. Reson. Imaging 2018;47:1415-1425. © 2017 International Society for Magnetic Resonance in Medicine.
Precision analysis of a quantitative CT liver surface nodularity score.

PubMed

Smith, Andrew; Varney, Elliot; Zand, Kevin; Lewis, Tara; Sirous, Reza; York, James; Florez, Edward; Abou Elkassem, Asser; Howard-Claudio, Candace M; Roda, Manohar; Parker, Ellen; Scortegagna, Eduardo; Joyner, David; Sandlin, David; Newsome, Ashley; Brewster, Parker; Lirette, Seth T; Griswold, Michael

2018-04-26

To evaluate precision of a software-based liver surface nodularity (LSN) score derived from CT images. An anthropomorphic CT phantom was constructed with simulated liver containing smooth and nodular segments at the surface and simulated visceral and subcutaneous fat components. The phantom was scanned multiple times on a single CT scanner with adjustment of image acquisition and reconstruction parameters (N = 34) and on 22 different CT scanners from 4 manufacturers at 12 imaging centers. LSN scores were obtained using a software-based method. Repeatability and reproducibility were evaluated by intraclass correlation (ICC) and coefficient of variation. Using abdominal CT images from 68 patients with various stages of chronic liver disease, inter-observer agreement and test-retest repeatability among 12 readers assessing LSN by software- vs. visual-based scoring methods were evaluated by ICC. There was excellent repeatability of LSN scores (ICC:0.79-0.99) using the CT phantom and routine image acquisition and reconstruction parameters (kVp 100-140, mA 200-400, and auto-mA, section thickness 1.25-5.0 mm, field of view 35-50 cm, and smooth or standard kernels). There was excellent reproducibility (smooth ICC: 0.97; 95% CI 0.95, 0.99; CV: 7%; nodular ICC: 0.94; 95% CI 0.89, 0.97; CV: 8%) for LSN scores derived from CT images from 22 different scanners. Inter-observer agreement for the software-based LSN scoring method was excellent (ICC: 0.84; 95% CI 0.79, 0.88; CV: 28%) vs. good for the visual-based method (ICC: 0.61; 95% CI 0.51, 0.69; CV: 43%). Test-retest repeatability for the software-based LSN scoring method was excellent (ICC: 0.82; 95% CI 0.79, 0.84; CV: 12%). The software-based LSN score is a quantitative CT imaging biomarker with excellent repeatability, reproducibility, inter-observer agreement, and test-retest repeatability.
Method to assess the temporal persistence of potential biometric features: Application to oculomotor, gait, face and brain structure databases

PubMed Central

Nixon, Mark S.; Komogortsev, Oleg V.

2017-01-01

We introduce the intraclass correlation coefficient (ICC) to the biometric community as an index of the temporal persistence, or stability, of a single biometric feature. It requires, as input, a feature on an interval or ratio scale, and which is reasonably normally distributed, and it can only be calculated if each subject is tested on 2 or more occasions. For a biometric system, with multiple features available for selection, the ICC can be used to measure the relative stability of each feature. We show, for 14 distinct data sets (1 synthetic, 8 eye-movement-related, 2 gait-related, and 2 face-recognition-related, and one brain-structure-related), that selecting the most stable features, based on the ICC, resulted in the best biometric performance generally. Analyses based on using only the most stable features produced superior Rank-1-Identification Rate (Rank-1-IR) performance in 12 of 14 databases (p = 0.0065, one-tailed), when compared to other sets of features, including the set of all features. For Equal Error Rate (EER), using a subset of only high-ICC features also produced superior performance in 12 of 14 databases (p = 0. 0065, one-tailed). In general, then, for our databases, prescreening potential biometric features, and choosing only highly reliable features yields better performance than choosing lower ICC features or than choosing all features combined. We also determined that, as the ICC of a group of features increases, the median of the genuine similarity score distribution increases and the spread of this distribution decreases. There was no statistically significant similar relationships for the impostor distributions. We believe that the ICC will find many uses in biometric research. In case of the eye movement-driven biometrics, the use of reliable features, as measured by ICC, allowed to us achieve the authentication performance with EER = 2.01%, which was not possible before. PMID:28575030
Method to assess the temporal persistence of potential biometric features: Application to oculomotor, gait, face and brain structure databases.

PubMed

Friedman, Lee; Nixon, Mark S; Komogortsev, Oleg V

2017-01-01

We introduce the intraclass correlation coefficient (ICC) to the biometric community as an index of the temporal persistence, or stability, of a single biometric feature. It requires, as input, a feature on an interval or ratio scale, and which is reasonably normally distributed, and it can only be calculated if each subject is tested on 2 or more occasions. For a biometric system, with multiple features available for selection, the ICC can be used to measure the relative stability of each feature. We show, for 14 distinct data sets (1 synthetic, 8 eye-movement-related, 2 gait-related, and 2 face-recognition-related, and one brain-structure-related), that selecting the most stable features, based on the ICC, resulted in the best biometric performance generally. Analyses based on using only the most stable features produced superior Rank-1-Identification Rate (Rank-1-IR) performance in 12 of 14 databases (p = 0.0065, one-tailed), when compared to other sets of features, including the set of all features. For Equal Error Rate (EER), using a subset of only high-ICC features also produced superior performance in 12 of 14 databases (p = 0. 0065, one-tailed). In general, then, for our databases, prescreening potential biometric features, and choosing only highly reliable features yields better performance than choosing lower ICC features or than choosing all features combined. We also determined that, as the ICC of a group of features increases, the median of the genuine similarity score distribution increases and the spread of this distribution decreases. There was no statistically significant similar relationships for the impostor distributions. We believe that the ICC will find many uses in biometric research. In case of the eye movement-driven biometrics, the use of reliable features, as measured by ICC, allowed to us achieve the authentication performance with EER = 2.01%, which was not possible before.
Reliability and validity of Arabic translation of Medication Adherence Report Scale (MARS) and Beliefs about Medication Questionnaire (BMQ)–specific for use in children and their parents

PubMed Central

Alsous, Mervat; Alhalaiqa, Fadwa; Abu Farha, Rana; Abdel Jalil, Mariam; McElnay, James; Horne, Robert

2017-01-01

Objectives to evaluate the reliability and discriminant validity of Arabic translation of the Medication Adherence Report Scale (MARS) and the Beliefs about Medication Questionnaire-specific (BMQ-specific). Methods Having developed Arabic translations of the study instruments, a cross-sectional study was carried out between March and October 2015 in two multidisciplinary governmental hospitals in Jordan. An expert panel monitored the forward and backward translation of the MARS and BMQ. Standard Arabic was used (with no specific dialect inclusion) to allow greater generalisability across Arabic speaking countries. Once the Arabic translations of the questionnaires were developed they were tested for consistency, validity and reliability on a group of children with chronic diseases and their parents. Results A total of 258 parents and 208 children were included in the study. The median age of participated children and parents was 15 years and 42 years respectively. Principle component analysis of all questionnaires indicated that all had good construct validity as they clearly measured one construct. The questionnaires were deemed reliable based on the results of Cronbach alpha coefficient. Furthermore, reliability of the questionnaires was demonstrated by test-retest intraclass correlation coefficients (ICC) which ranged from good to excellent for all scales (ICC>0.706). The Pearson correlation coefficient ranged from 0.546–0.805 for the entire sample which indicated a significant moderate to strong positive correlation between MARS and BMQ items at time 1 and 2. Reported adherence was greater than 59% using MARS-children and MARS-parents scales, and was correlated with beliefs in necessity and independent of the concerns regarding medications. Conclusion The Arabic translations of both BMQ and MARS for use in children and their parents have good internal consistency and proved to be valid and reliable tools that can be used by researchers in clinical practice to measure adherence and beliefs about medications in Arabic speaking patient populations. PMID:28192467
Reliability and validity of Arabic translation of Medication Adherence Report Scale (MARS) and Beliefs about Medication Questionnaire (BMQ)-specific for use in children and their parents.

PubMed

Alsous, Mervat; Alhalaiqa, Fadwa; Abu Farha, Rana; Abdel Jalil, Mariam; McElnay, James; Horne, Robert

2017-01-01

to evaluate the reliability and discriminant validity of Arabic translation of the Medication Adherence Report Scale (MARS) and the Beliefs about Medication Questionnaire-specific (BMQ-specific). Having developed Arabic translations of the study instruments, a cross-sectional study was carried out between March and October 2015 in two multidisciplinary governmental hospitals in Jordan. An expert panel monitored the forward and backward translation of the MARS and BMQ. Standard Arabic was used (with no specific dialect inclusion) to allow greater generalisability across Arabic speaking countries. Once the Arabic translations of the questionnaires were developed they were tested for consistency, validity and reliability on a group of children with chronic diseases and their parents. A total of 258 parents and 208 children were included in the study. The median age of participated children and parents was 15 years and 42 years respectively. Principle component analysis of all questionnaires indicated that all had good construct validity as they clearly measured one construct. The questionnaires were deemed reliable based on the results of Cronbach alpha coefficient. Furthermore, reliability of the questionnaires was demonstrated by test-retest intraclass correlation coefficients (ICC) which ranged from good to excellent for all scales (ICC>0.706). The Pearson correlation coefficient ranged from 0.546-0.805 for the entire sample which indicated a significant moderate to strong positive correlation between MARS and BMQ items at time 1 and 2. Reported adherence was greater than 59% using MARS-children and MARS-parents scales, and was correlated with beliefs in necessity and independent of the concerns regarding medications. The Arabic translations of both BMQ and MARS for use in children and their parents have good internal consistency and proved to be valid and reliable tools that can be used by researchers in clinical practice to measure adherence and beliefs about medications in Arabic speaking patient populations.
Psychometric evaluation of the Arabic version of the multidimensional assessment of fatigue scale (MAF) for use in patients with ankylosing spondylitis.

PubMed

Bahouq, Hanane; Rostom, Samira; Bahiri, Rachid; Hakkou, Jinane; Aissaoui, Nawal; Hajjaj-Hassouni, Najia

2012-12-01

Fatigue is a frequent symptom during ankylosing spondylitis (AS) often under estimated which needs to be measured properly with respect to its intensity by appropriate measures, such as the multidimensional assessment of fatigue (MAF). The aims of this study were to translate into the classic Arabic version of the MAF questionnaire and to validate its use for assessing fatigue in Moroccan patients with AS. The MAF contains 16 items with a global fatigue index (IGF). The MAF was translated and back-translated to arabic, pretested and reviewed by a committee following the Guillemin criteria (J Clin Epidemiol 46:1417-1432, 1993). It was then validate on 110 Moroccan patients with AS. Reliability for the 3-day test-retest was assessed using internal consistency by Cronbach's alpha coefficient and the intra-class correlation coefficient (ICC). External construct validity was assessed by correlation with pain, activity of disease and other keys variable. The reproducibility of the 15 items was satisfactory with a kappa statistics of agreement superior to 0.6. The ICC for IGF score reproducibility was good and reached 0.98 (IC 95%, 0.96-0.99). The internal consistency was at 0.991 with Cronbach's alpha coefficient. The construct validity showed a positive correlation between MAF and the axial (r = 0.34) and peripheral (r = 0.32) visual analogical scale, the Bath ankylosing spondylitis disease activity index (BASDAI) (r = 0.77), the first item of BASDAI (r = 0.85), the functional disability by the Bath ankylosing spondylitis functional index (r = 0.64), the erythrocyte sedimentation rate (r = 0.43) and the C reactive protein (r = 0.30) (for all P < 0.001). There was no statistical correlation between MAF and the other variables. The Arabic version of the MAF has good comprehensibility, internal consistency, reliability and validity for the evaluation of Arabic speaking patients with AS.
Development of a valid Simplified Chinese version of the International Hip Outcome Tool (SC-iHOT-33) in young patients having total hip arthroplasty.

PubMed

Li, D H; Wang, W; Li, X; Gao, Y L; Liu, D H; Liu, D L; Xu, W D

2017-01-01

The International Hip Outcome Tool (iHOT-33) is a questionnaire designed for young, active patients with hip disorders. It has proven to be a highly reliable and valid questionnaire. The main purpose of our study was to adapt the iHOT-33 questionnaire into simplified Chinese and to assess its psychometric properties in Chinese patients. The iHOT-33 was cross culturally adapted into Chinese and 138 patients completed the Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC), the EuroQol-5D (EQ-5D), and the Chinese version of the iHOT-33(SC-iHOT-33) pre- or postoperatively within 6 months' follow-up. The Cronbach's alpha, intraclass correlation coefficient (ICC), Pearson's correlation coefficient (r), effect size (ES), and standardized response mean (SRM) were calculated to assess the reliability, validity, and responsiveness of the SC-iHOT-33, respectively. Total Cronbach's alpha was 0.965, which represented excellent internal consistency of the SC-iHOT-33. The ICC ranges from 0.866 to 0.929, which shows excellent test-retest reliability. The subscales of SC-iHOT-33 had the highest correlation coefficient (r = 0.812) with the physical function subscales of the WOMAC, as well as good correlation between the social/emotional subscale of the SC-iHOT-33 and the EQ-5D (r = 0.740, r = 0.743). No floor or ceiling effects were found. The ES and SRM values indicated good responsiveness of 2.44 and 2.67, respectively. The SC-iHOT-33 questionnaire is reliable, valid, and responsive for the evaluation of young, Chinese, active patients with hip disorders. Copyright © 2016 Osteoarthritis Research Society International. Published by Elsevier Ltd. All rights reserved.

Reliability and relationships among handgrip strength, leg extensor strength and power, and balance in older men.

PubMed

Jenkins, Nathaniel D M; Buckner, Samuel L; Bergstrom, Haley C; Cochrane, Kristen C; Goldsmith, Jacob A; Housh, Terry J; Johnson, Glen O; Schmidt, Richard J; Cramer, Joel T

2014-10-01

To quantify the reliability of isometric leg extension torque (LEMVC), rate of torque development (LERTD), isometric handgrip force (HGMVC) and RFD (HGRFD), isokinetic leg extension torque and power at 1.05rad·s(-1) and 3.14rad·s(-1); and explore relationships among strength, power, and balance in older men. Sixteen older men completed 3 isometric handgrips, 3 isometric leg extensions, and 3 isokinetic leg extensions at 1.05rad·s(-1) and 3.14rad·s(-1) during two visits. Intraclass correlation coefficients (ICCs), ICC confidence intervals (95% CI), coefficients of variation (CVs), and Pearson correlation coefficients were calculated. LERTD demonstrated no reliability. The CVs for LERTD and HGRFD were ≤23.26%. HGMVC wasn't related to leg extension torque or power, or balance (r=0.14-0.47; p>0.05). However, moderate to strong relationships were found among isokinetic leg extension torque at 1.05rad·s(-1) and 3.14rad·s(-1), leg extension mean power at 1.05rad·s(-1), and functional reach (r=0.51-0.95; p≤0.05). LERTD and HGRFD weren't reliable and shouldn't be used as outcome variables in older men. Handgrip strength may not be an appropriate surrogate for lower body strength, power, or balance. Instead, perhaps handgrip strength should only be used to describe upper body strength or functionality, which may compliment isokinetic assessments of lower body strength, which were reliable and related to balance. Copyright © 2014 Elsevier Inc. All rights reserved.
Rater reliability and concurrent validity of the Keyboard Personal Computer Style instrument (K-PeCS).

PubMed

Baker, Nancy A; Cook, James R; Redfern, Mark S

2009-01-01

This paper describes the inter-rater and intra-rater reliability, and the concurrent validity of an observational instrument, the Keyboard Personal Computer Style instrument (K-PeCS), which assesses stereotypical postures and movements associated with computer keyboard use. Three trained raters independently rated the video clips of 45 computer keyboard users to ascertain inter-rater reliability, and then re-rated a sub-sample of 15 video clips to ascertain intra-rater reliability. Concurrent validity was assessed by comparing the ratings obtained using the K-PeCS to scores developed from a 3D motion analysis system. The overall K-PeCS had excellent reliability [inter-rater: intra-class correlation coefficients (ICC)=.90; intra-rater: ICC=.92]. Most individual items on the K-PeCS had from good to excellent reliability, although six items fell below ICC=.75. Those K-PeCS items that were assessed for concurrent validity compared favorably to the motion analysis data for all but two items. These results suggest that most items on the K-PeCS can be used to reliably document computer keyboarding style.
Task performance in virtual environments used for cognitive rehabilitation after traumatic brain injury.

PubMed

Christiansen, C; Abreu, B; Ottenbacher, K; Huffman, K; Masel, B; Culpepper, R

1998-08-01

This report describes a reliability study using a prototype computer-simulated virtual environment to assess basic daily living skills in a sample of persons with traumatic brain injury (TBI). The benefits of using virtual reality in training for situations where safety is a factor have been established in defense and industry, but have not been demonstrated in rehabilitation. Thirty subjects with TBI receiving comprehensive rehabilitation services at a residential facility. An immersive virtual kitchen was developed in which a meal preparation task involving multiple steps could be performed. The prototype was tested using subjects who completed the task twice within 7 days. The stability of performance was estimated using intraclass correlation coefficients (ICCs). The ICC value for total performance based on all steps involved in the meal preparation task was .73. When three items with low variance were removed the ICC improved to .81. Little evidence of vestibular optical side-effects was noted in the subjects tested. Adequate initial reliability exists to continue development of the environment as an assessment and training prototype for persons with brain injury.
[Reliability of the PRISCUS-PAQ. Questionnaire to assess physical activity of persons aged 70 years and older].

PubMed

Trampisch, U; Platen, P; Burghaus, I; Moschny, A; Wilm, S; Thiem, U; Hinrichs, T

2010-12-01

A questionnaire (Q) to measure physical activity (PA) of persons ≥70 years for epidemiological research is lacking. The aim was to develop the PRISCUS-PAQ and test the reliability in community-dwelling people (≥70 years). Validated PA questionnaires were translated and adapted to design the PRISCUS-PAQ. Its test-retest reliability for 91 randomly selected people (36% men) aged 70-98 (76±5) years ranged from 0.47 (walking) to 0.82 (riding a bicycle). The overall activity score was 0.59 as determined by the intraclass correlation coefficient (ICC). Recording of general activities, e.g., housework (ICC=0.59), was in general less reliable than athletic activities, e.g., gymnastics (ICC=0.76). The PRISCUS-PAQ, which is a short instrument with acceptable reliability to collect the physical activity of the elderly in a telephone interview, will be used to collect data in a large cohort of older people in the German research consortium PRISCUS.
Validation of the Japanese version of the Pediatric Quality of Life Inventory (PedsQL) Cancer Module.

PubMed

Tsuji, Naoko; Kakee, Naoko; Ishida, Yasushi; Asami, Keiko; Tabuchi, Ken; Nakadate, Hisaya; Iwai, Tsuyako; Maeda, Miho; Okamura, Jun; Kazama, Takuro; Terao, Yoko; Ohyama, Wataru; Yuza, Yuki; Kaneko, Takashi; Manabe, Atsushi; Kobayashi, Kyoko; Kamibeppu, Kiyoko; Matsushima, Eisuke

2011-04-10

The PedsQL 3.0 Cancer Module is a widely used instrument to measure pediatric cancer specific health-related quality of life (HRQOL) for children aged 2 to 18 years. We developed the Japanese version of the PedsQL Cancer Module and investigated its reliability and validity among Japanese children and their parents. Participants were 212 children with cancer and 253 of their parents. Reliability was determined by internal consistency using Cronbach's coefficient alpha and test-retest reliability using intra-class correlation coefficient (ICC). Validity was assessed through factor validity, convergent and discriminant validity, concurrent validity, and clinical validity. Factor validity was examined by exploratory factor analysis. Convergent and discriminant validity were examined by multitrait scaling analysis. Concurrent validity was assessed using Spearman's correlation coefficients between the Cancer Module and Generic Core Scales, and the comparison of the scores of child self-reports with those of other self-rating depression scales for children. Clinical validity was assessed by comparing the on- and off- treatment scores using Kruskal-Wallis and Mann-Whitney U tests. Cronbach's coefficient alpha was over 0.70 for the total scale and over 0.60 for each subscale by age except for the 'pain and hurt' subscale for children aged 5 to 7 years. For test-retest reliability, the ICC exceeded 0.70 for the total scale for each age. Exploratory factor analysis demonstrated sufficient factorial validity. Multitrait scaling analysis showed high success rates. Strong correlations were found between the reports by children and their parents, and the scores of the Cancer Module and the Generic Core Scales except for 'treatment anxiety' subscales for child reports. The Depression Self-Rating Scale for Children (DSRS-C) scores were significantly correlated with emotional domains and the total score of the cancer module. Children who had been off treatment over 12 months demonstrated significantly higher scores than those on treatment. The results demonstrate the reliability and validity of the Japanese version of the PedsQL Cancer Module among Japanese children.
Self-administered physical activity questionnaires for the elderly: a systematic review of measurement properties.

PubMed

Forsén, Lisa; Loland, Nina Waaler; Vuillemin, Anne; Chinapaw, Mai J M; van Poppel, Mireille N M; Mokkink, Lidwine B; van Mechelen, Willem; Terwee, Caroline B

2010-07-01

To systematically review and appraise studies examining self-administered physical activity questionnaires (PAQ) for the elderly. This article is one of a group of four articles in Sports Medicine on the content and measurement properties of PAQs. LITERATURE SEARCH METHODOLOGY: Searches in PubMed, EMBASE and SportDiscu (until May 2009) on self-administered PAQ. Inclusion criteria were as follows: (i) the study examined (at least one of) the measurement properties of a self-administered PAQ; (ii) the questionnaire aimed to measure physical activity (PA) in older people; (iii) the average age of the study population was >55 years; (iv) the article was written in English. We excluded PA interviews, diaries and studies that evaluated the measurement properties of a self-administered PAQ in a specific population, such as patients. We used a standard checklist (qualitative attributes and measurement properties of PA questionnaires [QAPAQ]) for appraising the measurement properties of PAQs. Eighteen articles on 13 PAQs were reviewed, including 16 reliability analyses and 25 validity analyses (of which 15 were on construct validity, seven on health/functioning associations, two on known-groups validity and one on responsiveness). Many studies suffered from methodological flaws, e.g. too small sample size or inadequate time interval between test and retest. Three PAQs received a positive rating on reliability: IPAQ-C (International Physical Activity Questionnaire-Chinese), intraclass correlation coefficient (ICC) > or = 0.81; WHI-PAQ (Women's Health Initiative-PAQ), ICC = 0.76; and PASE (Physical Activity Scale for the Elderly), Pearson correlation coefficient (r) = 0.84. However, PASE was negatively rated on reliability in another study (ICC = 0.65). One PAQ received a positive rating on construct validity: PASE against Mini-Logger (r > 0.52), but PASE was negatively rated in another study against accelerometer and another PAQ, Spearman correlation coefficient = 0.17 and 0.48, respectively. Three of the 13 PAQs were tested for health/functioning associations and all three were positively rated in some categories of PA in many studies (r > 0.30). Even though several studies showed an association between the tested PAQ and health/functioning variables, the knowledge about reliability and construct validity of self-administrated PAQs for older adults is still scarce and more high-quality validation studies are needed.
Validity of an activity monitor in young people with cerebral palsy gross motor function classification system level I.

PubMed

O' Donoghue, Deirdre; Kennedy, Norelee

2014-11-01

The activPAL™ activity monitor has potential for use in youth with Cerebral Palsy (CP) as it has demonstrated acceptable validity for the assessment of sedentary and physical activity in other populations. This study determined the validity of the activPAL™ activity monitor for the measurement of sitting, standing, walking time, transitions and step count for both legs in young people with hemiplegic and asymmetric diplegic CP. Seventeen participants with CP Gross Motor Function Classification System level I completed two video recorded test protocols that involved wearing an activPAL™ activity monitor on alternate legs. Agreement between observed video recorded data and activPAL™ activity monitor data was assessed using the Bland and Altman (BA) method and intraclass correlation coefficients (ICC 3,1). There was perfect agreement for transitions and high agreement for sitting (BA mean differences (MD): -1.8 and -1.8 s; ICCs: 0.49 and 0.95) standing (MD: 0.8 and 0.1 s; ICCs: 0.59 and 0.98) walking (MD: 1 and 1.1 s; ICCs: 0.99 and 0.94) timings and low agreement for step count (MD: 4.1 and 2.8 steps; ICCs: 0.96 and 0.95) for both legs. This study found clinically acceptable agreement with direct observation for all activPAL™ activity monitor functions, except for step count measurement with respect to the range of measurement values obtained for both legs in this study population.
Focused physician-performed echocardiography in sports medicine: a potential screening tool for detecting aortic root dilatation in athletes.

PubMed

Yim, Eugene S; Kao, Daniel; Gillis, Edward F; Basilico, Frederick C; Corrado, Gianmichael D

2013-12-01

The purpose of this study was to investigate whether sports medicine physicians can obtain accurate measurements of the aortic root in young athletes. Twenty male collegiate athletes, aged 18 to 21 years, were prospectively enrolled. Focused echocardiography was performed by a board-certified sports medicine physician and a medical student, followed by comprehensive echocardiography within 2 weeks by a cardiac sonographer. A left parasternal long-axis view was acquired to measure the aortic root diameter at the sinuses of Valsalva. Intraclass correlation coefficients (ICCs) were used to assess inter-rater reliability compared to a reference standard and intra-rater reliability of repeated measurements obtained by the sports medicine physician and medical student. The ICCs between the sports medicine physician and cardiac sonographer and between the medical student and cardiac sonographer were strong: 0.80 and 0.76, respectively. Across all 3 readers, the ICC was 0.89, indicating strong inter-rater reliability and concordance. The ICC for the 2 measurements taken by the sports medicine physician for each athlete was 0.75, indicating strong intra-rater reliability. The medical student had moderate intra-rater reliability, with an ICC of 0.59. Sports medicine physicians are able to obtain measurements of the aortic root by focused echocardiography that are consistent with those obtained by a cardiac sonographer. Focused physician-performed echocardiography may serve as a promising technique for detecting aortic root dilatation and may contribute in this manner to preparticipation cardiovascular screening for athletes.
Web-based questionnaires to assess perinatal outcome proved to be valid.

PubMed

van Gelder, Marleen M H J; Vorstenbosch, Saskia; Derks, Lineke; Te Winkel, Bernke; van Puijenbroek, Eugène P; Roeleveld, Nel

2017-10-01

The objective of this study was to validate a Web-based questionnaire completed by the mother to assess perinatal outcome used in a prospective cohort study. For 882 women with an estimated date of delivery between February 2012 and February 2015 who participated in the PRegnancy and Infant DEvelopment (PRIDE) Study, we compared data on pregnancy outcome, including mode of delivery, plurality, gestational age, birth weight and length, head circumference, birth defects, and infant sex, from Web-based questionnaires administered to the mothers 2 months after delivery with data from obstetric records. For continuous variables, we calculated intraclass correlation coefficients (ICCs) with 95% confidence intervals (CIs), whereas sensitivity and specificity were determined for categorical variables. We observed only very small differences between the two methods of data collection for gestational age (ICC, 0.91; 95% CI, 0.90-0.92), birth weight (ICC, 0.96; 95% CI, 0.95-0.96), birth length (ICC, 0.90; 95% CI, 0.87-0.92), and head circumference (ICC, 0.88; 95% CI, 0.80-0.93). Agreement between the Web-based questionnaire and obstetric records was high as well, with sensitivity ranging between 0.86 (termination of pregnancy) and 1.00 (four outcomes) and specificity between 0.96 (term birth) and 1.00 (nine outcomes). Our study provides evidence that Web-based questionnaires could be considered as a valid complementary or alternative method of data collection. Copyright © 2017 Elsevier Inc. All rights reserved.
Repeatability and Reproducibility of Retinal Nerve Fiber Layer Parameters Measured by Scanning Laser Polarimetry with Enhanced Corneal Compensation in Normal and Glaucomatous Eyes

PubMed Central

Ara, Mirian; Pajarin, Ana B.

2015-01-01

Objective. To assess the intrasession repeatability and intersession reproducibility of peripapillary retinal nerve fiber layer (RNFL) thickness parameters measured by scanning laser polarimetry (SLP) with enhanced corneal compensation (ECC) in healthy and glaucomatous eyes. Methods. One randomly selected eye of 82 healthy individuals and 60 glaucoma subjects was evaluated. Three scans were acquired during the first visit to evaluate intravisit repeatability. A different operator obtained two additional scans within 2 months after the first session to determine intervisit reproducibility. The intraclass correlation coefficient (ICC), coefficient of variation (COV), and test-retest variability (TRT) were calculated for all SLP parameters in both groups. Results. ICCs ranged from 0.920 to 0.982 for intravisit measurements and from 0.910 to 0.978 for intervisit measurements. The temporal-superior-nasal-inferior-temporal (TSNIT) average was the highest (0.967 and 0.946) in normal eyes, while nerve fiber indicator (NFI; 0.982) and inferior average (0.978) yielded the best ICC in glaucomatous eyes for intravisit and intervisit measurements, respectively. All COVs were under 10% in both groups, except NFI. TSNIT average had the lowest COV (2.43%) in either type of measurement. Intervisit TRT ranged from 6.48 to 12.84. Conclusions. The reproducibility of peripapillary RNFL measurements obtained with SLP-ECC was excellent, indicating that SLP-ECC is sufficiently accurate for monitoring glaucoma progression. PMID:26185762
Reproducibility of peripapillary retinal nerve fiber layer thickness with spectral domain cirrus high-definition optical coherence tomography in normal eyes.

PubMed

Hong, Samin; Kim, Chan Yun; Lee, Won Seok; Seong, Gong Je

2010-01-01

To assess the reproducibility of the new spectral domain Cirrus high-definition optical coherence tomography (HD-OCT; Carl Zeiss Meditec, Dublin, CA, USA) for analysis of peripapillary retinal nerve fiber layer (RNFL) thickness in healthy eyes. Thirty healthy Korean volunteers were enrolled. Three optic disc cube 200 x 200 Cirrus HD-OCT scans were taken on the same day in discontinuous sessions by the same operator without using the repeat scan function. The reproducibility of the calculated RNFL thickness and probability code were determined by the intraclass correlation coefficient (ICC), coefficient of variation (CV), test-retest variability, and Fleiss' generalized kappa (kappa). Thirty-six eyes were analyzed. For average RNFL thickness, the ICC was 0.970, CV was 2.38%, and test-retest variability was 4.5 microm. For all quadrants except the nasal, ICCs were 0.972 or higher and CVs were 4.26% or less. Overall test-retest variability ranged from 5.8 to 8.1 microm. The kappa value of probability codes for average RNFL thickness was 0.690. The kappa values of quadrants and clock-hour sectors were lower in the nasal areas than in other areas. The reproducibility of Cirrus HD-OCT to analyze peripapillary RNFL thickness in healthy eyes was excellent compared with the previous reports for time domain Stratus OCT. For the calculated RNFL thickness and probability code, variability was relatively higher in the nasal area, and more careful analyses are needed.
A Correction Equation for Jump Height Measured Using the Just Jump System.

PubMed

McMahon, John J; Jones, Paul A; Comfort, Paul

2016-05-01

To determine the concurrent validity and reliability of the popular Just Jump system (JJS) for determining jump height and, if necessary, provide a correction equation for future reference. Eighteen male college athletes performed 3 bilateral countermovement jumps (CMJs) on 2 JJSs (alternative method) that were placed on top of a force platform (criterion method). Two JJSs were used to establish consistency between systems. Jump height was calculated from flight time obtained from the JJS and force platform. Intraclass correlation coefficients (ICCs) demonstrated excellent within-session reliability of the CMJ height measurement derived from both the JJS (ICC = .96, P < .001) and the force platform (ICC = .96, P < .001). Dependent t tests revealed that the JJS yielded a significantly greater CMJ jump height (0.46 ± 0.09 m vs 0.33 ± 0.08 m) than the force platform (P < .001, Cohen d = 1.39, power = 1.00). There was, however, an excellent relationship between CMJ heights derived from the JJS and force platform (r = .998, P < .001, power = 1.00), with a coefficient of determination (R2) of .995. Therefore, the following correction equation was produced: Criterion jump height = (0.8747 × alternative jump height) - 0.0666. The JJS provides a reliable but overestimated measure of jump height. It is suggested, therefore, that practitioners who use the JJS as part of future work apply the correction equation presented in this study to resultant jump-height values.
A simple video-based timing system for on-ice team testing in ice hockey: a technical report.

PubMed

Larson, David P; Noonan, Benjamin C

2014-09-01

The purpose of this study was to describe and evaluate a newly developed on-ice timing system for team evaluation in the sport of ice hockey. We hypothesized that this new, simple, inexpensive, timing system would prove to be highly accurate and reliable. Six adult subjects (age 30.4 ± 6.2 years) performed on ice tests of acceleration and conditioning. The performance times of the subjects were recorded using a handheld stopwatch, photocell, and high-speed (240 frames per second) video. These results were then compared to allow for accuracy calculations of the stopwatch and video as compared with filtered photocell timing that was used as the "gold standard." Accuracy was evaluated using maximal differences, typical error/coefficient of variation (CV), and intraclass correlation coefficients (ICCs) between the timing methods. The reliability of the video method was evaluated using the same variables in a test-retest analysis both within and between evaluators. The video timing method proved to be both highly accurate (ICC: 0.96-0.99 and CV: 0.1-0.6% as compared with the photocell method) and reliable (ICC and CV within and between evaluators: 0.99 and 0.08%, respectively). This video-based timing method provides a very rapid means of collecting a high volume of very accurate and reliable on-ice measures of skating speed and conditioning, and can easily be adapted to other testing surfaces and parameters.
Intraday and Interday Reliability of Ultra-Short-Term Heart Rate Variability in Rugby Union Players.

PubMed

Nakamura, Fábio Y; Pereira, Lucas A; Esco, Michael R; Flatt, Andrew A; Moraes, José E; Cal Abad, Cesar C; Loturco, Irineu

2017-02-01

Nakamura, FY, Pereira, LA, Esco, MR, Flatt, AA, Moraes, JE, Cal Abad, CC, and Loturco, I. Intraday and interday reliability of ultra-short-term heart rate variability in rugby union players. J Strength Cond Res 31(2): 548-551, 2017-The aim of this study was to examine the intraday and interday reliability of ultra-short-term vagal-related heart rate variability (HRV) in elite rugby union players. Forty players from the Brazilian National Rugby Team volunteered to participate in this study. The natural log of the root mean square of successive RR interval differences (lnRMSSD) assessments were performed on 4 different days. The HRV was assessed twice (intraday reliability) on the first day and once per day on the following 3 days (interday reliability). The RR interval recordings were obtained from 2-minute recordings using a portable heart rate monitor. The relative reliability of intraday and interday lnRMSSD measures was analyzed using the intraclass correlation coefficient (ICC). The typical error of measurement (absolute reliability) of intraday and interday lnRMSSD assessments was analyzed using the coefficient of variation (CV). Both intraday (ICC = 0.96; CV = 3.99%) and interday (ICC = 0.90; CV = 7.65%) measures were highly reliable. The ultra-short-term lnRMSSD is a consistent measure for evaluating elite rugby union players, in both intraday and interday settings. This study provides further validity to using this shortened method in practical field conditions with highly trained team sports athletes.
Design and Reproducibility of a Mini-Survey to Evaluate the Quality of Food Intake (Mini-ECCA) in a Mexican Population

PubMed Central

González-Gómez, Montserrat; Orozco-Gutiérrez, Jaime Fernando; Prado-Arriaga, Ruth Jackelyne; Márquez-Sandoval, Fabiola; Altamirano-Martínez, Martha Betzaida

2018-01-01

Evaluating food intake quality may contribute to the development of nutrition programs. In Mexico, there are no screening tools that can be administered quickly for the evaluation of this variable. The aim was to determine the reproducibility of a mini-survey designed to evaluate the quality of food intake (Mini-ECCA) in a Mexican population. Mini-ECCA consists of 12 questions that are based on Mexican and international recommendations for food and non-alcoholic beverage intake, with the support of photographs for food quantity estimation. Each question scores as 0 (unhealthy) or 1 (healthy), and the final score undergoes a classification procedure. Through the framework of a nutritional study, 152 employees of the municipal water company in Guadalajara, Mexico (April–August 2016), were invited to participate. The survey was administered in two rounds (test and retest) with a 15-day interval between them. We calculated the Spearman correlation coefficient, the intra-class correlation coefficient (ICC), and weighted kappa for score classification agreement (SPSS versus 14 p < 0.05 was considered statistically significant). The survey obtained a “good” reproducibility (ρ = 0.713, p < 0.001), and an excellent concordance (ICC = 0.841 Confidence Interval 95% 0.779, 0.885). It can thus be said that the Mini-ECCA displayed acceptable reproducibility and is suitable for the purpose of dietary assessment and guidance. PMID:29690618
Design and Reproducibility of a Mini-Survey to Evaluate the Quality of Food Intake (Mini-ECCA) in a Mexican Population.

PubMed

Bernal-Orozco, María Fernanda; Badillo-Camacho, Nayeli; Macedo-Ojeda, Gabriela; González-Gómez, Montserrat; Orozco-Gutiérrez, Jaime Fernando; Prado-Arriaga, Ruth Jackelyne; Márquez-Sandoval, Fabiola; Altamirano-Martínez, Martha Betzaida; Vizmanos, Barbara

2018-04-23

Evaluating food intake quality may contribute to the development of nutrition programs. In Mexico, there are no screening tools that can be administered quickly for the evaluation of this variable. The aim was to determine the reproducibility of a mini-survey designed to evaluate the quality of food intake (Mini-ECCA) in a Mexican population. Mini-ECCA consists of 12 questions that are based on Mexican and international recommendations for food and non-alcoholic beverage intake, with the support of photographs for food quantity estimation. Each question scores as 0 (unhealthy) or 1 (healthy), and the final score undergoes a classification procedure. Through the framework of a nutritional study, 152 employees of the municipal water company in Guadalajara, Mexico (April⁻August 2016), were invited to participate. The survey was administered in two rounds (test and retest) with a 15-day interval between them. We calculated the Spearman correlation coefficient, the intra-class correlation coefficient (ICC), and weighted kappa for score classification agreement (SPSS versus 14 p < 0.05 was considered statistically significant). The survey obtained a “good” reproducibility (ρ = 0.713, p < 0.001), and an excellent concordance (ICC = 0.841 Confidence Interval 95% 0.779, 0.885). It can thus be said that the Mini-ECCA displayed acceptable reproducibility and is suitable for the purpose of dietary assessment and guidance.
Accuracy of the HumaSensplus point-of-care uric acid meter using capillary blood obtained by fingertip puncture.

PubMed

Fabre, Stéphanie; Clerson, Pierre; Launay, Jean-Marie; Gautier, Jean-François; Vidal-Trecan, Tiphaine; Riveline, Jean-Pierre; Platt, Adam; Abrahamsson, Anna; Miner, Jeffrey N; Hughes, Glen; Richette, Pascal; Bardin, Thomas

2018-05-02

The uric acid (UA) level in patients with gout is a key factor in disease management and is typically measured in the laboratory using plasma samples obtained after venous puncture. This study aimed to assess the reliability of immediate UA measurement with capillary blood samples obtained by fingertip puncture with the HumaSens plus point-of-care meter. UA levels were measured using both the HumaSens plus meter in the clinic and the routine plasma UA method in the biochemistry laboratory of 238 consenting diabetic patients. HumaSens plus capillary and routine plasma UA measurements were compared by linear regression, Bland-Altman plots, intraclass correlation coefficient (ICC), and Lin's concordance coefficient. Values outside the dynamic range of the meter, low (LO) or high (HI), were analyzed separately. The best capillary UA thresholds for detecting hyperuricemia were determined by receiver operating characteristic (ROC) curves. The impact of potential confounding factors (demographic and biological parameters/treatments) was assessed. Capillary and routine plasma UA levels were compared to reference plasma UA measurements by liquid chromatography-mass spectrometry (LC-MS) for a subgroup of 67 patients. In total, 205 patients had capillary and routine plasma UA measurements available. ICC was 0.90 (95% confidence interval (CI) 0.87-0.92), Lin's coefficient was 0.91 (0.88-0.93), and the Bland-Altman plot showed good agreement over all tested values. Overall, 17 patients showed values outside the dynamic range. LO values were concordant with plasma values, but HI values were considered uninterpretable. Capillary UA thresholds of 299 and 340 μmol/l gave the best results for detecting hyperuricemia (corresponding to routine plasma UA thresholds of 300 and 360 μmol/l, respectively). No significant confounding factor was found among those tested, except for hematocrit; however, this had a negligible influence on the assay reliability. When capillary and routine plasma results were discordant, comparison with LC-MS measurements showed that plasma measurements had better concordance: capillary UA, ICC 0.84 (95% CI 0.75-0.90), Lin's coefficient 0.84 (0.77-0.91); plasma UA, ICC 0.96 (0.94-0.98), Lin's coefficient 0.96 (0.94-0.98). UA measurements with the HumaSens plus meter were reasonably comparable with those of the laboratory assay. The meter is easy to use and may be useful in the clinic and in epidemiologic studies.
A creatinine biosensor based on admittance measurement

NASA Astrophysics Data System (ADS)

Ching, Congo Tak-Shing; Sun, Tai-Ping; Jheng, Deng-Yun; Tsai, Hou-Wei; Shieh, Hsiu-Li

2015-08-01

Regular check of blood creatinine level is very important as it is a measurement of renal function. Therefore, the objective of this study is to develop a simple and reliable creatinine biosensor based on admittance measurement for precise determination of creatinine. The creatinine biosensor was fabricated with creatinine deiminase immobilized on screen-printed carbon electrodes. Admittance measurement at a specific frequency ranges (22.80 - 84.71 Hz) showed that the biosensor has an excellent linear (r2 > 0.95) response range (50 - 250 uM), which covers the normal physiological and pathological ranges of blood creatinine levels. Intraclass correlation coefficient (ICC) showed that the biosensor has excellent reliability and validity (ICC = 0.98). In conclusion, a simple and reliable creatinine biosensor was developed and it is capable of precisely determining blood creatinine levels in both the normal physiological and pathological ranges.
Improved estimation of subject-level functional connectivity using full and partial correlation with empirical Bayes shrinkage.

PubMed

Mejia, Amanda F; Nebel, Mary Beth; Barber, Anita D; Choe, Ann S; Pekar, James J; Caffo, Brian S; Lindquist, Martin A

2018-05-15

Reliability of subject-level resting-state functional connectivity (FC) is determined in part by the statistical techniques employed in its estimation. Methods that pool information across subjects to inform estimation of subject-level effects (e.g., Bayesian approaches) have been shown to enhance reliability of subject-level FC. However, fully Bayesian approaches are computationally demanding, while empirical Bayesian approaches typically rely on using repeated measures to estimate the variance components in the model. Here, we avoid the need for repeated measures by proposing a novel measurement error model for FC describing the different sources of variance and error, which we use to perform empirical Bayes shrinkage of subject-level FC towards the group average. In addition, since the traditional intra-class correlation coefficient (ICC) is inappropriate for biased estimates, we propose a new reliability measure denoted the mean squared error intra-class correlation coefficient (ICC MSE ) to properly assess the reliability of the resulting (biased) estimates. We apply the proposed techniques to test-retest resting-state fMRI data on 461 subjects from the Human Connectome Project to estimate connectivity between 100 regions identified through independent components analysis (ICA). We consider both correlation and partial correlation as the measure of FC and assess the benefit of shrinkage for each measure, as well as the effects of scan duration. We find that shrinkage estimates of subject-level FC exhibit substantially greater reliability than traditional estimates across various scan durations, even for the most reliable connections and regardless of connectivity measure. Additionally, we find partial correlation reliability to be highly sensitive to the choice of penalty term, and to be generally worse than that of full correlations except for certain connections and a narrow range of penalty values. This suggests that the penalty needs to be chosen carefully when using partial correlations. Copyright © 2018. Published by Elsevier Inc.
Short-term test-retest-reliability of conditioned pain modulation using the cold-heat-pain method in healthy subjects and its correlation to parameters of standardized quantitative sensory testing.

PubMed

Gehling, Julia; Mainka, Tina; Vollert, Jan; Pogatzki-Zahn, Esther M; Maier, Christoph; Enax-Krumova, Elena K

2016-08-05

Conditioned Pain Modulation (CPM) is often used to assess human descending pain inhibition. Nine different studies on the test-retest-reliability of different CPM paradigms have been published, but none of them has investigated the commonly used heat-cold-pain method. The results vary widely and therefore, reliability measures cannot be extrapolated from one CPM paradigm to another. Aim of the present study was to analyse the test-retest-reliability of the common heat-cold-pain method and its correlation to pain thresholds. We tested the short-term test-retest-reliability within 40 ± 19.9 h using a cold-water immersion (10 °C, left hand) as conditioning stimulus (CS) and heat pain (43-49 °C, pain intensity 60 ± 5 on the 101-point numeric rating scale, right forearm) as test stimulus (TS) in 25 healthy right-handed subjects (12females, 31.6 ± 14.1 years). The TS was applied 30s before (TSbefore), during (TSduring) and after (TSafter) the 60s CS. The difference between the pain ratings for TSbefore and TSduring represents the early CPM-effect, between TSbefore and TSafter the late CPM-effect. Quantitative sensory testing (QST, DFNS protocol) was performed on both sessions before the CPM assessment. paired t-tests, Intraclass correlation coefficient (ICC), standard error of measurement (SEM), smallest real difference (SRD), Pearson's correlation, Bland-Altman analysis, significance level p < 0.05 with Bonferroni correction for multiple comparisons, when necessary. Pain ratings during CPM correlated significantly (ICC: 0.411…0.962) between both days, though ratings for TSafter were lower on day 2 (p < 0.005). The early (day 1: 16.7 ± 11.7; day 2: 19.5 ± 11.9; ICC: 0.618, SRD: 20.2) and late (day 1: 1.7 ± 9.2; day 2: 7.6 ± 11.5; ICC: 0.178, SRD: 27.0) CPM effect did not differ significantly between both days. Both early and late CPM-effects did not correlate with the pain thresholds. The short-term test-retest-reliability of the early CPM-effect using the heat-cold-pain method in healthy subjects achieved satisfying results in terms of the ICC. The SRD of the early CPM effect showed that an individual change of > 20 NRS can be attributed to a real change rather than chance. The late CPM-effect was weaker and not reliable.

Acetate templating on digital images is more accurate than computer-based templating for total hip arthroplasty.

PubMed

Petretta, Robert; Strelzow, Jason; Ohly, Nicholas E; Misur, Peter; Masri, Bassam A

2015-12-01

Templating is an important aspect of preoperative planning for total hip arthroplasty and can help determine the size and positioning of the prosthesis. Historically, templating has been performed using acetate templates over printed radiographs. As a result of the increasing use of digital imaging, surgeons now either obtain additional printed radiographs solely for templating purposes or use specialized digital templating software, both of which carry additional cost. The purposes of this study was to compare acetate templating of digitally calibrated images on an LCD monitor to digital templating in terms of (1) accuracy; (2) reproducibility; and (3) time efficiency. Acetate onlay templating was performed directly over digital radiographs on an LCD monitor and was compared with digital templating. Five separate observers participated in this study templating on 52 total hip arthroplasties. For the acetate templating, the digital images were magnified to the scaled reference on the templates provided by the manufacturer (ratio 1.2:1) before templating using a 25-mm marker as a reference. Both the acetate and digital templating results were then compared with the actual implanted components to determine accuracy. Interobserver and intraobserver variability was determined by an intraclass correlation coefficient. Observers recorded time to complete templating from the time of complete upload of patients' imaging onto the system to completion of templating. Both acetate and digital templates demonstrated moderate accuracy in predicting within one size of the eventual implanted acetabular cup (77% [199 of 260]; 70% [181 of 260], respectively; p = 0.050; 95% confidence interval [CI], 0.058-0.32), whereas acetate templating was better at predicting the femoral stem compared to digital templating (75% [195 of 260]; 60% [155 of 260], respectively; p < 0.001; 95% CI, 0.084-0.32). Acetate templating showed moderate to substantial interobserver agreement (cup intraclass correlation coefficient [ICC] = 0.55; 95% CI, 0.14-0.86; femoral ICC = 0.75; 95% CI, 0.39-0.95) and both methods showed almost perfect intraobserver agreement in reproducibility (acetate cup ICC = 0.82; 95% CI, 0.66-0.97; acetate femoral ICC = 0.86; 95% CI, 0.74-0.97; digital cup ICC = 0.82; 95% CI, 0.68-0.97; digital femoral ICC = 0.88; 95% CI, 0.77-1.0). Acetate templating could be performed more quickly (acetate mean 119 seconds; range, 37-220 seconds versus 154 seconds; range, 73-343 seconds; p < 0.001). Acetate onlay templating on digitally calibrated images can be a reliable substitute for digital templating using specialized software. It is quicker to perform and much less expensive. Hospitals and practices need not purchase expensive software, particularly at lower volume centers. Level III, diagnostic study.
Household and familial resemblance in risk factors for type 2 diabetes and related cardiometabolic diseases in rural Uganda: a cross-sectional community sample.

PubMed

Nielsen, Jannie; Bahendeka, Silver K; Whyte, Susan R; Meyrowitsch, Dan W; Bygbjerg, Ib C; Witte, Daniel R

2017-09-21

Prevention of type 2 diabetes (T2D) has been successfully established in randomised clinical trials. However, the best methods for the translation of this evidence into effective population-wide interventions remain unclear. To assess whether households could be a target for T2D prevention and screening, we investigated the resemblance of T2D risk factors at household level and by type of familial dyadic relationship in a rural Ugandan community. This cross-sectional household-based study included 437 individuals ≥13 years of age from 90 rural households in south-western Uganda. Resemblance in glycosylated haemoglobin (HbA1c), anthropometry, blood pressure, fitness status and sitting time were analysed using a general mixed model with random effects (by household or dyad) to calculate household intraclass correlation coefficients (ICCs) and dyadic regression coefficients. Logistic regression with household as a random effect was used to calculate the ORs for individuals having a condition or risk factor if another household member had the same condition. The strongest degree of household member resemblances in T2D risk factors was seen in relation to fitness status (ICC=0.24), HbA1c (ICC=0.18) and systolic blood pressure (ICC=0.11). Regarding dyadic resemblance, the highest standardised regression coefficient was seen in fitness status for spouses (0.54, 95% CI 0.32 to 0.76), parent-offspring (0.41, 95% CI 0.28 0.54) and siblings (0.41, 95% CI 0.25 to 0.57). Overall, parent-offspring and sibling pairs were the dyads with strongest resemblance, followed by spouses. The marked degree of resemblance in T2D risk factors at household level and between spouses, parent-offspring and sibling dyads suggest that shared behavioural and environmental factors may influence risk factor levels among cohabiting individuals, which point to the potential of the household setting for screening and prevention of T2D. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Wip1 is associated with tumorigenity and metastasis through MMP-2 in human intrahepatic cholangiocarcinoma

PubMed Central

Liu, Sulai; Jiang, Bo; Li, Hao; He, Zili; Lv, Pin; Peng, Chuang; Wang, Yonggang; Cheng, Wei; Xu, Zhengquan; Chen, Wei; Liu, Zhengkai; Zhang, Bao; Shen, Shengqian; Xiang, Shuanglin

2017-01-01

Wip1 has been shown to correlate with the metastasis/invasion of several tumors. This study was designed to investigate the clinical significance and biological function of Wip1 in intrahepatic cholangiocarcinoma (ICC). The expression of Wip1 was investigated in sixty human ICC biopsy samples by immunohistochemistry. Transient and stable knockdown of Wip1 in two human ICC cells (ICC-9810 and SSP25) were established using short hairpin RNA expression vector. Immunohistochemistry revealed that Wip1 was up-regulated in human ICC tissues (47/60, 78.3%). High levels of Wip1 in human ICC correlated with metastasis to the lymph metastasis (P=0.022). Genetic depletion of Wip1 in ICC cells resulted in significantly inhibited proliferation and invasion compared with controls. Most importantly, Wip1 down-regulation impaired tumor migration capacity of ICC cells in vivo. Subsequent investigations revealed that matrix metalloproteinase-2 (MMP-2) is an important target of Wip1. Consistently, in human ICC tissues, Wip1 level was positively correlated with MMP-2 expression. Taken together, our founding indicates that Wip1 may be a crucial regulator in the tumorigenicity and invasion of human ICC, Wip1 exerts its pro-invasion function at least in part through the MMP-2 signaling pathway, suggesting Wip1 as a potential therapeutic target for ICC. PMID:28915621
Comparability and repeatability of different methods of corneal astigmatism assessment.

PubMed

Ferreira, Tiago B; Ribeiro, Filomena J

2018-01-01

To assess the comparability and repeatability of keratometric and astigmatism values measured by four techniques: Orbscan IIz ® (Bausch and Lomb), Lenstar LS 900 ® (Haag-Streit), Cassini ® (i-Optics), and Total Cassini (anterior + posterior surface), in healthy volunteers. Fifteen healthy volunteers (30 eyes) were assessed by the four techniques. In each eye, three consecutive measures were performed by the same operator. Keratometric and astigmatism values were recorded. The intraclass correlation coefficient (ICC) was used to assess comparability and repeatability. Agreement between measurement techniques was evaluated with Bland-Altman plots. Comparability was high between all measurement techniques for minimum keratometry (K1), maximum keratometry (K2), astigmatism magnitude, and astigmatism axis, with ICC >0.900, except for astigmatism magnitude measured by Cassini compared to Lenstar (ICC =0.798) and Orbscan compared to Lenstar (ICC =0.810). However, there were some differences in the median values of K1 and K2 between measurement techniques, and the Bland-Altman plots showed a wide data spread for all variables, except for astigmatism magnitude measured by Cassini and Total Cassini. For J0 and J45, comparability was only high for J0 between Cassini and Orbscan. Repeatability was also high for all measurement techniques except for K2 (ICC =0.814) and J45 (ICC =0.621) measured by Cassini. All measurement techniques showed high comparability regarding K1, K2, and astigmatism axis. Although posterior corneal surface is known to influence these measurements, comparability was high between Cassini and Total Cassini regarding astigmatism magnitude and axis. However, the wide data spread suggests that none of these devices should be used interchangeably.
Reliability and validity of the Performance Recorder 1 for measuring isometric knee flexor and extensor strength.

PubMed

Neil, Sarah E; Myring, Alec; Peeters, Mon Jef; Pirie, Ian; Jacobs, Rachel; Hunt, Michael A; Garland, S Jayne; Campbell, Kristin L

2013-11-01

Muscular strength is a key parameter of rehabilitation programs and a strong predictor of functional capacity. Traditional methods to measure strength, such as manual muscle testing (MMT) and hand-held dynamometry (HHD), are limited by the strength and experience of the tester. The Performance Recorder 1 (PR1) is a strength assessment tool attached to resistance training equipment and may be a time- and cost-effective tool to measure strength in clinical practice that overcomes some limitations of MMT and HHD. However, reliability and validity of the PR1 have not been reported. Test-retest and inter-rater reliability was assessed using the PR1 in healthy adults (n = 15) during isometric knee flexion and extension. Criterion-related validity was assessed through comparison of values obtained from the PR1 and Biodex® isokinetic dynamometer. Test-retest reliability was excellent for peak knee flexion (intra-class correlation coefficient [ICC] of 0.96, 95% CI: 0.85, 0.99) and knee extension (ICC = 0.96, 95% CI: 0.87, 0.99). Inter-rater reliability was also excellent for peak knee flexion (ICC = 0.95, 95% CI: 0.85, 0.99) and peak knee extension (ICC = 0.97, 95% CI: 0.91, 0.99). Validity was moderate for peak knee flexion (ICC = 0.75, 95% CI: 0.38, 0.92) but poor for peak knee extension (ICC = 0.37, 95% CI: 0, 0.73). The PR1 provides a reliable measure of isometric knee flexor and extensor strength in healthy adults that could be used in the clinical setting, but absolute values may not be comparable to strength assessment by gold-standard measures.
Prospective patients rate practice factors: development of a questionnaire.

PubMed

St Louis, Brian Lingg; Firestone, Allen R; Johnston, William; Shanker, Shiva; Vig, Katherine W L

2011-02-01

The importance that prospective patients place on practice characteristics when choosing an orthodontic practice has not been extensively reported. The objective of this research was to develop a valid and reliable questionnaire to address the relative importance of orthodontic office and doctor characteristics for prospective patients or parents of child patients during the initial orthodontic office consultation. An initial questionnaire, based on published literature, was field-tested on 16 subjects to assess its validity. Based on the field test, the questionnaire was modified and tested for reliability by using a test-retest method. The questionnaire covered the following areas: doctor, office, staff, and finances. The reliability study included 2 groups of subjects: 12 consecutive prospective adult patients and 41 consecutive parents of prospective child patients. The questionnaires consisted of 43 and 50 questions for the adult patients and the parents of patients, respectively. The subjects rated the importance of practice characteristics in their selection of an orthodontic practice using a 100-mm visual analog scale anchored at "not important at all" and "most important." Reliability was analyzed by using the intraclass correlation coefficient (ICC). Summary scores of all 53 subjects showed excellent reliability (ICC, 0.88; range, 0.61-1.0). Summary scores of all 50 questions showed acceptable reliability (ICC, 0.70; range, 0.45-0.88). Twenty-one questions had excellent reliability (ICC, >.75), and 29 questions had fair-to-good reliability (ICC, 0.41-0.75). No questions showed poor reliability (ICC, <0.4). The pilot study data indicated that the overall reliability of the questionnaire is acceptable. Copyright © 2011 American Association of Orthodontists. Published by Mosby, Inc. All rights reserved.
Validity of a commercial wearable sleep tracker in adult insomnia disorder patients and good sleepers.

PubMed

Kang, Seung-Gul; Kang, Jae Myeong; Ko, Kwang-Pil; Park, Seon-Cheol; Mariani, Sara; Weng, Jia

2017-06-01

To compare the accuracy of the commercial Fitbit Flex device (FF) with polysomnography (PSG; the gold-standard method) in insomnia disorder patients and good sleepers. Participants wore an FF and actigraph while undergoing overnight PSG. Primary outcomes were intraclass correlation coefficients (ICCs) of the total sleep time (TST) and sleep efficiency (SE), and the frequency of clinically acceptable agreement between the FF in normal mode (FFN) and PSG. The sensitivity, specificity, and accuracy of detecting sleep epochs were compared among FFN, actigraphy, and PSG. The ICCs of the TST between FFN and PSG in the insomnia (ICC=0.886) and good-sleepers (ICC=0.974) groups were excellent, but the ICC of SE was only fair in both groups. The TST and SE were overestimated for FFN by 6.5min and 1.75%, respectively, in good sleepers, and by 32.9min and 7.9% in the insomnia group with respect to PSG. The frequency of acceptable agreement of FFN and PSG was significantly lower (p=0.006) for the insomnia group (39.4%) than for the good-sleepers group (82.4%). The sensitivity and accuracy of FFN in an epoch-by-epoch comparison with PSG was good and comparable to those of actigraphy, but the specificity was poor in both groups. The ICC of TST in the FFN-PSG comparison was excellent in both groups, and the frequency of agreement was high in good sleepers but significantly lower in insomnia patients. These limitations need to be considered when applying commercial sleep trackers for clinical and research purposes in insomnia. Copyright © 2017 Elsevier Inc. All rights reserved.
The Localized Scleroderma Skin Severity Index and Physician Global Assessment of Disease Activity: A Work in Progress Toward Development of Localized Scleroderma Outcome Measures

PubMed Central

ARKACHAISRI, THASCHAWEE; VILAIYUK, SOAMARAT; LI, SUZANNE; O’NEIL, KATHLEEN M.; POPE, ELENA; HIGGINS, GLORIA C.; PUNARO, MARILYNN; RABINOVICH, EGLA C.; ROSENKRANZ, MARGALIT; KIETZ, DANIEL A.; ROSEN, PAUL; SPALDING, STEVEN J.; HENNON, TERESA R.; TOROK, KATHRYN S.; CASSIDY, ELAINE; MEDSGER, THOMAS A.

2013-01-01

Objective To develop and evaluate a Localized Scleroderma (LS) Skin Severity Index (LoSSI) and global assessments’ clinimetric property and effect on quality of life (QOL). Methods A 3-phase study was conducted. The first phase involved 15 patients with LS and 14 examiners who assessed LoSSI [surface area (SA), erythema (ER), skin thickness (ST), and new lesion/extension (N/E)] twice for inter/intrarater reliability. Patient global assessment of disease severity (PtGA-S) and Children’s Dermatology Life Quality Index (CDLQI) were collected for intrarater reliability evaluation. The second phase was aimed to develop clinical determinants for physician global assessment of disease activity (PhysGA-A) and to assess its content validity. The third phase involved 2 examiners assessing LoSSI and PhysGA-A on 27 patients. Effect of training on improving reliability/validity and sensitivity to change of the LoSSI and PhysGA-A was determined. Results Interrater reliability was excellent for ER [intraclass correlation coefficient (ICC) 0.71], ST (ICC 0.70), LoSSI (ICC 0.80), and PhysGA-A (ICC 0.90) but poor for SA (ICC 0.35); thus, LoSSI was modified to mLoSSI. Examiners’ experience did not affect the scores, but training/practice improved reliability. Intrarater reliability was excellent for ER, ST, and LoSSI (Spearman’s rho = 0.71–0.89) and moderate for SA. PtGA-S and CDLQI showed good intrarater agreement (ICC 0.63 and 0.80). mLoSSI correlated moderately with PhysGA-A and PtGA-S. Both mLoSSI and PhysGA-A were sensitive to change following therapy. Conclusion mLoSSI and PhysGA-A are reliable and valid tools for assessing LS disease severity and show high sensitivity to detect change over time. These tools are feasible for use in routine clinical practice. They should be considered for inclusion in a core set of LS outcome measures for clinical trials. PMID:19833758
Interrater reliability of the mind map assessment rubric in a cohort of medical students.

PubMed

D'Antoni, Anthony V; Zipp, Genevieve Pinto; Olson, Valerie G

2009-04-28

Learning strategies are thinking tools that students can use to actively acquire information. Examples of learning strategies include mnemonics, charts, and maps. One strategy that may help students master the tsunami of information presented in medical school is the mind map learning strategy. Currently, there is no valid and reliable rubric to grade mind maps and this may contribute to their underutilization in medicine. Because concept maps and mind maps engage learners similarly at a metacognitive level, a valid and reliable concept map assessment scoring system was adapted to form the mind map assessment rubric (MMAR). The MMAR can assess mind map depth based upon concept-links, cross-links, hierarchies, examples, pictures, and colors. The purpose of this study was to examine interrater reliability of the MMAR. This exploratory study was conducted at a US medical school as part of a larger investigation on learning strategies. Sixty-six (N = 66) first-year medical students were given a 394-word text passage followed by a 30-minute presentation on mind mapping. After the presentation, subjects were again given the text passage and instructed to create mind maps based upon the passage. The mind maps were collected and independently scored using the MMAR by 3 examiners. Interrater reliability was measured using the intraclass correlation coefficient (ICC) statistic. Statistics were calculated using SPSS version 12.0 (Chicago, IL). Analysis of the mind maps revealed the following: concept-links ICC = .05 (95% CI, -.42 to .38), cross-links ICC = .58 (95% CI, .37 to .73), hierarchies ICC = .23 (95% CI, -.15 to .50), examples ICC = .53 (95% CI, .29 to .69), pictures ICC = .86 (95% CI, .79 to .91), colors ICC = .73 (95% CI, .59 to .82), and total score ICC = .86 (95% CI, .79 to .91). The high ICC value for total mind map score indicates strong MMAR interrater reliability. Pictures and colors demonstrated moderate to strong interrater reliability. We conclude that the MMAR may be a valid and reliable tool to assess mind maps in medicine. However, further research on the validity and reliability of the MMAR is necessary.
Interrater reliability of the mind map assessment rubric in a cohort of medical students

PubMed Central

D'Antoni, Anthony V; Zipp, Genevieve Pinto; Olson, Valerie G

2009-01-01

Background Learning strategies are thinking tools that students can use to actively acquire information. Examples of learning strategies include mnemonics, charts, and maps. One strategy that may help students master the tsunami of information presented in medical school is the mind map learning strategy. Currently, there is no valid and reliable rubric to grade mind maps and this may contribute to their underutilization in medicine. Because concept maps and mind maps engage learners similarly at a metacognitive level, a valid and reliable concept map assessment scoring system was adapted to form the mind map assessment rubric (MMAR). The MMAR can assess mind map depth based upon concept-links, cross-links, hierarchies, examples, pictures, and colors. The purpose of this study was to examine interrater reliability of the MMAR. Methods This exploratory study was conducted at a US medical school as part of a larger investigation on learning strategies. Sixty-six (N = 66) first-year medical students were given a 394-word text passage followed by a 30-minute presentation on mind mapping. After the presentation, subjects were again given the text passage and instructed to create mind maps based upon the passage. The mind maps were collected and independently scored using the MMAR by 3 examiners. Interrater reliability was measured using the intraclass correlation coefficient (ICC) statistic. Statistics were calculated using SPSS version 12.0 (Chicago, IL). Results Analysis of the mind maps revealed the following: concept-links ICC = .05 (95% CI, -.42 to .38), cross-links ICC = .58 (95% CI, .37 to .73), hierarchies ICC = .23 (95% CI, -.15 to .50), examples ICC = .53 (95% CI, .29 to .69), pictures ICC = .86 (95% CI, .79 to .91), colors ICC = .73 (95% CI, .59 to .82), and total score ICC = .86 (95% CI, .79 to .91). Conclusion The high ICC value for total mind map score indicates strong MMAR interrater reliability. Pictures and colors demonstrated moderate to strong interrater reliability. We conclude that the MMAR may be a valid and reliable tool to assess mind maps in medicine. However, further research on the validity and reliability of the MMAR is necessary. PMID:19400964
Accuracy and reliability of observational gait analysis data: judgments of push-off in gait after stroke.

PubMed

McGinley, Jennifer L; Goldie, Patricia A; Greenwood, Kenneth M; Olney, Sandra J

2003-02-01

Physical therapists routinely observe gait in clinical practice. The purpose of this study was to determine the accuracy and reliability of observational assessments of push-off in gait after stroke. Eighteen physical therapists and 11 subjects with hemiplegia following a stroke participated in the study. Measurements of ankle power generation were obtained from subjects following stroke using a gait analysis system. Concurrent videotaped gait performances were observed by the physical therapists on 2 occasions. Ankle power generation at push-off was scored as either normal or abnormal using two 11-point rating scales. These observational ratings were correlated with the measurements of peak ankle power generation. A high correlation was obtained between the observational ratings and the measurements of ankle power generation (mean Pearson r=.84). Interobserver reliability was moderately high (mean intraclass correlation coefficient [ICC (2,1)]=.76). Intraobserver reliability also was high, with a mean ICC (2,1) of.89 obtained. Physical therapists were able to make accurate and reliable judgments of push-off in videotaped gait of subjects following stroke using observational assessment. Further research is indicated to explore the accuracy and reliability of data obtained with observational gait analysis as it occurs in clinical practice.
Assessing physical activity during youth sport: the Observational System for Recording Activity in Children: Youth Sports.

PubMed

Cohen, Alysia; McDonald, Samantha; McIver, Kerry; Pate, Russell; Trost, Stewart

2014-05-01

The purpose of this study was to evaluate the validity and interrater reliability of the Observational System for Recording Activity in Children: Youth Sports (OSRAC:YS). Children (N = 29) participating in a parks and recreation soccer program were observed during regularly scheduled practices. Physical activity (PA) intensity and contextual factors were recorded by momentary time-sampling procedures (10-second observe, 20-second record). Two observers simultaneously observed and recorded children's PA intensity, practice context, social context, coach behavior, and coach proximity. Interrater reliability was based on agreement (Kappa) between the observer's coding for each category, and the Intraclass Correlation Coefficient (ICC) for percent of time spent in MVPA. Validity was assessed by calculating the correlation between OSRAC:YS estimated and objectively measured MVPA. Kappa statistics for each category demonstrated substantial to almost perfect interobserver agreement (Kappa = 0.67-0.93). The ICC for percent time in MVPA was 0.76 (95% C.I. = 0.49-0.90). A significant correlation (r = .73) was observed for MVPA recorded by observation and MVPA measured via accelerometry. The results indicate the OSRAC:YS is a reliable and valid tool for measuring children's PA and contextual factors during a youth soccer practice.
Measuring occupational balance and its relationship to perceived stress and health: Mesurer l'équilibre occupationnel et sa relation avec le stress perçus et la santé.

PubMed

Yu, Yu; Manku, Mandeep; Backman, Catherine L

2018-04-01

There is an assumption that occupational balance is integrally related to health and well-being. This study aimed to investigate test-retest reliability of the English-translated Occupational Balance Questionnaire (OBQ), its relationship to measures of health (Short Form Health Survey-36 Version 2.0 [SF-36v2]) and stress (Perceived Stress Scale-10; PSS-10), and demographic differences in OBQ scores in Canadian adults. Test-retest reliability (2 weeks) was assessed using intraclass correlation (ICC) coefficients. Online surveys from 86 adults were analyzed using descriptive, correlational, and t test statistics. OBQ test-retest reliability was ICC = 0.74 (95% CI [0.34, 0.90]; p = .003) when excluding an influential case ( n = 20). OBQ correlations with PSS-10 were r = -.72; with SF-36v2 Mental Component Score, r = .65; and with Physical Component Score, r = .31; all p < .001. Age and gender had no impact on OBQ scores. Findings help elucidate relationships among health, stress, and occupational balance; however, further psychometric testing is warranted before using OBQ for clinical purposes.
Cross-cultural adaptation, reliability and validity of the Turkish version of the Hospital for Special Surgery (HSS) Knee Score.

PubMed

Narin, Selnur; Unver, Bayram; Bakırhan, Serkan; Bozan, Ozgür; Karatosun, Vasfi

2014-01-01

The purpose of this study was to adapt the English version of the Hospital for Special Surgery (HSS) knee score for use in a Turkish population and to evaluate its validity, reliability and cultural adaptation. Standard forward-back translation of the HSS knee score was performed and the Turkish version was applied in 73 patients. The Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC), Mini-Mental State Examination and sit-to-stand test were also performed and analyzed. Internal consistency reliability was tested using Cronbach's alpha. The intraclass correlation coefficient (ICC) was used to calculate the test-retest reliability at one-week intervals. Validity was assessed by calculating the Pearson correlation between the HSS, WOMAC and sit-to-stand test scores. The ICC ranged from 0.98 to 0.99 with high internal consistency (Cronbach's alpha: 0.87). The WOMAC score correlated with total HSS score (r: -0.80, p<0.001) and sit-to-stand score (r: 0.12, p: 0.312). The Turkish version of the HSS knee score is reliable and valid in evaluating the total knee arthroplasty in Turkish patients.
Intraoperative specimen radiography in patients with nonpalpable malignant breast lesions.

PubMed

Schmachtenberg, C; Engelken, F; Fischer, T; Bick, U; Poellinger, A; Fallenberg, E M

2012-07-01

Specimen mammography of nonpalpable wire-localized breast lesions is the standard in breast-conserving surgery. The aim of this study was to evaluate the reliability of intraoperative 2-view specimen mammography in different cancer types. After ethics approval, 3 readers retrospectively evaluated margins on 266 2-view specimen radiographs. They determined the closest margin and the orientation. The results were correlated with the histopathology (intra-class correlation coefficient [ICC] and contingency coefficient [CC]) and compared (Wilcoxon test). Invasive ductal carcinoma (IDC) with ductal carcinoma in situ (DCIS) was present in 115 (43 %), IDC in 75 (28 %), invasive lobular carcinoma (ILC) in 57 (22 %) and rare cancers (CA) in 19 specimens (7 %). The sensitivity/specificity and positive/negative predictive value (P/NPV) of specimen mammography were 0.50/0.86 and 0.86/0.50 for CA, 0.42/0.68 and 0.48/0.63 for IDC, 0.36/0.81 and 0.69/0.51 for ILC, and 0.22/0.78 and 0.68/0.32 for IDC+DCIS. Readers correctly identified the orientation of the closest margin in at least one view in an average of 149 specimens (56 %). CCs were between 0.680 (IDC) and 0.912 (CA), suggesting a moderate correlation between radiographic and histological orientation. The correlations were worse for the radiographic and histological distances, with ICC ranging from 0.238 (ILC) to 0.475 (CA). The Wilcoxon test revealed overestimation of the radiographic margins compared to the histological ones for DCIS. Our results suggest that specimen radiography has relatively good overall specificity and good PPV, while the sensitivity and NPV are low for DCIS. A negative result on specimen radiography does not rule out histologically involved margins. © Georg Thieme Verlag KG Stuttgart · New York.
Translation, cross-cultural adaptation, and validation of the Turkish version of the Harris Hip Score.

PubMed

Çelik, Derya; Can, Canan; Aslan, Yasemin; Ceylan, Hasan Huseyin; Bilsel, Kerem; Ozdincler, Arzu Razak

2014-01-01

The Harris Hip Score (HHS) developed to assess function and pain from the perspective of patients hip pathologies. The purpose of this study was to translate and culturally adapt the HHS into Turkish, and thereby determine the reliability and validity of the translated version. The HHS was translated into Turkish in accordance with the stages recommended by Beaton. The measurement properties of the HHS were tested in 80 patients; 52 males, mean age 51 years (range 21-75 years) suffering from different hip pathologies. The test-retest reliability was tested in 58 patients; 28 males mean age, 52 years (range 30-73 years) after an interval of seven days. The Cronbach's Alpha was used to assess internal consistency and the intra-class correlation coefficient (ICC) was used to estimate the test-retest reliability. Patients were asked to answer the Oxford Hip Score (OHS), the Western Ontario and McMaster Universities Arthritis Index (WOMAC), the VAS and the Short Form-36 (SF-36) for the validity of the estimation. The Turkish version of the HHS showed sufficient internal consistency (Cronbach's alpha,0.70) and test-retest reliability (ICC = 0.91). The correlation coefficients between the HHS, the WOMAC and the OHS were 0.64 and 0.89 respectively. The highest correlations between the HHS and SF-36 were with the physical function scale (r = 0.72), and the lowest correlations were with the mental function scale (r = 0.10). We observed no floor or ceiling effects. The Turkish version of the HHS has sufficient reliability and validity to measure patient-reported outcome for Turkish-speaking individuals with a variety of hip disorders.
Value of three-dimensional volume rendering images in the assessment of the centrality index for preoperative planning in patients with renal masses.

PubMed

Sofia, C; Magno, C; Silipigni, S; Cantisani, V; Mucciardi, G; Sottile, F; Inferrera, A; Mazziotti, S; Ascenti, G

2017-01-01

To evaluate the precision of the centrality index (CI) measurement on three-dimensional (3D) volume rendering technique (VRT) images in patients with renal masses, compared to its standard measurement on axial images. Sixty-five patients with renal lesions underwent contrast-enhanced multidetector (MD) computed tomography (CT) for preoperative imaging. Two readers calculated the CI on two-dimensional axial images and on VRT images, measuring it in the plane that the tumour and centre of the kidney were lying in. Correlation and agreement of interobserver measurements and inter-method results were calculated using intraclass correlation (ICC) coefficients and the Bland-Altman method. Time saving was also calculated. The correlation coefficients were r=0.99 (p<0.05) and r=0.99 (p<0.05) for both the CI on axial and VRT images, with an ICC of 0.99, and 0.99, respectively. Correlation between the two methods of measuring the CI on VRT and axial CT images was r=0.99 (p<0.05). The two methods showed a mean difference of -0.03 (SD 0.13). Mean time saving per each examination with VRT was 45.5%. The present study showed that VRT and axial images produce almost identical values of CI, with the advantages of greater ease of execution and a time saving of almost 50% for 3D VRT images. In addition, VRT provides an integrated perspective that can better assist surgeons in clinical decision making and in operative planning, suggesting this technique as a possible standard method for CI measurement. Copyright © 2016 The Royal College of Radiologists. Published by Elsevier Ltd. All rights reserved.
Psychometric properties of the Spanish version of the Mindful Attention Awareness Scale (MAAS) in patients with fibromyalgia.

PubMed

Cebolla, Ausias; Luciano, Juan V; DeMarzo, Marcelo Piva; Navarro-Gil, Mayte; Campayo, Javier Garcia

2013-01-14

Mindful-based interventions improve functioning and quality of life in fibromyalgia (FM) patients. The aim of the study is to perform a psychometric analysis of the Spanish version of the Mindful Attention Awareness Scale (MAAS) in a sample of patients diagnosed with FM. The following measures were administered to 251 Spanish patients with FM: the Spanish version of MAAS, the Chronic Pain Acceptance Questionnaire, the Pain Catastrophising Scale, the Injustice Experience Questionnaire, the Psychological Inflexibility in Pain Scale, the Fibromyalgia Impact Questionnaire and the Euroqol. Factorial structure was analysed using Confirmatory Factor Analyses (CFA). Cronbach's α coefficient was calculated to examine internal consistency, and the intraclass correlation coefficient (ICC) was calculated to assess the test-retest reliability of the measures. Pearson's correlation tests were run to evaluate univariate relationships between scores on the MAAS and criterion variables. The MAAS scores in our sample were low (M = 56.7; SD = 17.5). CFA confirmed a two-factor structure, with the following fit indices [sbX2 = 172.34 (p < 0.001), CFI = 0.95, GFI = 0.90, SRMR = 0.05, RMSEA = 0.06. MAAS was found to have high internal consistency (Cronbach's α = 0.90) and adequate test-retest reliability at a 1-2 week interval (ICC = 0.90). It showed significant and expected correlations with the criterion measures with the exception of the Euroqol (Pearson = 0.15). Psychometric properties of the Spanish version of the MAAS in patients with FM are adequate. The dimensionality of the MAAS found in this sample and directions for future research are discussed.
[Validity and Reliability of the Attitudes Toward Sexuality in the Elderly Questionnaire in Cartagena, Colombia].

PubMed

Melguizo-Herrera, Estela; Álvarez-Romero, Yuleysi; Cabarcas-Mendoza, Mayerlin Vanessa; Calvo-Rodríguez, Rossy Stefanie; Flórez-Almanza, Jeomaidis; Moadie-Contreras, Olga Patricia; Campo-Arias, Adalberto

2015-01-01

There are many stereotypes and prejudices about the sexual lives of the elderly. However, there are no validated and reliable tools for measuring these in the Latin-American context. To determine the internal consistency, dimensionality, differential item functioning (DIF) by gender and stability of the Attitudes towards Sexuality in the Elderly Questionnaire (ASEQ) in adults over 60 years-old in Cartagena, Colombia. A validation study was designed that included a sample of 130 participants without cognitive impairment attending a Life Center. The ages ranged between 60 and 90 years (mean, 73.7±8.0), and there were 61.5% females. Internal consistency was calculated using Cronbach alpha and McDonald omega, exploratory factor analysis (EFA) (dimensionality), DIF by gender (item response theory) with Kendall correlation, and stability (reproducibility) with Pearson correlation and intraclass correlation coefficient (ICC). The ASEQ showed high internal consistency on the first application (α=.83 and ω=.87) and in the second one (α=.85 and ω=.89). AFE showed two salient factors (prejudices and limitations) that explained 42.6% of the total variance. The IDF presented appropriate coefficients, with the exception of item 14 that showed a high value (τ=.37). ASEQ showed high stability (r=.82 and ICC=.89; 95% confidence interval, 0.83- 0.92; P<.001). ASEQ is a two-dimensional and reliable scale in older adults attending a Life Center in Cartagena, Colombia. New studies are required to evaluate the performance in a representative sample. Copyright © 2014 Asociación Colombiana de Psiquiatría. Publicado por Elsevier España. All rights reserved.
Quantifying frontal plane knee motion during single limb squats: reliability and validity of 2-dimensional measures.

PubMed

Gwynne, Craig R; Curran, Sarah A

2014-12-01

Clinical assessment of lower limb kinematics during dynamic tasks may identify individuals who demonstrate abnormal movement patterns that may lead to etiology of exacerbation of knee conditions such as patellofemoral joint (PFJt) pain. The purpose of this study was to determine the reliability, validity and associated measurement error of a clinically appropriate two-dimensional (2-D) procedure of quantifying frontal plane knee alignment during single limb squats. Nine female and nine male recreationally active subjects with no history of PFJt pain had frontal plane limb alignment assessed using three-dimensional (3-D) motion analysis and digital video cameras (2-D analysis) while performing single limb squats. The association between 2-D and 3-D measures was quantified using Pearson's product correlation coefficients. Intraclass correlation coefficients (ICCs) were determined for within- and between-session reliability of 2-D data and standard error of measurement (SEM) was used to establish measurement error. Frontal plane limb alignment assessed with 2-D analysis demonstrated good correlation compared with 3-D methods (r = 0.64 to 0.78, p < 0.001). Within-session (0.86) and between-session ICCs (0.74) demonstrated good reliability for 2-D measures and SEM scores ranged from 2° to 4°. 2-D measures have good consistency and may provide a valid measure of lower limb alignment when compared to existing 3-D methods. Assessment of lower limb kinematics using 2-D methods may be an accurate and clinically useful alternative to 3-D motion analysis when identifying individuals who demonstrate abnormal movement patterns associated with PFJt pain. 2b.

Psychometric properties of the Mayo Elbow Performance Score.

PubMed

Celik, Derya

2015-06-01

To translate and culturally adapt the Mayo Elbow Performance Score (MEPS), a widely used instrument for evaluating disability associated with elbow injuries, into Turkish (MEPS-T) and to determine psychometric properties of the translated version. The MEPS was translated into Turkish using published methodological guidelines. The measurement properties of the MEPS-T (construct validity and floor and ceiling effects) were tested in 91 patients with elbow pathology. The reproducibility of the MEPS-T was tested in 59 patients over 7-14 days. The responsiveness of the MEPS-T was tested in a subgroup of 46 patients diagnosed with lateral epicondylitis and who received conservative treatment for 6 weeks. The interclass correlation coefficient (ICC) was used to estimate the test-retest reliability. The construct validity was analyzed with the disabilities of the arm, shoulder and hand (DASH), Visual Analog Scale (VAS) and the Short Form 36 (SF-36). Effect size (ES) was used to assess the responsiveness. The distribution of floor and ceiling effects was determined. The MEPS-T showed very good test-retest reliability (ICC 0.89). The correlation coefficients between the MEPS-T and DASH and VAS were -0.61 and -0.53, respectively (p < 0.001). The highest correlations were between the MEPS-T and the mental component summary (r = 0.47, p = 0.001) and role emotional (r = 0.45, p = 0.001). The MEPS-T ES, 0.50, was moderate (95% CI 0.33-0.62). We observed no ceiling or floor effects. The MEPS-T represents a valid, reliable and moderately responsive instrument for evaluating patients with elbow disease.
The Reliability, Validity, and Normative Data of Interpupillary Distance and Pupil Diameter Using Eye-Tracking Technology

PubMed Central

Murray, Nicholas P.; Hunfalvay, Melissa; Bolte, Takumi

2017-01-01

Purpose The purpose of this study was to determine the reliability of interpupillary distance (IPD) and pupil diameter (PD) measures using an infrared eye tracker and central point stimuli. Validity of the test compared to known clinical tools was determined, and normative data was established against which individuals can measure themselves. Methods Participants (416) across various demographics were examined for normative data. Of these, 50 were examined for reliability and validity. Validity for IPD measured the test (RightEye IPD/PD) against the PL850 Pupilometer and the Essilor Digital CRP. For PD, the test was measured against the Rosenbaum Pocket Vision Screener (RPVS). Reliability was analyzed with intraclass correlation coefficients (ICC) between trials with Cronbach's alpha (CA) and the standard error of measurement for each ICC. Convergent validity was investigated by calculating the bivariate correlation coefficient. Results Reliability results were strong (CA > 0.7) for all measures. High positive significant correlations were found between the RightEye IPD test and the PL850 Pupilometer (P < 0.001) and Essilor Digital CRP (P < 0.001) and for the RightEye PD test and the RPVS (P < 0.001). Conclusions Using infrared eye tracking and the RightEye IPD/PD test stimuli, reliable and accurate measures of IPD and PD were found. Results from normative data showed an adequate comparison for people with normal vision development. Translational Relevance Results revealed a central point of fixation may remove variability in examining PD reliably using infrared eye tracking when consistent environmental and experimental procedures are conducted. PMID:28685104
Reliability of air displacement plethysmography in a large, heterogeneous sample.

PubMed

Noreen, Eric E; Lemon, Peter W R

2006-08-01

Several studies have assessed the validity of air displacement plethysmography (ADP), but few have assessed the reliability of ADP using a large, heterogeneous sample. This study was conducted to determine the reliability of ADP using the Bod Pod in a large, heterogeneous sample. A total of 980 healthy men and women (30 +/- 15 yr, mean +/- SD) completed two body composition assessments separated by 15-30 min. All testing was done in accordance with the manufacturer's instructions. A significant correlation (r = 0.992, P = 0.001) was found between body density (BD) 1 (1.046 +/- 0.001 kg.L(-1); mean +/- SEM) and BD 2 (1.046 +/- 0.001 kg.L(-1). A paired t-test revealed no significant difference between BD 1 and 2 (P = 0.935). The coefficient of variation (CV) for BD was 0.15%. A significant intraclass correlation coefficient (ICC) was found for BD (ICC = 0.996, P = 0.001), and the standard error of measurement (SEM) was 0.001 kg.L(-1). Body mass (BM) 1 and 2 were correlated significantly (r = 0.999, P = 0.001); however, a significant (P = 0.001) decrease was seen from BM 1 (75.510 +/- 0.461 kg) to BM 2 (75.497 +/- 0.461 kg). Body volume (BV) tended to decrease (P = 0.08) from BV 1 (69.900 +/- 0.449 L) to BV 2 (69.884 +/- 0.449 L). ADP using the Bod Pod appears to assess BD reliably; however, the observed CV suggests that multiple trials are necessary to detect small treatment effects.
Fetal frontomaxillary facial angle between 11 and 13 + 6 weeks of gestation in a Brazilian population: influence of different races.

PubMed

Panigassi, Ana Paula Nascimento; Araujo Júnior, Edward; Nardozza, Luciano Marcondes Machado; Moron, Antonio Fernandes; Pares, David Baptista da Silva

2013-07-01

To evaluate the influence of different races over the measurement of the frontomaxillary facial angle between 11 and 13 + 6 weeks of pregnancy in a Brazilian population. A cross-sectional study was conducted with 332 healthy pregnant women, with a crown-rump length (CRL) between 47 and 84 mm. Such measurements were taken abdominally, using the mid-sagittal plane, and the angle was measured by tracing a line over the palate and a line from the anterosuperior maxillary angle all the way to the external part of the forehead. As for the reference intervals, a simple linear regression between the frontomaxillary facial angle and the CRL was used, as well as Pearson's correlation coefficient (r). To evaluate the difference between races, a variance analysis was used (ANOVA). To calculate reproducibility, the intraclass correlation coefficient (ICC) was used. The means for the fetal frontomaxillary facial angle in white, black and mixed races were 81.8 ± 6.6; 82.2 ± 6.1 and 81.4 ± 6.2 mm, respectively. There was no statistical difference between races (p = 0.713). A decreasing correlation between the frontomaxillary facial angle and the CRL was observed for the black (r = -0.450) and mixed (r = -0.212) races. Excellent intraobserver reproducibility was observed, as well as a satisfactory interobserver reproducibility, with ICC of 0.858 and 0.605, respectively. There were no significative statistical differences in the measurement of the fetal frontomaxillary facial angle between 11 and 13 + 6 weeks of pregnancy in the different races in a Brazilian population.
Micro-scale Spatial Clustering of Cholera Risk Factors in Urban Bangladesh.

PubMed

Bi, Qifang; Azman, Andrew S; Satter, Syed Moinuddin; Khan, Azharul Islam; Ahmed, Dilruba; Riaj, Altaf Ahmed; Gurley, Emily S; Lessler, Justin

2016-02-01

Close interpersonal contact likely drives spatial clustering of cases of cholera and diarrhea, but spatial clustering of risk factors may also drive this pattern. Few studies have focused specifically on how exposures for disease cluster at small spatial scales. Improving our understanding of the micro-scale clustering of risk factors for cholera may help to target interventions and power studies with cluster designs. We selected sets of spatially matched households (matched-sets) near cholera case households between April and October 2013 in a cholera endemic urban neighborhood of Tongi Township in Bangladesh. We collected data on exposures to suspected cholera risk factors at the household and individual level. We used intra-class correlation coefficients (ICCs) to characterize clustering of exposures within matched-sets and households, and assessed if clustering depended on the geographical extent of the matched-sets. Clustering over larger spatial scales was explored by assessing the relationship between matched-sets. We also explored whether different exposures tended to appear together in individuals, households, and matched-sets. Household level exposures, including: drinking municipal supplied water (ICC = 0.97, 95%CI = 0.96, 0.98), type of latrine (ICC = 0.88, 95%CI = 0.71, 1.00), and intermittent access to drinking water (ICC = 0.96, 95%CI = 0.87, 1.00) exhibited strong clustering within matched-sets. As the geographic extent of matched-sets increased, the concordance of exposures within matched-sets decreased. Concordance between matched-sets of exposures related to water supply was elevated at distances of up to approximately 400 meters. Household level hygiene practices were correlated with infrastructure shown to increase cholera risk. Co-occurrence of different individual level exposures appeared to mostly reflect the differing domestic roles of study participants. Strong spatial clustering of exposures at a small spatial scale in a cholera endemic population suggests a possible role for highly targeted interventions. Studies with cluster designs in areas with strong spatial clustering of exposures should increase sample size to account for the correlation of these exposures.
Validity and Reliability of Spine Rasterstereography in Patients With Adolescent Idiopathic Scoliosis.

PubMed

Tabard-Fougère, Anne; Bonnefoy-Mazure, Alice; Hanquinet, Sylviane; Lascombes, Pierre; Armand, Stéphane; Dayer, Romain

2017-01-15

Test-retest study. This study aimed to evaluate the validity and reliability of rasterstereography in patients with adolescent idiopathic scoliosis (AIS) with a major curve Cobb angle (CA) between 10° and 40° for frontal, sagittal, and transverse parameters. Previous studies evaluating the validity and reliability of rasterstereography concluded that this technique had good accuracy compared with radiographs and a high intra- and interday reliability in healthy volunteers. To the best of our knowledge, the validity and reliability have not been assessed in AIS patients. Thirty-five adolescents with AIS (male = 13) aged 13.1 ± 2.0 years were included. To evaluate the validity of the scoliosis angle (SA) provided by rasterstereography, a comparison (t test, Pearson correlation) was performed with the CA obtained using 2D EOS® radiography (XR). Three rasterstereographic repeated measurements were independently performed by two operators on the same day (interrater reliability) and again by the first operator 1 week later (intrarater reliability). The variables of interest were the SA, lumbar lordosis, and thoracic kyphosis angle, trunk length, pelvic obliquity, and maximum, root mean square and amplitude of vertebral rotations. The data analyses used intraclass correlation coefficients (ICCs). The CA and SA were strongly correlated (R = 0.70) and were nonsignificantly different (P = 0.60). The intrarater reliability (same day: ICC [1, 1], n = 35; 1 week later: ICC [1, 3], n = 28) and interrater reliability (ICC [3, 3], n = 16) were globally excellent (ICC > 0.75) except for the assessment of pelvic obliquity. This study showed that the rasterstereographic system allows for the evaluation of AIS patients with a good validity compared with XR with an overall excellent intra- and interrater reliability. Based on these results, this automatic, fast, and noninvasive system can be used for monitoring the evolution of AIS in growing patients instead of repetitive radiographs, thereby reducing radiation exposure and decreasing costs. 4.
Physical activity monitoring in patients with peripheral arterial disease: validation of an activity monitor.

PubMed

Fokkenrood, H J P; Verhofstad, N; van den Houten, M M L; Lauret, G J; Wittens, C; Scheltinga, M R M; Teijink, J A W

2014-08-01

The daily life physical activity (PA) of patients with peripheral arterial disease (PAD) may be severely hampered by intermittent claudication (IC). From a therapeutic, as well as research, point of view, it may be more relevant to determine improvement in PA as an outcome measure in IC. The aim of this study was to validate daily activities using a novel type of tri-axial accelerometer (Dynaport MoveMonitor) in patients with IC. Patients with IC were studied during a hospital visit. Standard activities (locomotion, lying, sitting, standing, shuffling, number of steps and "not worn" detection) were video recorded and compared with activities scored by the MoveMonitor. Inter-rater reliability (expressed in intraclass correlation coefficients [ICC]), sensitivity, specificity, and positive predictive values (PPV) were calculated for each activity. Twenty-eight hours of video observation were analysed (n = 21). Our video annotation method (the gold standard method) appeared to be accurate for most postures (ICC > 0.97), except for shuffling (ICC = 0.38). The MoveMonitor showed a high sensitivity (>86%), specificity (>91%), and PPV (>88%) for locomotion, lying, sitting, and "not worn" detection. Moderate accuracy was found for standing (46%), while shuffling appeared to be undetectable (18%). A strong correlation was found between video recordings and the MoveMonitor with regard to the calculation of the "number of steps" (ICC = 0.90). The MoveMonitor provides accurate information on a diverse set of postures, daily activities, and number of steps in IC patients. However, the detection of low amplitude movements, such as shuffling and "sitting to standing" transfers, is a matter of concern. This tool is useful in assessing the role of PA as a novel, clinically relevant outcome parameter in IC. Copyright © 2014 European Society for Vascular Surgery. Published by Elsevier Ltd. All rights reserved.
Validation of a semi-quantitative food frequency questionnaire to assess food groups and nutrient intake.

PubMed

Macedo-Ojeda, Gabriela; Vizmanos-Lamotte, Barbara; Márquez-Sandoval, Yolanda Fabiola; Rodríguez-Rocha, Norma Patricia; López-Uriarte, Patricia Josefina; Fernández-Ballart, Joan D

2013-11-01

Semi-quantitative Food Frequency Questionnaires (FFQs) analyze average food and nutrient intake over extended periods to associate habitual dietary intake with health problems and chronic diseases. A tool of this nature applicable to both women and men is not presently available in Mexico. To validate a FFQ for adult men and women. The study was conducted on 97 participants, 61% were women. Two FFQs were administered (with a one-year interval) to measure reproducibility. To assess validity, the second FFQ was compared against dietary record (DR) covering nine days. Statistical analyses included Pearson correlations and Intraclass Correlation Coefficients (ICC). The de-attenuation of the ICC resulting from intraindividual variability was controlled. The validity analysis was complemented by comparing the classification ability of FFQ to that of DR through concordance between intake categories and Bland-Altman plots. Reproducibility: ICC values for food groups ranged 0.42-0.87; the range for energy and nutrients was between 0.34 and 0.82. ICC values for food groups ranged 0.35-0.84; the range for energy and nutrients was between 0.36 and 0.77. Most subjects (56.7-76.3%) classified in the same or adjacent quintile for energy and nutrients using both methods. Extreme misclassification was <6.3% for all items. Bland-Altman plots reveal high concordance between FFQ and DR. FFQ produced sufficient levels of reproducibility and validity to determine average daily intake over one year. These results will enable the analysis of possible associations with chronic diseases and dietary diagnoses in adult populations of men and women. Copyright AULA MEDICA EDICIONES 2013. Published by AULA MEDICA. All rights reserved.
Repeatability of a running heat tolerance test.

PubMed

Mee, Jessica A; Doust, Jo; Maxwell, Neil S

2015-01-01

At present there is no standardised heat tolerance test (HTT) procedure adopting a running mode of exercise. Current HTTs may misdiagnose a runner's susceptibility to a hyperthermic state due to differences in exercise intensity. The current study aimed to establish the repeatability of a practical running test to evaluate individual's ability to tolerate exercise heat stress. Sixteen (8M, 8F) participants performed the running HTT (RHTT) (30 min, 9 km h(-1), 2% elevation) on two separate occasions in a hot environment (40 °C and 40% relative humidity). There were no differences in peak rectal temperature (RHTT1: 38.82 ± 0.47 °C, RHTT2: 38.86 ± 0.49 °C, Intra-class correlation coefficient (ICC)=0.93, typical error of measure (TEM) = 0.13 °C), peak skin temperature (RHTT1: 38.12 ± 0.45, RHTT2: 38.11 ± 0.45 °C, ICC = 0.79, TEM = 0.30 °C), peak heart rate (RHTT1: 182 ± 15 beats min(-1), RHTT2: 183 ± 15 beats min(-1), ICC = 0.99, TEM = 2 beats min(-1)), nor sweat rate (1721 ± 675 g h(-1), 1716 ± 745 g h(-1), ICC = 0.95, TEM = 162 g h(-1)) between RHTT1 and RHTT2 (p>0.05). Results demonstrate good agreement, strong correlations and small differences between repeated trials, and the TEM values suggest low within-participant variability. The RHTT was effective in differentiating between individuals physiological responses; supporting a heat tolerance continuum. The findings suggest the RHTT is a repeatable measure of physiological strain in the heat and may be used to assess the effectiveness of acute and chronic heat alleviating procedures. Copyright © 2015 Elsevier Ltd. All rights reserved.
Predictors and Variability of Urinary Paraben Concentrations in Men and Women, Including before and during Pregnancy

PubMed Central

Smith, Kristen W.; Braun, Joe M.; Williams, Paige L.; Ehrlich, Shelley; Correia, Katharine F.; Calafat, Antonia M.; Ye, Xiaoyun; Ford, Jennifer; Keller, Myra; Meeker, John D.

2012-01-01

Background: Parabens are suspected endocrine disruptors and ubiquitous preservatives used in personal care products, pharmaceuticals, and foods. No studies have assessed the variability of parabens in women, including during pregnancy. Objective: We evaluated predictors and variability of urinary paraben concentrations. Methods: We measured urinary concentrations of methyl (MP), propyl (PP), and butyl paraben (BP) among couples from a fertility center. Mixed-effects regression models were fit to examine demographic predictors of paraben concentrations and to calculate intraclass correlation coefficients (ICCs). Results: Between 2005 and 2010, we collected 2,721 spot urine samples from 245 men and 408 women. The median concentrations were 112 µg/L (MP), 24.2 µg/L (PP), and 0.70 µg/L (BP). Urinary MP and PP concentrations were 4.6 and 7.8 times higher in women than men, respectively, and concentrations of both MP and PP were 3.8 times higher in African Americans than Caucasians. MP and PP concentrations we CI re slightly more variable in women (ICC = 0.42, 0.43) than men (ICC = 0.54, 0.51), and were weakly correlated between partners (r = 0.27–0.32). Among 129 pregnant women, urinary paraben concentrations were 25–45% lower during pregnancy than before pregnancy, and MP and PP concentrations were more variable (ICCs of 0.38 and 0.36 compared with 0.46 and 0.44, respectively). Conclusions: Urinary paraben concentrations were more variable in women compared with men, and during pregnancy compared with before pregnancy. However, results for this study population suggest that a single urine sample may reasonably represent an individual’s exposure over several months, and that a single sample collected during pregnancy may reasonably classify gestational exposure. PMID:22721761
Predictors and variability of urinary paraben concentrations in men and women, including before and during pregnancy.

PubMed

Smith, Kristen W; Braun, Joe M; Williams, Paige L; Ehrlich, Shelley; Correia, Katharine F; Calafat, Antonia M; Ye, Xiaoyun; Ford, Jennifer; Keller, Myra; Meeker, John D; Hauser, Russ

2012-11-01

Parabens are suspected endocrine disruptors and ubiquitous preservatives used in personal care products, pharmaceuticals, and foods. No studies have assessed the variability of parabens in women, including during pregnancy. We evaluated predictors and variability of urinary paraben concentrations. We measured urinary concentrations of methyl (MP), propyl (PP), and butyl paraben (BP) among couples from a fertility center. Mixed-effects regression models were fit to examine demographic predictors of paraben concentrations and to calculate intraclass correlation coefficients (ICCs). Between 2005 and 2010, we collected 2,721 spot urine samples from 245 men and 408 women. The median concentrations were 112 µg/L (MP), 24.2 µg/L (PP), and 0.70 µg/L (BP). Urinary MP and PP concentrations were 4.6 and 7.8 times higher in women than men, respectively, and concentrations of both MP and PP were 3.8 times higher in African Americans than Caucasians. MP and PP concentrations were slightly more variable in women (ICC = 0.42, 0.43) than men (ICC = 0.54, 0.51), and were weakly correlated between partners (r = 0.27-0.32). Among 129 pregnant women, urinary paraben concentrations were 25-45% lower during pregnancy than before pregnancy, and MP and PP concentrations were more variable (ICCs of 0.38 and 0.36 compared with 0.46 and 0.44, respectively). Urinary paraben concentrations were more variable in women compared with men, and during pregnancy compared with before pregnancy. However, results for this study population suggest that a single urine sample may reasonably represent an individual's exposure over several months, and that a single sample collected during pregnancy may reasonably classify gestational exposure.
Validity and reliability of the Fitbit Zip as a measure of preschool children’s step count

PubMed Central

Sharp, Catherine A; Mackintosh, Kelly A; Erjavec, Mihela; Pascoe, Duncan M; Horne, Pauline J

2017-01-01

Objectives Validation of physical activity measurement tools is essential to determine the relationship between physical activity and health in preschool children, but research to date has not focused on this priority. The aims of this study were to ascertain inter-rater reliability of observer step count, and interdevice reliability and validity of Fitbit Zip accelerometer step counts in preschool children. Methods Fifty-six children aged 3–4 years (29 girls) recruited from 10 nurseries in North Wales, UK, wore two Fitbit Zip accelerometers while performing a timed walking task in their childcare settings. Accelerometers were worn in secure pockets inside a custom-made tabard. Video recordings enabled two observers to independently code the number of steps performed in 3 min by each child during the walking task. Intraclass correlations (ICCs), concordance correlation coefficients, Bland-Altman plots and absolute per cent error were calculated to assess the reliability and validity of the consumer-grade device. Results An excellent ICC was found between the two observer codings (ICC=1.00) and the two Fitbit Zips (ICC=0.91). Concordance between the Fitbit Zips and observer counts was also high (r=0.77), with an acceptable absolute per cent error (6%–7%). Bland-Altman analyses identified a bias for Fitbit 1 of 22.8±19.1 steps with limits of agreement between −14.7 and 60.2 steps, and a bias for Fitbit 2 of 25.2±23.2 steps with limits of agreement between −20.2 and 70.5 steps. Conclusions Fitbit Zip accelerometers are a reliable and valid method of recording preschool children’s step count in a childcare setting. PMID:29081984
Reliability and Validity of 2 Self-Report Measures to Assess Sedentary Behavior in Older Adults.

PubMed

Gennuso, Keith P; Matthews, Charles E; Colbert, Lisa H

2015-05-01

The purpose of this study was to examine the reliability and validity of 2 currently available physical activity surveys for assessing time spent in sedentary behavior (SB) in older adults. Fifty-eight adults (≥65 years) completed the Yale Physical Activity Survey for Older Adults (YPAS) and Community Health Activities Model Program for Seniors (CHAMPS) before and after a 10-day period during which they wore an ActiGraph accelerometer (ACC). Intraclass correlation coefficients (ICC) examined test-retest reliability. Overall percent agreement and a kappa statistic examined YPAS validity. Lin's concordance correlation, Pearson correlation, and Bland-Altman analysis examined CHAMPS validity. Both surveys had moderate test-retest reliability (ICC: YPAS = 0.59 (P < .001), CHAMPS = 0.64 (P < .001)) and significantly underestimated SB time. Agreement between YPAS and ACC was low (κ = -0.0003); however, there was a linear increase (P < .01) in ACC-derived SB time across YPAS response categories. There was poor agreement between ACC-derived SB and CHAMPS (Lin's r = .005; 95% CI, -0.010 to 0.020), and no linear trend across CHAMPS quartiles (P = .53). Neither of the surveys should be used as the sole measure of SB in a study; though the YPAS has the ability to rank individuals, providing it with some merit for use in correlational SB research.
Validity and Reliability of Fitbit Flex for Step Count, Moderate to Vigorous Physical Activity and Activity Energy Expenditure

PubMed Central

Sushames, Ashleigh; Edwards, Andrew; Thompson, Fintan; McDermott, Robyn; Gebel, Klaus

2016-01-01

Objectives To examine the validity and reliability of the Fitbit Flex against direct observation for measuring steps in the laboratory and against the Actigraph for step counts in free-living conditions and for moderate-to-vigorous physical activity (MVPA) and activity energy expenditure (AEE) overall. Methods Twenty-five adults (12 females, 13 males) wore a Fitbit Flex and an Actigraph GT3X+ during a laboratory based protocol (including walking, incline walking, running and stepping) and free-living conditions during a single day period to examine measurement of steps, AEE and MVPA. Twenty-four of the participants attended a second session using the same protocol. Results Intraclass correlations (ICC) for test-retest reliability of the Fitbit Flex were strong for walking (ICC = 0.57), moderate for stair stepping (ICC = 0.34), and weak for incline walking (ICC = 0.22) and jogging (ICC = 0.26). The Fitbit significantly undercounted walking steps in the laboratory (absolute proportional difference: 21.2%, 95%CI 13.0–29.4%), but it was more accurate, despite slightly over counting, for both jogging (6.4%, 95%CI 3.7–9.0%) and stair stepping (15.5%, 95%CI 10.1–20.9%). The Fitbit had higher coefficients of variation (Cv) for step counts compared to direct observation and the Actigraph. In free-living conditions, the average MVPA minutes were lower in the Fitbit (35.4 minutes) compared to the Actigraph (54.6 minutes), but AEE was greater from the Fitbit (808.1 calories) versus the Actigraph (538.9 calories). The coefficients of variation were similar for AEE for the Actigraph (Cv = 36.0) and Fitbit (Cv = 35.0), but lower in the Actigraph (Cv = 25.5) for MVPA against the Fitbit (Cv = 32.7). Conclusion The Fitbit Flex has moderate validity for measuring physical activity relative to direct observation and the Actigraph. Test-rest reliability of the Fitbit was dependant on activity type and had greater variation between sessions compared to the Actigraph. Physical activity surveillance studies using the Fitbit Flex should consider the potential effect of measurement reactivity and undercounting of steps. PMID:27589592
Validity and reliability of an adapted arabic version of the long international physical activity questionnaire.

PubMed

Helou, Khalil; El Helou, Nour; Mahfouz, Maya; Mahfouz, Yara; Salameh, Pascale; Harmouche-Karaki, Mireille

2017-07-24

The International Physical Actvity Questionnaire (IPAQ) is a validated tool for physical activity assessment used in many countries however no Arabic version of the long-form of this questionnaire exists to this date. Hence, the aim of this study was to cross-culturally adapt and validate an Arabic version of the long International Physical Activity Questionnaire (AIPAQ) equivalent to the French version (F-IPAQ) in a Lebanese population. The guidelines for cross-cultural adaptation provided by the World Health Organization and the International Physical Activity Questionnaire committee were followed. One hundred fifty-nine students and staff members from Saint Joseph University of Beirut were randomly recruited to participate in the study. Items of the A-IPAQ were compared to those from the F-IPAQ for concurrent validity using Spearman's correlation coefficient. Content validity of the questionnaire was assessed using factor analysis for the A-IPAQ's items. The physical activity indicators derived from the A-IPAQ were compared with the body mass index (BMI) of the participants for construct validity. The instrument was also evaluated for internal consistency reliability using Cronbach's alpha and Intraclass Correlation Coefficient (ICC). Finally, thirty-one participants were asked to complete the A-IPAQ on two occasions three weeks apart to examine its test-retest reliability. Bland-Altman analyses were performed to evaluate the extent of agreement between the two versions of the questionnaire and its repeated administrations. A high correlation was observed between answers of the F-IPAQ and those of the A-IPAQ, with Spearman's correlation coefficients ranging from 0.91 to 1.00 (p < 0.05). Bland-Altman analysis showed a high level of agreement between the two versions with all values scattered around the mean for total physical activity (mean difference = 5.3 min/week, 95% limits of agreement = -145.2 to 155.8). Negative correlations were observed between MET values and BMI, independent of age, gender or university campus. The A-IPAQ showed a high internal consistency reliability with Cronbach's alpha ranging from 0.769-1.00 (p < 0.001) and intraclass correlation coefficient (ICC) ranging from 0.625-0.999 (p < 0.001), except for a moderate agreement with the moderate garden/yard activity (alpha = 0.682; ICC = 0.518; p < 0.001). The A-IPAQ had moderate-to-good test-retest reliability for most of its items (ICC ranging from 0.66-0.96; p < 0.001) and the Bland-Altman analysis showed a satisfactory agreement between the two administrations of the A-IPAQ for total physical activity (mean difference = 99.8 min/week, 95% limits of agreement = -1105.3; 1304.9) and total vigorous and moderate physical activity (mean difference = -29.7 min/week, 95% limits of agreement = -777.6; 718.2). The modified Arabic version of the IPAQ showed acceptable validity and reliability for the assessment of physical activity among Lebanese adults. More studies are necessary in the future to assess its validity compared to a gold-standard criterion measure.
Obtaining the mean relative weights of the cost of care in Catalonia (Spain): retrospective application of the adjusted clinical groups case-mix system in primary health care.

PubMed

Sicras-Mainar, Antoni; Velasco-Velasco, Soledad; Navarro-Artieda, Ruth; Aguado Jodar, Alba; Plana-Ripoll, Oleguer; Hermosilla-Pérez, Eduardo; Bolibar-Ribas, Bonaventura; Prados-Torres, Alejandra; Violan-Fors, Concepción

2013-04-01

The study aims to obtain the mean relative weights (MRWs) of the cost of care through the retrospective application of the adjusted clinical groups (ACGs) in several primary health care (PHC) centres in Catalonia (Spain) in routine clinical practice. This is a retrospective study based on computerized medical records. All patients attended by 13 PHC teams in 2008 were included. The principle measurements were: demographic variables (age and sex), dependent variables (number of diagnoses and total costs), and case-mix or co-morbidity variables (International Classification of Primary Care). The costs model for each patient was established by differentiating the fix costs from the variable costs. In the bivariate analysis, the Student's t, analysis of variance, chi-squared, Pearson's linear correlation and Mann-Whitney-Wilcoxon tests were used. In order to compare the MRW of the present study with those of the United States (US), the concordance [intraclass correlation coefficient (ICC) and concordance correlation coefficient (CCC)] and the correlation (coefficient of determination: R²) were measured. The total number of patients studied was 227,235, and the frequentation was 5.9 visits/habitant/year) and with a mean diagnoses number of 4.5 (3.2). The distribution of costs was €148.7 million, of which 29.1% were fixed costs. The mean total cost per patient/year was €654.2 (851.7), which was considered to be the reference MRW. Relationship between study-MRW and US-MRW: ICC was 0.40 [confidential interval (CI) 95%: 0.21-0.60] and the CCC was 0.42 (CI 95%: 0.35-0.49). The correlation between the US MRW and the MRW of the present study can be seen; the adjusted R² value is 0.691. The explanatory power of the ACG classification was 36.9% for the total costs. The R² of the total cost without considering outliers was 56.9%. The methodology has been shown appropriate for promoting the calculation of the MRW for each category of the classification. The results provide a possible practical application in PHC clinical management. © 2012 Blackwell Publishing Ltd.
Differences in serum thyroglobulin measurements by 3 commercial immunoradiometric assay kits and laboratory standardization using Certified Reference Material 457 (CRM-457).

PubMed

Lee, Ji In; Kim, Ji Young; Choi, Joon Young; Kim, Hee Kyung; Jang, Hye Won; Hur, Kyu Yeon; Kim, Jae Hyeon; Kim, Kwang-Won; Chung, Jae Hoon; Kim, Sun Wook

2010-09-01

Serum thyroglobulin (Tg) is essential in the follow-up of patients with differentiated thyroid carcinoma (DTC). However, interchangeability and standardization between Tg assays have not yet been achieved, even with the development of an international Tg standard (Certified Reference Material 457 [CRM-457]). Serum Tg from 30 DTC patients and serially diluted CRM-457 were measured using 3 different immunoradiometric assays (IRMA-1, IRMA-2, IRMA-3). The intraclass correlation coefficient (ICC) method was used to describe the concordance of each IRMA to CRM-457. The serum Tg measured by 3 different IRMAs correlated well (r > .85, p < .0001), but clinically relevant discrepancies were found in 13.3% of patients. IRMA-3, which claims to be standardized to CRM-457, showed the best ICC (p(1) = .98) for the CRM-457. Hospitals caring for patients with DTC should either set their own cutoffs for IRMAs for Tg based on their patient pools, or adopt IRMAs standardized to CRM-457 and calibrate their laboratory using CRM-457.
Intersession reliability of fMRI activation for heat pain and motor tasks

PubMed Central

Quiton, Raimi L.; Keaser, Michael L.; Zhuo, Jiachen; Gullapalli, Rao P.; Greenspan, Joel D.

2014-01-01

As the practice of conducting longitudinal fMRI studies to assess mechanisms of pain-reducing interventions becomes more common, there is a great need to assess the test–retest reliability of the pain-related BOLD fMRI signal across repeated sessions. This study quantitatively evaluated the reliability of heat pain-related BOLD fMRI brain responses in healthy volunteers across 3 sessions conducted on separate days using two measures: (1) intraclass correlation coefficients (ICC) calculated based on signal amplitude and (2) spatial overlap. The ICC analysis of pain-related BOLD fMRI responses showed fair-to-moderate intersession reliability in brain areas regarded as part of the cortical pain network. Areas with the highest intersession reliability based on the ICC analysis included the anterior midcingulate cortex, anterior insula, and second somatosensory cortex. Areas with the lowest intersession reliability based on the ICC analysis also showed low spatial reliability; these regions included pregenual anterior cingulate cortex, primary somatosensory cortex, and posterior insula. Thus, this study found regional differences in pain-related BOLD fMRI response reliability, which may provide useful information to guide longitudinal pain studies. A simple motor task (finger-thumb opposition) was performed by the same subjects in the same sessions as the painful heat stimuli were delivered. Intersession reliability of fMRI activation in cortical motor areas was comparable to previously published findings for both spatial overlap and ICC measures, providing support for the validity of the analytical approach used to assess intersession reliability of pain-related fMRI activation. A secondary finding of this study is that the use of standard ICC alone as a measure of reliability may not be sufficient, as the underlying variance structure of an fMRI dataset can result in inappropriately high ICC values; a method to eliminate these false positive results was used in this study and is recommended for future studies of test–retest reliability. PMID:25161897
Temporal Stability of the Ford Insomnia Response to Stress Test (FIRST).

PubMed

Jarrin, Denise C; Chen, Ivy Y; Ivers, Hans; Drake, Christopher L; Morin, Charles M

2016-10-15

The Ford Insomnia Response to Stress Test (FIRST) is a self-report tool that measures sleep reactivity (i.e., vulnerability to experience situational insomnia under stressful conditions). Sleep reactivity has been termed a "trait-like" vulnerability; however, evidence of its long-term stability is lacking. The main objective of the current psychometric study was to investigate the temporal stability of the FIRST over two 6-mo intervals in a population-based sample of adults with and without insomnia. The temporal stability of the FIRST was also compared with the temporal stability of other scales associated with insomnia (trait-anxiety, arousability). Participants included 1,122 adults (mean age = 49.9 y, standard deviation = 14.8; 38.8% male) presenting with an insomnia syndrome (n = 159), insomnia symptoms (n = 152), or good sleep (n = 811). Participants completed the FIRST, the State-Trait Anxiety Inventory (trait-anxiety), and the Arousal Predisposition Scale (arousability) on three different occasions: baseline and at 6- and 12-mo follow-up. Intraclass correlation coefficients (ICCs) were computed for all scales (baseline to 6 mo and 6 to 12 mo). The FIRST yielded strong temporal stability from baseline to 6 mo among those with insomnia syndrome (ICC = 0.81), symptoms (ICC = 0.78), and good sleep (ICC = 0.81). Similar results were observed for 6 to 12 mo among those with insomnia syndrome (ICC = 0.74), insomnia symptoms (ICC = 0.82), and good sleep (ICC = 0.84). The stability of the FIRST was not comparable with the stability of trait-anxiety, but was somewhat comparable with the stability of arousability. Overall, the FIRST is a temporally reliable stable scale over 6-mo intervals. Future research is needed to corroborate the stability and trait-like measures of sleep reactivity with physiological, behavioural and personality measures. © 2016 American Academy of Sleep Medicine
Validity and reliability of an instrumented leg-extension machine for measuring isometric muscle strength of the knee extensors.

PubMed

Ruschel, Caroline; Haupenthal, Alessandro; Jacomel, Gabriel Fernandes; Fontana, Heiliane de Brito; Santos, Daniela Pacheco dos; Scoz, Robson Dias; Roesler, Helio

2015-05-20

Isometric muscle strength of knee extensors has been assessed for estimating performance, evaluating progress during physical training, and investigating the relationship between isometric and dynamic/functional performance. To assess the validity and reliability of an adapted leg-extension machine for measuring isometric knee extensor force. Validity (concurrent approach) and reliability (test and test-retest approach) study. University laboratory. 70 healthy men and women aged between 20 and 30 y (39 in the validity study and 31 in the reliability study). Intraclass correlation coefficient (ICC) values calculated for the maximum voluntary isometric torque of knee extensors at 30°, 60°, and 90°, measured with the prototype and with an isokinetic dynamometer (ICC2,1, validity study) and measured with the prototype in test and retest sessions, scheduled from 48 h to 72 h apart (ICC1,1, reliability study). In the validity analysis, the prototype showed good agreement for measurements at 30° (ICC2,1 = .75, SEM = 18.2 Nm) and excellent agreement for measurements at 60° (ICC2,1 = .93, SEM = 9.6 Nm) and at 90° (ICC2,1 = .94, SEM = 8.9 Nm). Regarding the reliability analysis, between-days' ICC1,1 were good to excellent, ranging from .88 to .93. Standard error of measurement and minimal detectable difference based on test-retest ranged from 11.7 Nm to 18.1 Nm and 32.5 Nm to 50.1 Nm, respectively, for the 3 analyzed knee angles. The analysis of validity and repeatability of the prototype for measuring isometric muscle strength has shown to be good or excellent, depending on the knee joint angle analyzed. The new instrument, which presents a relative low cost and easiness of transportation when compared with an isokinetic dynamometer, is valid and provides consistent data concerning isometric strength of knee extensors and, for this reason, can be used for practical, clinical, and research purposes.

A novel method for identifying settings for well-motivated ecologic studies of cancer

PubMed Central

Stang, Andreas; Kowall, Bernd; Rusner, Carsten; Trabert, Britton; Bray, Freddie; Schüz, Joachim; McGlynn, Katherine A.; Kuss, Oliver

2016-01-01

A low within-country variability and a large between-country variability in cancer incidence may indicate that ecologic factors are involved in the etiology of the disease. The aim of this study is to explore the within- and between-country variability of cancer incidence to motivate high-quality ecologic studies. We extracted age-standardized incidence rate estimates (world standard population) from 135 regions for the 10 most frequent invasive cancers in Europe for non-Hispanic white populations from Cancer Incidence in Five Continents, Volume X. We fitted weighted multilevel Poisson regression models with random country effects for each cancer and sex. We estimated intraclass correlation coefficients (ICC) and 95% confidence intervals (95%CI). A high ICC indicates a low within and a high between-country variability of rates. The two cancer sites with the highest ICC among men were prostate cancer (0.96, 95%CI: 0.92–0.99) and skin melanoma (0.78, 0.64–0.93). Among women, high ICC were observed for lung cancer (0.84, 0.73–0.95) and breast cancer (0.80, 0.69–0.91). The two most prominent sex differences for ICC occurred for cancers of the head and neck (men: 0.70, 0.55–0.85, women: 0.19, 0.08–0.30) and breast cancer (men: 0.04, 0.01–0.07, women: 0.80, 0.69–0.91). ICCs were relatively low for pancreatic cancer (men: 0.23, 0.10–0.35; women: 0.13, 0.04–0.21) and leukemia (men: 0.12, 0.04–0.21; women: 0.08, 0.02–0.14). For cancers with high ICC for which systematic factors of the health care system, screening and diagnostic activities are not plausible explanations for between-country variations in incidence, cross-country sex-specific ecologic studies may be especially promising. PMID:26595447
A novel method for identifying settings for well-motivated ecologic studies of cancer.

PubMed

Stang, Andreas; Kowall, Bernd; Rusner, Carsten; Trabert, Britton; Bray, Freddie; Schüz, Joachim; McGlynn, Katherine A; Kuss, Oliver

2016-04-15

A low within-country variability and a large between-country variability in cancer incidence may indicate that ecologic factors are involved in the etiology of the disease. The aim of this study is to explore the within- and between-country variability of cancer incidence to motivate high-quality ecologic studies. We extracted age-standardized incidence rate estimates (world standard population) from 135 regions for the ten most frequent invasive cancers in Europe for non-Hispanic white populations from Cancer Incidence in Five Continents, Volume X. We fitted weighted multilevel Poisson regression models with random country effects for each cancer and sex. We estimated intraclass correlation coefficients (ICCs) and 95% confidence intervals (95% CIs). A high ICC indicates a low within- and a high between-country variability of rates. The two cancer sites with the highest ICC among men were prostate cancer (0.96, 95% CI: 0.92-0.99) and skin melanoma (0.78, 0.64-0.93). Among women, high ICCs were observed for lung cancer (0.84, 0.73-0.95) and breast cancer (0.80, 0.69-0.91). The two most prominent sex differences for ICC occurred for cancers of the head and neck (men: 0.70, 0.55-0.85, women: 0.19, 0.08-0.30) and breast cancer (men: 0.04, 0.01-0.07, women: 0.80, 0.69-0.91). ICCs were relatively low for pancreatic cancer (men: 0.23, 0.10-0.35; women: 0.13, 0.04-0.21) and leukemia (men: 0.12, 0.04-0.21; women: 0.08, 0.02-0.14). For cancers with high ICC for which systematic factors of the health care system, screening and diagnostic activities are not plausible explanations for between-country variations in incidence, cross-country sex-specific ecologic studies may be especially promising. © 2015 UICC.
Reliability of the Swedish version of the Exercise Self-Efficacy Scale (S-ESES): a test-retest study in adults with neurological disease.

PubMed

Ahlström, Isabell; Hellström, Karin; Emtner, Margareta; Anens, Elisabeth

2015-03-01

To examine the test-retest reliability of the Swedish translated version of the Exercise Self-Efficacy Scale (S-ESES) in people with neurological disease and to examine internal consistency. Test-retest study. A total of 30 adults with neurological diseases including: Parkinson's disease; Multiple Sclerosis; Cervical Dystonia; and Charcot-Marie-Tooth disease. The S-ESES was sent twice by surface mail. Completion interval mean was 16 days apart. Weighted kappa, intraclass correlation coefficient 2,1 [ICC (2,1)], standard error of measurement (SEM), also expressed as a percentage value (SEM%), and Cronbach's alpha were calculated. The relative reliability of the test-retest results showed substantial agreement measured using weighted kappa (MD = 0.62) and a very high-reliability ICC (2,1) (0.92). Absolute reliability measured using SEM was 5.3 and SEM% was 20.7. Excellent internal consistency was shown, with an alpha coefficient of 0.91 (test 1) and 0.93 (test 2). The S-ESES is recommended for use in research and in clinical work for people with neurological diseases. The low-absolute reliability, however, indicates a limited ability to measure changes on an individual level.
Estimation of Temporal Gait Parameters Using a Human Body Electrostatic Sensing-Based Method.

PubMed

Li, Mengxuan; Li, Pengfei; Tian, Shanshan; Tang, Kai; Chen, Xi

2018-05-28

Accurate estimation of gait parameters is essential for obtaining quantitative information on motor deficits in Parkinson's disease and other neurodegenerative diseases, which helps determine disease progression and therapeutic interventions. Due to the demand for high accuracy, unobtrusive measurement methods such as optical motion capture systems, foot pressure plates, and other systems have been commonly used in clinical environments. However, the high cost of existing lab-based methods greatly hinders their wider usage, especially in developing countries. In this study, we present a low-cost, noncontact, and an accurate temporal gait parameters estimation method by sensing and analyzing the electrostatic field generated from human foot stepping. The proposed method achieved an average 97% accuracy on gait phase detection and was further validated by comparison to the foot pressure system in 10 healthy subjects. Two results were compared using the Pearson coefficient r and obtained an excellent consistency ( r = 0.99, p < 0.05). The repeatability of the purposed method was calculated between days by intraclass correlation coefficients (ICC), and showed good test-retest reliability (ICC = 0.87, p < 0.01). The proposed method could be an affordable and accurate tool to measure temporal gait parameters in hospital laboratories and in patients' home environments.
The many ways sputum flows - Dealing with high within-subject variability in cystic fibrosis sputum rheology.

PubMed

Radtke, Thomas; Böni, Lukas; Bohnacker, Peter; Fischer, Peter; Benden, Christian; Dressel, Holger

2018-04-21

We evaluated test-retest reliability of sputum viscoelastic properties in clinically stable patients with cystic fibrosis (CF). Data from a prospective, randomized crossover study was used to determine within-subject variability of sputum viscoelasticity (G', storage modulus and G", loss modulus at 1 and 10 rad s -1 ) and solids content over three consecutive visits. Precision of sputum properties was quantified by within-subject standard deviation (SD ws ), coefficient of variation (CV) and intraclass correlation coefficients (ICC). Fifteen clinically stable adults with CF (FEV 1 range 24-94% predicted) were included. No differences between study visits (mean ± SD 8 ± 2 days) were observed for any sputum rheology measure. CV's for G', G" and solids content ranged between 40.3-45.3% and ICC's between 0.21-0.42 indicating poor to fair test-retest reliability. Short-term within-subject variability of sputum properties is high in clinically stable adults with CF. Investigators applying shear rheology experiments in future prospective studies should consider using multiple measurements aiming to increase precision of sputum rheological outcomes. Copyright © 2018 Elsevier B.V. All rights reserved.
The effect of image alignment on capillary blood flow measurement of the neuroretinal rim using the Heidelberg retina flowmeter

PubMed Central

Sehi, M; Flanagan, J G

2004-01-01

Aim: To examine the influence of image alignment on the repeatability of blood flow measurements of the optic nerve. Methods: 10 normal subjects were examined. Heidelberg retina tomograph imaging was performed to establish best location and focus for the temporal neuroretinal rim. Two high quality Heidelberg retina flowmeter (HRF) images were acquired for three methods of alignment: central, nasal, and temporal. A 10×10 pixel measurement window was selected and exactly reproduced on all images. The interquartile pixel values were used to calculate capillary flow. ANOVA, intraclass correlation coefficients (ICC) and the coefficient of repeatability (CoR) were used for analysis. Results: There was no difference between methods (p = 0.47) or between visits (p = 0.51). The ICCs were 0.83 for the central, 0.34 for the nasal, and 0.42 for the temporal alignment. The CoR was 31.5 for central (mean effect 235.1), 234.6 for nasal, and 256.7 for temporal alignment. Conclusion: Central alignment was the most repeatable method for the measurement of neuroretinal rim capillary blood flow using the HRF. PMID:14736775
A new scale for the assessment of performance and capacity of hand function in children with hemiplegic cerebral palsy: reliability and validity studies.

PubMed

Rosa-Rizzotto, M; Visonà Dalla Pozza, L; Corlatti, A; Luparia, A; Marchi, A; Molteni, F; Facchin, P; Pagliano, E; Fedrizzi, E

2014-10-01

In hemiplegic children, the recognition of the activity limitation pattern and the possibility of grading its severity are relevant for clinicians while planning interventions, monitoring results, predicting outcomes. Aim of the study is to examine the reliability and validity of Besta Scale, an instrument used to measure in hemiplegic children from 18 months to 12 years of age both grasp on request (capacity) and spontaneous use of upper limb (performance) in bimanual play activities and in ADL. Psychometric analysis of reliability and of validity of the Besta scale was performed. Outpatient study sample Reliability study: A sample of 39 patients was enrolled. The administration of Besta scale was video-recorded in a standardized manner. All videos were scored by 20 independent raters on subsequent viewing. 3 raters randomly selected from the 20-raters group rescored the same video two years later for intra-rater reliability. Intra and inter-rater reliability were calculated using Intraclass Correlation Coefficient (ICC) and Kendall's coefficient (K), respectively. Internal consistency reliability was assessed using Alpha's Chronbach coefficient. Validity study: a sample of 105 children was assessed 5 times (at t0 and 2, 3, 6 and 12 months later) by 20 independent raters. Each patient underwent at the same time to QUEST and Besta scale administration and assessment. Criterion validity was calculated using rho-Pearson coefficient. Reliability study: The inter-rater reliability calculated with Kendall's coefficient resulted moderate K=0.47. The intra-rater (or test-retest) reliability for 3 raters was excellent (ICC=0.927). The Cronbach's alpha for internal consistency was 0.972. Validity study: Besta scale showed a good criterion validity compared to QUEST increasing by age and severity of impairment. Rho Pearson's correlation coefficient r was 0.81 (P<0.0001). Limitations. Besta scales in infants finds hard to distinguish between mild to moderately impaired hand function. Besta scale scoring system is a valid and reliable tool, utilizable in a clinical setting to monitor evolution of unimanual and bimanual manipulation and to distinguish hand's capacity from performance.
Reliability of plasma lipopolysaccharide-binding protein (LBP) from repeated measures in healthy adults.

PubMed

Citronberg, Jessica S; Wilkens, Lynne R; Lim, Unhee; Hullar, Meredith A J; White, Emily; Newcomb, Polly A; Le Marchand, Loïc; Lampe, Johanna W

2016-09-01

Plasma lipopolysaccharide-binding protein (LBP), a measure of internal exposure to bacterial lipopolysaccharide, has been associated with several chronic conditions and may be a marker of chronic inflammation; however, no studies have examined the reliability of this biomarker in a healthy population. We examined the temporal reliability of LBP measured in archived samples from participants in two studies. In Study one, 60 healthy participants had blood drawn at two time points: baseline and follow-up (either three, six, or nine months). In Study two, 24 individuals had blood drawn three to four times over a seven-month period. We measured LBP in archived plasma by ELISA. Test-retest reliability was estimated by calculating the intraclass correlation coefficient (ICC). Plasma LBP concentrations showed moderate reliability in Study one (ICC 0.60, 95 % CI 0.43-0.75) and Study two (ICC 0.46, 95 % CI 0.26-0.69). Restricting the follow-up period improved reliability. In Study one, the reliability of LBP over a three-month period was 0.68 (95 % CI: 0.41-0.87). In Study two, the ICC of samples taken ≤seven days apart was 0.61 (95 % CI 0.29-0.86). Plasma LBP concentrations demonstrated moderate test-retest reliability in healthy individuals with reliability improving over a shorter follow-up period.
Parent-child agreement on the Behavior Rating Inventory of Executive Functioning (BRIEF) in a community sample of adolescents.

PubMed

Egan, Kaitlyn N; Cohen, L Adelyn; Limbers, Christine

2018-03-06

Despite its widespread use, a minimal amount is known regarding the agreement between parent and youth ratings of youth's executive functioning on the Behavior Rating Inventory of Executive Functioning (BRIEF) in typically developing youth. The present study examined parent-child agreement on the BRIEF with a community sample of adolescents and their parents. Ninety-seven parent-child dyads (M age = 13.91 years; SD = .52) completed the BRIEF self- and parent-report forms and a demographic questionnaire. Intraclass Correlation Coefficients (ICCs) and paired sample t-tests were used to evaluate agreement between self- and parent-reports on the BRIEF. Total sample ICCs indicated moderate to good parent-child agreement (0.46-0.68). Parents from the total sample reported significantly higher mean T-scores for their adolescents on Inhibit, Working Memory, Planning/Organization, Behavioral Regulation Index (BRI), Metacognition Index, and Global Executive Composite. Differences were found in regard to gender and race/ethnicity: ICCs were higher between parent-girl dyads on the scales that comprise the BRI than between parent-boy dyads. Parent-adolescent ICCs were also higher for adolescents who self-identified as White in comparison to those who identified as Non-White/Mixed Race on Emotional Control. These findings suggest gender and racial/ethnic differences should be considered when examining parent-child agreement on the BRIEF in typically developing adolescents.
Strength tests for elite rowers: low- or high-repetition?

PubMed

Lawton, Trent W; Cronin, John B; McGuigan, Michael R

2014-01-01

The purpose of this project was to evaluate the utility of low- and high-repetition maximum (RM) strength tests used to assess rowers. Twenty elite heavyweight males (age 23.7 ± 4.0 years) performed four tests (5 RM, 30 RM, 60 RM and 120 RM) using leg press and seated arm pulling exercise on a dynamometer. Each test was repeated on two further occasions; 3 and 7 days from the initial trial. Per cent typical error (within-participant variation) and intraclass correlation coefficients (ICCs) were calculated using log-transformed repeated-measures data. High-repetition tests (30 RM, 60 RM and 120 RM), involving seated arm pulling exercise are not recommended to be included in an assessment battery, as they had unsatisfactory measurement precision (per cent typical error > 5% or ICC < 0.9). Conversely, low-repetition tests (5 RM) involving leg press and seated arm pulling exercises could be used to assess elite rowers (per cent typical error ≤ 5% and ICC ≥ 0.9); however, only 5 RM leg pressing met criteria (per cent typical error = 2.7%, ICC = 0.98) for research involving small samples (n = 20). In summary, low-repetition 5 RM strength testing offers greater utility as assessments of rowers, as they can be used to measure upper- and lower-body strength; however, only the leg press exercise is recommended for research involving small squads of elite rowers.
Reliability of primary caregivers reports on lifestyle behaviours of European pre-school children: the ToyBox-study.

PubMed

González-Gil, E M; Mouratidou, T; Cardon, G; Androutsos, O; De Bourdeaudhuij, I; Góźdź, M; Usheva, N; Birnbaum, J; Manios, Y; Moreno, L A

2014-08-01

Reliable assessments of health-related behaviours are necessary for accurate evaluation on the efficiency of public health interventions. The aim of the current study was to examine the reliability of a self-administered primary caregivers questionnaire (PCQ) used in the ToyBox-intervention. The questionnaire consisted of six sections addressing sociodemographic and perinatal factors, water and beverages consumption, physical activity, snacking and sedentary behaviours. Parents/caregivers from six countries (Belgium, Bulgaria, Germany, Greece, Poland and Spain) were asked to complete the questionnaire twice within a 2-week interval. A total of 93 questionnaires were collected. Test-retest reliability was assessed using intra-class correlation coefficient (ICC). Reliability of the six questionnaire sections was assessed. A stronger agreement was observed in the questions addressing sociodemographic and perinatal factors as opposed to questions addressing behaviours. Findings showed that 92% of the ToyBox PCQ had a moderate-to-excellent test-retest reliability (defined as ICC values from 0.41 to 1) and less than 8% poor test-retest reliability (ICC < 0.40). Out of the total ICC values, 67% showed good-to-excellent reliability (ICC from 0.61 to 1). We conclude that the PCQ is a reliable tool to assess sociodemographic characteristics, perinatal factors and lifestyle behaviours of pre-school children and their families participating in the ToyBox-intervention. © 2014 World Obesity.
Reliability of scores between stroke patients and significant others on the Reintegration to Normal Living (RNL) Index.

PubMed

Tooth, Leigh R; McKenna, Kryss T; Smith, Melinda; O'Rourke, Peter K

2003-05-06

This study measured reliability between stroke patients' and significant others' scores on items on the Reintegration to Normal Living (RNL) Index and whether there were any scoring biases. The 11-item RNL Index was administered to 57 pairs of patients and significants six months after stroke rehabilitation. The index was scored using a 10-point visual analogue scale. Patient and significant other demographic information and data on patients' clinical, functional and cognitive status were collected. Reliability was measured using the intra-class correlation coefficient (ICC) and percent agreement. Overall poor reliability was found for the RNL Index total score (ICC=.36, 95% CI .07 to .59) and the daily functioning subscale (ICC=.24, 95% Cl -.003 to .46) and moderate reliability was found for the perception of self subscale (ICC= .55, 95% Cl .28 to .73). There was a moderate bias for patients to rate themselves as achieving better reintegration than was indicated by significant others, although no demographic or clinical factors were associated with this bias. Exact match agreement was best for the subjective items and worse for items reflecting mobility around the community and participation in a work activity. Caution is needed when interpreting patient information reported by significant others on the RNL Index. The use of a shorter scale to rate the RNL Index requires investigation.
Reliability and Agreement of Neck Functional Capacity Evaluation Tests in Patients With Chronic Multifactorial Neck Pain.

PubMed

Reneman, M F; Roelofs, M; Schiphorst Preuper, H R

2017-07-01

To analyze test-retest reliability and agreement, and to explore the safety of neck functional capacity evaluation (Neck-FCE) tests in patients with chronic multifactorial neck pain. Test-retest; 2 FCE sessions were held with a 2-week interval. University-based outpatient rehabilitation center. Individuals (N=18; 14 women) with a mean age of 34 years. Not applicable. The Neck-FCE protocol consists of 6 tests: lifting waist to overhead (kg), 2-handed carrying (kg), overhead working (s), bending and overhead reaching (s), and repetitive side reaching (left and right) (s). Intraclass correlation coefficients (ICCs) and limits of agreement (LoA) were calculated. ICC point estimates between .75 and .90 were considered as good, and >.90 were considered as excellent reliability. ICC point estimates ranged between .39 and .96. Ratios of the LoA ranged between 32.0% and 56.5%. Mean ± SD numeric rating scale pain scores in the neck and shoulder 24 hours after the test were 6.7±2.6 and 6.3±3.0, respectively. Based on ICC point estimates and 95% confidence intervals, 3 tests had excellent reliability and 3 had poor reliability. LoA were substantial in all 6 tests. Safety was confirmed. Copyright © 2016 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
[Reliability and validity of a generic job exposure matrix applied on a small-business].

PubMed

Haro-García, Luis; Celis-Quintal, Germán; López-Rojas, Pablo; Sánchez-Román, Francisco Raúl; Juárez-Pérez, Cuauhtémoc Arturo

2007-01-01

to evaluate the reliability and validity of a generic job exposure matrix (JEM) applied in a small business. procedures to evaluate a JEM integrated by six sections: the number of exposed workers per area, frequency of exposure, time of exposure time, level of exposure, safety controls, and proximity to source of exposure, was evaluated. The JEM also obtains information about possible health effects from exposure to occupational/environment agents. Two observers estimated the risk of exposure to epoxy resins on 31 workers of an epoxy resin facility in Mexico City. The rater agreements between the two observers were assessed through percent agreement (PA), weighted kappa (kappa(w)) and the intraclass correlation coefficient (ICC). disagreements were greater for the number of exposed workers (PA = 61.3, kappa(w) = 0.24, ICC = 0.33), level of exposure (PA= 66.7, kappa(w) = 0.25, ICC= 0.56), and safety controls (PA = 54.8, kappa(w) = 0.23, ICC = 0.69) sections. Percent agreement and kappa(w) were 64% and 0.58, respectively. In accordance with Landis and Koch, Altman, Fleiss, and Byrt classifications for the interpretation of kappa value, the weighted kappa (0.58) ranged from moderate to a fair good level. despite the discordance in some sections, the JEM proved to be useful to identify the risk of exposure in this type of small business.
Reliability of muscle strength assessment in chronic post-stroke hemiparesis: a systematic review and meta-analysis.

PubMed

Rabelo, Michelle; Nunes, Guilherme S; da Costa Amante, Natália Menezes; de Noronha, Marcos; Fachin-Martins, Emerson

2016-02-01

Muscle weakness is the main cause of motor impairment among stroke survivors and is associated with reduced peak muscle torque. To systematically investigate and organize the evidence of the reliability of muscle strength evaluation measures in post-stroke survivors with chronic hemiparesis. Two assessors independently searched four electronic databases in January 2014 (Medline, Scielo, CINAHL, Embase). Inclusion criteria comprised studies on reliability on muscle strength assessment in adult post-stroke patients with chronic hemiparesis. We extracted outcomes from included studies about reliability data, measured by intraclass correlation coefficient (ICC) and/or similar. The meta-analyses were conducted only with isokinetic data. Of 450 articles, eight articles were included for this review. After quality analysis, two studies were considered of high quality. Five different joints were analyzed within the included studies (knee, hip, ankle, shoulder, and elbow). Their reliability results varying from low to very high reliability (ICCs from 0.48 to 0.99). Results of meta-analysis for knee extension varying from high to very high reliability (pooled ICCs from 0.89 to 0.97), for knee flexion varying from high to very high reliability (pooled ICCs from 0.84 to 0.91) and for ankle plantar flexion showed high reliability (pooled ICC = 0.85). Objective muscle strength assessment can be reliably used in lower and upper extremities in post-stroke patients with chronic hemiparesis.
Linguistic validation of cystic fibrosis quality of life questionnaires.

PubMed

Rozov, Tatiana; Cunha, Maristela T; Nascimento, Oliver; Quittner, Alexandra L; Jardim, José R

2006-01-01

The purpose of this study was to validate the Portuguese translations of four cystic fibrosis quality of life questionnaires (CFQ). The first three were developed for patients with cystic fibrosis aged from 6 to 11 years, from 12 to 13 years and 14 years or more, while the fourth was developed for the parents of patients aged 6 to 13 years. The four CFQ translations contained from 35 to 50 questions covering nine domains and were validated as follows: translation from English to Portuguese, pilot application, back translation and then approval by the author of the English versions. The four translations were applied to 90 stable patients (30 from each age group) and the parents of patients aged 6-13 years (n = 60), on two occasions with a 13 to 17 day interval. Intraclass Correlation Coefficients (ICC) were used to measure reproducibility. This study was approved by the Commission for Ethics in Research at the institution. Reproducibility was good (ICC = 0.62 to 0.99) for the four translations in all domains, with the exceptions of the Digestion domain for the 6 to 11 and 12 to 13 years age groups with ICC = 0.59 and 0.47, respectively and the Social Role domain for the 14 and over age group (ICC = -0.19 ). The translation and cultural adaptation for Brazil resulted in four CFQ versions that are easy to understand and offer good reproducibility.
Inter-rater reliability of select physical examination procedures in patients with neck pain.

PubMed

Hanney, William J; George, Steven Z; Kolber, Morey J; Young, Ian; Salamh, Paul A; Cleland, Joshua A

2014-07-01

This study evaluated the inter-rater reliability of select examination procedures in patients with neck pain (NP) conducted over a 24- to 48-h period. Twenty-two patients with mechanical NP participated in a standardized examination. One examiner performed standardized examination procedures and a second blinded examiner repeated the procedures 24-48 h later with no treatment administered between examinations. Inter-rater reliability was calculated with the Cohen Kappa and weighted Kappa for ordinal data while continuous level data were calculated using an intraclass correlation coefficient model 2,1 (ICC2,1). Coefficients for categorical variables ranged from poor to moderate agreement (-0.22 to 0.70 Kappa) and coefficients for continuous data ranged from slight to moderate (ICC2,1 0.28-0.74). The standard error of measurement for cervical range of motion ranged from 5.3° to 9.9° while the minimal detectable change ranged from 12.5° to 23.1°. This study is the first to report inter-rater reliability values for select components of the cervical examination in those patients with NP performed 24-48 h after the initial examination. There was considerably less reliability when compared to previous studies, thus clinicians should consider how the passage of time may influence variability in examination findings over a 24- to 48-h period.
Reproducibility and responsiveness of quality of life assessment and six minute walk test in elderly heart failure patients.

PubMed

O'Keeffe, S T; Lye, M; Donnellan, C; Carmichael, D N

1998-10-01

To examine the reproducibility and responsiveness to change of a six minute walk test and a quality of life measure in elderly patients with heart failure. Longitudinal within patient study. 60 patients with heart failure (mean age 82 years) attending a geriatric outpatient clinic, 45 of whom underwent a repeat assessment three to eight weeks later. Subjects underwent a standardised six minute walk test and completed the chronic heart failure questionnaire (CHQ), a heart failure specific quality of life questionnaire. Intraclass correlation coefficients (ICC) were calculated using a random effects one way analysis of variance as a measure of reproducibility. Guyatt's responsiveness coefficient and effect sizes were calculated as measures of responsiveness to change. 24 patients reported no major change in cardiac status, while seven had deteriorated and 14 had improved between the two clinic visits. Reproducibility was satisfactory (ICC > 0.75) for the six minute walk test, for the total CHQ score, and for the dyspnoea, fatigue, and emotion domains of the CHQ. Effect sizes for all measures were large (> 0.8), and responsiveness coefficients were very satisfactory (> 0.7). Effect sizes for detecting deterioration were greater than those for detecting improvement. Quality of life assessment and a six minute walk test are reproducible and responsive measures of cardiac status in frail, very elderly patients with heart failure.
Reliability and Validity of the Early Years Physical Activity Questionnaire (EY-PAQ)

PubMed Central

Bingham, Daniel D.; Collings, Paul J.; Clemes, Stacy A.; Costa, Silvia; Santorelli, Gillian; Griffiths, Paula; Barber, Sally E.

2016-01-01

Measuring physical activity (PA) and sedentary time (ST) in young children (<5 years) is complex. Objective measures have high validity but require specialist expertise, are expensive, and can be burdensome for participants. A proxy-report instrument for young children that accurately measures PA and ST is needed. The aim of this study was to assess the reliability and validity of the Early Years Physical Activity Questionnaire (EY-PAQ). In a setting where English and Urdu are the predominant languages spoken by parents of young children, a sample of 196 parents and their young children (mean age 3.2 ± 0.8 years) from Bradford, UK took part in the study. A total of 156 (79.6%) questionnaires were completed in English and 40 (20.4%) were completed in transliterated Urdu. A total of 109 parents took part in the reliability aspect of the study, which involved completion of the EY-PAQ on two occasions (7.2 days apart; standard deviation (SD) = 1.1). All 196 participants took part in the validity aspect which involved comparison of EY-PAQ scores against accelerometry. Validty anaylsis used all data and data falling with specific MVPA and ST boundaries. Reliability was assessed using intra-class correlations (ICC) and validity by Bland–Altman plots and rank correlation coefficients. The test re-test reliability of the EY-PAQ was moderate for ST (ICC = 0.47) and fair for moderate-to-vigorous physical activity (MVPA)(ICC = 0.35). The EY-PAQ had poor agreement with accelerometer-determined ST (mean difference = −87.5 min·day−1) and good agreement for MVPA (mean difference = 7.1 min·day−1) limits of agreement were wide for all variables. The rank correlation coefficient was non-significant for ST (rho = 0.19) and significant for MVPA (rho = 0.30). The EY-PAQ has comparable validity and reliability to other PA self-report tools and is a promising population-based measure of young children’s habitual MVPA but not ST. In situations when objective methods are not possible for measurement of young children’s MVPA, the EY-PAQ may be a suitable alternative but only if boundaries are applied.
Quantitative 3D Ultrashort Time-to-Echo (UTE) MRI and Micro-CT (μCT) Evaluation of the Temporomandibular Joint (TMJ) Condylar Morphology

PubMed Central

Geiger, Daniel; Bae, Won C.; Statum, Sheronda; Du, Jiang; Chung, Christine B.

2014-01-01

Objective Temporomandibular dysfunction involves osteoarthritis of the TMJ, including degeneration and morphologic changes of the mandibular condyle. Purpose of this study was to determine accuracy of novel 3D-UTE MRI versus micro-CT (μCT) for quantitative evaluation of mandibular condyle morphology. Material & Methods Nine TMJ condyle specimens were harvested from cadavers (2M, 3F; Age 85 ± 10 yrs., mean±SD). 3D-UTE MRI (TR=50ms, TE=0.05 ms, 104 μm isotropic-voxel) was performed using a 3-T MR scanner and μCT (18 μm isotropic-voxel) was performed. MR datasets were spatially-registered with μCT dataset. Two observers segmented bony contours of the condyles. Fibrocartilage was segmented on MR dataset. Using a custom program, bone and fibrocartilage surface coordinates, Gaussian curvature, volume of segmented regions and fibrocartilage thickness were determined for quantitative evaluation of joint morphology. Agreement between techniques (MRI vs. μCT) and observers (MRI vs. MRI) for Gaussian curvature, mean curvature and segmented volume of the bone were determined using intraclass correlation correlation (ICC) analyses. Results Between MRI and μCT, the average deviation of surface coordinates was 0.19±0.15 mm, slightly higher than spatial resolution of MRI. Average deviation of the Gaussian curvature and volume of segmented regions, from MRI to μCT, was 5.7±6.5% and 6.6±6.2%, respectively. ICC coefficients (MRI vs. μCT) for Gaussian curvature, mean curvature and segmented volumes were respectively 0.892, 0.893 and 0.972. Between observers (MRI vs. MRI), the ICC coefficients were 0.998, 0.999 and 0.997 respectively. Fibrocartilage thickness was 0.55±0.11 mm, as previously described in literature for grossly normal TMJ samples. Conclusion 3D-UTE MR quantitative evaluation of TMJ condyle morphology ex-vivo, including surface, curvature and segmented volume, shows high correlation against μCT and between observers. In addition, UTE MRI allows quantitative evaluation of the fibrocartilaginous condylar component. PMID:24092237

Cross-cultural adaptation, reliability, and validity of the Turkish version of PedsQL 3.0 Arthritis Module: a quality-of-life measure for patients with juvenile idiopathic arthritis in Turkey.

PubMed

Tarakci, E; Baydogan, S N; Kasapcopur, O; Dirican, A

2013-04-01

The aim of this study was to describe the cultural adaptation, validity, and reliability of a Turkish version of the pediatric quality-of-life inventory (PedsQL) 3.0 Arthritis Module in a population with juvenile idiopathic arthritis (JIA). A total of 169 patients with JIA and their parents were enrolled in the study. The Turkish version of the childhood health assessment questionnaire (CHAQ) was used to evaluate the validity of related domains in the PedsQL 3.0 Arthritis Module. Both the PedsQL 3.0 Arthritis Module and CHAQ were filled out by children over 8 years of age and by the parents of children 2-7 years of age. Internal reliability was poor to excellent (Cronbach's alpha coefficients 0.56-0.84 for self-reporting and 0.63-0.82 for parent reporting), and interobserver reliability varied from good to excellent (intraclass correlation coefficient (ICC) 0.79-0.91 for self-reporting and 0.80-0.88 for parent reporting) for the total scores of the PedsQL 3.0 Arthritis Module. Parent-child concordance for all scores was moderate to excellent (ICC 0.42-0.92). The PedsQL 3.0 Arthritis Module and CHAQ were highly positively correlated, with coefficients from 0.21 to 0.76, indicating concurrent validity. We demonstrated the reliability and validity of quality-of-life measurement using the Turkish version of the PedsQL 3.0 Arthritis Module in our sociocultural context. The PedsQL 3.0 Arthritis Module can be utilized as a tool for the evaluation of quality of life in patients with JIA aged 2-18 years.
Comparative Analysis of 2-D Versus 3-D Ultrasound Estimation of the Fetal Adrenal Gland Volume and Prediction of Preterm Birth

PubMed Central

Turan, Ozhan M.; Turan, Sifa; Buhimschi, Irina A.; Funai, Edmund F.; Campbell, Katherine H.; Bahtiyar, Ozan M.; Harman, Chris R.; Copel, Joshua A.; Baschat, Ahmet A; Buhimschi, Catalin S.

2013-01-01

Objective We aim to test the hypothesis that 2D fetal AGV measurements offer similar volume estimates as volume calculations based on 3D technique Methods Fetal AGV was estimated by 3D ultrasound (VOCAL) in 93 women with signs/symptoms of preterm labor and 73 controls. Fetal AGV was calculated using an ellipsoid formula derived from 2D measurements of the same blocks (0.523× length × width × depth). Comparisons were performed by intra-class correlation coefficient (ICC), coefficient of repeatability, and Bland-Altman method. The cAGV (AGV/fetal weight) was calculated for both methods and compared for prediction of PTB within 7 days. Results Among 168 volumes, there was a significant correlation between 3D and 2D methods (ICC=0.979[95%CI: 0.971-0.984]). The coefficient of repeatability for the 3D was superior to the 2D method (Intra-observer 3D: 30.8, 2D:57.6; inter-observer 3D: 12.2, 2D: 15.6). Based on 2D calculations, a cAGV≥433mm3/kg, was best for prediction of PTB (sensitivity: 75%(95%CI=59-87); specificity: 89%(95%CI=82-94). Sensitivity and specificity for the 3D cAGV (cut-off ≥420mm3/kg) was 85%(95%CI=70-94) and 95%(95%CI=90-98), respectively. In receiver-operating-curve curve analysis, 3D cAGV was superior to 2D cAGV for prediction of PTB (z=1.99, p=0.047). Conclusion 2D volume estimation of fetal adrenal gland using ellipsoid formula cannot replace 3D AGV calculations for prediction of PTB. PMID:22644825
Reliability of intestinal temperature using an ingestible telemetry pill system during exercise in a hot environment.

PubMed

Ruddock, Alan D; Tew, Garry A; Purvis, Alison J

2014-03-01

Ingestible telemetry pill systems are being increasingly used to assess the intestinal temperature during exercise in hot environments. The purpose of this investigation was to assess the interday reliability of intestinal temperature during an exercise-heat challenge. Intestinal temperature was recorded as 12 physically active men (25 ± 4 years, stature 181.7 ± 7.0 cm, body mass 81.1 ± 10.6 kg) performed two 60-minute bouts of recumbent cycling (50% of peak aerobic power [watts]) in an environmental chamber set at 35° C 50% relative humidity 3-10 days apart. A range of statistics were used to calculate the reliability, including a paired t-test, 95% limits of agreement (LOA), coefficient of variation (CV), standard error of measurement (SEM), Pearson's correlation coefficient (r), intraclass correlation coefficient (ICC), and Cohen's d. Statistical significance was set at p ≤ 0.05. The method indicated a good overall reliability (LOA = ± 0.61° C, CV = 0.58%, SEM = 0.12° C, Cohen's d = 0.12, r = 0.84, ICC = 0.84). Analysis revealed a statistically significant (p = 0.02) mean systematic bias of -0.07 ± 0.31° C, and the investigation of the Bland-Altman plot suggested the presence of heteroscedasticity. Further analysis revealed the minimum "likely" change in intestinal temperature to be 0.34° C. Although the method demonstrates a good reliability, researchers should be aware of heteroscedasticity. Changes in intestinal temperature >0.34° C as a result of exercise or an intervention in a hot environment are likely changes and less influenced by error associated with the method.
Assessment of the Validity and Reproducibility of a Novel Standardized Test Meal for the Study of Postprandial Triacylglycerol Concentrations.

PubMed

Tentolouris, Nikolaos; Kanellos, Panagiotis T; Siami, Evangelia; Athanasopoulou, Elpida; Chaviaras, Nikolaos; Kolovou, Genovefa; Sfikakis, Petros P; Katsilambros, Nikolaos

2017-08-01

Lipotest ® is a standardized fat-rich meal designed for use as a test meal during a fat tolerance test (FTT) for the study of postprandial triacylglycerol (TAG) concentrations. Herein we examined the precision and reproducibility of examination using Lipotest ® on postprandial TAG levels. A total of 26 healthy consenting subjects were examined twice after 8-10 h fasting with an interval of approximately 1 week apart. Blood samples were collected at baseline and 1, 2, 3, and 4 h after consumption of the test meal for measurement of plasma total TAG levels. We examined agreement, precision, and accuracy between the two visits using the Altman plots and correlation coefficient. Reproducibility was tested using the coefficient of variation (CV) and intraclass correlation coefficient (ICC). Moreover, the area under the curve (AUC) as a summary measure of the overall postprandial TAG levels was calculated. The agreement, precision (r ≥ 0.74, p < 0.001), and accuracy (≥0.99) between the measurements in plasma TAG during Lipotest ® testing in the two visits were high. In terms of reproducibility, the values of CV were 15.59-23.83% while those of ICC were ≥0.75. The values of the AUCs in the visits were not different (p = 0.87). A single measurement of plasma TAG levels at 4 h after Lipotest ® consumption depicted peak postprandial TAG concentration. A FTT using Lipotest ® as a standardized meal has good precision and reproducibility for the study of postprandial TAG levels in healthy individuals. A single determination of plasma TAG concentration at 4 h after Lipotest ® consumption captures peak postprandial TAG response.
Intraclass reliability for assessing how well Taiwan constrained hospital-provided medical services using statistical process control chart techniques

PubMed Central

2012-01-01

Background Few studies discuss the indicators used to assess the effect on cost containment in healthcare across hospitals in a single-payer national healthcare system with constrained medical resources. We present the intraclass correlation coefficient (ICC) to assess how well Taiwan constrained hospital-provided medical services in such a system. Methods A custom Excel-VBA routine to record the distances of standard deviations (SDs) from the central line (the mean over the previous 12 months) of a control chart was used to construct and scale annual medical expenditures sequentially from 2000 to 2009 for 421 hospitals in Taiwan to generate the ICC. The ICC was then used to evaluate Taiwan’s year-based convergent power to remain unchanged in hospital-provided constrained medical services. A bubble chart of SDs for a specific month was generated to present the effects of using control charts in a national healthcare system. Results ICCs were generated for Taiwan’s year-based convergent power to constrain its medical services from 2000 to 2009. All hospital groups showed a gradually well-controlled supply of services that decreased from 0.772 to 0.415. The bubble chart identified outlier hospitals that required investigation of possible excessive reimbursements in a specific time period. Conclusion We recommend using the ICC to annually assess a nation’s year-based convergent power to constrain medical services across hospitals. Using sequential control charts to regularly monitor hospital reimbursements is required to achieve financial control in a single-payer nationwide healthcare system. PMID:22587736
Test-retest reliability of computer-based video analysis of general movements in healthy term-born infants.

PubMed

Valle, Susanne Collier; Støen, Ragnhild; Sæther, Rannei; Jensenius, Alexander Refsum; Adde, Lars

2015-10-01

A computer-based video analysis has recently been presented for quantitative assessment of general movements (GMs). This method's test-retest reliability, however, has not yet been evaluated. The aim of the current study was to evaluate the test-retest reliability of computer-based video analysis of GMs, and to explore the association between computer-based video analysis and the temporal organization of fidgety movements (FMs). Test-retest reliability study. 75 healthy, term-born infants were recorded twice the same day during the FMs period using a standardized video set-up. The computer-based movement variables "quantity of motion mean" (Qmean), "quantity of motion standard deviation" (QSD) and "centroid of motion standard deviation" (CSD) were analyzed, reflecting the amount of motion and the variability of the spatial center of motion of the infant, respectively. In addition, the association between the variable CSD and the temporal organization of FMs was explored. Intraclass correlation coefficients (ICC 1.1 and ICC 3.1) were calculated to assess test-retest reliability. The ICC values for the variables CSD, Qmean and QSD were 0.80, 0.80 and 0.86 for ICC (1.1), respectively; and 0.80, 0.86 and 0.90 for ICC (3.1), respectively. There were significantly lower CSD values in the recordings with continual FMs compared to the recordings with intermittent FMs (p<0.05). This study showed high test-retest reliability of computer-based video analysis of GMs, and a significant association between our computer-based video analysis and the temporal organization of FMs. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Intraclass reliability for assessing how well Taiwan constrained hospital-provided medical services using statistical process control chart techniques.

PubMed

Chien, Tsair-Wei; Chou, Ming-Ting; Wang, Wen-Chung; Tsai, Li-Shu; Lin, Weir-Sen

2012-05-15

Few studies discuss the indicators used to assess the effect on cost containment in healthcare across hospitals in a single-payer national healthcare system with constrained medical resources. We present the intraclass correlation coefficient (ICC) to assess how well Taiwan constrained hospital-provided medical services in such a system. A custom Excel-VBA routine to record the distances of standard deviations (SDs) from the central line (the mean over the previous 12 months) of a control chart was used to construct and scale annual medical expenditures sequentially from 2000 to 2009 for 421 hospitals in Taiwan to generate the ICC. The ICC was then used to evaluate Taiwan's year-based convergent power to remain unchanged in hospital-provided constrained medical services. A bubble chart of SDs for a specific month was generated to present the effects of using control charts in a national healthcare system. ICCs were generated for Taiwan's year-based convergent power to constrain its medical services from 2000 to 2009. All hospital groups showed a gradually well-controlled supply of services that decreased from 0.772 to 0.415. The bubble chart identified outlier hospitals that required investigation of possible excessive reimbursements in a specific time period. We recommend using the ICC to annually assess a nation's year-based convergent power to constrain medical services across hospitals. Using sequential control charts to regularly monitor hospital reimbursements is required to achieve financial control in a single-payer nationwide healthcare system.
Ultrasound measures of tendon thickness: Intra-rater, Inter-rater and Inter-machine reliability.

PubMed

Del Baño-Aledo, María Elena; Martínez-Payá, Jacinto Javier; Ríos-Díaz, José; Mejías-Suárez, Silvia; Serrano-Carmona, Sergio; de Groot-Ferrando, Ana

2017-01-01

Ultrasound imaging is often used by physiotherapists and other healthcare professionals but the reliability of image acquisition with different ultrasound machines is unknown. The objective was to compare the intra-rater, inter-rater and intermachine reliability of thickness measurements of the plantar fascia (PF), Achilles tendon (AT), patellar tendon (PT) and elbow common extensor tendon (ECET) with musculoskeletal ultrasound imaging (MSUS). Tendon thickness was measured in four anatomical structures (14 participants, 28 images per tendon) by two sonographers and with two different ultrasound machines. Intraclass Correlation Coefficients (ICCs) and Bland-Altman plots were calculated. The standard error of measurement (SEM) and minimum detectable difference (MDD) were calculated. Inter-rater reliability was excellent for AT (ICC=0.98; 95% CI= 0.96-0.99) and very good for PT (ICC=0.85; 95% CI = 0.67-0.93) and ECET (ICC=0.81; 95% CI= 0.72-0.94). Reliability for PF was moderate, with an ICC of 0.63 (CI 95%= 0.20-0.83). Bland-Altman plot for inter-machine reliability showed a mean difference of 1 m for PF measurements and a mean difference of 4 m and 20 m for AT and PT. The relative SEMs were below 7% and the MDCs were below 0.7 mm. The MSUS reliability in measuring thickness of the four tendons is confirmed by the homogeneous readings intra sonographers, between operators and between different machines. Level of evidence: Tendon thickness can be measured reliably on different ultrasound devices, which is an important step forward in the use of this technique in daily clinical practice and research. III.
Reliable classification of facial phenotypic variation in craniofacial microsomia: a comparison of physical exam and photographs.

PubMed

Birgfeld, Craig B; Heike, Carrie L; Saltzman, Babette S; Leroux, Brian G; Evans, Kelly N; Luquetti, Daniela V

2016-03-31

Craniofacial microsomia is a common congenital condition for which children receive longitudinal, multidisciplinary team care. However, little is known about the etiology of craniofacial microsomia and few outcome studies have been published. In order to facilitate large, multicenter studies in craniofacial microsomia, we assessed the reliability of phenotypic classification based on photographs by comparison with direct physical examination. Thirty-nine children with craniofacial microsomia underwent a physical examination and photographs according to a standardized protocol. Three clinicians completed ratings during the physical examination and, at least a month later, using respective photographs for each participant. We used descriptive statistics for participant characteristics and intraclass correlation coefficients (ICCs) to assess reliability. The agreement between ratings on photographs and physical exam was greater than 80 % for all 15 categories included in the analysis. The ICC estimates were higher than 0.6 for most features. Features with the highest ICC included: presence of epibulbar dermoids, ear abnormalities, and colobomas (ICC 0.85, 0.81, and 0.80, respectively). Orbital size, presence of pits, tongue abnormalities, and strabismus had the lowest ICC, values (0.17 or less). There was not a strong tendency for either type of rating, physical exam or photograph, to be more likely to designate a feature as abnormal. The agreement between photographs and physical exam regarding the presence of a prior surgery was greater than 90 % for most features. Our results suggest that categorization of facial phenotype in children with CFM based on photographs is reliable relative to physical examination for most facial features.
Reproducibility of Abdominal Aortic Aneurysm Diameter Measurement and Growth Evaluation on Axial and Multiplanar Computed Tomography Reformations

DOE Office of Scientific and Technical Information (OSTI.GOV)

Dugas, Alexandre; Therasse, Eric; Kauffmann, Claude

2012-08-15

Purpose: To compare different methods measuring abdominal aortic aneurysm (AAA) maximal diameter (Dmax) and its progression on multidetector computed tomography (MDCT) scan. Materials and Methods: Forty AAA patients with two MDCT scans acquired at different times (baseline and follow-up) were included. Three observers measured AAA diameters by seven different methods: on axial images (anteroposterior, transverse, maximal, and short-axis views) and on multiplanar reformation (MPR) images (coronal, sagittal, and orthogonal views). Diameter measurement and progression were compared over time for the seven methods. Reproducibility of measurement methods was assessed by intraclass correlation coefficient (ICC) and Bland-Altman analysis. Results: Dmax, as measuredmore » on axial slices at baseline and follow-up (FU) MDCTs, was greater than that measured using the orthogonal method (p = 0.046 for baseline and 0.028 for FU), whereas Dmax measured with the orthogonal method was greater those using all other measurement methods (p-value range: <0.0001-0.03) but anteroposterior diameter (p = 0.18 baseline and 0.10 FU). The greatest interobserver ICCs were obtained for the orthogonal and transverse methods (0.972) at baseline and for the orthogonal and sagittal MPR images at FU (0.973 and 0.977). Interobserver ICC of the orthogonal method to document AAA progression was greater (ICC = 0.833) than measurements taken on axial images (ICC = 0.662-0.780) and single-plane MPR images (0.772-0.817). Conclusion: AAA Dmax measured on MDCT axial slices overestimates aneurysm size. Diameter as measured by the orthogonal method is more reproducible, especially to document AAA progression.« less
2008 Niday Perinatal Database quality audit: report of a quality assurance project.

PubMed

Dunn, S; Bottomley, J; Ali, A; Walker, M

2011-12-01

This quality assurance project was designed to determine the reliability, completeness and comprehensiveness of the data entered into Niday Perinatal Database. Quality of the data was measured by comparing data re-abstracted from the patient record to the original data entered into the Niday Perinatal Database. A representative sample of hospitals in Ontario was selected and a random sample of 100 linked mother and newborn charts were audited for each site. A subset of 33 variables (representing 96 data fields) from the Niday dataset was chosen for re-abstraction. Of the data fields for which Cohen's kappa statistic or intraclass correlation coefficient (ICC) was calculated, 44% showed substantial or almost perfect agreement (beyond chance). However, about 17% showed less than 95% agreement and a kappa or ICC value of less than 60% indicating only slight, fair or moderate agreement (beyond chance). Recommendations to improve the quality of these data fields are presented.
Test-retest reliability of an fMRI paradigm for studies of cardiovascular reactivity.

PubMed

Sheu, Lei K; Jennings, J Richard; Gianaros, Peter J

2012-07-01

We examined the reliability of measures of fMRI, subjective, and cardiovascular reactions to standardized versions of a Stroop color-word task and a multisource interference task. A sample of 14 men and 12 women (30-49 years old) completed the tasks on two occasions, separated by a median of 88 days. The reliability of fMRI BOLD signal changes in brain areas engaged by the tasks was moderate, and aggregating fMRI BOLD signal changes across the tasks improved test-retest reliability metrics. These metrics included voxel-wise intraclass correlation coefficients (ICCs) and overlap ratio statistics. Task-aggregated ratings of subjective arousal, valence, and control, as well as cardiovascular reactions evoked by the tasks showed ICCs of 0.57 to 0.87 (ps < .001), indicating moderate-to-strong reliability. These findings support using these tasks as a battery for fMRI studies of cardiovascular reactivity. Copyright © 2012 Society for Psychophysiological Research.
Intrarater Reliability and Other Psychometrics of the Health Promoting Activities Scale (HPAS).

PubMed

Muskett, Rachel; Bourke-Taylor, Helen; Hewitt, Alana

The Health Promoting Activities Scale (HPAS) measures the self-rated frequency with which adults participate in activities that promote health. We evaluated the internal consistency, construct validity, and intrarater reliability of the HPAS with a cohort of mothers (N = 56) of school-age children. We used an online survey that included the HPAS and measures of mental and physical health. Statistical analysis included intraclass correlation coefficients (ICCs), measurement error, error range, limits of agreement, and minimum detectable change (MDC). The HPAS showed good internal consistency (Cronbach's α = .73). Construct validity was supported by a significant difference in HPAS scores among participants grouped by physical activity level; no other differences were significant. Results included a high aggregate ICC of .90 and an MDC of 5 points. Our evaluation of the HPAS revealed good reliability and stability, suggesting suitability for ongoing evaluation as an outcome measure. Copyright © 2017 by the American Occupational Therapy Association, Inc.
Development of a reliable method to assess footwear comfort during running.

PubMed

Mündermann, Anne; Nigg, Benno M; Stefanyshyn, Darren J; Humble, R Neil

2002-08-01

The purposes of this study were: (a) to determine whether subjects are able to distinguish between differences in footwear with respect to footwear comfort; and (b) to determine how reliably footwear comfort can be assessed using a visual analogue scale (VAS) and a protocol including a control condition during running. Intraclass correlation coefficients (ICCs) between comfort ratings for repeated conditions were high (ICC = 0.799). Differences in comfort ratings between the insert conditions were significant. A paired t-test revealed a significant difference in overall comfort ratings for the control insert when tested after the soft insert compared to when tested after the hard insert (P = 0.008). The results of this study showed that VASs provide a reliable measure to assess footwear comfort during running under the conditions that: (a) a control condition is included; and (b) the average comfort rating of sessions 4-6 is used. Copyright 2002 Elsevier Science B.V.
Reliability of infrared thermometric measurements of skin temperature in the hand.

PubMed

Packham, Tara L; Fok, Diana; Frederiksen, Karen; Thabane, Lehana; Buckley, Norman

2012-01-01

Clinical measurement study. Skin temperature asymmetries (STAs) are used in the diagnosis of complex regional pain syndrome (CRPS), but little evidence exists for reliability of the equipment and methods. This study examined the reliability of an inexpensive infrared (IR) thermometer and measurement points in the hand for the study of STA. ST was measured three times at five points on both hands with an IR thermometer by two raters in 20 volunteers (12 normals and 8 CRPS). ST measurement results using IR thermometers support inter-rater reliability: intraclass correlation coefficient (ICC) estimate for single measures 0.80; all ST measurement points were also highly reliable (ICC single measures, 0.83-0.91). The equipment demonstrated excellent reliability, with little difference in the reliability of the five measurement sites. These preliminary findings support their use in future CRPS research. Not applicable. Copyright © 2012 Hanley & Belfus. Published by Elsevier Inc. All rights reserved.
Inter-arch digital model vs. manual cast measurements: Accuracy and reliability.

PubMed

Kiviahde, Heikki; Bukovac, Lea; Jussila, Päivi; Pesonen, Paula; Sipilä, Kirsi; Raustia, Aune; Pirttiniemi, Pertti

2017-06-28

The purpose of this study was to evaluate the accuracy and reliability of inter-arch measurements using digital dental models and conventional dental casts. Thirty sets of dental casts with permanent dentition were examined. Manual measurements were done with a digital caliper directly on the dental casts, and digital measurements were made on 3D models by two independent examiners. Intra-class correlation coefficients (ICC), a paired sample t-test or Wilcoxon signed-rank test, and Bland-Altman plots were used to evaluate intra- and inter-examiner error and to determine the accuracy and reliability of the measurements. The ICC values were generally good for manual and excellent for digital measurements. The Bland-Altman plots of all the measurements showed good agreement between the manual and digital methods and excellent inter-examiner agreement using the digital method. Inter-arch occlusal measurements on digital models are accurate and reliable and are superior to manual measurements.
Telepsychiatry: assessment of televideo psychiatric interview reliability with present- and next-generation internet infrastructures.

PubMed

Yoshino, A; Shigemura, J; Kobayashi, Y; Nomura, S; Shishikura, K; Den, R; Wakisaka, H; Kamata, S; Ashida, H

2001-09-01

We assessed the reliability of remote video psychiatric interviews conducted via the internet using narrow and broad bandwidths. Televideo psychiatric interviews conducted with 42 in-patients with chronic schizophrenia using two bandwidths (narrow, 128 kilobits/s; broad, 2 megabits/s) were assessed in terms of agreement with face-to-face interviews in a test-retest fashion. As a control, agreement was assessed between face-to-face interviews. Psychiatric symptoms were rated using the Oxford version of the Brief Psychiatric Rating Scale (BPRS), and agreement between interviews was estimated as the intraclass correlation coefficient (ICC). The ICC was significantly lower in the narrow bandwidth than in the broad bandwidth and the control for both positive symptoms score and total score. While reliability of televideo psychiatric interviews is insufficient using the present narrow-band internet infrastructure, the next generation of infrastructure (broad-band) may permit reliable diagnostic interviews.
Reliability and validity of a self-administered tool for online neuropsychological testing: The Amsterdam Cognition Scan.

PubMed

Feenstra, Heleen E M; Murre, Jaap M J; Vermeulen, Ivar E; Kieffer, Jacobien M; Schagen, Sanne B

2018-04-01

To facilitate large-scale assessment of a variety of cognitive abilities in clinical studies, we developed a self-administered online neuropsychological test battery: the Amsterdam Cognition Scan (ACS). The current studies evaluate in a group of adult cancer patients: test-retest reliability of the ACS and the influence of test setting (home or hospital), and the relationship between our online and a traditional test battery (concurrent validity). Test-retest reliability was studied in 96 cancer patients (57 female; M age = 51.8 years) who completed the ACS twice. Intraclass correlation coefficients (ICCs) were used to assess consistency over time. The test setting was counterbalanced between home and hospital; influence on test performance was assessed by repeated measures analyses of variance. Concurrent validity was studied in 201 cancer patients (112 female; M age = 53.5 years) who completed both the online and an equivalent traditional neuropsychological test battery. Spearman or Pearson correlations were used to assess consistency between online and traditional tests. ICCs of the online tests ranged from .29 to .76, with an ICC of .78 for the ACS total score. These correlations are generally comparable with the test-retest correlations of the traditional tests as reported in the literature. Correlating online and traditional test scores, we observed medium to large concurrent validity (r/ρ = .42 to .70; total score r = .78), except for a visuospatial memory test (ρ = .36). Correlations were affected-as expected-by design differences between online tests and their offline counterparts. Although development and optimization of the ACS is an ongoing process, and reliability can be optimized for several tests, our results indicate that it is a highly usable tool to obtain (online) measures of various cognitive abilities. The ACS is expected to facilitate efficient gathering of data on cognitive functioning in the near future.
Reliability of a Computerized Neurocognitive Test in Baseline Concussion Testing of High School Athletes.

PubMed

MacDonald, James; Duerson, Drew

2015-07-01

Baseline assessments using computerized neurocognitive tests are frequently used in the management of sport-related concussions. Such testing is often done on an annual basis in a community setting. Reliability is a fundamental test characteristic that should be established for such tests. Our study examined the test-retest reliability of a computerized neurocognitive test in high school athletes over 1 year. Repeated measures design. Two American high schools. High school athletes (N = 117) participating in American football or soccer during the 2011-2012 and 2012-2013 academic years. All study participants completed 2 baseline computerized neurocognitive tests taken 1 year apart at their respective schools. The test measures performance on 4 cognitive tasks: identification speed (Attention), detection speed (Processing Speed), one card learning accuracy (Learning), and one back speed (Working Memory). Reliability was assessed by measuring the intraclass correlation coefficient (ICC) between the repeated measures of the 4 cognitive tasks. Pearson and Spearman correlation coefficients were calculated as a secondary outcome measure. The measure for identification speed performed best (ICC = 0.672; 95% confidence interval, 0.559-0.760) and the measure for one card learning accuracy performed worst (ICC = 0.401; 95% confidence interval, 0.237-0.542). All tests had marginal or low reliability. In a population of high school athletes, computerized neurocognitive testing performed in a community setting demonstrated low to marginal test-retest reliability on baseline assessments 1 year apart. Further investigation should focus on (1) improving the reliability of individual tasks tested, (2) controlling for external factors that might affect test performance, and (3) identifying the ideal time interval to repeat baseline testing in high school athletes. Computerized neurocognitive tests are used frequently in high school athletes, often within a model of baseline testing of asymptomatic individuals before the start of a sporting season. This study adds to the evidence that suggests in this population such testing may lack sufficient reliability to support clinical decision making.
Handgrip force steadiness in young and older adults: a reproducibility study.

PubMed

Blomkvist, Andreas W; Eika, Fredrik; de Bruin, Eling D; Andersen, Stig; Jorgensen, Martin

2018-04-02

Force steadiness is a quantitative measure of the ability to control muscle tonus. It is an independent predictor of functional performance and has shown to correlate well with different degrees of motor impairment following stroke. Despite being clinically relevant, few studies have assessed the validity of measuring force steadiness. The aim of this study was to explore the reproducibility of handgrip force steadiness, and to assess age difference in steadiness. Intrarater reproducibility (the degree to which a rating gives consistent result on separate occasions) was investigated in a test-retest design with seven days between sessions. Ten young and thirty older adults were recruited and handgrip steadiness was tested at 5%, 10% and 25% of maximum voluntary contraction (MVC) using Nintendo Wii Balance Board (WBB). Coefficients of variation were calculated from the mean force produced (CVM) and the target force (CVT). Area between the force curve and the target force line (Area) was also calculated. For the older adults we explored reliability using intraclass correlation coefficient (ICC) and agreement using standard error of measurement (SEM), limits of agreement (LOA) and smallest real difference (SRD). A systematic improvement in handgrip steadiness was found between sessions for all measures (CVM, CVT, Area). CVM and CVT at 5% of MVC showed good to high reliability, while Area had poor reliability for all percentages of MVC. Averaged ICC for CVM, CVT and Area was 0.815, 0.806 and 0.464, respectively. Averaged ICC on 5%, 10%, and 25% of MVC was 0.751, 0.667 and 0.668, respectively. Measures of agreement showed similar trends with better results for CVM and CVT than for Area. Young adults had better handgrip steadiness than older adults across all measures. The CVM and CVT measures demonstrated good reproducibility at lower percentages of MVC using the WBB, and could become relevant measures in the clinical setting. The Area measure had poor reproducibility. Young adults have better handgrip steadiness than old adults.

Cross-cultural adaptation and validation of the reliability of the Thai version of the Hip disability and Osteoarthritis Outcome Score (HOOS).

PubMed

Trathitiphan, Warayos; Paholpak, Permsak; Sirichativapee, Winai; Wisanuyotin, Taweechok; Laupattarakasem, Pat; Sukhonthamarn, Kamolsak; Jeeravipoolvarn, Polasak; Kosuwon, Weerachai

2016-10-01

HOOS was developed as an extension of the Western Ontario and McMaster Universities' Osteoarthritis Index questionnaire for measuring symptoms and functional limitations related to the hip(s) of patients with osteoarthritis. To determine the validity and reliability of the Thai version of the Hip disability and Osteoarthritis Outcome Score (HOOS) vis-à-vis hip osteoarthritis, the original HOOS was translated into a Thai version of HOOS, according to international recommendations. Patients with hip osteoarthritis (n = 57; 25 males) were asked to complete the Thai version of HOOS twice: once then again after a 3-week interval. The test-retest reliability was analyzed using the intraclass correlation coefficient (ICC). Internal consistencies were analyzed using Cronbach's alpha, while the construct validity was tested by comparing the Thai HOOS with the Thai modified SF-36 and calculating the Spearman's rank correlation coefficients. The Thai HOOS produced good reliability (i.e., the ICC was greater than 0.9 in all five subscales). All of the Cronbach's alpha showed that the Thai HOOS had high internal consistency (Cronbach's alpha greater than 0.8), especially for the pain and ADL subscales (0.89 and 0.90, respectively). The Spearman's rank correlation for all five subscales of the Thai HOOS had moderate correlation with the Bodily Pain subscale of the Thai SF-36. The pain subscale of the Thai HOOS had a high correlation with the Vitality and Social Function subscales of the Thai SF-36 (r = 0.55 and 0.54)-with which the symptom subscale had a moderate correlation. The Thai version of HOOS had excellent internal consistency, excellent test-retest reliability, and good construct validity. It can be used as a reliable tool for assessing quality of life for patients with hip osteoarthritis in Thailand.
Reliability and validity of a questionnaire for self-assessment of complete dentures.

PubMed

Komagamine, Yuriko; Kanazawa, Manabu; Kaiba, Yoshinori; Sato, Yusuke; Minakuchi, Shunsuke

2014-05-02

Demand for complete denture treatment is expected to rise over several decades. However, to date, no questionnaire on complete dentures, as evaluated by edentulous patients, has been shown to be reliable and valid. This study sought to assess the reliability and validity of Patient's Denture Assessment (PDA), which provides a multidimensional evaluation of dentures among edentulous patients. Patients, who had new complete dentures fabricated at the University Hospital of Dentistry, Tokyo Medical and Dental University through 2009 to 2010, were enrolled. The reliability of the PDA was determined by examining internal consistency and test-retest reliability. Internal consistency for all of the question items and the six subscales was measured using Cronbach's α and average inter-item correlation coefficients among 93 participants. For 33 of these participants, test-retest reliability was determined at a 2 month-interval using the interclass correlation coefficients (ICCs) and 95% confidence interval for the summary scores and the six subscale scores. The PDA was validated in 93 participants by examining the difference in the summary score and the six subscale scores of the PDA before and after replacement with new dentures by the paired t-test. Ability to detect change was also tested in 93 patients using effect size. The Cronbach's α for the PDA ranged from 0.56 to 0.93. The average inter-item correlation coefficients ranged from 0.28 to 0.83. ICCs for the PDA ranged from 0.37 to 0.83. The paired t-test showed a significant difference between the summary score and the six subscale scores before and after replacement with new dentures (p < 0.05) and the effect size was 0.97. The PDA demonstrated good reliability by assessing internal consistency and test-retest reliability. In addition, the PDA demonstrated good validity by assessing discriminant validity. Thus, the PDA could help dentists obtain a detailed understanding of the patients' perceptions in using their dentures.
The Trojan Lifetime Champions Health Survey: development, validity, and reliability.

PubMed

Sorenson, Shawn C; Romano, Russell; Scholefield, Robin M; Schroeder, E Todd; Azen, Stanley P; Salem, George J

2015-04-01

Self-report questionnaires are an important method of evaluating lifespan health, exercise, and health-related quality of life (HRQL) outcomes among elite, competitive athletes. Few instruments, however, have undergone formal characterization of their psychometric properties within this population. To evaluate the validity and reliability of a novel health and exercise questionnaire, the Trojan Lifetime Champions (TLC) Health Survey. Descriptive laboratory study. A large National Collegiate Athletic Association Division I university. A total of 63 university alumni (age range, 24 to 84 years), including former varsity collegiate athletes and a control group of nonathletes. Participants completed the TLC Health Survey twice at a mean interval of 23 days with randomization to the paper or electronic version of the instrument. Content validity, feasibility of administration, test-retest reliability, parallel-form reliability between paper and electronic forms, and estimates of systematic and typical error versus differences of clinical interest were assessed across a broad range of health, exercise, and HRQL measures. Correlation coefficients, including intraclass correlation coefficients (ICCs) for continuous variables and κ agreement statistics for ordinal variables, for test-retest reliability averaged 0.86, 0.90, 0.80, and 0.74 for HRQL, lifetime health, recent health, and exercise variables, respectively. Correlation coefficients, again ICCs and κ, for parallel-form reliability (ie, equivalence) between paper and electronic versions averaged 0.90, 0.85, 0.85, and 0.81 for HRQL, lifetime health, recent health, and exercise variables, respectively. Typical measurement error was less than the a priori thresholds of clinical interest, and we found minimal evidence of systematic test-retest error. We found strong evidence of content validity, convergent construct validity with the Short-Form 12 Version 2 HRQL instrument, and feasibility of administration in an elite, competitive athletic population. These data suggest that the TLC Health Survey is a valid and reliable instrument for assessing lifetime and recent health, exercise, and HRQL, among elite competitive athletes. Generalizability of the instrument may be enhanced by additional, larger-scale studies in diverse populations.
Validation of intracranial area as a surrogate measure of intracranial volume when using clinical MRI.

PubMed

Nandigam, R N Kaveer; Chen, Yu-Wei; Gurol, Mahmut E; Rosand, Jonathan; Greenberg, Steven M; Smith, Eric E

2007-01-01

We sought to determine whether mid-sagittal intracranial area (ICA) is a valid surrogate of intracranial volume (ICV) when using retrospective data with relatively thick (6-7 mm) sagittal slices. Data were retrospectively analyzed from 47 subjects who had two MRI scans taken at least nine months apart. Twenty-three subjects had manual segmentation of ICV on the T2-weighted sequence for comparison. Intraclass correlation coefficient (ICC) for intraobserver, interobserver, and intraobserver scan-rescan comparisons were 0.96, 0.97 and 0.95. Pearson correlation coefficients between ICV and ICA, averaging the cumulative 1, 2, 3, and 4 most midline slices, were 0.89, 0.94, 0.93, and 0.95. There was a significant marginal increase in explained variance of ICV by measuring two, rather than one, slices (P= 0.001). These data suggest that ICA, even measured without high-resolution imaging, is a reasonable substitute for ICV.
Reliability and measurement error of sagittal spinal motion parameters in 220 patients with chronic low back pain using a three-dimensional measurement device.

PubMed

Mieritz, Rune M; Bronfort, Gert; Jakobsen, Markus D; Aagaard, Per; Hartvigsen, Jan

2014-09-01

A basic premise for any instrument measuring spinal motion is that reliable outcomes can be obtained on a relevant sample under standardized conditions. The purpose of this study was to assess the overall reliability and measurement error of regional spinal sagittal plane motion in patients with chronic low back pain (LBP), and then to evaluate the influence of body mass index, examiner, gender, stability of pain, and pain distribution on reliability and measurement error. This study comprises a test-retest design separated by 7 to 14 days. The patient cohort consisted of 220 individuals with chronic LBP. Kinematics of the lumbar spine were sampled during standardized spinal extension-flexion testing using a 6-df instrumented spatial linkage system. Test-retest reliability and measurement error were evaluated using interclass correlation coefficients (ICC(1,1)) and Bland-Altman limits of agreement (LOAs). The overall test-retest reliability (ICC(1,1)) for various motion parameters ranged from 0.51 to 0.70, and relatively wide LOAs were observed for all parameters. Reliability measures in patient subgroups (ICC(1,1)) ranged between 0.34 and 0.77. In general, greater (ICC(1,1)) coefficients and smaller LOAs were found in subgroups with patients examined by the same examiner, patients with a stable pain level, patients with a body mass index less than below 30 kg/m(2), patients who were men, and patients in the Quebec Task Force classifications Group 1. This study shows that sagittal plane kinematic data from patients with chronic LBP may be sufficiently reliable in measurements of groups of patients. However, because of the large LOAs, this test procedure appears unusable at the individual patient level. Furthermore, reliability and measurement error varies substantially among subgroups of patients. Copyright © 2014 Elsevier Inc. All rights reserved.
Inter-study reproducibility of left ventricular torsion and torsion rate quantification using MR myocardial feature tracking.

PubMed

Kowallick, Johannes T; Morton, Geraint; Lamata, Pablo; Jogiya, Roy; Kutty, Shelby; Lotz, Joachim; Hasenfuß, Gerd; Nagel, Eike; Chiribiri, Amedeo; Schuster, Andreas

2016-01-01

To determine the inter-study reproducibility of MR feature tracking (MR-FT) derived left ventricular (LV) torsion and torsion rates for a combined assessment of systolic and diastolic myocardial function. Steady-state free precession (SSFP) cine LV short-axis stacks were acquired at 9:00 (Exam A), 9:30 (Exam B), and 14:00 (Exam C) in 16 healthy volunteers at 3 Tesla. SSFP images were analyzed offline using MR-FT to assess rotational displacement in apical and basal slices. Global peak torsion, peak systolic and peak diastolic torsion rates were calculated using different definitions ("twist", "normalized twist" and "circumferential-longitudinal (CL) shear angle"). Exam A and B were compared to assess the inter-study reproducibility. Morning and afternoon scans were compared to address possible diurnal variation. The different methods showed good inter-study reproducibility for global peak torsion (intraclass correlation coefficient [ICC]: 0.90-0.92; coefficient of variation [CoV]: 19.0-20.3%) and global peak systolic torsion rate (ICC: 0.82-0.84; CoV: 25.9-29.0%). Conversely, global peak diastolic torsion rate showed little inter-study reproducibility (ICC: 0.34-0.47; CoV: 40.8-45.5%). Global peak torsion as determined by the CL shear angle showed the best inter-study reproducibility (ICC: 0.90;CoV: 19.0%). MR-FT results were not measurably affected by diurnal variation between morning and afternoon scans (CL shear angle: 4.8 ± 1.4°, 4.8 ± 1.5°, and 4.1 ± 1.6° for Exam A, B, and C, respectively; P = 0.21). MR-FT based derivation of myocardial peak torsion and peak systolic torsion rate has high inter-study reproducibility as opposed to peak diastolic torsion rate. The CL shear angle was the most reproducible parameter independently of cardiac anatomy and may develop into a robust tool to quantify cardiac rotational mechanics in longitudinal MR-FT patient studies. © 2015 Wiley Periodicals, Inc.
Is computed tomography an accurate and reliable method for measuring total knee arthroplasty component rotation?

PubMed

Figueroa, José; Guarachi, Juan Pablo; Matas, José; Arnander, Magnus; Orrego, Mario

2016-04-01

Computed tomography (CT) is widely used to assess component rotation in patients with poor results after total knee arthroplasty (TKA). The purpose of this study was to simultaneously determine the accuracy and reliability of CT in measuring TKA component rotation. TKA components were implanted in dry-bone models and assigned to two groups. The first group (n = 7) had variable femoral component rotations, and the second group (n = 6) had variable tibial tray rotations. CT images were then used to assess component rotation. Accuracy of CT rotational assessment was determined by mean difference, in degrees, between implanted component rotation and CT-measured rotation. Intraclass correlation coefficient (ICC) was applied to determine intra-observer and inter-observer reliability. Femoral component accuracy showed a mean difference of 2.5° and the tibial tray a mean difference of 3.2°. There was good intra- and inter-observer reliability for both components, with a femoral ICC of 0.8 and 0.76, and tibial ICC of 0.68 and 0.65, respectively. CT rotational assessment accuracy can differ from true component rotation by approximately 3° for each component. It does, however, have good inter- and intra-observer reliability.
Comparison of 2-dimensional, 3-dimensional, and vascular ultrasonographic parameters for endometrial receptivity between 2 consecutive stimulated in vitro fertilization cycles.

PubMed

Ng, Ernest Hung Yu; Chan, Carina Chi Wai; Tang, Oi Shan; Ho, Pak Chung

2007-07-01

We compared the ultrasonographic parameters for endometrial receptivity between 2 consecutive in vitro fertilization (IVF) cycles in the same patients. Patients who had undergone 2 in vitro fertilization cycles between November 2002 and December 2004 were recruited. A 3-dimensional ultrasonographic examination with power Doppler imaging was performed on the day of oocyte retrieval to determine the endometrial thickness, endometrial pattern, pulsatility and resistive indices of uterine vessels, endometrial volume, vascularization index, flow index, and vascularization flow index of endometrial and subendometrial regions. Of 662 patients, 95 (14.4%) underwent 2 consecutive cycles using the same stimulation regimen during the study period. There were no significant differences in these ultrasonographic parameters between the first and second cycles. The intraclass correlation coefficient (ICC) for endometrial volume was significantly higher than that of other ultrasonographic parameters. The ICC for the endometrial thickness, uterine pulsatility index, and endometrial 3-dimensional power Doppler flow indices were similar. Ultrasonographic parameters for endometrial receptivity were comparable in the 2 consecutive stimulated cycles. The endometrial volume had the highest ICC among these ultrasonographic parameters and was most reproducible between 2 cycles.
Intrarater reliability of the Humac NORM isokinetic dynamometer for strength measurements of the knee and shoulder muscles.

PubMed

Habets, Bas; Staal, J Bart; Tijssen, Marsha; van Cingel, Robert

2018-01-10

To determine the intrarater reliability of the Humac NORM isokinetic dynamometer for concentric and eccentric strength tests of knee and shoulder muscles. 54 participants (50% female, average age 20.9 ± 3.1 years) performed concentric and eccentric strength measures of the knee extensors and flexors, and the shoulder internal and external rotators on two different Humac NORM isokinetic dynamometers, which were situated at two different centers. The knee extensors and flexors were tested concentrically at 60° and 180°/s, and eccentrically at 60° s. Concentric strength of the shoulder internal and external rotators, and eccentric strength of the external rotators were measured at 60° and 120°/s. We calculated intraclass correlation coefficients (ICCs), standard error of measurement, standard error of measurement expressed as a %, and the smallest detectable change to determine reliability and measurement error. ICCs for the knee tests ranged from 0.74 to 0.89, whereas ICC values for the shoulder tests ranged from 0.72 to 0.94. Measurement error was highest for the concentric test of the knee extensors and lowest for the concentric test of shoulder external rotators.
Inter-rater Agreement on Final Competency Testing Utilizing Standardized Patients.

PubMed

Bowman, Dixie H; Ferber, Kyle L; Sima, Adam P

2016-01-01

The purpose of this study was to determine whether licensed physical therapists (n=8) serving as standardized patients (SPs) for practical examinations evaluate physical therapy students (n=51) equivalently to the physical therapy course instructor (n=1). The SPs completed the same assessment based on the evaluation criteria as did the instructor. The scores for the practical examination, answers to three questions, and the documentation note were summarized separately for the SP and the instructor by means and standard deviations. A paired t-test and an intraclass correlation coefficient (ICC) for each aspect of the score were calculated. ICC(1,1) values were reported along with corresponding 95% confidence intervals. The instructor had significantly higher scores for the practical exam and the overall score compared to the ratings from the SPs. No differences were observed between the instructor and SP scores on the three answers to the questions and documentation note scores. Based on the ICC values identified in this study, a physical therapist serving as an SP may not be an adequate replacement for an instructor when it comes to grading physical therapy students on all aspects of their competency tests.
The Dutch Activity Card Sort institutional version was reproducible, but biased against women.

PubMed

Jong, A M; van Nes, F A; Lindeboom, R

2012-01-01

To examine the reproducibility of the institutional version of the Dutch Activity Card Sort (ACS-NL) and the possible presence of gender bias. Older rehabilitation inpatients (N = 52) were included. Intra- and inter-rater agreement for the ACS-NL total and subscale scores was examined by intraclass correlations (ICC), and agreement of individual items by the κ coefficient (k). Gender bias was examined by the proportion of men and women selecting an ACS item. ICC for inter-rater agreement of the ACS total score ranged between 0.78 and 0.87, ICC for intra-rater agreement ranged between 0.79 and 0.89. Median inter-rater κ for ACS-NL items was 0.72 (interquartile scores; 0.62-0.80). The inter-rater agreement (k = 0.43) and intra-rater agreement (k = 0.39) for the five most important activities was lower. Twenty ACS activities favoured men and seven activities favoured women. As a result, men scored systematically higher on the ACS-NL than women. Logistic regression analysis correcting for activity engagement level confirmed our findings. The reproducibility of the ACS-NL was high. The ACS-NL institutional version score may be biased in favour of men.
Test-Retest Reliability of a Novel Isokinetic Squat Device With Strength-Trained Athletes.

PubMed

Bridgeman, Lee A; McGuigan, Michael R; Gill, Nicholas D; Dulson, Deborah K

2016-11-01

Bridgeman, LA, McGuigan, MR, Gill, ND, and Dulson, DK. Test-retest reliability of a novel isokinetic squat device with strength-trained athletes. J Strength Cond Res 30(11): 3261-3265, 2016-The aim of this study was to investigate the test-retest reliability of a novel multijoint isokinetic squat device. The subjects in this study were 10 strength-trained athletes. Each subject completed 3 maximal testing sessions to assess peak concentric and eccentric force (N) over a 3-week period using the Exerbotics squat device. Mean differences between eccentric and concentric force across the trials were calculated. Intraclass correlation coefficients (ICCs) and coefficients of variation (CVs) for the variables of interest were calculated using an excel reliability spreadsheet. Between trials 1 and 2 an 11.0 and 2.3% increase in mean concentric and eccentric forces, respectively, was reported. Between trials 2 and 3 a 1.35% increase in the mean concentric force production and a 1.4% increase in eccentric force production was reported. The mean concentric peak force CV and ICC across the 3 trials was 10% (7.6-15.4) and 0.95 (0.87-0.98) respectively. However, the mean eccentric peak force CV and ICC across the trials was 7.2% (5.5-11.1) and 0.90 (0.76-0.97), respectively. Based on these findings it is suggested that the Exerbotics squat device shows good test-retest reliability. Therefore practitioners and investigators may consider its use to monitor changes in concentric and eccentric peak force.
Test-retest reliability of the Military Pre-training Questionnaire.

PubMed

Robinson, M; Stokes, K; Bilzon, J; Standage, M; Brown, P; Thompson, D

2010-09-01

Musculoskeletal injuries are a significant cause of morbidity during military training. A brief, inexpensive and user-friendly tool that demonstrates reliability and validity is warranted to effectively monitor the relationship between multiple predictor variables and injury incidence in military populations. To examine the test-retest reliability of the Military Pre-training Questionnaire (MPQ), designed specifically to assess risk factors for injury among military trainees across five domains (physical activity, injury history, diet, alcohol and smoking). Analyses were based on a convenience sample of 58 male British Army trainees. Kappa (kappa), weighted kappa (kappa(w)) and intraclass correlation coefficients (ICC) were used to evaluate the 2-week test-retest reliability of the MPQ. For index measures constituting the assessment of a given construct, internal consistency was assessed by Cronbach's alpha (alpha) coefficients. Reliability of individual items ranged from poor to almost perfect (kappa range = 0.45-0.86; kappa(w) range = 0.11-0.91; ICC range = 0.34-0.86) with most items demonstrating moderate reliability. Overall scores related to physical activity, diet, alcohol and smoking constructs were reliable between both administrations (ICC = 0.63-0.85). Support for the internal consistency of the incorporated alcohol (alpha = 0.78) and cigarette (alpha = 0.75) scales was also provided. The MPQ is a reliable self-report instrument for assessing multiple injury-related risk factors during initial military training. Further assessment of the psychometric properties of the MPQ (e.g. different types of validity) with military populations/samples will support its interpretation and use in future surveillance and epidemiological studies.
Validity and reliability of rectus femoris ultrasound measurements: Comparison of curved-array and linear-array transducers.

PubMed

Hammond, Kendra; Mampilly, Jobby; Laghi, Franco A; Goyal, Amit; Collins, Eileen G; McBurney, Conor; Jubran, Amal; Tobin, Martin J

2014-01-01

Muscle-mass loss augers increased morbidity and mortality in critically ill patients. Muscle-mass loss can be assessed by wide linear-array ultrasound transducers connected to cumbersome, expensive console units. Whether cheaper, hand-carried units equipped with curved-array transducers can be used as alternatives is unknown. Accordingly, our primary aim was to investigate in 15 nondisabled subjects the validity of measurements of rectus femoris cross-sectional area by using a curved-array transducer against a linear-array transducer-the reference-standard technique. In these subjects, we also determined the reliability of measurements obtained by a novice operator versus measurements obtained by an experienced operator. Lastly, the relationship between quadriceps strength and rectus area recorded by two experienced operators with a curved-array transducer was assessed in 17 patients with chronic obstructive pulmonary disease (COPD). In nondisabled subjects, the rectus cross-sectional area measured with the curved-array transducer by the novice and experienced operators was valid (intraclass correlation coefficient [ICC]: 0.98, typical percentage error [%TE]: 3.7%) and reliable (ICC: 0.79, %TE: 9.7%). In the subjects with COPD, both reliability (ICC: 0.99) and repeatability (%TE: 7.6% and 9.8%) were high. Rectus area was related to quadriceps strength in COPD for both experienced operators (coefficient of determination: 0.67 and 0.70). In conclusion, measurements of rectus femoris cross-sectional area recorded with a curved-array transducer connected to a hand-carried unit are valid, reliable, and reproducible, leading us to contend that this technique is suitable for cross-sectional and longitudinal studies.
iPhone Sensors in Tracking Outcome Variables of the 30-Second Chair Stand Test and Stair Climb Test to Evaluate Disability: Cross-Sectional Pilot Study

PubMed Central

Samaan, Michael A; Schultz, Brooke; Popovic, Tijana; Souza, Richard B; Majumdar, Sharmila

2017-01-01

Background Performance tests are important to characterize patient disabilities and functional changes. The Osteoarthritis Research Society International and others recommend the 30-second Chair Stand Test and Stair Climb Test, among others, as core tests that capture two distinct types of disability during activities of daily living. However, these two tests are limited by current protocols of testing in clinics. There is a need for an alternative that allows remote testing of functional capabilities during these tests in the osteoarthritis patient population. Objective Objectives are to (1) develop an app for testing the functionality of an iPhone’s accelerometer and gravity sensor and (2) conduct a pilot study objectively evaluating the criterion validity and test-retest reliability of outcome variables obtained from these sensors during the 30-second Chair Stand Test and Stair Climb Test. Methods An iOS app was developed with data collection capabilities from the built-in iPhone accelerometer and gravity sensor tools and linked to Google Firebase. A total of 24 subjects performed the 30-second Chair Stand Test with an iPhone accelerometer collecting data and an external rater manually counting sit-to-stand repetitions. A total of 21 subjects performed the Stair Climb Test with an iPhone gravity sensor turned on and an external rater timing the duration of the test on a stopwatch. App data from Firebase were converted into graphical data and exported into MATLAB for data filtering. Multiple iterations of a data processing algorithm were used to increase robustness and accuracy. MATLAB-generated outcome variables were compared to the manually determined outcome variables of each test. Pearson’s correlation coefficients (PCCs), Bland-Altman plots, intraclass correlation coefficients (ICCs), standard errors of measurement, and repeatability coefficients were generated to evaluate criterion validity, agreement, and test-retest reliability of iPhone sensor data against gold-standard manual measurements. Results App accelerometer data during the 30-second Chair Stand Test (PCC=.890) and gravity sensor data during the Stair Climb Test (PCC=.865) were highly correlated to gold-standard manual measurements. Greater than 95% of values on Bland-Altman plots comparing the manual data to the app data fell within the 95% limits of agreement. Strong intraclass correlation was found for trials of the 30-second Chair Stand Test (ICC=.968) and Stair Climb Test (ICC=.902). Standard errors of measurement for both tests were found to be within acceptable thresholds for MATLAB. Repeatability coefficients for the 30-second Chair Stand Test and Stair Climb Test were 0.629 and 1.20, respectively. Conclusions App-based performance testing of the 30-second Chair Stand Test and Stair Climb Test is valid and reliable, suggesting its applicability to future, larger-scale studies in the osteoarthritis patient population. PMID:29079549
Axial Globe Position Measurement: A Prospective Multicenter Study by the International Thyroid Eye Disease Society.

PubMed

Bingham, Chad M; Sivak-Callcott, Jennifer A; Gurka, Matthew J; Nguyen, John; Hogg, Jeffery P; Feldon, Steve E; Fay, Aaron; Seah, Lay-Leng; Strianese, Diego; Durairaj, Vikram D; Uddin, Jimmy; Devoto, Martin H; Harris, Matheson; Saunders, Justin; Osaki, Tammy H; Looi, Audrey; Teo, Livia; Davies, Brett W; Elefante, Andrea; Shen, Sunny; Realini, Tony; Fischer, William; Kazim, Michael

2016-01-01

Identify a reproducible measure of axial globe position (AGP) for multicenter studies on patients with thyroid eye disease (TED). This is a prospective, international, multicenter, observational study in which 3 types of AGP evaluation were examined: radiologic, clinical, and photographic. In this study, CT was the modality to which all other methods were compared. CT AGP was measured from an orthogonal line between the anterior lateral orbital rims to the cornea. All CT measurements were made at a single institution by 3 individual clinicians. Clinical evaluation was performed with exophthalmometry. Three clinicians from each clinical site assessed AGP with 3 different exophthalmometers and horizontal palpebral width using a ruler. Each physician made 3 separate measurements with each type of exophthalmometer not in succession. All photographic measurements were made at a single institution. AGP was measured from lateral photographs in which a standard marker was placed at the anterior lateral orbital rim. Horizontal and vertical palpebral fissure were measured from frontal photographs. Three trained readers measured 3 separate times not in succession. Exophthalmometry and photography method validity was assessed by agreement with CT (mean differences calculation, intraclass correlation coefficients [ICCs], Bland-Altman figures). Correlation between palpebral fissure and CT AGP was assessed with Pearson correlation. Intraclinician and interclinician reliability was evaluated using ICCs. Sixty-eight patients from 7 centers participated. CT mean AGP was 21.37 mm (15.96-28.90 mm) right and 21.22 mm (15.87-28.70 mm) left (ICC 0.996 and 0.995). Exophthalmometry AGP fell between 18 mm and 25 mm. Intraclinician agreement across exophthalmometers was ideal (ICC 0.948-0.983). Agreement between clinicians was greater than 0.85 for all upright exophthalmometry measurements. Photographic mean AGP was 20.47 mm (10.92-30.88 mm) right and 20.30 mm (8.61-28.72 mm) left. Intrareader and interreader agreement was ideal (ICC 0.991-0.989). All exophthalmometers' mean differences from CT ranged between -0.06 mm (±1.36 mm) and 0.54 mm (±1.61 mm); 95% confidence interval fell within 1 mm. Magnitude of AGP did not affect exophthalmometry validity. Oculus best estimated CT AGP but differences from other exophthalmometers were not clinically meaningful in upright measurements. Photographic AGP (right ICC = 0.575, left ICC = 0.355) and palpebral fissure do not agree with CT. Upright clinical exophthalmometry accurately estimates CT AGP in TED. AGP measurement was reliably reproduced by the same clinician and between clinicians at multiple institutions using the protocol in this study. These findings allow reliable measurement of AGP that will be of considerable value in future outcome studies.
Radiation detector device for rejecting and excluding incomplete charge collection events

DOEpatents

Bolotnikov, Aleksey E.; De Geronimo, Gianluigi; Vernon, Emerson; Yang, Ge; Camarda, Giuseppe; Cui, Yonggang; Hossain, Anwar; Kim, Ki Hyun; James, Ralph B.

2016-05-10

A radiation detector device is provided that is capable of distinguishing between full charge collection (FCC) events and incomplete charge collection (ICC) events based upon a correlation value comparison algorithm that compares correlation values calculated for individually sensed radiation detection events with a calibrated FCC event correlation function. The calibrated FCC event correlation function serves as a reference curve utilized by a correlation value comparison algorithm to determine whether a sensed radiation detection event fits the profile of the FCC event correlation function within the noise tolerances of the radiation detector device. If the radiation detection event is determined to be an ICC event, then the spectrum for the ICC event is rejected and excluded from inclusion in the radiation detector device spectral analyses. The radiation detector device also can calculate a performance factor to determine the efficacy of distinguishing between FCC and ICC events.
Reliability of Various Measurement Stations for Determining Plantar Fascia Thickness and Echogenicity.

PubMed

Bisi-Balogun, Adebisi; Cassel, Michael; Mayer, Frank

2016-04-13

This study aimed to determine the relative and absolute reliability of ultrasound (US) measurements of the thickness and echogenicity of the plantar fascia (PF) at different measurement stations along its length using a standardized protocol. Twelve healthy subjects (24 feet) were enrolled. The PF was imaged in the longitudinal plane. Subjects were assessed twice to evaluate the intra-rater reliability. A quantitative evaluation of the thickness and echogenicity of the plantar fascia was performed using Image J, a digital image analysis and viewer software. A sonography evaluation of the thickness and echogenicity of the PF showed a high relative reliability with an Intra class correlation coefficient of ≥0.88 at all measurement stations. However, the measurement stations for both the PF thickness and echogenicity which showed the highest intraclass correlation coefficient (ICCs) did not have the highest absolute reliability. Compared to other measurement stations, measuring the PF thickness at 3 cm distal and the echogenicity at a region of interest 1 cm to 2 cm distal from its insertion at the medial calcaneal tubercle showed the highest absolute reliability with the least systematic bias and random error. Also, the reliability was higher using a mean of three measurements compared to one measurement. To reduce discrepancies in the interpretation of the thickness and echogenicity measurements of the PF, the absolute reliability of the different measurement stations should be considered in clinical practice and research rather than the relative reliability with the ICC.
Reliability of Various Measurement Stations for Determining Plantar Fascia Thickness and Echogenicity

PubMed Central

Bisi-Balogun, Adebisi; Cassel, Michael; Mayer, Frank

2016-01-01

This study aimed to determine the relative and absolute reliability of ultrasound (US) measurements of the thickness and echogenicity of the plantar fascia (PF) at different measurement stations along its length using a standardized protocol. Twelve healthy subjects (24 feet) were enrolled. The PF was imaged in the longitudinal plane. Subjects were assessed twice to evaluate the intra-rater reliability. A quantitative evaluation of the thickness and echogenicity of the plantar fascia was performed using Image J, a digital image analysis and viewer software. A sonography evaluation of the thickness and echogenicity of the PF showed a high relative reliability with an Intra class correlation coefficient of ≥0.88 at all measurement stations. However, the measurement stations for both the PF thickness and echogenicity which showed the highest intraclass correlation coefficient (ICCs) did not have the highest absolute reliability. Compared to other measurement stations, measuring the PF thickness at 3 cm distal and the echogenicity at a region of interest 1 cm to 2 cm distal from its insertion at the medial calcaneal tubercle showed the highest absolute reliability with the least systematic bias and random error. Also, the reliability was higher using a mean of three measurements compared to one measurement. To reduce discrepancies in the interpretation of the thickness and echogenicity measurements of the PF, the absolute reliability of the different measurement stations should be considered in clinical practice and research rather than the relative reliability with the ICC. PMID:27089369
Validity and Reliability of Nintendo Wii Fit Balance Scores

PubMed Central

Wikstrom, Erik A.

2012-01-01

Context: Interactive gaming systems have the potential to help rehabilitate patients with musculoskeletal conditions. The Nintendo Wii Balance Board, which is part of the Wii Fit game, could be an effective tool to monitor progress during rehabilitation because the board and game can provide objective measures of balance. However, the validity and reliability of Wii Fit balance scores remain unknown. Objective: To determine the concurrent validity of balance scores produced by the Wii Fit game and the intrasession and intersession reliability of Wii Fit balance scores. Design: Descriptive laboratory study. Setting: Sports medicine research laboratory. Patients or Other Participants: Forty-five recreationally active participants (age = 27.0 ± 9.8 years, height = 170.9 ± 9.2 cm, mass = 72.4 ± 11.8 kg) with a heterogeneous history of lower extremity injury. Intervention(s): Participants completed a single-limb–stance task on a force plate and the Star Excursion Balance Test (SEBT) during the first test session. Twelve Wii Fit balance activities were completed during 2 test sessions separated by 1 week. Main Outcome Measure(s): Postural sway in the anteroposterior (AP) and mediolateral (ML) directions and the AP, ML, and resultant center-of-pressure (COP) excursions were calculated from the single-limb stance. The normalized reach distance was recorded for the anterior, posteromedial, and posterolateral directions of the SEBT. Wii Fit balance scores that the game software generated also were recorded. Results: All 96 of the calculated correlation coefficients among Wii Fit activity outcomes and established balance outcomes were interpreted as poor (r < 0.50). Intrasession reliability for Wii Fit balance activity scores ranged from good (intraclass correlation coefficient [ICC] = 0.80) to poor (ICC = 0.39), with 8 activities having poor intrasession reliability. Similarly, 11 of the 12 Wii Fit balance activity scores demonstrated poor intersession reliability, with scores ranging from fair (ICC = 0.74) to poor (ICC = 0.29). Conclusions: Wii Fit balance activity scores had poor concurrent validity relative to COP outcomes and SEBT reach distances. In addition, the included Wii Fit balance activity scores generally had poor intrasession and intersession reliability. PMID:22892412

Rectal cancer: assessment of complete response to preoperative combined radiation therapy with chemotherapy--conventional MR volumetry versus diffusion-weighted MR imaging.

PubMed

Curvo-Semedo, Luís; Lambregts, Doenja M J; Maas, Monique; Thywissen, Thomas; Mehsen, Rana T; Lammering, Guido; Beets, Geerard L; Caseiro-Alves, Filipe; Beets-Tan, Regina G H

2011-09-01

To determine diagnostic performance of diffusion-weighted (DW) magnetic resonance (MR) imaging for assessment of complete tumor response (CR) after combined radiation therapy with chemotherapy (CRT) in patients with locally advanced rectal cancer (LARC) by means of volumetric signal intensity measurements and apparent diffusion coefficient (ADC) measurements and to compare the performance of DW imaging with that of T2-weighted MR volumetry. A retrospective analysis of 50 patients with LARC, for whom clinical and imaging data were retrieved from a previous imaging study approved by the local institutional ethical committee and for which all patients provided informed consent, was conducted. Patients underwent pre- and post-CRT standard T2-weighted MR and DW MR. Two independent readers placed free-hand regions of interest (ROIs) in each tumor-containing section on both data sets to determine pre- and post-CRT tumor volumes and tumor volume reduction rates (volume). ROIs were copied to an ADC map to calculate tumor ADCs. Histopathologic findings were the standard of reference. Receiver operating characteristic (ROC) curves were generated to compare performance of T2-weighted and DW MR volumetry and ADC. The intraclass correlation coefficient (ICC) was used to evaluate interobserver variability and the correlation between T2-weighted and DW MR volumetry. Areas under the ROC curve (AUCs) for identification of a CR that was based on pre-CRT volume, post-CRT volume, and volume, respectively, were 0.57, 0.70, and 0.84 for T2-weighted MR versus 0.63, 0.93, and 0.92 for DW MR volumetry (P = .15, .02, .42). Pre- and post-CRT ADC and ADC AUCs were 0.55, 0.54, and 0.51, respectively. Interobserver agreement was excellent for all pre-CRT measurements (ICC, 0.91-0.96) versus good (ICC, 0.61-0.79) for post-CRT measurements. ICC between T2-weighted and DW MR volumetry was excellent (0.97) for pre-CRT measurements versus fair (0.25) for post-CRT measurements. Post-CRT DW MR volumetry provided high diagnostic performance in assessing CR and was significantly more accurate than T2-weighted MR volumetry. Post-CRT DW MR was equally as accurate as volume measurements of both T2-weighted and DW MR. Pre-CRT volumetry and ADC were not reliable.
Relative and absolute test-retest reliabilities of pressure pain threshold in patients with knee osteoarthritis.

PubMed

Srimurugan Pratheep, Neeraja; Madeleine, Pascal; Arendt-Nielsen, Lars

2018-04-25

Pressure pain threshold (PPT) and PPT maps are commonly used to quantify and visualize mechanical pain sensitivity. Although PPT's have frequently been reported from patients with knee osteoarthritis (KOA), the absolute and relative reliability of PPT assessments remain to be determined. Thus, the purpose of this study was to evaluate the test-retest relative and absolute reliability of PPT in KOA. For that purpose, intra- and interclass correlation coefficient (ICC) as well as the standard error of measurement (SEM) and the minimal detectable change (MDC) values within eight anatomical locations covering the most painful knee of KOA patients was measured. Twenty KOA patients participated in two sessions with a period of 2 weeks±3 days apart. PPT's were assessed over eight anatomical locations covering the knee and two remote locations over tibialis anterior and brachioradialis. The patients rated their maximum pain intensity during the past 24 h and prior to the recordings on a visual analog scale (VAS), and completed The Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) and PainDetect surveys. The ICC, SEM and MDC between the sessions were assessed. The ICC for the individual variability was expressed with coefficient of variance (CV). Bland-Altman plots were used to assess potential bias in the dataset. The ICC ranged from 0.85 to 0.96 for all the anatomical locations which is considered "almost perfect". CV was lowest in session 1 and ranged from 44.2 to 57.6%. SEM for comparison ranged between 34 and 71 kPa and MDC ranged between 93 and 197 kPa with a mean PPT ranged from 273.5 to 367.7 kPa in session 1 and 268.1-331.3 kPa in session 2. The analysis of Bland-Altman plot showed no systematic bias. PPT maps showed that the patients had lower thresholds in session 2, but no significant difference was observed for the comparison between the sessions for PPT or VAS. No correlations were seen between PainDetect and PPT and PainDetect and WOMAC. Almost perfect relative and absolute reliabilities were found for the assessment of PPT's for KOA patients. The present investigation implicates that PPT's is reliable for assessing pain sensitivity and sensitization in KOA patients.
Reproducibility of graph metrics of human brain structural networks.

PubMed

Duda, Jeffrey T; Cook, Philip A; Gee, James C

2014-01-01

Recent interest in human brain connectivity has led to the application of graph theoretical analysis to human brain structural networks, in particular white matter connectivity inferred from diffusion imaging and fiber tractography. While these methods have been used to study a variety of patient populations, there has been less examination of the reproducibility of these methods. A number of tractography algorithms exist and many of these are known to be sensitive to user-selected parameters. The methods used to derive a connectivity matrix from fiber tractography output may also influence the resulting graph metrics. Here we examine how these algorithm and parameter choices influence the reproducibility of proposed graph metrics on a publicly available test-retest dataset consisting of 21 healthy adults. The dice coefficient is used to examine topological similarity of constant density subgraphs both within and between subjects. Seven graph metrics are examined here: mean clustering coefficient, characteristic path length, largest connected component size, assortativity, global efficiency, local efficiency, and rich club coefficient. The reproducibility of these network summary measures is examined using the intraclass correlation coefficient (ICC). Graph curves are created by treating the graph metrics as functions of a parameter such as graph density. Functional data analysis techniques are used to examine differences in graph measures that result from the choice of fiber tracking algorithm. The graph metrics consistently showed good levels of reproducibility as measured with ICC, with the exception of some instability at low graph density levels. The global and local efficiency measures were the most robust to the choice of fiber tracking algorithm.
Validation of a Novel Scoring System for Changes in Skeletal Manifestations of Hypophosphatasia in Newborns, Infants, and Children: The Radiographic Global Impression of Change Scale.

PubMed

Whyte, Michael P; Fujita, Kenji P; Moseley, Scott; Thompson, David D; McAlister, William H

2018-05-01

Hypophosphatasia (HPP) is the heritable metabolic disease characterized by impaired skeletal mineralization due to low activity of the tissue-nonspecific isoenzyme of alkaline phosphatase. Although HPP during growth often manifests with distinctive radiographic skeletal features, no validated method was available to quantify them, including changes over time. We created the Radiographic Global Impression of Change (RGI-C) scale to assess changes in the skeletal burden of pediatric HPP. Site-specific pairs of radiographs of newborns, infants, and children with HPP from three clinical studies of asfotase alfa, an enzyme replacement therapy for HPP, were obtained at baseline and during treatment. Each pair was scored by three pediatric radiologists ("raters"), with nine raters across the three studies. Intrarater and interrater agreement was determined by weighted Kappa coefficients. Interrater reliability was assessed using intraclass correlation coefficients (ICCs) and by two-way random effects analysis of variance (ANOVA) and a mixed-model repeated measures ANOVA. Pearson correlation coefficients evaluated relationships of the RGI-C to the Rickets Severity Scale (RSS), Pediatric Outcomes Data Collection Instrument Global Function Parent Normative Score, Childhood Health Assessment Questionnaire Disability Index, 6-Minute Walk Test percent predicted, and Z-score for height in patients aged 6 to 12 years at baseline. Eighty-nine percent (8/9) of raters showed substantial or almost perfect intrarater agreement of sequential RGI-C scores (weighted Kappa coefficients, 0.72 to 0.93) and moderate or substantial interrater agreement (weighted Kappa coefficients, 0.53 to 0.71) in patients aged 0 to 12 years at baseline. Moderate-to-good interrater reliability was observed (ICC, 0.57 to 0.65). RGI-C scores were significantly (p ≤ 0.0065) correlated with the RSS and with measures of global function, disability, endurance, and growth in the patients aged 6 to 12 years at baseline. Thus, the RGI-C is valid and reliable for detecting clinically important changes in skeletal manifestations of severe HPP in newborns, infants, and children, including during asfotase alfa treatment. © 2018 The Authors. Journal of Bone and Mineral Research Published by Wiley Periodicals Inc. © 2018 The Authors. Journal of Bone and Mineral Research Published by Wiley Periodicals Inc.
Cross-cultural adaptation and validation of the Saudi Arabic version of the Knee Injury and Osteoarthritis Outcome Score (KOOS).

PubMed

Alfadhel, Saud A; Vennu, Vishal; Alnahdi, Ali H; Omar, Mohammed T; Alasmari, Saeed H; AlJafri, Zahra; Bindawas, Saad M

2018-06-07

The Knee Injury Osteoarthritis Outcome Score (KOOS) is a widely used joint-specific measure employed to evaluate pain, symptoms, activities of daily living, recreational activities, and quality of life in patients with knee osteoarthritis (OA). Although the original KOOS has been translated into many languages, a Saudi Arabic version is not available. This study aimed to culturally adapt and evaluate the psychometric properties of the Saudi Arabic version of the KOOS in patients with knee OA. The original KOOS was translated and adapted into Saudi Arabic version over six stages according to the guidelines suggested by Beaton and recommended by the American Association of Orthopedic Surgeons Outcome Committee. Patients diagnosed with knee OA (n = 136) were recruited to examine the psychometric properties, such as internal consistency that was tested using Cronbach's alpha, test-retest reliability that was analyzed using the intra-class correlation coefficient (ICC 2,1 ), and construct validity that examined by testing the correlations between the new version subscales, Form 36 Health Survey subscales, and the Visual Analog Scale, Spearman's correlation coefficient (r s ) was used to measure the correlations. A total of 122 (89.7%) of the 136 participants with knee OA completed the second re-test of new Saudi Arabic version. Excellent internal consistency (Cronbach's alpha = 0.87-0.92) was detected in the subscales of the adapted version, as well as excellent test-retest reliability (ICC 2,1 = 0.92-0.94). The pattern of correlation between the subscales of the Saudi Arabic version of the KOOS, SF-36 domains and the Visual Analog Scale for pain supported the construct validity of the adapted version. The Saudi Arabic version of the KOOS was well accepted and exhibited excellent reliability, internal consistency, and construct validity in Saudi patients with knee OA.
Correlation of apparent diffusion coefficient ratio on 3.0 T MRI with prostate cancer Gleason score.

PubMed

Jyoti, Rajeev; Jain, Tarun Pankaj; Haxhimolla, Hodo; Liddell, Heath; Barrett, Sean Edward

2018-01-01

The purpose was to investigate the usefulness of ADC ratio on Diffusion MRI to discriminate between benign and malignant lesions of Prostate. Images of patients who underwent in-gantry MRI guided prostate lesion biopsy were retrospectively analyzed. Prostate Cancers with 20% or more Gleason score (GS) pattern 3 + 3 = 6 in each core or any volume of higher Gleason score pattern were included. ADC ratio was calculated by two reviewers for each lesion. The ADC ratio was calculated for each lesion by dividing the lowest ADC value in a lesion and highest ADC value in normal prostate in peripheral zone (PZ). ADC ratio values were compared with the biopsy result. Data was analysed using independent samples T-test, Spearman correlation, intra-class correlation coefficient (ICC) and Receiver operating characteristic (ROC) curve. 45 lesions in 33 patients were analyzed. 12 lesions were in transitional zone (TZ) and 33 in perpheral zone PZ. All lesions demonstrated an ADC ratio of 0.45 or lower. GS demonstrated a negative correlation with both the ADC value and ADC ratio . However, ADC ratio (p < 0.001) demonstrated a stronger correlation compared to ADC value alone (p = 0.014). There was no significant statistical difference between GS 3 + 4 and GS 4 + 3 mean ADC tumour value (p = 0.167). However when using ADC ratio , there was a significant difference (p = 0.032). ROC curve analysis demonstrated an area under the curve of 0.83 using ADC ratio and 0.76 when using ADC tumour value when discriminating Gleason 6 from Gleason ≥7 tumours. Inter-observer reliability in the calculation of ADC ratios was excellent, with ICC of 0.964. ADC ratio is a reliable and reproducible tool in quantification of diffusion restriction for clinically significant prostate cancer foci.
Reliability and relative validity of three physical activity questionnaires in Taizhou population of China: the Taizhou Longitudinal Study.

PubMed

Hu, B; Lin, L F; Zhuang, M Q; Yuan, Z Y; Li, S Y; Yang, Y J; Lu, M; Yu, S Z; Jin, L; Ye, W M; Wang, X F

2015-09-01

To examine the test-retest reliabilities and relative validities of the Chinese version of short International Physical Activity Questionnaire (IPAQ-S-C), the Global Physical Activity Questionnaire (GPAQ-C), and the Total Energy Expenditure Questionnaire (TEEQ-C) in a population-based prospective study, the Taizhou Longitudinal Study (TZLS). A longitudinal comparative study. A total of 205 participants (male: 38.54%) aged 30-70 years completed three questionnaires twice (day one and day nine) and physical activity log (PA-log) over seven consecutive days. The test-retest reliabilities were evaluated using intra-class correlation coefficients (ICCs) and the relative validities were estimated by comparing the data from physical activity questionnaires (PAQs) and PA-log. Good reliabilities were observed between the repeated PAQs. The ICCs ranged from 0.51 to 0.80 for IPAQ-C, 0.67 to 0.85 for GPAQ-C, and 0.74 to 0.94 for TEEQ-C, respectively. Energy expenditure of most PA domains estimated by the three PAQs correlated moderately with the results recorded by PA-log except the walking domain of IPAQ-S-C. The partial correlation coefficients between the PAQs and PA-log ranged from 0.44 to 0.58 for IPAQ-S-C, 0.26 to 0.52 for GPAQ-C, and 0.41 to 0.72 for TEEQ-C, respectively. Bland-Altman plots showed acceptable agreement between the three PAQs and PA-log. The three PAQs, especially TEEQ-C, were relatively reliable and valid for assessment of physical activity and could be used in TZLS. Copyright © 2015 The Royal Society for Public Health. Published by Elsevier Ltd. All rights reserved.
Validity and reliability of a Dutch version of the Foot and Ankle Ability Measure.

PubMed

Weel, Hanneke; Zwiers, Ruben; Azim, Donija; Sierevelt, Inger N; Haverkamp, Daniel; van Dijk, C Niek; Kerkhoffs, Gino M M J

2016-04-01

The aim of the study was to develop a Dutch language version of the Foot and Ankle Ability Measure (FAAM) and evaluate its measurement properties according to the consensus-based standards for the selection of health measurement instruments (COSMIN) definitions. A forward-backward translation procedure was performed and subsequently the Dutch version of the FAAM was evaluated for its reliability and validity in 369 patients with a variety of foot and ankle complaints. The reliability was assessed by calculating the intraclass correlation coefficients (ICC, test-retest reliability), Cronbach's alpha (internal consistency), the standard error of measurement and the minimal detectable change (MDC). Additionally, this was done for athletes. The construct validity was assessed by the use of Spearman's correlation coefficient between FAAM domains and similar and contradictory domains of the Foot and Ankle Outcome Score, Short Form 36 and the Numeric Rating Scale for pain. The ICC of the subscales ranged from 0.62 to 0.86. Cronbach's alpha's minimum was 0.97. At individual level, the MDC ranged from 23.9 to 44.7 and at group level from 2.77 to 4.32. In the subgroup of athletes, the reliability was higher. The hypothesized correlations of the construct validity were supported by an 80% confirmation rate. The Dutch version of the FAAM met adequate measurement properties, although the reliability is not optimal. The FAAM-Sport subscale is more useful in athletes and the FAAM-Sport % seems not to contribute. In athletes with various foot and ankle symptoms, the FAAM can be used for functional assessment and follow-up at group level. For the general population, the FAAM is less appropriate. Diagnostic study, Level I.
Unilateral lower limb strength assessed using the Nintendo Wii Balance Board: a simple and reliable method.

PubMed

Blomkvist, A W; Andersen, S; de Bruin, E; Jorgensen, M G

2017-10-01

Lower limb weakness is an important risk factor for fall accidents and a predictor for all-cause mortality among older adults. Unilateral whole-lower limb strength may be a better measure of fall risk than the bilateral measure. In addition, a number of clinical conditions affect only one leg, and thus this type of assessment is relevant in clinical settings. To explore the intra-rater reproducibility of the Nintendo Wii Balance Board (WBB) to measure unilateral whole-lower limb strength and to compare the method with stationary isometric muscle apparatus (SID). Intra-rater test-retest design with 1 week between sessions. Thirty community-dwelling older adults (69 ± 4.2 years) were enrolled and examined for maximum lower limb strength in their dominant and non-dominant leg. Intraclass correlation coefficient (ICC) was calculated to describe relative reproducibility, while standard error of measurement (SEM), limits of agreement (LOA) and smallest real difference (SRD) were calculated to describe absolute reproducibility between test sessions. Concurrent validity with the SID was explored using the Pearson product-moment correlation coefficient (PCC). No systematic difference was observed between test sessions. ICC was 0.919-0.950 and SEM, LOA and SRD was 2.9-4.1 kg, 24.1-28.3 kg and 7.6-11.3 kg, respectively. Further, the PCC was 0.755 and 0.730 for the dominant limb and the non-dominant limb, respectively. A high relative and an acceptable absolute reproducibility was seen when using the Nintendo Wii Balance Board for testing unilateral lower limb strength in community-dwelling older adults. The WBB correlated strongly with the SID.
Environmental and Genetic Factors Explain Differences in Intraocular Scattering.

PubMed

Benito, Antonio; Hervella, Lucía; Tabernero, Juan; Pennos, Alexandros; Ginis, Harilaos; Sánchez-Romera, Juan F; Ordoñana, Juan R; Ruiz-Sánchez, Marcos; Marín, José M; Artal, Pablo

2016-01-01

To study the relative impact of genetic and environmental factors on the variability of intraocular scattering within a classical twin study. A total of 64 twin pairs, 32 monozygotic (MZ) (mean age: 54.9 ± 6.3 years) and 32 dizygotic (DZ) (mean age: 56.4 ± 7.0 years), were measured after a complete ophthalmologic exam had been performed to exclude all ocular pathologies that increase intraocular scatter as cataracts. Intraocular scattering was evaluated by using two different techniques based on a straylight parameter log(S) estimation: a compact optical instrument based in the principle of optical integration and a psychophysical measurement. Intraclass correlation coefficients (ICC) were used as descriptive statistics of twin resemblance, and genetic models were fitted to estimate heritability. No statistically significant difference was found for MZ and DZ groups for age (P = 0.203), best-corrected visual acuity (P = 0.626), cataract gradation (P = 0.701), sex (P = 0.941), optical log(S) (P = 0.386), or psychophysical log(S) (P = 0.568), with only a minor difference in equivalent sphere (P = 0.008). Intraclass correlation coefficients between siblings were similar for scatter parameters: 0.676 in MZ and 0.471 in DZ twins for optical log(S); 0.533 in MZ twins and 0.475 in DZ twins for psychophysical log(S). For equivalent sphere, ICCs were 0.767 in MZ and 0.228 in DZ twins. Conservative estimates of heritability for the measured scattering parameters were 0.39 and 0.20, respectively. Correlations of intraocular scatter (straylight) parameters in the groups of identical and nonidentical twins were similar. Heritability estimates were of limited magnitude, suggesting that genetic and environmental factors determine the variance of ocular straylight in healthy middle-aged adults.
The development of an instrument to measure quality of vision: the Quality of Vision (QoV) questionnaire.

PubMed

McAlinden, Colm; Pesudovs, Konrad; Moore, Jonathan E

2010-11-01

To develop an instrument to measure subjective quality of vision: the Quality of Vision (QoV) questionnaire. A 30-item instrument was designed with 10 symptoms rated in each of three scales (frequency, severity, and bothersome). The QoV was completed by 900 subjects in groups of spectacle wearers, contact lens wearers, and those having had laser refractive surgery, intraocular refractive surgery, or eye disease and investigated with Rasch analysis and traditional statistics. Validity and reliability were assessed by Rasch fit statistics, principal components analysis (PCA), person separation, differential item functioning (DIF), item targeting, construct validity (correlation with visual acuity, contrast sensitivity, total root mean square [RMS] higher order aberrations [HOA]), and test-retest reliability (two-way random intraclass correlation coefficients [ICC] and 95% repeatability coefficients [R(c)]). Rasch analysis demonstrated good precision, reliability, and internal consistency for all three scales (mean square infit and outfit within 0.81-1.27; PCA >60% variance explained by the principal component; person separation 2.08, 2.10, and 2.01 respectively; and minimal DIF). Construct validity was indicated by strong correlations with visual acuity, contrast sensitivity and RMS HOA. Test-retest reliability was evidenced by a minimum ICC of 0.867 and a minimum 95% R(c) of 1.55 units. The QoV Questionnaire consists of a Rasch-tested, linear-scaled, 30-item instrument on three scales providing a QoV score in terms of symptom frequency, severity, and bothersome. It is suitable for measuring QoV in patients with all types of refractive correction, eye surgery, and eye disease that cause QoV problems.
Chinese translation and validation of the Walking Impairment Questionnaire in patients with peripheral artery disease.

PubMed

Yan, Bryan P; Lau, James Y; Yu, Check-Man; Au, Kim; Chan, Ka-Wai; Yu, Doris S; Ma, Ronald C; Lam, Yat-Yin; Hiatt, William R

2011-06-01

The Walking Impairment Questionnaire (WIQ) is a frequently used questionnaire to evaluate patients with intermittent claudication on four subscales: pain severity, walking distance, walking speed and the ability to climb stairs. The aim of this study is to translate and validate the WIQ in Chinese. After translation and cultural adaptation of the WIQ, 134 patients with intermittent claudication completed the Chinese WIQ and European Quality of Life 5 Dimension (EQ-5D). Walking distances were determined by the 6-minute walk test (6MWT). Correlations between the WIQ, quality of life questionnaire and walking distances were calculated to determine validity. Reliability and internal consistency were determined using the intra-class correlation coefficient (ICC) and Cronbach's alpha (α), respectively. Significant correlations were found between the WIQ score, initial claudication distance (ICD), absolute claudication distance (ACD) and all domains of the EQ-5D (all p ≤ 0.01). Test-retest reliability (ICC = 0.74) and the overall internal consistency determined (α = 0.90) showed good agreement. A lower WIQ score corresponded to shorter walking distances. In conclusion, this study showed that the Chinese version of the WIQ is a valid, reliable and clinically relevant instrument for assessing walking impairment in patients with intermittent claudication.
Reliability of concentrations of organophosphate pesticide metabolites in serial urine specimens from pregnancy in the Generation R Study.

PubMed

Spaan, Suzanne; Pronk, Anjoeka; Koch, Holger M; Jusko, Todd A; Jaddoe, Vincent W V; Shaw, Pamela A; Tiemeier, Henning M; Hofman, Albert; Pierik, Frank H; Longnecker, Matthew P

2015-05-01

The widespread use of organophosphate (OP) pesticides has resulted in ubiquitous exposure in humans, primarily through their diet. Exposure to OP pesticides may have adverse health effects, including neurobehavioral deficits in children. The optimal design of new studies requires data on the reliability of urinary measures of exposure. In the present study, urinary concentrations of six dialkyl phosphate (DAP) metabolites, the main urinary metabolites of OP pesticides, were determined in 120 pregnant women participating in the Generation R Study in Rotterdam. Intra-class correlation coefficients (ICCs) across serial urine specimens taken at <18, 18-25, and >25 weeks of pregnancy were determined to assess reliability. Geometric mean total DAP metabolite concentrations were 229 (GSD 2.2), 240 (GSD 2.1), and 224 (GSD 2.2) nmol/g creatinine across the three periods of gestation. Metabolite concentrations from the serial urine specimens in general correlated moderately. The ICCs for the six DAP metabolites ranged from 0.14 to 0.38 (0.30 for total DAPs), indicating weak to moderate reliability. Although the DAP metabolite levels observed in this study are slightly higher and slightly more correlated than in previous studies, the low to moderate reliability indicates a high degree of within-person variability, which presents challenges for designing well-powered epidemiological studies.
Implicit Review Instrument to Evaluate Quality of Care Delivered by Physicians to Children in Emergency Departments.

PubMed

Marcin, James P; Romano, Patrick S; Dharmar, Madan; Chamberlain, James M; Dudley, Nanette; Macias, Charles G; Nigrovic, Lise E; Powell, Elizabeth C; Rogers, Alexander J; Sonnett, Meridith; Tzimenatos, Leah; Alpern, Elizabeth R; Andrews-Dickert, Rebecca; Borgialli, Dominic A; Sidney, Erika; Casper, Charlie; Dean, Jonathan Michael; Kuppermann, Nathan

2018-06-01

To evaluate the consistency, reliability, and validity of an implicit review instrument that measures the quality of care provided to children in the emergency department (ED). Medical records of randomly selected children from 12 EDs in the Pediatric Emergency Care Applied Research Network (PECARN). Eight pediatric emergency medicine physicians applied the instrument to 620 medical records. We determined internal consistency using Cronbach's alpha and inter-rater reliability using the intraclass correlation coefficient (ICC). We evaluated the validity of the instrument by correlating scores with four condition-specific explicit review instruments. Individual reviewers' Cronbach's alpha had a mean of 0.85 with a range of 0.76-0.97; overall Cronbach's alpha was 0.90. The ICC was 0.49 for the summary score with a range from 0.40 to 0.46. Correlations between the quality of care score and the four condition-specific explicit review scores ranged from 0.24 to 0.38. The quality of care instrument demonstrated good internal consistency, moderate inter-rater reliability, high inter-rater agreement, and evidence supporting validity. The instrument could be useful for systems' assessment and research in evaluating the care delivered to children in the ED. © Health Research and Educational Trust.
Accuracy of an infrared LED device to measure heart rate and energy expenditure during rest and exercise.

PubMed

Lee, C Matthew; Gorelick, Mark; Mendoza, Albert

2011-12-01

The purpose of this study was to examine the accuracy of the ePulse Personal Fitness Assistant, a forearm-worn device that provides measures of heart rate and estimates energy expenditure. Forty-six participants engaged in 4-minute periods of standing, 2.0 mph walking, 3.5 mph walking, 4.5 mph jogging, and 6.0 mph running. Heart rate and energy expenditure were simultaneously recorded at 60-second intervals using the ePulse, an electrocardiogram (EKG), and indirect calorimetry. The heart rates obtained from the ePulse were highly correlated (intraclass correlation coefficients [ICCs] ≥0.85) with those from the EKG during all conditions. The typical errors progressively increased with increasing exercise intensity but were <5 bpm only during rest and 2.0 mph. Energy expenditure from the ePulse was poorly correlated with indirect calorimetry (ICCs: 0.01-0.36) and the typical errors for energy expenditure ranged from 0.69-2.97 kcal · min(-1), progressively increasing with exercise intensity. These data suggest that the ePulse Personal Fitness Assistant is a valid device for monitoring heart rate at rest and low-intensity exercise, but becomes less accurate as exercise intensity increases. However, it does not appear to be a valid device to estimate energy expenditure during exercise.
Reliability and validity of the Iranian version of the QAPACE in adolescents.

PubMed

Amiri, Parisa; Jalali-Farahani, Sara; Zarkesh, Maryam; Barzin, Maryam; Kaviani, Robabeh; Ahmadizad, Sajad

2014-08-01

The aim of this study was to determine the reliability and validity of the Iranian version of the Quantification de l'Activite Physique en Altitude Chez les Enfants (QAPACE) in adolescents. After linguistic validation, the Iranian version of the QAPACE was completed by 359 (52.4 % girls) schoolchildren, aged 15-18 years. Test-retest reliability of the questionnaire was determined by intraclass correlation coefficients (ICCs). For validation purposes, two methods were used for (1) the correlation between VO2peak and the DEE and (2) known-group validity, which was examined by comparing the normal weight adolescents and those who were overweight/obese. ICCs for test-retest ranged from 0.79 to 0.98. The mean scores in test-retest surveys for total score and all of the subscores were significant (p < 0.05). Sex-specific analysis showed a significant correlation between VO2peak and DEE over 12-month, school, and vacation periods in girls (p < 0.05). The mean values for all activities except for transportation, other activities in school, personal artistic activities, sport competition, and home activities were significantly lower in overweight/obese group than normal group. Our results support the initial reliability and validity of the Iranian version of QAPACE as a daily physical activity measure in adolescents.
Evaluative frailty index for physical activity (EFIP): a reliable and valid instrument to measure changes in level of frailty.

PubMed

de Vries, Nienke M; Staal, J Bart; Olde Rikkert, Marcel G M; Nijhuis-van der Sanden, Maria W G

2013-04-01

Physical activity is assumed to be important in the prevention and treatment of frailty. It is unclear, however, to what extent frailty can be influenced because instruments designed to assess frailty have not been validated as evaluative outcome instruments in clinical practice. The aims of this study were: (1) to develop a frailty index (i.e., the evaluative frailty index for physical activity [EFIP]) based on the method of deficit accumulation and (2) to test the clinimetric properties of the EFIP. The content of the EFIP was determined using a written Delphi procedure. Intrarater reliability, interrater reliability, and construct validity were determined in an observational study (n=24). Intrarater reliability and interrater reliability were calculated using Cohen kappa and intraclass correlation coefficients (ICCs). Construct validity was determined by correlating the score on the EFIP with those on the timed "up & go" test (TUG), the performance-oriented mobility assessment (POMA), and the Cumulative Illness Rating Scale for Geriatrics (CIRS-G). Fifty items were included in the EFIP. Interrater reliability (Cohen kappa=0.72, ICC=.96) and intrarater reliability (Cohen kappa=0.77 and 0.80, ICC=.93 and .98) were good. As expected, a fair to moderate correlation with the TUG, POMA, and CIRS-G was found (.61, -.70, and .66, respectively). Reliability and validity of the EFIP have been tested in a small sample. These and other clinimetric properties, such as responsiveness, will be assessed or reassessed in a larger study population. The EFIP is a reliable and valid instrument to evaluate the effect of physical activity on frailty in research and in clinical practice.
The modified gait abnormality rating scale in patients with a conversion disorder: a reliability and responsiveness study.

PubMed

Vandenberg, Justin M; George, Deanna R; O'Leary, Andrea J; Olson, Lindsay C; Strassburg, Kaitlyn R; Hollman, John H

2015-01-01

Individuals with conversion disorder have neurologic symptoms that are not identified by an underlying organic cause. Often the symptoms manifest as gait disturbances. The modified gait abnormality rating scale (GARS-M) may be useful for quantifying gait abnormalities in these individuals. The purpose of this study was to examine the reliability, responsiveness and concurrent validity of GARS-M scores in individuals with conversion disorder. Data from 27 individuals who completed a rehabilitation program were included in this study. Pre- and post-intervention videos were obtained and walking speed was measured. Five examiners independently evaluated gait performance according to the GARS-M criteria. Inter- and intrarater reliability of GARS-M scores were estimated with intraclass correlation coefficients (ICCs). Responsiveness was estimated with the minimum detectable change (MDC). Pre- to post-treatment changes in GARS-M scores were analyzed with a dependent t-test. The correlation between GARS-M scores and walking speed was analyzed to assess concurrent validity. GARS-M scores were quantified with good-to-excellent inter- (ICC = 0.878) and intrarater reliability (ICC = 0.989). The MDC was 2 points. Mean GARS-M scores decreased from 7 ± 5 at baseline to 1 ± 2 at discharge (t26 = 7.411, p < 0.001) and 85% of patients improved beyond the MDC. Furthermore, GARS-M scores and walking speed measurements were moderately correlated (r = -0.582, p = 0.004), indicating that the GARS-M has acceptable concurrent validity. Our findings provide evidence that the GARS-M scores are reliable, valid and responsive for quantifying gait abnormalities in patients with conversion disorder. GARS-M scores provide objective measures upon which treatment effects can be assessed. Copyright © 2014 Elsevier B.V. All rights reserved.
Reliability and validity of two multidimensional self-reported physical activity questionnaires in people with chronic low back pain.

PubMed

Carvalho, Flávia A; Morelhão, Priscila K; Franco, Marcia R; Maher, Chris G; Smeets, Rob J E M; Oliveira, Crystian B; Freitas Júnior, Ismael F; Pinto, Rafael Z

2017-02-01

Although there is some evidence for reliability and validity of self-report physical activity (PA) questionnaires in the general adult population, it is unclear whether we can assume similar measurement properties in people with chronic low back pain (LBP). To determine the test-retest reliability of the International Physical Activity Questionnaire (IPAQ) long-version and the Baecke Physical Activity Questionnaire (BPAQ) and their criterion-related validity against data derived from accelerometers in patients with chronic LBP. Cross-sectional study. Patients with non-specific chronic LBP were recruited. Each participant attended the clinic twice (one week interval) and completed self-report PA. Accelerometer measures >7 days included time spent in moderate-and-vigorous physical activity, steps/day, counts/minute, and vector magnitude counts/minute. Intraclass Correlation Coefficients (ICC) and Bland and Altman method were used to determine reliability and spearman rho correlation were used for criterion-related validity. A total of 73 patients were included in our analyses. The reliability analyses revealed that the BPAQ and its subscales have moderate to excellent reliability (ICC 2,1 : 0.61 to 0.81), whereas IPAQ and most IPAQ domains (except walking) showed poor reliability (ICC 2,1 : 0.20 to 0.40). The Bland and Altman method revealed larger discrepancies for the IPAQ. For the validity analysis, questionnaire and accelerometer measures showed at best fair correlation (rho < 0.37). Although the BPAQ showed better reliability than the IPAQ long-version, both questionnaires did not demonstrate acceptable validity against accelerometer data. These findings suggest that questionnaire and accelerometer PA measures should not be used interchangeably in this population. Copyright © 2016 Elsevier Ltd. All rights reserved.
Compared to X-ray, three-dimensional computed tomography measurement is a reproducible radiographic method for normal proximal humerus.

PubMed

Jia, Xiaoyang; Chen, Yanxi; Qiang, Minfei; Zhang, Kun; Li, Haobo; Jiang, Yuchen; Zhang, Yijie

2016-07-15

Accurate comprehension of the normal humeral morphology is crucial for anatomical reconstruction in shoulder arthroplasty. However, traditional morphological measurements for humerus were mainly based on cadaver and radiography. The purpose of this study was to provide a series of precise and repeatable parameters of the normal proximal humerus for arthroplasty, based on the three-dimensional (3-D) measurements. Radiographic and 3-D computed tomography (CT) measurements of the proximal humerus were performed in a sample of 120 consecutive adults. Sex differences, two image modalities differences, and correlations of the parameters were evaluated. Intra- and inter-observer reproducibility was evaluated using intraclass correlation coefficients (ICCs). In the male group, all parameters except the neck-shaft angle of humerus, based on 3-D CT images, were greater than those in the female group (P < 0.05). All variables were significantly different between two image modalities (P < 0.05). In 3-D CT measurement, all parameters expect neck-shaft angle had correlation with each other (P < 0.001), particularly between two diameters of the humeral head (r = 0.907). All parameters in the 3-D CT measurement had excellent reproducibility (ICC range, 0.878 to 0.936) that was higher than those in the radiographs (ICC range, 0.741 to 0.858). The present study suggested that 3-D CT was more reproducible than plain radiography in the assessment of morphology of the normal proximal humerus. Therefore, this reproducible modality could be utilized in the preoperative planning. Our data could serve as an effective guideline for humeral component selection and improve the design of shoulder prosthesis.

[Characteristics of high resolution diffusion weighted imaging apparent diffusion coefficient histogram and its correlations with cancer stages in patients with nasopharyngeal carcinoma].

PubMed

Wang, G J; Wang, Y; Ye, Y; Chen, F; Lu, Y T; Li, S L

2017-11-07

Objective: To investigate the features of apparent diffusion coefficient (ADC) histogram parameters based on entire tumor volume data in high resolution diffusion weighted imaging of nasopharyngeal carcinoma (NPC) and to evaluate its correlations with cancer stages. Methods: This retrospective study included 154 cases of NPC patients[102 males and 52 females, mean age (48±11) years]who had received readout segmentation of long variable echo trains of MRI scan before radiation therapy. The area of tumor was delineated on each section of axial ADC maps to generate ADC histogram by using Image J. ADC histogram of entire tumor along with the histogram parameters-the tumor voxels, ADC(mean), ADC(25%), ADC(50%), ADC(75%), skewness and kurtosis were obtained by merging all sections with SPSS 22.0 software. Intra-observer repeatability was assessed by using intra-class correlation coefficients (ICC). The patients were subdivided into two groups according to cancer volume: small cancer group (<305 voxels, about 2 cm(3)) and large cancer group (≥2 cm(3)). The correlation between ADC histogram parameters and cancer stages was evaluated with Spearman test. Results: The ICC of measuring ADC histogram parameters of tumor voxels, ADC(mean), ADC(25%), ADC(50%), ADC(75%), skewness, kurtosis was 0.938, 0.861, 0.885, 0.838, 0.836, 0.358 and 0.456, respectively. The tumor voxels was positively correlated with T staging ( r =0.368, P <0.05). There were significant differences in tumor voxels among patients with different T stages ( K =22.306, P <0.05). There were significant differences in the ADC(mean), ADC(25%), ADC(50%) among patients with different T stages in the small cancer group( K =8.409, 8.187, 8.699, all P <0.05), and the up-mentioned three indices were positively correlated with T staging ( r =0.221, 0.209, 0.235, all P <0.05). Skewness and kurtosis differed significantly between the groups with different cancer volume( t =-2.987, Z =-3.770, both P <0.05). Conclusion: The tumor volume, tissue uniformity of NPC are important factors affecting ADC and cancer stages, parameters of ADC histogram (ADC(mean), ADC(25%), ADC(50%)) increases with T staging in NPC smaller than 2 cm(3).
Development of a clinical static and dynamic standing balance measurement tool appropriate for use in adolescents.

PubMed

Emery, Carolyn A; Cassidy, J David; Klassen, Terry P; Rosychuk, Rhonda J; Rowe, Brian B

2005-06-01

There is a need in sports medicine for a static and dynamic standing balance measure to quantify balance ability in adolescents. The purposes of this study were to determine the test-retest reliability of timed static (eyes open) and dynamic (eyes open and eyes closed) unipedal balance measurements and to examine factors associated with balance. Adolescents (n=123) were randomly selected from 10 Calgary high schools. This study used a repeated-measures design. One rater measured unipedal standing balance, including timed eyes-closed static (ECS), eyes-open dynamic (EOD), and eyes-closed dynamic (ECD) balance at baseline and 1 week later. Dynamic balance was measured on a foam surface. Reliability was examined using both intraclass correlation coefficients (ICCs) and Bland and Altman statistical techniques. Multiple linear regressions were used to examine other potentially influencing factors. Based on ICCs, test-retest reliability was adequate for ECS, EOD, and ECD balance (ICC=.69, .59, and .46, respectively). The results of Bland and Altman methods, however, suggest that caution is required in interpreting reliability based on ICCs alone. Although both ECS balance and ECD balance appear to demonstrate adequate test-retest reliability by ICC, Bland and Altman methods of agreement demonstrate sufficient reliability for ECD balance only. Thirty percent of the subjects reached the 180-second maximum on EOD balance, suggesting that this test is not appropriate for use in this population. Balance ability (ECS and ECD) was better in adolescents with no past history of lower-extremity injury. Timed ECD balance is an appropriate and reliable clinical measurement for use in adolescents and is influenced by previous injury.
Training less-experienced faculty improves reliability of skills assessment in cardiac surgery.

PubMed

Lou, Xiaoying; Lee, Richard; Feins, Richard H; Enter, Daniel; Hicks, George L; Verrier, Edward D; Fann, James I

2014-12-01

Previous work has demonstrated high inter-rater reliability in the objective assessment of simulated anastomoses among experienced educators. We evaluated the inter-rater reliability of less-experienced educators and the impact of focused training with a video-embedded coronary anastomosis assessment tool. Nine less-experienced cardiothoracic surgery faculty members from different institutions evaluated 2 videos of simulated coronary anastomoses (1 by a medical student and 1 by a resident) at the Thoracic Surgery Directors Association Boot Camp. They then underwent a 30-minute training session using an assessment tool with embedded videos to anchor rating scores for 10 components of coronary artery anastomosis. Afterward, they evaluated 2 videos of a different student and resident performing the task. Components were scored on a 1 to 5 Likert scale, yielding an average composite score. Inter-rater reliabilities of component and composite scores were assessed using intraclass correlation coefficients (ICCs) and overall pass/fail ratings with kappa. All components of the assessment tool exhibited improvement in reliability, with 4 (bite, needle holder use, needle angles, and hand mechanics) improving the most from poor (ICC range, 0.09-0.48) to strong (ICC range, 0.80-0.90) agreement. After training, inter-rater reliabilities for composite scores improved from moderate (ICC, 0.76) to strong (ICC, 0.90) agreement, and for overall pass/fail ratings, from poor (kappa = 0.20) to moderate (kappa = 0.78) agreement. Focused, video-based anchor training facilitates greater inter-rater reliability in the objective assessment of simulated coronary anastomoses. Among raters with less teaching experience, such training may be needed before objective evaluation of technical skills. Published by Elsevier Inc.
Translation, adaptation and inter-rater reliability of the administration manual for the Fugl-Meyer assessment.

PubMed

Michaelsen, Stella M; Rocha, André S; Knabben, Rodrigo J; Rodrigues, Luciano P; Fernandes, Claudia G C

2011-01-01

Recently, the reliability of the Brazilian version of the Fugl-Meyer Assessment (FMA) was assessed through the scoring given according to observations made by a single evaluator who applied the test. When different raters apply the scale, the reliability may depend on the interpretation given to the assessment sheet. In such cases, a clear administration manual is essential for ensuring homogeneity of application. To translate and adapt the French Canadian version of the FMA administration manual into Brazilian Portuguese and to evaluate the inter-rater reliability when different evaluators apply the FMA on the basis of the information contained in the manual. Eighteen adults (59±10 years) with chronic hemiparesis (38±35 months after a stroke) took part in this study. Eight patients participated in the first part of the study and 10 in the second part. Based on analyzing the results from part 1, an adapted version was developed, in which information and photos were added to illustrate the positions of the patient and evaluator. The inter-rater reliability was assessed using the intraclass correlation coefficient (ICC). The reliability of the FMA based on the adapted version of the manual was excellent for the total motor scores for the upper limbs (ICC=0.98) and lower limbs (ICC=0.90), as well as for movement sense (ICC=0.98) and upper and lower-limb passive range of motion (ICC=0.84 and 0.90, respectively). The reliability was moderate for tactile sensitivity (0.75). The joint pain assessment presented low reliability. The results showed that, except for pain assessment, application of the FMA based on the adapted version of the application manual for Brazilian Portuguese presented adequate inter-rater reliability.
Precision of higher-order aberration measurements with a new Placido-disk topographer and Hartmann-Shack wavefront sensor.

PubMed

López-Miguel, Alberto; Martínez-Almeida, Loreto; González-García, María J; Coco-Martín, María B; Sobrado-Calvo, Paloma; Maldonado, Miguel J

2013-02-01

To assess the intrasession and intersession precision of ocular, corneal, and internal higher-order aberrations (HOAs) measured using an integrated topographer and Hartmann-Shack wavefront sensor (Topcon KR-1W) in refractive surgery candidates. IOBA-Eye Institute, Valladolid, Spain. Evaluation of diagnostic technology. To analyze intrasession repeatability, 1 experienced examiner measured eyes 9 times successively. To study intersession reproducibility, the same clinician obtained measurements from another set of eyes in 2 consecutive sessions 1 week apart. Ocular, corneal, and internal HOAs were obtained. Coma and spherical aberrations, 3rd- and 4th-order aberrations, and total HOAs were calculated for a 6.0 mm pupil diameter. For intrasession repeatability (75 eyes), excellent intraclass correlation coefficients (ICCs) were obtained (ICC >0.87), except for internal primary coma (ICC = 0.75) and 3rd-order (ICC = 0.72) HOAs. Repeatability precision (1.96 × S(w)) values ranged from 0.03 μm (corneal primary spherical) to 0.08 μm (ocular primary coma). For intersession reproducibility (50 eyes), ICCs were good (>0.8) for ocular primary spherical, 3rd-order, and total higher-order aberrations; reproducibility precision values ranged from 0.06 μm (corneal primary spherical) to 0.21 μm (internal 3rd order), with internal HOAs having the lowest precision (≥0.12 μm). No systematic bias was found between examinations on different days. The intrasession repeatability was high; therefore, the device's ability to measure HOAs in a reliable way was excellent. Under intersession reproducibility conditions, dependable corneal primary spherical aberrations were provided. No author has a financial or proprietary interest in any material or method mentioned. Copyright © 2012 ASCRS and ESCRS. Published by Elsevier Inc. All rights reserved.
Isokinetic Strength and Endurance Tests used Pre- and Post-Spaceflight: Test-Retest Reliability

NASA Technical Reports Server (NTRS)

Laughlin, Mitzi S.; Lee, Stuart M. C.; Loehr, James A.; Amonette, William E.

2009-01-01

To assess changes in muscular strength and endurance after microgravity exposure, NASA measures isokinetic strength and endurance across multiple sessions before and after long-duration space flight. Accurate interpretation of pre- and post-flight measures depends upon the reliability of each measure. The purpose of this study was to evaluate the test-retest reliability of the NASA International Space Station (ISS) isokinetic protocol. Twenty-four healthy subjects (12 M/12 F, 32.0 +/- 5.6 years) volunteered to participate. Isokinetic knee, ankle, and trunk flexion and extension strength as well as endurance of the knee flexors and extensors were measured using a Cybex NORM isokinetic dynamometer. The first weekly session was considered a familiarization session. Data were collected and analyzed for weeks 2-4. Repeated measures analysis of variance (alpha=0.05) was used to identify weekly differences in isokinetic measures. Test-retest reliability was evaluated by intraclass correlation coefficients (ICC) (3,1). No significant differences were found between weeks in any of the strength measures and the reliability of the strength measures were all considered excellent (ICC greater than 0.9), except for concentric ankle dorsi-flexion (ICC=0.67). Although a significant difference was noted in weekly endurance measures of knee extension (p less than 0.01), the reliability of endurance measure by week were considered excellent for knee flexion (ICC=0.97) and knee extension (ICC=0.96). Except for concentric ankle dorsi-flexion, the isokinetic strength and endurance measures are highly reliable when following the NASA ISS protocol. This protocol should allow accurate interpretation isokinetic data even with a small number of crew members.
Reliability of the School Food Checklist for in-school audits and photograph analysis of children's packed lunches.

PubMed

Mitchell, S A; Miles, C L; Brennan, L; Matthews, J

2010-02-01

Assessment of children's diets is problematic, typically relying on error-prone parent or child recall or reporting, or resource intensive direct observation. The School Food Checklist (SFC) is an objective instrument comprising of 20 food and beverage categories designed to measure the foods contained in children's packed lunches. The present study aimed to assess intra-rater and inter-rater reliability of each of the food and beverage categories of the SFC for both in-school audits and photograph analysis of children's school lunches. Participants comprised 176 children aged 5-8 years from five primary schools in Northern Metropolitan Melbourne. The SFC was used to measure the foods contained in children's packed lunches in the school setting and using photographs. Photograph analysis was conducted by the auditors 2-3 months after completion of in-school audits. Both intra-rater [intra-class correlation coefficient (ICC) = 0.78-1] and inter-rater (ICC = 0.50-0.95) reliability analysis indicated strong agreement for in-school auditing. With the exception of the food category titled 'leftovers', there was strong intra-rater reliability for auditors' live audits and their analysis of photographs [ICC = 0.57-0.98 (Auditor 1); ICC = 0.72-0.90 (Auditor 2)], and strong inter-rater reliability for photograph analysis (ICC = 0.68-0.92). The SFC is a reliable method of measuring the foods and beverages contained in children's packed lunches when used in the school setting or for photograph analysis. This finding has broad implications, particularly for the use of photograph analysis, because this approach offers a convenient and cost effective method of measuring what food and beverages children bring to school in home packed lunches.
Strain analysis in CRT candidates using the novel segment length in cine (SLICE) post-processing technique on standard CMR cine images.

PubMed

Zweerink, Alwin; Allaart, Cornelis P; Kuijer, Joost P A; Wu, LiNa; Beek, Aernout M; van de Ven, Peter M; Meine, Mathias; Croisille, Pierre; Clarysse, Patrick; van Rossum, Albert C; Nijveldt, Robin

2017-12-01

Although myocardial strain analysis is a potential tool to improve patient selection for cardiac resynchronization therapy (CRT), there is currently no validated clinical approach to derive segmental strains. We evaluated the novel segment length in cine (SLICE) technique to derive segmental strains from standard cardiovascular MR (CMR) cine images in CRT candidates. Twenty-seven patients with left bundle branch block underwent CMR examination including cine imaging and myocardial tagging (CMR-TAG). SLICE was performed by measuring segment length between anatomical landmarks throughout all phases on short-axis cines. This measure of frame-to-frame segment length change was compared to CMR-TAG circumferential strain measurements. Subsequently, conventional markers of CRT response were calculated. Segmental strains showed good to excellent agreement between SLICE and CMR-TAG (septum strain, intraclass correlation coefficient (ICC) 0.76; lateral wall strain, ICC 0.66). Conventional markers of CRT response also showed close agreement between both methods (ICC 0.61-0.78). Reproducibility of SLICE was excellent for intra-observer testing (all ICC ≥0.76) and good for interobserver testing (all ICC ≥0.61). The novel SLICE post-processing technique on standard CMR cine images offers both accurate and robust segmental strain measures compared to the 'gold standard' CMR-TAG technique, and has the advantage of being widely available. • Myocardial strain analysis could potentially improve patient selection for CRT. • Currently a well validated clinical approach to derive segmental strains is lacking. • The novel SLICE technique derives segmental strains from standard CMR cine images. • SLICE-derived strain markers of CRT response showed close agreement with CMR-TAG. • Future studies will focus on the prognostic value of SLICE in CRT candidates.
Evaluation of Retinal and Choroidal Thickness by Swept-Source Optical Coherence Tomography: Repeatability and Assessment of Artifacts

PubMed Central

Mansouri, Kaweh; Medeiros, Felipe A.; Tatham, Andrew J.; Marchase, Nicholas; Weinreb, Robert N.

2017-01-01

PURPOSE To determine the repeatability of automated retinal and choroidal thickness measurements with swept-source optical coherence tomography (SS OCT) and the frequency and type of scan artifacts. DESIGN Prospective evaluation of new diagnostic technology. METHODS Thirty healthy subjects were recruited prospectively and underwent imaging with a prototype SS OCT instrument. Undilated scans of 54 eyes of 27 subjects (mean age, 35.1 ± 9.3 years) were obtained. Each subject had 4 SS OCT protocols repeated 3 times: 3-dimensional (3D) 6 × 6-mm raster scan of the optic disc and macula, radial, and line scan. Automated measurements were obtained through segmentation software. Interscan repeatability was assessed by intraclass correlation coefficients (ICCs). RESULTS ICCs for choroidal measurements were 0.92, 0.98, 0.80, and 0.91, respectively, for 3D macula, 3D optic disc, radial, and line scans. ICCs for retinal measurements were 0.39, 0.49, 0.71, and 0.69, respectively. Artifacts were present in up to 9% scans. Signal loss because of blinking was the most common artifact on 3D scans (optic disc scan, 7%; macula scan, 9%), whereas segmentation failure occurred in 4% of radial and 3% of line scans. When scans with image artifacts were excluded, ICCs for choroidal thickness increased to 0.95, 0.99, 0.87, and 0.93 for 3D macula, 3D optic disc, radial, and line scans, respectively. ICCs for retinal thickness increased to 0.88, 0.83, 0.89, and 0.76, respectively. CONCLUSIONS Improved repeatability of automated choroidal and retinal thickness measurements was found with the SS OCT after correction of scan artifacts. Recognition of scan artifacts is important for correct interpretation of SS OCT measurements. PMID:24531020
Pattern of age-associated decline of static and dynamic balance in community-dwelling older women.

PubMed

Takeshima, Nobuo; Islam, Mohammod M; Rogers, Michael E; Koizumi, Daisuke; Tomiyama, Naoki; Narita, Makoto; Rogers, Nicole L

2014-07-01

Falling is the leading cause of injury-related deaths in older adults, and a loss of balance is often the precursor to a fall. However, little is known about the rate at which balance declines with age. The objective of the present study was to determine whether there is an age-associated decline in static (SB) and/or dynamic (DB) balance in community-dwelling older women. SB and DB were determined in 971 older women. Intraclass correlation coefficients (ICC) were used to determine test-retest reliability. Sway velocity was used to measure SB standing on a platform and foam with eyes open and closed. DB was characterized by limits of stability (LOS) that measured end-point excursion (EXE) and maximum excursion (MXE) of the body's center of pressure. ICC for EXE and MXE for the LOS test were excellent (EPE = 0.96, MXE = 0.96). ICC for SB tests, except for the eyes open firm surface condition (ICC = 0.10), showed a high level of reproducibility (ICC = 0.88 and 0.90). Relationships existed between age and SB (r = 0.31, P < 0.001), and between age and DB (r = -0.46--0.48, P < 0.001). The rate of decline for both DB and SB was approximately 1% per year. Age was significantly associated with all balance measures. DB got significantly lower with advancing age until 80 years, and then plateaued. SB did not decline with age until 80 years, and then decreased significantly thereafter. Although large individual variation was found with balance ability, an age-related decline was found with both dynamic and static balance for Japanese older women. © 2013 Japan Geriatrics Society.
The Availability of Radiological Measurement of Femoral Anteversion Angle: Three-Dimensional Computed Tomography Reconstruction

PubMed Central

Byun, Ha Young; Shin, Heesuk; Lee, Eun Shin; Kong, Min Sik; Lee, Seung Hun

2016-01-01

Objective To assess the intra-rater and inter-rater reliability for measuring femoral anteversion angle (FAA) by a radiographic method using three-dimensional computed tomography reconstruction (3D-CT). Methods The study included 82 children who presented with intoeing gait. 3D-CT data taken between 2006 and 2014 were retrospectively reviewed. FAA was measured by 3D-CT. FAA is defined as the angle between the long axis of the femur neck and condylar axis of the distal femur. FAA measurement was performed twice at both lower extremities by each rater. The intra-rater and inter-rater reliability were calculated by intraclass correlation coefficient (ICC). Results One hundred and sixty-four lower limbs of 82 children (31 boys and 51 girls, 6.3±3.2 years old) were included. The ICCs of intra-rater measurement for the angle of femoral neck axis (NA) were 0.89 for rater A and 0.96 for rater B, and those of condylar axis (CA) were 0.99 for rater A and 0.99 for rater B, respectively. The ICC of inter-rater measurement for the angle of NA was 0.89 and that of CA was 0.92. By each rater, the ICCs of the intrarater measurement for FAA were 0.97 for rater A and 0.95 for rater B, respectively and the ICC of the inter-rater measurement for FAA was 0.89. Conclusion The 3D-CT measures for FAA are reliable within individual raters and between different raters. The 3D-CT measures of FAA can be a useful method for accurate diagnosis and follow-up of femoral anteversion. PMID:27152273
Intra- and inter-observer agreement on diagnosis of Dupuytren disease, measurements of severity of contracture, and disease extent.

PubMed

Broekstra, Dieuwke C; Lanting, Rosanne; Werker, Paul M N; van den Heuvel, Edwin R

2015-08-01

Dupuytren disease (DD) is a fibrosing disease affecting the palmar aponeurosis, and is mostly treated by surgery based on measurement of severity of flexion contracture of the fingers. Literature concerning the measurement reliability is scarce. This study aimed to determine the intra- and inter-observer agreement of four variables for diagnosing DD, determining severity of contracture, and disease extent. One of them is a new measurement on the area of nodules and cords for measuring the disease extent in early disease stages. An agreement study (n = 54) was performed by two trained investigators. Agreement was calculated per finger, based on an intraclass correlation coefficient (ICC) using a latent variable model on subjects for diagnosis and Tubiana stage. For total passive extension deficit (TPED) and the area of nodules and cords, agreement was calculated with an ICC using a one-way random effects model with subject as random effect. Inter-observer agreement was very good for diagnosing DD (ICC: 95.5%-99.9%) and good to very good for classifying Tubiana stage (ICC: 73.5%-94.9%). Agreements for area and TPED were moderate (middle finger) to very good (ICC: 48.4%-98.6% and 45.0%-99.5%, respectively). Intra-observer agreement was slightly higher on average than inter-observer agreement. Overall, the intra- and inter-observer agreement in diagnosing DD, and determining the severity of flexion contracture is high. Also, the newly introduced variable area of nodules and cords has high intra- and inter-observer agreement, indicating that it is suitable to measure disease extent. Copyright © 2015 Elsevier Ltd. All rights reserved.
The effect of the stability threshold on time to stabilization and its reliability following a single leg drop jump landing.

PubMed

Fransz, Duncan P; Huurnink, Arnold; de Boode, Vosse A; Kingma, Idsart; van Dieën, Jaap H

2016-02-08

We aimed to provide insight in how threshold selection affects time to stabilization (TTS) and its reliability to support selection of methods to determine TTS. Eighty-two elite youth soccer players performed six single leg drop jump landings. The TTS was calculated based on four processed signals: raw ground reaction force (GRF) signal (RAW), moving root mean square window (RMS), sequential average (SA) or unbounded third order polynomial fit (TOP). For each trial and processing method a wide range of thresholds was applied. Per threshold, reliability of the TTS was assessed through intra-class correlation coefficients (ICC) for the vertical (V), anteroposterior (AP) and mediolateral (ML) direction of force. Low thresholds resulted in a sharp increase of TTS values and in the percentage of trials in which TTS exceeded trial duration. The TTS and ICC were essentially similar for RAW and RMS in all directions; ICC's were mostly 'insufficient' (<0.4) to 'fair' (0.4-0.6) for the entire range of thresholds. The SA signals resulted in the most stable ICC values across thresholds, being 'substantial' (>0.8) for V, and 'moderate' (0.6-0.8) for AP and ML. The ICC's for TOP were 'substantial' for V, 'moderate' for AP, and 'fair' for ML. The present findings did not reveal an optimal threshold to assess TTS in elite youth soccer players following a single leg drop jump landing. Irrespective of threshold selection, the SA and TOP methods yielded sufficiently reliable TTS values, while for RAW and RMS the reliability was insufficient to differentiate between players. Copyright © 2016 Elsevier Ltd. All rights reserved.
Measurement of glenohumeral joint translation using real-time ultrasound imaging: A physiotherapist and sonographer intra-rater and inter-rater reliability study.

PubMed

Rathi, Sangeeta; Taylor, Nicholas F; Gee, Jamie; Green, Rodney A

2016-12-01

Ultrasonography is an economical and non-invasive method for measuring real-time joint movements. Although physiotherapists are increasingly using ultrasound imaging for rotator cuff disorders, there is a lack of evidence on their reliability in using ultrasonography to measure glenohumeral translation. The aim of this study was to evaluate the reliability of a physiotherapist in measuring anterior and posterior glenohumeral joint translation with ultrasound. Study design: within day reliability. Anterior and posterior glenohumeral translations were measured at rest, in response to passive accessory motion testing force, and with isometric internal and external rotation in 12 young healthy adults. All the measurements were made in real time by a physiotherapist and an experienced sonographer in two positions (neutral and abducted) and in two views (anterior and posterior). Intra-rater and inter-rater reliability were expressed using intraclass correlation coefficients (ICC) and measurement error (mm). Intra-rater reliability was good for both raters (ICC P : 0.86-0.98; ICC S : 0.85-0.96). The inter-rater reliability between the physiotherapist and sonographer was moderate to good for posterior measurements (ICC 0.50-0.75) and poor to moderate for anterior measurements (ICC 0.31-0.53). For both intra-rater and inter-rater measurements, posterior translation was more reliable than the anterior translation with smaller measurement errors (posterior: 0.1-0.2 mm, anterior: 0.2-0.3 mm). A physiotherapist with minimal training was reliable in measuring glenohumeral joint translations. The ultrasound method was reliable for repeated measurement of both anterior and posterior glenohumeral translations with posterior measurements being more reliable than anterior. This method is recommended for future research to investigate the stabilising role of rotator cuff muscles. Copyright © 2016 Elsevier Ltd. All rights reserved.
Translational aspects of rectal evoked potentials: a comparative study in rats and humans

PubMed Central

Nissen, Thomas Dahl; Graversen, Carina; Coen, Steven J.; Hultin, Leif; Aziz, Qasim; Lykkesfeldt, Jens; Drewes, Asbjørn Mohr

2013-01-01

Inconsistencies between species has stunted the progress of developing new analgesics. To increase the success of translating results between species, improved comparable models are required. Twelve rats received rectal balloon distensions on 2 different days separated by 24.3 (SD 24.6) days. Rectal balloon distensions were also performed in 18 humans (mean age: 34 yr; range: 21–56 yr; 12 men) on two separate occasions, separated by 9.3 (SD 5.5) days. In rats, cerebral evoked potentials (CEPs) were recorded by use of implanted skull-electrodes to distension pressure of 80 mmHg. In humans surface electrodes and individualized pressure, corresponding to pain detection threshold, were used. Comparison of morphology was assessed by wavelet analysis. Within- and between-day reproducibility was assessed in terms of latencies, amplitudes, and frequency content. In rats CEPs showed triphasic morphology. No differences in latencies, amplitudes, and power distribution were seen within or between days (all P ≥ 0.5). Peak-to-peak amplitude between the first positive and negative potential were the most reproducible characteristic within and between days (evaluated by intraclass correlation coefficients, ICC) (ICC = 0.99 and ICC = 9.98, respectively). In humans CEPs showed a triphasic morphology. No differences in latencies, amplitudes, or power distribution were seen within or between days (all P ≥ 0.2). Latency to the second negative potential (ICC = 0.98) and the second positive potential (ICC = 0.95) was the most reproducible characteristic within and between days. A unique and reliable translational platform was established assessing visceral sensitivity in rats and humans, which may improve the translational process of developing new drugs targeting visceral pain. PMID:23703652
Reliability of Examination Findings in Suspected Community-Acquired Pneumonia.

PubMed

Florin, Todd A; Ambroggio, Lilliam; Brokamp, Cole; Rattan, Mantosh S; Crotty, Eric J; Kachelmeyer, Andrea; Ruddy, Richard M; Shah, Samir S

2017-09-01

The authors of national guidelines emphasize the use of history and examination findings to diagnose community-acquired pneumonia (CAP) in outpatient children. Little is known about the interrater reliability of the physical examination in children with suspected CAP. This was a prospective cohort study of children with suspected CAP presenting to a pediatric emergency department from July 2013 to May 2016. Children aged 3 months to 18 years with lower respiratory signs or symptoms who received a chest radiograph were included. We excluded children hospitalized ≤14 days before the study visit and those with a chronic medical condition or aspiration. Two clinicians performed independent examinations and completed identical forms reporting examination findings. Interrater reliability for each finding was reported by using Fleiss' kappa (κ) for categorical variables and intraclass correlation coefficient (ICC) for continuous variables. No examination finding had substantial agreement (κ/ICC > 0.8). Two findings (retractions, wheezing) had moderate to substantial agreement (κ/ICC = 0.6-0.8). Nine findings (abdominal pain, pleuritic pain, nasal flaring, skin color, overall impression, cool extremities, tachypnea, respiratory rate, and crackles/rales) had fair to moderate agreement (κ/ICC = 0.4-0.6). Eight findings (capillary refill time, cough, rhonchi, head bobbing, behavior, grunting, general appearance, and decreased breath sounds) had poor to fair reliability (κ/ICC = 0-0.4). Only 3 examination findings had acceptable agreement, with the lower 95% confidence limit >0.4: wheezing, retractions, and respiratory rate. In this study, we found fair to moderate reliability of many findings used to diagnose CAP. Only 3 findings had acceptable levels of reliability. These findings must be considered in the clinical management and research of pediatric CAP. Copyright © 2017 by the American Academy of Pediatrics.
Intrarater reliability of goniometry and hand-held dynamometry for shoulder and elbow examinations in female team handball athletes and asymptomatic volunteers.

PubMed

Fieseler, Georg; Molitor, Thomas; Irlenbusch, Lars; Delank, Karl-Stefan; Laudner, Kevin G; Hermassi, Souhail; Schwesig, Rene

2015-12-01

To evaluate the intrarater reliability for examining active range of motion (ROM) and isometric strength of the shoulder and elbow among asymptomatic female team handball athletes and a control group using a manual goniometer and hand-held dynamometry (HHD). 22 female team handball athletes (age: 21.0 ± 3.7 years) and 25 volunteers (13 female, 12 male, age: 21.9 ± 1.24 years) participated to determine bilateral ROM for shoulder rotation and elbow flexion/extension, as well as isometric shoulder rotation and elbow flexion/extension strength. Subjects were assessed on two separate test sessions with 7 days between sessions. Relative (intraclass correlation coefficients (ICC) and standard error of measurement (SEM) reliability were calculated. Reliability for ROM and strength were good to excellent for both shoulders and groups (athletes: ICC = 0.94-0.97, SEM 1.07°-4.76 N, controls: ICC = 0.96-1.00, SEM = 0.00 N-4.48 N). Elbow measurements for both groups also showed good-to-excellent reliability (athletes: ICC = 0.79-0.97, SEM = 0.98°-5.94 N, controls: ICC = 0.87-1.00, SEM = 0.00 N-5.43 N). It is important to be able to reliably reproduce active ROM and isometric strength evaluations. Using a standardized testing position, goniometry and HHD are reliable instruments in the assessment of shoulder and elbow joint performance testing. We showed good-to-excellent reproducible results for male and female control subjects and female handball athletes, although the single parameters in ROM and strength were different for each group and between the shoulders and elbows.
Reliability of Hypernasality Rating: Comparison of 3 Different Methods for Perceptual Assessment.

PubMed

Yamashita, Renata Paciello; Borg, Elisabet; Granqvist, Svante; Lohmander, Anette

2018-01-01

To compare reliability in auditory-perceptual assessment of hypernasality for 3 different methods and to explore the influence of language background. Comparative methodological study. Participants and Materials: Audio recordings of 5-year-old Swedish-speaking children with repaired cleft lip and palate consisting of 73 stimuli of 9 nonnasal single-word strings in 3 different randomized orders. Four experienced speech-language pathologists (2 native speakers of Brazilian-Portuguese and 2 native speakers of Swedish) participated as listeners. After individual training, each listener performed the hypernasality rating task. Each order of stimuli was analyzed individually using the 2-step, VISOR and Borg centiMax scale methods. Comparison of intra- and inter-rater reliability, and consistency for each method within language of the listener and between listener languages (Swedish and Brazilian-Portuguese). Good to excellent intra-rater reliability was found within each listener for all methods, 2-step: κ = 0.59-0.93; VISOR: intraclass correlation coefficient (ICC) = 0.80-0.99; Borg centiMax (cM) scale: ICC = 0.80-1.00. The highest inter-rater reliability was demonstrated for VISOR (ICC = 0.60-0.90) and Borg cM-scale (ICC = 0.40-0.80). High consistency within each method was found with the highest for the Borg cM scale (ICC = 0.89-0.91). There was a significant difference in the ratings between the Swedish and the Brazilian listeners for all methods. The category-ratio scale Borg cM was considered most reliable in the assessment of hypernasality. Language background of Brazilian-Portuguese listeners influenced the perceptual ratings of hypernasality in Swedish speech samples, despite their experience in perceptual assessment of cleft palate speech disorders.
Reproducibility of current perception threshold with the Neurometer(®) vs the Stimpod NMS450 peripheral nerve stimulator in healthy volunteers: an observational study.

PubMed

Tsui, Ban C H; Shakespeare, Timothy J; Leung, Danika H; Tsui, Jeremy H; Corry, Gareth N

2013-08-01

Current methods of assessing nerve blocks, such as loss of perception to cold sensation, are subjective at best. Transcutaneous nerve stimulation is an alternative method that has previously been used to measure the current perception threshold (CPT) in individuals with neuropathic conditions, and various devices to measure CPT are commercially available. Nevertheless, the device must provide reproducible results to be used as an objective tool for assessing nerve blocks. We recruited ten healthy volunteers to examine CPT reproducibility using the Neurometer(®) and the Stimpod NMS450 peripheral nerve stimulator. Each subject's CPT was determined for the median (second digit) and ulnar (fifth digit) nerve sensory distributions on both hands - with the Neurometer at 5 Hz, 250 Hz, and 2000 Hz and with the Stimpod at pulse widths of 0.1 msec, 0.3 msec, 0.5 msec, and 1.0 msec, both at 5 Hz and 2 Hz. Intraclass correlation coefficients (ICC) were also calculated to assess reproducibility; acceptable ICCs were defined as ≥ 0.4. The ICC values for the Stimpod ranged from 0.425-0.79, depending on pulse width, digit, and stimulation; ICCs for the Neurometer were 0.615 and 0.735 at 250 and 2,000 Hz, respectively. These values were considered acceptable; however, the Neurometer performed less efficiently at 5 Hz (ICCs for the second and fifth digits were 0.292 and 0.318, respectively). Overall, the Stimpod device displayed good to excellent reproducibility in measuring CPT in healthy volunteers. The Neurometer displayed poor reproducibility at low frequency (5 Hz). These results suggest that peripheral nerve stimulators may be potential devices for measuring CPT to assess nerve blocks.
Comprehensive neuromechanical assessment in stroke patients: reliability and responsiveness of a protocol to measure neural and non-neural wrist properties.

PubMed

van der Krogt, Hanneke; Klomp, Asbjørn; de Groot, Jurriaan H; de Vlugt, Erwin; van der Helm, Frans Ct; Meskers, Carel Gm; Arendzen, J Hans

2015-03-13

Understanding movement disorder after stroke and providing targeted treatment for post stroke patients requires valid and reliable identification of biomechanical (passive) and neural (active and reflexive) contributors. Aim of this study was to assess test-retest reliability of passive, active and reflexive parameters and to determine clinical responsiveness in a cohort of stroke patients with upper extremity impairments and healthy volunteers. Thirty-two community-residing chronic stroke patients with an impairment of an upper limb and fourteen healthy volunteers were assessed with a comprehensive neuromechanical assessment protocol consisting of active and passive tasks and different stretch reflex-eliciting measuring velocities, using a haptic manipulator and surface electromyography of wrist flexor and extensor muscles (Netherlands Trial Registry number NTR1424). Intraclass correlation coefficients (ICC) and Standard Error of Measurement were calculated to establish relative and absolute test-retest reliability of passive, active and reflexive parameters. Clinical responsiveness was tested with Kruskal Wallis test for differences between groups. ICC of passive parameters were fair to excellent (0.45 to 0.91). ICC of active parameters were excellent (0.88-0.99). ICC of reflexive parameters were fair to good (0.50-0.74). Only the reflexive loop time of the extensor muscles performed poor (ICC 0.18). Significant differences between chronic stroke patients and healthy volunteers were found in ten out of fourteen parameters. Passive, active and reflexive parameters can be assessed with high reliability in post-stroke patients. Parameters were responsive to clinical status. The next step is longitudinal measurement of passive, active and reflexive parameters to establish their predictive value for functional outcome after stroke.

Selection for family medicine residency training in Canada: How consistently are the same students ranked by different programs?

PubMed

Wycliffe-Jones, Keith; Hecker, Kent G; Schipper, Shirley; Topps, Maureen; Robinson, Jeanine; Abedin, Tasnima

2018-02-01

To examine the consistency of the ranking of Canadian and US medical graduates who applied to Canadian family medicine (FM) residency programs between 2007 and 2013. Descriptive cross-sectional study. Family medicine residency programs in Canada. All 17 Canadian medical schools allowed access to their anonymized program rank-order lists of students applying to FM residency programs submitted to the first iteration of the Canadian Resident Matching Service match from 2007 to 2013. The rank position of medical students who applied to more than 1 FM residency program on the rank-order lists submitted by the programs. Anonymized ranking data submitted to the Canadian Resident Matching Service from 2007 to 2013 by all 17 FM residency programs were used. Ranking data of eligible Canadian and US medical graduates were analyzed to assess the within-student and between-student variability in rank score. These covariance parameters were then used to calculate the intraclass correlation coefficient (ICC) for all programs. Program descriptions and selection criteria were also reviewed to identify sites with similar profiles for subset ICC analysis. Between 2007 and 2013, the consistency of ranking by all programs was fair at best (ICC = 0.34 to 0.39). The consistency of ranking by larger urban-based sites was weak to fair (ICC = 0.23 to 0.36), and the consistency of ranking by sites focusing on training for rural practice was weak to moderate (ICC = 0.16 to 0.55). In most cases, there is a low level of consistency of ranking of students applying for FM training in Canada. This raises concerns regarding fairness, particularly in relation to expectations around equity and distributive justice in selection processes. Copyright© the College of Family Physicians of Canada.
Reliability and validity of CODA motion analysis system for measuring cervical range of motion in patients with cervical spondylosis and anterior cervical fusion.

PubMed

Gao, Zhongyang; Song, Hui; Ren, Fenggang; Li, Yuhuan; Wang, Dong; He, Xijing

2017-12-01

The aim of the present study was to evaluate the reliability of the Cartesian Optoelectronic Dynamic Anthropometer (CODA) motion system in measuring the cervical range of motion (ROM) and verify the construct validity of the CODA motion system. A total of 26 patients with cervical spondylosis and 22 patients with anterior cervical fusion were enrolled and the CODA motion analysis system was used to measure the three-dimensional cervical ROM. Intra- and inter-rater reliability was assessed by interclass correlation coefficients (ICCs), standard error of measurement (SEm), Limits of Agreements (LOA) and minimal detectable change (MDC). Independent samples t-tests were performed to examine the differences of cervical ROM between cervical spondylosis and anterior cervical fusion patients. The results revealed that in the cervical spondylosis group, the reliability was almost perfect (intra-rater reliability: ICC, 0.87-0.95; LOA, -12.86-13.70; SEm, 2.97-4.58; inter-rater reliability: ICC, 0.84-0.95; LOA, -13.09-13.48; SEm, 3.13-4.32). In the anterior cervical fusion group, the reliability was high (intra-rater reliability: ICC, 0.88-0.97; LOA, -10.65-11.08; SEm, 2.10-3.77; inter-rater reliability: ICC, 0.86-0.96; LOA, -10.91-13.66; SEm, 2.20-4.45). The cervical ROM in the cervical spondylosis group was significantly higher than that in the anterior cervical fusion group in all directions except for left rotation. In conclusion, the CODA motion analysis system is highly reliable in measuring cervical ROM and the construct validity was verified, as the system was sufficiently sensitive to distinguish between the cervical spondylosis and anterior cervical fusion groups based on their ROM.
Reliability and Validity of Two Self-report Measures to Assess Sedentary Behavior in Older Adults

PubMed Central

Gennuso, Keith P.; Matthews, Charles E.; Colbert, Lisa H.

2015-01-01

Background The purpose of this study was to examine the reliability and validity of two currently available physical activity surveys for assessing time spent in sedentary behavior (SB) in older adults. Methods Fifty-eight adults (≥65 years) completed the Yale Physical Activity Survey for Older Adults (YPAS) and Community Health Activities Model Program for Seniors (CHAMPS) before and after a 10-day period during which they wore an ActiGraph accelerometer (ACC). Intraclass correlation coefficients (ICC) examined test-retest reliability. Overall percent agreement and a kappa statistic examined YPAS validity. Lin’s concordance correlation, Pearson correlation, and Bland-Altman analysis examined CHAMPS validity. Results Both surveys had moderate test-retest reliability (ICC: YPAS=0.59 (P<0.001), CHAMPS=0.64 (P<0.001)) and significantly underestimated SB time. Agreement between YPAS and ACC was low (κ=−0.0003); however, there was a linear increase (P< 0.01) in ACC-derived SB time across YPAS response categories. There was poor agreement between ACC-derived SB and CHAMPS (Lin’s r=0.005; 95% CI, −0.010 to 0.020), and no linear trend across CHAMPS quartiles (p=0.53). Conclusions Neither of the surveys should be used as the sole measure of SB in a study; though the YPAS has the ability to rank individuals, providing it with some merit for use in correlational SB research. PMID:25110344
Validation and reliability of a Behcet's Syndrome Activity Scale in Korea.

PubMed

Choi, Hyo Jin; Seo, Mi Ryoung; Ryu, Hee Jung; Baek, Han Joo

2016-01-01

We prepared a cross-cultural adaptation of the Behcet's Syndrome Activity Scale (BSAS) and evaluated its reliability and validity in Korea. Fifty patients with Behcet's disease (BD) who attended the Rheumatology Clinic of Gachon University Gil Medical Center were included in this study. The first BSAS questionnaire was administered at each clinic visit, and the second questionnaire was completed at home within 24 hours of the visit. A Behcet's Disease Current Activity Form (BDCAF) and a Behcet's Disease Quality of Life (BDQOL) form were also given to patients. The test-retest reliability was analyzed by intraclass correlation coefficients (ICC). To assess the validity, the total BSAS score was compared with the BDCAF score, the patient/physician global assessment, and the BDQOL by Spearman rank correlation. Twelve males and 38 females were enrolled. The mean age was 48.5 years and the mean disease duration was 6.7 years. Thirty-eight patients (76.0%) returned the questionnaire by mail. For the test-retest reliability, the two assessments were significantly correlated on all 10 items of the BSAS questionnaire (p < 0.05) and the total BSAS score (ICC, 0.925; p < 0.001). The total BSAS score was statistically correlated with the BDQOL, BDCAF, and patient/physician global assessment (p < 0.01). The Korean version of BSAS is a reliable and valid instrument to measure BD activity.
Inflow-vascular space occupancy (iVASO) reproducibility in the hippocampus and cortex at different blood water nulling times.

PubMed

Rane, Swati; Talati, Pratik; Donahue, Manus J; Heckers, Stephan

2016-06-01

Inflow-vascular space occupancy (iVASO) measures arterial cerebral blood volume (aCBV) using accurate blood water nulling (inversion time [TI]) when arterial blood reaches the capillary, i.e., at the arterial arrival time. This work assessed the reproducibility of iVASO measurements in the hippocampus and cortex at multiple TIs. The iVASO approach was implemented at multiple TIs in 10 healthy volunteers at 3 Tesla. aCBV values were measured at each TI in the left and right hippocampus, and the cortex. Reproducibility of aCBV measurements within scans (same day) and across sessions (different days) was assessed using the intraclass correlation coefficient (ICC). Overall hippocampal aCBV was significantly higher than cortical aCBV, likely due to higher gray matter volume. Hippocampal ICC values were high at short TIs (≤914 ms; intrascan values = 0.80-0.96, interscan values = 0.61-0.91). Cortically, high ICC values were observed at intermediate TIs of 914 (intra: 0.93, inter: 0.87) and 1034 ms (intra: 0.96, inter: 0.86). The ICC values were comparable to established contrast-based CBV measures. iVASO measurements are reproducible within and across sessions. TIs for iVASO measurements should be chosen carefully, taking into account heterogeneous arterial arrival times in different brain regions. Magn Reson Med 75:2379-2387, 2016. © 2015 Wiley Periodicals, Inc. © 2015 Wiley Periodicals, Inc.
Measuring physical activity during pregnancy - Cultural adaptation of the Pregnancy Physical Activity Questionnaire (PPAQ) and assessment of its reliability in Polish conditions.

PubMed

Krzepota, Justyna; Sadowska, Dorota; Sempolska, Katarzyna; Pelczar, Małgorzata

2017-12-23

The assessment of physical activity during pregnancy is crucial in perinatal care and it is an important research topic. Unfortunately, in Poland there is a lack of one commonly accepted questionnaire of physical activity during pregnancy. The aim of this study was to adapt the Pregnancy Physical Activity Questionnaire (PPAQ) to Polish conditions and assess the reliability of its Polish version (PPAQ-PL). The PPAQ was translated from English into Polish and its reliability tested. 64 correctly completed (twice, one week apart) questionnaires were qualified for analysis. Test-retest reliability was assessed using Intraclass Correlation Coefficient (ICC). As a result of the adaptation and psychometric assessment, in the Polish version of the questionnaire the number of questions was reduced from 36 to 35 by removing the question concerning 'mowing lawn while on a riding mower'. The ICC value for total activity was 0.75, which confirms a substantial level of reliability. The ICC values for subscales of intensity ranged from 0.53 (light) - 0.86 (vigorous). For subscales of type, ICC values ranged from 0.59 (transportation) - 0.89 (household/caregiving). The PPAQ-PL can be accepted as a reliable tool for the assessing physical activity of pregnant women in Poland. Information obtained using the questionnaire might be helpful in monitoring health behaviours, preventing obesity, as well as designing and promoting physical activity programmes for pregnant women.
Analyses of inter-rater reliability between professionals, medical students and trained school children as assessors of basic life support skills.

PubMed

Beck, Stefanie; Ruhnke, Bjarne; Issleib, Malte; Daubmann, Anne; Harendza, Sigrid; Zöllner, Christian

2016-10-07

Training of lay-rescuers is essential to improve survival-rates after cardiac arrest. Multiple campaigns emphasise the importance of basic life support (BLS) training for school children. Trainings require a valid assessment to give feedback to school children and to compare the outcomes of different training formats. Considering these requirements, we developed an assessment of BLS skills using MiniAnne and tested the inter-rater reliability between professionals, medical students and trained school children as assessors. Fifteen professional assessors, 10 medical students and 111-trained school children (peers) assessed 1087 school children at the end of a CPR-training event using the new assessment format. Analyses of inter-rater reliability (intraclass correlation coefficient; ICC) were performed. Overall inter-rater reliability of the summative assessment was high (ICC = 0.84, 95 %-CI: 0.84 to 0.86, n = 889). The number of comparisons between peer-peer assessors (n = 303), peer-professional assessors (n = 339), and peer-student assessors (n = 191) was adequate to demonstrate high inter-rater reliability between peer- and professional-assessors (ICC: 0.76), peer- and student-assessors (ICC: 0.88) and peer- and other peer-assessors (ICC: 0.91). Systematic variation in rating of specific items was observed for three items between professional- and peer-assessors. Using this assessment and integrating peers and medical students as assessors gives the opportunity to assess hands-on skills of school children with high reliability.
The Design, Development, and Reliability Testing of a New Innovative Device to Measure Ankle Joint Dorsiflexion.

PubMed

Charles, James

2016-09-02

In clinical and research settings, ankle joint dorsiflexion needs to be reliably measured. Dorsiflexion is often measured by goniometry, but the intrarater and interrater reliability of this technique have been reported to be poor. Many devices to measure dorsiflexion have been developed for clinical and research use. An evaluation of 12 current tools showed that none met all of the desirable criteria. The purpose of this study was to design and develop a device that rates highly in all of the criteria and that can be proved to be highly reliable. While supine on a treatment table, 14 participants had a foot placed in the Charles device and ankle joint dorsiflexion measured and recorded three times with a digital inclinometer. The mean of the three readings was determined to be the ankle joint dorsiflexion. The analysis used was intraclass correlation coefficient (ICC). There was very little difference in ICC single or average measures between left and right feet, so data were pooled (N = 28). The single-measure ICC was 0.998 (95% confidence interval, 0.996-0.998). The average-measure ICC was 0.998 (95% confidence interval, 0.995-0.999). Limits of agreement for the average measure were also very good: -1.30° to 1.65°. The Charles device meets all of the desirable criteria and has many innovative features, increasing its appropriateness for clinical and research applications. It has a suitable design for measuring dorsiflexion and high intrarater and interrater reliability.
Ultrasonographic measurement of the acromiohumeral distance in spinal cord injury: Reliability and effects of shoulder positioning.

PubMed

Lin, Yen-Sheng; Boninger, Michael L; Day, Kevin A; Koontz, Alicia M

2015-11-01

To investigate the reliability of ultrasonographic measurement of acromiohumeral distance (AHD) and the effects of shoulder positioning on AHD among manual wheelchair users (MWUs) with spinal cord injury (SCI) and an able-bodied control group. Ten MWUs with SCI and 10 able-bodied subjects participated in this study. The ultrasonographic measurements of AHD from each subject were obtained by two raters during passive and active scapular plane arm elevation in neutral, 45°, 90° with and without resistance and in a weight relief raise position. The measurements were recorded again by each rater using the same procedures after a 30-minute time interval. All raters were blinded to each other's measurements. University Laboratories and Veteran Affairs Healthcare System. Intra-rater (intraclass correlation coefficient, ICC > 0.83) and inter-rater (ICC > 0.78) reliability was excellent for both the MWUs with SCI and able-bodied groups across all arm positions except for the 45° position in the control group for one of the raters (intra-rater: ICC < 0.40 and inter-rater: ICC < 0.60). AHD significantly reduced when the shoulder was in the 90° arm elevated positions with or without resistance. Findings from our study demonstrated that ultrasonography is a reliable means to evaluate AHD in both able bodied and individuals with SCI, who are known to have significant shoulder pathology. This technique could be used to develop reference measures and to identify changes in AHD caused by interventions.
Ultrasonographic measurements of lower trapezius muscle thickness at rest and during isometric contraction: a reliability study.

PubMed

Talbott, Nancy R; Witt, Dexter W

2014-07-01

The purpose of this study was to determine the intra-rater reliability and inter-rater reliability of ultrasound imaging (USI) thickness measurements of the lower trapezius (LT) at rest and during active contractions when the transverse process and the lamina were used as reference sites for the measurement process. Twenty healthy individuals between the ages of 22 and 32 years volunteered. With the subject prone and the shoulder in 145° of abduction, images of the LT were taken bilaterally by one examiner as the subject: (1) rested; (2) actively held the test position; and (3) actively held the test position while holding a weight. Ten subjects returned and testing was repeated by the same examiner and by a second examiner. LT thickness measurements were recorded at the level of the transverse process and at the level of the lamina. Intra-class correlation coefficients (ICC) for within session intra-rater reliability (ICC3,3) ranged from 0.951 to 0.986 for both measurement sites while between session intra-rater reliability (ICC3,2) ranged from 0.935 to 0.962. Within session inter-rater reliability (ICC2,2) ranged from 0.934 to 0.973. USI can be used to reliably measure LT thickness at rest, during active contraction and during active contraction when holding a weight. The described protocol can be utilized during shoulder examinations to provide an additional assessment tool for monitoring changes in LT thickness.
Reliability, standard error, and minimum detectable change of clinical pressure pain threshold testing in people with and without acute neck pain.

PubMed

Walton, David M; Macdermid, Joy C; Nielson, Warren; Teasell, Robert W; Chiasson, Marco; Brown, Lauren

2011-09-01

Clinical measurement. To evaluate the intrarater, interrater, and test-retest reliability of an accessible digital algometer, and to determine the minimum detectable change in normal healthy individuals and a clinical population with neck pain. Pressure pain threshold testing may be a valuable assessment and prognostic indicator for people with neck pain. To date, most of this research has been completed using algometers that are too resource intensive for routine clinical use. Novice raters (physiotherapy students or clinical physiotherapists) were trained to perform algometry testing over 2 clinically relevant sites: the angle of the upper trapezius and the belly of the tibialis anterior. A convenience sample of normal healthy individuals and a clinical sample of people with neck pain were tested by 2 different raters (all participants) and on 2 different days (healthy participants only). Intraclass correlation coefficient (ICC), standard error of measurement, and minimum detectable change were calculated. A total of 60 healthy volunteers and 40 people with neck pain were recruited. Intrarater reliability was almost perfect (ICC = 0.94-0.97), interrater reliability was substantial to near perfect (ICC = 0.79-0.90), and test-retest reliability was substantial (ICC = 0.76-0.79). Smaller change was detectable in the trapezius compared to the tibialis anterior. This study provides evidence that novice raters can perform digital algometry with adequate reliability for research and clinical use in people with and without neck pain.
Test-retest reliability of quantitative sensory testing for mechanical somatosensory and pain modulation assessment of masticatory structures.

PubMed

Costa, Y M; Morita-Neto, O; de Araújo-Júnior, E N S; Sampaio, F A; Conti, P C R; Bonjardim, L R

2017-03-01

Assessing the reliability of medical measurements is a crucial step towards the elaboration of an applicable clinical instrument. There are few studies that evaluate the reliability of somatosensory assessment and pain modulation of masticatory structures. This study estimated the test-retest reliability, that is over time, of the mechanical somatosensory assessment of anterior temporalis, masseter and temporomandibular joint (TMJ) and the conditioned pain modulation (CPM) using the anterior temporalis as the test site. Twenty healthy women were evaluated in two sessions (1 week apart) by the same examiner. Mechanical detection threshold (MDT), mechanical pain threshold (MPT), wind-up ratio (WUR) and pressure pain threshold (PPT) were assessed on the skin overlying the anterior temporalis, masseter and TMJ of the dominant side. CPM was tested by comparing PPT before and during the hand immersion in a hot water bath. anova and intra-class correlation coefficients (ICCs) were applied to the data (α = 5%). The overall ICCs showed acceptable values for the test-retest reliability of mechanical somatosensory assessment of masticatory structures. The ICC values of 75% of all quantitative sensory measurements were considered fair to excellent (fair = 8·4%, good = 33·3% and excellent = 33·3%). However, the CPM paradigm presented poor reliability (ICC = 0·25). The mechanical somatosensory assessment of the masticatory structures, but not the proposed CPM protocol, can be considered sufficiently reliable over time to evaluate the trigeminal sensory function. © 2016 John Wiley & Sons Ltd.
Optimization of Scan Parameters to Reduce Acquisition Time for Diffusion Kurtosis Imaging at 1.5T.

PubMed

Yokosawa, Suguru; Sasaki, Makoto; Bito, Yoshitaka; Ito, Kenji; Yamashita, Fumio; Goodwin, Jonathan; Higuchi, Satomi; Kudo, Kohsuke

2016-01-01

To shorten acquisition of diffusion kurtosis imaging (DKI) in 1.5-tesla magnetic resonance (MR) imaging, we investigated the effects of the number of b-values, diffusion direction, and number of signal averages (NSA) on the accuracy of DKI metrics. We obtained 2 image datasets with 30 gradient directions, 6 b-values up to 2500 s/mm(2), and 2 signal averages from 5 healthy volunteers and generated DKI metrics, i.e., mean, axial, and radial kurtosis (MK, K∥, and K⊥) maps, from various combinations of the datasets. These maps were estimated by using the intraclass correlation coefficient (ICC) with those from the full datasets. The MK and K⊥ maps generated from the datasets including only the b-value of 2500 s/mm(2) showed excellent agreement (ICC, 0.96 to 0.99). Under the same acquisition time and diffusion directions, agreement was better of MK, K∥, and K⊥ maps obtained with 3 b-values (0, 1000, and 2500 s/mm(2)) and 4 signal averages than maps obtained with any other combination of numbers of b-value and varied NSA. Good agreement (ICC > 0.6) required at least 20 diffusion directions in all the metrics. MK and K⊥ maps with ICC greater than 0.95 can be obtained at 1.5T within 10 min (b-value = 0, 1000, and 2500 s/mm(2); 20 diffusion directions; 4 signal averages; slice thickness, 6 mm with no interslice gap; number of slices, 12).
The Overt Behaviour Scale-Self-Report (OBS-SR) for acquired brain injury: exploratory analysis of reliability and validity.

PubMed

Kelly, Glenn; Simpson, Grahame K; Brown, Suzanne; Kremer, Peter; Gillett, Lauren

2017-05-23

The objectives were to test the properties, via a psychometric study, of the Overt Behaviour Scale-Self-Report (OBS-SR), a version of the OBS-Adult Scale developed to provide a client perspective on challenging behaviours after acquired brain injury. Study sample 1 consisted of 37 patients with primary brain tumour (PBT) and a family-member informant. Sample 2 consisted of 34 clients with other acquired brain injury (mixed brain injury, MBI) and a service-provider informant. Participants completed the OBS-SR (at two time points), and the Awareness Questionnaire (AQ) and Mayo Portland Adaptability Inventory-III (MPAI-III) once; informants completed the OBS-Adult and AQ once only. PBT-informant dyads displayed "good" levels of agreement (ICC 2,k = .74; OBS-SR global index). Although MBI-informant dyads displayed no agreement (ICC 2,k = .22; OBS-SR global index), the sub-group (17/29) rated by clinicians as having moderate to good levels of awareness displayed "fair" agreement (ICC 2,k = .58; OBS-SR global index). Convergent/divergent validity was demonstrated by significant correlations between OBS-SR subscales and MPAI-III subscales with behavioural content (coefficients in the range .36 -.61). Scores had good reliability across one week (ICC 2,k = .69). The OBS-SR took approximately 15 minutes to complete. It was concluded that the OBS-SR demonstrated acceptable reliability and validity, providing a useful resource in understanding clients' perspectives about their behaviour.
Heritability and reliability of automatically segmented human hippocampal formation subregions

PubMed Central

Whelan, Christopher D.; Hibar, Derrek P.; van Velzen, Laura S.; Zannas, Anthony S.; Carrillo-Roa, Tania; McMahon, Katie; Prasad, Gautam; Kelly, Sinéad; Faskowitz, Joshua; deZubiracay, Greig; Iglesias, Juan E.; van Erp, Theo G.M.; Frodl, Thomas; Martin, Nicholas G.; Wright, Margaret J.; Jahanshad, Neda; Schmaal, Lianne; Sämann, Philipp G.; Thompson, Paul M.

2016-01-01

The human hippocampal formation can be divided into a set of cytoarchitecturally and functionally distinct subregions, involved in different aspects of memory formation. Neuroanatomical disruptions within these subregions are associated with several debilitating brain disorders including Alzheimer’s disease, major depression, schizophrenia, and bipolar disorder. Multi-center brain imaging consortia, such as the Enhancing Neuro Imaging Genetics through Meta-Analysis (ENIGMA) consortium, are interested in studying disease effects on these subregions, and in the genetic factors that affect them. For large-scale studies, automated extraction and subsequent genomic association studies of these hippocampal subregion measures may provide additional insight. Here, we evaluated the test–retest reliability and transplatform reliability (1.5 T versus 3 T) of the subregion segmentation module in the FreeSurfer software package using three independent cohorts of healthy adults, one young (Queensland Twins Imaging Study, N = 39), another elderly (Alzheimer’s Disease Neuroimaging Initiative, ADNI-2, N = 163) and another mixed cohort of healthy and depressed participants (Max Planck Institute, MPIP, N = 598). We also investigated agreement between the most recent version of this algorithm (v6.0) and an older version (v5.3), again using the ADNI-2 and MPIP cohorts in addition to a sample from the Netherlands Study for Depression and Anxiety (NESDA) (N = 221). Finally, we estimated the heritability (h2) of the segmented subregion volumes using the full sample of young, healthy QTIM twins (N = 728). Test–retest reliability was high for all twelve subregions in the 3 T ADNI-2 sample (intraclass correlation coefficient (ICC) = 0.70–0.97) and moderate-to-high in the 4 T QTIM sample (ICC = 0.5–0.89). Transplatform reliability was strong for eleven of the twelve subregions (ICC = 0.66–0.96); however, the hippocampal fissure was not consistently reconstructed across 1.5 T and 3 T field strengths (ICC = 0.47–0.57). Between-version agreement was moderate for the hippocampal tail, subiculum and presubiculum (ICC = 0.78–0.84; Dice Similarity Coefficient (DSC) = 0.55–0.70), and poor for all other subregions (ICC = 0.34–0.81; DSC = 0.28–0.51). All hippocampal subregion volumes were highly heritable (h2 = 0.67–0.91). Our findings indicate that eleven of the twelve human hippocampal subregions segmented using FreeSurfer version 6.0 may serve as reliable and informative quantitative phenotypes for future multi-site imaging genetics initiatives such as those of the ENIGMA consortium. PMID:26747746
The root coverage esthetic score: Intra-examiner reliability among students and faculty at tufts university school of dental medicine.

PubMed

Isaia, Federica; Gyurko, Robert; Roomian, Tamar C; Hawley, Charles E

2018-04-06

The Root Coverage Esthetic Score (RES) was published in 2009 as an esthetic scoring system to measure visible final outcomes of root coverage procedures performed on Miller I and II recession defects. The aim of this study was to evaluate the intra-examiner, intra-group, and inter-examiner reliability of the (Root Coverage Esthetic Score) RES when used among periodontal faculty, post-graduate students in periodontology, and pre-doctoral DMD students when using the RES at Tufts University School of Dental Medicine (TUSDM). Thirty-three participants (12 second year DMD students, 11 periodontal residents, and 10 faculty members) were assembled to evaluate 25 baseline and 6-months post-treatment outcomes of mucogingival surgeries using the RES. Each projection was shown for 30 seconds during which the participants were asked to use the RES scoring system to evaluate the surgical outcomes. The results were then recorded on a standardized worksheet grid. To test intra-examiner reliability, 7 of the 25 projections were shown twice. Intra-examiner reliability and inter-examiner reliability were assessed using intraclass correlation coefficient using a two-way mixed effects model, and stratified by education level. PG residents had the highest tendency to agree with each other with an interclass correlation (ICC) of 0.53 (95%CI 0.36 - 0.74). DMD students had an ICC: 0.51 (95%CI: 0.33 - 0.75), and PG faculty members produced an ICC: 0.41 (95%CI: 0.24 - 0.64). There was no statistically significant difference in ICC among the three groups of participants (Kruskal-Wallis test, P = 0.2440). When the data for each RES element were then combined, the mean ICC for the total interrater agreement for RES was 0.48 (95% CI: 0.32-0.71). This corresponds to an overall moderate agreement among all participants using the RES to evaluate the 25 surgical outcomes. The intra-examiner reliability within each of the three groups was quite high. The highest mean ICC was produced by the PG Faculty (0.908). The mean ICCs for PG residents was 0.867, and the mean ICC for DMD students was 0.855. The Kruskal-Wallis test (p = 0.46) failed to find any statistical difference in intra-examiner reliability between the three groups of participants CONCLUSIONS: The RES is a "moderately" reliable scoring system for mucogingival treatments in a dental school setting and can be used even by operators with different level of periodontal experience. This scoring system can be repeated by the same examiner obtaining reliable results. This article is protected by copyright. All rights reserved. © 2018 American Academy of Periodontology.
A consistency evaluation of signal-to-noise ratio in the quality assessment of human brain magnetic resonance images.

PubMed

Yu, Shaode; Dai, Guangzhe; Wang, Zhaoyang; Li, Leida; Wei, Xinhua; Xie, Yaoqin

2018-05-16

Quality assessment of medical images is highly related to the quality assurance, image interpretation and decision making. As to magnetic resonance (MR) images, signal-to-noise ratio (SNR) is routinely used as a quality indicator, while little knowledge is known of its consistency regarding different observers. In total, 192, 88, 76 and 55 brain images are acquired using T 2 * , T 1 , T 2 and contrast-enhanced T 1 (T 1 C) weighted MR imaging sequences, respectively. To each imaging protocol, the consistency of SNR measurement is verified between and within two observers, and white matter (WM) and cerebral spinal fluid (CSF) are alternately used as the tissue region of interest (TOI) for SNR measurement. The procedure is repeated on another day within 30 days. At first, overlapped voxels in TOIs are quantified with Dice index. Then, test-retest reliability is assessed in terms of intra-class correlation coefficient (ICC). After that, four models (BIQI, BLIINDS-II, BRISQUE and NIQE) primarily used for the quality assessment of natural images are borrowed to predict the quality of MR images. And in the end, the correlation between SNR values and predicted results is analyzed. To the same TOI in each MR imaging sequence, less than 6% voxels are overlapped between manual delineations. In the quality estimation of MR images, statistical analysis indicates no significant difference between observers (Wilcoxon rank sum test, p w ≥ 0.11; paired-sample t test, p p ≥ 0.26), and good to very good intra- and inter-observer reliability are found (ICC, p icc ≥ 0.74). Furthermore, Pearson correlation coefficient (r p ) suggests that SNR wm correlates strongly with BIQI, BLIINDS-II and BRISQUE in T 2 * (r p ≥ 0.78), BRISQUE and NIQE in T 1 (r p ≥ 0.77), BLIINDS-II in T 2 (r p ≥ 0.68) and BRISQUE and NIQE in T 1 C (r p ≥ 0.62) weighted MR images, while SNR csf correlates strongly with BLIINDS-II in T 2 * (r p ≥ 0.63) and in T 2 (r p ≥ 0.64) weighted MR images. The consistency of SNR measurement is validated regarding various observers and MR imaging protocols. When SNR measurement performs as the quality indicator of MR images, BRISQUE and BLIINDS-II can be conditionally used for the automated quality estimation of human brain MR images.
The Unsupported Upper Limb Exercise Test in People Without Disabilities: Assessing the Within-Day Test-Retest Reliability and the Effects of Age and Gender.

PubMed

Oliveira, Ana; Cruz, Joana; Jácome, Cristina; Marques, Alda

2018-01-01

Purpose: To estimate the within-day test-retest reliability and standard error of measurement (SEM) of the unsupported upper limb exercise test (UULEX) in adults without disabilities and to determine the effects of age and gender on performance of the UULEX. Method: A cross-sectional study was conducted with 100 adults without disabilities (44 men, mean age 44.2 [SD 26] y; 56 women, mean age 38.1 [SD 24.1] y). Participants performed three UULEX tests to establish within-day reliability, measured using an intra-class correlation coefficient (ICC) model 2 (two-way random effects) with a single rater (ICC[2,1]) and SEM. The effects of age and gender were examined using two-factor mixed-design analysis of variance (ANOVA) and one-way repeated-measures ANOVA. For analysis purposes, four sub-groups were created: younger adults, older adults, men, and women. Results: Excellent within-day reliability and a small SEM were found in the four sub-groups (younger adults: ICC[2,1]=0.88; 95% CI: 0.82, 0.92; SEM∼40 s; older adults: ICC[2,1]=0.82; 95% CI: 0.72, 0.90; SEM∼50 s; men: ICC[2,1]=0.93; 95% CI: 0.88, 0.96; SEM∼30 s; women: ICC[2,1]=0.85; 95% CI: 0.78, 0.91; SEM∼45 s). Younger adults took, on average, 308.24 seconds longer than older adults to perform the test; older adults performed significantly better on the third test ( p <0.0001; η 2 =0.096). Gender effects were not found ( p >0.05). Conclusion: The within-day test-retest reliability and SEM values of the UULEX may be used to define the magnitude of the error obtained with repeated measures. One UULEX test seems to be adequate for younger adults to achieve reliable results, whereas three tests seem to be needed for older adults.
Real-time sonoelastography using an external reference material: test-retest reliability of healthy Achilles tendons.

PubMed

Schneebeli, Alessandro; Del Grande, Filippo; Vincenzo, Gabriele; Cescon, Corrado; Clijsen, Ron; Biordi, Fulvio; Barbero, Marco

2016-08-01

To establish the test-retest reliability of sonoelastography (SE) on healthy Achilles tendons in contracted and relaxed states using an external reference system. Forty-eight Achilles tendons from 24 healthy volunteers were assessed using ultrasound and real-time SE with an external reference material. Tendons were analyzed under relaxed and contracted conditions. Strain ratios between the tendons and the reference material were calculated. The intraclass correlation coefficient (ICC2.k) and Bland-Altman plot were used to assess test-retest reliability. The reliability of SE measurements under relaxed conditions ranged from high to very high, with an ICC2.k of 0.84 (95 % CI: 0.64-0.92) for reference material, 0.91 (95 % CI: 0.83-0.95) for Achilles tendons and 0.95 (95 % CI: 0.91-0.97) for Kager fat pads (KFP). The ICC2.k value for skin was 0.30 (95 % CI: -0.26 to 0.61). Reliability for measurements in the contracted state ranged from high to very high, with an ICC2.k of 0.93 (95 % CI: 0.87-0.96) for reference material, 0.72 (95 % CI: 0.50-0.84) for skin, 0.93 (95 % CI: 0.87-0.96) for Achilles tendons, and 0.81 (95 % CI: 0.66-0.89) for KFP. Reliability of the strain ratio (tendon/reference) under relaxed conditions was high with an ICC2.k of 0.87 (95 % CI: 0.75-0.93), and in the contracted state, it was very high with an ICC2.k of 0.94 (95 % CI: 0.90-0.97). Sonoelastography using an external reference material is a reliable and simple technique for the assessment of the elasticity of healthy Achilles tendons. The use of an external material as a reference, along with strain ratios, could provide a quantitative measure of elasticity.
Agreement among Goldmann applanation tonometer, iCare, and Icare PRO rebound tonometers; non-contact tonometer; and Tonopen XL in healthy elderly subjects.

PubMed

Kato, Yoshitake; Nakakura, Shunsuke; Matsuo, Naoko; Yoshitomi, Kayo; Handa, Marina; Tabuchi, Hitoshi; Kiuchi, Yoshiaki

2018-04-01

To evaluate the inter-device agreement among the Goldmann applanation tonometer (GAT), iCare and Icare PRO rebound tonometers, non-contact tonometer (NCT), and Tonopen XL tonometer. Sixty healthy elderly subjects were enrolled. The intraocular pressure (IOP) in each subject's right eye was measured thrice using each of the five tonometers. Intra-device agreement was evaluated by calculating intraclass correlation coefficients (ICCs). Inter-device agreement was evaluated by ICC and Bland-Altman analyses. ICCs for intra-device agreement for each tonometer were >0.8. IOP as measured by iCare (mean ± SD, 11.6 ± 2.5 mmHg) was significantly lower (p < 0.05) than that measured by GAT (14.0 ± 2.8 mmHg), NCT (13.6 ± 2.5 mmHg), Tonopen XL (13.7 ± 4.1 mmHg), and Icare PRO (12.6 ± 2.2 mmHg; Bonferroni test). There was no significant difference in mean IOP among GAT, NCT, and Tonopen XL. Regarding inter-device agreement, ICC was lower between Tonopen XL and other tonometers (all ICCs < 0.4). However, ICCs of GAT, iCare, Icare PRO, and NCT showed good agreement (0.576-0.700). The Bland-Altman analysis revealed that the width of the 95% limits of agreement was larger between the Tonopen XL and the other tonometers ranged from 14.94 to 16.47 mmHg. Among the other tonometers, however, the widths of 95% limits of agreement ranged from 7.91 to 9.24 mmHg. There was good inter-device agreement among GAT, rebound tonometers, and NCT. Tonopen XL shows the worst agreement with the other tonometers; therefore, we should pay attention to its' respective IOP. Japan Clinical Trials Register; number: UMIN000011544.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.